Natural language style packing is attractive because it avoids padding waste and lets the model see long continuous streams. For event data, it also introduces partial-event boundaries that sampling and untokenization must handle carefully.
The key design question is where to preserve event structure explicitly and where to let the packed stream behave like ordinary sequence data.