Hugging Face: Efficient MultiModal Data Pipeline: PyTorch IterableDataset and knapsack packing | SignalBreak | SignalBreak