Hugging Face: Text-to-Video capability expansion: ModelScope and diffusion-based models | SignalBreak | SignalBreak