Hugging Face: nanoVLM: PyTorch Vision-Language training with google/siglip-base-patch16-224 and HuggingFaceTB/SmolLM2-135M | SignalBreak | SignalBreak