Hugging Face: Smolagents adds vision support with vision-language models (VLMs) for autonomous agent workflows | SignalBreak | SignalBreak