Hugging Face: Google PaliGemma 2 Mix: new vision-language models for OCR, captioning, and VQA (3B/10B/28B) | SignalBreak | SignalBreak