Hugging Face: ConTextual: Benchmark for context-sensitive text-rich visual reasoning in multimodal models | SignalBreak | SignalBreak