Hugging Face: DABStep: Data Agent Benchmark for Multi-step Reasoning — 16% accuracy for current AI models | SignalBreak | SignalBreak