Artificial Analysis LLM Performance Leaderboard launches on Hugging Face
AI Impact Summary
Artificial Analysis has extended its LLM Performance Leaderboard to Hugging Face, giving engineers a unified view of price, speed, and quality across more than 100 serverless LLM endpoints. The metrics cover context window, per-token pricing, throughput (TPS) and latency (TTFT) with 14-day medians across workloads from ~100 to 10k tokens and parallel queries. This enables data-driven model/provider selection for consumer chat apps and agent-based workflows, illustrating trade-offs between high-quality options (GPT-4 Turbo, Claude 3 Opus) and faster, cheaper open models (Llama 3, Mixtral, Gemma, Gemini 1.5 Pro, etc.).
Affected Systems
- Date
- Date not specified
- Change type
- capability
- Severity
- info