OpenAI introduces RTEB: A New Standard for Retrieval Evaluation
Action Required
Developers need to migrate to this new benchmark to accurately assess the retrieval accuracy of their embedding models and ensure they perform well in real-world applications.
AI Impact Summary
OpenAI is introducing RTEB, a new benchmark designed to reliably evaluate the retrieval accuracy of embedding models. Existing benchmarks struggle to measure true generalization, and RTEB addresses this with a hybrid strategy of open and private datasets. This new benchmark is focused on enterprise use cases and includes multilingual datasets and domain-specific focus, offering a more robust and transparent way to assess retrieval model performance in real-world applications.
Models affected
- new
- Date
- Date not specified
- Change type
- capability
- Severity
- high