InfoCapability

Vectara HHEM Leaderboard — New Open-Source LLM Hallucination Evaluation

AI Impact Summary

Vectara has created a new Hugging Face leaderboard powered by the HF leaderboard template and their Hughes Hallucination Evaluation Model (HHEM). This allows for dynamic updates and evaluation of LLMs like GPT-4, Gemini, and Llama 2, specifically measuring their propensity to hallucinate. The new leaderboard provides a mechanism for the community to contribute new models for evaluation and track performance metrics such as Factual Consistency Rate and Hallucination Rate.

Affected Systems

Hugging Face LeaderboardHughes Hallucination Evaluation Model (HHEM)

Date: Date not specified
Change type: capability
Severity: info

Vectara HHEM Leaderboard — New Open-Source LLM Hallucination Evaluation

More from Hugging Face

Get alerts for Hugging Face