Learn languages naturally with fresh, real content!

Popular Topics
Explore By Region
AI startup Galileo Technologies ranks Claude 3.5 Sonnet, Google's Gemini, and Alibaba's Qwen2-72B-Instruct top in the Hallucination Index benchmark.
AI startup Galileo Technologies has ranked midrange and open-source large language models highly in a new benchmark test, the Hallucination Index.
The benchmark, which evaluates 22 leading generative AI models, measured their accuracy across three task collections.
Anthropic's Claude 3.5 Sonnet topped the ranking, while Google's Gemini 1.5 Flash performed best on cost.
Alibaba's Qwen2-72B-Instruct was the top-performing open-source model.
3 Articles
La startup de inteligencia artificial Galileo Technologies clasifica a Claude 3.5 Sonnet, Gemini de Google y Qwen2-72B-Instruct de Alibaba en los primeros puestos del índice de referencia Hallucination Index.