List of AI News about benchmark
| Time | Details |
|---|---|
| 09:35 |
AI Benchmark Accuracy Challenged: Scale AI Exposes Training Data Contamination in 2024 Analysis
According to God of Prompt on Twitter, recent findings by Scale AI published in May 2024 reveal that AI models are achieving over 95% accuracy on benchmark tests because many test questions are already present in their training data. This 'contamination' undermines the reliability of AI benchmark scores, making it unclear how intelligent these models truly are. As reported by God of Prompt, the industry faces significant challenges in evaluating real AI capabilities, highlighting an urgent need for improved benchmarking standards. |