News

A new study alleges Alibaba's Qwen2.5 AI model cheated on key math benchmarks by memorizing test data, raising serious questions about AI benchmark integrity.