Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 ...
Anthropic has released Claude Sonnet 4.5, which it unabashedly refers to as "the best coding model in the world." ...
Harness is acquiring Qwiet AI to expand its application security capabilities with advanced static application testing and ...
Meta has released Code World Model (CWM), a 32-billion-parameter AI model for researchers that simulates code execution to ...
Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results