Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 ...
Anthropic has released Claude Sonnet 4.5, which it unabashedly refers to as "the best coding model in the world." ...
Harness is acquiring Qwiet AI to expand its application security capabilities with advanced static application testing and ...
Meta has released Code World Model (CWM), a 32-billion-parameter AI model for researchers that simulates code execution to ...
Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...
The agency's fleet of smartphones and tablets suffer from basic security failures that has left personal and sensitive ...
Although capable of reducing trivial mistakes, AI coding copilots leave enterprises at risk of increased insecure coding ...
AI is increasing both the number of pull requests and the volume of code within them, creating bottlenecks in code review, ...
Abstract: Scripting languages like Python or JavaScript are extremely popular among developers, in part due to their massive open-source ecosystems that enable smooth code reuse. However, recent work ...
Wang, S. (2025) A Review of Agent Data Evaluation: Status, Challenges, and Future Prospects as of 2025. Journal of Software ...
CodeRabbit's $60M funding highlights enterprise need for AI code review platforms, with organizations seeing 25% efficiency ...
Artificial intelligence code review startup CodeRabbit Inc. has raised $60 million in a Series B funding, a round that it ...