Software errors are becoming increasingly prevalent as the size and complexity of computer programs continue to increase and are more difficult to identify before widespread distribution of the ...
Software development has fundamentally changed in the past 18 months. AI-assisted coding and engineering went from novel and ...
OpenAI’s frontier model may not have astounded when it arrived earlier this year, but research indicates it’s now much better ...
Now, we're back with Opus 4.5. Anthropic, the company behind Claude claims, and I quote, "Our newest model, Claude Opus 4.5, is available today. It's intelligent, efficient, and the best model in the ...
Anthropic is solidifying its dominance in AI coding, with a new release that performed better on its engineering test than ...
It is a universal truth of human nature that the developers who build the code should not be the ones to test it. First of all, most of them pretty much detest that task. Second, like any good ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
In a new benchmark named Vibe Code Bench, OpenAI’s GPT-5.1 achieved the highest level of accuracy in completing a series of software engineering tasks, narrowly beating rival Anthropic’s Claude 4.5 ...
From generating test cases and transforming test data to accelerating planning and improving developer communication, AI is having a profound impact on software testing.
Katie Parrott, Dan Shipper, and Kieran Klaassen in Vibe Check Was this newsletter forwarded to you? Sign up to get it in your inbox. It’s appropriate that this week is Thanksgiving, because Anthropic ...