What’s Improved in AI Models Sonnet & Opus
Digest more
Anthropic's Claude Opus 4 outperforms OpenAI's GPT-4.1 with unprecedented seven-hour autonomous coding sessions and record-breaking 72.5% SWE-bench score, transforming AI from quick-response tool to day-long collaborator.
Claude Opus is Anthropic's largest version of the model family with more power for longer, complex tasks, whereas Sonnet is generally speedier and more efficient. Claude Opus 4 is
Claude Opus 4 is the world’s best coding model, Anthropic said. The company also released a safety report for the hybrid reasoning models.
Bowman later edited his tweet and the following one in a thread to read as follows, but it still didn't convince the naysayers.
Anthropic's Claude 4 models show particular strength in coding and reasoning tasks, but lag behind in multimodality and context window size compared to Google and OpenAI offerings.
2don MSN
A third-party research institute Anthropic partnered with to test Claude Opus 4 recommended against deploying an early version because it tends to "scheme."
Founded by former OpenAI engineers, Anthropic is currently concentrating its efforts on cutting-edge models that are particularly adept at generating lines of code, and used mainly by businesses and professionals.