Claude Opus 4 achieved a 72.5% score on SWE-bench, outperforming competitors like OpenAI's GPT-4.1, which scored 54.6%. The models have been integrated into development workflows and are being used in GitHub Copilot.
Company Description
Anthropic is an AI research company that focuses on the safety and alignment of AI systems with human values.