Anthropic’s Claude Opus 4.7 delivers gains in instruction-following, safety, and software engineering benchmarks but has drawn criticism for reduced initiative, more frequent hallucinations, and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results