>Opus 4.5 has really startled me - it genuinely can do complex software engineer...

martinald · 2025-12-28T02:11:23 1766887883

I don't think the benchmarks catch this very well. Opus 4.5 is _significantly_ better than Sonnet 4.5 in my experience, far more than the SWE Bench scores would say. I can happily leave Opus 4.5 running for 20-30 minutes and come back to very high quality software on complex tasks/refactoring. Sonnet 4.5 would fall over within a couple of minutes on these tasks.

20251227 · 2025-12-28T02:20:55 1766888455

What does "very high quality" mean here