Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult simonwillison.net 1 points by gingersnap 9 hours ago
More discussion: https://news.ycombinator.com/item?id=46037637