Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
ACCount37
2 days ago
|
parent
|
context
|
favorite
| on:
Claude Opus 4.7
Constantly. Minor revisions can easily "wobble" on benchmarks that the training didn't explicitly push them for.
Whether it's genuine loss of capability or just measurement noise is typically unclear.
help
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Whether it's genuine loss of capability or just measurement noise is typically unclear.