Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
eldenring
4 months ago
|
parent
|
context
|
favorite
| on:
Gemini 3 Pro Model Card [pdf]
Its the other way around too, HLE questions were selected adversarially to reduce the scores. I'd guess even if the questions were never released, and new training data was introduced, the scores would improve.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: