This is not a benchmark. They just want to give people the opportunity to try th...

YeGoblynQueenne · 2026-02-07T18:30:56 1770489056

Hey, sorry, totally out of context but I've always wanted to ask about the username. I keep reading it as "yoruba" in my mind. What does it mean, if I'm not being indiscreet?

yorwba · 2026-02-07T20:04:51 1770494691

You're not the first to have wondered: https://news.ycombinator.com/item?id=20730027

YeGoblynQueenne · 2026-02-07T22:51:42 1770504702

Well, now that I read that comment I remembered having read it before. My mind is going.

cocoto · 2026-02-07T16:54:30 1770483270

They could solve the problems and train the next models with the answers, as such the future models could “solve” theses.

fph · 2026-02-07T17:03:39 1770483819

The authors mention that before publications they tested these questions on Gemini and GPT, so they have been available to the two biggest players already; they have a head start.

data_maan · 2026-02-07T18:16:13 1770488173

Looks like very sloppy research.

pickleRick243 · 2026-02-07T23:39:30 1770507570

I don't think it's that serious...it's an interesting experiment that assumes people will take it in good faith. The idea is also of course to attach the transcript log and how you prompted the LLM so that anyone can attempt to reproduce if they wish.

data_maan · 2026-02-08T08:26:23 1770539183

If you want to do this rigorously, you should run it as a competition like the guys at the AI-MO Prize are doing on Kaggle.

That way you get all the necessary data.

I still think this is bro science.

yorwba · 2026-02-08T09:34:15 1770543255

If this were a competition, some people would try hard to win it. But the goal here is exploration, not exploitation. Once the answers are revealed, it's unlikely a winner will be identified, but a bunch of mathematicians who tried prompting AI with the questions might learn something from the exercise.

data_maan · 2026-02-10T19:11:46 1770750706

But everything has been explored in other datasets already.

If only a bunch of mathematicians learn something, why are so many people talking about this, why is the NY Times posting about this?

This is the attention economy at its worst.