How well will Grok 4 do on Frontier Math?
25
Ṁ100k
2026

Invalid contract

The highest score of any version of Grok 4 on the Epoch AI dashboard for the FrontierMath benchmark, within 1 week of the first appearance of Grok 4 on the dashboard.

( https://epoch.ai/data/ai-benchmarking-dashboard )

Get
Ṁ1,000
and
S3.00
Sort by:

what happened?

@bh It got 12-14%

@Bayesian thanks, figured I’d missed it but didn’t see anything on the epoch.ai dashboard. i wonder if they will evaluate the heavy/multiagent version.

@Bayesian Why are we sure Grok 4 Heavy won't count? Description implies it would

bought Ṁ750 ???
bought Ṁ1 ???

@Bayesian where can you see the score? The link in the description doesn't appear to talk about grok4

@SimoneRomeo huh they tweeted about it but ig it's not on the site

bought Ṁ10 ???

@Bayesian @Fay42 do I understand right that if the score is not on their website within one week, the market should resolve 0%?

@SimoneRomeo I would have expected that to be NA.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules