Will scaling transformers lead to a 60% score on ARC-AGI-2? | Manifold

Will scaling transformers lead to a 60% score on ARC-AGI-2?

16

Ṁ456

2030

31%

chance

1D

1W

1M

ALL

Will any plain transformer model achieve 60% or more on ARC-AGI-2 by 2030?

The inference cost to achieve this result does not matter.

The model that achieves this result must use the same "transformer recipe" common between 2023-2025: techniques like RLHF/RLAIF/CoT/RAG/vision encoders are allowed, but any specialized components must also be made of vanilla transformer blocks; Any new inductive biases, such as tree-search, neurosymbolic logic, etc. would not qualify.

The result must be verified by at least one reputable, unaffiliated org (ARC, Epoch, OpenAI Evals, academic lab, etc.) or a publicly re-runnable result (notebook on Kaggle, etc.).

Resolution uses the ARC-AGI-2 evaluation set and scoring script as published on arcprize.org on the day this market opens. Later revisions are ignored.

This question is managed and resolved by Manifold.

#Technical AI Timelines

Get

1,000

and

3.00

Sort by:

Honestly I don't believe 60% or more on ARC-AGI-2 is truly AGI in any meaningful sense:

Humans can score 100%, not 60.

It's a single benchmark that doesn't really test the full breadth of capabilities. It's definitely possible to have a system that's good at this benchmark while being useless in other tasks.

I propose renaming the question.

Related questions

If Artificial General Intelligence has an okay outcome, what will be the reason?

If Artificial General Intelligence has an okay outcome, what will be the reason?

Is scale unnecessary for intelligence (<10B param human-competitive STEM model before 2030)?

Artificial general intelligence (AGI) is possible in principle

Will tweaking current Large Language Models (LLMs) lead us to achieving Artificial General Intelligence (AGI)?

Will we have an AGI as smart as a "generally educated human" by the end of 2025?

Will Artificial General Intelligence (AGI) lead directly to the development of Artificial Superintelligence (ASI)?

When artificial general intelligence (AGI) exists, what will be true?

Will artificial general intelligence be achieved they the end of 2025 ?

Is scale unnecessary for intelligence (<10B param human-competitive STEM model before 2027)?

Related questions

If Artificial General Intelligence has an okay outcome, what will be the reason?

Will we have an AGI as smart as a "generally educated human" by the end of 2025?

If Artificial General Intelligence has an okay outcome, what will be the reason?

Will Artificial General Intelligence (AGI) lead directly to the development of Artificial Superintelligence (ASI)?

Is scale unnecessary for intelligence (<10B param human-competitive STEM model before 2030)?

When artificial general intelligence (AGI) exists, what will be true?

Artificial general intelligence (AGI) is possible in principle

Will artificial general intelligence be achieved they the end of 2025 ?

Will tweaking current Large Language Models (LLMs) lead us to achieving Artificial General Intelligence (AGI)?

Is scale unnecessary for intelligence (<10B param human-competitive STEM model before 2027)?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules