{"id":59067,"date":"2026-02-19T22:54:51","date_gmt":"2026-02-19T17:24:51","guid":{"rendered":"https:\/\/officechai.com\/?p=59067"},"modified":"2026-02-19T22:54:54","modified_gmt":"2026-02-19T17:24:54","slug":"google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark","status":"publish","type":"post","link":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/","title":{"rendered":"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark"},"content":{"rendered":"\n<p>ARC-AGI 2 had been created when ARC-AGI 1 seemed all but saturated, but it appears that ARC-AGI 2 won&#8217;t remain unsolved for much longer.<\/p>\n\n\n\n<p>Google DeepMind&#8217;s Gemini 3.1 Pro (Preview) has stormed to the top of both ARC-AGI leaderboards simultaneously, posting a 77.1% score on ARC-AGI-2 and a near-perfect 98.0% on ARC-AGI-1 \u2014 results that represent a dramatic leap over its predecessor and reframe the competitive landscape for frontier AI reasoning.<\/p>\n\n\n\n<p><strong>A Benchmark Built to Resist<\/strong><\/p>\n\n\n\n<p>The ARC-AGI-2 benchmark was purpose-built to outlast the first iteration of the challenge. When ARC-AGI-1 began to look like a solved problem \u2014 with models routinely cresting into the high nineties \u2014 the ARC Prize team raised the bar significantly, engineering tasks designed to resist brute-force pattern matching and demand more flexible, generalizable reasoning. For much of its early life, ARC-AGI-2 lived up to that ambition, with the best models struggling to clear even 50%. Gemini 3.1 Pro has now shattered that ceiling.<\/p>\n\n\n\n<p><strong>Gemini 3.1 Pro: The ARC-AGI Numbers<\/strong><\/p>\n\n\n\n<p>On ARC-AGI-2, Gemini 3.1 Pro <a href=\"https:\/\/x.com\/arcprize\/status\/2024522814557212908?s=20\">scores <\/a>77.1% at a cost of $0.962 per task, well clear of the next best verified entries on the leaderboard. On ARC-AGI-1, it posts 98.0% at just $0.522 per task \u2014 making it both the highest-scoring and among the more cost-efficient performers at the top of that chart.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img data-recalc-dims=\"1\" fetchpriority=\"high\" decoding=\"async\" width=\"640\" height=\"405\" src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?resize=640%2C405&#038;ssl=1\" alt=\"\" class=\"wp-image-59068\" srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?w=927&amp;ssl=1 927w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?resize=300%2C190&amp;ssl=1 300w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?resize=768%2C486&amp;ssl=1 768w\" sizes=\"(max-width: 640px) 100vw, 640px\" \/><figcaption class=\"wp-element-caption\">Gemini 3.1 Pro ARC-AGI 2<\/figcaption><\/figure>\n\n\n\n<p>The performance on ARC-AGI-2 is particularly striking in context. The leaderboard shows most frontier models \u2014 including GPT-5.2 variants, Grok 4, and various Claude configurations \u2014 clustered well below the 60% mark at comparable or higher cost points. <a href=\"https:\/\/officechai.com\/ai\/gemini-3-deep-think-benchmarks-arc-agi\/\">Gemini 3 Deep Think<\/a>, a more computationally intensive offering, reaches around 85% but at significantly greater expense. Gemini 3.1 Pro&#8217;s position on the Pareto frontier of performance versus cost is, by the data, unambiguous.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"640\" height=\"399\" data-src=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-47.png?resize=640%2C399&#038;ssl=1\" alt=\"\" class=\"wp-image-59069 lazyload\" data-srcset=\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-47.png?w=931&amp;ssl=1 931w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-47.png?resize=300%2C187&amp;ssl=1 300w, https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-47.png?resize=768%2C478&amp;ssl=1 768w\" data-sizes=\"(max-width: 640px) 100vw, 640px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 640px; --smush-placeholder-aspect-ratio: 640\/399;\" \/><figcaption class=\"wp-element-caption\">Gemini 3.1 Pro ARC-AGI 1<\/figcaption><\/figure>\n\n\n\n<p><strong>Doubling Down on Efficiency<\/strong><\/p>\n\n\n\n<p>Google DeepMind has framed Gemini 3.1 Pro&#8217;s results explicitly around the Pareto frontier \u2014 the line that defines the best achievable performance at each cost level. Across both benchmarks, the model appears to push that frontier outward, delivering scores that no other verified model achieves at sub-dollar-per-task pricing. That framing matters commercially: enterprise and API customers evaluating reasoning models care as much about inference cost as raw capability, and a model that is both smarter and cheaper to run changes procurement calculus meaningfully.<\/p>\n\n\n\n<p><strong>What It Means for ARC-AGI-2<\/strong><\/p>\n\n\n\n<p>A 77% score does not mean ARC-AGI-2 is solved; the benchmark designers have consistently argued that human-level performance on these abstract visual reasoning tasks requires scores approaching 100%. But 77% is a qualitative inflection point. It demonstrates that the gap between frontier models and the benchmark&#8217;s upper bound is now a matter of refinement rather than fundamental capability.<\/p>\n\n\n\n<p>The trajectory from ARC-AGI-1 saturation to ARC-AGI-2 dominance has arrived faster than many in the research community anticipated. If the pattern holds, the question is no longer whether ARC-AGI-2 will fall \u2014 but when, and which lab will get there first. Right now, Google holds the lead.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>ARC-AGI 2 had been created when ARC-AGI 1 seemed all but saturated, but it appears that ARC-AGI 2 won&#8217;t remain unsolved for much&#8230;<\/p>\n","protected":false},"author":1,"featured_media":59068,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1029],"tags":[],"class_list":["post-59067","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark\" \/>\n<meta property=\"og:description\" content=\"ARC-AGI 2 had been created when ARC-AGI 1 seemed all but saturated, but it appears that ARC-AGI 2 won&#8217;t remain unsolved for much...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/\" \/>\n<meta property=\"og:site_name\" content=\"OfficeChai\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/OfficeChai\/\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-19T17:24:51+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-19T17:24:54+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png\" \/>\n\t<meta property=\"og:image:width\" content=\"927\" \/>\n\t<meta property=\"og:image:height\" content=\"587\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"OfficeChai Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@OfficeChai\" \/>\n<meta name=\"twitter:site\" content=\"@OfficeChai\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"OfficeChai Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/\",\"url\":\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/\",\"name\":\"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark\",\"isPartOf\":{\"@id\":\"https:\/\/officechai.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?fit=927%2C587&ssl=1\",\"datePublished\":\"2026-02-19T17:24:51+00:00\",\"dateModified\":\"2026-02-19T17:24:54+00:00\",\"author\":{\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2\"},\"breadcrumb\":{\"@id\":\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?fit=927%2C587&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?fit=927%2C587&ssl=1\",\"width\":927,\"height\":587,\"caption\":\"gemini 3.1 pro arc agi 2\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/officechai.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/officechai.com\/#website\",\"url\":\"https:\/\/officechai.com\/\",\"name\":\"OfficeChai\",\"description\":\"Startups, Businesses And Careers\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/officechai.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2\",\"name\":\"OfficeChai Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/officechai.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g\",\"caption\":\"OfficeChai Team\"},\"description\":\"Dotting the i's, crossing the t's.\",\"url\":\"https:\/\/officechai.com\/author\/admin\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/","og_locale":"en_US","og_type":"article","og_title":"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark","og_description":"ARC-AGI 2 had been created when ARC-AGI 1 seemed all but saturated, but it appears that ARC-AGI 2 won&#8217;t remain unsolved for much...","og_url":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/","og_site_name":"OfficeChai","article_publisher":"https:\/\/www.facebook.com\/OfficeChai\/","article_published_time":"2026-02-19T17:24:51+00:00","article_modified_time":"2026-02-19T17:24:54+00:00","og_image":[{"width":927,"height":587,"url":"http:\/\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png","type":"image\/png"}],"author":"OfficeChai Team","twitter_card":"summary_large_image","twitter_creator":"@OfficeChai","twitter_site":"@OfficeChai","twitter_misc":{"Written by":"OfficeChai Team","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/","url":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/","name":"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark","isPartOf":{"@id":"https:\/\/officechai.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#primaryimage"},"image":{"@id":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?fit=927%2C587&ssl=1","datePublished":"2026-02-19T17:24:51+00:00","dateModified":"2026-02-19T17:24:54+00:00","author":{"@id":"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2"},"breadcrumb":{"@id":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#primaryimage","url":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?fit=927%2C587&ssl=1","contentUrl":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?fit=927%2C587&ssl=1","width":927,"height":587,"caption":"gemini 3.1 pro arc agi 2"},{"@type":"BreadcrumbList","@id":"https:\/\/officechai.com\/ai\/google-gemini-3-1-pro-doubles-performance-over-gemini-3-pro-on-arc-agi-2-tops-benchmark\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/officechai.com\/"},{"@type":"ListItem","position":2,"name":"Google Gemini 3.1 Pro Doubles Performance Over Gemini 3 Pro On ARC-AGI 2, Tops Benchmark"}]},{"@type":"WebSite","@id":"https:\/\/officechai.com\/#website","url":"https:\/\/officechai.com\/","name":"OfficeChai","description":"Startups, Businesses And Careers","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/officechai.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/officechai.com\/#\/schema\/person\/5861f1134993293cc28905de7624d6b2","name":"OfficeChai Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/officechai.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/61d744733248dc647d505d0676bb425323413132ee5447e86aa8eecbbb7b27d5?s=96&d=mm&r=g","caption":"OfficeChai Team"},"description":"Dotting the i's, crossing the t's.","url":"https:\/\/officechai.com\/author\/admin\/"}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/officechai.com\/wp-content\/uploads\/2026\/02\/image-46.png?fit=927%2C587&ssl=1","jetpack_shortlink":"https:\/\/wp.me\/p685C6-fmH","jetpack_likes_enabled":true,"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/59067","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/comments?post=59067"}],"version-history":[{"count":1,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/59067\/revisions"}],"predecessor-version":[{"id":59070,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/posts\/59067\/revisions\/59070"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/media\/59068"}],"wp:attachment":[{"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/media?parent=59067"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/categories?post=59067"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/officechai.com\/wp-json\/wp\/v2\/tags?post=59067"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}