{"id":64426,"date":"2025-01-20T08:49:54","date_gmt":"2025-01-20T08:49:54","guid":{"rendered":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/"},"modified":"2025-01-20T08:52:37","modified_gmt":"2025-01-20T08:52:37","slug":"openai-secretly-funded-benchmarking-dataset-linked-to-o3-model","status":"publish","type":"post","link":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/","title":{"rendered":"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model"},"content":{"rendered":"<p> <a href=\"https:\/\/go.fiverr.com\/visit\/?bta=1052423&nci=17043\" Target=\"_Top\"><img loading=\"lazy\" decoding=\"async\" border=\"0\" src=\"https:\/\/mailinvest.blog\/wp-content\/themes\/breek\/assets\/images\/transparent.gif\" data-lazy=\"true\" data-src=\"https:\/\/fiverr.ck-cdn.com\/tn\/serve\/?cid=40081059\"  width=\"601\" height=\"201\"><\/a>\n<\/p>\n<div id=\"narrow-cont\">\n<p>Revelations that OpenAI secretly funded and had entry to the FrontierMath benchmarking dataset are elevating considerations about whether or not it was used to coach its reasoning o3 AI reasoning mannequin, and the validity of the mannequin\u2019s excessive scores.<\/p>\n<p>Along with accessing the benchmarking dataset, OpenAI funded its creation, a incontrovertible fact that was withheld from the mathematicians who contributed to creating FrontierMath. Epoch AI belatedly disclosed OpenAI\u2019s funding solely within the closing paper printed on Arxiv.org, which introduced the benchmark. Earlier variations of the paper omitted any point out of OpenAI\u2019s involvement.<\/p>\n<h3>Screenshot Of FrontierMath Paper<\/h3>\n<p><img decoding=\"async\" src=\"https:\/\/mailinvest.blog\/wp-content\/themes\/breek\/assets\/images\/transparent.gif\" data-lazy=\"true\" data-src=\"https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-acknowledgement-347.png\" alt=\"\" width=\"400\" height=\"477\" class=\"alignnone wp-image-537772 size-full small-img\" data-srcset=\"https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-acknowledgement-347.png 400w, https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-acknowledgement-347-384x458.png 384w\" data-sizes=\"auto, (max-width: 400px) 100vw, 400px\" loading=\"lazy\"\/><\/p>\n<h3>Closeup Of Acknowledgement<\/h3>\n<p><img decoding=\"async\" src=\"https:\/\/mailinvest.blog\/wp-content\/themes\/breek\/assets\/images\/transparent.gif\" data-lazy=\"true\" data-src=\"https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-footer-970.png\" alt=\"\" width=\"746\" height=\"154\" class=\"alignnone size-full wp-image-537770\" data-srcset=\"https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-footer-970.png 746w, https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-footer-970-480x99.png 480w, https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-footer-970-680x140.png 680w, https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-frontiermath-footer-970-384x79.png 384w\" data-sizes=\"auto, (max-width: 746px) 100vw, 746px\" loading=\"lazy\"\/><\/p>\n<h3>Earlier Model Of Paper That Lacked Acknowledgement<\/h3>\n<p><img decoding=\"async\" src=\"https:\/\/mailinvest.blog\/wp-content\/themes\/breek\/assets\/images\/transparent.gif\" data-lazy=\"true\" data-src=\"https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-earlier-frontiermath-paper-652.png\" alt=\"\" width=\"700\" height=\"72\" class=\"alignnone size-full wp-image-537771\" data-srcset=\"https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-earlier-frontiermath-paper-652.png 700w, https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-earlier-frontiermath-paper-652-480x49.png 480w, https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-earlier-frontiermath-paper-652-680x70.png 680w, https:\/\/www.searchenginejournal.com\/wp-content\/uploads\/2025\/01\/screenshot-earlier-frontiermath-paper-652-384x39.png 384w\" data-sizes=\"auto, (max-width: 700px) 100vw, 700px\" loading=\"lazy\"\/><\/p>\n<h2>OpenAI 03 Mannequin Scored Extremely On FrontierMath Benchmark<\/h2>\n<p>The information of OpenAI\u2019s secret involvement are elevating questions in regards to the excessive scores achieved by\u00a0 the o3 reasoning AI mannequin and inflicting disappointment with the FrontierMath venture. Epoch AI responded with transparency about what occurred and what they\u2019re doing to examine if the o3 mannequin was educated with the FrontierMath dataset.<\/p>\n<p>Giving OpenAI entry to the dataset was sudden as a result of the entire level of it&#8217;s to\u00a0 check AI fashions however that may\u2019t be achieved if the fashions know the questions and solutions beforehand.<\/p>\n<p>A <a href=\"https:\/\/www.reddit.com\/r\/singularity\/comments\/1i4n0r5\/comment\/m7y3x0j\/\" target=\"_blank\" rel=\"noopener\">post<\/a> within the r\/singularity subreddit expressed this disappointment and cited a doc that claimed that the mathematicians didn\u2019t learn about OpenAI\u2019s involvement:<\/p>\n<blockquote>\n<p>\u201cFrontier Math, the current cutting-edge math benchmark, is funded by OpenAI. OpenAI allegedly has entry to the issues and options. That is disappointing as a result of the benchmark was bought to the general public as a method to guage frontier fashions, with help from famend mathematicians. In actuality, Epoch AI is constructing datasets for OpenAI. They by no means disclosed any ties with OpenAI earlier than.\u201d<\/p>\n<\/blockquote>\n<p><em>The Reddit dialogue <a href=\"https:\/\/www.lesswrong.com\/posts\/cu2E8wgmbdZbqeWqb\/meemi-s-shortform\" target=\"_blank\" rel=\"noopener\">cited a publication<\/a> that exposed OpenAI\u2019s deeper involvement:<\/em><\/p>\n<blockquote>\n<p>\u201cThe mathematicians creating the issues for FrontierMath weren&#8217;t (actively)[2] communicated to about funding from OpenAI.<\/p>\n<p>\u2026Now Epoch AI or OpenAI don\u2019t say publicly that OpenAI has entry to the workout routines or solutions or options. I&#8217;ve heard second-hand that OpenAI does have entry to workout routines and solutions and that they use them for validation.\u201d<\/p>\n<\/blockquote>\n<p>Tamay Besiroglu (<a href=\"https:\/\/www.linkedin.com\/in\/tamay-besiroglu\/\" target=\"_blank\" rel=\"noopener\">LinkedIn Profile<\/a>), related director at Epoch AI, acknowledged that OpenAI had entry to the datasets but in addition asserted that there was a \u201choldout\u201d dataset that OpenAI didn\u2019t have entry to.<\/p>\n<p><em>He wrote within the cited doc:<\/em><\/p>\n<blockquote>\n<p>\u201cTamay from Epoch AI right here.<\/p>\n<p>We made a mistake in not being extra clear about OpenAI\u2019s involvement. We had been restricted from disclosing the partnership till across the time o3 launched, and in hindsight we should always have negotiated more durable for the power to be clear to the benchmark contributors as quickly as attainable. Our contract particularly prevented us from disclosing details about the funding supply and the truth that OpenAI has information entry to a lot however not all the dataset. We personal this error and are dedicated to doing higher sooner or later.<\/p>\n<p>Relating to coaching utilization: We acknowledge that OpenAI does have entry to a big fraction of FrontierMath issues and options, excluding a unseen-by-OpenAI hold-out set that allows us to independently confirm mannequin capabilities. Nevertheless, we&#8217;ve a verbal settlement that these supplies won&#8217;t be utilized in mannequin coaching.<\/p>\n<p>OpenAI has additionally been totally supportive of our choice to keep up a separate, unseen holdout set\u2014an additional safeguard to stop overfitting and guarantee correct progress measurement. From day one, FrontierMath was conceived and introduced as an analysis instrument, and we consider these preparations replicate that function. \u201c<\/p>\n<\/blockquote>\n<h2>Extra Details About OpenAI &amp; FrontierMath Revealed<\/h2>\n<p>Elliot Glazer (<a href=\"https:\/\/www.linkedin.com\/in\/elliot-glazer-755190133\/\" target=\"_blank\" rel=\"noopener\">LinkedIn profile<\/a>\/<a href=\"https:\/\/www.reddit.com\/user\/elliotglazer\/\" target=\"_blank\" rel=\"noopener\">Reddit profile<\/a>), the lead mathematician at Epoch AI confirmed that OpenAI has the dataset and that they had been allowed to make use of it to guage OpenAI\u2019s o3 giant language mannequin, which is their subsequent state-of-the-art AI that\u2019s known as a reasoning AI mannequin. He supplied his opinion that the excessive scores obtained by the o3 mannequin are \u201clegit\u201d and that Epoch AI is conducting an impartial analysis to find out whether or not or not o3 had entry to the FrontierMath dataset for coaching, which might solid the mannequin\u2019s excessive scores in a special mild.<\/p>\n<p><em>He wrote:<\/em><\/p>\n<blockquote>\n<p>\u201cEpoch\u2019s lead mathematician right here. Sure, OAI funded this and has the dataset, which allowed them to guage o3 in-house. We haven\u2019t but independently verified their 25% declare. To take action, we\u2019re at present creating a hold-out dataset and can be capable of check their mannequin with out them having any prior publicity to those issues.<\/p>\n<p>My private opinion is that OAI\u2019s rating is legit (i.e., they didn\u2019t practice on the dataset), and that they don&#8217;t have any incentive to lie about inner benchmarking performances. Nevertheless, we will\u2019t vouch for them till our impartial analysis is full.\u201d<\/p>\n<\/blockquote>\n<p><em>Glazer had additionally <a href=\"https:\/\/www.reddit.com\/r\/singularity\/comments\/1i4n0r5\/comment\/m7y3x0j\/\" target=\"_blank\" rel=\"noopener\">shared<\/a> that Epoch AI was going to check o3 utilizing a \u201choldout\u201d dataset that OpenAI didn\u2019t have entry to, saying:<\/em><\/p>\n<blockquote>\n<p>\u201cWe\u2019re going to guage o3 with OAI having zero prior publicity to the holdout issues. This will likely be hermetic.\u201d<\/p>\n<\/blockquote>\n<p><em>One other <a href=\"https:\/\/www.reddit.com\/r\/singularity\/comments\/1i4n0r5\/comment\/m80tilj\/\" target=\"_blank\" rel=\"noopener\">post<\/a> on Reddit by Glazer described how the \u201choldout set\u201d was created:<\/em><\/p>\n<blockquote>\n<p>\u201cWe\u2019ll describe the method extra clearly when the holdout set eval is definitely achieved, however we\u2019re selecting the holdout issues at random from a bigger set which will likely be added to FrontierMath. The manufacturing course of is in any other case an identical to the way it\u2019s all the time been.\u201d<\/p>\n<\/blockquote>\n<h2>Ready For Solutions<\/h2>\n<p>That\u2019s the place the drama stands till the Epoch AI analysis is accomplished which is able to point out whether or not or not OpenAI had educated their AI reasoning mannequin with the dataset or solely used it for benchmarking it.<\/p>\n<p><em>Featured Picture by Shutterstock\/Antonello Marangi<\/em><\/p>\n<\/div>\n<iframe data-lazy=\"true\" data-src=\"https:\/\/www.fiverr.com\/gig_widgets?id=U2FsdGVkX18x7XQvttUTrv1oEqmGNGTgvvCUiUoJ\/AP4z\/UyMz8lXGOLpu15jIMxBbTR0gmD5uBoFvhC4KWeALQRp3h\/X\/AwcVD0K8Wj9H\/ZzYKzcCNHosB9oS4SCJJFWiN85P9ICAc4OgCoE\/wHKIY7CDkf2\/DQ1vqGvk4smVe5cRDEmrLPCWi4FC8p40VUhSmWQ5udCm0zoJtorgWv3vbDQw0kKYkwn39ozAnQXDe+YvWMxkLFWA+O3TFwkJvdkIK+\/AUSnRssPKt5WHY0FhNOxnSPcLslEL4G4\/RfP95ve99U+kRnDy3X+KtzdQLY+u935ghON\/o3UE4IMv9oN6JX9RnxzL\/LRcOgnHigxStSGPKsZYtnz8RWNVT\/rOLAibqiWJadC5MYHRbekF3eg6FOGrQGkXYbsn0+a5aovnlLCbLwIqY9fcS17UX8J235iQ6cdmHNbrPeS84CMm34RA==&affiliate_id=1052423&strip_google_tagmanager=true\" loading=\"lazy\" data-with-title=\"true\" class=\"fiverr_nga_frame\" frameborder=\"0\" height=\"350\" width=\"100%\" referrerpolicy=\"no-referrer-when-downgrade\" data-mode=\"random_gigs\" onload=\" var frame = this; var script = document.createElement('script'); script.addEventListener('load', function() { window.FW_SDK.register(frame); }); script.setAttribute('src', 'https:\/\/www.fiverr.com\/gig_widgets\/sdk'); document.body.appendChild(script); \" ><\/iframe>\n<br \/><a href=\"https:\/\/www.searchenginejournal.com\/openai-secretly-funded-frontiermath-benchmarking-dataset\/537760\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Revelations that OpenAI secretly funded and had entry to the FrontierMath benchmarking dataset are elevating considerations about whether or not it was used to coach&#8230;<\/p>\n","protected":false},"author":1,"featured_media":64427,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[],"class_list":["post-64426","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech-universe"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model - mailinvest.blog<\/title>\n<meta name=\"description\" content=\"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis.mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what&#039;s new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model - mailinvest.blog\" \/>\n<meta property=\"og:description\" content=\"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis.mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what&#039;s new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/\" \/>\n<meta property=\"og:site_name\" content=\"mailinvest.blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/freelanceracademic\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-01-20T08:49:54+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-01-20T08:52:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/mailinvest.blog\/wp-content\/uploads\/2025\/01\/openai-frontiermath-access-445.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1600\" \/>\n\t<meta property=\"og:image:height\" content=\"840\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"admin@mailinvest.blog\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin@mailinvest.blog\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/\"},\"author\":{\"name\":\"admin@mailinvest.blog\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#\\\/schema\\\/person\\\/012701c4c204d4e4ebd34f926cfd31a4\"},\"headline\":\"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model\",\"datePublished\":\"2025-01-20T08:49:54+00:00\",\"dateModified\":\"2025-01-20T08:52:37+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/\"},\"wordCount\":1003,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/mailinvest.blog\\\/wp-content\\\/uploads\\\/2025\\\/01\\\/openai-frontiermath-access-445.jpg\",\"articleSection\":[\"Tech Universe\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/\",\"url\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/\",\"name\":\"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model - mailinvest.blog\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/mailinvest.blog\\\/wp-content\\\/uploads\\\/2025\\\/01\\\/openai-frontiermath-access-445.jpg\",\"datePublished\":\"2025-01-20T08:49:54+00:00\",\"dateModified\":\"2025-01-20T08:52:37+00:00\",\"description\":\"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis.mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what's new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#primaryimage\",\"url\":\"https:\\\/\\\/mailinvest.blog\\\/wp-content\\\/uploads\\\/2025\\\/01\\\/openai-frontiermath-access-445.jpg\",\"contentUrl\":\"https:\\\/\\\/mailinvest.blog\\\/wp-content\\\/uploads\\\/2025\\\/01\\\/openai-frontiermath-access-445.jpg\",\"width\":1600,\"height\":840},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/2025\\\/01\\\/20\\\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/mailinvest.blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#website\",\"url\":\"https:\\\/\\\/mailinvest.blog\\\/\",\"name\":\"mailinvest.blog\",\"description\":\"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis. mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what&#039;s new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.\",\"publisher\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/mailinvest.blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#organization\",\"name\":\"mailinvest\",\"url\":\"https:\\\/\\\/mailinvest.blog\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/mailinvest.blog\\\/wp-content\\\/uploads\\\/2022\\\/01\\\/default.png\",\"contentUrl\":\"https:\\\/\\\/mailinvest.blog\\\/wp-content\\\/uploads\\\/2022\\\/01\\\/default.png\",\"width\":1000,\"height\":1000,\"caption\":\"mailinvest\"},\"image\":{\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/freelanceracademic\\\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/mailinvest.blog\\\/#\\\/schema\\\/person\\\/012701c4c204d4e4ebd34f926cfd31a4\",\"name\":\"admin@mailinvest.blog\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/98ed217bd0f3d6a6dcae2d9b0c76e305b049a07275e315e1407e19ec8b08e139?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/98ed217bd0f3d6a6dcae2d9b0c76e305b049a07275e315e1407e19ec8b08e139?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/98ed217bd0f3d6a6dcae2d9b0c76e305b049a07275e315e1407e19ec8b08e139?s=96&d=mm&r=g\",\"caption\":\"admin@mailinvest.blog\"},\"sameAs\":[\"https:\\\/\\\/mailinvest.blog\",\"admin@mailinvest.blog\"],\"url\":\"https:\\\/\\\/mailinvest.blog\\\/index.php\\\/author\\\/adminmailinvest-blog\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model - mailinvest.blog","description":"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis.mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what's new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/","og_locale":"en_US","og_type":"article","og_title":"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model - mailinvest.blog","og_description":"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis.mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what's new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.","og_url":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/","og_site_name":"mailinvest.blog","article_publisher":"https:\/\/www.facebook.com\/freelanceracademic\/","article_published_time":"2025-01-20T08:49:54+00:00","article_modified_time":"2025-01-20T08:52:37+00:00","og_image":[{"width":1600,"height":840,"url":"https:\/\/mailinvest.blog\/wp-content\/uploads\/2025\/01\/openai-frontiermath-access-445.jpg","type":"image\/jpeg"}],"author":"admin@mailinvest.blog","twitter_card":"summary_large_image","twitter_misc":{"Written by":"admin@mailinvest.blog","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#article","isPartOf":{"@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/"},"author":{"name":"admin@mailinvest.blog","@id":"https:\/\/mailinvest.blog\/#\/schema\/person\/012701c4c204d4e4ebd34f926cfd31a4"},"headline":"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model","datePublished":"2025-01-20T08:49:54+00:00","dateModified":"2025-01-20T08:52:37+00:00","mainEntityOfPage":{"@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/"},"wordCount":1003,"commentCount":0,"publisher":{"@id":"https:\/\/mailinvest.blog\/#organization"},"image":{"@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#primaryimage"},"thumbnailUrl":"https:\/\/mailinvest.blog\/wp-content\/uploads\/2025\/01\/openai-frontiermath-access-445.jpg","articleSection":["Tech Universe"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/","url":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/","name":"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model - mailinvest.blog","isPartOf":{"@id":"https:\/\/mailinvest.blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#primaryimage"},"image":{"@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#primaryimage"},"thumbnailUrl":"https:\/\/mailinvest.blog\/wp-content\/uploads\/2025\/01\/openai-frontiermath-access-445.jpg","datePublished":"2025-01-20T08:49:54+00:00","dateModified":"2025-01-20T08:52:37+00:00","description":"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis.mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what's new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.","breadcrumb":{"@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#primaryimage","url":"https:\/\/mailinvest.blog\/wp-content\/uploads\/2025\/01\/openai-frontiermath-access-445.jpg","contentUrl":"https:\/\/mailinvest.blog\/wp-content\/uploads\/2025\/01\/openai-frontiermath-access-445.jpg","width":1600,"height":840},{"@type":"BreadcrumbList","@id":"https:\/\/mailinvest.blog\/index.php\/2025\/01\/20\/openai-secretly-funded-benchmarking-dataset-linked-to-o3-model\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/mailinvest.blog\/"},{"@type":"ListItem","position":2,"name":"OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model"}]},{"@type":"WebSite","@id":"https:\/\/mailinvest.blog\/#website","url":"https:\/\/mailinvest.blog\/","name":"mailinvest.blog","description":"Technology is forever changing, and there are always new pieces of technology to replace obsolete ones. Tons of people enjoy reading tech blogs on a daily basis. mailinvest.blog tracks all the latest consumer technology breakthroughs and shows you what&#039;s new, what matters and how technology can enrich your life. mailinvest.blog also provides the information, tools, and advice that helps when deciding what to buy.","publisher":{"@id":"https:\/\/mailinvest.blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/mailinvest.blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/mailinvest.blog\/#organization","name":"mailinvest","url":"https:\/\/mailinvest.blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/mailinvest.blog\/#\/schema\/logo\/image\/","url":"https:\/\/mailinvest.blog\/wp-content\/uploads\/2022\/01\/default.png","contentUrl":"https:\/\/mailinvest.blog\/wp-content\/uploads\/2022\/01\/default.png","width":1000,"height":1000,"caption":"mailinvest"},"image":{"@id":"https:\/\/mailinvest.blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/freelanceracademic\/"]},{"@type":"Person","@id":"https:\/\/mailinvest.blog\/#\/schema\/person\/012701c4c204d4e4ebd34f926cfd31a4","name":"admin@mailinvest.blog","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/98ed217bd0f3d6a6dcae2d9b0c76e305b049a07275e315e1407e19ec8b08e139?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/98ed217bd0f3d6a6dcae2d9b0c76e305b049a07275e315e1407e19ec8b08e139?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/98ed217bd0f3d6a6dcae2d9b0c76e305b049a07275e315e1407e19ec8b08e139?s=96&d=mm&r=g","caption":"admin@mailinvest.blog"},"sameAs":["https:\/\/mailinvest.blog","admin@mailinvest.blog"],"url":"https:\/\/mailinvest.blog\/index.php\/author\/adminmailinvest-blog\/"}]}},"_links":{"self":[{"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/posts\/64426","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/comments?post=64426"}],"version-history":[{"count":1,"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/posts\/64426\/revisions"}],"predecessor-version":[{"id":64428,"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/posts\/64426\/revisions\/64428"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/media\/64427"}],"wp:attachment":[{"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/media?parent=64426"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/categories?post=64426"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mailinvest.blog\/index.php\/wp-json\/wp\/v2\/tags?post=64426"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}