{"id":683013,"date":"2025-08-03T18:50:18","date_gmt":"2025-08-03T15:50:18","guid":{"rendered":"https:\/\/buradabiliyorum.com\/en\/inside-openais-quest-to-make-ai-do-anything-for-you\/"},"modified":"2025-08-03T18:50:18","modified_gmt":"2025-08-03T15:50:18","slug":"inside-openais-quest-to-make-ai-do-anything-for-you","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/inside-openais-quest-to-make-ai-do-anything-for-you\/","title":{"rendered":"Inside OpenAI\u2019s quest to make AI do anything for you"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_84 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a25b3cf5e508\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a25b3cf5e508\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/inside-openais-quest-to-make-ai-do-anything-for-you\/#The_reinforcement_learning_renaissance\" >The reinforcement learning renaissance<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/inside-openais-quest-to-make-ai-do-anything-for-you\/#Scaling_reasoning\" >Scaling reasoning<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/inside-openais-quest-to-make-ai-do-anything-for-you\/#What_does_it_mean_for_an_AI_to_%E2%80%9Creason%E2%80%9D\" >What does it mean for an AI to \u201creason?\u201d<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/inside-openais-quest-to-make-ai-do-anything-for-you\/#The_next_frontier_AI_agents_for_subjective_tasks\" >The next frontier: AI agents for subjective tasks<\/a><\/li><\/ul><\/nav><\/div>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">Shortly after Hunter Lightman joined OpenAI as a researcher in 2022, he watched his colleagues launch ChatGPT, one of the fastest-growing products ever. Meanwhile, Lightman quietly worked on a team teaching OpenAI\u2019s models to solve high school math competitions.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Today that team, known as MathGen, is considered instrumental to OpenAI\u2019s industry-leading effort to create AI reasoning models: the core <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" data-internallinksmanager029f6b8e52c=\"4\" title=\"Technology\" target=\"_blank\" rel=\"noopener\">technology<\/a> behind AI agents that can do tasks on a computer like a human would.<\/p>\n<p class=\"wp-block-paragraph\">\u201cWe were trying to make the models better at mathematical reasoning, which at the time they weren\u2019t very good at,\u201d Lightman told TechCrunch, describing MathGen\u2019s early work.<\/p>\n<p class=\"wp-block-paragraph\">OpenAI\u2019s models are far from perfect today \u2014 the company\u2019s latest AI systems still hallucinate and its agents <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/www.wired.com\/story\/browser-haunted-by-ai-agents\/\">struggle with complex tasks.<\/a><\/p>\n<p class=\"wp-block-paragraph\">But its state-of-the-art models have improved significantly on mathematical reasoning. One of OpenAI\u2019s models recently won a gold medal at the International Math Olympiad, a math competition for the world\u2019s brightest high school students. OpenAI believes these reasoning capabilities will translate to other subjects, and ultimately power <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a>-purpose agents that the company has always dreamed of building.<\/p>\n<p class=\"wp-block-paragraph\">ChatGPT was a h<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>y accident \u2014 a lowkey research preview turned viral consumer business \u2014 but OpenAI\u2019s agents are the product of a years-long, deliberate effort within the company.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">\u201cEventually, you\u2019ll just ask the computer for what you need and it\u2019ll do all of these tasks for you,\u201d said OpenAI CEO Sam Altman at the company\u2019s first developer conference in 2023. \u201cThese capabilities are often talked about in the AI field as agents. The upsides of this are going to be tremendous.\u201d<\/p>\n<div class=\"wp-block-techcrunch-inline-cta\">\n<div class=\"inline-cta__wrapper\">\n<p>Techcrunch event<\/p>\n<div class=\"inline-cta__content\">\n<p>\n\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__location\">San Francisco<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__separator\">|<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__date\">October 27-29, 2025<\/span>\n\t\t\t\t\t\t\t<\/p>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"382\" width=\"680\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?w=680\" alt=\"OpenAI CEO Sam Altman speaks during the OpenAI DevDay event on November 06, 2023 in San Francisco, California.\" class=\"wp-image-2625318\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg 2573w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=150,84 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=300,169 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=768,432 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=680,382 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=1200,675 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=1280,720 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=430,242 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=720,405 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=900,506 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=800,450 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=1536,864 1536w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=2048,1152 2048w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=668,375 668w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=1097,617 1097w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=708,398 708w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778704897.jpg?resize=50,28 50w\" sizes=\"auto, (max-width: 680px) 100vw, 680px\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-element-caption__text\">OpenAI CEO Sam Altman speaks during the OpenAI DevDay event on November 06, 2023 in San Francisco, California.(Photo by Justin Sullivan\/Getty Images)<\/span><span class=\"wp-block-image__credits\"><strong>Image Credits:<\/strong>Justin Sullivan \/ Getty Images<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Whether agents will meet Altman\u2019s vision remains to be seen, but OpenAI shocked the world with the release of its first AI reasoning model, o1, in the fall of 2024. Less than a year later, the 21 foundational researchers behind that breakthrough are the most highly sought-after talent in Silicon Valley.<\/p>\n<p class=\"wp-block-paragraph\">Mark Zuckerberg recruited five of the o1 researchers to work on Meta\u2019s new superintelligence-focused unit, offering some compensation packages north of $100 million. One of them, Shengjia Zhao, was recently named chief scientist of Meta Superintelligence Labs.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-the-reinforcement-learning-renaissance\"><span class=\"ez-toc-section\" id=\"The_reinforcement_learning_renaissance\"><\/span>The reinforcement learning renaissance<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">The rise of OpenAI\u2019s reasoning models and agents are tied to a machine learning training technique known as reinforcement learning (RL). RL provides feedback to an AI model on whether its choices were correct or not in simulated environments.<\/p>\n<p class=\"wp-block-paragraph\">RL has been used for decades. For instance, in 2016, about a year after OpenAI was founded in 2015, an AI system created by Google DeepMind using RL, AlphaGo, gained global attention after beating a world champion in the board <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/game\/\" data-internallinksmanager029f6b8e52c=\"7\" title=\"Game\" target=\"_blank\" rel=\"noopener\">game<\/a>, Go.<\/p>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"436\" width=\"680\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?w=680\" alt=\"\" class=\"wp-image-3033346\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=150,96 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=300,193 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=768,493 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=680,436 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=430,276 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=720,462 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=900,578 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=800,513 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=668,429 668w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=584,375 584w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=962,617 962w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=708,454 708w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-515358462.jpg?resize=50,32 50w\" sizes=\"auto, (max-width: 680px) 100vw, 680px\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-element-caption__text\">South Korean professional Go player Lee Se-Dol (R) prepares for his fourth match against Google\u2019s artificial intelligence program, AlphaGo, during the Google DeepMind Challenge Match on March 13, 2016 in Seoul, South Korea. Lee Se-dol played a five-game match against a computer program developed by a Google, AlphaGo.  (Photo by Google via Getty Images)<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Around that time, one of OpenAI\u2019s first employees, Andrej Karpathy, began pondering how to leverage RL to create an AI agent that could use a computer. But it would take years for OpenAI to develop the necessary models and training techniques.<\/p>\n<p class=\"wp-block-paragraph\">By 2018, OpenAI pioneered its first large language model in the GPT <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">series<\/a>, pretrained on massive amounts of internet data and a large clusters of GPUs. GPT models excelled at text processing, eventually leading to ChatGPT, but struggled with basic math.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">It took until 2023 for OpenAI to achieve a breakthrough, initially dubbed \u201cQ*\u201d and then \u201cStrawberry,\u201d by combining LLMs, RL, and a technique called test-time computation. The latter gave the models extra time and computing power to plan and work through problems, verifying its steps, before providing an answer. <\/p>\n<p class=\"wp-block-paragraph\">This allowed OpenAI to introduce a new approach called \u201cchain-of-thought\u201d (CoT), which improved AI\u2019s performance on math questions the models hadn\u2019t seen before.<\/p>\n<p class=\"wp-block-paragraph\">\u201cI could see the model starting to reason,\u201d said El Kishky. \u201cIt would notice mistakes and backtrack, it would get frustrated. It really felt like reading the thoughts of a person.\u201d\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Though individually these techniques weren\u2019t novel, OpenAI uniquely combined them to create Strawberry, which directly led to the development of o1. OpenAI quickly identified that the planning and fact checking abilities of AI reasoning models could be useful to power AI agents.<\/p>\n<p class=\"wp-block-paragraph\">\u201cWe had solved a problem that I had been banging my head against for a couple of years,\u201d said Lightman. \u201cIt was one of the most exciting moments of my research career.\u201d<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-scaling-reasoning\"><span class=\"ez-toc-section\" id=\"Scaling_reasoning\"><\/span>Scaling reasoning<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">With AI reasoning models, OpenAI determined it had two new axes that would allow it to improve AI models: using more computational power during the post-training of AI models, and giving AI models more time and processing power while answering a question.<\/p>\n<p class=\"wp-block-paragraph\">\u201cOpenAI, as a company, thinks a lot about not just the way things are, but the way things are going to scale,\u201d said Lightman.<\/p>\n<p class=\"wp-block-paragraph\">Shortly after the 2023 Strawberry breakthrough, OpenAI spun up an \u201cAgents\u201d team led by OpenAI researcher Daniel Selsam to make further progress on this new paradigm, two sources told TechCrunch. Although the team was called \u201cAgents,\u201d\u00a0 OpenAI didn\u2019t initially differentiate between reasoning models and agents as we think of them today. The company just wanted to make AI systems capable of completing complex tasks.<\/p>\n<p class=\"wp-block-paragraph\">Eventually, the work of Selsam\u2019s Agents team became part of a larger project to develop the o1 reasoning model, with leaders including OpenAI co-founder Ilya Sutskever, chief research officer Mark Chen, and chief scientist Jakub Pachocki.<\/p>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"453\" width=\"680\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?w=680\" alt=\"Ilya Sutskever, Russian Israeli-Canadian computer scientist and co-founder and Chief Scientist of OpenAI.\" class=\"wp-image-2797714\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg 1080w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=150,100 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=300,200 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=768,512 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=680,453 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=430,287 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=720,480 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=900,600 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=800,533 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=668,445 668w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=563,375 563w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=926,617 926w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=708,472 708w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/019A7E29-3482-4F68-9A29-624BA22F0334.jpeg?resize=50,33 50w\" sizes=\"auto, (max-width: 680px) 100vw, 680px\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-element-caption__text\">Ilya Sutskever, Russian Israeli-Canadian computer scientist and co-founder and Chief Scientist of OpenAI, speaks at Tel Aviv University in Tel Aviv on June 5, 2023. (Photo by JACK GUEZ \/ AFP)<\/span><span class=\"wp-block-image__credits\"><strong>Image Credits:<\/strong>Getty Images<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">OpenAI would have to divert precious resources \u2014 mainly talent and GPUs \u2014 to create o1. Throughout OpenAI\u2019s history, researchers have had to negotiate with company leaders to obtain resources; demonstrating breakthroughs was a surefire way to secure them.<\/p>\n<p class=\"wp-block-paragraph\">\u201cOne of the core components of OpenAI is that everything in research is bottom up,\u201d said Lightman. \u201cWhen we showed the evidence [for o1], the company was like, \u2018This makes sense, let\u2019s push on it.\u2019\u201d<\/p>\n<p class=\"wp-block-paragraph\">Some former employees say that the startup\u2019s mission to develop AGI was the key factor in achieving breakthroughs around AI reasoning models. By focusing on developing the smartest-possible AI models, rather than products, OpenAI was able to prioritize o1 above other efforts.\u00a0That type of large investment in ideas wasn\u2019t always possible at competing AI labs.<\/p>\n<p class=\"wp-block-paragraph\">The decision to try new training methods proved prescient. By late 2024, several leading AI labs started seeing diminishing returns on models created through traditional pretraining scaling. Today, much of the AI field\u2019s momentum comes from advances in reasoning models.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-what-does-it-mean-for-an-ai-to-reason\"><span class=\"ez-toc-section\" id=\"What_does_it_mean_for_an_AI_to_%E2%80%9Creason%E2%80%9D\"><\/span><strong>What does it mean for an AI to \u201creason?\u201d<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">In many ways, the goal of AI research is to recreate human intelligence with computers. Since the launch of o1, ChatGPT\u2019s UX has been filled with more human-sounding features such as \u201cthinking\u201d and \u201creasoning.\u201d<\/p>\n<p class=\"wp-block-paragraph\">When asked whether OpenAI\u2019s models were truly reasoning, El Kishky hedged, saying he thinks about the concept in terms of computer <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/sciencee\/\" data-internallinksmanager029f6b8e52c=\"5\" title=\"Science\" target=\"_blank\" rel=\"noopener\">science<\/a>. <\/p>\n<p class=\"wp-block-paragraph\">\u201cWe\u2019re teaching the model how to efficiently expend compute to get an answer. So if you define it that way, yes, it is reasoning,\u201d said El Kishky.<\/p>\n<p class=\"wp-block-paragraph\">Lightman takes the approach of focusing on the model\u2019s results and not as much on the means or their relation to human brains.<\/p>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"454\" width=\"680\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?w=680\" alt=\"The OpenAI logo on screen at their developer day stage.\" class=\"wp-image-2625161\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg 1500w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=150,100 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=300,200 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=768,513 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=680,454 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=1200,802 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=1280,855 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=430,287 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=720,481 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=900,601 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=800,534 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=668,446 668w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=561,375 561w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=924,617 924w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=708,473 708w, https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/img_5619.jpg?resize=50,33 50w\" sizes=\"auto, (max-width: 680px) 100vw, 680px\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-element-caption__text\">The OpenAI logo on screen at their developer day stage. (Credit: Devin Coldeway)<\/span><span class=\"wp-block-image__credits\"><strong>Image Credits:<\/strong>Devin Coldewey<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">\u201cIf the model is doing hard things, then it is doing whatever necessary approximation of reasoning it needs in order to do that,\u201d said Lightman. \u201cWe can call it reasoning, because it looks like these reasoning traces, but it\u2019s all just a proxy for trying to make AI tools that are really powerful and useful to a lot of people.\u201d<\/p>\n<p class=\"wp-block-paragraph\">OpenAI\u2019s researchers note people may disagree with their nomenclature or definitions of reasoning \u2014 and surely, <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/garymarcus.substack.com\/p\/a-knockout-blow-for-llms?r=8tdk6&amp;utm_campaign=post&amp;utm_medium=web&amp;triedRedirect=true\">critics have emerged<\/a> \u2014 but they argue it\u2019s less important than the capabilities of their models. Other AI researchers tend to agree.<\/p>\n<p class=\"wp-block-paragraph\">Nathan Lambert, an AI researcher with the non-profit AI2, compares AI reasoning modes to airplanes in a <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/www.interconnects.ai\/p\/the-rise-of-reasoning-machines\">blog post<\/a>. Both, he says, are manmade systems inspired by nature \u2014 human reasoning and bird flight, respectively \u2014 but they operate through entirely different mechanisms. That doesn\u2019t make them any less useful, or any less capable of achieving similar outcomes.<\/p>\n<p class=\"wp-block-paragraph\">A group of AI researchers from OpenAI, Anthropic, and Google DeepMind agreed in a recent position paper that AI reasoning models are not well understood today, and more research is needed. It may be too early to confidently claim what exactly is going on inside them.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-the-next-frontier-ai-agents-for-subjective-tasks\"><span class=\"ez-toc-section\" id=\"The_next_frontier_AI_agents_for_subjective_tasks\"><\/span><strong>The next frontier: AI agents for subjective tasks<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">The AI agents on the market today work best for well-defined, verifiable domains such as coding. OpenAI\u2019s Codex agent aims to help software engineers offload simple coding tasks. Meanwhile, Anthropic\u2019s models have become particularly popular in AI coding tools like Cursor and Claude Code \u2014 these are some of the first AI agents that people are willing to pay up for.<\/p>\n<p class=\"wp-block-paragraph\">However, general purpose AI agents like OpenAI\u2019s ChatGPT Agent and Perplexity\u2019s Comet struggle with many of the complex, subjective tasks people want to automate. When trying to use these tools for online shopping or finding a long-term parking spot, I\u2019ve found the agents take longer than I\u2019d like and make silly mistakes.<\/p>\n<p class=\"wp-block-paragraph\">Agents are, of course, early systems that will undoubtedly improve. But researchers must first figure out how to better train the underlying models to complete tasks that are more subjective.<\/p>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"453\" width=\"680\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?w=680\" alt=\"\" class=\"wp-image-3033358\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg 8256w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=150,100 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=300,200 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=768,512 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=680,453 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=1200,800 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=1280,853 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=430,287 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=720,480 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=900,600 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=800,533 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=1536,1024 1536w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=2048,1365 2048w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=668,445 668w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=563,375 563w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=926,617 926w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=708,472 708w, https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/GettyImages-2197821080.jpg?resize=50,33 50w\" sizes=\"auto, (max-width: 680px) 100vw, 680px\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-element-caption__text\">AI applications (Photo by Jonathan Raa\/NurPhoto via Getty Images)<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">\u201cLike many problems in machine learning, it\u2019s a data problem,\u201d said Lightman, when asked about the limitations of agents on subjective tasks. \u201cSome of the research I\u2019m really excited about right now is figuring out how to train on less verifiable tasks. We have some leads on how to do these things.\u201d\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Noam Brown, an OpenAI researcher who helped create the IMO model and o1, told TechCrunch that OpenAI has new general-purpose RL techniques which allow them to teach AI models skills that aren\u2019t easily verified. This was how the company built the model which achieved a gold medal at IMO, he said.<\/p>\n<p class=\"wp-block-paragraph\">OpenAI\u2019s IMO model was a newer AI system that spawns multiple agents, which then simultaneously explore several ideas, and then choose the best possible answer. These types of AI models are becoming more popular; Google and xAI have recently released state-of-the-art models using this technique.<\/p>\n<p class=\"wp-block-paragraph\">\u201cI think these models will become more capable at math, and I think they\u2019ll get more capable in other reasoning areas as well,\u201d said Brown. \u201cThe progress has been incredibly fast. I don\u2019t see any reason to think it will slow down.\u201d<\/p>\n<p class=\"wp-block-paragraph\">These techniques may help OpenAI\u2019s models become more performant, gains that could show up in the company\u2019s upcoming GPT-5 model. OpenAI hopes to assert its dominance over competitors with the launch of GPT-5, ideally offering the <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/www.theinformation.com\/articles\/inside-openais-rocky-path-gpt-5?rc=dp0mql\">best AI model<\/a> to power agents for developers and consumers. <\/p>\n<p class=\"wp-block-paragraph\">But the company also wants to make its products simpler to use. El Kishky says OpenAI wants to develop AI agents that intuitively understand what users want, without requiring them to select specific settings. He says OpenAI aims to build AI systems that understand when to call up certain tools, and how long to reason for.<\/p>\n<p class=\"wp-block-paragraph\">These ideas paint a picture of an ultimate version of ChatGPT: an agent that can do anything on the internet for you, and understand how you want it to be done. That\u2019s a much different product than what ChatGPT is today, but the company\u2019s research is squarely headed in this direction.<\/p>\n<p class=\"wp-block-paragraph\">While OpenAI undoubtedly led the AI industry a few years ago, the company now faces a tranche of worthy opponents. The question is no longer just whether OpenAI can deliver its agentic future, but can the company do so before Google, Anthropic, xAI, or Meta beat them to it?<\/p>\n<\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" target=\"_blank\" >Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2025\/08\/03\/inside-openais-quest-to-make-ai-do-anything-for-you\/\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Shortly after Hunter Lightman joined OpenAI as a researcher in 2022, he watched his colleagues launch ChatGPT, one of the fastest-growing products ever. Meanwhile, Lightman quietly worked on a team teaching OpenAI\u2019s models to solve high school math competitions.\u00a0 Today that team, known as MathGen, is considered instrumental to OpenAI\u2019s industry-leading effort to create AI&#8230;<\/p>\n","protected":false},"author":1,"featured_media":683014,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/11\/GettyImages-1778705142.jpg?resize=1200,758","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[77337,92249,138467,61594,141199],"class_list":["post-683013","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-ai","tag-agents","tag-chatgpt","tag-exclusive","tag-openai"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/683013","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=683013"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/683013\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/683014"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=683013"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=683013"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=683013"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}