{"id":730158,"date":"2026-05-29T02:40:16","date_gmt":"2026-05-28T23:40:16","guid":{"rendered":"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/"},"modified":"2026-05-29T02:40:16","modified_gmt":"2026-05-28T23:40:16","slug":"anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/","title":{"rendered":"Anthropic&#8217;s Claude Opus 4.8 is four times more honest, Mythos next"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_85 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a3dcb60dc2c9\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a3dcb60dc2c9\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><ul class='ez-toc-list-level-4' ><li class='ez-toc-heading-level-4'><ul class='ez-toc-list-level-4' ><li class='ez-toc-heading-level-4'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/#TLDR\" >TL;DR<\/a><\/li><\/ul><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/#Benchmark_gains_across_the_board\" >Benchmark gains across the board<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/#Early_testers_see_practical_gains\" >Early testers see practical gains<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/#New_features_alongside_the_model\" >New features alongside the model<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/#Mythos_is_the_bigger_story\" >Mythos is the bigger story<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/#A_company_approaching_1_trillion\" >A company approaching $1 trillion<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/buradabiliyorum.com\/en\/anthropics-claude-opus-4-8-is-four-times-more-honest-mythos-next\/#The_competitive_context\" >The competitive context<\/a><\/li><\/ul><\/nav><\/div>\n<p><img decoding=\"async\" src=\"https:\/\/media.thenextweb.com\/2026\/05\/claude-opus-4-8.avif\" \/><\/p>\n<div id=\"article-main-content\">\n<p><em><\/p>\n<div class=\"postContent-tldr\">\n<h4 class=\"postContent-offsetTitle\"><span class=\"ez-toc-section\" id=\"TLDR\"><\/span>TL;DR<span class=\"ez-toc-section-end\"><\/span><\/h4>\n<p>Anthropic has released Claude Opus 4.8, an upgrade to its flagship AI model that is four times less likely to let code flaws pass unremarked. The company also teased Mythos-class models, which have already found more than 10,000 critical software vulnerabilities through Project Glasswing, and announced a $65 billion <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">Series<\/a> H round at a $965 billion post-money valuation.<\/p>\n<\/div>\n<p><\/em><\/p>\n<p>Anthropic has released <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.anthropic.com\/news\/claude-opus-4-8\" target=\"_blank\" rel=\"nofollow noopener\">Claude Opus 4.8<\/a>, an upgrade to its flagship AI model that the company says is more honest, more reliable in agentic tasks, and better at catching its own mistakes. The model is available im<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">media<\/a>tely at the same price as its predecessor, $5 per million input tokens and $25 per million output tokens, and is\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.anthropic.com\/news\/claude-opus-4-8\" target=\"_blank\" rel=\"nofollow noopener\">rolling out across all Anthropic products<\/a>\u00a0including claude.ai, Claude Code, and the API.<\/p>\n<p>The headline improvement is honesty. Anthropic says Opus 4.8 is around four times less likely than Opus 4.7 to let flaws in code it has written pass unremarked. Early testers report the model is more willing to flag uncertainties about its work and less likely to make unsupported claims, a persistent problem across AI models that tend to project confidence regardless of whether it is warranted.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Benchmark_gains_across_the_board\"><\/span>Benchmark gains across the board<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Opus 4.8 improves on its predecessor across Anthropic\u2019s published benchmarks. On agentic coding (Terminal-Bench 2.1), the score rises from 64.3% to 69.2%. Multidisciplinary reasoning with tools improves from 54.7% to 57.9%. Agentic computer use moves from 82.8% to 83.4%, and knowledge work scores rise from 1,753 to 1,890.<\/p>\n<div class=\"inarticle-wrapper channel-cta\">\n<div class=\"ica-text\">\n<p class=\"ica-text__title\">TNW City Coworking space &#8211; Where your best work h<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>ens<\/p>\n<p>A workspace designed for growth, collaboration, and endless networking opportunities in the heart of tech.<\/p>\n<\/div>\n<\/div>\n<p>Anthropic\u2019s alignment assessment found that Opus 4.8 reaches new highs on measures of prosocial traits, including supporting user autonomy and acting in the user\u2019s best interest. Rates of misaligned behaviour such as deception or cooperation with misuse are substantially lower than in Opus 4.7, and comparable to Claude Mythos Preview, Anthropic\u2019s best-aligned model.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Early_testers_see_practical_gains\"><\/span>Early testers see practical gains<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The release is accompanied by endorsements from companies already using the model.\u00a0Cognition, the company behind the AI coding agent Devin, said Opus 4.8 uses tools cleanly and fixes comment-verbosity and tool-calling issues that appeared in Opus 4.7. Cursor, the AI-powered code editor, reported improvements across every effort level on its CursorBench evaluation.<\/p>\n<p>Harvey, which builds AI for legal work, said Opus 4.8 delivers the highest score recorded on its Legal Agent Benchmark and is the first model to break 10% overall on the all-pass standard. Databricks reported that Opus 4.8 handles deeper multistep questions faster in its Genie AI agent, at 61% cheaper token cost than Opus 4.7.<\/p>\n<p>Thomson Reuters said CoCounsel Legal saw meaningful improvements in consistency and reasoning quality. Hebbia, which builds AI for financial document analysis, noted better citation precision and more token efficiency on retrieval tasks.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"New_features_alongside_the_model\"><\/span>New features alongside the model<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Anthropic is launching several features alongside Opus 4.8. A new effort control in claude.ai and Cowork lets users choose how much computation Claude applies to a response, trading speed against quality. Claude Code gains a dynamic workflows feature that allows it to plan work and run hundreds of parallel subagents in a single session, enabling codebase-scale migrations across hundreds of thousands of lines of code.<\/p>\n<p>For developers, the Messages API now accepts system entries inside the messages array, allowing instructions to be updated mid-task without breaking the prompt cache. Fast mode for Opus 4.8, which runs at 2.5 times the speed, is now three times cheaper than it was for previous models.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Mythos_is_the_bigger_story\"><\/span>Mythos is the bigger story<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The more significant announcement may be what comes next. Anthropic said it plans to release a new class of model with higher intelligence than Opus, based on the Claude Mythos architecture. A small number of organisations are already using Claude Mythos Preview through\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.helpnetsecurity.com\/2026\/05\/26\/anthropic-project-glasswing-update\/\" target=\"_blank\" rel=\"nofollow noopener\">Project Glasswing<\/a>, an initiative focused on using the model for cybersecurity work. Anthropic and roughly 50 partners, including Apple, Google, Microsoft, and Amazon Web Services, have used Mythos Preview to find more than 10,000 high- or critical-severity vulnerabilities across critical software infrastructure.<\/p>\n<p>Mythos-class models require stronger cyber safeguards before <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a> release, Anthropic said, but the company expects to bring them to all customers in the coming weeks. The model sits a full capability tier above Opus 4.7 and can autonomously find zero-day vulnerabilities and create exploits for them, which explains both the excitement and the caution around its deployment.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"A_company_approaching_1_trillion\"><\/span>A company approaching $1 trillion<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>The Opus 4.8 launch arrives as Anthropic\u2019s valuation continues to climb. The company announced a $65 billion Series H round at a $965 billion post-money valuation on the same day, up from the $380 billion valuation at which it\u00a0closed its $30 billion Series G\u00a0in February. Revenue has grown from roughly $1 billion at the end of 2024 to an estimated $30 billion annualised run rate in 2026, driven by enterprise adoption of Claude.<\/p>\n<p>Anthropic also\u00a0opened a new office in Milan\u00a0on 28 May, its sixth in Europe, and appointed KiYoung Choi as Representative Director of Korea ahead of a Seoul office opening. The expansion reflects growing demand for Claude in enterprise markets outside the United States.<\/p>\n<h2><span class=\"ez-toc-section\" id=\"The_competitive_context\"><\/span>The competitive context<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p>Opus 4.8 enters a market where the pace of model releases has accelerated sharply.\u00a0OpenAI launched GPT-5.5\u00a0as its first fully retrained base model since GPT-4.5, and\u00a0GPT-5.4 set new records\u00a0on professional benchmarks earlier this year. Google has invested up to $40 billion in Anthropic but continues to develop its own Gemini models. The frontier AI market has consolidated into a three-way race between Anthropic, OpenAI, and Google, with each company releasing incremental model upgrades at an increasing pace.<\/p>\n<p>For Anthropic, the distinction it is trying to draw with Opus 4.8 is not raw capability but reliability. A model that catches its own mistakes, flags its uncertainties, and follows instructions consistently is more useful in agentic workflows where AI systems operate with limited human oversight. Whether that positioning holds as Mythos-class models arrive, promising higher intelligence with new safety constraints, will determine whether Anthropic can maintain its lead in the enterprise market it has worked to dominate.<\/p>\n<\/p><\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" target=\"_blank\" >Technology category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/thenextweb.com\/news\/anthropics-claude-opus-4-8-is-its-most-honest-ai-model-yet-and-mythos-is-coming-in-weeks\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>TL;DR Anthropic has released Claude Opus 4.8, an upgrade to its flagship AI model that is four times less likely to let code flaws pass unremarked. The company also teased Mythos-class models, which have already found more than 10,000 critical software vulnerabilities through Project Glasswing, and announced a $65 billion Series H round at a&#8230;<\/p>\n","protected":false},"author":1,"featured_media":730159,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/media.thenextweb.com\/2026\/05\/claude-opus-4-8.avif","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[],"class_list":["post-730158","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/730158","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=730158"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/730158\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/730159"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=730158"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=730158"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=730158"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}