{"id":729472,"date":"2026-05-25T16:00:29","date_gmt":"2026-05-25T13:00:29","guid":{"rendered":"https:\/\/buradabiliyorum.com\/en\/microsofts-quiet-claude-code-retreat-and-the-real-cost-of-enterprise-ai\/"},"modified":"2026-05-25T16:00:29","modified_gmt":"2026-05-25T13:00:29","slug":"microsofts-quiet-claude-code-retreat-and-the-real-cost-of-enterprise-ai","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/microsofts-quiet-claude-code-retreat-and-the-real-cost-of-enterprise-ai\/","title":{"rendered":"Microsoft\u2019s quiet Claude Code retreat and the real cost of enterprise AI"},"content":{"rendered":"<div id=\"article-main-content\">\n<p>In December of last year, Microsoft told thousands of its engineers, product managers and designers that they could use Claude Code, Anthropic\u2019s command-line coding agent, on the company dime.<\/p>\n<p>By spring, the tool had spread well beyond engineering: into the kind of non-technical roles that, in earlier waves of enterprise software, would have waited years for a seat. Inside Microsoft, the rollout was framed as a learning exercise. Outside it, the surface signal was simpler.<\/p>\n<p>The world\u2019s largest software company, the one with its own foundation models and its own coding assistant, had just paid a competitor to put a rival product in front of its workforce.<\/p>\n<p>Six months later, that experiment is being wound down. According to\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.windowscentral.com\/microsoft\/microsoft-cancels-claude-code-licenses-shifting-developers-to-github-copilot-cli-a-move-likely-driven-by-financial-motives\" target=\"_blank\" rel=\"nofollow noopener\">reporting in Windows Central<\/a>\u00a0and other outlets following The Verge\u2019s original scoop, Microsoft is cancelling most direct Claude Code licences inside its Experiences and Devices group, the division that builds Windows, Microsoft 365, Outlook, Teams and Surface.<\/p>\n<p>Affected engineers have been told to migrate to GitHub Copilot CLI by 30 June, the last day of Microsoft\u2019s fiscal year. The official reason is toolchain unification. The unofficial reason is in the calendar.<\/p>\n<div class=\"inarticle-wrapper latest channel-cta hs-embed-tnw\">\n<div id=\"hs-embed-tnw\" class=\"channel-cta-wrapper\">\n<div class=\"channel-cta-img\"><img decoding=\"async\" class=\"js-lazy\" src=\"https:\/\/media.thenextweb.com\/hardfork-2018\/uploads\/visuals\/tnw-newsletter.png\"\/><\/div>\n<p><img decoding=\"async\" src=\"https:\/\/media.thenextweb.com\/hardfork-2018\/uploads\/visuals\/tnw-newsletter.png\"\/><\/p>\n<div class=\"channel-cta-input\">\n<p class=\"channel-cta-title\">The \ud83d\udc9c of EU tech<\/p>\n<p class=\"channel-cta-tagline\">The latest rumblings from the EU tech scene, a story from our wise ol&#8217; founder Boris, and some questionable AI art. It&#8217;s free, every week, in your inbox. Sign up now!<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p>The Claude pullback is the most credible signal yet that the unit economics of enterprise AI coding do not, at current token prices, work. Not because the tools are bad. The opposite: they are good enough that engineers use them constantly, and the constant use is what breaks the maths.<\/p>\n<p>The clearest evidence is at Uber, which is not Microsoft and does not have Microsoft\u2019s financial cushion. Praveen Neppalli Naga, Uber\u2019s chief <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" data-internallinksmanager029f6b8e52c=\"4\" title=\"Technology\" target=\"_blank\" rel=\"noopener\">technology<\/a> officer, told The Information in April that the company had burned through its entire planned 2026 AI coding budget in four months.<\/p>\n<p>By March,\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/aimagazine.com\/news\/why-uber-has-already-burned-through-its-ai-budget\" target=\"_blank\" rel=\"nofollow noopener\">Naga\u2019s own figures<\/a>\u00a0had Claude Code use jumping from 32 per cent to 84 per cent of his roughly 5,000-engineer organisation. Individual engineers were spending between $500 and $2,000 a month on tokens. Around 70 per cent of code committed at Uber now originates with AI, and on the order of one in ten live backend updates is shipped by an agent with no human in the loop.<\/p>\n<p><em>\u201cI\u2019m back to the drawing board,\u201d<\/em> Naga said, <em>\u201cbecause the budget I thought I would need is blown away already.\u201d<\/em><\/p>\n<p>That sentence is the whole story in miniature. The forecast was wrong because the variable being forecast, token consumption, behaves nothing like the licences and seats that finance teams know how to model. A traditional enterprise software deal is denominated in users.<\/p>\n<p>A token-priced deal is denominated in how much the model has to think. Agentic coding makes the model think a lot. Sessions run for hours, spawn parallel threads and generate volumes of context that bear no resemblance to the autocomplete interactions that shaped the original pricing structure.<\/p>\n<p>We have been tracking this fracture\u00a0for months. In November, GitHub paused new Copilot Pro and Pro+ sign-ups because the agentic workloads of paying customers were generating costs that exceeded their monthly plan price.<\/p>\n<p>Cost structures built for lightweight assistance, the company conceded, no longer held.<\/p>\n<p>This is not an Uber problem or a Microsoft problem. It is an industry condition. Bryan Catanzaro, vice-president of <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>lied deep learning at Nvidia, told\u00a0Axios in April\u00a0that, for his team, the cost of compute is now far beyond the cost of the employees using it.<\/p>\n<p>This is the chip company saying it. Fortune followed in May with reporting that <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/fortune.com\/2026\/05\/22\/microsoft-ai-cost-problem-tokens-agents\/\" target=\"_blank\" rel=\"nofollow noopener\">token-based AI tooling<\/a>, when used heavily, can cost more per task than the human engineer it was supposed to augment.<\/p>\n<p>A 2024 MIT analysis circulated widely in finance circles since then suggests that, on current pricing, AI automation pencils out as cheaper than human labour for roughly a quarter of the jobs people thought it would replace.<\/p>\n<p>Set that against the spend forecasts. Gartner expects worldwide AI spending to reach $2.5 trillion this year, up 69 per cent on 2025.<\/p>\n<p>The same firm now places generative AI squarely in what it calls the trough of disillusionment, predicting in a\u00a0<a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.gartner.com\/en\/newsroom\/press-releases\/2026-05-05-gartner-says-autonomous-business-and-artificial-intelligence-layoffs-may-create-budget-room-but-do-not-deliver-returns\" target=\"_blank\" rel=\"nofollow noopener\">May press release<\/a>\u00a0that 25 per cent of planned 2026 AI budget will slip into 2027 as proofs of concept die in the procurement pipeline.<\/p>\n<p>A separate Gartner read from April found that only 28 per cent of AI infrastructure projects fully deliver against their business case. That is not the curve of a technology going through an awkward adolescence. That is the curve of a market repricing itself.<\/p>\n<p>Microsoft\u2019s retreat lands inside this repricing, and not by accident. There are two ways to read the move. The first is the one Microsoft has briefed: that Copilot CLI is the strategic destination, that engineers will continue to have access to Claude models inside Copilot, and that the company simply wants a product it can shape directly with GitHub. That story is true.<\/p>\n<p>It is also a story that Microsoft could have told at any point in the past six months and chose not to. What changed was not the strategic logic. What changed was the bill.<\/p>\n<p>The second reading is harder to discount. Microsoft is uniquely positioned to know what enterprise-scale Claude usage actually costs, because its own engineers were the heaviest users outside Anthropic\u2019s customer base. Inside Experiences and Devices, Claude Code had become, by several accounts, the preferred tool.<\/p>\n<p>If the maths had improved with scale, this would be the moment Microsoft locked in a multi-year deal at favourable terms. Instead, it is unwinding the experiment in a window that conveniently closes the books on a fiscal year.<\/p>\n<p>When the company with the most leverage in the room walks away from a vendor whose product its own staff prefer, the signal is not about preference.<\/p>\n<p>Whether this constitutes a bubble depends on definitions. Token-level pricing will fall, as it has fallen at roughly a factor of ten every eighteen months for the past three years. The more interesting question is whether per-task token consumption falls faster than per-token cost.<\/p>\n<p>The evidence so far runs the other way. Each generation of agentic system, by design, consumes more tokens per unit of work, because it reasons longer, plans more elaborately and verifies itself against the world.<\/p>\n<p>Anthropic\u2019s own infrastructure team has spoken publicly about reasoning workloads generating order-of-magnitude more compute per query than chat. That is the bet baked into the next twelve months of model releases. It is also the bet that put Uber back at the drawing board.<\/p>\n<p>There is a worked example in TNW\u2019s own coverage. In April,\u00a0Anthropic banned a popular open-source agentic framework\u00a0called OpenClaw from running on consumer Claude subscriptions, after discovering that single instances could chew through the equivalent of $1,000 to $5,000 in API costs in a day of autonomous operation. The framework was running on a $200-a-month Max plan.<\/p>\n<p>The economic transfer was so blatant that Anthropic had to write a new clause into its terms of service. Multiply that pattern across a Fortune 500 engineering organisation, and you have the Uber budget memo.<\/p>\n<p>The counterargument is real and worth stating. The cost of a working AI coding agent compared to the cost of an additional senior engineer is, even at current prices, often favourable on a per-feature basis. The productivity uplift is documented; the substitution is happening. What is breaking is not the value proposition.<\/p>\n<p>It is the procurement model. Companies that signed up for a productivity tool are discovering they signed up for a metered utility, and the meter runs when nobody is looking. The fix may be straightforward: capped budgets per engineer, tiered access for high-leverage roles, agent runtime quotas.<\/p>\n<p>Many of the larger buyers are already there. But the implication is that the era of \u201cgive every employee a Claude Code seat\u201d is closing, and what replaces it will look more like AWS billing than like Office licences.<\/p>\n<p>That is what Microsoft\u2019s quiet email to its Windows and Surface teams really announces. Not the end of AI coding. Not even the end of Anthropic at Microsoft, given that Claude models will continue to be reachable through Copilot CLI.<\/p>\n<p>It announces the end of the experimental phase, the phase in which the world\u2019s largest software companies were willing to absorb arbitrary token costs in exchange for learning. The learning is done.<\/p>\n<p>What comes next is the harder part. Enterprises will keep buying AI coding tools, because the productivity is real and the competitive pressure is unforgiving. But they will buy them the way they buy electricity, with usage caps, with shadow meters, with a finance team in the room.<\/p>\n<p>Somewhere in a Microsoft conference room earlier this spring, someone looked at a Claude Code invoice and did the arithmetic against a Copilot CLI roadmap, and made a decision.<\/p>\n<p>The same arithmetic is now being done in every CFO\u2019s office that bought into the December 2025 rollout. The retreat will not be loud. It will be a <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">series<\/a> of fiscal-year-end emails, sent on a deadline nobody noticed until the budget was already gone.<\/p>\n<\/p><\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" target=\"_blank\" >Technology category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/thenextweb.com\/news\/microsoft-claude-code-retreat-ai-cost\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In December of last year, Microsoft told thousands of its engineers, product managers and designers that they could use Claude Code, Anthropic\u2019s command-line coding agent, on the company dime. By spring, the tool had spread well beyond engineering: into the kind of non-technical roles that, in earlier waves of enterprise software, would have waited years&#8230;<\/p>\n","protected":false},"author":1,"featured_media":729473,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/media.thenextweb.com\/2026\/04\/microsoft-voluntary-retirement-us-workers-ai.avif","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[],"class_list":["post-729472","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/729472","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=729472"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/729472\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/729473"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=729472"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=729472"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=729472"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}