{"id":653731,"date":"2025-02-16T17:55:16","date_gmt":"2025-02-16T14:55:16","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/"},"modified":"2025-02-16T17:55:16","modified_gmt":"2025-02-16T14:55:16","slug":"open-source-llms-hit-europes-digital-sovereignty-roadmap","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/","title":{"rendered":"#Open source LLMs hit Europe&#8217;s digital sovereignty roadmap"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_84 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a26b76bb1c99\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a26b76bb1c99\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/#Up_to_scratch\" >Up to scratch<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/#Build_up\" >Build up<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/#The_open_source_definition\" >The open source definition<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/#Two_for_one\" >Two for one<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/buradabiliyorum.com\/en\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/#Funding_gap\" >Funding gap<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/buradabiliyorum.com\/en\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/#Sovereign_state\" >Sovereign state<\/a><\/li><\/ul><\/nav><\/div>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">Large language models (LLMs) landed on Europe\u2019s digital sovereignty agenda with a bang last week, as <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a> <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/openeurollm.eu\/launch-press-release\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">emerged<\/a> of a new program to develop a <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/watch-movies-tv-seriess\/\" data-internallinksmanager029f6b8e52c=\"8\" title=\"Watch Movies &amp; TV Series\" target=\"_blank\" rel=\"noopener\">series<\/a> of \u201ctruly\u201d open source LLMs covering all European Union languages.<\/p>\n<p class=\"wp-block-paragraph\">This includes the current 24 official EU languages, as well as languages for countries currently negotiating for entry to the EU market, <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.politico.eu\/article\/albania-begin-eu-accession-talk-enlargement\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">such as Albania<\/a>. Future-proofing is the name of the <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/game\/\" data-internallinksmanager029f6b8e52c=\"7\" title=\"Game\" target=\"_blank\" rel=\"noopener\">game<\/a>.<\/p>\n<p class=\"wp-block-paragraph\"><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/openeurollm.eu\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">OpenEuroLLM<\/a> is a collaboration between some 20 organizations, co-led by <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/ufal.mff.cuni.cz\/jan-hajic\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Jan Ha<\/a><a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/ufal.mff.cuni.cz\/jan-hajic\">ji\u010d<\/a>, a computational linguist from the Charles University in Prague, and <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/fi.linkedin.com\/in\/psarlin\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Peter Sarlin<\/a>, CEO and co-founder of Finnish AI lab Silo AI, which <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.amd.com\/en\/newsroom\/press-releases\/2024-8-12-amd-completes-acquisition-of-silo-ai-to-accelerate.html\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">AMD acquired last year for $665 million<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">The project fits a broader narrative that has seen Europe push digital sovereignty as a priority, enabling it to bring mission-critical infrastructure and tools closer to home. Most of the cloud giants are investing in local infrastructure to ensure EU data stays local, while AI darling OpenAI recently unveiled a new offering that allows customers to process and store data in Europe.<\/p>\n<p class=\"wp-block-paragraph\">Elsewhere, the EU recently signed an $11 billion deal to create a sovereign satellite constellation to rival Elon Musk\u2019s Starlink.<\/p>\n<p class=\"wp-block-paragraph\">So OpenEuroLLM is certainly on-brand.<\/p>\n<p class=\"wp-block-paragraph\">However, the <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/digital-strategy.ec.europa.eu\/en\/news\/pioneering-ai-project-awarded-opening-large-language-models-european-languages\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">stated budget<\/a> just for building the models themselves is \u20ac37.4 million, with roughly \u20ac20 million coming from the EU\u2019s <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/digital-strategy.ec.europa.eu\/en\/activities\/digital-programme\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Digital Europe Programme<\/a> \u2014 a drop in the ocean compared to what the giants of the corporate AI world are investing. The actual budget is more when you factor in funding allocated for tangential and related work, and arguably the biggest expense is compute. The OpenEuroLLM project\u2019s partners include <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/eurohpc-ju.europa.eu\/supercomputers\/our-supercomputers_en\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">EuroHPC<\/a> supercomputer centers in Spain, Italy, Finland, and the Netherlands \u2014 and the broader EuroHPC project has a budget of around \u20ac7 billion.<\/p>\n<p class=\"wp-block-paragraph\">But the sheer number of disparate participating parties, spanning academia, research, and corporations, have led many to <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.linkedin.com\/posts\/alek-tarkowski_publicai-openness-opendata-activity-7292502898838589441-XDNE\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">question whether<\/a> its goals are achievable. Anastasia Stasenko, co-founder of LLM company <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/pleias.fr\/\">Pleias<\/a>, questioned <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/www.linkedin.com\/posts\/anastasia-stasenko_it-is-not-a-response-to-deepseek-it-is-the-activity-7292204865106153473-LSzz\/\">whether<\/a> a \u201csprawling consortia of 20+ organizations\u201d could have the same measured focus of a homegrown private AI firm.<\/p>\n<p class=\"wp-block-paragraph\">\u201cEurope\u2019s recent successes in AI shine through small focused teams like Mistral AI and <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/www.reuters.com\/technology\/artificial-intelligence\/french-genai-startup-lighton-rises-market-debut-2024-11-26\/\">LightOn<\/a> \u2014 companies that truly own what they\u2019re building,\u201d Stasenko wrote. \u201cThey carry im<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">media<\/a>te responsibility for their choices, whether in finances, market positioning, or reputation.\u201d<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-up-to-scratch\"><span class=\"ez-toc-section\" id=\"Up_to_scratch\"><\/span>Up to scratch<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">The OpenEuroLLM project is either starting from scratch or it has a head start \u2014 depending on how you look at it.<\/p>\n<p class=\"wp-block-paragraph\">Since 2022, Haji\u010d has also been coordinating the High Performance Language Technologies (<a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/hplt-project.org\/\">HPLT<\/a>) project, which has set out to develop free and reusable datasets, models, and workflows using high-performance computing (HPC). That project is scheduled to end in late 2025, but it can be viewed as a sort of \u201cpredecessor\u201d to OpenEuroLLM, according to Haji\u010d, given that most of the partners on HPLT (aside from the U.K. partners) are participating here, too.<\/p>\n<p class=\"wp-block-paragraph\">\u201cThis [OpenEuroLLM] is really just a broader participation, but more focused on generative LLMs,\u201d Haji\u010d said. \u201cSo it\u2019s not starting from zero in terms of data, expertise, tools, and compute experience. We have assembled people who know what they\u2019re doing \u2014 we should be able to get up to speed quickly.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Haji\u010d said that he expects the first version(s) to be released by mid-2026, with the final iteration(s) arriving by the project\u2019s conclusion in 2028. But those goals might still seem lofty when you consider that there isn\u2019t much to poke at yet beyond a bare-bones <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/github.com\/OpenLLM-Europe\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">GitHub profile<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">\u201cIn that respect, we are starting from scratch \u2014 the project started on Saturday [February 1],\u201d Haji\u010d said. \u201cBut we have been preparing the project for a year [the <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/ec.europa.eu\/info\/funding-tenders\/opportunities\/portal\/screen\/opportunities\/topic-details\/digital-2024-ai-06-finetune\">tender<\/a> process opened in February 2024].\u201d<\/p>\n<p class=\"wp-block-paragraph\">From academia and research, organizations spanning Czechia, the Netherlands, Germany, Sweden, Finland, and Norway are part of the OpenEuroLLM cohort, in addition to the EuroHPC centers. From the corporate world, Finland\u2019s AMD-owned AI lab Silo AI is on board, as are Aleph Alpha (Germany), Ellamind (Germany), Prompsit Language Engineering (Spain), and LightOn (France).<\/p>\n<p class=\"wp-block-paragraph\">One notable omission from the list is that of French AI unicorn Mistral, which has positioned itself as an open source alternative to incumbents such as OpenAI. While nobody from Mistral responded to TechCrunch for comment, Haji\u010d did confirm that he tried to initiate conversations with the startup, but to no avail.<\/p>\n<p class=\"wp-block-paragraph\">\u201cI tried to <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>roach them, but it hasn\u2019t resulted in a focused discussion about their participation,\u201d Haji\u010d said.<\/p>\n<p class=\"wp-block-paragraph\">The project could still gather new participants as part of the EU program that\u2019s providing funding, though it will be limited to EU organizations. This means that entities from the U.K. and Switzerland won\u2019t be able to take part. This flies in contrast to the Horizon R&amp;D program, which the U.K. rejoined in 2023 after a prolonged Brexit stalemate and which provided funding to HPLT.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-build-up\"><span class=\"ez-toc-section\" id=\"Build_up\"><\/span>Build up<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">The project\u2019s top-line goal, as per its tagline, is to create: \u201cA series of foundation models for transparent AI in Europe.\u201d Additionally, these models should preserve the \u201clinguistic and cultural diversity\u201d of all EU languages \u2014 current and future.<\/p>\n<p class=\"wp-block-paragraph\">What this translates to in terms of deliverables is still being ironed out, but it will likely mean a core multilingual LLM designed for <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a>-purpose tasks where accuracy is paramount. And then also smaller \u201cquantized\u201d versions, perhaps for edge applications where efficiency and speed are more important. <\/p>\n<p class=\"wp-block-paragraph\">\u201cThis is something we still have to make a detailed plan about,\u201d Haji\u010d said. \u201cWe want to have it as small but as high-quality as possible. We don\u2019t want to release something which is half-baked, because from the European point-of-view this is high-stakes, with lots of money coming from the European Commission \u2014 public money.\u201d<\/p>\n<p class=\"wp-block-paragraph\">While the goal is to make the model as proficient as possible in all languages, attaining equality across the board could also be challenging.  <\/p>\n<p class=\"wp-block-paragraph\">\u201cThat is the goal, but how successful we can be with languages with scarce digital resources is the question,\u201d Haji\u010d said. \u201cBut that\u2019s also why we want to have true benchmarks for these languages, and not to be swayed toward benchmarks which are perhaps not representative of the languages and the culture behind them.\u201c<\/p>\n<p class=\"wp-block-paragraph\">In terms of data, this is where a lot of the work from the HPLT project will prove fruitful, with <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/hplt-project.org\/datasets\/v2.0\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">version 2.0<\/a> of its dataset released four months ago. This dataset was trained 4.5 petabytes of web crawls and more than 20 billion documents, and Haji\u010d said that they will add additional data from <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/commoncrawl.org\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Common Crawl<\/a> (an open repository of web-crawled data) to the mix.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-the-open-source-definition\"><span class=\"ez-toc-section\" id=\"The_open_source_definition\"><\/span>The open source definition<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">In traditional software, the perennial struggle between open source and proprietary revolves around the \u201ctrue\u201d meaning of \u201copen source.\u201d This can be resolved by deferring to the formal \u201cdefinition\u201d as per the Open Source Initiative, the industry stewards of what are and aren\u2019t legitimate open source licenses. <\/p>\n<p class=\"wp-block-paragraph\">More recently, the OSI has formed a definition of \u201copen source AI,\u201d though not everyone is happy with the outcome. Open source AI proponents argue that not only models should be freely available, but also the datasets, pretrained models, weights \u2014 the full shebang. The OSI\u2019s definition doesn\u2019t make training data mandatory, because it says AI models are often trained on proprietary data or data with redistribution restrictions.<\/p>\n<p class=\"wp-block-paragraph\">Suffice it to say, the OpenEuroLLM is facing these same quandaries, and despite its intentions to be \u201ctruly open,\u201d it will probably have to make some compromises if it\u2019s to fulfill its \u201cquality\u201d obligations.<\/p>\n<p class=\"wp-block-paragraph\">\u201cThe goal is to have everything open. Now, of course, there are some limitations,\u201d Haji\u010d said. \u201cWe want to have models of the highest quality possible, and based on the European copyright directive we can use anything we can get our hands on. Some of it cannot be redistributed, but some of it can be stored for future inspection.\u201d<\/p>\n<p class=\"wp-block-paragraph\">What this means is that the OpenEuroLLM project might have to keep some of the training data under wraps, but be made available to auditors upon request \u2014 as required for high-risk AI systems under the terms of the EU AI Act.<\/p>\n<p class=\"wp-block-paragraph\">\u201cWe hope that most of the data [will be open], especially the data coming from the Common Crawl,\u201d Haji\u010d said. \u201cWe would like to have it all completely open, but we will see. In any case, we will have to comply with AI regulations.\u201d<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-two-for-one\"><span class=\"ez-toc-section\" id=\"Two_for_one\"><\/span>Two for one<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Another criticism that emerged in the aftermath of OpenEuroLLM\u2019s formal unveiling was that a very similar project launched in Europe just a few short months previous. <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/sites.google.com\/view\/eurollm\/home\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">EuroLLM<\/a>, which launched its first model in <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/huggingface.co\/utter-project\/EuroLLM-1.7B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">September<\/a> and a follow-up in <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/huggingface.co\/utter-project\/EuroLLM-9B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">December<\/a>, is <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/interoperable-europe.ec.europa.eu\/collection\/open-source-observatory-osor\/news\/eurollm-pioneering-european-open-source-ai\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">co-funded by the EU<\/a> alongside a consortium of nine partners. These include academic institutions such as the University of Edinburgh and corporations such as Unbabel, which last year won millions of GPU training hours on EU supercomputers.<\/p>\n<p class=\"wp-block-paragraph\">EuroLLM shares similar goals to its near-namesake: \u201cTo build an open source European Large Language Model that supports 24 Official European Languages, and a few other strategically important languages.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Andre Martins, head of research at Unbabel, <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.linkedin.com\/posts\/andre-martins-31476745_ai-artificialintelligence-openlanguagemodels-activity-7292215869395394560-Msbl?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAKviRMBcX5r1q4mPiPDFxquTiZC0hbfIYQ\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">took to social media<\/a> to <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/x.com\/andre_t_martins\/status\/1886541654653280579\">highlight these similarities<\/a>, noting that OpenEuroLLM is appropriating a name that already exists. \u201cI hope the different communities collaborate openly, share their expertise, and don\u2019t decide to reinvent the wheel every time a new project gets funded,\u201d Martins wrote.<\/p>\n<p class=\"wp-block-paragraph\">Haji\u010d called the situation \u201cunfortunate,\u201d adding that he hoped they might be able to cooperate, though he stressed that due to the source of its funding in the EU, OpenEuroLLM is restricted in terms of its collaborations with non-EU entities, including U.K. universities. <\/p>\n<h2 class=\"wp-block-heading\" id=\"h-funding-gap\"><span class=\"ez-toc-section\" id=\"Funding_gap\"><\/span>Funding gap<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">The arrival of China\u2019s DeepSeek, and the cost-to-performance ratio it promises, has given some encouragement that AI initiatives might be able to do far more with much less than initially thought. However, over the past few weeks, many have <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/semianalysis.com\/2025\/01\/31\/deepseek-debates\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">questioned the true costs<\/a> involved in building DeepSeek.<\/p>\n<p class=\"wp-block-paragraph\">\u201cWith respect to DeepSeek, we actually know very little about what exactly went into building it,\u201d Peter Sarlin, who is technical co-lead on the OpenEuroLLM project, told TechCrunch.<\/p>\n<p class=\"wp-block-paragraph\">Regardless, Sarlin reckons OpenEuroLLM will have access to sufficient funding, as it\u2019s mostly to cover people. Indeed, a large chunk of the costs of building AI systems is compute, and that should mostly be covered through its partnership with the EuroHPC centers.<\/p>\n<p class=\"wp-block-paragraph\">\u201cYou could say that OpenEuroLLM actually has quite a significant budget,\u201d Sarlin said. \u201cEuroHPC has invested billions in AI and compute infrastructure, and have committed billions more into expanding that in the coming few years.\u201d<\/p>\n<p class=\"wp-block-paragraph\">It\u2019s also worth noting that the OpenEuroLLM project isn\u2019t building toward a consumer- or enterprise-grade product. It\u2019s purely about the models, and this is why Sarlin reckons the budget it has should be ample. <\/p>\n<p class=\"wp-block-paragraph\">\u201cThe intent here isn\u2019t to build a chatbot or an AI assistant \u2014 that would be a product initiative requiring a lot of effort, and that\u2019s what ChatGPT did so well,\u201d Sarlin said. \u201cWhat we\u2019re contributing is an open source foundation model that functions as the AI infrastructure for companies in Europe to build upon. We know what it takes to build models, it\u2019s not something you need billions for.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Since 2017, Sarlin has spearheaded AI lab Silo AI, which launched \u2014 in partnership with others, including the HPLT project \u2014 the family of <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.silo.ai\/blog\/poro-a-family-of-open-models-that-bring-european-languages-to-the-frontier\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Poro<\/a> and <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.silo.ai\/blog\/viking-13b-scaling-nordic-ai-models-using-an-open-source-training-framework-for-lumi\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Viking open models<\/a>. These already support a handful of European languages, but the company is now readying the next iteration \u201cEuropa\u201d models, which will cover all European languages.<\/p>\n<p class=\"wp-block-paragraph\">And this ties in with the whole \u201cnot starting from scratch\u201d notion espoused by Haji\u010d \u2014 there is already a bedrock of expertise and <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/technology\/\" data-internallinksmanager029f6b8e52c=\"4\" title=\"Technology\" target=\"_blank\" rel=\"noopener\">technology<\/a> in place.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-sovereign-state\"><span class=\"ez-toc-section\" id=\"Sovereign_state\"><\/span>Sovereign state<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">As critics have noted, OpenEuroLLM does have a lot of moving parts \u2014 which Haji\u010d acknowledges, albeit with a positive outlook.<\/p>\n<p class=\"wp-block-paragraph\">\u201cI\u2019ve been involved in many collaborative projects, and I believe it has its advantages versus a single company,\u201d he said. \u201cOf course they\u2019ve done great things at the likes of OpenAI to Mistral, but I hope that the combination of academic expertise and the companies\u2019 focus could bring something new.\u201d<\/p>\n<p class=\"wp-block-paragraph\">And in many ways, it\u2019s not about trying to outmaneuver Big Tech or billion-dollar AI startups; the ultimate goal is digital sovereignty: (mostly) open foundation LLMs built by, and for, Europe.<\/p>\n<p class=\"wp-block-paragraph\">\u201cI hope this won\u2019t be the case, but if, in the end, we are not the number one model, and we have a \u2018good\u2019 model, then we will still have a model with all the components based in Europe,\u201d Haji\u010d said. \u201cThis will be a positive result.\u201d<\/p>\n<\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/category\/technology\/\" target=\"_blank\" >Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2025\/02\/16\/open-source-llms-hit-europes-digital-sovereignty-roadmap\/\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Large language models (LLMs) landed on Europe\u2019s digital sovereignty agenda with a bang last week, as news emerged of a new program to develop a series of \u201ctruly\u201d open source LLMs covering all European Union languages. This includes the current 24 official EU languages, as well as languages for countries currently negotiating for entry to&#8230;<\/p>\n","protected":false},"author":1,"featured_media":653732,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/02\/GettyImages-1138358728-e1739188254812.jpg?resize=1200,798","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[77337,151537,154395,151454],"class_list":["post-653731","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-ai","tag-government-policy","tag-openeurollm","tag-tc"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/653731","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=653731"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/653731\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/653732"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=653731"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=653731"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=653731"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}