{"id":666558,"date":"2025-05-02T14:48:47","date_gmt":"2025-05-02T11:48:47","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/ai2s-new-small-ai-model-outperforms-similarly-sized-models-from-google-meta\/"},"modified":"2025-05-02T14:48:47","modified_gmt":"2025-05-02T11:48:47","slug":"ai2s-new-small-ai-model-outperforms-similarly-sized-models-from-google-meta","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/ai2s-new-small-ai-model-outperforms-similarly-sized-models-from-google-meta\/","title":{"rendered":"Ai2&#8217;s new small AI model outperforms similarly-sized models from Google, Meta"},"content":{"rendered":"<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">\u2018Tis the week for small AI models, it seems. <\/p>\n<p class=\"wp-block-paragraph\">Nonprofit AI research institute Ai2 on Thursday <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/huggingface.co\/allenai\/OLMo-2-0425-1B\">released<\/a> Olmo 2 1B, a 1-billion-parameter model that Ai2 claims beats similarly-sized models from Google, Meta and Alibaba on several benchmarks. Parameters, sometimes referred to as weights, are the internal components of a model that guide its behavior.<\/p>\n<p class=\"wp-block-paragraph\">Olmo 2 1B is available under a permissive Apache 2.0 license on AI dev platform Hugging Face. Unlike most models, Olmo 2 1B can be replicated from scratch, as Ai2 has provided the code and data sets (<a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/huggingface.co\/datasets\/allenai\/olmo-mix-1124\">Olmo-mix-1124<\/a> and <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/huggingface.co\/datasets\/allenai\/dolmino-mix-1124\">Dolmino-mix-1124<\/a>) used to develop it.<\/p>\n<p class=\"wp-block-paragraph\">Small models might not be as capable as their behemoth counterparts, but importantly, they don\u2019t require beefy hardware to run. That makes them much more accessible for developers and hobbyists contending with the limitations of lower-end hardware and consumer machines.<\/p>\n<p class=\"wp-block-paragraph\">There\u2019s been a raft of small model launches over the past few days, from Microsoft\u2019s Phi 4 reasoning family to <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/venturebeat.com\/ai\/qwen-swings-for-a-double-with-2-5-omni-3b-model-that-runs-on-consumer-pcs-laptops\/\">Qwen\u2019s 2.5 Omni 3B<\/a>. Most of these, including Olmo 2 1B, can easily run on a modern laptop or even a mobile device.<\/p>\n<p class=\"wp-block-paragraph\">Ai2 says Olmo 2 1B was trained on a data set of 4 trillion tokens from publicly available, AI-generated, and manually created sources. Tokens are the raw bits of data that models ingest and generate, with a million tokens equivalent to about 750,000 words.<\/p>\n<p class=\"wp-block-paragraph\">On a benchmark measuring arithmetic reasoning, GSM8K, Olmo 2 1B scores better than Google\u2019s Gemma 3 1B, Meta\u2019s Llama 3.2 1B, and Alibaba\u2019s Qwen 2.5 1.5B. Olmo 2 1B also eclipses the performance of those three models on TruthfulQA, a test for evaluating factual accuracy.  <\/p>\n<div class=\"wp-block-techcrunch-inline-cta\">\n<div class=\"inline-cta__wrapper\">\n<p>Techcrunch event<\/p>\n<div class=\"inline-cta__content\">\n<p>\n\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__location\">Berkeley, CA<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__separator\">|<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__date\">June 5<\/span>\n\t\t\t\t\t\t\t<\/p>\n<p>\t\t\t\t\t<span>BOOK NOW<\/span><\/p><\/div>\n<\/p><\/div>\n<\/div>\n<figure class=\"wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter\">\n<div class=\"wp-block-embed__wrapper\">\n<blockquote class=\"twitter-tweet\" data-width=\"500\" data-dnt=\"true\">\n<p lang=\"en\" dir=\"ltr\">This model was pretrained on 4T tokens of high-quality data, following the same standard pretraining into high-quality annealing of our 7, 13, &amp; 32B models. We upload inter<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">media<\/a>te checkpoints from every 1000 steps in training.<\/p>\n<p>Access the base model: <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/t.co\/xofyWJmo85\">https:\/\/t.co\/xofyWJmo85<\/a> <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/t.co\/7uSJ6sYMdL\">pic.twitter.com\/7uSJ6sYMdL<\/a><\/p>\n<p>\u2014 Ai2 (@allen_ai) <a rel=\"nofollow\" target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/twitter.com\/allen_ai\/status\/1917927467056058665?ref_src=twsrc%5Etfw\">May 1, 2025<\/a><\/p><\/blockquote>\n<\/div>\n<\/figure>\n<p class=\"wp-block-paragraph\">Ai2 has warned that Olmo 2 1B carries risks, however. Like all AI models, it can produce \u201cproblematic outputs,\u201d including harmful and \u201csensitive\u201d content, the organization said, as well as factually inaccurate statements. For these reasons, Ai2 recommends against deploying Olmo 2 1B in commercial settings.<\/p>\n<\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/category\/technology\/\" target=\"_blank\" >Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2025\/05\/01\/ai2s-new-small-ai-model-outperforms-similarly-sized-models-from-google-meta\/\" target=\"_blank\" >Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u2018Tis the week for small AI models, it seems. Nonprofit AI research institute Ai2 on Thursday released Olmo 2 1B, a 1-billion-parameter model that Ai2 claims beats similarly-sized models from Google, Meta and Alibaba on several benchmarks. Parameters, sometimes referred to as weights, are the internal components of a model that guide its behavior. Olmo&#8230;<\/p>\n","protected":false},"author":1,"featured_media":666559,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/10\/GettyImages-1335295270.jpg?resize=1200,675","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[77337,83071,156047],"class_list":["post-666558","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-ai","tag-open-source","tag-ai2"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/666558","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=666558"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/666558\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/666559"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=666558"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=666558"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=666558"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}