{"id":635767,"date":"2024-09-10T21:50:05","date_gmt":"2024-09-10T18:50:05","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/"},"modified":"2024-09-10T21:50:05","modified_gmt":"2024-09-10T18:50:05","slug":"google-gemini-everything-you-need-to-know-about-the-generative-ai-models","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/","title":{"rendered":"#Google Gemini: Everything you need to know about the generative AI models"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_84 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a2554b115ab7\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a2554b115ab7\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#What_is_Gemini\" >What is Gemini?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Whats_the_difference_between_the_Gemini_apps_and_Gemini_models\" >What\u2019s the difference between the Gemini apps and Gemini models?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_Advanced\" >Gemini Advanced<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_extensions_and_Gems\" >Gemini extensions and Gems<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_Live_in-depth_voice_chats\" >Gemini Live in-depth voice chats<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-6\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Image_generation_via_Imagen_3\" >Image generation via Imagen 3<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-7\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_for_teens\" >Gemini for teens<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-8\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_in_smart_home_devices\" >Gemini in smart home devices<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-9\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#What_can_the_Gemini_models_do\" >What can the Gemini models do?<\/a><ul class='ez-toc-list-level-3' ><li class='ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-10\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#What_you_can_do_with_Gemini_Ultra\" >What you can do with Gemini Ultra<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-11\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_Pros_capabilities\" >Gemini Pro\u2019s capabilities<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-12\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_Flash_is_for_less_demanding_work\" >Gemini Flash is for less demanding work<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-13\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Gemini_Nano_can_run_on_your_phone\" >Gemini Nano can run on your phone<\/a><\/li><\/ul><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-14\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#How_much_do_the_Gemini_models_cost\" >How much do the Gemini models cost?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-15\" href=\"https:\/\/buradabiliyorum.com\/en\/google-gemini-everything-you-need-to-know-about-the-generative-ai-models\/#Is_Gemini_coming_to_the_iPhone\" >Is Gemini coming to the iPhone?<\/a><\/li><\/ul><\/nav><\/div>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">Google\u2019s trying to make waves with Gemini, its flagship suite of generative AI models, <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">app<\/a>s and services. But what\u2019s Gemini? How can you use it? And how does it\u00a0stack up to other generative AI tools such as OpenAI\u2019s ChatGPT, Meta\u2019s Llama and Microsoft\u2019s Copilot?<\/p>\n<p class=\"wp-block-paragraph\">To make it easier to keep up with the latest Gemini developments, we\u2019ve put together this handy guide, which we\u2019ll keep updated as new Gemini models, features and <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/news\/\" data-internallinksmanager029f6b8e52c=\"2\" title=\"News\" target=\"_blank\" rel=\"noopener\">news<\/a> about Google\u2019s plans for Gemini are released.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-what-is-gemini\"><span class=\"ez-toc-section\" id=\"What_is_Gemini\"><\/span>What is Gemini?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\" id=\"speakable-summary\">Gemini is Google\u2019s\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.wired.com\/story\/google-deepmind-demis-hassabis-chatgpt\/\">long-promised<\/a>, next-gen generative AI model family. Developed by Google\u2019s AI research labs DeepMind and Google Research, it comes in four flavors:<\/p>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Gemini Ultra<\/strong><\/li>\n<li class=\"wp-block-list-item\"><strong>Gemini Pro<\/strong><\/li>\n<li class=\"wp-block-list-item\"><strong>Gemini Flash<\/strong>, a speedier, \u201cdistilled\u201d version of Pro<\/li>\n<li class=\"wp-block-list-item\"><strong>Gemini Nano<\/strong>, two small models:\u00a0<strong>Nano-1<\/strong>\u00a0and the slightly more capable\u00a0<strong>Nano-2<\/strong>, which is meant to run offline<\/li>\n<\/ul>\n<p class=\"wp-block-paragraph\">All Gemini models were trained to be natively multimodal \u2014 in other words, able to work with and analyze more than just text. Google says they were pre-trained and fine-tuned on a variety of public, proprietary and licensed audio, images and videos, a set of codebases and text in different languages.<\/p>\n<p class=\"wp-block-paragraph\">This sets Gemini apart from models such as\u00a0Google\u2019s own LaMDA, which was trained exclusively on text data. LaMDA can\u2019t understand or generate anything beyond text (like essays, emails and so on), but that isn\u2019t necessarily the case with Gemini models.<\/p>\n<p class=\"wp-block-paragraph\">We\u2019ll note here that the\u00a0ethics and legality\u00a0of training models on public data, in some cases without the data owners\u2019 knowledge or consent, are murky indeed. Google has an\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/cloud.google.com\/blog\/products\/ai-machine-learning\/protecting-customers-with-generative-ai-indemnification\">AI indemnification policy<\/a>\u00a0to shield certain Google Cloud customers from lawsuits should they face them, but this policy contains carve-outs. Proceed with caution \u2014 particularly if you\u2019re intending on using Gemini commercially.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-what-s-the-difference-between-the-gemini-apps-and-gemini-models\"><span class=\"ez-toc-section\" id=\"Whats_the_difference_between_the_Gemini_apps_and_Gemini_models\"><\/span>What\u2019s the difference between the Gemini apps and Gemini models?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Gemini is separate and distinct from the Gemini apps on the web and mobile (formerly Bard).<\/p>\n<p class=\"wp-block-paragraph\">The Gemini apps are clients that connect to various Gemini models and layer a chatbot-like interface on top. Think of them as front ends for Google\u2019s generative AI, analogous to\u00a0ChatGPT\u00a0and Anthropic\u2019s\u00a0Claude family of apps.<\/p>\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1200\" height=\"800\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?w=680\" alt=\"Google Gemini mobile app\" class=\"wp-image-2796832\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=150,100 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=300,200 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=768,512 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=680,453 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=430,287 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=720,480 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=900,600 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-mobile-app-google.jpg?resize=800,533 800w\" sizes=\"auto, (max-width: 1200px) 100vw, 1200px\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Gemini on the web lives\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/gemini.google.com\/\">here<\/a>. On Android, the\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/play.google.com\/store\/apps\/details?id=com.google.android.apps.bard&amp;hl=en_US\">Gemini app<\/a>\u00a0replaces the existing Google Assistant app. And on iOS, the\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/support.google.com\/gemini\/answer\/14554984?hl=en&amp;co=GENIE.Platform%3DiOS\">Google and Google Search apps<\/a>\u00a0serve as that platform\u2019s Gemini clients.<\/p>\n<p class=\"wp-block-paragraph\">On Android, it also recently became possible to bring up the Gemini overlay on top of any app to ask questions about what\u2019s on the screen (e.g., a <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">YouTube<\/a> video). Just press and hold a supported smartphone\u2019s power button or say, \u201cHey Google\u201d; you\u2019ll see the overlay pop up. <\/p>\n<p class=\"wp-block-paragraph\">Gemini apps can accept images as well as voice commands and text \u2014 including files like PDFs and soon videos, either uploaded or imported from Google Drive \u2014 and generate images. As you\u2019d expect, conversations with Gemini apps on mobile carry over to Gemini on the web and vice versa if you\u2019re signed in to the same Google Account in both places.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-gemini-advanced\"><span class=\"ez-toc-section\" id=\"Gemini_Advanced\"><\/span>Gemini Advanced<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">The Gemini apps aren\u2019t the only means of recruiting Gemini models\u2019 assistance with tasks. Slowly but surely, Gemini-imbued features are\u00a0making their way\u00a0into staple Google apps and services like Gmail and Google Docs.<\/p>\n<p class=\"wp-block-paragraph\">To take advantage of most of these, you\u2019ll need the Google One AI Premium Plan. Technically a part of\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/one.google.com\/about\">Google One<\/a>, the AI Premium Plan costs $20 and provides access to Gemini in Google Workspace apps like Docs, Slides, Sheets and Meet. It also enables what Google calls Gemini Advanced, which brings the company\u2019s more sophisticated Gemini models to the Gemini apps.<\/p>\n<p class=\"wp-block-paragraph\">Gemini Advanced users get extras here and there, too, like priority access to new features, the ability to run and edit Python code directly in Gemini, and a larger \u201ccontext window.\u201d Gemini Advanced can remember the content of \u2014 and reason across \u2014 roughly 750,000 words in a conversation (or 1,500 pages of documents). That\u2019s compared to the 24,000 words (or 48 pages) the vanilla Gemini app can handle.<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1868\" height=\"884\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?w=680\" alt=\"Screenshot of a Google Gemini commercial\" class=\"wp-image-2816021\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg 1868w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=150,71 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=300,142 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=768,363 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=680,322 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=1200,568 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=1280,606 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=430,203 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=720,341 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=900,426 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=800,379 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/07\/Screenshot-2024-07-28-at-2.53.42\u202fPM.jpg?resize=1536,727 1536w\" sizes=\"auto, (max-width: 1868px) 100vw, 1868px\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Another Gemini Advanced exclusive is trip planning in Google Search, which creates custom <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/trip-and-travel\/\" data-internallinksmanager029f6b8e52c=\"10\" title=\"Trip &amp; Travel\" target=\"_blank\" rel=\"noopener\">travel<\/a> itineraries from prompts.\u00a0Taking into account things like flight times (from emails in a user\u2019s Gmail inbox), meal preferences and information about local attractions (from Google Search and Maps data), as well as the distances between those attractions, Gemini will generate an itinerary that updates automatically to reflect any changes.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Gemini across Google services is also available to corporate customers through two plans, Gemini Business (an add-on for Google Workspace) and Gemini Enterprise. Gemini Business costs as low as $20 per user per month, and Gemini Enterprise \u2014 which adds meeting note-taking and translated captions as well as document classification and labeling \u2014 is priced at $30 and up per user per month. (Both plans require an annual commitment.)<\/p>\n<p class=\"wp-block-paragraph\">In Gmail, Gemini lives in a side panel that can write emails and summarize message threads. You\u2019ll find the same panel in Docs, where it helps you write and refine your content and brainstorm new ideas. Gemini in Slides generates slides and custom images. And Gemini in Google Sheets tracks and organizes data, creating tables and formulas.<\/p>\n<p class=\"wp-block-paragraph\">Gemini\u2019s reach extends to Drive, as well, where it can summarize files and give quick facts about a project. In Meet, meanwhile, Gemini translates captions into additional languages.<\/p>\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"1600\" height=\"1190\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?w=680\" alt=\"Gemini in Gmail\" class=\"wp-image-2800519\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png 1600w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=150,112 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=300,223 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=768,571 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=680,506 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=1200,893 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=1280,952 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=430,320 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=720,536 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=900,669 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=800,595 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/gemini-gmail.png?resize=1536,1142 1536w\" sizes=\"auto, (max-width: 1600px) 100vw, 1600px\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Gemini recently came to Google\u2019s Chrome browser\u00a0in the form of an AI writing tool. You can use it to write something completely new or rewrite existing text; Google says it\u2019ll consider the webpage you\u2019re on to make recommendations.<\/p>\n<p class=\"wp-block-paragraph\">Elsewhere, you\u2019ll find hints of Gemini in Google\u2019s\u00a0database products,\u00a0cloud security tools,\u00a0app development platforms\u00a0(including\u00a0Firebase\u00a0and\u00a0Project IDX), as well as apps like\u00a0Google Photos\u00a0(where Gemini handles natural language search queries), YouTube (where it helps brainstorm video ideas) and the\u00a0NotebookLM note-taking assistant.<\/p>\n<p class=\"wp-block-paragraph\">Code Assist\u00a0(formerly\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/cloud.google.com\/duet-ai?hl=en\">Duet AI for Developers<\/a>), Google\u2019s suite of AI-powered assistance tools for code completion and generation, is offloading heavy computational lifting to Gemini. So are Google\u2019s\u00a0security products underpinned by Gemini, like\u00a0Gemini in Threat Intelligence, which can analyze large portions of potentially malicious code and let users perform natural language searches for ongoing threats or indicators of compromise.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-gemini-extensions-and-gems\"><span class=\"ez-toc-section\" id=\"Gemini_extensions_and_Gems\"><\/span>Gemini extensions and Gems<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Announced at Google I\/O 2024,\u00a0Gemini Advanced users can create Gems, custom chatbots powered by Gemini models. Gems can be generated from natural language descriptions \u2014 for example, \u201cYou\u2019re my running coach. Give me a daily running plan\u201d \u2014 and shared with others or kept private.<\/p>\n<p class=\"wp-block-paragraph\">Gems are available on desktop and mobile in 150 countries and most languages. Eventually, they\u2019ll be able to tap an expanded set of integrations with Google services, including Google Calendar, Tasks, Keep and YouTube Music, to complete custom tasks.<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"450\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/ezgif-3-2a05707c0a.gif?w=680\" alt=\"Gemini Gems\" class=\"wp-image-2844369\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Speaking of integrations, the Gemini apps on the web and mobile can tap into Google services via what Google calls \u201cGemini extensions.\u201d  Gemini today integrates with Google Drive, Gmail and YouTube to respond to queries such as \u201cCould you summarize my last three emails?\u201d Later this year, Gemini will be able to take additional actions with Google Calendar, Keep, Tasks, YouTube Music and Utilities, the Android-exclusive apps that control on-device features like timers and alarms, media controls, the flashlight, volume, Wi-Fi, Bluetooth and so on.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-gemini-live-in-depth-voice-chats\"><span class=\"ez-toc-section\" id=\"Gemini_Live_in-depth_voice_chats\"><\/span>Gemini Live in-depth voice chats<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">A\u00a0new experience called Gemini Live, exclusive to Gemini Advanced subscribers, allows users to have \u201cin-depth\u201d voice chats with Gemini. It\u2019s available in the Gemini apps on mobile and the Pixel Buds Pro 2, where it can be accessed even when your phone\u2019s locked.<\/p>\n<p class=\"wp-block-paragraph\">With Gemini Live enabled, you can interrupt Gemini while the chatbot\u2019s speaking (in one of several new voices) to ask a clarifying question, and it\u2019ll adapt to your speech patterns in real time. And sometime later this year, Gemini will be able to see and respond to your surroundings, either via photos or video captured by your smartphones\u2019 cameras.<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"450\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/ezgif-7-612379e706.gif?w=680\" alt=\"Gemini Live\" class=\"wp-image-2835064\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Live is also designed to serve as a virtual coach of sorts, helping you rehearse for events, brainstorm ideas and so on. For instance, Live can suggest which skills to highlight in an upcoming job or internship interview, and it can give public speaking advice.<\/p>\n<p class=\"wp-block-paragraph\">You can read our review of Gemini Live here. Spoiler alert: We think the feature has a ways to go before it\u2019s super useful \u2014 but it\u2019s early days, admittedly.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-image-generation-via-imagen-3\"><span class=\"ez-toc-section\" id=\"Image_generation_via_Imagen_3\"><\/span>Image generation via Imagen 3<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Gemini users can generate artwork and images using Google\u2019s built-in Imagen 3 model. <\/p>\n<p class=\"wp-block-paragraph\">Google says that Imagen 3 can more accurately understand the text prompts that it translates into images versus its predecessor,\u00a0Imagen 2, and is more \u201ccreative and detailed\u201d in its generations. In addition, the model produces fewer artifacts and visual errors (at least according to Google), and is the best Imagen model yet for rendering text.<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?w=680\" alt=\"Google Imagen 3\" class=\"wp-image-2844306\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png 1024w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=150,150 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=300,300 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=768,768 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=680,680 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=430,430 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=720,720 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=900,900 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/An-animated-image-of-a-tiny-dragon-hatching-from-an-egg-in-a-sunlit-meadow-surrounded-by-curious-glowing-butterflies.-Vibrant-colors-detailed-scales.png?resize=800,800 800w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\"\/><figcaption class=\"wp-element-caption\">A sample from Imagen 3.<\/figcaption><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Back in February, Google\u00a0was forced to pause\u00a0Gemini\u2019s ability to generate images of people after users complained of\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.theguardian.com\/technology\/2024\/feb\/28\/google-chief-ai-tools-photo-diversity-offended-users\">historical<\/a>\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/www.theverge.com\/2024\/2\/21\/24079371\/google-ai-gemini-generative-inaccurate-historical\">inaccuracies<\/a>. But in August, the company reintroduced people generation for certain users, specifically English-language users signed up for one of Google\u2019s paid Gemini plans (e.g. Gemini Advanced) as part of a pilot program.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-gemini-for-teens\"><span class=\"ez-toc-section\" id=\"Gemini_for_teens\"><\/span>Gemini for teens<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">In June, Google introduced a teen-focused Gemini experience, allowing students to sign up via their Google Workspace for Education school accounts.<\/p>\n<p class=\"wp-block-paragraph\">The teen-focused Gemini has \u201cadditional policies and safeguards,\u201d including a tailored onboarding process and an \u201cAI literacy guide\u201d to (as Google phrases it) \u201chelp teens use AI responsibly.\u201d Otherwise, it\u2019s nearly identical to the standard Gemini experience, down to the \u201cdouble check\u201d feature that looks across the web to see if Gemini\u2019s responses are accurate.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-gemini-in-smart-home-devices\"><span class=\"ez-toc-section\" id=\"Gemini_in_smart_home_devices\"><\/span>Gemini in smart home devices<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">A growing number of Google-made devices tap Gemini for enhanced functionality, from the Google TV Streamer to the Pixel 9 and 9 Pro to the newest Nest Learning Thermostat.<\/p>\n<p class=\"wp-block-paragraph\">On the Google TV Streamer, Gemini uses your preferences to curate content suggestions across your subscriptions and summarize reviews and even whole seasons of TV.<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"8256\" height=\"5504\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?w=680\" alt=\"Google TV Streamer set up\" class=\"wp-image-2820329\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg 8256w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=150,100 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=300,200 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=768,512 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=680,453 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=1200,800 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=1280,853 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=430,287 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=720,480 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=900,600 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=800,533 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=1536,1024 1536w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/08\/Google-TV-Streamer-set-up.jpg?resize=2048,1365 2048w\" sizes=\"auto, (max-width: 8256px) 100vw, 8256px\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">On the latest Nest thermostat (as well as Nest speakers, cameras and smart displays) Gemini will soon bolster Google Assistant\u2019s conversational and analytic capabilities.<\/p>\n<p class=\"wp-block-paragraph\">Subscribers to Google\u2019s Nest Aware plan later this year will get a preview of new Gemini-powered experiences like AI descriptions for Nest camera footage, natural language video search and recommended automations. Nest cameras will understand what\u2019s happening in real-time video feeds (e.g. when a dog\u2019s digging in the garden), while the companion Google Home app will surface videos and create device automations given a description (e.g. \u201cDid the kids leave their bikes in the driveway?,\u201d \u201cHave my Nest thermostat turn on the heating when I get home from work every Tuesday\u201d). <\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"2880\" height=\"1600\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?w=680\" alt=\"Google Gemini in smart home\" class=\"wp-image-2851851\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png 2880w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=150,83 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=300,167 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=768,427 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=680,378 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=1200,667 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=1280,711 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=430,239 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=720,400 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=900,500 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=800,444 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=1536,853 1536w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/09\/Screenshot_2024-09-09_at_8.41.01a_\u00afPM-transformed.png?resize=2048,1138 2048w\" sizes=\"auto, (max-width: 2880px) 100vw, 2880px\"\/><figcaption class=\"wp-element-caption\">Gemini will soon be able to summarize security camera footage from Nest devices.<\/figcaption><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> Google<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Also later this year, Google Assistant will get a few upgrades on Nest-branded and other smart home devices to make conversations feel more natural. Improved voices are on the way, in addition to the ability to ask follow-up questions and \u201c[more] easily go back and forth.\u201d<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-what-can-the-gemini-models-do\"><span class=\"ez-toc-section\" id=\"What_can_the_Gemini_models_do\"><\/span>What can the Gemini models do?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Because Gemini models are multimodal, they can perform a range of multimodal tasks, from transcribing speech to captioning images and videos in real time. Many of these capabilities have reached the product stage (as alluded to in the previous section), and Google is promising much more in the not-too-distant future.<\/p>\n<p class=\"wp-block-paragraph\">Of course, it\u2019s a bit hard to take the company at its word. Google\u00a0seriously underdelivered\u00a0with the original Bard launch. More recently, it ruffled feathers\u00a0with a video purporting to show Gemini\u2019s capabilities\u00a0that was more or less aspirational \u2014 not live.<\/p>\n<p class=\"wp-block-paragraph\">Also, Google offers no fix for some of the\u00a0underlying problems\u00a0with generative AI tech today, like its\u00a0encoded\u00a0biases\u00a0and tendency to make things up (i.e.\u00a0hallucinate). Neither do its rivals, but it\u2019s something to keep in mind when considering using or paying for Gemini.<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<\/blockquote>\n<p class=\"wp-block-paragraph\">Assuming for the purposes of this article that Google is being truthful with its recent claims, here\u2019s what the different tiers of Gemini can do now and what they\u2019ll be able to do once they reach their full potential:<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-what-you-can-do-with-gemini-ultra\"><span class=\"ez-toc-section\" id=\"What_you_can_do_with_Gemini_Ultra\"><\/span>What you can do with Gemini Ultra<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p class=\"wp-block-paragraph\">Google says that\u00a0Gemini Ultra\u00a0\u2014 thanks to its multimodality \u2014 can be used to help with things like physics homework, solving problems step-by-step on a worksheet and pointing out possible mistakes in already filled-in answers.<\/p>\n<p class=\"wp-block-paragraph\">Ultra can also be applied to tasks such as identifying scientific papers relevant to a problem, Google says. The model can extract information from several papers, for instance, and update a chart from one by generating the formulas necessary to re-create the chart with more timely data.<\/p>\n<p class=\"wp-block-paragraph\">Gemini Ultra technically supports image generation. But that capability hasn\u2019t made its way into the productized version of the model yet \u2014 perhaps because the mechanism is more complex than how apps such as ChatGPT generate images. Rather than feed prompts to an image generator (like\u00a0DALL-E 3, in ChatGPT\u2019s case), Gemini outputs images \u201cnatively,\u201d without an intermediary step.<\/p>\n<p class=\"wp-block-paragraph\">Ultra is available as an API through Vertex AI, Google\u2019s fully managed AI dev platform, and AI Studio, Google\u2019s web-based tool for app and platform developers.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-gemini-pro-s-capabilities\"><span class=\"ez-toc-section\" id=\"Gemini_Pros_capabilities\"><\/span>Gemini Pro\u2019s capabilities<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p class=\"wp-block-paragraph\">Google says that Gemini Pro is an improvement over LaMDA in its reasoning, planning and understanding capabilities. The latest version,\u00a0Gemini 1.5 Pro \u2014 which powers the Gemini apps for Gemini Advanced subscribers \u2014 exceeds even Ultra\u2019s performance in some areas.<\/p>\n<p class=\"wp-block-paragraph\">Gemini 1.5 Pro is improved in a number of areas\u00a0compared with its predecessor, Gemini 1.0 Pro, perhaps most obviously in the amount of data that it can process. Gemini 1.5 Pro can take in up to 1.4 million words, two hours of video or 22 hours of audio, and reason across or answer questions about that data (more or less).<\/p>\n<p class=\"wp-block-paragraph\">Gemini 1.5 Pro became <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/general\/\" data-internallinksmanager029f6b8e52c=\"3\" title=\"General\" target=\"_blank\" rel=\"noopener\">general<\/a>ly available on Vertex AI and AI Studio in June alongside a feature called code execution, which aims to reduce bugs in code that the model generates by iteratively refining that code over several steps. (Code execution also supports Gemini Flash.)<\/p>\n<p class=\"wp-block-paragraph\">Within Vertex AI, developers can customize Gemini Pro to specific contexts and use cases via a fine-tuning or \u201cgrounding\u201d process. For example, Pro (along with other Gemini models) can be instructed to use data from third-party providers like Moody\u2019s, Thomson Reuters, ZoomInfo and MSCI, or source information from corporate data sets or Google Search instead of its wider knowledge bank. Gemini Pro can also be connected to external, third-party APIs to perform particular actions, like automating a back-office workflow.<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<\/blockquote>\n<p class=\"wp-block-paragraph\">AI Studio offers templates for creating structured chat prompts with Pro. Developers can control the model\u2019s creative range and provide examples to give tone and style instructions \u2014 and also tune Pro\u2019s safety settings.<\/p>\n<p class=\"wp-block-paragraph\">Vertex AI Agent Builder\u00a0lets people build Gemini-powered \u201cagents\u201d within Vertex AI. For example, a company could create an agent that analyzes previous marketing campaigns to understand a brand style, and then apply that knowledge to help generate new ideas consistent with the style.\u00a0<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-gemini-flash-is-for-less-demanding-work\"><span class=\"ez-toc-section\" id=\"Gemini_Flash_is_for_less_demanding_work\"><\/span>Gemini Flash is for less demanding work<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p class=\"wp-block-paragraph\">For less demanding applications, there\u2019s Gemini Flash. The newest version is 1.5 Flash; Gemini app users <em>not<\/em> subscribed to Gemini Advanced get access to this. <\/p>\n<p class=\"wp-block-paragraph\">An offshoot of Gemini Pro that\u2019s small and efficient, built for narrow, high-frequency generative AI workloads, Flash is multimodal like Gemini Pro, meaning it can analyze audio, video and images as well as text (but only generate text). Google says that Flash is particularly well-suited for tasks like summarization and chat apps, plus image and video captioning and data extraction from long documents and tables. <\/p>\n<p class=\"wp-block-paragraph\">Devs using Flash and Pro can optionally leverage context caching, which lets them store large amounts of information (say, a knowledge base or database of research papers) in a cache that Gemini models can quickly and relatively cheaply access. Context caching is an additional fee on top of other Gemini model usage fees, however.<\/p>\n<h3 class=\"wp-block-heading\" id=\"h-gemini-nano-can-run-on-your-phone\"><span class=\"ez-toc-section\" id=\"Gemini_Nano_can_run_on_your_phone\"><\/span>Gemini Nano can run on your phone<span class=\"ez-toc-section-end\"><\/span><\/h3>\n<p class=\"wp-block-paragraph\">Gemini Nano is a much smaller version of the Gemini Pro and Ultra models, and it\u2019s efficient enough to run directly on (some) devices instead of sending the task to a server somewhere. So far, Nano powers a couple of features on the\u00a0Pixel 8 Pro, Pixel 8, Pixel 9 Pro, Pixel 9\u00a0and\u00a0Samsung Galaxy S24, including Summarize in Recorder and Smart Reply in Gboard.<\/p>\n<p class=\"wp-block-paragraph\">The Recorder app, which lets users push a button to record and transcribe audio, includes a Gemini-powered summary of recorded conversations, interviews, presentations and other audio snippets. Users get summaries even if they don\u2019t have a signal or Wi-Fi connection \u2014 and in a nod to privacy, no data leaves their phone in process.<\/p>\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"507\" height=\"1080\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?w=319\" alt=\"\" class=\"wp-image-2792997\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg 507w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?resize=70,150 70w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?resize=141,300 141w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?resize=319,680 319w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?resize=202,430 202w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?resize=338,720 338w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?resize=423,900 423w, https:\/\/techcrunch.com\/wp-content\/uploads\/2024\/06\/Pixel8Pro_Recorder-Summaries.jpg?resize=376,800 376w\" sizes=\"auto, (max-width: 507px) 100vw, 507px\"\/><\/figure>\n<p class=\"wp-block-paragraph\">Nano is also in Gboard, Google\u2019s keyboard replacement. There, it powers a feature called Smart Reply, which helps to suggest the next thing you\u2019ll want to say when having a conversation in a messaging app such as WhatsApp.<\/p>\n<p class=\"wp-block-paragraph\">In the Google Messages app on supported devices, Nano drives Magic Compose, which can craft messages in styles like \u201cexcited,\u201d \u201cformal\u201d and \u201clyrical.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Google says that a future version of Android will tap Nano to\u00a0alert users to potential scams during calls.\u00a0The new weather app on Pixel phones uses Gemini Nano to generate tailored weather reports. And TalkBack, Google\u2019s accessibility service, employs Nano to\u00a0create aural descriptions of objects\u00a0for low-vision and blind users.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-how-much-do-the-gemini-models-cost\"><span class=\"ez-toc-section\" id=\"How_much_do_the_Gemini_models_cost\"><\/span>How much do the Gemini models cost?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">Gemini 1.0 Pro (the first version of Gemini Pro), 1.5 Pro and Flash are available through Google\u2019s Gemini API for building apps and services \u2014 all with free options. But the free options impose usage limits and leave out certain features, like context caching and <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/cloud.google.com\/vertex-ai\/generative-ai\/docs\/multimodal\/batch-prediction-gemini\">batching<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">Gemini models are otherwise pay-as-you-go. Here\u2019s the base pricing \u2014 not including add-ons like context caching \u2014 as of September 2024:<\/p>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><strong>Gemini 1.0 Pro:<\/strong>\u00a050 cents per 1 million input tokens, $1.50 per 1 million output tokens<\/li>\n<li class=\"wp-block-list-item\"><strong>Gemini 1.5 Pro:\u00a0<\/strong>$3.50 per 1 million input tokens (for prompts up to 128K tokens) or $7 per 1 million input tokens (for prompts longer than 128K tokens); $10.50 per 1 million output tokens (for prompts up to 128K tokens) or $21.00 per 1 million output tokens (for prompts longer than 128K tokens)<\/li>\n<li class=\"wp-block-list-item\"><strong>Gemini 1.5 Flash:<\/strong>\u00a07.5 cents per 1 million input tokens (for prompts up to 128K tokens), 15 cents per 1 million input tokens (for prompts longer than 128K tokens); 30 cents per 1 million output tokens (for prompts up to 128K tokens), 60 cents per 1 million output tokens (for prompts longer than 128K tokens)<\/li>\n<\/ul>\n<p class=\"wp-block-paragraph\">Tokens are subdivided bits of raw data, like the syllables \u201cfan,\u201d \u201ctas\u201d and \u201ctic\u201d in the word \u201cfantastic\u201d; 1 million tokens is equivalent to about 700,000 words. <em>Input<\/em> refers to tokens fed into the model, while <em>output<\/em> refers to tokens that the model generates.<\/p>\n<p class=\"wp-block-paragraph\">Ultra pricing has yet to be announced, and Nano is still in\u00a0<a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/ai.google.dev\/gemini-api\/docs\/get-started\/android_aicore\">early access<\/a>.<\/p>\n<h2 class=\"wp-block-heading\" id=\"h-is-gemini-coming-to-the-iphone\"><span class=\"ez-toc-section\" id=\"Is_Gemini_coming_to_the_iPhone\"><\/span>Is Gemini coming to the iPhone?<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"wp-block-paragraph\">It might.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Apple has said that it\u2019s in talks to put Gemini and other third-party models to use\u00a0for a number of features in its Apple Intelligence suite. Following a\u00a0keynote presentation at WWDC 2024, Apple SVP Craig Federighi\u00a0confirmed plans to work with models\u00a0including Gemini, but he didn\u2019t divulge any additional details.<\/p>\n<p class=\"wp-block-paragraph\"><em>This post was originally published February 16, 2024, and has since been updated to include new information about Gemini and Google\u2019s plans for it.<\/em><\/p>\n<\/div>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMN63nwsw68G3Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/technology\/\" target=\"_blank\" rel=\"noopener\">Technology<\/a><\/span> category.<\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/techcrunch.com\/2024\/09\/10\/what-is-google-gemini-ai\/\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Google\u2019s trying to make waves with Gemini, its flagship suite of generative AI models, apps and services. But what\u2019s Gemini? How can you use it? And how does it\u00a0stack up to other generative AI tools such as OpenAI\u2019s ChatGPT, Meta\u2019s Llama and Microsoft\u2019s Copilot? To make it easier to keep up with the latest Gemini&#8230;<\/p>\n","protected":false},"author":1,"featured_media":635768,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/techcrunch.com\/wp-content\/uploads\/2023\/12\/google-bard-gemini-v2.jpg?resize=1200,675","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[77337,75885,74864,5055,151410,151759,147146,26293,151503],"class_list":["post-635767","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-ai","tag-enterprise","tag-gemini","tag-apps","tag-evergreens","tag-gemini-pro","tag-generative-ai","tag-google","tag-google-gemini"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/635767","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=635767"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/635767\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/635768"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=635767"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=635767"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=635767"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}