{"id":452592,"date":"2022-05-25T16:14:49","date_gmt":"2022-05-25T13:14:49","guid":{"rendered":"https:\/\/en.buradabiliyorum.com\/google-takes-on-openai-with-flashy-text-to-image-generator\/"},"modified":"2022-05-25T16:14:49","modified_gmt":"2022-05-25T13:14:49","slug":"google-takes-on-openai-with-flashy-text-to-image-generator","status":"publish","type":"post","link":"https:\/\/buradabiliyorum.com\/en\/google-takes-on-openai-with-flashy-text-to-image-generator\/","title":{"rendered":"#Google takes on OpenAI with flashy text-to-image generator"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_85 counter-hierarchy ez-toc-counter ez-toc-custom ez-toc-container-direction\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table of Contents<\/p>\n<label for=\"ez-toc-cssicon-toggle-item-6a4212fb590bc\" class=\"ez-toc-cssicon-toggle-label\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #dd3333;color:#dd3333\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #dd3333;color:#dd3333\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/label><input type=\"checkbox\"  id=\"ez-toc-cssicon-toggle-item-6a4212fb590bc\" checked aria-label=\"Toggle\" \/><nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-1'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/buradabiliyorum.com\/en\/google-takes-on-openai-with-flashy-text-to-image-generator\/#%E2%80%9CGoogle_takes_on_OpenAI_with_flashy_text-to-image_generator%E2%80%9D\" >&#8220;Google takes on OpenAI with flashy text-to-image generator&#8221;<\/a><ul class='ez-toc-list-level-2' ><li class='ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/buradabiliyorum.com\/en\/google-takes-on-openai-with-flashy-text-to-image-generator\/#Greetings_humanoids\" >Greetings humanoids<\/a><\/li><\/ul><\/li><\/ul><\/nav><\/div>\n<h1><span class=\"ez-toc-section\" id=\"%E2%80%9CGoogle_takes_on_OpenAI_with_flashy_text-to-image_generator%E2%80%9D\"><\/span>&#8220;Google takes on OpenAI with flashy text-to-image generator&#8221;<span class=\"ez-toc-section-end\"><\/span><\/h1>\n<div>\n                            The AI imagery competition is getting personal.<\/p>\n<p>Google this week unveiled a new challenger to OpenAI\u2019s vaunted DALLE-2 text-to-image generator\u00a0\u2014 and took shots at its rival\u2019s efforts.<\/p>\n<p>Both models convert text prompts into pictures. But Google\u2019s researchers claim their\u00a0system\u00a0provides \u201c<span>unprecedented photorealism and deep language understanding.\u201d<\/span><\/p>\n<div class=\"corona-wrapper neural-cta hs-embed-tnw\">\n<div class=\"neural-cta-wrapper\">\n<div class=\"neural-cta-img\"><img decoding=\"async\" src=\"https:\/\/s3.amazonaws.com\/events.tnw\/hardfork-2018\/uploads\/companies\/neural-newsletter_header.gif\"\/><\/div>\n<p><noscript><img decoding=\"async\" src=\"https:\/\/s3.amazonaws.com\/events.tnw\/hardfork-2018\/uploads\/companies\/neural-newsletter_header.gif\"\/><\/noscript><\/p>\n<div class=\"neural-cta-input\">\n<h2 class=\"neural-cta-title\"><span class=\"ez-toc-section\" id=\"Greetings_humanoids\"><\/span>Greetings humanoids<span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p class=\"neural-cta-tagline\">Subscribe now for a weekly recap of our favorite AI stories<\/p>\n<p><!--[if lte IE 8]><![endif]--><\/div>\n<\/div>\n<\/div>\n<figure class=\"post-image post-mediaBleed aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387410 js-lazy\" alt=\": Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Conflicting category. We observe that both DALL-E 2 and Imagen struggle generating well aligned images for this category. However, Imagen often generates some well aligned samples, e.g. \u201cA panda making latte art.\u201d\" width=\"770\" height=\"856\" sizes=\"auto, (max-width: 770px) 100vw, 770px\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07.png\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07.png 770w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07-189x210.png 189w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07-243x270.png 243w\"\/><figcaption><a rel=\"nofollow noopener\" target=\"_blank\" href=\"#\" data-url=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Feditorial.thenextweb.com%2Fneural%2F2022%2F05%2F25%2Fgoogle-takes-on-openai-with-flashy-text-to-image-generator%2F&amp;via=thenextweb&amp;related=thenextweb&amp;text=Check out this picture on: Human raters preferred Imagen over DALLE-2 for both sample quality and image-text alignment. Credit: Saharia et al.\" data-title=\"Share Human raters preferred Imagen over DALLE-2 for both sample quality and image-text alignment. Credit: Saharia et al. on Twitter\" data-width=\"685\" data-height=\"500\" class=\"post-image-share popitup\" title=\"Share Human raters preferred Imagen over DALLE-2 for both sample quality and image-text alignment. Credit: Saharia et al. on Twitter\"><i class=\"icon icon--inline icon--twitter--dark\"\/><\/a>Human raters preferred Imagen over DALLE-2 for both sample quality and image-text alignment. Credit: Saharia et al.<\/figcaption><noscript><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387410\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07.png\" alt=\": Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Conflicting category. We observe that both DALL-E 2 and Imagen struggle generating well aligned images for this category. However, Imagen often generates some well aligned samples, e.g. \u201cA panda making latte art.\u201d\" width=\"770\" height=\"856\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07.png 770w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07-189x210.png 189w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.58.07-243x270.png 243w\"\/><\/noscript><\/figure>\n<p>The cringingly-named Imagen system uses a large pre-trained language model as a text encoder. A cascade of <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/lilianweng.github.io\/posts\/2021-07-11-diffusion-models\/\">diffusion models<\/a> then turn the user\u2019s words into pictures.<\/p>\n<p>In tests, the Google team said Imagen \u201csignificantly outperformed\u201d DALL-E 2.<\/p>\n<figure class=\"post-image post-mediaBleed aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387407 js-lazy\" alt=\"Imagen vs DALL-E 2 on DrawBench a) image-text alignment, and b) image fidelity.\" width=\"914\" height=\"1098\" sizes=\"auto, (max-width: 914px) 100vw, 914px\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22.png\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22.png 914w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-175x210.png 175w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-112x135.png 112w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-225x270.png 225w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-796x956.png 796w\"\/><figcaption><a rel=\"nofollow noopener\" target=\"_blank\" href=\"#\" data-url=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Feditorial.thenextweb.com%2Fneural%2F2022%2F05%2F25%2Fgoogle-takes-on-openai-with-flashy-text-to-image-generator%2F&amp;via=thenextweb&amp;related=thenextweb&amp;text=Check out this picture on: Imagen particularly outshone DALL-E 2 in the colors, positional, text, and description categories. Credit: Saharia et al.\" data-title=\"Share Imagen particularly outshone DALL-E 2 in the colors, positional, text, and description categories. Credit: Saharia et al. on Twitter\" data-width=\"685\" data-height=\"500\" class=\"post-image-share popitup\" title=\"Share Imagen particularly outshone DALL-E 2 in the colors, positional, text, and description categories. Credit: Saharia et al. on Twitter\"><i class=\"icon icon--inline icon--twitter--dark\"\/><\/a>Imagen particularly outshone DALL-E 2 in the colors, positional, text, and de<a href=\"https:\/\/buradabiliyorum.com\/en\/category\/download-scripts-themes-apps\/\" data-internallinksmanager029f6b8e52c=\"9\" title=\"Download Scripts &amp; Themes &amp; Apps\" target=\"_blank\" rel=\"noopener\">script<\/a>ion categories. Credit: Saharia et al.<\/figcaption><noscript><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387407\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22.png\" alt=\"Imagen vs DALL-E 2 on DrawBench a) image-text alignment, and b) image fidelity.\" width=\"914\" height=\"1098\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22.png 914w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-175x210.png 175w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-112x135.png 112w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-225x270.png 225w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.51.22-796x956.png 796w\"\/><\/noscript><\/figure>\n<p>Imagen\u2019s developers have even invented a new method of measuring the supremacy of their creation.<\/p>\n<p>Dubbed DrawBench, the benchmark compares human judgments on the outputs of different text-to-image generators.<\/p>\n<p>Unsurprisingly, Google\u2019s metric gave strong scores to Google\u2019s system.<\/p>\n<p>\u201cWith DrawBench, extensive human evaluation shows that Imagen outperforms other recent methods by a significant margin,\u201d the researchers said in <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/arxiv.org\/pdf\/2205.11487.pdf\">their study paper<\/a>.<\/p>\n<figure class=\"post-image post-mediaBleed aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387409 js-lazy\" alt=\"Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Colors category. We observe that DALL-E 2 generally struggles with correctly assigning the colors to the objects especially for prompts with more than one object.\" width=\"768\" height=\"858\" sizes=\"auto, (max-width: 768px) 100vw, 768px\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15.png\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15.png 768w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15-188x210.png 188w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15-242x270.png 242w\"\/><figcaption><a rel=\"nofollow noopener\" target=\"_blank\" href=\"#\" data-url=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Feditorial.thenextweb.com%2Fneural%2F2022%2F05%2F25%2Fgoogle-takes-on-openai-with-flashy-text-to-image-generator%2F&amp;via=thenextweb&amp;related=thenextweb&amp;text=Check out this picture on: DALL-E 2 can struggle to correctly assign colors to objects \u2014 especially for prompts with more than one object. Credit: Saharia et al.\" data-title=\"Share DALL-E 2 can struggle to correctly assign colors to objects \u2014 especially for prompts with more than one object. Credit: Saharia et al. on Twitter\" data-width=\"685\" data-height=\"500\" class=\"post-image-share popitup\" title=\"Share DALL-E 2 can struggle to correctly assign colors to objects \u2014 especially for prompts with more than one object. Credit: Saharia et al. on Twitter\"><i class=\"icon icon--inline icon--twitter--dark\"\/><\/a>DALL-E 2 can struggle to correctly assign colors to objects \u2014 especially for prompts with more than one object. Credit: Saharia et al.<\/figcaption><noscript><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387409\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15.png\" alt=\"Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Colors category. We observe that DALL-E 2 generally struggles with correctly assigning the colors to the objects especially for prompts with more than one object.\" width=\"768\" height=\"858\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15.png 768w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15-188x210.png 188w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.56.15-242x270.png 242w\"\/><\/noscript><\/figure>\n<p>The images and metrics certainly look impressive, but Google hasn\u2019t offered an opportunity to scrutinize the results.<\/p>\n<p>You can try some interactive demos at <a rel=\"nofollow noopener\" target=\"_blank\" href=\"https:\/\/imagen.research.google\/\">the Imagen website<\/a>, but these only let you use a small selection of phrases to form a constrained sentence.<\/p>\n<p>Until the model and code get a public release, cynics will suspect that Google\u2019s cherry-picking the results.<\/p>\n<figure class=\"post-image post-mediaBleed aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387411 js-lazy\" alt=\"Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Text category. Imagen is significantly better than DALL-E 2 in prompts with quoted text.\" width=\"768\" height=\"854\" sizes=\"auto, (max-width: 768px) 100vw, 768px\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38.png\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38.png 768w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38-189x210.png 189w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38-243x270.png 243w\"\/><figcaption><a rel=\"nofollow noopener\" target=\"_blank\" href=\"#\" data-url=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Feditorial.thenextweb.com%2Fneural%2F2022%2F05%2F25%2Fgoogle-takes-on-openai-with-flashy-text-to-image-generator%2F&amp;via=thenextweb&amp;related=thenextweb&amp;text=Check out this picture on: Imagen was significantly better than DALL-E 2 in prompts with quoted text. Credit: Saharia et al.\" data-title=\"Share Imagen was significantly better than DALL-E 2 in prompts with quoted text. Credit: Saharia et al. on Twitter\" data-width=\"685\" data-height=\"500\" class=\"post-image-share popitup\" title=\"Share Imagen was significantly better than DALL-E 2 in prompts with quoted text. Credit: Saharia et al. on Twitter\"><i class=\"icon icon--inline icon--twitter--dark\"\/><\/a>Imagen was significantly better than DALL-E 2 in prompts with quoted text. Credit: Saharia et al.<\/figcaption><noscript><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387411\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38.png\" alt=\"Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Text category. Imagen is significantly better than DALL-E 2 in prompts with quoted text.\" width=\"768\" height=\"854\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38.png 768w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38-189x210.png 189w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.59.38-243x270.png 243w\"\/><\/noscript><\/figure>\n<p>Google\u2019s explanation for keeping the model private echoes one given by OpenAI: the system is too dangerous to release.<\/p>\n<p>The researchers warn that generative methods can spread misinformation, stir harassment, and exacerbate marginalization.<\/p>\n<p>\u201cOur preliminary assessment also suggests Imagen encodes several <a href=\"https:\/\/buradabiliyorum.com\/en\/category\/social-mediaa\/\" data-internallinksmanager029f6b8e52c=\"1\" title=\"Social Media\" target=\"_blank\" rel=\"noopener\">social<\/a> biases and stereotypes, including an overall bias towards generating images of people with lighter skin tones and a tendency for images portraying different professions to align with Western gender stereotypes,\u201d said the researchers.<\/p>\n<figure class=\"post-image post-mediaBleed aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387408 js-lazy\" alt=\" Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Reddit category.\" width=\"764\" height=\"854\" sizes=\"auto, (max-width: 764px) 100vw, 764px\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57.png\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57.png 764w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57-188x210.png 188w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57-242x270.png 242w\"\/><figcaption><a rel=\"nofollow noopener\" target=\"_blank\" href=\"#\" data-url=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Feditorial.thenextweb.com%2Fneural%2F2022%2F05%2F25%2Fgoogle-takes-on-openai-with-flashy-text-to-image-generator%2F&amp;via=thenextweb&amp;related=thenextweb&amp;text=Check out this picture on: Imagen significantly outperformed DALL-E 2 in the positional, text, and descriptions categories. Credit: Saharia et al.\" data-title=\"Share Imagen significantly outperformed DALL-E 2 in the positional, text, and descriptions categories. Credit: Saharia et al. on Twitter\" data-width=\"685\" data-height=\"500\" class=\"post-image-share popitup\" title=\"Share Imagen significantly outperformed DALL-E 2 in the positional, text, and descriptions categories. Credit: Saharia et al. on Twitter\"><i class=\"icon icon--inline icon--twitter--dark\"\/><\/a>Imagen significantly outperformed DALL-E 2 in the positional, text, and descriptions categories. Credit: Saharia et al.<span class=\"commtext c00\"\/><\/figcaption><noscript><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1387408\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57.png\" alt=\" Example qualitative comparisons between Imagen and DALL-E 2 [54] on DrawBench prompts from Reddit category.\" width=\"764\" height=\"854\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57.png 764w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57-188x210.png 188w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57-121x135.png 121w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Screenshot-2022-05-25-at-10.54.57-242x270.png 242w\"\/><\/noscript><\/figure>\n<p>The team concludes that Imagen \u201cis not suitable for public use at this time\u201d \u2014\u00a0but does offer hope of a future release.<\/p>\n<p><span style=\"font-weight: 400;\">I await their update with caution. As someone who creates images for articles every day, the prospect of AI labs competing to offer better results is attractive.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">On the other hand, I don\u2019t want our robot overlords to replace artists with algorithms.<\/span>\n                        <\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<blockquote><p><strong><span style=\"color: #ff6600;\">If you liked the article, do not forget to share it with your friends. Follow us on\u00a0<span style=\"color: #ff0000;\"><a style=\"color: #ff0000;\" href=\"https:\/\/news.google.com\/publications\/CAAqBwgKMLG0nwswvr63Aw\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Google News<\/a><\/span>\u00a0too, click on the star and choose us from your favorites.<\/span><\/strong><\/p><\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\">For forums sites go to <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/forum.buradabiliyorum.com\/\" target=\"_blank\" rel=\"noopener\">Forum.BuradaBiliyorum.Com<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<blockquote>\n<p style=\"text-align: center;\"><strong>If you want to read more like this article, you can visit our <span style=\"color: #ff9900;\"><a style=\"color: #ff9900;\" href=\"https:\/\/en.buradabiliyorum.com\/technology\/\" target=\"_blank\" rel=\"noopener\">Technology category.<\/a><\/span><\/strong><\/p>\n<\/blockquote>\n<p><span style=\"color: black;\"><a style=\"color: #ff9900;\" href=\"https:\/\/thenextweb.com\/news\/google-takes-on-openai-with-flashy-text-to-image-generator\" target=\"_blank\" rel=\"noopener\">Source<\/a><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&#8220;Google takes on OpenAI with flashy text-to-image generator&#8221; The AI imagery competition is getting personal. Google this week unveiled a new challenger to OpenAI\u2019s vaunted DALLE-2 text-to-image generator\u00a0\u2014 and took shots at its rival\u2019s efforts. Both models convert text prompts into pictures. But Google\u2019s researchers claim their\u00a0system\u00a0provides \u201cunprecedented photorealism and deep language understanding.\u201d Greetings humanoids&#8230;<\/p>\n","protected":false},"author":1,"featured_media":452593,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"fifu_image_url":"https:\/\/img-cdn.tnwcdn.com\/image\/neural?filter_last=1&fit=1280,640&url=https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2022\/05\/Untitled-design-8.jpg&signature=70123fc3a3539270bacc97fa7a105ebc","fifu_image_alt":"","footnotes":""},"categories":[18],"tags":[],"class_list":["post-452592","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology"],"_links":{"self":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/452592","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/comments?post=452592"}],"version-history":[{"count":0,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/posts\/452592\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media\/452593"}],"wp:attachment":[{"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/media?parent=452592"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/categories?post=452592"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/buradabiliyorum.com\/en\/wp-json\/wp\/v2\/tags?post=452592"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}