Create your own Generative AI Text-to-Image API

<h1>The TL;DR</h1> <ul> <li>Recent advances in Generative AI have led to the launch of a whole host of services such as DALL-E 2, Midjourney and Stability AI that have the potential to drastically change the way we approach content creation.</li> <li>In this post I show you how to build and serve your very own, high performance, text-to-image service over an API. Based on&nbsp;<a href="https://github.com/CompVis/stable-diffusion" rel="noopener ugc nofollow" target="_blank">Stable Diffusion</a>&nbsp;via&nbsp;<a href="https://medium.com/towards-data-science/whats-hugging-face-122f4e7eb11a" rel="noopener">HuggingFace</a>, using Vertex AI Workbench and Endpoints.</li> </ul> <h1>How we got here</h1> <p>As George Lawton mentions in his&nbsp;<a href="https://www.techtarget.com/searchenterpriseai/definition/generative-AI" rel="noopener ugc nofollow" target="_blank">article</a>: &ldquo;Generative AI is a type of artificial intelligence technology that can produce various types of content including text, imagery, audio and synthetic data. The recent buzz around generative AI has been driven by the simplicity of new user interfaces for creating high-quality text, graphics and videos in a matter of seconds.&rdquo;[2]</p> <p>Machine Learning is nothing new, in fact it&rsquo;s been around in some shape or form since the 1960s[1]. &ldquo;But it was not until 2014, with the introduction of&nbsp;<a href="https://en.wikipedia.org/wiki/Generative_adversarial_network" rel="noopener ugc nofollow" target="_blank">generative adversarial networks&nbsp;</a>(GANs), a type of machine learning algorithm, that generative AI could create convincingly authentic images, videos and audio of real people.&rdquo;[2]</p> <p>Combined with the power of Large Language Models (LLMs) that can take a user prompt in natural language describing something and then produce photorealistic images, we&rsquo;ve come a very long way in a short period of time. The first to do this was&nbsp;<a href="https://openai.com/product/dall-e-2" rel="noopener ugc nofollow" target="_blank">OpenAI&rsquo;s DALL&middot;E</a>, in April 2022, followed by Disco Diffusion in August 2022, which was eventually succeeded by Stable Diffusion.</p> <p><a href="https://towardsdatascience.com/create-your-own-generative-ai-text-to-image-api-548c07a4d839"><strong>Website</strong></a></p>
Tags: Image API