Survival of the Fittest: Compact Generative AI Models Are the Future for Cost-Effective AI at Scale

<p>After a decade of rapid growth in artificial intelligence (AI) model complexity and compute, 2023 marks a shift in focus to efficiency and the broad application of generative AI (GenAI). As a result, a new crop of models with less than 15 billion parameters, referred to as nimble AI, can closely match the capabilities of ChatGPT-style giant models containing more than 100B parameters, especially when targeted for particular domains. While GenAI is already being deployed throughout industries for a wide range of business usages, the use of compact, yet highly intelligent models, is rising. In the near future, I expect there will be a small number of giant modes and a giant number of small, more nimble AI models embedded in countless applications.</p> <p>While there has been great progress with larger models, bigger is certainly not better when it comes to training and environmental costs.&nbsp;<a href="https://www.trendforce.com/presscenter/news/20230301-11584.html" rel="noopener ugc nofollow" target="_blank">TrendForce</a>&nbsp;estimates that ChatGPT training alone for GPT-4 reportedly costs more than $100 million, while nimble model pre-training costs are orders-of-magnitude lower (for example, quoted as&nbsp;<a href="https://www.mosaicml.com/blog/mpt-7b" rel="noopener ugc nofollow" target="_blank">approximately $200,000</a>&nbsp;for MosaicML&rsquo;s MPT-7B). Most of the compute costs occur during continuous inference execution, but this follows a similar challenge for larger models including expensive compute. Furthermore, giant models hosted on third-party environments raise security and privacy challenges. Nimble models are substantially cheaper to run and provide a host of additional benefits such as adaptability, hardware flexibility, integrability within larger applications, security and privacy, explainability, and more (see Figure 1). The perception that smaller models don&rsquo;t perform as well as larger models is also changing. Smaller, targeted models are not less intelligent &mdash; they can provide equivalent or superior performance for business, consumer, and scientific domains, increasing their value while decreasing time and cost investment.</p> <p><a href="https://towardsdatascience.com/survival-of-the-fittest-compact-generative-ai-models-are-the-future-for-cost-effective-ai-at-scale-6bbdc138f618"><strong>Read More</strong></a></p>