7 Essential Open-Source Generative AI Models Available Today
20 May 2024
There are many reasons that businesses may want to choose open-source over proprietary tools when getting started with generative AI.
This could be because of cost, opportunities for customization and optimization, transparency or simply the support that’s offered by the community.
There are disadvantages too, of course, and I cover the pros and cons of each option more fully in this article.
With software generally, the term open-source simply means that the source code is publicly available and can be used, free of charge, for pretty much any purpose.
When it comes to AI models, though, there has been some debate about exactly what this entails, as we will get into it as we discuss the individual models covered here. So, let’s dive in.
Stable Diffusion
One of the most powerful and flexible image generation models, and certainly the most widely-used open-source image models, Stable Diffusion 3 (the latest version as of writing) supports text-to-image as well as image-to-image generation and has become well-known for its ability to create highly realistic and detailed images.
As is common with open-source software, using Stable Diffusion isn’t quite as straightforward as using commercial, proprietary tools like ChatGPT. Rather than having its own web interface, it’s accessed through third-party tools built by commercial entities, including DreamStudio and Stable Diffusion Web. The alternative is to compile and run it yourself locally, and this requires providing your own compute resources as well as technical know-how.
Meta Llama 3
This is a family of language models available in various sizes, making it suitable for different applications, from lightweight mobile clients to full-size cloud deployments. The same model that powers the Meta AI assistant available across its social media platforms can be deployed by anyone for many uses including natural language generation and creating computer code. One of its strong points is its ability to run on relatively low-powered hardware. However, as with some of the other models covered here, there is some debate as to whether it can truly be considered open-source, as Meta has not disclosed exact details of its training data.
Mistral AI
Mistral is a French startup that has developed several generative AI models that it has made available under open-source licenses. These include Mistral 7B, which is designed to be lightweight and easy to deploy on low-power hardware, and the more powerful Mistral 8x22B. It has a strong user community offering support, and positions itself as a highly flexible and customizable generative language model.
GPT-2
OpenAI has open-sourced the second version of their LLM – essentially an earlier version of the engines that are now used to power ChatGPT. While it isn’t as big, powerful or flexible as the later GPT-3.5 or GPT-4 (built on 1.2 billion parameters compared to GPT-4’s one-trillion plus), it’s still considered to be perfectly adequate for many language-based tasks such as generating text or powering chatbots. GPT-2 is made available by OpenAI under the MIT license, which is generally considered to be compliant with open-source principles.
BLOOM
BLOOM is described as the world’s largest open, multilingual language model, built on 176 billion parameters. Development was led by Hugging Face, a repository of open-source AI resources working alongside a team of over 1,000 researchers as part of a global collaborative project known as BigScience. The aim was to create a truly open and transparent LLM available to anyone who agrees to the terms of the project’s Responsible AI License. Technically, this means it isn’t quite open source, but it is freely available to use and distribute, as long as it isn’t used for harmful purposes as defined by the terms of the license. This makes it a very interesting experiment in the critically important domain of developing and distributing ethical AI.
Grok.AI
This LLM also claims to be the world’s largest open-source model, although again there is some debate as to whether it technically fills all of the criteria for being truly open source.
Grok was designed and built by X.ai, a startup founded by Elon Musk following his split from OpenAI. This split has been reported as being caused by disagreements over exactly what “open” means when it comes to AI models.
Rather than using the term large language model, X describes Grok as a “mixture of experts” model, reflecting the fact that the base model is designed to be more general-purpose and is not specifically trained for creating dialogue, as is the case with, for example, ChatGPT.
As with Llama, skepticism of Grok’s open-source status is based on the fact that while X.ai has made the weights and architecture of the model publicly available, it hasn’t revealed all of the code or training data.
Falcon
Two models of this LLM architecture have been made freely available by its developers, the Technology Innovation Institute, a research institution founded by the government of Abu Dhabi. Both models – the more portable Falcon 40B and the more powerful 180B, have been released as open source and reportedly come second only to GPT-4 on Open Face’s leaderboard of LLM performance. While the smaller model is released under the Apache 2.0 license – generally considered to fit the definition of open-source - the larger model has had some conditions attached to its use and distribution.
This exploration into the realm of open-source generative AI tools illuminates the diverse array of options available and underscores the transformative potential these technologies hold for businesses eager to leverage AI's power while embracing transparency, cost-efficiency, and robust community support.
Related Articles
4 Smartphones Leading The AI Revolution
As enterprises increasingly rely on company-issued smartphones as primary computing devices, these mobile devices are becoming the frontline of workplace AI integration.[...]
The Rise Of AI-Enabled Virtual Pets: Why Millions Are Raising Digital Companions
Remember Tamagotchis? Those tiny digital pets that had millions of kids frantically pressing buttons to keep their virtual companions alive in the 1990s?[...]
The Dark Side Of AI: How Deepfakes And Disinformation Are Becoming A Billion-Dollar Business Risk
Every week, I talk to business leaders who believe they're prepared for AI disruption. But when I ask them about their defense strategy against AI-generated deepfakes and disinformation, I'm usually met with blank stares.[...]
Why You Should Be Polite To ChatGPT And Other AIs
In my latest conversation with ChatGPT, I caught myself saying "please" and "thank you." My wife, overhearing this, couldn't help but laugh at my politeness toward a machine.[...]
The 7 Revolutionary Cloud Computing Trends That Will Define Business Success In 2025
Picture this: A world where quantum computing is as accessible as checking your email, where AI automatically optimizes your entire cloud infrastructure, and where edge computing seamlessly melds with cloud services to deliver lightning-fast responses.[...]
AI And The Global Economy: A Double-Edged Sword That Could Trigger Market Meltdowns
The stock market's current AI euphoria, driven by companies like NVIDIA developing powerful processors for machine learning, might mask a more troubling reality.[...]
Sign up to Stay in Touch!
Bernard Marr is a world-renowned futurist, influencer and thought leader in the fields of business and technology, with a passion for using technology for the good of humanity.
He is a best-selling author of over 20 books, writes a regular column for Forbes and advises and coaches many of the world’s best-known organisations.
He has a combined following of 4 million people across his social media channels and newsletters and was ranked by LinkedIn as one of the top 5 business influencers in the world.
Bernard’s latest book is ‘Generative AI in Practice’.
Social Media