Stability AI Open Sources GPT-like Models, StableLM

The models can be used for both commercial and research purposes.

Share

Stability AI, the parent company of Stable Diffusion, has announced StableLM, a suite of open-source large language models available in “alpha” on GitHub and Hugging Face. Under the permissive license, developers can freely access, inspect, and utilize the models for both commercial and research purposes.
Check out the GitHub repository here.

Much like ChatGPT, the suite is designed to efficiently generate text and code. Despite its modest size of 3 and 7 billion parameters (15 to 65B parameter models en route) in contrast to GPT-3’s 175 billion. All thanks to a larger version of the open-source dataset known as the Pile. The AI company has decided to stay mum on whether these models suffer from toxicity and hallucinations. Given Pile’s profanity-laced content, it’s a possibility.

Alongside, the release also includes instructionally fine-tuned research models, utilizing a combination of open-source datasets such as Stanford’s Alpaca, GPT4All, Databricks’ Dolly, ShareGPT, and HH. These models, designed exclusively for research purposes, are made available under a noncommercial CC BY-NC-SA 4.0 license, aligning with Alpaca’ license.

Last year, the Emad Mostaque run company made its text-to-image AI available through a public demo, a software beta, and a full download of the model, allowing users to experiment with the tool and come up with various integrations. A similar trend can be expected with StableLM as Meta’s open-source LLaMa language model has been broadly adapted by the community within a few weeks of its release. 

Stability AI’s release has been met with criticism by some researchers, who worry about the models’ potential in nefarious activities. However, Stability AI stands firm in their belief that transparency and accessibility are key to advance technology and it sees open-sourcing as the right path forward.

Read: 14 Open Source LLMs You Need to Know

Share
Picture of Tasmia Ansari

Tasmia Ansari

Tasmia is a tech journalist at AIM, looking to bring a fresh perspective to emerging technologies and trends in data science, analytics, and artificial intelligence.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India