MITB Banner

Now Build ChatGPT On Your Own Device

The company has leveraged a PyTorch-based implementation, that covers all three stages from pre-training, reward model training, and reinforcement learning.
Share

Since OpenAI has not open-sourced the code for ChatGPT, replicating the chatbot is a herculean task, and even the big-tech are struggling. But, AI startup Colossal-AI has found a way to build your own ChatGPT with less computing resources. 

Towards this goal, the company has leveraged a PyTorch-based implementation that covers all three stages from pre-training, reward model training, and reinforcement learning. They offer a demo version of the training process that requires only 1.62 GB of GPU memory and can be done on a single consumer-grade GPU, with 10.3x growth on one GPU model capacity. 

Check out the GitHub repository here.

Colossal-AI said that compared to the original PyTorch, the single-machine process is 7.7 times faster and a single-GPU inference can be 1.42 times faster, which is achievable on a single line of code. For fine-tuning, users can increase the capacity of the model by up to 3.7 times with one line of code on a single GPU while running at a high speed. 

The original PyTorch implementation typically requires a 780 million parameter model on A100 80GB, which costs $14,999. Colossal-AI, on the other hand, boosts it to a single GPU by 10.3 times to 8 billion parameters. 

There are multiple versions available of a single-GPU scale, a multiple-GPUs scale on a single node, along with a 175-billion parameter scale. Developers can also import OPT, GPT-3, and BLOOM pre-trained language models from Hugging Face. 

Learn more about the process by checking out the documentation and code.

PS: The story was written using a keyboard.
Picture of Mohit Pandey

Mohit Pandey

Mohit dives deep into the AI world to bring out information in simple, explainable, and sometimes funny words. He also holds a keen interest in photography, filmmaking, and the gaming industry.
Related Posts

Download our Mobile App

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox
Recent Stories

Featured

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

AIM Conference Calendar

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives. Revel in intimate events that encapsulate the heart and soul of the AI Industry.

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed