Last updated January 4, 2023
In AI News & Update

LLMs finally Bloom with Petals

This BitTorrent-style running of large language models (LLMs) allows many times faster inference when compared to offloading on single systems, closer to 1 second per token. Parallel inference can reach hundreds of tokens per second.

Share

Published on January 2, 2023

by Mohit Pandey

Listen to this story

Even when large language models like BLOOM, PaLM, or GPT get open-sourced, fine-tuning and inferencing them on your system is a memory-heavy task. This might hinder developers from running these models on their systems and thus slow down innovation, leaving it in the hands of only big players.

BigScience Workshop released Petals, which allows users to run language models with more than 100 billion parameters at home by loading a small part of the model on your machine, and then collaborating with other people to run other parts of inference and fine-tuning.

Click here to check out the repository on GitHub.

This BitTorrent-style running of large language models allows many times faster inference when compared to offloading on single systems, closer to 1 second per token. Parallel inference can reach hundreds of tokens per second.

The script is built for CUDA-enabled PyTorch and uses Anaconda to install and is only available for Linux users for now.

Mentioned in the GitHub page, “Petals” is a metaphor for a single person serving different parts of the model, and hosting together the entire language model – BLOOM, which has 176 billion parameters.

Since the collaboration might be slow in the beginning because of privacy or security issues, the team has decided to give “bloom points” as an incentive system for people who donate their GPU time for people to fine tune it.

Also read: ChatGPT and DALL-E on Discord

Access all our open Survey & Awards Nomination forms in one place

Share

Mohit Pandey

Mohit dives deep into the AI world to bring out information in simple, explainable, and sometimes funny words. He also holds a keen interest in photography, filmmaking, and the gaming industry.

Related Posts

ManageEngine Zoho

Zoho’s ManageEngine Invests $10 Mn in NVIDIA, Intel, and AMD GPUs

Vandana Nair 30/03/2024

Yann LeCun Loves Kannada Llama

Siddharth Jindal 15/01/2024

Sepp Hochreiter’s Quest to Kick OpenAI from Language Modelling Supermarket

Tasmia Ansari 26/07/2023

Flyfish Wants to be a Premium Indian ‘Consultative Sales AI’ Offering

Shyam Nandan Upadhyay 17/07/2023

Generative AI is Having An Edison Moment

Tasmia Ansari 25/05/2023

Jeffrey Ullman’s Unsettling Ultimatum

Tasmia Ansari 24/05/2023

dark web

A New Language Model Trained on Dark Web Emerges

Mohit Pandey 16/05/2023

Databricks Unveils Dolly 2.0, A Game-Changer in the Open-Source LLMs

Tasmia Ansari 13/04/2023

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Snowflake Arctic

Snowflake Releases Open Enterprise LLM, Arctic with 480 Billion Parameters

Mohit Pandey

Arctic activates approximately 50% fewer parameters than DBRX, and 80% fewer than Grok-1 during inference or training.

Now Run Programs in Real Time with Llama 3 on Groq

Siddharth Jindal

Fibe Leverages Amazon Bedrock to Increase Customer Support Efficiency by 30%

Shritama Saha

Top Editorial Picks

Adobe Launches Firefly Image 3 Beta With Auto Stylisation, Structure Reference Capabilities

Donna Eva

C.P. Gurnani & InterGlobe’s Rahul Bhatia Announce AI Business Venture AIonOS

Shyam Nandan Upadhyay

BCG Predicts AI to Drive 20% of 2024 Revenues, Doubling to 40% by 2026

Shritama Saha

AWS Brings Meta’s Llama 3 Models on Amazon Bedrock

Shritama Saha

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Also in News

Guardians of the Syntax: Securing Enterprise LLM Systems against Emerging Threats

Guardians of the Syntax: Securing Enterprise LLM Systems against Emerging Threats

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

US India Investments

India will Need at least $200-300 Mn to Build GPT-5-level AI Model

Doctors Use Apple Vision Pro to Enhance Shoulder Arthroscopy Surgery

Doctors Use Apple Vision Pro to Enhance Shoulder Arthroscopy Surgery

10 AI Tools to Complete Excel Tasks in Minutes

You Don’t Need a Degree to Get an AI Job

You Don’t Need a Degree to Get an AI Job

UAE Turns to India to Spearhead AI Innovations

Microsoft’s Phi-3 Outperforms Meta’s Llama 3 and Fits Perfectly on an iPhone

Microsoft’s Phi-3 Outperforms Meta’s Llama 3 and Fits Perfectly on an iPhone

AI Courses & Careers

View All

India is a Goldmine for AI Talent

Donna Eva 15/04/2024

Top 10 LMS Platforms for Enterprise AI Training and Development

Analytics India Magazine 14/04/2024

AI Clock is Ticking: Wake Up Call for Education Institutions

Siddharth Jindal 18/09/2023

Become a Certified Generative AI Engineer

Industry
Insights

View All

AI Can Now Edit DNA of Human Cells

Gopika Raj 23/04/2024

Genpact Launches AI Innovation Centre in Gurugram

Genpact Launches AI Innovation Centre in Gurugram

Mohit Pandey 23/04/2024

Oracle Integrates GenAI to Enhance Supply Chain and Automate KPIs

Siddharth Jindal 23/04/2024

Check our Industry Research Reports

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

AIM Videos

What is Computer Vision and How it Works?

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

GenAI
Corner

View All

7 AI Startups that Featured on Shark Tank India Season 3

Siddharth Jindal 15/04/2024

Top 9 Semiconductor GCCs in India

Shyam Nandan Upadhyay 15/04/2024

Top 6 Devin Alternatives to Automate Your Coding Tasks

Siddharth Jindal 08/04/2024

10 Free AI Courses by NVIDIA

Shritama Saha 02/04/2024

Top 6 AI/ML Hackathons to Participate in 2024

Siddharth Jindal 22/03/2024

What’s Devin Up to?

K L Krithika 17/03/2024

10 Underrated Women in AI to Watchout For

K L Krithika 11/03/2024

10 AI Startups Run by Incredible Women Entrepreneurs

K L Krithika 08/03/2024

Data
Dialogues

View All

Automation Anywhere Wants to Augment Humans with AI, Not Replace Them

Shritama Saha 18/04/2024

Father of Computational Theory Wins 2023 Turing Award

Shritama Saha 13/04/2024

Falcon- TII- UAE

Building Open Source LLMs is Not for Everyone

Vandana Nair 12/04/2024

This 20-year-old AI Researcher Created the much-needed Indic LLM Leaderboard

This 20-year-old AI Researcher Created the much-needed Indic LLM Leaderboard

Mohit Pandey 10/04/2024

NPCI is Exploring AI-Powered Futuristic Payment Frontiers: CTO

Pritam Bordoloi 08/04/2024

Prisma AI

Prisma AI Has an ‘Eye on You’ at Adani Airports

Vandana Nair 06/04/2024

Salesforce Chief Ethicist Deems Doomsday AI Discussions a ‘Waste of Time’

Pritam Bordoloi 01/04/2024

Data Science Hiring Process at Confluent

Shritama Saha 28/03/2024

Future
Talks

View All

ai jobs india

T-Hub Supported MATH is Launching AI Career Finder to Create AI Jobs

Pritam Bordoloi 23/04/2024

Quora’s Poe Eats Google’s Lunch

Gopika Raj 17/04/2024

Zoho teams up with Intel for optimizing video AI workloads

Zoho Collaborates with Intel to Optimise & Accelerate Video AI Workloads

Gopika Raj 08/04/2024

Rakuten Certified as Best Firm for Data Scientists for the 2nd Time

Analytics India Magazine 08/04/2024

bulls.ai

This Indian Logistics Company Developed an LLM to Enhance Last-Mile Delivery

Pritam Bordoloi 02/04/2024

Perplexity AI

Perplexity AI Reviews with Pro Access

Vandana Nair 02/04/2024

Apple WWDC 2024

What to Expect at the ‘Absolutely Incredible’ Apple WWDC 2024

Vandana Nair 31/03/2024

Code Generator

Will StarCoder 2 Win Over Enterprises?

Pritam Bordoloi 20/03/2024

Developer’s Corner

Japan is the Next Big Hub for Indian Tech Talent

Siddharth Jindal 22/04/2024

Will TypeScript Wipe Out JavaScript?

K L Krithika 21/04/2024

Meta Llama 3

Meta Forces Developers Cite ‘Llama 3’ in their AI Development

Sukriti Gupta 19/04/2024

Why Developers Hate Jira

Why Developers Hate Jira

Mohit Pandey 01/04/2024

In Case You Missed It

Which is the Most Frustrating Programming Language?

Which is the Most Frustrating Programming Language?

Mohit Pandey 18/03/2024

AI4Bharat Rolls Out IndicLLMSuite for Building LLMs in Indian Languages

Shritama Saha 15/03/2024

Google Introduces Synth^2 to Enhance the Training of Visual Language Models

K L Krithika 14/03/2024

Infosys Funds Llama 2 Project with 22 Indian Languages

Infosys Founder Funds Meta’s Llama 2 Project with 22 Indian Languages

Mohit Pandey 13/03/2024

Webstories

Excel tools

9 Best AI Tools for Excel and Google Spread Sheet Automation

Generative AI Certification Courses

8 Best Generative AI Courses for Executives and Managers

Add ChatGPT Chrome Extension Right Away

Top 8 AI Browser Extensions for Chrome Users in 2024

Dead Programming Languages

Top 5 Devin AI Alternatives for Coders and Developers

Programming language concept. System engineering. Software development.

10 Best AI Code Generator Tools to Use for Free in 2024

STAR Framework for Measuring AI Trust: Safety, Transparency, Accountability and Responsibility

What are the Responsibility of Developers Using Generative AI

Also in Trends

Cohere Unveils SnapKV to Cut Memory & Processing Time in LLMs

‘We May be Able to Create an Infinite Data Generation Engine with Synthetic Data,’ says Anthropic CEO

What’s Up with ChatGPT Enterprise

OpenAI Introduces New Enterprise-Grade Features for API Customers

India Leads Global AI Project Implementation: Report Reveals

Daniel Dines UiPath

UiPath Launches New Data Centers in Pune, Chennai to Expand India Footprint

Perplexity AI

Perplexity AI Raises $63M at $1B Valuation, Expands to Enterprise Market

G42 Qualcomm

UAE continues to spearhead global collaborations with G42 selecting Qualcomm for AI Inference

Cognizant Teams Up with Microsoft to Integrate Generative AI for Employees

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

AIM Launches the 3rd Edition of Data Engineering Summit. May 30-31, Bengaluru

Join the forefront of data innovation at the Data Engineering Summit 2024, where industry leaders redefine technology’s future.

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024