MITB Banner

SetFit – A New Text-Classification Model That Outperforms OpenAI’s GPT-3

The joint research, which was led by Intel Labs and the UKP Lab, and Hugging Face, outperforms GPT-3 in 7 out of 11 tasks – while being 1600x smaller

Share

Listen to this story

The amount of available labelled data is a barrier to producing a high-performing model in many ML applications. Developments in the past two years have shown the challenge of overcoming data limitations by using LLMs (Large Language Models), such as OpenAI GPT-3 to achieve good results. However, while these improve the missing labeled data situation, they introduce a new problem of the access and cost of LLMs. 

To counter this, a group of researchers have discovered a new approach called SetFit to create highly accurate text-classification models with limited labeled data. Intel Labs, UKP Lab, and Hugging Face led the joint research that outperforms GPT-3 in 7 out of 11 tasks – while being 1600x smaller. 

Source: Phil Schmid

According to the blog, SetFit has several unique features compared to other few-shot learning methods. One feature is using no prompts or verbalisers, as current techniques for few-shot fine-tuning require handcrafted prompts. SetFit altogether dispenses with prompts by generating embeddings directly from text examples. Moreover, it doesn’t require large-scale models like GPT-3 to achieve high accuracy. It also consists of multilingual support which can be used with Sentence Transformer on the hub. 

Source: Phil Schmid
The team has generated a high-performing text-classification model with 8 samples per class or only 32 labeled samples using the new approach. “This is huge! SetFit will help so many companies to get started with text-classification and transformers, without the need to label a lot of data and compute power. Compared to LLM training, the SetFit classifier takes less than 1 hour on a small GPU (NVIDIA T4) to train or less than $1 so to speak,” read the blog.

Share
Picture of Bhuvana Kamath

Bhuvana Kamath

I am fascinated by technology and AI’s implementation in today’s dynamic world. Being a technophile, I am keen on exploring the ever-evolving trends around applied science and innovation.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.