MITB Banner

Meta AI Chief Yann LeCun’s Self-Supervised Idea Finally Sees Open Source Light 

Meta AI successfully trained a visual transformer model with 632M parameters utilising 16 A100 GPUs within a span of 72 hours

Share

Meta AI Chief Yann LeCun’s Self-Supervised Idea Finally Sees Open Source Light
Listen to this story

Meta has been one of the biggest proponents of self-supervised learning when it comes to AI. Today, Meta AI has announced I-JEPA, a self-supervised computer vision model that learns the world by predicting it, based on Yann LeCun’s vision of autonomous machine intelligence to learn and reason similar to how humans and animals do. 

Click here to check it out. The paper is also being presented at CVPR2023 next week.

The model’s training code and checkpoints are open-sourced under a non-commercial licence

I-JEPA (Image Joint Embedding Predictive Architecture) learns by creating an internal model representing the outside world and compares abstract representation of images, instead of comparing the pixels. 

According to the paper, this model delivers strong performance on various computer vision tasks, and is highly efficient than other similar models. 

I-JEPA offers versatile applicability without requiring extensive fine-tuning. Meta AI successfully trained a visual transformer model with 632M parameters utilising 16 A100 GPUs within a span of 72 hours. This model attains state-of-the-art results for low-shot classification on ImageNet, with a mere 12 labelled examples per class. In comparison, alternative approaches often consume two to 10 times the GPU-hours and yield inferior error rates when trained with an equivalent dataset size.

By learning from representations instead of pixels, the model is able to avoid biases and issues that occur due to invariance-based pre-training. This also enables the model to learn directly from the images, instead of representation, which Meta AI says is a problem with the current LLM models. 

The Theory

Last year, Yann LeCun, Meta’s Chief AI Scientist, introduced an innovative architecture designed to address the significant constraints faced by contemporary AI systems, called the world model. LeCun envisions the development of machines capable of rapidly acquiring internal models of the world’s dynamics, enabling them to efficiently learn, strategise for complex tasks, and seamlessly adapt to novel circumstances.

This work by Meta AI is highly based on the hypothesis that common sense information is the key for enabling intelligent behaviour. This knowledge is achieved by passively observing the world which is stored on the background of the mind. Meta believes that self-supervised learning is the path towards human-like intelligence.

For this to work, the system needs to acquire these representations through self-supervised learning, which entails learning directly from unlabeled data like images or sounds, instead of relying on manually curated labelled datasets.

Meta AI demonstrates the potential of I-JEPA, showcasing the ability to learn competitive off-the-shelf image representations without relying on additional knowledge encoded through manually designed image transformations. 

Advancing JEPAs further to acquire broader world-models from richer modalities would be particularly intriguing. This advancement could enable making long-range spatial and temporal predictions about future events in videos based on a short context, while conditioning these predictions on audio or textual prompts.

Share
Picture of Mohit Pandey

Mohit Pandey

Mohit dives deep into the AI world to bring out information in simple, explainable, and sometimes funny words. He also holds a keen interest in photography, filmmaking, and the gaming industry.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.