Last updated April 18, 2023
In AI News & Update

Researchers Release Multimodal MiniGPT-4 before OpenAI

MiniGPT-4 is an open-source model performing complex vision-language tasks like GPT-4

Share

Published on April 18, 2023

by Pritam Bordoloi

Listen to this story

GPT-4 has been the most advanced development in the world of AI so far with its multimodal capabilities. Most recently, a group of researchers have announced MiniGPT-4-an open-sourced model performing complex vision-language tasks like GPT-4. The code, demos, and training instructions are available on Github.

While OpenAI announced that GPT-4 is indeed multimodal, the ability of the model to process images has not been made available yet. However, MiniGPT-4 can process images.

The researchers have also revealed that MiniGPT has many capabilities similar to those exhibited by GPT-4 like detailed image description generation and website creation from hand-written drafts.

To build MiniGPT-4, the researchers have used Vicuna, which is built on LLaMA, as a language decoder and the BLIP-2 Vision Language Model, as a visual decoder. Interestingly, both Vicuna and BLIP-2 are open source.

A mini version of GPT-4 but open-source

AND UNDERSTAND images 🤯

Can turn an image of a hand drawing into a fully-functional website.

This is way too fast (link below) pic.twitter.com/LEV2ZvJZfg
— Tony Dinh 🎯 (@tdinh_me) April 18, 2023

Given OpenAI has not revealed much details about the architecture (including model size), hardware, training compute, dataset construction, training method used for GPT-4, the open source mini version of the powerful LLM could prove to be significant in terms of research.

Access all our open Survey & Awards Nomination forms in one place

Share

Pritam Bordoloi

I have a keen interest in creative writing and artificial intelligence. As a journalist, I deep dive into the world of technology and analyse how it’s restructuring business models and reshaping society.

Related Posts

China Open Sources TinyGPT-V

China Open Sources TinyGPT-V, Outperforms Larger MLLMs

Mohit Pandey 04/01/2024

6 Must-Know Autonomous AI Agents

K L Krithika 21/10/2023

10 Brilliant Datasets Based on ChatGPT Outputs

Tasmia Ansari 25/07/2023

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Snowflake Arctic

Snowflake Releases Open Enterprise LLM, Arctic with 480 Billion Parameters

Mohit Pandey

Arctic activates approximately 50% fewer parameters than DBRX, and 80% fewer than Grok-1 during inference or training.

Now Run Programs in Real Time with Llama 3 on Groq

Siddharth Jindal

Fibe Leverages Amazon Bedrock to Increase Customer Support Efficiency by 30%

Shritama Saha

Top Editorial Picks

Six Months Old Cognition Labs Raises $175 Mn from Founders Fund at $2 Bn Valuation

K L Krithika

Apple Releases Four Open Source LLMs with OpenELM Series of Models

Mohit Pandey

Adobe Launches Firefly Image 3 Beta With Auto Stylisation, Structure Reference Capabilities

Donna Eva

C.P. Gurnani & InterGlobe’s Rahul Bhatia Announce AI Business Venture AIonOS

Shyam Nandan Upadhyay

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Also in News

Guardians of the Syntax: Securing Enterprise LLM Systems against Emerging Threats

Guardians of the Syntax: Securing Enterprise LLM Systems against Emerging Threats

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

This 18-Year-Old Programmer is Creating an Open Source Alternative to Redis

US India Investments

India will Need at least $200-300 Mn to Build GPT-5-level AI Model

Doctors Use Apple Vision Pro to Enhance Shoulder Arthroscopy Surgery

Doctors Use Apple Vision Pro to Enhance Shoulder Arthroscopy Surgery

10 AI Tools to Complete Excel Tasks in Minutes

You Don’t Need a Degree to Get an AI Job

You Don’t Need a Degree to Get an AI Job

UAE Turns to India to Spearhead AI Innovations

Microsoft’s Phi-3 Outperforms Meta’s Llama 3 and Fits Perfectly on an iPhone

Microsoft’s Phi-3 Outperforms Meta’s Llama 3 and Fits Perfectly on an iPhone

AI Courses & Careers

View All

India is a Goldmine for AI Talent

Donna Eva 15/04/2024

Top 10 LMS Platforms for Enterprise AI Training and Development

Analytics India Magazine 14/04/2024

AI Clock is Ticking: Wake Up Call for Education Institutions

Siddharth Jindal 18/09/2023

Become a Certified Generative AI Engineer

Industry
Insights

View All

BCG Predicts AI to Drive 20% of 2024 Revenues, Doubling to 40% by 2026

Shritama Saha 24/04/2024

AI Can Now Edit DNA of Human Cells

Gopika Raj 23/04/2024

Genpact Launches AI Innovation Centre in Gurugram

Genpact Launches AI Innovation Centre in Gurugram

Mohit Pandey 23/04/2024

Check our Industry Research Reports

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

AIM Videos

What is Computer Vision and How it Works?

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

GenAI
Corner

View All

7 AI Startups that Featured on Shark Tank India Season 3

Siddharth Jindal 15/04/2024

Top 9 Semiconductor GCCs in India

Shyam Nandan Upadhyay 15/04/2024

Top 6 Devin Alternatives to Automate Your Coding Tasks

Siddharth Jindal 08/04/2024

10 Free AI Courses by NVIDIA

Shritama Saha 02/04/2024

Top 6 AI/ML Hackathons to Participate in 2024

Siddharth Jindal 22/03/2024

What’s Devin Up to?

K L Krithika 17/03/2024

10 Underrated Women in AI to Watchout For

K L Krithika 11/03/2024

10 AI Startups Run by Incredible Women Entrepreneurs

K L Krithika 08/03/2024

Data
Dialogues

View All

Automation Anywhere Wants to Augment Humans with AI, Not Replace Them

Shritama Saha 18/04/2024

Father of Computational Theory Wins 2023 Turing Award

Shritama Saha 13/04/2024

Falcon- TII- UAE

Building Open Source LLMs is Not for Everyone

Vandana Nair 12/04/2024

This 20-year-old AI Researcher Created the much-needed Indic LLM Leaderboard

This 20-year-old AI Researcher Created the much-needed Indic LLM Leaderboard

Mohit Pandey 10/04/2024

NPCI is Exploring AI-Powered Futuristic Payment Frontiers: CTO

Pritam Bordoloi 08/04/2024

Prisma AI

Prisma AI Has an ‘Eye on You’ at Adani Airports

Vandana Nair 06/04/2024

Salesforce Chief Ethicist Deems Doomsday AI Discussions a ‘Waste of Time’

Pritam Bordoloi 01/04/2024

ManageEngine Zoho

Zoho’s ManageEngine Invests $10 Mn in NVIDIA, Intel, and AMD GPUs

Vandana Nair 30/03/2024

Future
Talks

View All

ai jobs india

T-Hub Supported MATH is Launching AI Career Finder to Create AI Jobs

Pritam Bordoloi 23/04/2024

Quora’s Poe Eats Google’s Lunch

Gopika Raj 17/04/2024

Zoho teams up with Intel for optimizing video AI workloads

Zoho Collaborates with Intel to Optimise & Accelerate Video AI Workloads

Gopika Raj 08/04/2024

Rakuten Certified as Best Firm for Data Scientists for the 2nd Time

Analytics India Magazine 08/04/2024

bulls.ai

This Indian Logistics Company Developed an LLM to Enhance Last-Mile Delivery

Pritam Bordoloi 02/04/2024

Perplexity AI

Perplexity AI Reviews with Pro Access

Vandana Nair 02/04/2024

Apple WWDC 2024

What to Expect at the ‘Absolutely Incredible’ Apple WWDC 2024

Vandana Nair 31/03/2024

Code Generator

Will StarCoder 2 Win Over Enterprises?

Pritam Bordoloi 20/03/2024

Developer’s Corner

Japan is the Next Big Hub for Indian Tech Talent

Siddharth Jindal 22/04/2024

Will TypeScript Wipe Out JavaScript?

K L Krithika 21/04/2024

Meta Llama 3

Meta Forces Developers Cite ‘Llama 3’ in their AI Development

Sukriti Gupta 19/04/2024

Why Developers Hate Jira

Why Developers Hate Jira

Mohit Pandey 01/04/2024

In Case You Missed It

Which is the Most Frustrating Programming Language?

Which is the Most Frustrating Programming Language?

Mohit Pandey 18/03/2024

AI4Bharat Rolls Out IndicLLMSuite for Building LLMs in Indian Languages

Shritama Saha 15/03/2024

Google Introduces Synth^2 to Enhance the Training of Visual Language Models

K L Krithika 14/03/2024

Infosys Funds Llama 2 Project with 22 Indian Languages

Infosys Founder Funds Meta’s Llama 2 Project with 22 Indian Languages

Mohit Pandey 13/03/2024

Webstories

Excel tools

9 Best AI Tools for Excel and Google Spread Sheet Automation

Generative AI Certification Courses

8 Best Generative AI Courses for Executives and Managers

Add ChatGPT Chrome Extension Right Away

Top 8 AI Browser Extensions for Chrome Users in 2024

Dead Programming Languages

Top 5 Devin AI Alternatives for Coders and Developers

Programming language concept. System engineering. Software development.

10 Best AI Code Generator Tools to Use for Free in 2024

STAR Framework for Measuring AI Trust: Safety, Transparency, Accountability and Responsibility

What are the Responsibility of Developers Using Generative AI

Also in Trends

How Good is Llama 3 for Indic Languages?

AWS Brings Meta’s Llama 3 Models on Amazon Bedrock

Cohere Unveils SnapKV to Cut Memory & Processing Time in LLMs

‘We May be Able to Create an Infinite Data Generation Engine with Synthetic Data,’ says Anthropic CEO

What’s Up with ChatGPT Enterprise

OpenAI Introduces New Enterprise-Grade Features for API Customers

India Leads Global AI Project Implementation: Report Reveals

Daniel Dines UiPath

UiPath Launches New Data Centers in Pune, Chennai to Expand India Footprint

Perplexity AI

Perplexity AI Raises $63M at $1B Valuation, Expands to Enterprise Market

G42 Qualcomm

UAE continues to spearhead global collaborations with G42 selecting Qualcomm for AI Inference

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

AIM Launches the 3rd Edition of Data Engineering Summit. May 30-31, Bengaluru

Join the forefront of data innovation at the Data Engineering Summit 2024, where industry leaders redefine technology’s future.

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024