MITB Banner

Apple’s ReALM Challenges OpenAI’s GPT-4

Apple has unveiled its groundbreaking AI model ReALM, poised to outperform OpenAI’s GPT-4, shattering the conventional boundaries.

Share

Illustration by Nikhil Kumar

Listen to this story

Amid the buzz surrounding Apple’s unveiling of the new MM1 model last month, the tech giant has now introduced another contender poised to beat OpenAI’s GPT-4 with its latest AI model, ReALM (Reference Resolution As Language Modeling). 

This new model comprehends various contexts and delivers accurate information. Users can pose queries, which are visible on the screen or running in the background, and receive precise answers seamlessly.

Apple believes its latest AI model surpasses OpenAI’s GPT-4. 

“We also benchmark against GPT-3.5 and GPT-4, with our smallest model achieving performance comparable to that of GPT-4, and our larger models substantially outperforming it,” said the researchers in the paper titled ReALM: Reference Resolution As Language Modeling. 

The researchers include Joel Ruben Antony Moniz, Soundarya Krishnan, Melis Ozyildirim, Prathamesh Saraf, Halim Cagri Ates, Yuan Zhang, Hong Yu, and Nidhi Rajshree.

GPT-4 vs ReALM

Apple researchers said the difference between GPT-3.5 and GPT-4 is how they process information. They said that GPT-3.5 can only understand text, so we only give it text prompts. On the other hand, GPT-4 can also understand images. This combination of text and image helps GPT-4 perform much better. 

ReALM, on the other hand, uses both text and images (like screenshots) to understand and respond to prompts more effectively. 

The researchers, however, said that there are even more ways to enhance results, like using similar phrases until you reach a certain length of the prompt. “This more complex approach deserves further, dedicated exploration, and we leave this to future work,”

Further, they said that the ReALM model will be tested across three distinct entity types associated with diverse tasks: on-screen entities, conversational entities, and background entities. 

Decoding Reference Resolution 

Apple researchers further said that understanding references like ‘they’ or ‘that’ in human speech is intuitive for our brains and helps us effortlessly understand contextual cues. However, deciphering such references poses a challenge for an LLM-based chatbot as it struggles to understand the intended context. 

This challenge is known as reference resolution, where the aim is to comprehend the specific entity or concept to which an expression refers. 

The researchers believe that the low-power nature and latency constraints of such systems require the use of a ‘single LLM’ with extensive prompts to achieve seamless experiences.

For instance, a user asks about nearby pharmacies, which can be done by Siri, leading to a list being presented. Later, the user asks to call the bottom listed number (present on-screen). Siri would not perform this particular task. However, with ReALM, the language model can comprehend the context by analysing on-device data and fulfilling the query. This also hints that at WWDC 2024, scheduled for June 10-14, 2024, Siri will most likely get a generative AI upgrade, setting the stage and heralding the arrival of the ReALM. “It’s going to be Absolutely Incredible!” said Apple SVP of marketing Greg Joswiak, in his recent post, hinting at the AI innovations that are going to be unveiled at the developers’ conference.

Share
Picture of Gopika Raj

Gopika Raj

With a Master's degree in Journalism & Mass Communication, Gopika Raj infuses her technical writing with a distinctive flair. Intrigued by advancements in AI technology and its future prospects, her writing offers a fresh perspective in the tech domain, captivating readers along the way.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India