Last updated January 28, 2022
In AI News & Update

This new AI model can tell where the sound came from

The research was funded by the National Science Foundation and the National Institute on Deafness and Other Communication Disorders.

Published on January 28, 2022
by Meeta Ramnani

MIT neuroscientists have developed a computer model that can localize sounds. The model packs a bunch of convolutional neural networks and performs the task as well as humans do.

The human brain is tuned to recognise particular sounds and determine the direction of their origin. The brain estimates the location of the sound by comparing differences in sounds that reach the right and left ear. “We now have a model that can actually localize sounds in the real world,” said Josh McDermott, an associate professor of brain and cognitive sciences and a member of MIT’s McGovern Institute for Brain Research. “And when we treated the model like a human experimental participant and simulated this large set of experiments that people had tested humans on in the past, what we found over and over again is that the model recapitulates the results that you see in humans.”

McDermott is the senior author of the paper, which appeared in Nature Human Behavior. The paper’s lead author is MIT graduate student Andrew Francl. “The study also found that humans’ ability to perceive location is adapted to the specific challenges of the environment,” added McDermott.

Convolutional neural networks are also used extensively to model the human visual system.

Since convolutional neural networks can be designed with different architectures, the MIT team first used a supercomputer to train and test about 1,500 different models to help them find the ones that would work best for localization. The researchers narrowed it down to 10 models and further trained and used them for subsequent studies.

To train the models, the researchers created a virtual world where they controlled the size of the room and the reflection properties of the walls. They used over 400 training sounds that included human voices, animal sounds, machine sounds and natural sounds. The researchers also ensured the model started with the same information provided by human ears, which includes details like the sound reflecting and altering to the outer ear which has folds. The researchers simulated this effect by running each sound through a specialised mathematical function.

To test this, the researchers placed a mannequin with microphones in its ears in an actual room and played sounds from different directions and then fed those recordings into the models. The models performed very similarly to humans when asked to localize these sounds. “Although the model was trained in a virtual world, when we evaluated it, it could localize sounds in the real world,” Francl said.

The researchers are applying the model to other aspects of audition, like pitch perception and speech recognition to understand other cognitive phenomena, such as the limits on what a person can pay attention to or remember. The research was funded by the National Science Foundation and the National Institute on Deafness and Other Communication Disorders.

Access all our open Survey & Awards Nomination forms in one place >>

Meeta Ramnani

Meeta’s interest lies in finding out real practical applications of technology. At AIM, she writes stories that question the new inventions and the need to develop them. She believes that technology has and will continue to change the world very fast and that it is no more ‘cool’ to be ‘old-school’. If people don’t update themselves with the technology, they will surely be left behind.

Watch More

This new AI model can tell where the sound came from

Meeta Ramnani

Download our Mobile App

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

3 Ways to Join our Community

Telegram group

Discord Server

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox

Recent Stories

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.