AI News & Update

NVIDIA Develops Automatic Speech Recognition Model for Telugu

Telugu is one of the country’s most commonly spoken languages, with more than 75 million speakers in southern India. In the US, Telugu population was

05/12/2022

Tech & AI Blend

How This Startup Is Using Automatic Speech Recognition To Assist Sales Reps

Integrating AI to complement human intelligence in critical functions like sales and marketing is becoming a norm. Conversational AI has evolved from being a way

14/09/2020

AI Mysteries

Facebook Makes Advancements In Automatic Speech Recognition

Facebook AI Research (FAIR) recently trained a single acoustic model for multiple languages with the aim of improving automatic speech recognition (ASR) performance on low-resource

14/07/2020

AI Origins & Evolution

This Mozilla Project Can Be A Game Changer In The Automatic Speech Recognition Landscape

Mozilla is riding on its open-source initiatives and is continuously working on becoming a foundation for developers to innovate in machine learning landscape. The firm

09/12/2019

AI Origins & Evolution

Top 10 Automatic Speech Recognition Tools That’ll Relieve You Of The Keyboard

Speech recognition is the process of decoding human voices and is a part of machine learning. Organisations are implementing Automatic Speech Recognition (ASR) technology to

30/10/2019

AI Origins & Evolution

Automatic Speech Transcription And Speaker Recognition Simultaneously Using Apple AI

Last year, Apple witnessed several controversies regarding its speech recognition technology. To provide quality control in the company’s voice assistant Siri, Apple asked its contractors

08/02/2020

NVIDIA’s Parakeet Surpasses OpenAI's Whisper v3 in Speech Recognition

AI News & Update

NVIDIA’s Parakeet Surpasses OpenAI’s Whisper v3 in Speech Recognition

Under the CC BY 4.0 license, Parakeet distinguishes itself through its extensive training on a vast dataset of 64,000 hours of audio.

10/01/2024

AI News & Update

Now You Can Create Lifelike Avatars with AI Animation and Speech in NVIDIA ACE

NVIDIA’s Avatar Cloud Engine (ACE) update brings advanced animation and speech features to AI avatars, enabling realistic expressions and conversations.

05/12/2023

AI Mysteries

Meta’s 6 Premier Papers Presented at INTERSPEECH 2023

At the annual conference of the International Speech Communication Association, Meta presented more than 20 papers primarily focusing on NLP

22/08/2023

AI Mysteries

Google’s 6 Must-Read Papers Published at INTERSPEECH 2023

We’ve picked out the best of 20+ research papers Google will be presenting at the event

21/08/2023

AI News & Update

Google Unveils New Universal Speech Model, Performs Better than OpenAI Whisper

A critical first step towards supporting 1,000 languages.

08/03/2023

AI News & Update

Google USM Shatters Language Barriers with Multilingual Speech Recognition Model

The model’s encoder is pre-trained on a vast unlabeled multilingual dataset of 12 million hours that covers over 300 languages.

03/03/2023

Why speech separation is such a difficult problem to solve

AI Origins & Evolution

Why Speech Separation is Such a Difficult Problem to Solve

Researchers are making great progress in the field of speech separation and recognition using various methods, but the solution and the biggest challenge still is inferring sounds as separate sources of speech instead of a single speaker.

27/09/2022

AI News & Update

OpenAI Open-Sources ‘Whisper’ — a Multilingual Speech Recognition System

The company’s open-sourced models and inference code serve as a foundation for building useful applications and boost further research on robust speech processing.

26/09/2022

AI News & Update

Google rolls out visual interface for Speech-to-Text API in cloud

The Speech-to-text API is available in all Google Cloud regions and can be accessed by all GCP users.

08/02/2022

AI News & Update

Facebook AI Releases XLS-R, Self-Supervised Model For Speech Tasks

XLS-R substantively improves upon previous multilingual models by training on nearly ten times more public data in more than twice as many languages.

19/11/2021

AI Origins & Evolution

Facial Recognition Is Steadily Entering The Stage Of Large-Scale Deployments

Facial recognition technology is being leveraged way beyond unlocking our phones; it is aiming to identify every person on the planet, for good or bad.

19/10/2021

AI Origins & Evolution

Why Speech-to-Speech Translation Is So Important For Google

AI-assisted cross-lingual conversation is a challenging problem. To this end, Google introduced Translatotron in 2019.

07/10/2021

AI Origins & Evolution

Top Speech-To-Speech Translation Models & Tools In Market Today

Speech-to-speech translation can aid communication between people who speak different languages.

03/10/2021

AI Origins & Evolution

Google Upgrades Translatotron, Its Speech-to-Speech Translation Model

Google claims the revised version can successfully transfer voice even when the input speech consists of multiple speakers.

30/09/2021

AI Origins & Evolution

Graph Transformer Network: A New Framework For Language & Speech Processing

Last year, Facebook open-sourced graph transformer networks (GTN), a framework for automatic differentiation with a weighted finite-state transducer graph (WFSTs). To put things in perspective,

08/07/2021

AI Mysteries

Sound Pitch Recognition Using SPICE

Article is about Pitch Recognition, aka Pitch Estimation.

23/06/2021

AI Origins & Evolution

HuBERT: Facebook’s Latest Approach To Self-Supervised Speech Representation Learning

Facebook AI Research (FAIR) has published a research paper introducing Hidden Unit BERT (HuBERT), their latest approach for learning self-supervised speech representations. According to FAIR,

20/06/2021

Tech & AI Blend

How VSpeech.ai’s ML Model Understands Mixed Language Inputs Accurately

Ahmedabad-based VSpeech.ai was founded in 2015. The startup sensed an opportunity while working with Interactive Voice Response (IVR) call centres, and soon pivoted to IVR

03/06/2021

Tech & AI Blend

How This Startup Is Using Deep Learning To Decipher Speech From Lip Movements

Recent advances in computer vision, pattern recognition, and signal processing have led to a budding curiosity in automating the challenging task of lip reading. Visual

25/12/2020

AI Mysteries

Guide To LibriSpeech Datasets With Implementation in PyTorch and TensorFlow

The Librispeech dataset is SLR12 which is the audio recording of reading English speech.

11/12/2020

AI Mysteries

Guide To VoxCeleb Datasets For Audio-Visual of Human Speech

Guide To VoxCeleb Datasets For Visual-Audio of Human Speech.

08/12/2020

AI Mysteries

This New AI Model Can Convert Silent Words Into Audible Speech

Recently, researchers from UC Berkeley introduced a new AI model that can convert silently mouthed words to audible speech. The task of digitally voicing silent

27/11/2020

Gnani.ai Launches New Integrated Speech Solution For The Ministry of Defence

AI News & Update

Gnani.ai Launches New Integrated Speech Solution For The Ministry of Defence

Gnani.ai, a conversational AI startup, has announced the launch of a new integrated speech recognition based solution for the Indian Armed Forces. According to the

24/11/2020

AI Mysteries

Understanding Speech: Moving Beyond ASRs

Deep Learning DevCon 2020 or DLDC 2020 is another conference of the year that is hosted in partnership with Analytics India Magazine. Scheduled for 29th

31/10/2020

Results

Search Results for: automatic speech recognition

Contact Us

Subscribe to our newsletter

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

Subscribe to Our Newsletter