MITB Banner

Voice Interoperability Movement Gathers Momentum, Thanks To Linux Foundation

Share

“Future of voice assistance will be multi-assistant, multi-platform, multi-device, multimodal, and multi-use.”

Imagine using the same device to command Google Assistant for booking tickets, Alexa to play a song and Cortana to schedule a meeting on Outlook. Voice Interoperability Initiative was first launched by Amazon back in 2019 with support from 30 companies (now 80) including global brands like Baidu, BMW, Bose, Microsoft, Salesforce, Sony and more. Today, companies like Facebook and Intel are part of this initiative. For example, Facebook’s video calling service, Portal, also supports Alexa. Qualcomm, also part of Amazon’s interoperability initiative, already allows its chipsets to function on multiple wake word engines to run simultaneously on a single device. 

With the use of chat assistants on the rise, these initiatives couldn’t have come at a better time. Now, the Linux Foundation wants to chip in. The organisation thinks there is a “trust gap” raising critical questions of privacy, data security, ease of use, brand protection, interoperability, and equal and unbiased access for all.

To establish voice interoperability and standardisation and to win the user’s trust, the foundation has recently launched the Open Voice Network (OVN). The Open Voice Network is an independently funded and governed non-profit industry association which operates as a directed fund of the Linux Foundation. The founding members of OVN include Target, Schwarz Gruppe, Wegmans Food Markets, Microsoft, Veritone, and Deutsche Telekom.

The objective of Voice Interoperability Initiatives is to provide customers with choice and flexibility through multiple, interoperable voice services.  The Open Voice Network especially, is guided by four values:

  • Worthy of user trust
  • Enable user, ecosystem, and architectural choice
  • Inclusive and accessible
  • Open in software and hardware, serving as a foundation for commercial differentiation

Advantages of Voice Interoperability

  • Companies or designers will be incentivised to work with one another to ensure customers have the freedom to choose multiple voice services on a single device.
  • This simplifies requirements at the end user by enabling design consistencies to reduce development effort.
  • Developers and device makers will be committed to protect the security and privacy of customers interacting with multiple voice services.

The OVN community claims voice will soon become a common interface for every digital device. But experts have started to wonder if automated speech recognition technology might fall short of its potential for acceptance and enterprise value. According to Mike Dolan, senior VP at the Linux Foundation, voice is soon expected to become a primary mode of communication with the digital world by connecting millions of users through billions of sites, smart environments and AI bots. It’s already been used beyond smart speakers; in automobiles, smartphones and home electronics. “Impact of voice on industries including commerce, transportation, healthcare and entertainment is staggering and we’re excited to bring it under the open governance model of the Linux foundation to grow the community and pave a way forward,” said Dolan.

“To speak is human, and voice is rapidly becoming the primary interaction modality between users and their devices and services at home and work.”

Ali Dalloul, GM, Microsoft Azure

Since voice assistants depend upon technologies like Automatic Speech Recognition (ASR), Natural Language Processing (NLP) and Machine Learning (ML), more devices and services can interact openly and safely with one another. In 2019, Amazon even proposed a multi-agent design that would allow the users to interact with multiple assistants with multiple wake words. Linux foundation believes consumers and businesses can tap into technologies like conversational AI for customer service, commerce and more. According to the press release, the Open Voice Network will:

  • Commit to research and provide recommendations toward standardisation of attributes to enable user choice and trust.
  • Create industry level awareness through identification and sharing best practices tof conversational AI.
  • Collaborate with industries on relevant regulatory and legislative issues, including those of data privacy.

Going forward, the Linux Foundation aims to market its open governance model to a wider community to accelerate conversational AI standards rollout and adoption.

Share
Picture of Ram Sagar

Ram Sagar

I have a master's degree in Robotics and I write about machine learning advancements.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.