“Future of voice assistance will be multi-assistant, multi-platform, multi-device, multimodal, and multi-use.”
Imagine using the same device to command Google Assistant for booking tickets, Alexa to play a song and Cortana to schedule a meeting on Outlook. Voice Interoperability Initiative was first launched by Amazon back in 2019 with support from 30 companies (now 80) including global brands like Baidu, BMW, Bose, Microsoft, Salesforce, Sony and more. Today, companies like Facebook and Intel are part of this initiative. For example, Facebook’s video calling service, Portal, also supports Alexa. Qualcomm, also part of Amazon’s interoperability initiative, already allows its chipsets to function on multiple wake word engines to run simultaneously on a single device.
With the use of chat assistants on the rise, these initiatives couldn’t have come at a better time. Now, the Linux Foundation wants to chip in. The organisation thinks there is a “trust gap” raising critical questions of privacy, data security, ease of use, brand protection, interoperability, and equal and unbiased access for all.
Sign up for your weekly dose of what's up in emerging technology.
To establish voice interoperability and standardisation and to win the user’s trust, the foundation has recently launched the Open Voice Network (OVN). The Open Voice Network is an independently funded and governed non-profit industry association which operates as a directed fund of the Linux Foundation. The founding members of OVN include Target, Schwarz Gruppe, Wegmans Food Markets, Microsoft, Veritone, and Deutsche Telekom.
The objective of Voice Interoperability Initiatives is to provide customers with choice and flexibility through multiple, interoperable voice services. The Open Voice Network especially, is guided by four values:
- Worthy of user trust
- Enable user, ecosystem, and architectural choice
- Inclusive and accessible
- Open in software and hardware, serving as a foundation for commercial differentiation
Advantages of Voice Interoperability
- Companies or designers will be incentivised to work with one another to ensure customers have the freedom to choose multiple voice services on a single device.
- This simplifies requirements at the end user by enabling design consistencies to reduce development effort.
- Developers and device makers will be committed to protect the security and privacy of customers interacting with multiple voice services.
The OVN community claims voice will soon become a common interface for every digital device. But experts have started to wonder if automated speech recognition technology might fall short of its potential for acceptance and enterprise value. According to Mike Dolan, senior VP at the Linux Foundation, voice is soon expected to become a primary mode of communication with the digital world by connecting millions of users through billions of sites, smart environments and AI bots. It’s already been used beyond smart speakers; in automobiles, smartphones and home electronics. “Impact of voice on industries including commerce, transportation, healthcare and entertainment is staggering and we’re excited to bring it under the open governance model of the Linux foundation to grow the community and pave a way forward,” said Dolan.
“To speak is human, and voice is rapidly becoming the primary interaction modality between users and their devices and services at home and work.”Ali Dalloul, GM, Microsoft Azure
Since voice assistants depend upon technologies like Automatic Speech Recognition (ASR), Natural Language Processing (NLP) and Machine Learning (ML), more devices and services can interact openly and safely with one another. In 2019, Amazon even proposed a multi-agent design that would allow the users to interact with multiple assistants with multiple wake words. Linux foundation believes consumers and businesses can tap into technologies like conversational AI for customer service, commerce and more. According to the press release, the Open Voice Network will:
- Commit to research and provide recommendations toward standardisation of attributes to enable user choice and trust.
- Create industry level awareness through identification and sharing best practices tof conversational AI.
- Collaborate with industries on relevant regulatory and legislative issues, including those of data privacy.
Going forward, the Linux Foundation aims to market its open governance model to a wider community to accelerate conversational AI standards rollout and adoption.