MITB Banner

The Tech Behind Axis Bank’s Multilingual Voice Bot

Share

The Tech Behind Axis Bank’s Multilingual Voice Bot

Illustration by The Tech Behind Axis Bank’s Multilingual Voice Bot

Understanding the ever-changing customer aspirations and requirements will always be a challenge. However, the larger issue at hand lies in to bring the customers at par with the adoption of digital technology. The COVID era has emulated a significant impact at different levels of the organisation across businesses. The scenario at Axis Bank was no different as the status quo, and modus operandi across the bank was challenged during the initial phase of lockdown. 

The bank has observed extreme spikes in volumes, increment in complaints and escalations transpired by customer anxiety, which was a repercussion of limited services to customers during the lockdown phase. This reflected in Axis Bank’s customer experience index and the cross-functional service levels of the organisation. And thus, it became critical for the bank to re-define their role in the life of their customers by elevating their digital banking experiences to new domains of customer service. 

Also Read: What Does It Take To Design The Next Generation Chatbots

AI-Powered Conversational Voice BOT To The Rescue

In order to address the increasing number of queries from customers more effectively and promptly amid the pandemic, Axis Bank decided to partner with an AI-based SaaS voice automation platform — Vernacular.ai, to optimise voice AI solutions and automate their customer interactions via an intelligent human-like dialogue.

The company decided to create a next-gen multilingual voice bot — AXAA, for Axis Bank that can converse in English, Hindi as well as Hinglish. This bot leverages cutting edge automated speech recognition and natural language understanding technology boosted by AI-powered business algorithms to help accelerate the engagement strategy of the bank.

Explaining further, Sourabh Gupta, the co-founder and CEO of Vernacular.ai stated that the company uses all variants of standard sentence semantics techniques for the bot since the compute and performance requirements for many of the deployments can vary. For instance — a heavy neural sentence embedder for less loaded cases, but a much faster and simpler system for the opposite situation.

They, in turn, designed AXAA to provide an augmented customer experience which can automate the contact centre operations and is capable of handling lacs of customer queries and requests per day with the ability of faster scalability. This automated voice assistant has been deployed to have an in-depth understanding of their queries, context and the intent of the call. 

“Additionally, since the production environment will not give an ideal scenario, we also build custom models to work with the output of speech recognition systems which include real-life errors, such as, general noise and speech disfluencies,” said Gupta.

Also Read: How This NLP-Driven Literature Search Engine Can Help With COVID Information

The Tech Behind

Vernacular.ai started with an internal step called data-farming, which plugs into different sources of the conversational data repositories, and involves on-premise data sourcing to utilise previous conversations. To facilitate the process, the company gets the data points annotated by an active-learning based system that picks data points based on their performance.

“Model training process requires an overall system training which involves training the language-model and word embeddings for the business use-cases,” said Gupta. 

The bot used on-premise data sourcing, along with querying based on heuristics of model performance; “we pick data points that performed poorly,” said Gupta. “We used LM tuning, ASR Release, neural network, and resolution metrics for word embeddings.”

Furthermore, Vernacular.ai used state-of-the-art, deep neural networks trained on thousands of hours of acoustic data for text to speech recognition. And, to identify the pauses in between people communicating, the company employed recurrent models which worked on real-time audios with minimal latency. Alongside, the idiolect perception layer, which involves identifying various stylistic components from speech signals of utterances, includes identifying speaker characteristics like gender, age, location, etc. 

Post that, the modelling usage patterns in all the languages covers conversational nuances and semantic equivalence of words and phrases. The company also involved text to speech models where the systems are trained to synthesise audio which replicate nuances of intelligent human-like dialogue. “To discover novel user patterns in deployed systems, we used high recall sentinel models which helped in understanding new behaviour and handling concept drift during the lifetime of the product,” said Gupta.

Also Read: How AI-Powered Analytics & Monitoring System Helps In Reducing Water Losses

Benefits of AXAA In Customer Retention

Talking about the benefits, Ratan Kesh, the EVP and Head – Retail Operations and Service of Axis Bank said, “The launch of AXAA, in July’20, is in line with the bank’s philosophy of building a sharper customer focus and embark on a journey of constant innovation and enhancement.”

“With the capability of operating like a humanoid, AXAA has the power to change the paradigm of customer experience,” said Kesh.

Till date, the AI Voice BOT has catered to more than 2.23 million customers at an industry best success rate of an average of 85% and above. There were also approximately 65%-70% of customers who were being touched by the BOT and attempted to connect with the contact centre. In case, the bot — AXAA is unable to service or cater to a particular customer query, it has been designed to direct the call immediately to one of the human experts, minimising the navigation time on the conventional IVR. “AXAA has been working side by side with expert customer service officers in delivering a consistent and superior experience to customers,” said Kesh.

Taking digital banking to the next level, “moving forward to the next phase, we are working on expanding the language capabilities to 10 Indian languages and add 27 more self-service options,” concluded Gupta.

Share
Picture of Sejuti Das

Sejuti Das

Sejuti currently works as Associate Editor at Analytics India Magazine (AIM). Reach out at sejuti.das@analyticsindiamag.com
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India