AIM Banners_978 x 90

Microsoft’s DeepSinger Generates Voices That Can Sing In English and Chinese

A team of researchers from Microsoft and Zhejiang University recently developed a multi-lingual multi-singer singing voice synthesis (SVS) system known as DeepSinger. The system is built from scratch using singing training data mined from music websites. With the advancement of deep neural networks, Singing Voice Synthesis (SVS) generates singing voices from lyrics, which has attracted much traction in the field of research and industrial community in recent years. This technique is similar to the text-to-speech method that enables machines to speak. Traditional SVS mostly relies on human recording and annotations and requires a large number of high-quality singing recordings as training data as well as strict data alignments between lyrics and singing audio for accurate singing mode
Subscribe or log in to Continue Reading

Uncompromising innovation. Timeless influence. Your support powers the future of independent tech journalism.

Already have an account? Sign In.

📣 Want to advertise in AIM? Book here

Picture of Ambika Choudhury
Ambika Choudhury
A Technical Journalist who loves writing about Machine Learning and Artificial Intelligence. A lover of music, writing and learning something out of the box.
Related Posts
AIM Print and TV
Don’t Miss the Next Big Shift in AI.
Get one year subscription for ₹5999
Download the easiest way to
stay informed