Meta Open Sources AudioCraft: Generative AI Music Studio

The system comprises three models: MusicGen, AudioGen, and EnCodec.

Share

Published on August 2, 2023

by Siddharth Jindal

Listen to this story

Meta today announced that it is open sourcing AudioCraft, a new family of generative AI models built for generating high-quality, realistic audio & music from text.

We're publicly releasing AudioCraft for research purposes and to further understanding of the technology. Responsible innovation can’t happen in isolation. Opening up our research and resulting models helps ensure that everyone has equal access.

GitHub ⬇️https://t.co/hu1004mxX4
— Meta AI (@MetaAI) August 2, 2023

AudioCraft introduces a unified code base that encompasses music, sound, compression, and generation functionalities, providing a comprehensive solution in one place. The system comprises three models: MusicGen, AudioGen, and EnCodec.

The new release of AudioCraft is an improvement over the previous MusicGen version. It includes a better EnCodec decoder that allows for higher-quality music generation with fewer glitches. Moreover, it now has pre-trained AudioGen models, enabling the system to create environmental sounds and sound effects like a dog barking, cars honking, or footsteps on a wooden floor.

This release is exciting as it simplifies building on the top of the state of the art in audiogeneration. People can now build things like sound generators and compression algorithms with the same code base.

Meta in its blog stated that AudioCraft represents a significant advancement in generative AI research. They believe that the straightforward approach they developed for audio generation will have a profound influence on future technologies and how we interact with them.

Meta expressed its excitement over the creative potential of people using AudioCraft and looks forward to seeing what they will create with it.

Access the code here: https://bit.ly/3QnMya3

Access all our open Survey & Awards Nomination forms in one place