Meta AI Releases EnCodec, a Neural Network to Reconstruct Input Audio Signals

October 26, 2022

Meta releases EnCodec, a neural network trained to reconstruct input audio signals into smaller files. Meta researchers claim to receive state-of-the-art results in low-bit-rate audio hypercompression.

Encodec, our AI-powered compression neural net, has 3 parts:
1️⃣ Encoder: transforms raw data into higher dimensional + lower frame rate
2️⃣ Quantizer: compresses to target size, equiv. to mp3
3️⃣ Decoder: turns compressed signal back to waveform, most similar to the original

3/5 pic.twitter.com/S4AvsNgztP
— Meta AI (@MetaAI) October 25, 2022

EnCodec has a streaming encoder-decoder architecture that utilizes sequential modeling. Such convolutional-based encoder-decoder architectures are very potent in multiple audio-based jobs, like audio enhancement, audio bandwidth extension, audio separation, and many others.

EnCodec comes with three main components, Encoder, Quantizer, and Decoder. The Encoder network (E) transforms input audio into a latent representation (z) with a higher dimension and lower frame rate. Then the Quantizer (Q) compresses it to the desired target size in an MP3 format and outputs z𝔮. Finally, the Decoder network (G) transforms the compressed audio signal into a waveform (ẋ), nearly similar to the original one.

Meta researchers claim to have achieved a 10x compression rate vs MP3 at 64kbps without compromising audio quality. It is a pioneer research as this is the first time a 48kHz stereo audio was used as an input.
Meta released a research paper highlighting all the technical details and architecture behind EnCodec. The paper also highlights that a Transformer model of EnCodec can be used to make it more efficient and reduce audio bandwidth by 40% without any quality loss. To help developers and people with a technical background to understand more about EnCodec, Meta has also released the code.

Meta AI Releases EnCodec, a Neural Network to Reconstruct Input Audio Signals

LEAVE A REPLY Cancel reply

Most Popular

Meta AI Releases EnCodec, a Neural Network to Reconstruct Input Audio Signals

Subscribe to our newsletter

RELATED ARTICLES

AI Made Cyberattacks Faster Than Patches. Mandiant’s Data Proves It.

Cerebras Systems IPO: What the $26.6B Nasdaq Listing Means for AI Chips

SpaceX Secures $60 Billion Option to Acquire Cursor as Musk Bets on AI Coding

LEAVE A REPLY Cancel reply

Most Popular