Friday, April 12, 2024
HomeNewsNVIDIA Releases Maxine to Deliver Breakthrough Audio and Video Quality at Scale

NVIDIA Releases Maxine to Deliver Breakthrough Audio and Video Quality at Scale

The tech giant releases a cloud-native architecture for more qualitative communication.

NVIDIA releases Maxine, a suite of GPU-driven software development kits (SDKs) to deliver breakthrough audio and video quality. Maxine enables clear communications via its cloud-native microservices for augmented-reality effects and audio-video enhancement. 

With the early-access release of Maxine’s audio effects, the company said that Maxine would be re-architected for cloud-native microservices. Additionally, new SDK capabilities, including Speaker Focus and Face Expression Estimation, were announced, along with the availability of Eye Contact to all users. Updated versions of existing SDK functionalities are also included in NVIDIA Maxine.

Maxine provides three updated GPU-accelerated SDKs for audio, video, and AR effects that revolutionize real-time communications with AI. A new feature called Speaker Focus isolates the audio tracks of foreground and background speakers to make each voice more audible. Lastly, the Audio Super Resolution SDK function has also received an upgrade with better quality.

Read More: New NVIDIA DGX System Software and Infrastructure Solutions Supercharge Enterprise AI

The video effects SDK uses a regular webcam to produce AI-based video effects. Enhancements to temporal stability have been made to the Virtual Background function, which divides a person’s profile into sections and uses AI-powered background removal, replacement, or blur.

Additionally, the AR SDK offers typical web camera feed-based, real-time 3D face tracking and body pose estimation driven by AI.

Other cloud-native microservices offered by Maxine will enable developers to create real-time AI applications. These services may be autonomously managed and deployed on the cloud, speeding up implementation time. Some of these microservices are:

  • Background Noise Removal
  • Room Echo Removal
  • Audio Super Resolution
  • Acoustic Echo Cancellation

Maxine is a part of the NVIDIA Omniverse Avatar Cloud Engine, a set of cloud-based AI models and services that developers may use to create, personalize, and use interactive avatars. You can refer to the GTC keynote for more information. 

Subscribe to our newsletter

Subscribe and never miss out on such trending AI-related articles.

We will never sell your data

Join our WhatsApp Channel and Discord Server to be a part of an engaging community.

Disha Chopra
Disha Chopra
Disha Chopra is a content enthusiast! She is an Economics graduate pursuing her PG in the same field along with Data Sciences. Disha enjoys the ever-demanding world of content and the flexibility that comes with it. She can be found listening to music or simply asleep when not working!


Please enter your comment!
Please enter your name here

Most Popular