Monday, June 17, 2024
ad
HomeNewsOpenAI Expands ChatGPT with Voice and Image-Based Abilities

OpenAI Expands ChatGPT with Voice and Image-Based Abilities

The development marks a notable evolution in the generative AI field, as OpenAI integrates voice-overs assistant features with its powerful large language models.

OpenAI has announced significant enhancements to its popular generative AI assistant, ChatGPT, expanding its capabilities beyond text-based interactions. ChatGPT, known for generating essays, poems, and summaries from text prompts, is now set to support voice conversations and image-based searches.

This development marks a notable evolution in the generative AI field, as OpenAI integrates voice-based assistant features with its powerful large language models (LLMs). Users can now engage in voice conversations with ChatGPT, asking it questions or requesting spontaneous tasks like crafting bedtime stories with vocal prompts.

The voice functionality is powered by a new text-to-speech model capable of producing human-like voices from text inputs. OpenAI collaborated with established voice actors to create five distinct voices and utilized the open-source Whisper speech recognition system to transcribe spoken words into text.

Read More: Another Group of Writers Sues OpenAI over Copyright Infringement

In addition to voice capabilities, ChatGPT users can utilize image-based queries. For example, they can upload an image and ask ChatGPT to provide explanations or instructions related to the image. 

These new features will roll out to paying Plus and Enterprise subscribers over the next two weeks. To activate voice features, users must navigate to the app’s “settings” menu, select “new features,” and opt-in to voice conversations. They can then choose their preferred voice by tapping the headphone button in the top-right corner. 

Initially, voice capabilities will be available in the ChatGPT Android and iOS apps on an opt-in beta basis, while image search will be accessible by default on all platforms. This expansion signifies OpenAI’s commitment to enhancing user interactions with ChatGPT and making it a more versatile and interactive AI assistant.

Subscribe to our newsletter

Subscribe and never miss out on such trending AI-related articles.

We will never sell your data

Join our WhatsApp Channel and Discord Server to be a part of an engaging community.

Sahil Pawar
Sahil Pawar
I am a graduate with a bachelor's degree in statistics, mathematics, and physics. I have been working as a content writer for almost 3 years and have written for a plethora of domains. Besides, I have a vested interest in fashion and music.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular