OpenAI has announced a major update to its ChatGPT, which will allow voice conversations and image interactions.
The voice feature, which can perform tasks like setting alarms, narrating bedtime stories, and providing information from the internet, relies on a new text-to-speech model that generates life-like audio from text and brief voice samples. It uses the Whisper speech recognition system to transcribe spoken words into text, generate responses, and convert them into spoken language for users.
The new features will be available to subscribers of the Plus and Enterprise plans over the next two weeks.
Analyst QuickTake: OpenAI is expanding the capabilities of ChatGPT by introducing voice and image-based interactions. Users will now be able to engage in voice conversations with ChatGPT, making it more interactive and versatile. This move also positions ChatGPT in the broader context of GenAI, where competition is heating up among tech giants like Amazon, Google, Meta (formerly Facebook), and Microsoft.
By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.