Earlier this year, when ChatGPT maker OpenAI unveiled its latest large language model – GPT-4o, the US-based company also highlighted its multimodal capabilities and focused on the AI model’s ability to talk with humans in real time.
Today, in a reply to a post on X (previously Twitter), OpenAI CEO Sam Altman said that the much anticipated GPT-4o-powered Voice Mode will be available in alpha next week for paid subscribers. While ChatGPT currently has a Voice Mode, it is not that helpful because of delayed responses. Compared to the current version, which has an average latency of 2.8 seconds and 5.4 seconds for GPT 3.5 and GPT 4 respectively, GPT-4o has no noticeable delay.
In a demo shared by OpenAI, ChatGPT could be seen teaching users Portuguese, engaging in conversation with more than one person and responding to user queries with emotions and non-verbal cues. The GPT-4o-powered Voice Mode, which was to be made available to users to a small group of users last month was delayed citing the AI’s ability to “detect and refuse certain content.”
However, the new Voice Mode won’t be available to all ChatGPT Plus users until fall. OpenAI last week introduced GPT-4o mini, a streamlined version of its latest large language model which the company says is more capable than GPT-3.5 Turbo. Recently, OpenAI also announced a new AI-powered search engine called SearchGPT, which will eventually be integrated with ChatGPT.
© IE Online Media Services Pvt Ltd
First uploaded on: 26-07-2024 at 12:33 IST
Source: Sam Altman confirms ChatGPT Advanced Voice mode will be available to Plus users next week