OpenAI is rolling out a new advanced voice mode for ChatGPT, offering more natural and engaging audio responses to a select group of subscribers.
Points
- OpenAI is introducing an advanced voice mode for ChatGPT to ChatGPT Plus subscribers.
- The feature offers various voice options for more natural interactions.
- The new mode aims to enhance realism and emotional engagement.
- Legal and ethical concerns have
been addressed to ensure ethical use of the voice mode.
– OpenAI plans to make the feature available to all ChatGPT Plus users by fall 2024.
OpenAI is beginning to roll out its new advanced voice mode for ChatGPT to a limited group of ChatGPT Plus subscribers. This feature offers a variety of voice options, providing more natural and engaging audio responses. Initially revealed at the GPT-4o launch event in May, this advanced voice mode has garnered significant attention for its sophisticated capabilities.
However, it faced criticism due to its voice sounding remarkably similar to Scarlett Johansson’s, which raised ethical and legal concerns. The new speech mode showcased notable advancements over the previous iteration at OpenAI’s event. Employees of OpenAI demonstrated how the chatbot could respond dynamically to disruptions and change course as necessary.
The onstage voice, known as “Sky,” was criticized despite these improvements because it resembled Johansson’s performance of an AI in the film Her. Johansson then started writing to OpenAI to ask for more information about the voice’s creation.
The rollout was originally slated for an alpha release in late June, but it was delayed by one month. According to OpenAI, the wait was required to satisfy their security requirements and enhance the model’s capacity to identify and reject particular kinds of information.
As per OpenAI spokesperson Taya Christianson, the speech model was put through extensive testing by more than a hundred outside specialists known as “red teamers,” who try to take advantage of technology flaws. OpenAI’s response to heightened scrutiny of its safety procedures included this delay.
OpenAI has added filters to the new voice mode to block requests for generating music or copyrighted audio. In response to criticism over a Johansson-like voice, OpenAI limited the mode to four preset voices created with voice actors. Spokesperson Taya Christianson assured that ChatGPT will not impersonate others’ voices, and any outputs differing from these presets will be blocked to prevent misuse.
How Will ChatGPT Evolve?
The new voice function aims to excel in conveying emotions such as sadness, excitement, and fear, even enabling ChatGPT to sing. While user feedback is still pending, these enhancements promise to make interactions more engaging and emotionally resonant. OpenAI spokesperson Lindsay McCallum emphasized that ChatGPT cannot replicate the voices of specific individuals or celebrities, and outputs differing from pre-set voices are blocked. This measure is crucial to prevent misuse, such as creating deepfakes or other deceptive content.
OpenAI plans to make the new advanced voice mode available to all ChatGPT Plus users by the fall. This rollout aims to provide users with a more interactive and versatile experience while maintaining high safety and ethical standards.
Analysis
- Enhanced Interaction: The advanced voice mode aims to provide a more engaging and realistic interaction with ChatGPT, improving user experience.
- Ethical Safeguards: OpenAI’s measures to address ethical and legal concerns demonstrate a commitment to responsible AI development.
- User Engagement: By introducing emotionally resonant voice interactions, OpenAI is likely to attract more users and enhance the appeal of ChatGPT Plus subscriptions.