OpenAI pauses a ChatGPT voice after some say it sounds like Scarlett Johansson

20 May 2024

24081 posts

OpenAI says it plans to halt the use of one of its ChatGPT voices after some users said it sounded like Scarlett Johansson, who famously voiced a fictional, and at the time futuristic, AI assistant in the 2013 film “Her.”

Quick Read

OpenAI Pauses ChatGPT Voice: OpenAI has decided to temporarily discontinue the use of “Sky,” one of ChatGPT’s voice options, after users pointed out that it sounded similar to actress Scarlett Johansson.
User Feedback Prompts Action: The decision came following feedback and queries regarding the selection process of its AI-generated voices, particularly Sky.
Not a Deliberate Mimicry: OpenAI clarified that Sky’s voice was not intended to mimic Johansson’s voice but was provided by another professional actress using her natural voice, whose identity remains undisclosed for privacy reasons.
Voice Mode Expansion: OpenAI had expanded its voice capabilities for ChatGPT in September, making the feature available to all mobile app users by November, enhancing user interaction with the AI.
Advanced AI Features: Recently, OpenAI introduced an updated version, GPT-4o, enhancing the AI’s ability to mimic human cadences and gauge users’ moods, sparking further discussions about AI-human interaction complexities.
Cultural Impact and Criticisms: The advanced capabilities of GPT-4o have drawn comparisons to the film “Her,” and raised concerns about gender dynamics in voice assistant programming, highlighted by critiques of AI reinforcing gender stereotypes.

The Associated Press has the story:

OpenAI pauses a ChatGPT voice after some say it sounds like Scarlett Johansson

Newslooks- NEW YORK (AP) —

In a post on the social media platform X Monday, OpenAI said it is “working to pause” Sky — the name of one of five voices that ChatGPT users can chose to speak with. The company said it had “heard questions” about how it selects the lifelike audio options available for its flagship artificial intelligence chatbot, particularly Sky, and wanted to address them.

OpenAI was also quick to debunk the internet’s theories about Johansson in an accompanying blog post detailing how ChatGPT’s voices were chosen.

FILE – Scarlett Johansson poses for photographers at the photo call for the film “Asteroid City” at the 76th international film festival, Cannes, southern France, May 24, 2023. OpenAI plans to halt the use of one of its ChatGPT voices after some drew similarities to Johansson, who famously portrayed a fictional AI assistant in the (perhaps no longer so futuristic) film “Her.” (Photo by Joel C Ryan/Invision/AP, File)

“We believe that AI voices should not deliberately mimic a celebrity’s distinctive voice — Sky’s voice is not an imitation of Scarlett Johansson but belongs to a different professional actress using her own natural speaking voice,” the company wrote. It said it could not share the name of its voice actors for privacy reasons.

San Francisco-based OpenAI did not comment further on why it still decided to pause Sky’s use.

OpenAI first rolled out voice capabilities for ChatGPT, which included the five different voices, in September, allowing users to engage in back-to-forth conversation with the AI assistant. “Voice Mode” was originally just available to paid subscribers, but in November, OpenAI announced that the feature would become free for all users with the mobile app.

And ChatGPT’s interactions are becoming more and more sophisticated. Last week, OpenAI said the latest update to its generative AI model can mimic human cadences in its verbal responses and can even try to detect people’s moods.

FILE – The OpenAI logo is seen on a mobile phone in front of a computer screen displaying output from ChatGPT, March 21, 2023, in Boston. OpenAI plans to halt the use of one of its ChatGPT voices after some drew similarities to actor Scarlett Johansson, who famously portrayed a fictional AI assistant in the (perhaps no longer so futuristic) film “Her.” (AP Photo/Michael Dwyer, File)

OpenAI says the newest model, dubbed GPT-4o, works faster than previous versions and can reason across text, audio and video in real time. In a demonstration during OpenAI’s May 13 announcement, the AI bot chatted in real time, adding emotion — specifically “more drama” — to its voice as requested. It also took a stab at extrapolating a person’s emotional state by looking at a selfie video of their face, aided in language translations, step-by-step math problems and more.

GPT-4o, short for “omni,” isn’t widely available yet. It will progressively make its way to select users in the coming weeks and months. The model’s text and image capabilities have already begun rolling out, and is set to reach even some of those that use ChatGPT’s free tier — but the new voice mode will just be available for paid subscribers of ChatGPT Plus.

While most have yet to get their hands on these newly announced features, the capabilities have conjured up even more comparisons to the Spike Jonze’s dystopian romance “Her,” which follows an introverted man (Joaquin Phoenix) who falls in love with an AI-operating system (Johansson), leading to many complications.

Scarlett Johansson poses for photographers upon arrival at the premiere of the film ‘Asteroid City’ at the 76th international film festival, Cannes, southern France, Tuesday, May 23, 2023. (Photo by Vianney Le Caer/Invision/AP)

OpenAI CEO Sam Altman appeared to tap into this, too — simply posting the word “her” on the social media platform X the day of GPT-4o’s unveiling.

Many reacting to the model’s demos last week also found some of the interactions struck a strangely flirtatious tone. In one video posted by OpenAI, a female-voiced ChatGPT compliments a company employee on “rocking an OpenAI hoodie,” for example, and in another the chatbot says “oh stop it, you’re making me blush” after being told that it’s amazing.

That’s sparked some conversation on the gendered ways critics say tech companies have long used to develop and engage with voice assistants, dating back far before the latest wave of generative AI advanced the capabilities of AI chatbots. In 2019, the United Nations’ culture and science organization pointed to “hardwired subservience” built into default female-voiced assistants (like Apple’s Siri to Amazon’s Alexa), even when confronted with sexist insults and harassment.

“This is clearly programmed to feed dudes’ egos,” The Daily Show senior correspondent Desi Lydic said of GPT-4o in a segment last week. “You can really tell that a man built this tech.”

For more entertainment news

‘The Apprentice,’ about a young Donald Trump, premieres in Cannes

Microsoft’s AI chatbot will ‘recall’ everything you do on a PC