OpenAI has unveiled a new AI voice generator called Voice Engine that can accurately mimic human voices using a 15-second sample of someone speaking. While the tool has potential applications for accessibility services, concerns have been raised about potential misinformation and abuse due to its convincing nature. OpenAI has a history of successfully launching AI tools, and Voice Engine is currently being tested by a small group of trusted partners in education and health technology.
The company acknowledges the risks associated with generating speech that resembles people’s voices, particularly in an election year. OpenAI is considering major changes as AI-generated audio becomes more widely available, including the potential phased-out use of voice-based authentication for bank accounts. The company emphasizes the importance of voice authentication experiences and a no-go voice list to prevent the creation of voices that are too similar to prominent figures.
Voice Engine has the ability to create replica voices that can speak in multiple languages using a voice sample in one language. OpenAI shared examples of AI-generated audio in various languages to demonstrate the tool’s capabilities in maintaining the tone and accent of the original speaker. The company is currently working on the public release of Sora, an AI-generated video tool that can create realistic looking 60-second videos from text instructions, as well as ChatGPT, which can generate images from a text prompt.
While OpenAI’s Voice Engine has potential benefits for translation, reading assistance, and aiding those who have lost their ability to speak, there are concerns about its potential misuse for disinformation and scams. The company is working with trusted partners to test the tool and gather feedback on its potential risks and benefits. OpenAI is committed to making any necessary changes to prevent misuse of AI-generated voices and ensure proper authentication and identification of AI-generated content.
As AI-generated audio technology becomes more widespread, OpenAI is considering how to responsibly deploy Voice Engine and similar tools. The company emphasizes the importance of verifying the original speaker’s consent and clearly identifying AI-generated content to listeners. OpenAI’s goal is to balance the potential benefits of AI voice technology with the need to prevent misuse and deception. The company’s cautious approach reflects the growing concerns surrounding the ethical use of AI technology in various applications.