The Editorial Staff at AIChief is a team of professional content writers with extensive experience in AI and marketing. Founded in 2025, AIChief has quickly grown into the largest free AI resource hub in the industry.
OpenAI Unveils Enhanced AI Models for Transcription and Voice
OpenAI introduces new transcription and voice AI models, enhancing user interaction and accuracy for developers and users alike.

Originally reported bytechcrunch
OpenAI has released upgraded transcription and voice-generating AI models, boasting significant improvements over previous versions. Aligning with its vision of more autonomous systems, OpenAI aims to create models that can independently perform tasks for users. Olivier Godement, OpenAI's Head of Product, describes these models as a step towards building chatbots capable of engaging in meaningful conversations with customers.
The new text-to-speech model, called “gpt-4o-mini-tts,” is designed to produce more realistic and nuanced speech while allowing developers to customize tones and styles. Users can guide the model on how to deliver lines, whether adopting a quirky style reminiscent of a “mad scientist” or a calm demeanor like a mindfulness teacher. Jeff Harris, a member of OpenAI’s product team, emphasizes the goal of enabling developers to craft both voice experience and emotional context for better user interactions.
In addition to the text-to-speech upgrades, OpenAI has introduced “gpt-4o-transcribe” and “gpt-4o-mini-transcribe” as replacements for the outdated Whisper transcription model. These new models were developed using diverse and high-quality audio datasets to improve the accuracy of speech recognition, especially in challenging environments. Harris notes a reduction in errors, claiming these models will not fabricate information as Whisper often did.
Despite these advancements, challenges remain, particularly for Indic and Dravidian languages, where OpenAI reports a word error rate nearing 30%. Interestingly, the new transcription models will not be available for open-source usage, diverging from OpenAI's tradition of releasing models under open licenses. Harris explains that these larger models are not suitable for local machine operation, prompting a more considered approach to their release.
#news
ES
Editorial Staff Editor
View all posts
Filter:
No comments yet. Be the first to comment!
Related stories
Erin Brockovich takes aim at data center secrecy
#ainews
Environmental activist Erin Brockovich has a new mission: Bringing more transparency to data center construction and the impact those data centers have on nearby communities. Brockovich — who was famo...
1h ago
Making sense of the debate over AI psychosis
#ainews
Box founder Aaron Levie got us talking this week with a social media post suggesting that tech CEOs are“uniquely prone to AI psychosis.” On the latest episode ofTechCrunch’s Equity podcast, Kirsten Ko...
7h ago
I went looking for the AI weed vape that gives you Bitcoin for smoking
#ainews
Gudtrip is the most ridiculous AI/crypto/weed product to ever touch the internet. Could it possibly be real? The crypto weed vape found me on 4/20, the high holiday of cannabis enthusiasts everywhere....
9h ago