On March 17, Google announced that Vertex AI would be initiating Chirp 3, an audio generative model, from next week. Chirp 3 is an AI generative model, claiming to be capable of converting speech-to-text and HD text-to-speech with 8 voices in 31 languages.  Though the efficiency and utility of this feature is yet to be tested by the customers, it seems to be a promising feature of AI models.

This ground-breaking announcement was made at an event “Gemini for the United Kingdom”, attended by Google DeepMind CEO Sir Demis Hassabis and Google Cloud CEO Thomas Kurian.  Along with this, the platform made other progressive statements about Gemini advances, UK skill training programme, and £280,000 startup cloud creditsfor UK based AI startups.

Applications of Vertex AI

Since its launch in 2021, Google has been using the Vertex AI platform to develop AI models for myriad applications, including visual content analysis, social media monitoring, medical assistance, and chatbot development. Now, it is about to expand its application to generating voice models via Chirp 3.

Uses of Chirp 3

Google is anticipating Chirp 3 to play a transformative role in assisting different digital industries by generating audiobooks, transcribe meetings, facilitate customer call services, podcast narration, and voice annotation. Most interesting it can be used to build voice assistants that could further expand the application of this AI model in enterprises and customer services centres.

Restricted Chip 3 for Safety

For now Chirp 3 will be available under restricted usage due to its potential misuse and Google’s high security and safety policies. For that Thomas Kurian said

“We’re uniquely able to provide secure, flexible infrastructure; leading AI models; and an open developer platform that integrates with existing IT investments while maintaining security, privacy, and access controls”

AI Voice Platforms

Chirp 3 is the first step for Google in the domain of AI voice models; it is not the only one in the league. Companies like Sesame and ElevenLabs have already launched their AI voice models and they are already providing voice services to their customers. Seasame’s Maya and Mile has gone viral due to ultra–natural voice expression, emotional intelligence, real-time conversation, and personalized voices. Similarly, ElevenLabs provides the services of dubbing, voice cloning, creating a video voiceover, and converting text into realistic sound effects. Besides these, Amazon, Speechify, and TurboScribe also assist their customers by providing AI voice services.

Although Chirp 3’s operationalization is expected within the next week, and Google seems very confident about this new AI voice model launch, it remains to be seen how Google will compete in this already developed market and satisfy its customers.