Prøve GULL - Gratis

Adam's Apple moment for AI models

Voice and Data

|

January 2025

Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?

- BY PRATIMA HARIGUNANI

Adam's Apple moment for AI models

Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.

Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?

NO MORE TONE-DEAF LMS

The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

image

FLERE HISTORIER FRA Voice and Data

Voice and Data

Voice and Data

Rebuilding enterprise DNA with AI-ready platforms

SAP is rebuilding enterprise foundations with AI-ready data fabrics and secure automation frameworks to create a scalable, intelligent infrastructure.

time to read

5 mins

November 2025

Voice and Data

Voice and Data

SECURING THE 5G ENGINE FOR A SAFER DIGITAL WORLD

India's 5G revolution demands a defence-first mindset as cyber threats escalate, making trust, resilience and Zero Trust security essential for a digital economy.

time to read

4 mins

November 2025

Voice and Data

Voice and Data

Get smarter SOCs in the age of intelligent threats

Al-powered SOCs are transforming security, combining automation and intelligence to enhance detection, response, and cyber resilience across the industry.

time to read

4 mins

November 2025

Voice and Data

Voice and Data

Building the nation's long-term digital spine

India needs a future-proof fibre backbone to deliver reliable, scalable, and mission-critical connectivity for a Viksit Bharat through 2047 and beyond.

time to read

4 mins

November 2025

Voice and Data

Voice and Data

Are telcos ready to let AI take the wheel?

AI is reshaping how networks run, decisions are made, and customer experiences evolve-pushing telcos to prepare for an era where intelligence drives the core.

time to read

4 mins

November 2025

Voice and Data

Voice and Data

IGNITING A NEW ORBIT FOR SPACE RESEARCH

From mission design to Earth observation, HPC is now the hidden engine accelerating simulations, autonomy, and discovery across global space science.

time to read

10 mins

November 2025

Voice and Data

Glasgow scientists develop AI model to decode protein talk

PLM-interact decodes how proteins communicate, predicting interactions and mutations to speed up disease and virus research.

time to read

1 mins

November 2025

Voice and Data

GPS spoofing at IGIA: A wake-up call for national security

The disruption at Indira Gandhi International Airport (IGIA), where more than 800 flights were delayed or diverted following an alleged GPS-spoofing incident, is a wake-up call for India's aviation and communication systems.

time to read

2 mins

November 2025

Voice and Data

Voice and Data

The wireless foundation of neo-industrial growth

India's next phase of growth will be shaped by secure, scalable wireless platforms that unify connectivity, strengthen security, and accelerate innovation.

time to read

2 mins

November 2025

Voice and Data

Voice and Data

Scalable, secure, fast: The Cloud CDN advantage

Cloud-based CDNs are redefining digital performance, delivering speed, security, and scalability at the edge for a seamless user experience.

time to read

4 mins

November 2025

Listen

Translate

Share

-
+

Change font size