Essayer OR - Gratuit

Adam's Apple moment for AI models

Voice and Data

|

January 2025

Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?

- BY PRATIMA HARIGUNANI

Adam's Apple moment for AI models

Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.

Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?

NO MORE TONE-DEAF LMS

The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

image

PLUS D'HISTOIRES DE Voice and Data

Voice and Data

Voice and Data

Rewiring AI infrastructure from core to edge

As GenAI shifts from pilots to production, enterprises must rethink infrastructure strategy to meet performance, cost, and compliance demands.

time to read

3 mins

September 2025

Voice and Data

Voice and Data

SECURITY AND COMPETITION: BALANCING THE TELECOM ACT

The Telecom Act 2023 strengthens security and consumer trust, but gaps in competition, OTT parity, and licensing stability raise new challenges.

time to read

5 mins

September 2025

Voice and Data

Voice and Data

Bet off the table: India rewrites online gaming future

The Online Gaming Act halts real-money play, pushing jobs, capital, and brands to pivot toward e-sports, free-to-play, and global growth paths.

time to read

5 mins

September 2025

Voice and Data

Voice and Data

Chip packaging evolves to support connected futures

As 5G, edge, and loT scale globally, chip packaging plays a critical role in delivering low-latency, energy-efficient connectivity at both core and edge.

time to read

3 mins

September 2025

Voice and Data

Voice and Data

Rising orbit: Startups power India's new space journey

India's space sector shifts from state-led missions to public-private partnerships, with startups driving agility, innovation, and global competitiveness.

time to read

3 mins

September 2025

Voice and Data

Voice and Data

GPU cloud: Retail's new engine of relevance

Retailers must shift from basic personalisation to hyper-relevance, using GPU cloud to deliver fast, scalable, and privacy-first experiences.

time to read

3 mins

September 2025

Voice and Data

Voice and Data

Charting telecom's path to a trusted digital future

At COAI Dialogues 2025, trust, policy, and infrastructure converge to shape India's telecom roadmap for an inclusive, secure digital future.

time to read

7 mins

September 2025

Voice and Data

Voice and Data

Building India's quantum backbone with QKD

India's push for QKD networks is reshaping security, demanding policy clarity, legal reform, and early adoption to protect national infrastructure.

time to read

3 mins

September 2025

Voice and Data

Voice and Data

Fibre or 5G? Convergence may be the real superhero

Fibre brings stability and 5G adds reach, but convergence offers enterprises the most resilient path to future-ready connectivity.

time to read

6 mins

September 2025

Voice and Data

Voice and Data

Transcending the noise with tech-powered interactions

Technology helps businesses cut through digital clutter, creating trusted, seamless interactions that build loyalty and lasting customer value.

time to read

4 mins

September 2025

Listen

Translate

Share

-
+

Change font size