試す - 無料

Adam's Apple moment for AI models

Voice and Data

|

January 2025

Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?

- BY PRATIMA HARIGUNANI

Adam's Apple moment for AI models

Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.

Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?

NO MORE TONE-DEAF LMS

The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

image

Voice and Data からのその他のストーリー

Voice and Data

Voice and Data

DPDP Act sets new guardrails for India's Al ecosystem

India's data law reshapes the foundations of Al by enforcing trust, clarity, and responsible innovation across every layer of its evolving digital ecosystem.

time to read

3 mins

December 2025

Voice and Data

Voice and Data

Securing the nation in an age of silent cyber conflict

India's expanding digital ecosystem now sits at the centre of global cyber conflict, demanding resilience, sovereign control, and Al-driven defence at scale.

time to read

3 mins

December 2025

Voice and Data

Voice and Data

"We are building a Rapido, not a bus, for space"

Immanuel Louis, Co-founder and COO of Astrophel Aerospace, is attempting to rewire how India thinks about satellite launches-from large-ride dependency to faster, indigenous access to orbit. Alongside his co-founder, Louis is building a vertically integrated space-tech startup focused on Made-in-India rocket components, reusable engines, and sub-systems optimised for small and CubeSat missions.

time to read

6 mins

December 2025

Voice and Data

Voice and Data

Driving speed, stability, and next-gen wireless efficiency

Built on multi-link architecture, expanded spectrum, and higher-order modulation, Wi-Fi 7 lays the foundation for the next phase of immersive and connected living.

time to read

4 mins

December 2025

Voice and Data

Voice and Data

UNSHACKLING SATCOM: THE POLICY RESET INDIA NEEDS FOR VIKSIT BHARAT

India's connectivity gap demands a regulatory shift that can enable satellite networks to reach regions where terrestrial infrastructure cannot viably operate.

time to read

6 mins

December 2025

Voice and Data

Voice and Data

Telcos to Techcos: The long climb to a new success story

As operators climb out of the commodity-connectivity trap, they are rebuilding networks, services, and platforms to script a sustainable growth story.

time to read

3 mins

December 2025

Voice and Data

Voice and Data

When automation turned against the world wide web

A routine Cloudflare update triggered a global outage, exposing the security and stability risks of centralised cloud and why resilience must be re-engineered.

time to read

7 mins

December 2025

Voice and Data

Voice and Data

6G to need 2-3 GHz more mid-band spectrum by 2040

GSMA forecasts 2-3 GHz more mid-band spectrum needed by 2040 to avoid urban congestion and enable global 6G readiness.

time to read

1 min

December 2025

Voice and Data

Voice and Data

"Neutral networks will anchor 5G, satcom and 6G growth"

Salil Ahuja, Chief Strategy Officer at Shaurrya Teleservices, oversees strategy at one of India's emerging neutral digital infrastructure providers and among the early TRAI-empanelled Digital Connectivity Rating Agencies (DCRAs). With connectivity quality, fragmented in-building networks, private 5G readiness, satcom convergence, and AI-driven infrastructure becoming central to India's digital ecosystem, he is shaping how buildings, enterprises, and operators prepare for the next wave of digital services.

time to read

4 mins

December 2025

Voice and Data

Voice and Data

Shaping India's near-future digital playbook

As India enters 2026, these ten technologies will advance network design, strengthen digital infrastructure, and unlock new layers of enterprise value.

time to read

11 mins

December 2025

Listen

Translate

Share

-
+

Change font size