Prøve GULL - Gratis

Adam's Apple moment for AI models

Voice and Data

January 2025

Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?

- BY PRATIMA HARIGUNANI

Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.

Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?

NO MORE TONE-DEAF LMS

The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

Denne historien er fra January 2025-utgaven av Voice and Data.

Abonner på Magzter GOLD for å få tilgang til tusenvis av kuraterte premiumhistorier og over 9000 magasiner og aviser.

Allerede abonnent? Logg på

FLERE HISTORIER FRA Voice and Data

Vis alle

Voice and Data

The flight deck layer for autonomous AI networks

As AI networks act autonomously, embedded observability is evolving into a governing layer that orchestrates telemetry, policy and real-time corrective action.

3 mins

February 2026

Voice and Data

"Shopfloor change is now driven by data and intelligent networks"

India's factory floors are no longer defined only by machines, throughput, and shift rosters.

7 mins

February 2026

Voice and Data

DIGITAL TRANSFORMATION HITS BUDGET REALITY

As spending tightens, CIOs are cutting sprawl and proving value fast—turning cloud, networks, and platforms into disciplined systems built to perform.

10 mins

February 2026

Voice and Data

Securing the digital stack at the silicon core

As Al and hyperscale infrastructure expand, trust must be engineered into semiconductors-the foundational layer powering networks and cloud.

3 mins

February 2026

Voice and Data

China builds Meteor 1 parallel optical Al chip

A new photonic processor signals a shift in high-performance computing for Al and data centres amid rising power demands.

1 mins

February 2026

Voice and Data

Intelligent fibre for distributed Al ecosystems

As Al workloads stretch across hyperscale, edge, and GPU clusters, ultra-low- loss fibre and automation now define network performance and resilience.

4 mins

February 2026

Voice and Data

APP-LAYER FRAUD: IT IS TIME FOR A STRONGER TRUST ARCHITECTURE

With scams increasingly originating on messaging platforms, India must correct regulatory asymmetry and strengthen verification to protect digital trust.

5 mins

February 2026

Voice and Data

Beyond the 5G rollout, the age of execution

Industry leaders at the V&D 5G+ Conference debate how India can turn network scale into resilient, intelligent systems that deliver economic value.

12 mins

February 2026

Voice and Data

Building real-time risk engines on telco networks

BFSI firms are redesigning risk systems using AI, blockchain, and low-latency networks to enable real-time fraud prevention and compliance.

4 mins

February 2026

Voice and Data

Five signals for India's digital infrastructure shift

Nirmala Sitharaman outlines India's Al- and data-centre-led roadmap, signalling structural shifts in networks, compute and digital sovereignty.

6 mins

February 2026