Essayer OR - Gratuit
Adam's Apple moment for AI models
Voice and Data
|January 2025
Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?
Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.
Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?
NO MORE TONE-DEAF LMS
The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

Cette histoire est tirée de l'édition January 2025 de Voice and Data.
Abonnez-vous à Magzter GOLD pour accéder à des milliers d'histoires premium sélectionnées et à plus de 9 000 magazines et journaux.
Déjà abonné ? Se connecter
PLUS D'HISTOIRES DE Voice and Data
Voice and Data
The flight deck layer for autonomous AI networks
As AI networks act autonomously, embedded observability is evolving into a governing layer that orchestrates telemetry, policy and real-time corrective action.
3 mins
February 2026
Voice and Data
"Shopfloor change is now driven by data and intelligent networks"
India's factory floors are no longer defined only by machines, throughput, and shift rosters.
7 mins
February 2026
Voice and Data
DIGITAL TRANSFORMATION HITS BUDGET REALITY
As spending tightens, CIOs are cutting sprawl and proving value fast—turning cloud, networks, and platforms into disciplined systems built to perform.
10 mins
February 2026
Voice and Data
Securing the digital stack at the silicon core
As Al and hyperscale infrastructure expand, trust must be engineered into semiconductors-the foundational layer powering networks and cloud.
3 mins
February 2026
Voice and Data
China builds Meteor 1 parallel optical Al chip
A new photonic processor signals a shift in high-performance computing for Al and data centres amid rising power demands.
1 mins
February 2026
Voice and Data
Intelligent fibre for distributed Al ecosystems
As Al workloads stretch across hyperscale, edge, and GPU clusters, ultra-low- loss fibre and automation now define network performance and resilience.
4 mins
February 2026
Voice and Data
APP-LAYER FRAUD: IT IS TIME FOR A STRONGER TRUST ARCHITECTURE
With scams increasingly originating on messaging platforms, India must correct regulatory asymmetry and strengthen verification to protect digital trust.
5 mins
February 2026
Voice and Data
Beyond the 5G rollout, the age of execution
Industry leaders at the V&D 5G+ Conference debate how India can turn network scale into resilient, intelligent systems that deliver economic value.
12 mins
February 2026
Voice and Data
Building real-time risk engines on telco networks
BFSI firms are redesigning risk systems using AI, blockchain, and low-latency networks to enable real-time fraud prevention and compliance.
4 mins
February 2026
Voice and Data
Five signals for India's digital infrastructure shift
Nirmala Sitharaman outlines India's Al- and data-centre-led roadmap, signalling structural shifts in networks, compute and digital sovereignty.
6 mins
February 2026
Listen
Translate
Change font size
