Prøve GULL - Gratis
Adam's Apple moment for AI models
Voice and Data
|January 2025
Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?
Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.
Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?
NO MORE TONE-DEAF LMS
The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

Denne historien er fra January 2025-utgaven av Voice and Data.
Abonner på Magzter GOLD for å få tilgang til tusenvis av kuraterte premiumhistorier og over 9000 magasiner og aviser.
Allerede abonnent? Logg på
FLERE HISTORIER FRA Voice and Data
Voice and Data
Reimagining Earth through a living digital twin
A new geospatial alliance led by Aechelon seeks to build a real-time digital twin of Earth, merging satellite, radar, and Al for dynamic intelligence.
3 mins
October 2025
Voice and Data
Telcos find new lifeline in spectrum slicing
As revenues shrink and competition intensifies, spectrum-as-a-service offers telcos a shared model to unlock growth while empowering enterprises.
7 mins
October 2025
Voice and Data
Rethinking enterprise connectivity with managed Wi-Fi
As digital transformation accelerates, managed Wi-Fi is emerging as the backbone of agile, secure, and insight-driven enterprise connectivity in India.
4 mins
October 2025
Voice and Data
Beyond VPN: Building trust into network access
As cyberthreats grow more complex, enterprises are shifting from VPNs to ZTNA to achieve secure, scalable, and context-aware access for remote users.
4 mins
October 2025
Voice and Data
Banking on richer, safer digital conversations
RCS redefines banking communication with verified security, two-way interactivity, and measurable engagement that builds digital trust at lower cost.
3 mins
October 2025
Voice and Data
Powering India's cloud with sustainable data hubs
India's data centre boom is reshaping digital infrastructure, demanding clean energy, local innovation, and policy alignment to sustain its growth.
5 mins
October 2025
Voice and Data
Made in India: Building the backbone of loT hardware
From imported modules to home-grown chips, India's loT hardware story is evolving into one of design control, ecosystem depth, and strategic resilience.
4 mins
October 2025
Voice and Data
Breaking data barriers with light
Li-Fi uses the power of light to deliver ultra-fast, secure, and interference-free connectivity-illuminating a new frontier in digital communication.
7 mins
October 2025
Voice and Data
Cyber defence redefined as firewalls give way to foresight
Data security is evolving from reactive protection to predictive foresight— unifying platforms, intelligence, and AI to make organisations breach-ready.
3 mins
October 2025
Voice and Data
Connecting the currents of digital finance
India's telcos are building the invisible pipes that keep digital payments, inclusion, and innovation flowing across the nation's financial ecosystem.
5 mins
October 2025
Listen
Translate
Change font size

