Prøve GULL - Gratis
Adam's Apple moment for AI models
Voice and Data
|January 2025
Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?
Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.
Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?
NO MORE TONE-DEAF LMS
The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

Denne historien er fra January 2025-utgaven av Voice and Data.
Abonner på Magzter GOLD for å få tilgang til tusenvis av kuraterte premiumhistorier og over 9000 magasiner og aviser.
Allerede abonnent? Logg på
FLERE HISTORIER FRA Voice and Data
Voice and Data
Rebuilding enterprise DNA with AI-ready platforms
SAP is rebuilding enterprise foundations with AI-ready data fabrics and secure automation frameworks to create a scalable, intelligent infrastructure.
5 mins
November 2025
Voice and Data
SECURING THE 5G ENGINE FOR A SAFER DIGITAL WORLD
India's 5G revolution demands a defence-first mindset as cyber threats escalate, making trust, resilience and Zero Trust security essential for a digital economy.
4 mins
November 2025
Voice and Data
Get smarter SOCs in the age of intelligent threats
Al-powered SOCs are transforming security, combining automation and intelligence to enhance detection, response, and cyber resilience across the industry.
4 mins
November 2025
Voice and Data
Building the nation's long-term digital spine
India needs a future-proof fibre backbone to deliver reliable, scalable, and mission-critical connectivity for a Viksit Bharat through 2047 and beyond.
4 mins
November 2025
Voice and Data
Are telcos ready to let AI take the wheel?
AI is reshaping how networks run, decisions are made, and customer experiences evolve-pushing telcos to prepare for an era where intelligence drives the core.
4 mins
November 2025
Voice and Data
IGNITING A NEW ORBIT FOR SPACE RESEARCH
From mission design to Earth observation, HPC is now the hidden engine accelerating simulations, autonomy, and discovery across global space science.
10 mins
November 2025
Voice and Data
Glasgow scientists develop AI model to decode protein talk
PLM-interact decodes how proteins communicate, predicting interactions and mutations to speed up disease and virus research.
1 mins
November 2025
Voice and Data
GPS spoofing at IGIA: A wake-up call for national security
The disruption at Indira Gandhi International Airport (IGIA), where more than 800 flights were delayed or diverted following an alleged GPS-spoofing incident, is a wake-up call for India's aviation and communication systems.
2 mins
November 2025
Voice and Data
The wireless foundation of neo-industrial growth
India's next phase of growth will be shaped by secure, scalable wireless platforms that unify connectivity, strengthen security, and accelerate innovation.
2 mins
November 2025
Voice and Data
Scalable, secure, fast: The Cloud CDN advantage
Cloud-based CDNs are redefining digital performance, delivering speed, security, and scalability at the edge for a seamless user experience.
4 mins
November 2025
Listen
Translate
Change font size
