कोशिश गोल्ड - मुक्त
Adam's Apple moment for AI models
Voice and Data
|January 2025
Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?
Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.
Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?
NO MORE TONE-DEAF LMS
The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

यह कहानी Voice and Data के January 2025 संस्करण से ली गई है।
हजारों चुनिंदा प्रीमियम कहानियों और 10,000 से अधिक पत्रिकाओं और समाचार पत्रों तक पहुंचने के लिए मैगज़्टर गोल्ड की सदस्यता लें।
क्या आप पहले से ही ग्राहक हैं? साइन इन करें
Voice and Data से और कहानियाँ
Voice and Data
DPDP Act sets new guardrails for India's Al ecosystem
India's data law reshapes the foundations of Al by enforcing trust, clarity, and responsible innovation across every layer of its evolving digital ecosystem.
3 mins
December 2025
Voice and Data
Securing the nation in an age of silent cyber conflict
India's expanding digital ecosystem now sits at the centre of global cyber conflict, demanding resilience, sovereign control, and Al-driven defence at scale.
3 mins
December 2025
Voice and Data
"We are building a Rapido, not a bus, for space"
Immanuel Louis, Co-founder and COO of Astrophel Aerospace, is attempting to rewire how India thinks about satellite launches-from large-ride dependency to faster, indigenous access to orbit. Alongside his co-founder, Louis is building a vertically integrated space-tech startup focused on Made-in-India rocket components, reusable engines, and sub-systems optimised for small and CubeSat missions.
6 mins
December 2025
Voice and Data
Driving speed, stability, and next-gen wireless efficiency
Built on multi-link architecture, expanded spectrum, and higher-order modulation, Wi-Fi 7 lays the foundation for the next phase of immersive and connected living.
4 mins
December 2025
Voice and Data
UNSHACKLING SATCOM: THE POLICY RESET INDIA NEEDS FOR VIKSIT BHARAT
India's connectivity gap demands a regulatory shift that can enable satellite networks to reach regions where terrestrial infrastructure cannot viably operate.
6 mins
December 2025
Voice and Data
Telcos to Techcos: The long climb to a new success story
As operators climb out of the commodity-connectivity trap, they are rebuilding networks, services, and platforms to script a sustainable growth story.
3 mins
December 2025
Voice and Data
When automation turned against the world wide web
A routine Cloudflare update triggered a global outage, exposing the security and stability risks of centralised cloud and why resilience must be re-engineered.
7 mins
December 2025
Voice and Data
6G to need 2-3 GHz more mid-band spectrum by 2040
GSMA forecasts 2-3 GHz more mid-band spectrum needed by 2040 to avoid urban congestion and enable global 6G readiness.
1 min
December 2025
Voice and Data
"Neutral networks will anchor 5G, satcom and 6G growth"
Salil Ahuja, Chief Strategy Officer at Shaurrya Teleservices, oversees strategy at one of India's emerging neutral digital infrastructure providers and among the early TRAI-empanelled Digital Connectivity Rating Agencies (DCRAs). With connectivity quality, fragmented in-building networks, private 5G readiness, satcom convergence, and AI-driven infrastructure becoming central to India's digital ecosystem, he is shaping how buildings, enterprises, and operators prepare for the next wave of digital services.
4 mins
December 2025
Voice and Data
Shaping India's near-future digital playbook
As India enters 2026, these ten technologies will advance network design, strengthen digital infrastructure, and unlock new layers of enterprise value.
11 mins
December 2025
Listen
Translate
Change font size
