Essayer OR - Gratuit
Adam's Apple moment for AI models
Voice and Data
|January 2025
Traditional language models struggled with voice, losing time, accuracy, and nuance. Are voice-driven models the game-changing twist the world needs?

Dolphins, Lyrebirds, Bats, Mockingbirds, whales, and elephants may live in entirely different environments, but they have one thing in common: the power to communicate through sound. Some may be ultrasonic, some infrasonic, some mimicry, and some utterly akin to baby babble, but sound is always a crucial part of their existence—sometimes even survival.
Meta Spirit LM, GPT-4o, Gnani, DeepL and Sutra HiFi. These names seem to belong to different AI forests altogether, but they also have the denominator of sound running across them. In some way or another, many small and big players have now thrown a voice-dominant model in the Language Model (LM) ring. It is not hard to understand why when one looks at the apparent advantages. But are they also good enough to fix some deep-seated issues their elder siblings have faced? Or would the throat still cough differently again?
NO MORE TONE-DEAF LMS
The challenge with existing AI models was that they first had to convert speech to text through direct or multimodal approaches, take the input for synthesising it with a language model, and convert it all with text-to-speech techniques. This consumed time. This took up compute power. This needed data inputs. But above everything else, this process still missed out on subtle aspects like pitch, tone, emotion and other sub-text areas of the human voice. Not to mention the sheer diversity of accents, dialects and vernacular speech, especially in a multi-cultural country like India.

Cette histoire est tirée de l'édition January 2025 de Voice and Data.
Abonnez-vous à Magzter GOLD pour accéder à des milliers d'histoires premium sélectionnées et à plus de 9 000 magazines et journaux.
Déjà abonné ? Se connecter
PLUS D'HISTOIRES DE Voice and Data

Voice and Data
Rewiring AI infrastructure from core to edge
As GenAI shifts from pilots to production, enterprises must rethink infrastructure strategy to meet performance, cost, and compliance demands.
3 mins
September 2025

Voice and Data
SECURITY AND COMPETITION: BALANCING THE TELECOM ACT
The Telecom Act 2023 strengthens security and consumer trust, but gaps in competition, OTT parity, and licensing stability raise new challenges.
5 mins
September 2025

Voice and Data
Bet off the table: India rewrites online gaming future
The Online Gaming Act halts real-money play, pushing jobs, capital, and brands to pivot toward e-sports, free-to-play, and global growth paths.
5 mins
September 2025

Voice and Data
Chip packaging evolves to support connected futures
As 5G, edge, and loT scale globally, chip packaging plays a critical role in delivering low-latency, energy-efficient connectivity at both core and edge.
3 mins
September 2025

Voice and Data
Rising orbit: Startups power India's new space journey
India's space sector shifts from state-led missions to public-private partnerships, with startups driving agility, innovation, and global competitiveness.
3 mins
September 2025

Voice and Data
GPU cloud: Retail's new engine of relevance
Retailers must shift from basic personalisation to hyper-relevance, using GPU cloud to deliver fast, scalable, and privacy-first experiences.
3 mins
September 2025

Voice and Data
Charting telecom's path to a trusted digital future
At COAI Dialogues 2025, trust, policy, and infrastructure converge to shape India's telecom roadmap for an inclusive, secure digital future.
7 mins
September 2025

Voice and Data
Building India's quantum backbone with QKD
India's push for QKD networks is reshaping security, demanding policy clarity, legal reform, and early adoption to protect national infrastructure.
3 mins
September 2025

Voice and Data
Fibre or 5G? Convergence may be the real superhero
Fibre brings stability and 5G adds reach, but convergence offers enterprises the most resilient path to future-ready connectivity.
6 mins
September 2025

Voice and Data
Transcending the noise with tech-powered interactions
Technology helps businesses cut through digital clutter, creating trusted, seamless interactions that build loyalty and lasting customer value.
4 mins
September 2025
Listen
Translate
Change font size