Try GOLD - Free
Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?
DataQuest
|October 2024
Compressing AI models has been both an adventure and a formidable next-inflection-point in the many curves of AI innovation. Can we look for options better than, and beyond, erstwhile approaches like pruning and SLMs?
What happens when you compress Al's heavier parts with a new centrifugal blade? You save so much storage, memory, GPU stacks and compute gas-tanks, of course. Besides boosting speed, cutting inference latency and expanding compatibility for small devices and edge-networks. But how does this 'squeeze' affect accuracy, error compensation and application-ease? And what about all the other grinders that are attacking the same problem? Like SLMs, GPTQ and QuIP? Recently, Yandex Research, IST Austria, NeuralMagic, and KAUST announced that they have developed what they call 'two innovative compression methods for large language models'. It was also claimed that, when combined, these methods allow for a reduction in model size by up to 8 times while preserving response quality by 95 per cent. Compressed models like Llama 2 13B can run on I GPU instead of 4- they added. So how does it all work and does it address the issues we mentioned earlier?
We compress it all in this interview with Artem Babenko, Head of Yandex Research. He oversees scientific research at Yandex and the company's engagement with the international scientific community. He also supervises a team of approximately 30 researchers engaged in various areas of computer science. According to Artem, his main achievements are his scientific contributions in three key areas: neural networks for image search, high-dimensional vector compression, and fast search across massive databases containing billions of records. Who better than he to explain the ambition, ingredients and final taste of compression? Let's press those buttons.
Can you explain additive quantization and PV-tuning in simpler terms- for a layman? Is it similar to model pruning?
This story is from the October 2024 edition of DataQuest.
Subscribe to Magzter GOLD to access thousands of curated premium stories, and 10,000+ magazines and newspapers.
Already a subscriber? Sign In
MORE STORIES FROM DataQuest
DataQuest
Horse-Whisperers Win. Cat-Herders Lose.
And Dog-Walkers Stay. It's the post-AI world in coding and software engineering. More about humans.
4 mins
April 2026
DataQuest
The marketing team of tomorrow is being built today
AI tools are already inside marketing teams. The real advantage lies in building strategy, workflows, and measurement that turn speed into measurable revenue growth.
2 mins
April 2026
DataQuest
Inside the Autonomous Enterprise
For years, enterprise AI lived at the edge of work, surfacing insights, suggesting actions, and helping people move faster.
2 mins
April 2026
DataQuest
AI without subtitles. For how long now?
Explainability is not just about wiping away the mystery of that stubborn and evasive AI Black Box. It is also about interpretability, trust, safety and responsibility. Has the industry cracked this mystery? Will it?
12 mins
April 2026
DataQuest
Are You Ready for Sovereign AI?
Governments could function as orchestrator, investor, regulator & anchor customer to drive strategy.
6 mins
April 2026
DataQuest
Forget the Genie, Read The AI Bottle's Label
Who owns the data, where is that data coming from, how will it be used, is it all sovereign, what about input-indemnity, what about multiplier-pricing, what about new SLAs and pricing models – so many questions, so many what-ifs, so many words to squint at when you look closely at the shiny AI packaging!
6 mins
April 2026
DataQuest
AI- stuck in the petri-dish paradox
Legacy complexity is a big reason for the AI pilot quicksand. Most pilots stall, not because the model is weak, but because the surrounding environment is not ready to operationalise it. Let's get a closer look at this AI dead-end.
6 mins
April 2026
DataQuest
The Doorman may open the gates for AI, but he never leaves
With over 23 years of experience in this uniquely-human industry, Nikhil Dev, General Manager, IT, The Lalit Suri Hospitality Group has the room with the perfect view of humans intermingling with technology.
5 mins
April 2026
DataQuest
The autonomous enterprise: When AI moves from support to execution
AI is no longer just helping people work. In more and more enterprises, it is beginning to execute tasks, trigger workflows, and act inside live systems. But before businesses hand AI real authority, they must solve for trust, control, and accountability.
8 mins
April 2026
DataQuest
Cognizant CAIO Babak Hodjat explains how Agentic AI will transform enterprises
Cognizant CAIO Babak Hodjat discusses agentic AI, sovereign compute, multilingual models, and Green AI, outlining how India can translate its AI ambitions into scalable enterprise impact and ecosystem growth.
7 mins
March 2026
Listen
Translate
Change font size
