Facebook Pixel Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It? | DataQuest - business - Read this story on Magzter.com

Try GOLD - Free

Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

DataQuest

|

October 2024

Compressing AI models has been both an adventure and a formidable next-inflection-point in the many curves of AI innovation. Can we look for options better than, and beyond, erstwhile approaches like pruning and SLMs?

- Pratima H

Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

What happens when you compress Al's heavier parts with a new centrifugal blade? You save so much storage, memory, GPU stacks and compute gas-tanks, of course. Besides boosting speed, cutting inference latency and expanding compatibility for small devices and edge-networks. But how does this 'squeeze' affect accuracy, error compensation and application-ease? And what about all the other grinders that are attacking the same problem? Like SLMs, GPTQ and QuIP? Recently, Yandex Research, IST Austria, NeuralMagic, and KAUST announced that they have developed what they call 'two innovative compression methods for large language models'. It was also claimed that, when combined, these methods allow for a reduction in model size by up to 8 times while preserving response quality by 95 per cent. Compressed models like Llama 2 13B can run on I GPU instead of 4- they added. So how does it all work and does it address the issues we mentioned earlier?

We compress it all in this interview with Artem Babenko, Head of Yandex Research. He oversees scientific research at Yandex and the company's engagement with the international scientific community. He also supervises a team of approximately 30 researchers engaged in various areas of computer science. According to Artem, his main achievements are his scientific contributions in three key areas: neural networks for image search, high-dimensional vector compression, and fast search across massive databases containing billions of records. Who better than he to explain the ambition, ingredients and final taste of compression? Let's press those buttons.

Can you explain additive quantization and PV-tuning in simpler terms- for a layman? Is it similar to model pruning?

MORE STORIES FROM DataQuest

DataQuest

DataQuest

Horse-Whisperers Win. Cat-Herders Lose.

And Dog-Walkers Stay. It's the post-AI world in coding and software engineering. More about humans.

time to read

4 mins

April 2026

DataQuest

DataQuest

The marketing team of tomorrow is being built today

AI tools are already inside marketing teams. The real advantage lies in building strategy, workflows, and measurement that turn speed into measurable revenue growth.

time to read

2 mins

April 2026

DataQuest

DataQuest

Inside the Autonomous Enterprise

For years, enterprise AI lived at the edge of work, surfacing insights, suggesting actions, and helping people move faster.

time to read

2 mins

April 2026

DataQuest

DataQuest

AI without subtitles. For how long now?

Explainability is not just about wiping away the mystery of that stubborn and evasive AI Black Box. It is also about interpretability, trust, safety and responsibility. Has the industry cracked this mystery? Will it?

time to read

12 mins

April 2026

DataQuest

DataQuest

Are You Ready for Sovereign AI?

Governments could function as orchestrator, investor, regulator & anchor customer to drive strategy.

time to read

6 mins

April 2026

DataQuest

DataQuest

Forget the Genie, Read The AI Bottle's Label

Who owns the data, where is that data coming from, how will it be used, is it all sovereign, what about input-indemnity, what about multiplier-pricing, what about new SLAs and pricing models – so many questions, so many what-ifs, so many words to squint at when you look closely at the shiny AI packaging!

time to read

6 mins

April 2026

DataQuest

DataQuest

AI- stuck in the petri-dish paradox

Legacy complexity is a big reason for the AI pilot quicksand. Most pilots stall, not because the model is weak, but because the surrounding environment is not ready to operationalise it. Let's get a closer look at this AI dead-end.

time to read

6 mins

April 2026

DataQuest

DataQuest

The Doorman may open the gates for AI, but he never leaves

With over 23 years of experience in this uniquely-human industry, Nikhil Dev, General Manager, IT, The Lalit Suri Hospitality Group has the room with the perfect view of humans intermingling with technology.

time to read

5 mins

April 2026

DataQuest

DataQuest

The autonomous enterprise: When AI moves from support to execution

AI is no longer just helping people work. In more and more enterprises, it is beginning to execute tasks, trigger workflows, and act inside live systems. But before businesses hand AI real authority, they must solve for trust, control, and accountability.

time to read

8 mins

April 2026

DataQuest

DataQuest

Cognizant CAIO Babak Hodjat explains how Agentic AI will transform enterprises

Cognizant CAIO Babak Hodjat discusses agentic AI, sovereign compute, multilingual models, and Green AI, outlining how India can translate its AI ambitions into scalable enterprise impact and ecosystem growth.

time to read

7 mins

March 2026

Listen

Translate

Share

-
+

Change font size