Essayer OR - Gratuit

Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

DataQuest

|

October 2024

Compressing AI models has been both an adventure and a formidable next-inflection-point in the many curves of AI innovation. Can we look for options better than, and beyond, erstwhile approaches like pruning and SLMs?

- Pratima H

Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

What happens when you compress Al's heavier parts with a new centrifugal blade? You save so much storage, memory, GPU stacks and compute gas-tanks, of course. Besides boosting speed, cutting inference latency and expanding compatibility for small devices and edge-networks. But how does this 'squeeze' affect accuracy, error compensation and application-ease? And what about all the other grinders that are attacking the same problem? Like SLMs, GPTQ and QuIP? Recently, Yandex Research, IST Austria, NeuralMagic, and KAUST announced that they have developed what they call 'two innovative compression methods for large language models'. It was also claimed that, when combined, these methods allow for a reduction in model size by up to 8 times while preserving response quality by 95 per cent. Compressed models like Llama 2 13B can run on I GPU instead of 4- they added. So how does it all work and does it address the issues we mentioned earlier?

We compress it all in this interview with Artem Babenko, Head of Yandex Research. He oversees scientific research at Yandex and the company's engagement with the international scientific community. He also supervises a team of approximately 30 researchers engaged in various areas of computer science. According to Artem, his main achievements are his scientific contributions in three key areas: neural networks for image search, high-dimensional vector compression, and fast search across massive databases containing billions of records. Who better than he to explain the ambition, ingredients and final taste of compression? Let's press those buttons.

Can you explain additive quantization and PV-tuning in simpler terms- for a layman? Is it similar to model pruning?

PLUS D'HISTOIRES DE DataQuest

DataQuest

DataQuest

Cloud-Rolls. The Big Mid-Air Plane Swap!

Roll-backs make for great headlines. Cloud repatriation makes for delicious IT media ink. But there is so much that goes before, and during, this U-turn that we seldom get a glance of. Who better than the horse-whisperer’s mouth to give a peek of why, and how, Cloud shifts are happening. Frank Karlitschek, CEO and Founder of Nextcloud takes us into the trenches. Or shall we say, 30,000 feet up there.

time to read

3 mins

September 2025

DataQuest

DataQuest

Epam Systems CEO Balazs Fejes: Engineering, Al, and the road ahead

Engineering is evolving, not disappearing. AI, client focus, and modernisation are reshaping how enterprises build and transform in the digital era.

time to read

5 mins

September 2025

DataQuest

DataQuest

Cloud Sovereignty: Feature. Bug. Feature. Repeat!

Like most big coal-engine moments in the timeline of technology, Cloud is also turning out to be suffixed with a paradox. Are Sovereign Clouds a fair-ask? Are they practically feasible? Is Sovereignty-washing possible? Is that happening? Let's lift some of these clouds today.

time to read

11 mins

September 2025

DataQuest

Why Industry 5.0 matters for India's defence and how Aimtron is making it real

India's defence manufacturing is entering the Industry 5.0 era, where machines empower rather than replace humans. By blending AI, robotics, and skilled expertise, the shift promises resilient supply chains and reduced import dependency.

time to read

2 mins

September 2025

DataQuest

DataQuest

Veeam leaders on cyber resilience, ransomware shifts, and the future of data recovery

Veeam leaders share how Suraksha aligns with Bharat's digital vision, tackling ransomware trends, CISO priorities, and data recovery challenges.

time to read

4 mins

September 2025

DataQuest

DataQuest

Why Perplexity's new Comet Plus browser feels like a rethink of the internet

Comet Plus blends AI assistance, licensed premium content, and a user-first design to reimagine browsing as purposeful, clutter-free, and fair to publishers.

time to read

4 mins

September 2025

DataQuest

DataQuest

In the age of Al, what it really means to be a software professional

As AI reshapes coding, Great Learning reveals why soft skills, real-world problem-solving, and strategic thinking now define the future of tech careers.

time to read

6 mins

September 2025

DataQuest

DataQuest

From gold medals to drones: Agnishwar Jayaprakash on steering Garuda Aerospace into Industry 5.0

In defence, drones will work alongside soldiers, offering real-time situational awareness and reconnaissance, ultimately making operations safer and smarter.

time to read

4 mins

September 2025

DataQuest

DataQuest

Indian factories and automation: The 'everything bagel' is here

Gone are the days when intelligent machines, software and tools only touched limited areas in a manufacturing setup. Now every ingredient is covered with new tech sprinkled all over- from design departments to assembly-lines to maintenance to QA and warehouses. Will this taste better than before, though? Perhaps, a special sauce for India could take it all a notch up. Let's bite in.

time to read

8 mins

September 2025

DataQuest

DataQuest

Why context, not just data, will define the future of Al in finance

AI's raw intelligence isn't enough. Intuit's Jayanth Saimani reveals why context, domain expertise, and human judgment are crucial for building responsible AI.

time to read

5 mins

September 2025

Listen

Translate

Share

-
+

Change font size