Prøve GULL - Gratis

Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

DataQuest

|

October 2024

Compressing AI models has been both an adventure and a formidable next-inflection-point in the many curves of AI innovation. Can we look for options better than, and beyond, erstwhile approaches like pruning and SLMs?

- Pratima H

Cold-Pressed AI Juice - Is That Bottle Here? Is It Worth It?

What happens when you compress Al's heavier parts with a new centrifugal blade? You save so much storage, memory, GPU stacks and compute gas-tanks, of course. Besides boosting speed, cutting inference latency and expanding compatibility for small devices and edge-networks. But how does this 'squeeze' affect accuracy, error compensation and application-ease? And what about all the other grinders that are attacking the same problem? Like SLMs, GPTQ and QuIP? Recently, Yandex Research, IST Austria, NeuralMagic, and KAUST announced that they have developed what they call 'two innovative compression methods for large language models'. It was also claimed that, when combined, these methods allow for a reduction in model size by up to 8 times while preserving response quality by 95 per cent. Compressed models like Llama 2 13B can run on I GPU instead of 4- they added. So how does it all work and does it address the issues we mentioned earlier?

We compress it all in this interview with Artem Babenko, Head of Yandex Research. He oversees scientific research at Yandex and the company's engagement with the international scientific community. He also supervises a team of approximately 30 researchers engaged in various areas of computer science. According to Artem, his main achievements are his scientific contributions in three key areas: neural networks for image search, high-dimensional vector compression, and fast search across massive databases containing billions of records. Who better than he to explain the ambition, ingredients and final taste of compression? Let's press those buttons.

Can you explain additive quantization and PV-tuning in simpler terms- for a layman? Is it similar to model pruning?

FLERE HISTORIER FRA DataQuest

DataQuest

DataQuest

GCCs will not kill good old IT baby-sitters. But...

...outsourcing cribs will change as core-work control, intellectual information, ease of direct engagement and expertise become strong reasons for GCCs to sit in chairs where service providers once reigned.

time to read

5 mins

August 2025

DataQuest

DataQuest

From POS to car dashboards: Building the Internet of Payments

Your car just paid for coffee. No phone, no wallet, just ambient tech making payments disappear into the background. From dashboards to vending machines, the Internet of Payments is turning transactions into invisible moments of flow.

time to read

3 mins

August 2025

DataQuest

DataQuest

Hi Microwave! One Hot Code Please

They are fast, they are fuss-free, they pop up like hot toast without you having to burn your hands. But are they crisp enough? Well-cooked? Let's take a bite of low code, no code and vibe code today.

time to read

11 mins

August 2025

DataQuest

DataQuest

Will the future be Consolidated Platforms or Expanding Niches?

The enterprise SaaS industry is experiencing a major shift. What was once a chaotic assortment of niche tools is coalescing into a wave of consolidation, unified platforms, and AI-native reimagination. With companies evaluating their software stacks due to changing business priorities, inflation, and the potential of generative AI, the central question is whether the future of SaaS will reside in all-in-one giants or in a colourful ecosystem of specialised, modular offerings.

time to read

8 mins

August 2025

DataQuest

DataQuest

Al-Powered Precision: GE HealthCare's Vision for MedTech Innovation in India

GE HealthCare's CTO shares how AI-led innovations from India are redefining diagnostics, oncology, and access in the country's rapidly evolving MedTech landscape.

time to read

6 mins

August 2025

DataQuest

DataQuest

Why Enterprise Al won't be Plug-and-Play

From training its own LLMs on $20M worth of in-house infrastructure to rethinking AI privacy at the data layer, Zoho is proving that AI for enterprises isn’t about size—it’s about fit, context, and control. Ramprakash Ramamoorthy, Director of Al Research at Zoho, explains why real-world AI adoption demands more than APIs and flashy demos—and why Zoho is betting on contextual, private, ground-up intelligence.

time to read

5 mins

August 2025

DataQuest

DataQuest

Over 90% of Security Breaches Linked to Human Error or Malicious Activity

Proofpoint's Bikramdeep Singh discusses India's cybersecurity readiness, people-centric threats, AI-driven defences, sector risks, regulatory shifts, and talent challenges.

time to read

5 mins

August 2025

DataQuest

DataQuest

Cybercrime goes corporate: A trillion-dollar industry undermining global security

Cybercrime-as-a-Service (CaaS) makes hacking accessible, driving a booming illicit economy. Learn how AI fuels attacks and what businesses must do to defend.

time to read

4 mins

August 2025

DataQuest

DataQuest

Every accident also means a thousand accidents that did not happen

The best use of AI is for improving safety says this art lover, innovator, and a railways veteran - who puts the common-man’s experience and safety on the same seat when he talks about future-forward contours. At the Zinnov Confluence 2025, we caught up with Dr. Sudhanshu Mani, Retired GM, Indian Railways, an Independent Consultant and an expert voice on Vande Bharat Express as the leader of the project. We tried to get some fleeting glimpses of the superfast and super-exciting coaches of technology that are being added to our train journeys. Hop on.

time to read

3 mins

August 2025

DataQuest

DataQuest

At 30% time and effort savings, AI coding is worthwhile

With automation and AI, coding time and resources can be shrunk in a compelling way. But what about test-flakiness, patterns, regression testing, test coverage, robustness, redundancies and developer satisfaction? Richard Spence, Area Vice President of Growth Sales, International at UiPath decodes all that and more.

time to read

3 mins

August 2025

Listen

Translate

Share

-
+

Change font size