يحاول ذهب - حر

What the DeepSeek disruption is all about

February 06, 2025

|

Hindustan Times East UP

Recently, Chinese Artificial Intelligence (AI) firm DeepSeek introduced its latest large language model, R1, sending shockwaves through the tech industry.

- Siddharth Pai

R1 wasn't just on par with the best AI models available; it was built at a fraction of the usual cost and released for free. The financial world reacted instantly, with the United States (US) stock market losing a staggering $1 trillion the day R1 was unveiled.

The implications of DeepSeek's move extended far beyond these financial tremors. By openly sharing the details of how R1 and its predecessor, V3, were developed and making these models freely accessible, DeepSeek shattered a long-held industry belief that reasoning-based AI models were extraordinarily difficult and expensive to create. This revelation had an immediate impact, triggering a rapid response from major AI competitors.

The reaction from competitors and the rapid shifts in the industry begs the question: What exactly did DeepSeek do to cause such a massive upheaval, and is the hype surrounding R1 justified? Understanding the impact requires a closer look at how large language models are built.

Training these models involves two primary phases: Pre-training and post-training. In pre-training, the model learns to generate text by analyzing vast amounts of publicly available documents (basically the internet's entire contents) and processing them repeatedly. This results in a base model that possesses extensive knowledge but lacks task-specific refinements.

The process is computationally intensive and represents the largest cost in AI development. The model then undergoes post-training to refine its capabilities. One key component of this is supervised fine-tuning (SFT), where human trainers curate question-answer pairs and teach the model to respond accurately.

المزيد من القصص من Hindustan Times East UP

Hindustan Times East UP

India's drug regulation is stuck in a time warp

Earlier this month, at least 20 children died in Madhya Pradesh and Rajasthan after consuming Coldrif cough syrup, which tested positive for diethylene glycol (DEG) —a highly toxic industrial chemical known to cause kidney failure.

time to read

3 mins

October 14, 2025

Hindustan Times East UP

CANADIAN FM DISCUSSES KEY TIES WITH MODI, JAISHANKAR

Canada’s foreign minister Anita Anand met Prime Minister Narendra Modi in New Delhi on Monday, an official statement issued by the Prime Minister's Office (PMO) said.

time to read

1 min

October 14, 2025

Hindustan Times East UP

Trio win Economics Nobel for work on innovation, growth

Joel Mokyr, Philippe Aghion and Peter Howitt won the 2025 Nobel economics prize for their work on how innovation and the forces of “creative destruction” can drive economic growth, the Royal Swedish Academy of Sciences said on Monday.

time to read

1 min

October 14, 2025

Hindustan Times East UP

Between New Delhi & Kabul, a fine balance

Pragmatism and convergence on Pakistan have replaced ideology and legacy concerns as the main drivers of India-Afghanistan relations

time to read

4 mins

October 14, 2025

Hindustan Times East UP

Hindustan Times East UP

Trump hails end of war as Hamas frees hostages

Hamas freed the last 20 surviving Israeli hostages on Monday under a US-brokered ceasefire deal, a big step towards ending two years of shattering war in Gaza as President Donald Trump urged Israel to turn military success into peace.

time to read

1 min

October 14, 2025

Hindustan Times East UP

CM okays Invest UP restructuring; satellite offices in metros soon

LUCKNOW: Satellite investment promotion offices of Invest UP will come up in Mumbai, Bengaluru, Hyderabad, Chennai, and New Delhi to enable direct engagement with domestic and global investors and ensure more investments in Uttar Pradesh.

time to read

1 min

October 14, 2025

Hindustan Times East UP

The changing ground in Bihar

NDA seat deal makes evident the shift in the power dynamic between the BJP and JD(U)

time to read

2 mins

October 14, 2025

Hindustan Times East UP

Why India does poorly in building large companies

We had in an earlier article (‘Superman entrepreneurs driving the Indian economy’, August 24) described how Superman entrepreneurs (SEs), a force-of-nature kind of entrepreneur, had been driving India's growth by overcoming the odds and creating dynamic companies.

time to read

4 mins

October 14, 2025

Hindustan Times East UP

Charges against Lalu, Tejashwi in IRCTC case

A Delhi court has framed charges under the Prevention of Corruption Act pertaining to criminal conspiracy and cheating against former Railways minister and Rashtriya Janata Dal (RJD) supremo Lalu Prasad in connection with the Indian Railway Catering and Tourism Corporation (IRCTC) hotel corruption case.

time to read

1 min

October 14, 2025

Hindustan Times East UP

In Bihar, the INDIA bloc fights a perception battle

Will it be the National Democratic Alliance (NDA) that will assume office in Patna on November 14, when the Bihar assembly election results are announced? Or will it be the Mahagathbandhan (a phalanx of the larger Opposition alliance, the INDIA bloc, in Bihar)? There's also a third player that is keenly watched — the Jan Suraaj Party, founded by pollster Prashant Kishor, who may emerge as a kingmaker.

time to read

4 mins

October 13, 2025

Listen

Translate

Share

-
+

Change font size