Versuchen GOLD - Frei

Innovate More Rapidly

DataQuest

|

November 2019

How do you undertake a journey to the data cloud? Digital transformation is going on around us. It is happening across all aspects of the society. We are now learning how to integrate new technologies

- Pradeep Chakraborty

Innovate More Rapidly

Change has accelerated in the past decade. Earlier, systems were deployed with the expectation that they would last forever. They were not designed to look at each other’s data, and were fairly limiting. Open source was a new idea in the early 2000s. People began to adopt Lucene, a software I had written. There was no institutional backing or publicity. Open source emerged as a tool for development.

A Distributing Computing Platform

Nutch started in 2003. Around 2005, Google published a paper on how they build search engines. They had a paper talking about how they had automated things. We started working on reworking Nutch in 2004. The tale of debugging is much longer. In 2006, I joined yahoo! I developed Hadoop. Hadoop was named after my son’s toy elephant. It was a distributing computing platform, based on Google’s ideas.

A group of people believed that Hadoop could be used much further. Together, they formed Cloudera. I joined Cloudera in 2009. Stepping back from my lesson in Hadoop, if you can increase the scale and focus on flexibility, you can permit them to store more data in raw form and experiment. They can innovate more quickly. The waterfall method inhibited process through data. This gave us a much more appropriate platform. Most of the past data was relational.

New sources of data are events, things recorded from sensors, etc. We need a different class of tools. Companies can run petabytes of data easily today. Software is also eating the world. In every industry, everywhere, the advances being made are predominantly using software. A company’s growth is fuelled more by data, today. The use of data is no longer isolated. It has emerged everywhere.

Challenges

WEITERE GESCHICHTEN VON DataQuest

DataQuest

DataQuest

Engineering India's Al-First Data Centres at Hyperscale

Rohan Sheth explains how AI and HPC are reshaping India's data centres, from density and cooling to power economics, sustainability, and hyperscale decision criteria.

time to read

4 mins

February 2026

DataQuest

DataQuest

From copilots to colleagues: Why agentic AI is forcing enterprises to rethink control, trust, and culture

As AI agents shift from assisting to acting, enterprises must redesign governance, data controls, and security guardrails so autonomy stays auditable, reversible, and trusted.

time to read

2 mins

February 2026

DataQuest

DataQuest

Reclaiming Control in the AI Era: A Conversation with Kalyan Kumar, CPO, HCLSoftware

Enterprises are reassessing cloud-first strategies as AI becomes core to operations. HCLSoftware's Kalyan Kumar explains why sovereignty, choice and control now shape decisions.

time to read

5 mins

February 2026

DataQuest

DataQuest

When infrastructure learns: The rise of the Al-native core

AI-native infrastructure is moving from concept to operational reality, reshaping how organisations build, govern, and scale intelligence across their digital core.

time to read

6 mins

February 2026

DataQuest

DataQuest

Bridging the gap between connectivity and compute at scale

As AI scales in India, data centres are evolving into high-density, low-latency platforms that unify connectivity, compute, and sustainability at national scale.

time to read

4 mins

February 2026

DataQuest

DataQuest

PUE is not a grapefruit metric, anymore

So what are the new high-hanging fruits for data centre strategists today? And are players going after them?

time to read

4 mins

February 2026

DataQuest

DataQuest

Even if Al demand fades, India need not worry - about data centres

For every megawatt (MW) of installed colocation capacity, users here generate approximately 13.2 PB of data monthly- compared to 0.3 PB for Australia and just 0.01 PB for Singapore. India's data centre growth is not dependent on one tech lever. Plus, it is phased and modular and not kneejerk. Manoj Paul explains these contours in detail.

time to read

7 mins

February 2026

DataQuest

DataQuest

AI infrastructure and systemic risk

What has been the biggest change in data centre industry-specially after AI workloads? Is Al-bubble a big risk for data centre infra- how much will it affect data centres if something cracks?

time to read

1 min

February 2026

DataQuest

DataQuest

Inside the Shift to High-Density, Al-Ready Data Centres

CtrlS' Vipin Jain discusses what it truly takes to build AI-ready data centres in India, balancing high density, liquid-ready cooling, resilience, and ESG accountability.

time to read

4 mins

February 2026

DataQuest

DataQuest

Sustainability is now the headline, not a footnote

Sanjay Agrawal, Head Presales and CTO at Hitachi Vantara India and SAARC opines that the conversation is moving beyond headline metrics like PUE toward a broader view of how data lifecycle management and infrastructure efficiency reduce the overall environmental footprint. Let's see why and how- while also touching upon adjacent (or not-so-adjacent) factors like redundancies, availability and AI-readiness

time to read

3 mins

February 2026

Translate

Share

-
+

Change font size