Versuchen GOLD - Frei
Innovate More Rapidly
DataQuest
|November 2019
How do you undertake a journey to the data cloud? Digital transformation is going on around us. It is happening across all aspects of the society. We are now learning how to integrate new technologies
Change has accelerated in the past decade. Earlier, systems were deployed with the expectation that they would last forever. They were not designed to look at each other’s data, and were fairly limiting. Open source was a new idea in the early 2000s. People began to adopt Lucene, a software I had written. There was no institutional backing or publicity. Open source emerged as a tool for development.
A Distributing Computing Platform
Nutch started in 2003. Around 2005, Google published a paper on how they build search engines. They had a paper talking about how they had automated things. We started working on reworking Nutch in 2004. The tale of debugging is much longer. In 2006, I joined yahoo! I developed Hadoop. Hadoop was named after my son’s toy elephant. It was a distributing computing platform, based on Google’s ideas.
A group of people believed that Hadoop could be used much further. Together, they formed Cloudera. I joined Cloudera in 2009. Stepping back from my lesson in Hadoop, if you can increase the scale and focus on flexibility, you can permit them to store more data in raw form and experiment. They can innovate more quickly. The waterfall method inhibited process through data. This gave us a much more appropriate platform. Most of the past data was relational.
New sources of data are events, things recorded from sensors, etc. We need a different class of tools. Companies can run petabytes of data easily today. Software is also eating the world. In every industry, everywhere, the advances being made are predominantly using software. A company’s growth is fuelled more by data, today. The use of data is no longer isolated. It has emerged everywhere.
Challenges
Diese Geschichte stammt aus der November 2019-Ausgabe von DataQuest.
Abonnieren Sie Magzter GOLD, um auf Tausende kuratierter Premium-Geschichten und über 9.000 Zeitschriften und Zeitungen zuzugreifen.
Sie sind bereits Abonnent? Anmelden
WEITERE GESCHICHTEN VON DataQuest
DataQuest
Engineering India's Al-First Data Centres at Hyperscale
Rohan Sheth explains how AI and HPC are reshaping India's data centres, from density and cooling to power economics, sustainability, and hyperscale decision criteria.
4 mins
February 2026
DataQuest
From copilots to colleagues: Why agentic AI is forcing enterprises to rethink control, trust, and culture
As AI agents shift from assisting to acting, enterprises must redesign governance, data controls, and security guardrails so autonomy stays auditable, reversible, and trusted.
2 mins
February 2026
DataQuest
Reclaiming Control in the AI Era: A Conversation with Kalyan Kumar, CPO, HCLSoftware
Enterprises are reassessing cloud-first strategies as AI becomes core to operations. HCLSoftware's Kalyan Kumar explains why sovereignty, choice and control now shape decisions.
5 mins
February 2026
DataQuest
When infrastructure learns: The rise of the Al-native core
AI-native infrastructure is moving from concept to operational reality, reshaping how organisations build, govern, and scale intelligence across their digital core.
6 mins
February 2026
DataQuest
Bridging the gap between connectivity and compute at scale
As AI scales in India, data centres are evolving into high-density, low-latency platforms that unify connectivity, compute, and sustainability at national scale.
4 mins
February 2026
DataQuest
PUE is not a grapefruit metric, anymore
So what are the new high-hanging fruits for data centre strategists today? And are players going after them?
4 mins
February 2026
DataQuest
Even if Al demand fades, India need not worry - about data centres
For every megawatt (MW) of installed colocation capacity, users here generate approximately 13.2 PB of data monthly- compared to 0.3 PB for Australia and just 0.01 PB for Singapore. India's data centre growth is not dependent on one tech lever. Plus, it is phased and modular and not kneejerk. Manoj Paul explains these contours in detail.
7 mins
February 2026
DataQuest
AI infrastructure and systemic risk
What has been the biggest change in data centre industry-specially after AI workloads? Is Al-bubble a big risk for data centre infra- how much will it affect data centres if something cracks?
1 min
February 2026
DataQuest
Inside the Shift to High-Density, Al-Ready Data Centres
CtrlS' Vipin Jain discusses what it truly takes to build AI-ready data centres in India, balancing high density, liquid-ready cooling, resilience, and ESG accountability.
4 mins
February 2026
DataQuest
Sustainability is now the headline, not a footnote
Sanjay Agrawal, Head Presales and CTO at Hitachi Vantara India and SAARC opines that the conversation is moving beyond headline metrics like PUE toward a broader view of how data lifecycle management and infrastructure efficiency reduce the overall environmental footprint. Let's see why and how- while also touching upon adjacent (or not-so-adjacent) factors like redundancies, availability and AI-readiness
3 mins
February 2026
Translate
Change font size
