Hadoop and Data Lakes
Discussion surrounding Hadoop and data lakes is as relevant as ever. The Hadoop ecosystem is considered THE technological breakthrough for enabling companies to capitalize on the big data revolution. A data lake, in turn, is viewed as a broad data management concept and a prerequisite for data-driven companies. It promises a fast, efficient, low-cost way to manage, use and analyze any amount of data from different systems with varying structures. As a source for any type of analytic task, it can also claim to be the technological backbone of digitalization and the (big) datafication of the entire economy.
Enterprise projects with Hadoop technology and data lakes in production have only emerged in recent years. As a result, many companies find it hard to distinguish between media hype and the benefits that can realistically be enjoyed. There is limited experience in terms of how and where it really makes sense to implement, which obstacles can arise during implementation, and what potential benefits are delivered in real-world scenarios.
“Hadoop and Data Lakes: Use Cases, Benefits and Limitations” provides answers and insights. It explores the current status of Hadoop and data lakes in general and real experiences from Hadoop use cases across the globe. It tackles important questions including:
- How widespread is the current usage of Hadoop and data lakes in companies?
- What are the plans for the future?
- How do companies utilize Hadoop or plan to use it?
- How do companies currently use data lakes?
- What problems do they face?
- What real-world benefits does Hadoop bring?
- What projects have companies implemented already?
This report is based on an empirical survey exploring the usage of and attitudes towards Hadoop and data lakes in companies across the world. Over 380 business and IT decision-makers completed the online survey in 2016.