A Data Science Central Community
Yes, we are marching towards New Year 2016! What happened to Resolution of 2014, 2015? Quit Habits? Practice Habits? Road ahead? Am into all, but i could not able to keep it up. Hence this New Year 2016 is no more resolutions, just implement the plan.
Extend to that, as we know big data is bringing more business value to enterprise by leveraging the data lake. Data Lake..... What is that? Data Lake is loosely defined word and the definition gets changed during implementation in terms with the data systems.
Hence we planned to share the artifacts called - "The Collective Definition of Data Lake by Big Data Community". Request to post your thoughts & definition......
Advance Happy New Year 2016!
Wikipedia: A data lake is a large storage repository and processing engine, they provide "massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs". The term was coined by James Dixon, Pentaho chief technology officer.
Gartner: A data lake is a collection of storage instances of various data assets additional to the originating data sources. These assets are stored in a near-exact, or even exact, copy of the source format.
Microsoft: Data Lake - Batch, real-time and interactive analytics made easy.
EMC2: Data Lake Foundation gives you a single system to capture, store, analyze, protect and manage your data.
Capgemini: Discover a new approach to addressing your company’s information challenges. Embracing Big Data satisfies both local and corporate needs from an integrated environment. We call it the Business Data Lake.
Cognizant: Your mission (whether or not you accept it) is to not only manage the sheer bulk of data, but to also draw meaning from the bits and bytes. This requires going way beyond traditional data repositories to what we call the data lake. You won't be able to afford the time, effort and cost of loading all this data into a big data repository, nor could you easily find and use the data you need in it.
Kumar Chinnakali [IGATE]: Data Lake empowers the data fidelity, where the data systems can drill down to data inception at any time in future.
Yours Please? Data……..
Please ping to [email protected]