Systematic mapping of literature of a data lake
(Enero - Junio)
Abstract
The exponential growth of data in organizations has generated the development of new technologies such as a Data Lake or Data Lagoon. In this work have been raised research questions that allowed to determine its definition, utility, importance, architecture, functions and contributions that generates the use of this technology, for it was carried out a Systematic Mapping of Literature (MSL). As a result, it was defined that a Data Lake is a low-cost data repository that allows the storage of structured, unstructured and semi-structured data. The technology that allows the implementation of a Data Lake is Hadoop, which forces data analysts to investigate its implementation.