Open Access Open Access  Restricted Access Subscription Access

Data Warehouse and Big Data Integration


Affiliations
1 Distrial F.J.C University, Bogota, Colombia
 

Big Data triggered furthered an influx of research and prospective on concepts and processes pertaining previously to the Data Warehouse field. Some conclude that Data Warehouse as such will disappear; others present Big Data as the natural Data Warehouse evolution (perhaps without identifying a clear division between the two); and finally, some others pose a future of convergence, partially exploring the possible integration of both. In this paper, we revise the underlying technological features of Big Data and Data Warehouse, highlighting their differences and areas of convergence. Even when some differences exist, both technologies could (and should) be integrated because they both aim at the same purpose: data exploration and decision making support. We explore some convergence strategies, based on the common elements in both technologies. We present a revision of the state-of-the-art in integration proposals from the point of view of the purpose, methodology, architecture and underlying technology, highlighting the common elements that support both technologies that may serve as a starting point for full integration and we propose a proposal of integration between the two technologies.

Keywords

Big Data, Data Warehouse, Integration, Hadoop, NoSql, MapReduce, 7V’s, 3C’s, M&G.
User
Notifications
Font Size


  • Data Warehouse and Big Data Integration

Abstract Views: 712  |  PDF Views: 346

Authors

Sonia Ordonez Salinas
Distrial F.J.C University, Bogota, Colombia
Alba Consuelo Nieto Lemus
Distrial F.J.C University, Bogota, Colombia

Abstract


Big Data triggered furthered an influx of research and prospective on concepts and processes pertaining previously to the Data Warehouse field. Some conclude that Data Warehouse as such will disappear; others present Big Data as the natural Data Warehouse evolution (perhaps without identifying a clear division between the two); and finally, some others pose a future of convergence, partially exploring the possible integration of both. In this paper, we revise the underlying technological features of Big Data and Data Warehouse, highlighting their differences and areas of convergence. Even when some differences exist, both technologies could (and should) be integrated because they both aim at the same purpose: data exploration and decision making support. We explore some convergence strategies, based on the common elements in both technologies. We present a revision of the state-of-the-art in integration proposals from the point of view of the purpose, methodology, architecture and underlying technology, highlighting the common elements that support both technologies that may serve as a starting point for full integration and we propose a proposal of integration between the two technologies.

Keywords


Big Data, Data Warehouse, Integration, Hadoop, NoSql, MapReduce, 7V’s, 3C’s, M&G.

References