Open Access Open Access  Restricted Access Subscription Access

Data Integration - Challenges, Techniques and Future Directions: A Comprehensive Study


Affiliations
1 Faculty of Computer Science and Engineering, Sathyabama University, Chennai - 600119, Tamil Nadu, India
2 School of Information Technology and Engineering, VIT University, Vellore - 632 014, Tamil Nadu, India
 

Objectives: This paper studies various query reformulation techniques, which are used to convert the intermediate schema to the targeted schema. The techniques such as Ontology based information integration and data integration languages are also reviewed. Methods/Statistical Analysis: This paper discusses the techniques used for data integration and also to resolve inconsistencies from the integrated data. Data integration techniques mainly focusing on integration of data in several levels and applying independent or unified query over the data available. Findings: Analysis of various techniques done in the paper has led to the identification of several shortcomings and scope for improvements in the available techniques. This identified research directions includes vertical enhancement of wrappers by utilizing a single unified wrapper for all the data sources. Optimizing the queries depending on the data source is also another major requirement to provide efficient and faster results reducing the data retrieval latencies. The paper also advocates other research directions that include identifying duplicates from the retrieved data and performing effective elimination strategies to reduce space consumption. Identifying conflicts and applying strategies to eliminate conflicts is another major area with a huge scope for improvement. Application/Improvements: The comprehensive survey also recommends further works in the area of data integration techniques.

Keywords

Conflict Identification, Conflict Resolution, Data Integration, Data Conflicts, Inconsistency Resolution.
User

Abstract Views: 226

PDF Views: 0




  • Data Integration - Challenges, Techniques and Future Directions: A Comprehensive Study

Abstract Views: 226  |  PDF Views: 0

Authors

Bazeer Ahamed
Faculty of Computer Science and Engineering, Sathyabama University, Chennai - 600119, Tamil Nadu, India
T. Ramkumar
School of Information Technology and Engineering, VIT University, Vellore - 632 014, Tamil Nadu, India

Abstract


Objectives: This paper studies various query reformulation techniques, which are used to convert the intermediate schema to the targeted schema. The techniques such as Ontology based information integration and data integration languages are also reviewed. Methods/Statistical Analysis: This paper discusses the techniques used for data integration and also to resolve inconsistencies from the integrated data. Data integration techniques mainly focusing on integration of data in several levels and applying independent or unified query over the data available. Findings: Analysis of various techniques done in the paper has led to the identification of several shortcomings and scope for improvements in the available techniques. This identified research directions includes vertical enhancement of wrappers by utilizing a single unified wrapper for all the data sources. Optimizing the queries depending on the data source is also another major requirement to provide efficient and faster results reducing the data retrieval latencies. The paper also advocates other research directions that include identifying duplicates from the retrieved data and performing effective elimination strategies to reduce space consumption. Identifying conflicts and applying strategies to eliminate conflicts is another major area with a huge scope for improvement. Application/Improvements: The comprehensive survey also recommends further works in the area of data integration techniques.

Keywords


Conflict Identification, Conflict Resolution, Data Integration, Data Conflicts, Inconsistency Resolution.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i44%2F125309