Open Access
Subscription Access
Open Access
Subscription Access
A Survey on Duplicate Detection Approaches in Hierarchical Data
Subscribe/Renew Journal
Duplicate detection is the process of finding the duplicate objects in the data. This is the important part of data cleansing step of data mining. Significant amount of work has been done in duplicate detection of relational data, but only recently the researchers have shifted their focus towards duplicate detection in hierarchical and semi-structured data e.g. XML. In this paper we provide an overview of different methods for duplicate detection in hierarchical data and semi-structured data.
Keywords
Data Cleansing, Duplicate Detection, XML, Data Mining, Hierarchical Data.
User
Subscription
Login to verify subscription
Font Size
Information
Abstract Views: 254
PDF Views: 2