Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Review of Adaptive Data Stream Classification


Affiliations
1 Information Technology Department, Walchand College of Engineering, Sangli, India
2 Computer Science and Engineering Department, Walchand College of Engineering, Sangli, India
     

   Subscribe/Renew Journal


Data stream classification is a method of mining knowledge from continuous data points. It is classification and prediction task for evolving data streams. For a non-stationary dataset, the data stream classification is posed with the number of challenges like concept drift, infinite length, concept evolution and feature evolution. Data stream is an unending flow of data, which is generated continuously at a rapid rate. As data streams are of infinite length, traditional multi-pass learning algorithms are not applicable as they may require large amount of storage space and training time. Concept drift arrives when the class definition of some instances changes with time. Concept evolution is emergence of new class as stream progresses. However, it is possible that both concept drift and concept evolution may arrive at the same time. By considering these problems, it is challenging to learn a classification model that is consistent with the current concept. Feature evolution occurs when feature space changes with new stream instances, then the feature space of classification model and new unlabelled data would be different, which affects classification accuracy. This paper discusses the different approaches to solve the issues in data stream classification.

Keywords

Data Stream, Ensemble Learning, Outliers, Novel Class.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 258

PDF Views: 2




  • Review of Adaptive Data Stream Classification

Abstract Views: 258  |  PDF Views: 2

Authors

Mayura B. Shinde
Information Technology Department, Walchand College of Engineering, Sangli, India
Hetal V. Gandhi
Computer Science and Engineering Department, Walchand College of Engineering, Sangli, India

Abstract


Data stream classification is a method of mining knowledge from continuous data points. It is classification and prediction task for evolving data streams. For a non-stationary dataset, the data stream classification is posed with the number of challenges like concept drift, infinite length, concept evolution and feature evolution. Data stream is an unending flow of data, which is generated continuously at a rapid rate. As data streams are of infinite length, traditional multi-pass learning algorithms are not applicable as they may require large amount of storage space and training time. Concept drift arrives when the class definition of some instances changes with time. Concept evolution is emergence of new class as stream progresses. However, it is possible that both concept drift and concept evolution may arrive at the same time. By considering these problems, it is challenging to learn a classification model that is consistent with the current concept. Feature evolution occurs when feature space changes with new stream instances, then the feature space of classification model and new unlabelled data would be different, which affects classification accuracy. This paper discusses the different approaches to solve the issues in data stream classification.

Keywords


Data Stream, Ensemble Learning, Outliers, Novel Class.