Open Access
Subscription Access
Open Access
Subscription Access
Updating Solving Set Algorithm of Outlier Detection to Reduce the Iterations for Large Data Sets and its Application to Fault Diagnosis
Subscribe/Renew Journal
In this paper original solving set algorithm for detection of possible outliers is updated to have less iterations and thus there by less time. Original algorithm selects initial solving set randomly, but if we select this set carefully using standard deviation of each pattern with respect to each other. The proposed modification requires less time and iterations than the original one. Our experimentation says that this modification requires around half to two third of the patterns in the initial solving set having maximum standard deviation. We have compared original and updated algorithms using synthetic 2-dimensional data set, as described in section II, as well as a fault diagnosis data set from NASA. We observed that the time required to detect outliers for updated algorithm is less than the original one and it exhibit better outlier detection rate than the original one along with better cluster entropy. Better outlier detection rate, less time required and better cluster entropy are the key features of this modification that makes it suitable for outlier detection from large data sets.
Keywords
Data Mining, Distance-Based Outlier, Fault Diagnosis, Outlier Detection.
User
Subscription
Login to verify subscription
Font Size
Information
Abstract Views: 288
PDF Views: 3