Open Access Open Access  Restricted Access Subscription Access

Enhancing Outlier Detection and Dimensionality Reduction in Machine Learning for Extreme Value Analysis


Affiliations
1 Dept. of Computer Applications, Lakshmi Narain College of Technology (MCA), Bhopal, India
2 Dept. of Computer Engineering, Lakshmi Narain College of Technology Excellence, Bhopal, India

An outlier is generally defined as an observation that significantly deviates from the rest of the considered compliance. In the realm of ultramodern machine literacy, where analysis of multidimensional datasets is current, improving data quality is imperative for economists aiming to gain robust results. Numerous machine learning algorithms are sensitive to both the range and distribution of trait values within the input data. The presence of outliers in the input data can distort and misguide the training process of machine literacy algorithms, leading to prolonged training times, less precise models, and eventually inferior issues. Prior to the construction of prophetic models using training data, outliers can introduce incorrect representations, thereby impacting interpretations of the collected data. Traditional outlier discovery methods frequently concentrate on banning the tails of distributions and overlooking the data generation processes specific to individual datasets. Colorful styles are useful for detecting different types of outliers in high-dimensional datasets from two distinct perspectives: relating the devious aspects of a data object and setting devious data objects within a dataset. N-dimensional data poses a significant challenge within the realm of machine literacy, contributing to the diversity of challenges faced in the field.

Keywords

Outlier detection, Machine Learning, High-Dimensional Data, Intrinsic Dimension (ID), k-Nearest Neighbor (k-NN)
User
Notifications
Font Size

Abstract Views: 95




  • Enhancing Outlier Detection and Dimensionality Reduction in Machine Learning for Extreme Value Analysis

Abstract Views: 95  | 

Authors

Virendra Kumar Tiwari
Dept. of Computer Applications, Lakshmi Narain College of Technology (MCA), Bhopal, India
Ashish Jain
Dept. of Computer Applications, Lakshmi Narain College of Technology (MCA), Bhopal, India
Rohit Singh
Dept. of Computer Applications, Lakshmi Narain College of Technology (MCA), Bhopal, India
Priyanka Singh
Dept. of Computer Engineering, Lakshmi Narain College of Technology Excellence, Bhopal, India

Abstract


An outlier is generally defined as an observation that significantly deviates from the rest of the considered compliance. In the realm of ultramodern machine literacy, where analysis of multidimensional datasets is current, improving data quality is imperative for economists aiming to gain robust results. Numerous machine learning algorithms are sensitive to both the range and distribution of trait values within the input data. The presence of outliers in the input data can distort and misguide the training process of machine literacy algorithms, leading to prolonged training times, less precise models, and eventually inferior issues. Prior to the construction of prophetic models using training data, outliers can introduce incorrect representations, thereby impacting interpretations of the collected data. Traditional outlier discovery methods frequently concentrate on banning the tails of distributions and overlooking the data generation processes specific to individual datasets. Colorful styles are useful for detecting different types of outliers in high-dimensional datasets from two distinct perspectives: relating the devious aspects of a data object and setting devious data objects within a dataset. N-dimensional data poses a significant challenge within the realm of machine literacy, contributing to the diversity of challenges faced in the field.

Keywords


Outlier detection, Machine Learning, High-Dimensional Data, Intrinsic Dimension (ID), k-Nearest Neighbor (k-NN)