Open Access Open Access  Restricted Access Subscription Access

A Study on Regression Based Machine Learning Models to Predict the Student Performance


Affiliations
1 Bharati Vidyapeeth's College of Engineering for Women, Pune, India

   Subscribe/Renew Journal


This article discusses the use of three regression models (Linear Regression, Decision Tree Regression, and Random Forest Regression) to study the performance of high school students in India across three subjects: Physics, Chemistry, and Mathematics. The study identifies various factors that affect student performance, such as access to good internet connectivity, parental educational background, and lunch quality. The data was obtained from an educational firm and analyzed based on principles and methods that aid decision-making processes. The results showed that all three regression models produced accurate and plausible results, with an overall accuracy of approximately 95%. The study's primary objective was to provide a clear and concise comparative analysis of various Machine Learning techniques and their impact on the dataset and the predictive attributes analyzed. The findings from this study underscore the importance of considering various factors when analyzing student performance and highlight the effectiveness of Machine Learning techniques in this domain.

Keywords

Online Courses, Learning Analytics Dataset, Machine Learning, Tutor Marked Assessment, Receiver Operating Characteristic (ROC).
Subscription Login to verify subscription
User
Notifications
Font Size


Abstract Views: 37




  • A Study on Regression Based Machine Learning Models to Predict the Student Performance

Abstract Views: 37  | 

Authors

Kamlesh V. Patil
Bharati Vidyapeeth's College of Engineering for Women, Pune, India
Kiran D. Yesugade
Bharati Vidyapeeth's College of Engineering for Women, Pune, India
Kiran B. Naikwadi
Bharati Vidyapeeth's College of Engineering for Women, Pune, India

Abstract


This article discusses the use of three regression models (Linear Regression, Decision Tree Regression, and Random Forest Regression) to study the performance of high school students in India across three subjects: Physics, Chemistry, and Mathematics. The study identifies various factors that affect student performance, such as access to good internet connectivity, parental educational background, and lunch quality. The data was obtained from an educational firm and analyzed based on principles and methods that aid decision-making processes. The results showed that all three regression models produced accurate and plausible results, with an overall accuracy of approximately 95%. The study's primary objective was to provide a clear and concise comparative analysis of various Machine Learning techniques and their impact on the dataset and the predictive attributes analyzed. The findings from this study underscore the importance of considering various factors when analyzing student performance and highlight the effectiveness of Machine Learning techniques in this domain.

Keywords


Online Courses, Learning Analytics Dataset, Machine Learning, Tutor Marked Assessment, Receiver Operating Characteristic (ROC).



DOI: https://doi.org/10.16920/jeet%2F2024%2Fv38i2%2F24200