Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Predicting Institute Graduation Rate with Genetic Algorithm Assisted Regression for Education Data Mining


Affiliations
1 Department of Information Technology, Gujarat Technological University, India
2 Technology Department, Illinois State University, United States
     

   Subscribe/Renew Journal


In recent era of Digitization, large amount of computer generated data is accumulated on servers. However, these gathered data is useful only when interesting, novel and applicable knowledge is generated out of it. In the field of education, digitized data is generated by online academic activities. Data mining is discussed with every aspect of society however its use in academia is at infancy. Data mining is used to find novel, interesting and useful knowledge out of data which directs to actionable patterns on which academicians could work to enhance the productivity of academic activities. Education data mining is focused to use educational data for knowledge discovery to attain valuable insights in education domain. In this paper, an important task of prediction of institute graduation rate is addressed. Two novel approaches are proposed in the paper for effective graduation rate prediction. The first approach is genetic algorithm assisted regression model. The second approach investigates and uses various filter methods to further enhance the results in terms of time and number of features. Three regression models – multiple linear regression, decision tree regression and support vector regression are considered for experiments and comparative results are produced. The proposed methods provide better institute graduation rate prediction.

Keywords

Genetic Algorithm, Regression, Education Data Mining, Filter Method.
Subscription Login to verify subscription
User
Notifications
Font Size

  • B.L. Deekshatulu and Priti Chandra, “Classification of Heart Disease using K- Nearest Neighbor and Genetic Algorithm”, Proceedings of International Conference on Computational Intelligence: Modelling Techniques and Applications, pp. 85-93, 2013.
  • T. Santhanam and M.S Padmavathi, “Application of K-Means and Genetic Algorithms for Dimension Reduction by Integrating SVM for Diabetes Diagnosis”, Procedia Computer Science, Vol. 47, pp. 76-83, 2015.
  • Jingyuan Dai, “A Novel Medical Assistance System Based on Data Mining”, Proceedings of IEEE Workshop on Advanced Research and Technology in Industry Applications, pp. 264-273, 2014.
  • Antonis Lambrou, Harris Papadopoulos, and Alex Gammerman, “Reliable Confidence Measures for Medical Diagnosis with Evolutionary Algorithms”, IEEE Transaction on Information Technology in Biomedicine, Vol. 15, No. 1, pp. 1-23, 2011.
  • P. Johnson, “Genetic Algorithm with Logistic Regression for Prediction of Progression to Alzheimer’s Disease”, BMC Bioinformatics, Vol. 15, No. 16, pp. 1-14, 2014.
  • N. Devasenathipathi and Nilesh Modi, “Applying GA to Improve Students’ Academic Performance by Group Formation”, International Journal of Data Warehousing and Mining, Vol. 1, No. 2, pp. 142-146, 2011.
  • N. Devasenathipathi and Nilesh Modi, “Evolutionary Algorithm Approach to Pupils’ Pedantic Accomplishment”, Proceedings of International Conference on Frontiers of Intelligent Computing: Theory and Applications, pp. 415-423, 2013.
  • N. Devasenathipathi and Nilesh Modi, “Contemplating Crossover Operators of Genetic Algorithm for Student Group Formation Problem”, International Journal of Emerging Technology and Advanced Engineering, Vol. 2, No. 2, pp. 192-197, 2012.
  • Behrouz Minaei-Bidgoli and William F. Punch, “Using Genetic Algorithm for Data Mining Optimization in an Educational Web-based System”, Proceedings of International Conference on Genetic and Evolutionary Computation, pp. 2252-2258, 2003.
  • Fateme Moslehi and Abdorrahman Haeri, “A Novel Hybrid Wrapper–Filter”, Ambient Intelligence and Humanized Computing, Vol. 11, No. 3, pp.1105-1127, 2020.
  • N. Shelke and S. Gadage, “A Survey of Data Mining Approaches in Performance Analysis and Evaluation”, International Journal of Advanced Research in Computer Science and Software Engineering, Vol. 5, No. 2, pp. 456-459, 2015.
  • Q. Hung Do and Jeng Fung Chen, “A Neuro-Fuzzy Approach in the Classification of Students’ Academic Performance”, Computational Intelligence and Neuroscience, Vol. 2013, pp. 1-8, 2013.
  • K. Barker, T.B. Trafalis and T. Reed, “Learning from Student Data”, Proceedings of International Conference on Systems and Information Engineering Design, pp. 79-84, 2004.
  • C. Romero and S. Ventura, “Data Mining in Education”, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, Vol. 3, No. 1, pp. 12-27, 2013.
  • A. Merceron and K. Yacef, “Education Data Mining: A Case Study”, Proceedings of Conference on Artificial Intelligence in Education: Supporting Learning through Intelligent and Socially Informed Technology, pp. 467-474, 2005.
  • N. Thai-Nighe, Paul Janecek and P. Haddawy, “A Comparative Analysis of Techniques for Predicting Academic Performance”, Proceedings of International Conference on Frontiers in Education, pp. 1-7, 2007.
  • S. Huang and N. Fang, “Predicting Student Academic Performance in an Engineering Dynamics Course: A Comparison of Four Types Predictive Mathematical Models”, Computers and Education, Vol. 61, No. 1, pp. 133-145, 2013.
  • P. Thakar, A. Mehta and Manisha, “Performance Analysis and Prediction in Education Data Mining: A Research Travelogue”, International Journal of Computer Applications, Vol. 110, No. 15, pp. 60-68, 2015.
  • B. Bhardwaj and S. Pal, “Data Mining: A Prediction for Performance Improvement using Classification”, International Journal of Computer Science and Information Security, Vol. 9, No. 4, pp. 1-5, 2011.
  • G. Rao and D.L.S. Reddy, “An Analysis of Education Data Mining in Advanced Education System”, International Journal of Science and Research, Vol. 4, No. 12, pp. 2149-2153, 2015.
  • D. Fatima, S. Fatima and A.V. Krishna Prasad, “A Survey on Research Work in EDM”, IOSR Journal of Computer Engineering, Vol. 17, No. 2, pp. 43-49, 2015.
  • E. Jormanainen, “A Frame Work for Research on Technology-Enhanced Special Education”, Proceedings of 7th IEEE International Conference on Advanced Learning Technologies, pp. 1-9, 2007.
  • J. Kumar, “A Comprehensive Study of Education Data Mining”, International Journal of Electrical Electronics and Computer Science Engineering, Vol. 4, No. 2, pp. 58-63, 2015.
  • E. Taherifar and T. Banirostam, “Assessment of Student Feedback from the Training Course and Instructor Performance through the Combination of Clustering Methods and Decision Tree Algorithms” International Journal of Advanced Research in Computer Science and Software Engineering, Vol. 6, No. 2, pp. 1-5, 2016.
  • E. Balraj and D. Malini, “A Survey on Predicting Student Dropout Analysis using Data Mining Algorithms”, International Journal of Pure and Applied Mathematics, Vol. 18, No. 8, pp. 621-626, 2018.
  • A. Mayra and D. Mauricio, “Factors to Predict Dropout at the Universities: A Case of Study in Ecuador”, Proceedings of IEEE International Conference on Global Engineering Education, pp. 1238-1243, 2018.
  • Tao Zhang, “Predicting the Performance Fluctuation of Students based on Long-Term and Short-Term Data”, Proceedings of International Conference of Educational Innovation Through Technology, pp. 126-134, 2017.
  • A. Daud, “Predicting Student Performance using Advanced Learning Analytics”, Proceedings of International Conference on World Wide Web Companion, pp. 415-423, 2017.
  • M. Bucos and B. Dragulescu, “Predicting Student Success using Data Generated in Traditional Educational Environments”, TEM Journal, Vol. 7, No. 3, pp. 617-625, 2018.
  • Ali Buldu and Kerem Ucgun, “Data Mining Application on Students’ Data”, Procedia-Social and Behavioral Sciences, Vol. 2, No. 2, pp. 5251-5259, 2010.
  • Dorina Kabakchieva, “Predicting Student Performance by using Data Mining Methods for Classification”, Cybernetics and Information Technologies, Vol. 13, No. 1, pp. 61-72, 2013.
  • C.J. Carmona, P. Gonzalez, M.J. del Jesus, C. Romero and S. Ventura, “Evolutionary Algorithms for Subgroup Discovery Applied to E-Learning Data”, Proceedings of IEEE International Conference on Education Engineering, pp. 1-7, 2010.
  • Cristobal Romero and Sebastian Ventura, “Education Data Mining: A Review of the State of the Art”, IEEE Transactions on Systems, Man and Cybernetics-Part C: Applications and Reviews, Vol. 40, No. 6, pp. 1-21, 2010.
  • C. Romero and S. Ventura, “Educational Data Mining: A Survey from 1995 to 2005”, Expert Systems with Applications, Vol. 33, No. 1, pp. 135-146, 2007.
  • NCES, “National Center for Education Statistics”, Available at: https://nces.ed.gov/ipeds/trendgenerator/, Accessed at 2019.
  • Brenda L. Bailey, “Let the Data Talk: Developing Models to Explain IPEDS Graduation Rates’ Data”, Wiley Inter Science-Special Issue: Data Mining in Action: Case Studies of Enrollment Management, Vol. 2006, No. 131, pp. 101-115, 2006.
  • Angela E. Henderson and William F. Punch, “Predicting U.S. News and World Report Ranking of Regional Universities in the South using Public Data”, Ph.D. Dissertation, School of Education, Colorado State University, pp. 1-126, 2017.
  • Abby Miller, Sue Clery and Amy Topper, “Assessing the Capacity of IPEDS to Collect Transfer Student Data”, Project Report, NPEC, Coffey Consulting, 2018.
  • Joshua Lee Whitlock, “Using Data Science and Predictive Analytics to Understand 4-Year University Student Churn”, Ph.D. Dissertation, Department of Educational Leadership and Policy Analysis, East Tennessee State University, pp. 1-157, 2018.
  • UCI, “University of California Irvine-Machine Learning Repository”. Available at: https://archive.ics.uci.edu/ml/index.php, Accessed at 2018.

Abstract Views: 176

PDF Views: 0




  • Predicting Institute Graduation Rate with Genetic Algorithm Assisted Regression for Education Data Mining

Abstract Views: 176  |  PDF Views: 0

Authors

Mala H. Mehta
Department of Information Technology, Gujarat Technological University, India
N. C. Chauhan
Department of Information Technology, Gujarat Technological University, India
Anu Gokhale
Technology Department, Illinois State University, United States

Abstract


In recent era of Digitization, large amount of computer generated data is accumulated on servers. However, these gathered data is useful only when interesting, novel and applicable knowledge is generated out of it. In the field of education, digitized data is generated by online academic activities. Data mining is discussed with every aspect of society however its use in academia is at infancy. Data mining is used to find novel, interesting and useful knowledge out of data which directs to actionable patterns on which academicians could work to enhance the productivity of academic activities. Education data mining is focused to use educational data for knowledge discovery to attain valuable insights in education domain. In this paper, an important task of prediction of institute graduation rate is addressed. Two novel approaches are proposed in the paper for effective graduation rate prediction. The first approach is genetic algorithm assisted regression model. The second approach investigates and uses various filter methods to further enhance the results in terms of time and number of features. Three regression models – multiple linear regression, decision tree regression and support vector regression are considered for experiments and comparative results are produced. The proposed methods provide better institute graduation rate prediction.

Keywords


Genetic Algorithm, Regression, Education Data Mining, Filter Method.

References