Open Access Open Access  Restricted Access Subscription Access

Generating Enhanced Web Log File using Advanced Data Cleansing Algorithm in Pre-Processing Phase


Affiliations
1 Department of Computer Science, Research and Development Centre, Bharathiyar University,Coimbatore – 641046, Tamil Nadu, India
2 Department of Computer Science and Engineering, Sri Ram Engineering College, Chennai – 602024, Tamil Nadu, India
 

Objectives: The major plan of this research concentrates to generate enhanced web log file using pre-processing stages. Designing an effective web site is a big challenge. With the development of Internet, web sites become a dynamic tool in the world. Methods/Statistical Analysis: Well management and good design are the vital individuality of a web site to congregate the consideration of visitors. There are various methods to generate web log files. An effective analysis is performed on user data on the Educational Institute web site. This can be done on web logs which are generated as a result of user’s access to web page. Data cleaning is performed on this web log file using Advanced Data Cleaning Algorithm (ADC) to generate Enhanced Web Log File. Findings: In this paper, the log file of an educational institute was taken, pre-processed and analyzed using an effective algorithm. The Advanced Data Cleaning Algorithm produced efficient and useful data for performing further stages of Pre-processing. It uses server log files to provide user access patterns. The Enhanced Log File contents are inserted into a database table. Application/Improvement: This paper mainly deals on techniques for enhanced data cleansing, user recognition and session detection for preprocessing stage.

Keywords

Enhanced Data Cleansing, Pre-processing, Session Detection, User Recognition, Web Server Logs.
User

Abstract Views: 179

PDF Views: 0




  • Generating Enhanced Web Log File using Advanced Data Cleansing Algorithm in Pre-Processing Phase

Abstract Views: 179  |  PDF Views: 0

Authors

J. Umarani
Department of Computer Science, Research and Development Centre, Bharathiyar University,Coimbatore – 641046, Tamil Nadu, India
S. Manikandan
Department of Computer Science and Engineering, Sri Ram Engineering College, Chennai – 602024, Tamil Nadu, India

Abstract


Objectives: The major plan of this research concentrates to generate enhanced web log file using pre-processing stages. Designing an effective web site is a big challenge. With the development of Internet, web sites become a dynamic tool in the world. Methods/Statistical Analysis: Well management and good design are the vital individuality of a web site to congregate the consideration of visitors. There are various methods to generate web log files. An effective analysis is performed on user data on the Educational Institute web site. This can be done on web logs which are generated as a result of user’s access to web page. Data cleaning is performed on this web log file using Advanced Data Cleaning Algorithm (ADC) to generate Enhanced Web Log File. Findings: In this paper, the log file of an educational institute was taken, pre-processed and analyzed using an effective algorithm. The Advanced Data Cleaning Algorithm produced efficient and useful data for performing further stages of Pre-processing. It uses server log files to provide user access patterns. The Enhanced Log File contents are inserted into a database table. Application/Improvement: This paper mainly deals on techniques for enhanced data cleansing, user recognition and session detection for preprocessing stage.

Keywords


Enhanced Data Cleansing, Pre-processing, Session Detection, User Recognition, Web Server Logs.



DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i48%2F140829