Open Access Open Access  Restricted Access Subscription Access

Text News Classification System using Naive Bayes Classifier


Affiliations
1 Punajbi University, Patiala, India
2 Punjabi University, Patiala, India
 

This paper describes the Naive Bayes text News classification system developed for Punjabi Language. News corpus is used for training and testing purpose of the classifiers. Language specific preprocessing techniques are applied on raw data to generate a standardized and reduced-feature lexicon. Punjabi language is morphological rich language which makes those tasks complex. Statistical characteristics of corpus and lexicon are measured which show satisfactory results of text preprocessing module. We are able to get satisfactory results using Naive Bayes Classifier.

Keywords

Naive Bayes, Text classification, Punjabi.
User
Notifications
Font Size

Abstract Views: 136

PDF Views: 0




  • Text News Classification System using Naive Bayes Classifier

Abstract Views: 136  |  PDF Views: 0

Authors

Shruti Bajaj Mangal
Punajbi University, Patiala, India
Vishal Goyal
Punjabi University, Patiala, India

Abstract


This paper describes the Naive Bayes text News classification system developed for Punjabi Language. News corpus is used for training and testing purpose of the classifiers. Language specific preprocessing techniques are applied on raw data to generate a standardized and reduced-feature lexicon. Punjabi language is morphological rich language which makes those tasks complex. Statistical characteristics of corpus and lexicon are measured which show satisfactory results of text preprocessing module. We are able to get satisfactory results using Naive Bayes Classifier.

Keywords


Naive Bayes, Text classification, Punjabi.