Open Access Open Access  Restricted Access Subscription Access

Natural Language Processing: Text Categorization and Classifications


Affiliations
1 Department of Computer science,Helwan University, Egypt
 

There are huge data from unstructured text obtained daily from various resources like emails, tweets, social media posts, customer comments, reviews, and reports in many different fields, etc. Unstructured text data can be analyzed to obtain useful information that will be used according to the purpose of the analysis also the domain that the data was obtained from it. Because of the huge amount of the data the human manually analysis of these texts is not possible, so we have to automatic analysis. The topic analysis is the Natural Language Processing (NLP) technology that organizes and understands large collections of text data, by identifying the topics, finding patterns and semantic. There two common approaches for topic analysis, topic modeling, and topic classification each approach has different algorithms to apply that will be discussed.

Keywords

Natural Language Processing, Topic Classification, Topic Modeling, Text Categorization.
User
Notifications
Font Size

Abstract Views: 191

PDF Views: 0




  • Natural Language Processing: Text Categorization and Classifications

Abstract Views: 191  |  PDF Views: 0

Authors

Mona Nasr
Department of Computer science,Helwan University, Egypt
Andrew karam
Department of Computer science,Helwan University, Egypt
Mina Atef
Department of Computer science,Helwan University, Egypt
Kirollos Boles
Department of Computer science,Helwan University, Egypt
Kirollos Samir
Department of Computer science,Helwan University, Egypt
Mario Raouf
Department of Computer science,Helwan University, Egypt

Abstract


There are huge data from unstructured text obtained daily from various resources like emails, tweets, social media posts, customer comments, reviews, and reports in many different fields, etc. Unstructured text data can be analyzed to obtain useful information that will be used according to the purpose of the analysis also the domain that the data was obtained from it. Because of the huge amount of the data the human manually analysis of these texts is not possible, so we have to automatic analysis. The topic analysis is the Natural Language Processing (NLP) technology that organizes and understands large collections of text data, by identifying the topics, finding patterns and semantic. There two common approaches for topic analysis, topic modeling, and topic classification each approach has different algorithms to apply that will be discussed.

Keywords


Natural Language Processing, Topic Classification, Topic Modeling, Text Categorization.