Open Access Open Access  Restricted Access Subscription Access

A Text Mining Approach to Extract Opinions from Unstructured Text


Affiliations
1 PG and Research Department of Computer Science, Quaid-E-Millath Government College for Women (Autonomous), Chennai - 600002, Tamil Nadu, India
 

Background/Objectives: To extract and interpret public opinion from informal description of text in social media websites. Methods: The informal descriptions containing opinions are Tokenized, Parts of Speech Tagging, Word-sense Disambiguation and Text Transformation/Attribute Generation are employed. Sample data pertaining to performance rating of cricket players was collected from twitter, cricinfo and cricbuzz. The reviews were rated against subjective evaluation criteria, the scale ranging from poor, moderate, good, excellent and the linguistic variables were converted into numerical values using fuzzy since the expressed opinions may be easy to understand. Findings: Opinions enhance any decision making process, so the influence of opinions from sample data pertaining to performance rating of cricket players are classified as poor, moderate, good and excellent. Application/Improvements: Using the proposed approach a generic prototype can be built that can be used to extract opinions and interpret.

Keywords

Information Extraction, Opinion Mining, Unstructured Text.
User

Abstract Views: 186

PDF Views: 0




  • A Text Mining Approach to Extract Opinions from Unstructured Text

Abstract Views: 186  |  PDF Views: 0

Authors

Ananthi Sheshasaayee
PG and Research Department of Computer Science, Quaid-E-Millath Government College for Women (Autonomous), Chennai - 600002, Tamil Nadu, India
R. Jayanthi
PG and Research Department of Computer Science, Quaid-E-Millath Government College for Women (Autonomous), Chennai - 600002, Tamil Nadu, India

Abstract


Background/Objectives: To extract and interpret public opinion from informal description of text in social media websites. Methods: The informal descriptions containing opinions are Tokenized, Parts of Speech Tagging, Word-sense Disambiguation and Text Transformation/Attribute Generation are employed. Sample data pertaining to performance rating of cricket players was collected from twitter, cricinfo and cricbuzz. The reviews were rated against subjective evaluation criteria, the scale ranging from poor, moderate, good, excellent and the linguistic variables were converted into numerical values using fuzzy since the expressed opinions may be easy to understand. Findings: Opinions enhance any decision making process, so the influence of opinions from sample data pertaining to performance rating of cricket players are classified as poor, moderate, good and excellent. Application/Improvements: Using the proposed approach a generic prototype can be built that can be used to extract opinions and interpret.

Keywords


Information Extraction, Opinion Mining, Unstructured Text.



DOI: https://doi.org/10.17485/ijst%2F2015%2Fv8i36%2F130062