Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Empirical Evaluation of Grid-Based Text Mining Tasks


Affiliations
1 Department of Information Technology and Science, Dr.G R Damodaran College of Science, Coimbatore, Tamil Nadu, India
2 Department of Computer Science, St. Joseph’s College for Women, India
     

   Subscribe/Renew Journal


The enormous amount of information stored in unstructured texts cannot simply be used for further processing by computers, which typically handle text as simple sequences of character strings. Text mining is the process of extracting interesting information and knowledge from unstructured text. This study implies an overview of our research activities aimed at efficient use of Grid infrastructure to solve various text mining tasks. Grid-enabling of various text mining tasks was mainly driven by increasing volume of processed data. Integration of text mining services into the distributed service oriented system enables plenty of various possibilities for building the distributed text mining services. Three different data driven distributed approaches for text mining have been proposed they are induction of decision trees, GHSOM clustering algorithm and FCA method. The objective of this work was to evaluate the concept that proposed by Min Sarnovskartý, which yielded favorable results.

Keywords

Decision Trees, GHSOM, Grid, Text Mining.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 242

PDF Views: 1




  • Empirical Evaluation of Grid-Based Text Mining Tasks

Abstract Views: 242  |  PDF Views: 1

Authors

R. Boobathiraj
Department of Information Technology and Science, Dr.G R Damodaran College of Science, Coimbatore, Tamil Nadu, India
J. Geetha
Department of Computer Science, St. Joseph’s College for Women, India

Abstract


The enormous amount of information stored in unstructured texts cannot simply be used for further processing by computers, which typically handle text as simple sequences of character strings. Text mining is the process of extracting interesting information and knowledge from unstructured text. This study implies an overview of our research activities aimed at efficient use of Grid infrastructure to solve various text mining tasks. Grid-enabling of various text mining tasks was mainly driven by increasing volume of processed data. Integration of text mining services into the distributed service oriented system enables plenty of various possibilities for building the distributed text mining services. Three different data driven distributed approaches for text mining have been proposed they are induction of decision trees, GHSOM clustering algorithm and FCA method. The objective of this work was to evaluate the concept that proposed by Min Sarnovskartý, which yielded favorable results.

Keywords


Decision Trees, GHSOM, Grid, Text Mining.