Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Multi-Document Text Summarization Using Clustering Techniques and Lexical Chaining


Affiliations
1 Department of Information Technology, Pondicherry Engineering College, Pondicherry, India
2 Microsoft R&D India Private Limited, Hyderabad, India
     

   Subscribe/Renew Journal


This paper investigates the use of clustering and lexical chains to produce coherent summaries of multiple documents in text format to generate an indicative, less redundant summary. The summary is designed as per user's requirement of conciseness i.e., the documents are summarized according to the percentage input by the user. For achieving the above, various clustering techniques are used. Clustering is done at two levels, one at single document level and then at multi-document level. The clustered sentences are scored based on five different methods and lexically linked to produce the final summary in a text document.

Keywords

Hierarchical Clustering, Lexical Chaining, Precision, Recall.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 194

PDF Views: 0




  • Multi-Document Text Summarization Using Clustering Techniques and Lexical Chaining

Abstract Views: 194  |  PDF Views: 0

Authors

S. Saraswathi
Department of Information Technology, Pondicherry Engineering College, Pondicherry, India
R. Arti
Microsoft R&D India Private Limited, Hyderabad, India

Abstract


This paper investigates the use of clustering and lexical chains to produce coherent summaries of multiple documents in text format to generate an indicative, less redundant summary. The summary is designed as per user's requirement of conciseness i.e., the documents are summarized according to the percentage input by the user. For achieving the above, various clustering techniques are used. Clustering is done at two levels, one at single document level and then at multi-document level. The clustered sentences are scored based on five different methods and lexically linked to produce the final summary in a text document.

Keywords


Hierarchical Clustering, Lexical Chaining, Precision, Recall.