Open Access Open Access  Restricted Access Subscription Access

Development of Algorithm for Extracting Acronym Definitions from Gurmukhi Text


Affiliations
1 Computer Science Department, UCOE, Punjabi University, Patiala, India
2 Department of Computer Science, UCOE, Punjabi University, Patiala, India
 

In the recent years, an exponential increase of using new linguistic feature Acronyms in the available electronic information documents to produce shorter text causes a big necessity to quickly understand large volumes of data. In the recent years, Automatic Recognition of Acronym-Definition Pairs has gained a One challenge derives from the common and uncontrolled use of acronyms in the medical and political field. In this paper we have developed an Algorithm for Extracting Acronym Definitions from Gurmukhi Text by using First-Letter matching technique. For improving the recall we have provided Common Acronym-Definition Pairs database. We tested our system over two types of data- Medical and Political data. We have applied three standard measures 1) Recall 2) precision 3) F-Score. The proposed system achieved 86% recall, 86.04% precision, yielding F-score of 86.01% for Medical document and 86.53% recall, 95.55% precision, yielding F-score of 90.81% for Political document by using Database of common Acronym-Definition pairs.

Keywords

Acronyms, Acronym identification, Abbreviation, Tokenization, Normalization, Recall, Precision, Medline.
User
Notifications
Font Size

Abstract Views: 109

PDF Views: 0




  • Development of Algorithm for Extracting Acronym Definitions from Gurmukhi Text

Abstract Views: 109  |  PDF Views: 0

Authors

Manpreet Kaur
Computer Science Department, UCOE, Punjabi University, Patiala, India
Jagroop Kaur
Department of Computer Science, UCOE, Punjabi University, Patiala, India

Abstract


In the recent years, an exponential increase of using new linguistic feature Acronyms in the available electronic information documents to produce shorter text causes a big necessity to quickly understand large volumes of data. In the recent years, Automatic Recognition of Acronym-Definition Pairs has gained a One challenge derives from the common and uncontrolled use of acronyms in the medical and political field. In this paper we have developed an Algorithm for Extracting Acronym Definitions from Gurmukhi Text by using First-Letter matching technique. For improving the recall we have provided Common Acronym-Definition Pairs database. We tested our system over two types of data- Medical and Political data. We have applied three standard measures 1) Recall 2) precision 3) F-Score. The proposed system achieved 86% recall, 86.04% precision, yielding F-score of 86.01% for Medical document and 86.53% recall, 95.55% precision, yielding F-score of 90.81% for Political document by using Database of common Acronym-Definition pairs.

Keywords


Acronyms, Acronym identification, Abbreviation, Tokenization, Normalization, Recall, Precision, Medline.