Open Access Open Access  Restricted Access Subscription Access

Extraction of Replicated Punjabi Multiword Expressions


Affiliations
1 Department of Computer Science, Punjabi University, Patiala, India
 

Multiword Expressions (MWEs) play a vital role in Natural Language Processing. Multiword Expression is a combination of two or more words but treated as a single word. In Punjabi Language, there are varieties of MWEs and many of these are of the types that are not found in English. In this paper, we discuss different types of MWEs encountered in Punjabi. For example, replicated words, word combination with antonym, synonym, hyponym, gender, number and ‘waala’ morpheme have not been discovered as MWEs in English. Rule based approachs, statistical methods, and linguists’ approaches were used for MWE identification and extraction. In this paper, we present a methodology for identification and extraction of Punjabi MWEs using statistical methods, rule base methods and linguists’ approach.
User
Notifications
Font Size


  • Extraction of Replicated Punjabi Multiword Expressions

Abstract Views: 368  |  PDF Views: 0

Authors

Kapil Dev Goyal
Department of Computer Science, Punjabi University, Patiala, India
Vishal Goyal
Department of Computer Science, Punjabi University, Patiala, India

Abstract


Multiword Expressions (MWEs) play a vital role in Natural Language Processing. Multiword Expression is a combination of two or more words but treated as a single word. In Punjabi Language, there are varieties of MWEs and many of these are of the types that are not found in English. In this paper, we discuss different types of MWEs encountered in Punjabi. For example, replicated words, word combination with antonym, synonym, hyponym, gender, number and ‘waala’ morpheme have not been discovered as MWEs in English. Rule based approachs, statistical methods, and linguists’ approaches were used for MWE identification and extraction. In this paper, we present a methodology for identification and extraction of Punjabi MWEs using statistical methods, rule base methods and linguists’ approach.

References