Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Automatic Extraction of Personal Names from Structured Web Texts


Affiliations
1 Department of Information Science, University of Madras, Chennai 600 005, India
     

   Subscribe/Renew Journal


This paper describes a method for automatic identification and extraction of personal names from structured web texts. A set of rules were defined, using which a mechanism was developed and implemented to extract names of persons from a corpus of electronic texts. The results and the problems identified are presented
User
Subscription Login to verify subscription
Notifications
Font Size

  • Alegria, Inaki et. al. Design and Development of a Named Entity Recognizer for an Agglutinative Language, 1998. http://ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1076696580/publikoak/IJCNLP04.pdf
  • Alegria, Loinaz; Inaki et. al. Named Entity Recognition and Classification for Texts in Basque, 1998. http.7/ixa.si.ehu.es/Ixa/Argitalpenak/Artikuluak/1061807449/publikoak/jotrill.pdf
  • Black, William J. et. al. FACILE: Description of the NE System used for MUC-7, 1998. www.itl.nist.gov/iaui/894.02/related_projects/niuc/proceeding/muc_7_proceedings/facile_muc7.pdf
  • Chen, Zheng et. al. A New statistical Approach to Personal Name Extraction, 2002. • http://research.microsoft.com/~zhengc/papers/icm/2002-19.pdf
  • Chieu, Hai Leong; Ng, Hwee Tou. Teaching a Weaker Classifier: Named Entity Recognition on Uppercase Text, 1998. http://acl.ldc.upenn.edU/P/P02/P02-1061.pdf
  • Fox, Heidi. Learning to Extract and Classify Names from Texts, 1998. www.ieeexplre.ieee.org/iel4/5875/15679/00728133.pdf
  • Fukumoto, J. et. al. Oki Electric Industry: Description of the Oki System as used for MET-2, 1998. www.itl.nist.gov/iaui/894.02/related_projects/muc/proceeding/muc_7_proceedings/oki_met2.pdf
  • Huyck, Christian R. Description of the American University in Cairo's System used for MUC7, 1998. www.itl.nist.gov/iaui/894.02/related_projects/muc/proceeding/muc_7_proceedings/ auc.pdf
  • Krupka, George R.; Hausman, Kevin. ISOQuest Inc.: Description of the NetOwlTMExtractor System as used for MUC-7, 1998. www.itl.nist.gov/iaui/894.02/ related_projects/muc/proceeding/muc_7_proceedings/isoquest.pdf
  • Likforman-Sulem, Laurence et. al. Proper Name Extraction from Fax Images combining Textual and Image Features, 2003. www.ieeexplore.ieee.org/iel5/8701/27545/01227724.pdf
  • Poibeau, Thierry; Kosseim, Leila. Proper Name Extraction From Non - Journalistic Texts, 2001. www.iro.umontreal.ca/~kosseim/publications/clin.pdf
  • Ravin, Yael. Extracting Names from Natural Language Texts, 2001. www.research.ibm.com/talent/documents/20338.pdf
  • Shen, Bing. Person Name Identification in Chinese Documents using Finite State Automata, 2003.www.ieeexplore.ieee.org/iel5/8789/27820/01241125.pdf
  • Thompson, Paul; Dozier, Christopher C. Name Searching and Information Retrieval, 2001. http://arxiv.org/htinl/cmp-lg/9706017
  • Yangarbar, Roman; Grishman, Ralph. NYU; Description of the Proteus/PET System as used for MUC - 7 ST, 1998. www.itl.nist.gov/iaui/894.02/related_projects/muc/ proceeding/muc_7_proceedings/nyu_st_paper.pdf

Abstract Views: 219

PDF Views: 0




  • Automatic Extraction of Personal Names from Structured Web Texts

Abstract Views: 219  |  PDF Views: 0

Authors

K. Sivasamy
Department of Information Science, University of Madras, Chennai 600 005, India
K. S. Raghavan
Department of Information Science, University of Madras, Chennai 600 005, India

Abstract


This paper describes a method for automatic identification and extraction of personal names from structured web texts. A set of rules were defined, using which a mechanism was developed and implemented to extract names of persons from a corpus of electronic texts. The results and the problems identified are presented

References