Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Named Entity Recognition from Biomedical Text-An Information Extraction Task


Affiliations
1 Department of Computer Science and Engineering, Manonmaniam Sundaranar University, India
2 Department of Computer Science and Engineering, Madanapalle Institute of Technology and Science, India
     

   Subscribe/Renew Journal


Biomedical Text Mining targets the Extraction of significant information from biomedical archives. Bio TM encompasses Information Retrieval (IR) and Information Extraction (IE). The Information Retrieval will retrieve the relevant Biomedical Literature documents from the various Repositories like PubMed, MedLine etc., based on a search query. The IR Process ends up with the generation of corpus with the relevant document retrieved from the Publication databases based on the query. The IE task includes the process of Preprocessing of the document, Named Entity Recognition (NER) from the documents and Relationship Extraction. This process includes Natural Language Processing, Data Mining techniques and machine Language algorithm. The preprocessing task includes tokenization, stop word Removal, shallow parsing, and Parts-Of-Speech tagging. NER phase involves recognition of well-defined objects such as genes, proteins or cell-lines etc. This process leads to the next phase that is extraction of relationships (IE). The work was based on machine learning algorithm Conditional Random Field (CRF).

Keywords

Information Extraction, Information Retrieval, Text Mining, Named Entity Recognition, Data Mining.
Subscription Login to verify subscription
User
Notifications
Font Size

Abstract Views: 231

PDF Views: 0




  • Named Entity Recognition from Biomedical Text-An Information Extraction Task

Abstract Views: 231  |  PDF Views: 0

Authors

N. Kanya
Department of Computer Science and Engineering, Manonmaniam Sundaranar University, India
T. Ravi
Department of Computer Science and Engineering, Madanapalle Institute of Technology and Science, India

Abstract


Biomedical Text Mining targets the Extraction of significant information from biomedical archives. Bio TM encompasses Information Retrieval (IR) and Information Extraction (IE). The Information Retrieval will retrieve the relevant Biomedical Literature documents from the various Repositories like PubMed, MedLine etc., based on a search query. The IR Process ends up with the generation of corpus with the relevant document retrieved from the Publication databases based on the query. The IE task includes the process of Preprocessing of the document, Named Entity Recognition (NER) from the documents and Relationship Extraction. This process includes Natural Language Processing, Data Mining techniques and machine Language algorithm. The preprocessing task includes tokenization, stop word Removal, shallow parsing, and Parts-Of-Speech tagging. NER phase involves recognition of well-defined objects such as genes, proteins or cell-lines etc. This process leads to the next phase that is extraction of relationships (IE). The work was based on machine learning algorithm Conditional Random Field (CRF).

Keywords


Information Extraction, Information Retrieval, Text Mining, Named Entity Recognition, Data Mining.