Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Semantic Annotator for Knowledge Graph Exploration : Pattern-Based NLP Technique


Affiliations
1 Indian Statistical Institute, Bangalore – 560059, Karnataka, India
     

   Subscribe/Renew Journal


Semantic Annotator for knowledge Graph Exploration, abbreviated as SAGE is a “Thing” annotation system. Here, “Thing” refers to any concept, named individuals (aka entities), entity relations, and attributes. The system is primarily built based on the idea of “string to thing” where the “string” is any given text (e.g., abstract of an article) as input by the user. For annotation, the system utilises knowledge graph(s). SAGE can be used by anyone for annotating Things and for their exploitation on the Web. The annotation of things is done through exact and partial matches. For exact matches, the system makes explicit the name of the knowledge graphs it is sourced from. It also shows the type hierarchies for the matched named entities. In the current work, we describe the SAGE annotation system, designed on pattern-based NLP techniques, along with its features and various usage, and the experimental results.

Keywords

Application, Automated Annotation, Entity Annotation, Knowledge Graph Exploration, NLP, Semantic Annotation, Semantic Annotation Platform, Thing Annotation, Thing Spotting.
User
About The Authors

Biswanath Dutta
Indian Statistical Institute, Bangalore – 560059, Karnataka
India

Puranjani Das
Indian Statistical Institute, Bangalore – 560059, Karnataka
India


Notifications

  • BioAssay Ontology. (n.d.). Retrieved from: https://bioportal. bioontology.org/ontologies/BAO
  • Blumaumer, A., and Kiryakov, A. (n.d.). Knowledge Graphs: 5 Use cases and 10 steps to get there - Ontotext. Retrieved from: https://www.ontotext.com/knowledgehub/webinars/knowledge-graphs-5-use-cases-and-10-steps-to-get-there/
  • Brat Rapid Annotation Tool. (n.d.). Retrieved from: https:// brat.nlplab.org/
  • Chabchoub, M., Gagnon, M. and Web, A. Z. (2018). FICLONE: Improving DBpedia spotlight using named entity recognition and collective disambiguation. Open Journal Semantic Web, 5(1), 12-28.
  • Chen, S., Karaoglu, A., Negreanu, C., Ma, T., Yao, J.-G., Williams, J., Jiang F, Gordon, A., Lin, C.-Y. (2022). LinkingPark: An automatic semantic table interpretation system. Journal of Web Semantics, 74. https://doi. org/10.1016/j.websem.2022.100733
  • Ciotti, M., Ciccozzi, M., Terrinoni, A., Jiang, W.-C., Wang C.-B., and Bernardini, S. (2020). The COVID-19 pandemic. Critical Reviews in Clinical Laboratory Sciences, 57(6), 365-388. https://doi.org/10.1080/10408363.2020. 1783198 PMid:32645276
  • CovidGraph. (n.d.). Retrieved from: https://healthecco.org/ covidgraph/
  • Daiber, J., Jakob, M., Hokamp, C., and Mendes, P. N. (2013). Improving efficiency and accuracy in multilingual entity extraction. In Proceedings of the 9th International Conference on Semantic Systems (I-SEMANTICS ‘13) (pp. 121-124.) Association for Computing Machinery, New York, NY, USA. https://doi. org/10.1145/2506182.2506198
  • DeBellis, M., and Dutta, B. (2021). The Covid-19 CODO development process: An agile approach to knowledge graph development. Communications in Computer and Information Science. 1459 CCIS, 153-168. https://doi. org/10.1007/978-3-030-91305-2_12
  • Doccano, GitHub. (n.d.). Retrieved from: https://github. com/doccano Dutta, B. and Das, P. (2023 April). SAGE: A semantic annotator for knowledge graph exploration. In ASIS&T Mid-Year Conference Expanding Horizons of Information Science and Technology and Beyond. Virtual. https://doi.org/10.5281/zenodo.7597207
  • Dutta, B., and DeBellis, M. (2020). CODO: An ontology for collection and analysis of Covid-19 data. In D. Aveiro, J. Dietz, & J. Filipe (Eds.), Proc of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management- KEOD (pp. 76-85). SciTePress. https://doi.org/10.5220/0010112500760085 PMid:32515358 PMCid:PMC7269891
  • Giunchiglia, F., Maltese, V., and Dutta, B. (2012). Domains and context: First steps towards managing diversity in knowledge. Journal of Web Semantics: science, Services and Agents on the World Wide Web, 12-13, 53-63. https://doi.org/10.1016/j.websem.2011.11.007
  • Google Knowledge Graph. (n.d.). Retrieved from: https://developers.google.com/knowledge-graph
  • Gupta, S., Szekely, P., Knoblock, C. A., Goel, A., Taheriyan, M., and Karma, M. M. (2012). A system for mapping structured sources into the Semantic Web. In Extended Semantic Web Conference (pp. 430-434). Springer, 2012. https://doi.org/10.1007/978-3-662-46641-4_40
  • He, Y., Yu, H., Ong, E., Wang, Y. and Liu, Y (2020). CIDO, a community-based ontology for coronavirus disease knowledge and data integration, sharing, and analysis. Scientific Data, 7(181). https://doi.org/10.1038/s41597-020-0523-6 PMid:32533075 PMCid:PMC7293349
  • Hogan, W. R., Hanna, J., Hicks, A., Amirova, S., Bramblett, B., Diller, M., Enderez, R., Modzelewski, T., Vasconcelos, M., and Delcher, C. (2017). Therapeutic indications and other use-case-driven updates in the drug ontology: Antimalarials, anti-hypertensives, opioid analgesics, and a large term request. Journal of Biomedical Semantics, 8(1). https://doi.org/10.1186/s13326-017-0121-5 PMid:28253937 PMCid:PMC5335794
  • Hogenboom, F., Frasincar, F., and Kaymak, U. (2010). An overview of approaches to extract information from natural language corpora. Information Foraging Lab, 69.
  • Huang, X., Zhang, J., Xu, Z. and Ou, L (2021). A knowledge graph-based question-answering method for medical domain. PeerJ Computer Science, 7. https://doi.org/10.7717/peerj-cs.667 PMid:34604514 PMCid:PMC8444078
  • Idehen, K. U. (2020). Linked data, ontologies, and knowledge graphs. Retrieved from: https://www.linkedin.com/pulse/linked-data-ontologies-knowledge-graphs-kingsley-uyi-idehen
  • Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P. N., ... and Bizer, C. (2015). Dbpedia-a large-scale, multilingual knowledge base extracted from wikipedia. Semantic web, 6(2), 167-195. https://doi.org/10.3233/SW-140134
  • Lin, Y., Mehta, S., Küçük-McGinty, H., Turner, J. P., Vidovic, D., Forlin, M., Koleti, A., Nguyen, D. T., Jensen, L. J., Guha, R., Mathias, S. L., Ursu, O., Stathias, V., Duan, J., Nabizadeh, N., Chung, C., Mader, C., Visser, U., Yang, J. J., … and Schürer, S. C. (2017). Drug target ontology to classify and integrate drug discovery data. Journal of Biomedical Semantics, 8(1). https://doi.org/10.1186/s13326-017-0161-x PMid:29122012 PMCid:PMC5679337
  • Lotfi, M., Hamblin, M. and Acta, N. R. (2020). COVID19: Transmission, prevention, and potential therapeutic opportunities. Clinica Chimica Acta, 508, 254-266. https://doi.org/10.1016/j.cca.2020.05.044 PMid:32474009 PMCid:PMC7256510
  • Miller, G. A. (1995). WordNet: A lexical database for English. Communications of ACM, 38(11), 39-41. https://doi.org/10.1145/219717.219748
  • Nguyen, P., Kertkeidkachorn, N., Ichise, R., and Takeda, H. (2022) MTab4D: Semantic annotation of tabular data with DBpedia. Semantic Web. https://doi.org/10.3233/ SW-223098
  • Object Property Description, Protégé 5 Documentation, GitHub (n.d.). Retrieved from: http://protegeproject.github.io/protege/views/object-property-description/
  • Penn Part-of-Speech tags (n.d.) Retrieved from: https://cs.nyu.edu/~grishman/jet/guide/PennPOS.html
  • Vrandečić, D., and Krötzsch, M. (2014). Wikidata: A free collaborative knowledgebase. Communications of the ACM, 57(10), 78-85. https://doi.org/10.1145/2629489
  • Wolfram|Alpha. (n.d.). Retrieved from: https://www.wolframalpha.com/

Abstract Views: 197

PDF Views: 5




  • Semantic Annotator for Knowledge Graph Exploration : Pattern-Based NLP Technique

Abstract Views: 197  |  PDF Views: 5

Authors

Biswanath Dutta
Indian Statistical Institute, Bangalore – 560059, Karnataka, India
Puranjani Das
Indian Statistical Institute, Bangalore – 560059, Karnataka, India

Abstract


Semantic Annotator for knowledge Graph Exploration, abbreviated as SAGE is a “Thing” annotation system. Here, “Thing” refers to any concept, named individuals (aka entities), entity relations, and attributes. The system is primarily built based on the idea of “string to thing” where the “string” is any given text (e.g., abstract of an article) as input by the user. For annotation, the system utilises knowledge graph(s). SAGE can be used by anyone for annotating Things and for their exploitation on the Web. The annotation of things is done through exact and partial matches. For exact matches, the system makes explicit the name of the knowledge graphs it is sourced from. It also shows the type hierarchies for the matched named entities. In the current work, we describe the SAGE annotation system, designed on pattern-based NLP techniques, along with its features and various usage, and the experimental results.

Keywords


Application, Automated Annotation, Entity Annotation, Knowledge Graph Exploration, NLP, Semantic Annotation, Semantic Annotation Platform, Thing Annotation, Thing Spotting.

References





DOI: https://doi.org/10.17821/2023%2Fv60i1%2F170889