Open Access Open Access  Restricted Access Subscription Access

Automatic Extraction of Spatio-Temporal Information from Arabic Text Documents


Affiliations
1 Department of Computer Science, Mentouri 2 University, Constantine, Algeria
2 Hamma Lakhdar El-Oued University, Algeria
 

Unstructured Arabic text documents are an important source of geographical and temporal information. The possibility of automatically tracking spatio-temporal information, capturing changes relating to events from text documents, is a new challenge in the fields of geographic information retrieval (GIR), temporal information retrieval (TIR) and natural language processing (NLP). There was a lot of work on the extraction of information in other languages that use Latin alphabet, such as English, French, or Spanish, by against the Arabic language is still not well supported in GIR and TIR and it needs to conduct more researches. In this paper, we present an approach that support automated exploration and extraction of spatio-temporal information from Arabic text documents in order to capture and model such information before it can be utilized in search and exploration tasks. The system has been successfully tested on 50 documents that include a mixture of types of Spatial/temporal information. The result achieved 91.01% of recall and of 80% precision. This illustrates that our approach is effective and its performance is satisfactory.

Keywords

Arabic NLP, Information Extraction, Temporal Data, Spatial Data, Gazetteers, Gis.
User
Notifications
Font Size

Abstract Views: 307

PDF Views: 180




  • Automatic Extraction of Spatio-Temporal Information from Arabic Text Documents

Abstract Views: 307  |  PDF Views: 180

Authors

Abdelkoui Feriel
Department of Computer Science, Mentouri 2 University, Constantine, Algeria
Kholladi Mohamed Khireddine
Hamma Lakhdar El-Oued University, Algeria

Abstract


Unstructured Arabic text documents are an important source of geographical and temporal information. The possibility of automatically tracking spatio-temporal information, capturing changes relating to events from text documents, is a new challenge in the fields of geographic information retrieval (GIR), temporal information retrieval (TIR) and natural language processing (NLP). There was a lot of work on the extraction of information in other languages that use Latin alphabet, such as English, French, or Spanish, by against the Arabic language is still not well supported in GIR and TIR and it needs to conduct more researches. In this paper, we present an approach that support automated exploration and extraction of spatio-temporal information from Arabic text documents in order to capture and model such information before it can be utilized in search and exploration tasks. The system has been successfully tested on 50 documents that include a mixture of types of Spatial/temporal information. The result achieved 91.01% of recall and of 80% precision. This illustrates that our approach is effective and its performance is satisfactory.

Keywords


Arabic NLP, Information Extraction, Temporal Data, Spatial Data, Gazetteers, Gis.