Open Access Open Access  Restricted Access Subscription Access

Discovering Spatio-Temporal Patterns of Themes in Social Media


Affiliations
1 Department of Computer Science, Federal University of Technology, P. M. B. 704, Akure, Nigeria
 

Social networking website creates new ways for engaging people belonging to different communities, moral and social values to communicate and share valuable knowledge, therefore creating a large amount of data. The importance of mining social media cannot be over emphasized, due to significant information that are revealed which can be applied in different areas. In this paper, a systematic approach for traversing the content of weblog, considering location and time (spatiotemporal) is proposed. The proposed model is capable of searching for subjects in social media using Boyer Moore Horspool (BMH) algorithm with respect to location and time. BMH is an efficient string searching algorithm, where the search is done in such a way that every character in the text needs not to be checked and some characters can be skipped without missing the subject occurrence. Semantic analysis was carried out on the subject by computing the mean occurrence of the subject with the corresponding predicate and object from the total occurrence of the subject. Experiments were carried out on two datasets: the first category was crawled from twitter website from September to October 2014 and the second category was obtained from spinn3r dataset made available through the International AAAI (Association for the Advancement of Artificial Intelligence) Conference on Web and Social Media (ICWSM). The results obtained from tracking some subjects such as Islam and Obama shows that the mean occurrence of the analysis of the subject successfully reveals the pattern of the subject over a period of time for a specific location. Evaluation of the system which is based on performance and functionality reveals that the model performs better than some baseline models. The proposed model is capable of revealing spatiotemporal pattern for a subject, and can be applied in any area where spatiotemporal factor is to be considered.

Keywords

Boyer-Moore-Horspool Alogrithm, Search Processing, Spatio Temporal Pattern, Sementic Analysis.
User
Notifications
Font Size

  • Baumer, E. P. S., Sinclair, j., & Tomlinson, B. (2010).Human factor in computingsystems. America is like metamucil: fostering critical and creative thinking about metaphor in political blogs. Atlanta, GA, USA, 34-45.
  • Boyer, R. S., & Moore, J. S., A fast string searching algorithm. 20(10), 762-772, (1977). Communications of Association for Computing Machinery(ACM), New York City.
  • Charu, C. A., Text mining in social networks in social network data analytics. (2nd ed.). Springer, 353-374, (2011).
  • Blei, D., Ng, A.,& Jordan, M.,Latent Dirichlet allocation, Journal of Machine Learning Research. 3(1), 993–1022, (2003).
  • Budak, C., Agrawal, D., & El Abbadi, A., Structural trend analysis for online social networks. Proceedings of the VLDB Endowment, 4(10), 646-656, (2011).
  • El-Mabrouk, N., &Crochemore, M. (Ed).(1996) Boyer-More strategy to efficient approximate string matching. Combinatorial Pattern Matching, Labuna Beach, California, France
  • Hassan, S., Hurst, M., & Alexey, M. (2009). Event Detection and Tracking in Social Streams. Proceeding of International AAAI Conference on Weblogs and Social Media.Third International AAAI Conference on Weblogs and Social Media. Retrievedfrom http://aaai.org/ocs/index.php/ICWSM/09/paper/view/170/493
  • Hume A. and Sunday D., Fast String Searching software—Practice and experience, ACM Digital Library., 21(11), 1221–1248, (1991).
  • Jayanta, K. P., &Abhisek, S. , Identifying themes in social media and detectingsentiments. International journal of statistics and applications: 1(1) 14-19, (2011).
  • Kumar, R., Novak, P. R., & Tomkins A.,Structure and evolution of blogspace.Commun.ACM,47(12):35-39, 2004,.
  • Leetaru, K. H., Culturomics 2.0: Forecasting Large-Scale Human Behavior Using Global News Media Tone In Time And Space. Journal on the Internet, 16 (9), (2011). Retrieved from:http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/3663/3040.
  • Liu, B., Sentiment analysis and opinion mining. Synthesis lectures on human language technologies, 5(1), 1-167, (2012).
  • Mei, Q., Liu, C., Su, H., & Zhai,C.X.(2006). A probabilistic approach to spatiotemporal theme pattern mining on weblogs, WWW.
  • Mike, K., & Steve, M., (2008). Centre for Business Performance. The use of information in decision making-Literature review for the audit commission.Cranfield, U.S.A.
  • Mike, T., David, W., &Sukhvinder, U. (2009).Data Mining Emotion in Social Network Communication: Gender differences in MySpaceStatisticalCybermetrics Research Group. School of Computing and Information Technology, University of Wolverhampton, Wulfruna Street, Wolverhampton WV1 1SB, UK.
  • Nebel, M. E. (2011). Search Texts-But Fast! The Boyer-More-HorspoolAlgorithm.Springer- VerlagBerlin Heidelberg Germany, 10.1007/978-3-642-15328-0_6.
  • Ojokoh, B. A., Olayemi, O. C., &Adewale, O. S., .Generating Recommendation Status of Electronic Products from Online Reviews, 4, 1-10, (2012).Intelligent Control and Automation, doi:10.4236/ica.2013.41001.
  • Pang, B., & Lee, L., Opinion mining and sentiment analysis, 2(1), 1-35, (2008).Foundations and Trends in Information Retrieval, U.S.A.
  • Roick, O., &Heuser, S., Location based social networks–definition, current state of the art and research Agenda. Transactions in GIS, 17(5), 763-784, (2013).
  • Sowjanya,M., Ravindra,K., Kumar,R.Y.,(2014).Application of Concept-Based Mining Model in Text Clustering. International Journal of Computer Science and Information Technologies.5(5)6578-6582,(2014).
  • Twitter4J,(2014). Java library for the Twitter API2014.Retrieved from http://twitter4j.org/en/index.html.
  • Wang, X. & McCallum, A. (2006).Topics over time: a non-markov continuous-time model of topical trends, SIGKDD.
  • Wang, C., Wang, J., Xie X., and Ma W. Y. ( 2007). Mining geographic knowledge using location aware topic model.Proceedings of the 4th ACM Workshop On Geographic Information Retrieval, GIR. 65-70. DOI: 10.1145/1316948.1316967.
  • William, M. C, Charlie K. D., & Clifford J. W. (2013).Social Network Analysis with content and Graphs.
  • Zielinski, A., Middleton, S. E.,Tokarchuk, L. N., & Wang, X. (2013). Information systems for crisis response and management.Social media text mining and network analysis for decision support in natural crisis management.

Abstract Views: 166

PDF Views: 0




  • Discovering Spatio-Temporal Patterns of Themes in Social Media

Abstract Views: 166  |  PDF Views: 0

Authors

Tobore Igbe
Department of Computer Science, Federal University of Technology, P. M. B. 704, Akure, Nigeria
Bolanle Ojokoh
Department of Computer Science, Federal University of Technology, P. M. B. 704, Akure, Nigeria
Olumide Adewale
Department of Computer Science, Federal University of Technology, P. M. B. 704, Akure, Nigeria

Abstract


Social networking website creates new ways for engaging people belonging to different communities, moral and social values to communicate and share valuable knowledge, therefore creating a large amount of data. The importance of mining social media cannot be over emphasized, due to significant information that are revealed which can be applied in different areas. In this paper, a systematic approach for traversing the content of weblog, considering location and time (spatiotemporal) is proposed. The proposed model is capable of searching for subjects in social media using Boyer Moore Horspool (BMH) algorithm with respect to location and time. BMH is an efficient string searching algorithm, where the search is done in such a way that every character in the text needs not to be checked and some characters can be skipped without missing the subject occurrence. Semantic analysis was carried out on the subject by computing the mean occurrence of the subject with the corresponding predicate and object from the total occurrence of the subject. Experiments were carried out on two datasets: the first category was crawled from twitter website from September to October 2014 and the second category was obtained from spinn3r dataset made available through the International AAAI (Association for the Advancement of Artificial Intelligence) Conference on Web and Social Media (ICWSM). The results obtained from tracking some subjects such as Islam and Obama shows that the mean occurrence of the analysis of the subject successfully reveals the pattern of the subject over a period of time for a specific location. Evaluation of the system which is based on performance and functionality reveals that the model performs better than some baseline models. The proposed model is capable of revealing spatiotemporal pattern for a subject, and can be applied in any area where spatiotemporal factor is to be considered.

Keywords


Boyer-Moore-Horspool Alogrithm, Search Processing, Spatio Temporal Pattern, Sementic Analysis.

References