Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Survey on Performance Analysis of Hadoop ETL for Disaster Management


Affiliations
1 Department of Information Technology, Madurai Sivakasi Nadars Pioneer Meenakshi Women’s College, Poovanthi, India
     

   Subscribe/Renew Journal


The term big data refers to data sets whose volume, variability and Speed of velocity make them difficult to capture, manage, procedure or analyzed. To examine this huge amount of data Hadoop is able to be used. Hadoop is an open source software project that enables the Spread giving out of large data sets across a cluster of creationservers.ETL Tools extract important information from various data sources, various transFormation’s of data are established out transformation phase and then load into the big data. HDFS (Hadoop Distributed File System), is a spread file system design to hold the very huge of data (peta bytes or even zetta bytes), and there high throughput Admission to this information. Map Reduce method has been calculated n this paper which is required for implement Big Data Analysis using HDFS. In this paper the related topics of Big Data Analytics, and Hadoop, ETL, Map Reduce are reviewed.


Keywords

Big Data, Hadoop, ETL, Map Reduce, HDFS.
User
Subscription Login to verify subscription
Notifications
Font Size

  • Varsha B.Bobad, “International Research Journal of Engineering and Technology (IRJET)”, Volume: 03 Issue: 01
  • Inmon, William "Data Mart Does Not Equal Data Warehouse".DM Review.com. (2000-07-18)
  • Jeffrey R. Bocarsly, “The Data Warehouse Toolkit.” Complex ETL Testing-A Strategic Approach
  • R. Kimball and M. Ross. WileyPublishing, Inc., 2002.
  • “Survey of Recent Research Progress and Issues in Big Data” www.cse.wustl.edu/~jain/cse570-13/ftp/bigdata2/index.html 1/13
  • Ms. Vibhavari Chavan, Prof. Rajesh. N. Phursule, ―”Survey Paper on Big Data”‖ International Journal of Computer Science and Information Technologies, Vol. 5 (6), 2014.
  • Ms. Vibhavari Chavan, Prof. Rajesh. N. Phursule, ―”Survey Paper on Big Data” International Journal of Computer Science and Information Technologies, Vol. 5 (6), 2014.
  • Amogh Pramod Kulkarni, Mahesh Khandewal, ―”Survey on Hadoop and Introduction to YARN”, International Journal of Emerging Technology and Advanced Engineering Website: www.ijetae.com (ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 4, Issue 5, May 2014)
  • Tekiner F. and Keane J.A., Systems, Man and Cybernetics (SMC), ―”Big Data Framework” 2013 IEEE International Conference on 13–16 Oct. 2013, 1494–1499
  • Inmon, William (2000-07-18). "Data Mart Does Not Equal Data Warehouse". DMReview.com
  • V. Hristidis, S. Chen, T. Li, S. Luis, and Y. Deng, "Survey of Data Management and Analysis in Disaster Situations," Journal of Systems and Software, vol. 83, no. 10, pp. 17011714, 2010
  • Kuldeep Deshpande, and dr. Bhimappa desai limitations of dataware house platforms and Assessment of Hadoop as an Alternative, Volume 5, Issue 2, pp. 51-58,IJITMIS, 2014
  • Saleem, K., Luis, S., Deng, Y., Chen, S.-C., Hristidis, V., Li, T., 2008. "Towards a business Continuity information network for rapid disaster recovery.” In: Proceedings of the 9th Annual International Conference on Digital Government Research, Montreal, Canada, May 18–21, pp. 107–116
  • Sagiroglu, S.Sinanc, D.,‖ Big Data: A Review‖,2013, 20-24.
  • Tekiner F. and Keane J.A., Systems, Man and Cybernetics (SMC), ―”Big Data Framework”‖ 2013 IEEE International Conference on 13–16 Oct. 2013, 1494–1499
  • S.Vikram Phaneendra & E.Madhusudhan Reddy “Big Data-solutions for RDBMS problems-
  • A survey” In 12th IEEE/IFIP Network Operations & Management Symposium (NOMS 2010) (Osaka, Japan, Apr 19{23 2013).
  • Senthi Vadivel Bhupatthi Rav” Disaster Management: A Global Issue” International journal of civil and structural engineering Volume 1, No 1, 2010
  • Wang, F. Hadoop High Availability through Metadata Replication. ACM (2009).
  • Vibhavari Chavan et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (6) , 2014, 7932-7939

Abstract Views: 255

PDF Views: 5




  • Survey on Performance Analysis of Hadoop ETL for Disaster Management

Abstract Views: 255  |  PDF Views: 5

Authors

M. Saranya
Department of Information Technology, Madurai Sivakasi Nadars Pioneer Meenakshi Women’s College, Poovanthi, India

Abstract


The term big data refers to data sets whose volume, variability and Speed of velocity make them difficult to capture, manage, procedure or analyzed. To examine this huge amount of data Hadoop is able to be used. Hadoop is an open source software project that enables the Spread giving out of large data sets across a cluster of creationservers.ETL Tools extract important information from various data sources, various transFormation’s of data are established out transformation phase and then load into the big data. HDFS (Hadoop Distributed File System), is a spread file system design to hold the very huge of data (peta bytes or even zetta bytes), and there high throughput Admission to this information. Map Reduce method has been calculated n this paper which is required for implement Big Data Analysis using HDFS. In this paper the related topics of Big Data Analytics, and Hadoop, ETL, Map Reduce are reviewed.


Keywords


Big Data, Hadoop, ETL, Map Reduce, HDFS.

References





DOI: https://doi.org/10.36039/ciitaas%2F10%2F4%2F2018%2F172808.80-83