MapReduce: A Technical Review

T. Y. J. Naga Malleswari; G. Vadivu

doi:10.17485/ijst/2016/v9i1/130117

MapReduce: A Technical Review

Affiliations
1 Department of Computer Science and Engineering, SRM University, Chennai - 603203, Tamil Nadu, India
2 Department of Information Technology, SRM University, Chennai - 603203, Tamil Nadu, India

Abstract
References
Article Metrics
Refbacks

MapReduce, a programming model, allows parallel processing of large amount of data sets where various data mining techniques are not quite useful. It’s Map and Reduce functions can be customized by the developers according to their application. This paper gives an idea of MapReduce, its advantages and disadvantages. This paper also focuses on how MapReduce is used, how map and reduce computations are customized, implemented in several scenarios such as in medical field to generate medical reports by processing large medical data sets, stream processing and workflow scheduling in multi core processors, in distributed environment, for processing distributed data sets by using pilot abstractions. We also represent how MapReduce used for deduplication of files to save disk space in data centers. MapReduce based Pre-Post (MRPre-Post) a parallel data mining algorithm is adapted in Hadoop platform to achieve scalability. MapReduce is implemented in vHadoop (Virtual Hadoop), a scalable hadoop virtual cluster to process machine learning algorithms. The scenarios discussed in this paper help developers and researchers how to customize and use MapReduce in their applications.

Keywords

Big Data, Data Management, Distribute Environment, Iterative Computation, Large Scale Data Processing, MapReduce

About the Journal

Editorial Board

Current Issue

Archives

Advanced Search

Article Submission

Registration

Subscription

User

Information

Journal Content
Browse

Donations

Abstract Views: 207

PDF Views: 0

MapReduce: A Technical Review

Abstract Views: 207 | PDF Views: 0

Authors

T. Y. J. Naga Malleswari
Department of Computer Science and Engineering, SRM University, Chennai - 603203, Tamil Nadu, India

G. Vadivu
Department of Information Technology, SRM University, Chennai - 603203, Tamil Nadu, India

Abstract

Keywords

Big Data, Data Management, Distribute Environment, Iterative Computation, Large Scale Data Processing, MapReduce

DOI: https://doi.org/10.17485/ijst%2F2016%2Fv9i1%2F130117

Username
Password
Remember me

Username
Password
Remember me

Indian Journal of Science and Technology

MapReduce: A Technical Review

Keywords

MapReduce: A Technical Review

Authors

Abstract

Keywords