Open Access
Subscription Access
Open Access
Subscription Access
Locality Aware Scheduling Using Prefetching Technique in Hadoop
Subscribe/Renew Journal
Hadoop is a hastily growing environment of components for fulfilling the Google MapReduce algorithms in a scalable fashion on commodity hardware. Hadoop qualifies users to store and process large capacities of data and analyze it in ways not previously potential with less scalable solutions or standard SQL-based tactics. MapReduce offers a favorable programming model for big data processing. Data Locality is of most concern in MapReduce as to improve the performance and to decrease the network traffic. Many algorithms are there for improving the performance based on locality of data. Somehow there are many defects or more future work is there to be done in this area. "Moving computation to data is cheaper than moving computation to data." By following this Hadoop principle, Data Locality is the more effective performance metric for effective computation. In the proposed system, a new different approach is given to achieve the data locality in map phase. Here, task is assigned to the requesting node if it has the local data. If requesting node has non local data then the data is pre-fetched to this node from the nearest node. We consider progress of node to start prefetching. This approach will improve performance with faster computation and reduce the network traffic.
Keywords
Hadoop, MapReduce, Data Locality, Prefetching.
User
Subscription
Login to verify subscription
Font Size
Information
Abstract Views: 307
PDF Views: 1