Open Access Open Access  Restricted Access Subscription Access

Storage and Computation on Big Data: A Comparative Study


Affiliations
1 Vardhaman College of Engineering, Hyderabad, India
 

A huge data space includes set of interesting points; Skyline is an important operation in many applications to return a set of interesting points from a potentially huge data space [1]. This survey paper highlights the characteristics of big data and their challenges. This paper also discusses the tools and techniques of big data. The existing algorithms like SaLSa, SSPL are novel computation algorithms. SaLSa exploits the idea of presorting the input data so as to effectively limit the number of tuples to be read and compared [2]. SSPL utilizes sorted positional index lists which require low space overhead to reduce I/O cost significantly [1]. SSPL consists of two phases. In phase 1, SSPL computes scan depth of the involved sorted positional index lists. During retrieving the lists in a round-robin fashion, SSPL performs pruning on any candidate positional index to discard the candidate whose corresponding tuple is not skyline result. Phase 1 ends when there is a candidate positional index seen in all of the involved lists. In phase 2, SSPL exploits the obtained candidate positional indexes to get skyline results by a selective and sequential scan on the table [1].

Keywords

SaLSa, SSPL, Big Data.
User
Notifications
Font Size

Abstract Views: 201

PDF Views: 0




  • Storage and Computation on Big Data: A Comparative Study

Abstract Views: 201  |  PDF Views: 0

Authors

C. H. Sravan Kumar
Vardhaman College of Engineering, Hyderabad, India
P. Buddha Reddy
Vardhaman College of Engineering, Hyderabad, India
K. Srinivas
Vardhaman College of Engineering, Hyderabad, India

Abstract


A huge data space includes set of interesting points; Skyline is an important operation in many applications to return a set of interesting points from a potentially huge data space [1]. This survey paper highlights the characteristics of big data and their challenges. This paper also discusses the tools and techniques of big data. The existing algorithms like SaLSa, SSPL are novel computation algorithms. SaLSa exploits the idea of presorting the input data so as to effectively limit the number of tuples to be read and compared [2]. SSPL utilizes sorted positional index lists which require low space overhead to reduce I/O cost significantly [1]. SSPL consists of two phases. In phase 1, SSPL computes scan depth of the involved sorted positional index lists. During retrieving the lists in a round-robin fashion, SSPL performs pruning on any candidate positional index to discard the candidate whose corresponding tuple is not skyline result. Phase 1 ends when there is a candidate positional index seen in all of the involved lists. In phase 2, SSPL exploits the obtained candidate positional indexes to get skyline results by a selective and sequential scan on the table [1].

Keywords


SaLSa, SSPL, Big Data.