Efficient Skew Handling for Outer Joins in a Cloud Computing Environment - 2018


Outer joins are ubiquitous in several workloads and Big Data systems. The question of a way to best execute outer joins in large parallel systems is notably difficult, as universe datasets are characterized by data skew resulting in performance issues. Although skew handling techniques have been extensively studied for inner joins, there is little printed work solving the corresponding problem for parallel outer joins, particularly within the extremely popular Cloud computing surroundings. Standard approaches to the problem like ones based on hash redistribution typically lead to load balancing issues whereas duplication-based approaches incur significant overhead in terms of network communication. In this Project, we propose a brand new approach for economical skew handling in outer joins over a Cloud computing atmosphere. We present an efficient implementation of our approach over the Spark framework. We tend to evaluate the performance of our approach on a 192-core system with giant take a look at datasets in excess of one hundred GB and with varying skew. Experimental results show that our approach is scalable and, a minimum of in cases of high skew, considerably faster than the state-of-the-art.

Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here

PROJECT TITLE :Efficient Secure Outsourcing of Large-Scale Sparse Linear Systems of Equations - 2018ABSTRACT:Solving large-scale sparse linear systems of equations (SLSEs) is one in all the foremost common and basic problems in
PROJECT TITLE :Distributed Feature Selection for Efficient Economic Big Data Analysis - 2018ABSTRACT:With the rapidly increasing popularity of economic activities, a large amount of economic data is being collected. Although
PROJECT TITLE :Efficient Wideband DOA Estimation Through Function Evaluation Techniques - 2018ABSTRACT:This Project presents an economical analysis methodology for the functions involved within the computation of direction-of-arrival
PROJECT TITLE :Efficient System Tracking With Decomposable Graph-Structured Inputs and Application to Adaptive Equalization With Cyclostationary Inputs - 2018ABSTRACT:This Project introduces the graph-structured recursive least
PROJECT TITLE :Efficient Partial-Sum Network Architectures for List Successive-Cancellation Decoding of Polar Codes - 2018ABSTRACT:List successive cancellation decoder (LSCD) architectures have been recently proposed for the decoding

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry