Fault Tolerant Stencil Computation on Cloud-based GPU Spot Instances - 2017


This paper describes a fault tolerant framework for distributed stencil computation on cloud-based GPU clusters. It uses pipelining to overlap the information movement with computation in the halo region in addition to parallelises knowledge movement among the GPUs. Instead of running stencil codes on traditional clusters and supercomputers, the computation is performed on the Amazon Web Service GPU cloud, and utilizes its spot instances to enhance cost-efficiency. The implementation is based on a low-value faulttolerant mechanism to handle the doable termination of the spot instances. Including a price bidding module, our stencil framework not only optimizes for performance however also for price. Experimental results show that our framework outperforms the state-of-the-art solutions achieving a peak of twenty five TFLOPS for two- D decomposition running on 512 nodes. We also show that the use of spot instances yields sensible cost-potency, increasing the common TFLOPS/USD from 132 to 360.

Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here

PROJECT TITLE : Multi-Switches Fault Diagnosis Based on Small Low Frequency Data for Voltage-Source Inverters of PMSM Drives ABSTRACT: Using small low-frequency data for inverter failure diagnosis of permanent magnet synchronous
PROJECT TITLE : Fast Fault Diagnosis Method for Hall Sensors in Brushless DC Motor Drives ABSTRACT: Because of their simplicity and low cost, brushless direct current motors with Hall sensors are frequently employed in a wide
PROJECT TITLE : Fault Current Estimation in Multi-Terminal HVdc Grids Considering MMC Control ABSTRACT: For multi-terminal HVdc protection systems, DC faults are crucial events, and knowing the critical fault time is essential
PROJECT TITLE : Bridge-Type Solid-State Fault Current Limiter Based on ACDC Reactor ABSTRACT: Based on a single series reactor, this study presents a novel bridge-type solid-state fault current limiter (BSSFCL). There are
PROJECT TITLE : Fault Detection and Protection of Induction Motors Using Sensors ABSTRACT: Because an induction motor (IM) is used extensively in industry as an actuator, its protection against probable faults, such as overvoltage,

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry