A Switched Dynamical System Framework for Analysis of Massively Parallel Asynchronous Numerical Algorithms
Conference Paper
Overview
Identity
Additional Document Info
Other
View All
Overview
abstract
2015 American Automatic Control Council. In the near future, massively parallel computing systems will be necessary to solve computation intensive applications. The key bottleneck in massively parallel implementation of numerical algorithms is the synchronization of data across processing elements (PEs) after each iteration, which results in significant idle time. Thus, there is a trend towards relaxing the synchronization and adopting an asynchronous model of computation to reduce idle time. However, it is not clear what is the effect of this relaxation on the stability and accuracy of the numerical algorithm. In this paper we present a new framework to analyze such algorithms. We treat the computation in each PE as a dynamical system and model the asynchrony as stochastic switching. The overall system is then analyzed as a switched dynamical system. However, modeling of massively parallel numerical algorithms as switched dynamical systems results in a very large number of modes, which makes current analysis tools available for such systems computationally intractable. We develop new techniques that circumvent this scalability issue. The framework is presented on a one-dimensional heat equation as a case study for the partial differential equation (PDE), and the proposed analysis tools are verified by implementing asynchronous communications between cores on an nVIDIA Tesla GPU machine.