3rd International ICST Conference on Simulation Tools and Techniques

Research Article

μπ: a scalable and transparent system for simulating MPI programs

Download488 downloads
  • @INPROCEEDINGS{10.4108/ICST.SIMUTOOLS2010.8692,
        author={Kalyan S.  Perumalla},
        title={μπ: a scalable and transparent system for simulating MPI programs},
        proceedings={3rd International ICST Conference on Simulation Tools and Techniques},
        publisher={ICST},
        proceedings_a={SIMUTOOLS},
        year={2010},
        month={5},
        keywords={MPI Virtual Execution Supercomputing Synchronization},
        doi={10.4108/ICST.SIMUTOOLS2010.8692}
    }
    
  • Kalyan S. Perumalla
    Year: 2010
    μπ: a scalable and transparent system for simulating MPI programs
    SIMUTOOLS
    ICST
    DOI: 10.4108/ICST.SIMUTOOLS2010.8692
Kalyan S. Perumalla1,*
  • 1: Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA
*Contact email: perumallaks@ornl.gov

Abstract

μπ is a scalable, transparent system for experimenting with the execution of parallel programs on simulated computing platforms. The level of simulated detail can be varied for application behavior as well as for machine characteristics. Unique features of μπ are repeatability of execution, scalability to millions of simulated (virtual) MPI ranks, scalability to hundreds of thousands of host (real) MPI ranks, portability of the system to a variety of host supercomputing platforms, and the ability to experiment with scientific applications whose source-code is available. The set of source-code interfaces supported by μπ is being expanded to support a wider set of applications, and MPI-based scientific computing benchmarks are being ported. In proof-of-concept experiments, μπ has been successfully exercised to spawn and sustain very large-scale executions of an MPI test program given in source code form. Low slowdowns are observed, due to its use of purely discrete event style of execution, and due to the scalability and efficiency of the underlying parallel discrete event simulation engine, μsik. In the 1AZargest runs, μπ has been executed on up to 216,000 cores of a Cray XT5 supercomputer, successfully simulating over 27 million virtual MPI ranks, each virtual rank containing its own thread context, and all ranks fully synchronized by virtual time.