About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
9th EAI International Conference on Performance Evaluation Methodologies and Tools

Research Article

GPU Performance Prediction Through Parallel Discrete Event Simulation and Common Sense

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.4108/eai.14-12-2015.2262575,
        author={Guillaume Chapuis and Stephan Eidenbenz and Nandakishore Santhi},
        title={GPU Performance Prediction Through Parallel Discrete Event Simulation and Common Sense},
        proceedings={9th EAI International Conference on Performance Evaluation Methodologies and Tools},
        publisher={ACM},
        proceedings_a={VALUETOOLS},
        year={2016},
        month={1},
        keywords={parallel discrete event simulation gpgpu performance prediction},
        doi={10.4108/eai.14-12-2015.2262575}
    }
    
  • Guillaume Chapuis
    Stephan Eidenbenz
    Nandakishore Santhi
    Year: 2016
    GPU Performance Prediction Through Parallel Discrete Event Simulation and Common Sense
    VALUETOOLS
    ICST
    DOI: 10.4108/eai.14-12-2015.2262575
Guillaume Chapuis1, Stephan Eidenbenz1,*, Nandakishore Santhi1
  • 1: LANL
*Contact email: eidenben@lanl.gov

Abstract

We present the GPU Module of a Performance Prediction Toolkit developed at Los Alamos National Laboratory, which enables code developers to efficiently test novel algorithmic ideas particularly for large-scale computational physics codes. The GPU Module is a heavily-parameterized model of the GPU hardware that takes as input a sequence of abstracted instructions that the user provides as a representation of the application or can also be read in from the GPU intermediate representation PTX format. These instructions are then executed in a discrete event simulation framework of the entire computing infrastructure that can include multi-GPU and also multi-node components as typically found in high performance computing applications. Our GPU Module aims at a trade-off between the cycle-accuracy of GPU simulators and the fast execution times of analytical models. This trade-off is achieved by simulating at cycle level only a portion of the computations and using this partial runtime to analytically predict the total execution of the modeled application. We present GPU models that we validate against three different benchmark applications that cover the range from bandwidth- to cycle-limited. Our runtime predictions are within an error of 20%. We then predict performance of a next-generation GPU (Nvidia’s Pascal) for the same benchmark applications.

Keywords
parallel discrete event simulation, gpgpu, performance prediction
Published
2016-01-04
Publisher
ACM
http://dx.doi.org/10.4108/eai.14-12-2015.2262575
Copyright © 2015–2025 ICST
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL