10th EAI International Conference on Simulation Tools and Techniques

Research Article

ScaleSimulator – A Fast and Cycle-Accurate Parallel Simulator for Architectural Exploration

  • @INPROCEEDINGS{10.1145/3173519.3173528,
        author={Ori Chalak and Weiguang Cai and Wei Li and Lei Fang and Libing Zheng and Jintang Wang and Zuguang Wu and Xiongli Gu and Haibin Wang and Avi Mendelson},
        title={ScaleSimulator -- A Fast and Cycle-Accurate Parallel Simulator for Architectural Exploration},
        proceedings={10th EAI International Conference on Simulation Tools and Techniques},
        publisher={ACM},
        proceedings_a={SIMUTOOLS},
        year={2018},
        month={8},
        keywords={scalesimulator simulation system level qemu parallel simulator},
        doi={10.1145/3173519.3173528}
    }
    
  • Ori Chalak
    Weiguang Cai
    Wei Li
    Lei Fang
    Libing Zheng
    Jintang Wang
    Zuguang Wu
    Xiongli Gu
    Haibin Wang
    Avi Mendelson
    Year: 2018
    ScaleSimulator – A Fast and Cycle-Accurate Parallel Simulator for Architectural Exploration
    SIMUTOOLS
    ACM
    DOI: 10.1145/3173519.3173528
Ori Chalak1,*, Weiguang Cai1, Wei Li1, Lei Fang1, Libing Zheng1, Jintang Wang1, Zuguang Wu1, Xiongli Gu1, Haibin Wang1, Avi Mendelson2
  • 1: Huawei
  • 2: Technion
*Contact email: ori.chalak@huawei.com

Abstract

Design of next generation computer systems should be supported by simulation infrastructure that must achieve a few contradictory goals such as fast execution time, high accuracy, and enough flexibility to allow comparison between large numbers of possible design points. Most existing architecture level simulators are designed to be flexible and to execute the code in parallel for greater efficiency, but at the cost of scarified accuracy. This paper presents the ScaleSimulator simulation environment, which is based on a new design methodology whose goal is to achieve near cycle accuracy while still being flexible enough to simulate many different future system architectures and efficient enough to run meaningful workloads. We achieve these goals by making the parallelism a first-class citizen in our methodology. Thus, this paper focuses mainly on the ScaleSimulator design points that enable better parallel execution while maintaining the scalability and cycle accuracy of a simulated architecture. The paper indicates that the new proposed ScaleSimulator tool can (1) efficiently parallelize the execution of a cycle-accurate architecture simulator, (2) efficiently simulate complex architectures (e.g., out-of-order CPU pipeline, cache coherency protocol, and network) and massive parallel systems, and (3) use meaningful workloads, such as full simulation of OLTP benchmarks, to examine future architectural choices.