Research Article
ScaleSimulator – A Fast and Cycle-Accurate Parallel Simulator for Architectural Exploration
@INPROCEEDINGS{10.1145/3173519.3173528, author={Ori Chalak and Weiguang Cai and Wei Li and Lei Fang and Libing Zheng and Jintang Wang and Zuguang Wu and Xiongli Gu and Haibin Wang and Avi Mendelson}, title={ScaleSimulator -- A Fast and Cycle-Accurate Parallel Simulator for Architectural Exploration}, proceedings={10th EAI International Conference on Simulation Tools and Techniques}, publisher={ACM}, proceedings_a={SIMUTOOLS}, year={2018}, month={8}, keywords={scalesimulator simulation system level qemu parallel simulator}, doi={10.1145/3173519.3173528} }
- Ori Chalak
Weiguang Cai
Wei Li
Lei Fang
Libing Zheng
Jintang Wang
Zuguang Wu
Xiongli Gu
Haibin Wang
Avi Mendelson
Year: 2018
ScaleSimulator – A Fast and Cycle-Accurate Parallel Simulator for Architectural Exploration
SIMUTOOLS
ACM
DOI: 10.1145/3173519.3173528
Abstract
Design of next generation computer systems should be supported by simulation infrastructure that must achieve a few contradictory goals such as fast execution time, high accuracy, and enough flexibility to allow comparison between large numbers of possible design points. Most existing architecture level simulators are designed to be flexible and to execute the code in parallel for greater efficiency, but at the cost of scarified accuracy. This paper presents the ScaleSimulator simulation environment, which is based on a new design methodology whose goal is to achieve near cycle accuracy while still being flexible enough to simulate many different future system architectures and efficient enough to run meaningful workloads. We achieve these goals by making the parallelism a first-class citizen in our methodology. Thus, this paper focuses mainly on the ScaleSimulator design points that enable better parallel execution while maintaining the scalability and cycle accuracy of a simulated architecture. The paper indicates that the new proposed ScaleSimulator tool can (1) efficiently parallelize the execution of a cycle-accurate architecture simulator, (2) efficiently simulate complex architectures (e.g., out-of-order CPU pipeline, cache coherency protocol, and network) and massive parallel systems, and (3) use meaningful workloads, such as full simulation of OLTP benchmarks, to examine future architectural choices.