Presentation

P17 - DisCosTiC: A DSL-based Parallel Simulation Framework using First-Principles Analytic Performance Models
Presenter
DescriptionDisCosTiC (Distributed Cost in Clusters) is a lightweight message passing simulation toolkit that simulates large-scale applications by taking the socket-level performance properties of the hardware-software interaction into account. It can reproduce and explore the dynamics of parallel programs on current and future supercomputers in a well-controlled environment, thereby saving resources and time. In contrast to existing trace-based simulators, we propose a domain-specific language (DSL) since traces do not comprise inter-process dependency information and are superimposed by many effects coming from the real system, such as system noise, variations in MPI implementations, etc. DisCosTiC has no dependencies on external libraries and uses analytical, first-principle models for execution and communication time predictions, taking socket-level bandwidth contention into account. For the execution part, it supports Roofline and ECM (execution-cache-memory) models, while communication is covered by Hockney and LogGOPS models. The structure of the parallel program is formulated within the DSL, and a configuration file holds hardware attributes. The resulting simulated traces can be visualized via Chromium’s Trace Event Profiling tool. In the simulator design, we tried to find a trade-off among modeling complexity, simulation accuracy, and user-friendliness.
TimeTuesday, June 289:00 - 11:00 CEST
LocationFoyer 2nd Floor
Event Type
Poster