Abstract
We present a new set of tools for the language-centric performance analysis and debugging of OpenMP programs that allows programmers to relate dynamic information from parallel execution to OpenMP constructs. Users can visualize execution traces, examine aggregate metrics on parallel loops and tasks, such as load imbalance or synchronization overhead, and obtain detailed information on specific events, such as the partitioning of a loop’s iteration space, its distribution to workers according to the scheduling policy and fine-grain synchronization. Our work is based on the Aftermath performance analysis tool and a ready-to-use, instrumented version of the LLVM/clang OpenMP run-time with negligible overhead for tracing. By analyzing the performance of the MG application of the NPB suite, we show that language-centric performance analysis in general and our tools in particular can help improve the performance of large-scale OpenMP applications significantly.
Original language | English |
---|---|
Title of host publication | OpenMP : memory, devices, and tasks : 12th International Workshop on OpenMP, IWOMP 2016, Nara, Japan, October 5-7, 2016, proceedings |
Editors | Naoya Maruyama, Bronis R. De Supinski, Mohamed Wahib |
Publisher | Springer Nature |
Pages | 237-250 |
Number of pages | 14 |
ISBN (Print) | 9783319455495 |
DOIs | |
Publication status | Published - 2016 |
Event | International Workshop on OpenMP - Nara Kasugano International Forum, Nara, Japan Duration: 5 Oct 2016 → 7 Oct 2016 Conference number: 12 http://iwomp2016.riken.jp/ |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 9903 |
Conference
Conference | International Workshop on OpenMP |
---|---|
Abbreviated title | IWOMP |
Country/Territory | Japan |
City | Nara |
Period | 5/10/16 → 7/10/16 |
Internet address |
Keywords
- OpenMP
- Performance Analysis
- Tracing