Parallel Programming Laboratory

Runtime Systems and Tools:

Topology Aware Mapping

Introduction

Petascale machines with hundreds of thousands of cores are being built. These machines have varying interconnect topologies and large network diameters. Computation is cheap and communication on the network is becoming the bottleneck for scaling of parallel applications. Network contention, specifically, is becoming an increasingly important factor affecting overall performance. The broad goal of this research is performance optimization of parallel applications through reduction of network contention.

Most parallel applications have a certain communication topology. Mapping of tasks in a parallel application based on their communication graph, to the physical processors on a machine can potentially lead to performance improvements. We have developed an automatic mapping framework and implemented a mapping library to map the communication graph for an application on to the interconnect topology of a machine while trying to localize communication.

A parallel quantum chemistry application, OpenAtom, runs twice as fast when its objects are mapped in a topology aware fashion on various machines (see figures below). NAMD, a molecular dynamics applications also sees performance improvements of 10-15% at the scaling end. Building on these ideas, we have developed algorithms and techniques for automatic mapping of parallel applications to relieve the application developers of this burden. We use the hop-bytes metric for the evaluation of mapping algorithms and suggest that it is a better metric than the previously used maximum dilation metric. The main focus of this research is on developing topology aware mapping algorithms for parallel applications with regular and irregular communication patterns. The automatic mapping framework is a suite of such algorithms with capabilities to choose the best mapping for a problem with a given communication graph.

Software available for download

Contributions of this research

Our work is among the first to discuss the effects of contention on Cray and IBM machines and to compare across multiple architectures. Contrary to popular belief that Cray machines do not stand to benefit from topology mapping, this research proves otherwise and provides detailed methods and results for performance improvements from topology mapping on them. Another contribution of this work is an API for topology discovery which works on both Cray and IBM machines.

We believe that the set of MPI benchmarks we have developed for quantifying message latencies would be useful for the HPC community to assess latencies on a supercomputer and to determine the message sizes for which number of hops makes a significant difference. The effective bandwidth benchmark in the HPC Challenge benchmark suite measures the total bandwidth available on a system but does not analyze the effects of distance or contention on message latencies. Results from MPI benchmarks re-establish the importance of mapping for the current supercomputers.

Our experience in developing mapping algorithms for production codes and insights discussed in this work will be useful to individual application writers trying to scale their codes to large supercomputers. We believe that the automatic mapping framework is applicable to a wide variety of communication scenarios and will relieve the application writers from the burden of finding good mapping solutions for their codes. Application developers can use this framework for mapping of their applications without any changes to their code base. Unlike most of the previous work, this research handles both cardinality and topological variations in the graphs. The framework provides scalable and fast, runtime solutions. Therefore, it will be useful to a large body of applications running on large parallel machines.

There has not been much research on mapping of unstructured mesh applications for performance optimizations and this research takes up at that task. We have also started developing scalable techniques for distributed load balancing in an effort to move away from centralized mapping decision algorithms.

People

Papers/Talks

17-02

2017
[Paper]

Automatic topology mapping of diverse large-scale parallel applications [ICS 2017]

| Juan Galvez | Nikhil Jain | Laxmikant Kale

[BIB] [PDF]

More...

16-21

2016
[Poster]

ACM SRC: Mapping Applications on Irregular Allocations [SC 2016]

| Seonmyeong Bak | Nikhil Jain | Laxmikant Kale

[PNG] [PDF] [PDF]

More...

16-11

2016
[Paper]

Evaluating HPC Networks via Simulation of Parallel Workloads [SC 2016]

[BIB] [PDF]

More...

16-05

2016
[Paper]

OpenAtom: Scalable Ab-Initio Molecular Dynamics with Diverse Capability [ISC 2016]

[BIB] [PDF]

More...

16-04

2016
[Paper]

Analyzing network health and congestion in dragonfly-based systems [IPDPS 2016]

[BIB]

More...

16-02

2016
[PhD Thesis]

Optimization of Communication Intensive Applications on HPC Networks [Thesis 2016]

| Nikhil Jain

[BIB] [PDF]

More...

15-03

2015
[Paper]

Identifying the Culprits behind Network Congestion [IPDPS 2015]

[BIB] [PDF]

More...

14-40

2014
[Talk]

Maximizing Throughput on a Dragonfly Network [SC 2014]

| Nikhil Jain

[KEY] [KEY]

More...

14-14

2014
[Paper]

Optimizing the performance of parallel applications on a 5D torus via task mapping [HiPC 2014]

[BIB] [PDF]

More...

14-04

2014
[Paper]

Maximizing Throughput on a Dragonfly Network [SC 2014]

[BIB] [PDF]

More...

13-18

2013
[Paper]

Predicting Application Performance using Supervised Learning on Communication Features [SC 2013]

[BIB] [PDF]

More...

13-13

2013
[Talk]

Predicting Application Performance using Supervised Learning on Communication Features [Lawrence Livermore Talk 2013]

| Nikhil Jain

[KEY] [PDF]

More...

11-47

2011
[Talk]

Avoiding Hot-Spots on Two-Level Direct Networks [SC 2011]

| Abhinav Bhatele

[KEY] [PDF]

More...

11-36

2011
[Talk]

Heuristic-based Techniques for Mapping Irregular Communication Graphs to Mesh Topologies [ESCAPE 2011]

| Abhinav Bhatele | Laxmikant Kale

[PDF]

More...

11-22

2011
[Paper]

Improving Communication Performance in Dense Linear Algebra via Topology Aware Collectives [SC 2011]

| Edgar Solomonik | Abhinav Bhatele | James Demmel

[PDF]

More...

11-21

2011
[Paper]

Avoiding Hot-Spots on Two-Level Direct Networks [SC 2011]

| Abhinav Bhatele | Nikhil Jain | William Gropp | Laxmikant Kale

[BIB] [PDF]

More...

11-12

2011
[Paper]

Heuristic-Based Techniques for Mapping Irregular Communication Graphs to Mesh Topologies [ESCAPE 2011]

| Abhinav Bhatele | Laxmikant Kale

[BIB] [PDF]

More...

11-02

2011
[Paper]

Topology Aware Task Mapping [Encyclopedia of Parallel Computing 2011]

| Abhinav Bhatele

[BIB]

More...

10-18

2010
[Paper]

Automated Mapping of Regular Communication Graphs on Mesh Interconnects [HiPC 2010]

| Abhinav Bhatele | Gagan Gupta | Laxmikant Kale | I-Hsin Chung

[BIB] [PDF]

More...

10-17

2010
[PhD Thesis]

Automating Topology Aware Mapping for Supercomputers [Thesis 2010]

| Abhinav Bhatele

[BIB] [PS] [PDF]

More...

10-07

2010
[Paper]

Automated Mapping of Structured Communication Graphs onto Mesh Interconnects [CS Res. & Tech. Report 2010]

| Abhinav Bhatele | I-Hsin Chung | Laxmikant Kale

[BIB] [PDF]

More...

10-04

2010
[Paper]

Optimizing Communication for Charm++ Applications by Reducing Network Contention [Concurrency and Computation: Practice and Experience 2010]

| Abhinav Bhatele | Eric Bohm | Laxmikant Kale

[BIB] [PDF]

More...

09-16

2009
[Poster]

Topology Aware Task Mapping Techniques: An API and Case Study [PPoPP 2009]

| Abhinav Bhatele | Eric Bohm | Laxmikant Kale

More...

09-07

2009
[Paper]

Quantifying Network Contention on Large Parallel Machines [PPL 2009]

| Abhinav Bhatele | Laxmikant Kale

[BIB] [PDF]

More...

09-04

2009
[Paper]

A Pattern Language for Topology Aware Mapping [ParaPLoP 2009]

| Abhinav Bhatele | Laxmikant Kale | Nicholas Chen | Ralph Johnson

[BIB] [PDF]

More...

09-02

2009
[Paper]

Dynamic Topology Aware Load Balancing Algorithms for Molecular Dynamics Applications [ICS 2009]

| Abhinav Bhatele | Laxmikant Kale | Sameer Kumar

[BIB] [PS] [PDF]

More...

09-01

2009
[Paper]

An Evaluative Study on the Effect of Contention on Message Latencies in Large Supercomputers [LSPP 2009]

| Abhinav Bhatele | Laxmikant Kale

[BIB] [PS] [PDF]

More...

08-18

2008
[Poster]

Effects of Contention on Message Latencies in Large Supercomputers [SC 2008]

| Abhinav Bhatele | Laxmikant Kale

More...

08-17

2008
[Poster]

Automatic Topology-Aware Task Mapping for Parallel Applications Running on Large Parallel Machines [IPDPS 2008]

| Abhinav Bhatele | Laxmikant Kale

More...

08-10

2009
[Paper]

A Case Study of Communication Optimizations on 3D Mesh Interconnects [Euro-Par 2009]

| Abhinav Bhatele | Eric Bohm | Laxmikant Kale

[BIB] [PS] [PDF]

More...

08-07

2008
[Paper]

Benefits of Topology Aware Mapping for Mesh Interconnects [PPL 2008]

| Abhinav Bhatele | Laxmikant Kale

[BIB] [PS] [PDF]

More...

08-02

2008
[Paper]

Application-specific Topology-aware Mapping for Three Dimensional Topologies [LSPP 2008]

| Abhinav Bhatele | Laxmikant Kale

[BIB] [PS] [PDF]

More...

07-12

2007
[MS Thesis]

Application-specific Topology-aware Mapping and Load Balancing for three-dimensional Torus Topologies [Thesis 2007]

| Abhinav Bhatele

[BIB] [PS] [PDF]

More...

07-03

2007
[Paper]

Fine Grained Parallelization of the Car-Parrinello ab initio MD Method on Blue Gene/L [IBM Journal of Research and Development 2007]

[BIB]

More...

05-18

2006
[Paper]

Topology-Aware Task Mapping for Reducing Communication Contention on Large Parallel Machines [IPDPS 2006]

| Tarun Agarwal | Amit Sharma | Laxmikant Kale

[BIB] [PS] [PDF]

More...

05-09

2005
[MS Thesis]

Low Diameter Regular Graph as a Network Topology in Direct and Hybrid Interconnection Networks [Thesis 2005]

| Yogesh Mehta

[BIB] [PS] [PDF]

More...

05-07

2005
[MS Thesis]

Strategies for topology-aware task mapping and for rebalancing with bounded migrations [Thesis 2005]

| Tarun Agarwal

[BIB] [PS] [PDF]

More...

Live Webcast 15th Annual Charm++ Workshop

Introduction

Software available for download

Contributions of this research