Runtime Systems and Tools:
  Heterogeneous Computing
  People
  
  Papers/Talks
    
          22-05
        
        
          2022
[Paper]
        [Paper]
Improving Scalability with GPU-Aware Asynchronous Tasks [HIPS 2022]
          
        
          21-02
        
        
          2021
[Poster]
        [Poster]
CharminG: A Scalable GPU-resident Runtime System [HPDC 2021]
          
        
          20-04
        
        
          2020
[Paper]
        [Paper]
Achieving Computation-Communication Overlap with Overdecomposition on GPU Systems [ESPM2 2020]
          
        
          20-02
        
        
          2020
[Paper]
        [Paper]
End-to-end Performance Modeling of Distributed GPU Applications [ICS 2020]
          
        
          19-06
        
        
          2019
[Poster]
        [Poster]
ACM SRC: Fast Profiling-based Performance Modeling of Distributed GPU Applications [SC 2019]
          
        
          17-11
        
        
          2017
[Poster]
        [Poster]
ACM SRC: Runtime Support for Concurrent Execution of Overdecomposed Heterogeneous Tasks [SC 2017]
          
        
          16-14
        
        
          2016
[Paper]
        [Paper]
Runtime Coordinated Heterogeneous Tasks in Charm++ [ESPM2 2016]
          
        
          12-06
        
        
          2012
[Paper]
        [Paper]
Dynamic Scheduling for Work Agglomeration on                                                                                                                                                            Heterogeneous Clusters [Workshop on Multicore and GPU Programming Models, Languages and Compilers at IPDPS 2012]
          
        
          10-16
        
        
          2010
[Paper]
        [Paper]
Scaling Hierarchical N-Body Simulations on GPU Clusters [SC 2010]
          
        
          09-09
        
        
          2009
[Paper]
        [Paper]
Towards a Framework for Abstracting Accelerators in Parallel Applications: Experience with Cell [SC 2009]
          
        
          09-06
        
        
          2009
[Paper]
        [Paper]
Flexible Hardware Mapping for Finite Element Simulations on Hybrid CPU / GPU Clusters [SAAHPC 2009]
          
        
          08-12
        
        
          2008
[MS Thesis]
        [MS Thesis]
An Application Programming Interface for General Purpose Graphics Processing Units in an Asynchronous Runtime System [Thesis 2008]
          
        
          06-20
        
        
          2006
[Poster]
        [Poster]
Charm++ on Cell [PPL Poster 2006]
          
        
          06-19
        
        
          2006
[Poster]
        [Poster]
Charm++ Simplifies Programming for the Cell Processor [SC 2006]
          
        









