hugo-cwpearson/index.md at 04273c76e7506cf16e43d7586de23fdf892a1865

cwpearson/hugo-cwpearson

Fork 0

Files

Carl Pearson 04273c76e7 SIAM PP talk and abstract

2022-02-22 14:48:41 -07:00

1.2 KiB

Raw Blame History

title, date, tags

title

date

Abstract

Developing a high-performance implementation of a distributed computational kernel for high-performacing computing is increasingly challenging. Systems are composed of heterogenous computational resources, and limited communication performance demands an asynchronous application design. Even if high-performance computation and communication libraries are available. the challenge becomes the best coordination of the provided operations to create an optimal result. This work presents a system that automatically generates design rules for a high-performance implementation of a compound operation provided as a dependence graph. The system searches among valid schedules to determine the fastest arrangement of operations. A post-processing step on the results of the search yields interpretable design rules. The fast implementation can be used directly, or experts can use the design rules to create a high-performance implementation.

Link

slides

1.2 KiB Raw Blame History

Abstract

Link

1.2 KiB

Raw Blame History