Files
hugo-cwpearson/content/talk/20200311sandia/index.md
2020-03-09 16:17:51 -05:00

2.1 KiB

title, event, event_url, location, address, summary, abstract, date, date_end, all_day, publishDate, authors, tags, featured, image, links, url_code, url_pdf, url_slides, url_video, slides, projects, math
title event event_url location address summary abstract date date_end all_day publishDate authors tags featured image links url_code url_pdf url_slides url_video slides projects math
Optimizing Communication for CPU/GPU Nodes Sandia National Labs Seminar Sandia National Labs
street city region postcode country
Albuquerque, NM USA
Optimizing Multi-GPU Stencil Communication High-performance distributed computing systems increasingly feature nodes that have multiple CPU sockets and multiple GPUs. The communication bandwidth between those components depends on the underlying hardware and system software. Consequently, the bandwidth between these components is non-uniform, and these systems can expose different communication capabilities between these components. Optimally using these capabilities is challenging and essential consideration on emerging architectures. This talk starts by describing the performance of different CPU-GPU and GPU-GPU communication methods on nodes with high-bandwidth NVLink interconnects. This foundation is then used for domain partitioning, data placement, and communication planning in a CUDA+MPI 3D stencil halo exchange library. 2020-03-11T00:00:00Z 2020-03-11T00:00:00Z false 2020-03-13T00:00:00Z
Carl Pearson
true
caption focal_point
Right
stencil_library
true