hugo-cwpearson/index.md at e9e64383559481b4f1034393ac13d33fabadcf1e

Files

Carl Pearson b64492005b add skeleton for Sandia talk

2020-03-09 16:17:51 -05:00

2.1 KiB

Raw Blame History

title, event, event_url, location, address, summary, abstract, date, date_end, all_day, publishDate, authors, tags, featured, image, links, url_code, url_pdf, url_slides, url_video, slides, projects, math

title

event

event_url

location

address

summary

abstract

date

date_end

all_day

publishDate

authors

tags

featured

image

links

url_code

url_pdf

url_slides

url_video

slides

projects

math

Optimizing Communication for CPU/GPU Nodes

Sandia National Labs Seminar

Sandia National Labs

street	city	region	postcode	country
	Albuquerque, NM			USA

Optimizing Multi-GPU Stencil Communication

High-performance distributed computing systems increasingly feature nodes that have multiple CPU sockets and multiple GPUs. The communication bandwidth between those components depends on the underlying hardware and system software. Consequently, the bandwidth between these components is non-uniform, and these systems can expose different communication capabilities between these components. Optimally using these capabilities is challenging and essential consideration on emerging architectures. This talk starts by describing the performance of different CPU-GPU and GPU-GPU communication methods on nodes with high-bandwidth NVLink interconnects. This foundation is then used for domain partitioning, data placement, and communication planning in a CUDA+MPI 3D stencil halo exchange library.

2020-03-11T00:00:00Z

false

2020-03-13T00:00:00Z

Carl Pearson

true

caption	focal_point
	Right

stencil_library

true

2.1 KiB Raw Blame History

2.1 KiB

Raw Blame History