add some bibtex
This commit is contained in:
@@ -19,6 +19,19 @@ Bespoke codes with optimized communication may be non-portable across run-time/s
|
||||
This work presents node-aware approaches for automatic data placement and communication implementation for 3D stencil codes on multi-GPU nodes with non-homogeneous communication performance and capabilities.
|
||||
Benchmarking results in the Summit system show that choices in placement can result in a 20% improvement in single-node exchange, and communication specialization can yield a further 6x improvement in exchange time in a single node, and a 16% improvement at 1536 GPUs.
|
||||
|
||||
```bibtex
|
||||
@INPROCEEDINGS{9150372,
|
||||
author={C. {Pearson} and M. {Hidayetoğlu} and M. {Almasri} and O. {Anjum} and I. {Chung} and J. {Xiong} and W. W. {Hwu}},
|
||||
booktitle={2020 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)},
|
||||
title={Node-Aware Stencil Communication for Heterogeneous Supercomputers},
|
||||
year={2020},
|
||||
volume={},
|
||||
number={},
|
||||
pages={796-805},
|
||||
doi={10.1109/IPDPSW50202.2020.00136}
|
||||
}
|
||||
```
|
||||
|
||||
* [pdf](/pdf/20200522_pearson_iwapt.pdf)
|
||||
* [code](https://github.com/cwpearson/stencil)
|
||||
* [slides](/pdf/20200522_pearson_iwapt_slides.pdf)
|
||||
Reference in New Issue
Block a user