fb88da915d72ca9e43e798d423f9aa342f9733e7
dist-spmv
vortex
module --force purge
module load StdEnv
module load xl/2021.03.11
module load cuda/10.1.243
module load spectrum-mpi/rolling-release
module load cmake/3.18.0
ascicgpu030
To get nsight, had to download the rpm and unpack into home directory with cpio
.
module purge
module load sierra-devel/nvidia
module load cde/v2/cmake/3.19.2
mpirun -n 2 ~/software/nsight-systems-cli/2021.2.1/bin/nsys profile -c cudaProfilerApi -t cuda,mpi,nvtx -o dist-spmv_%q{OMPI_COMM_WORLD_RANK} -f true ./main
Design Considerations
Minimize CUDA runtime calls
Description
Languages
Cuda
64.2%
C++
30.4%
CMake
5.4%