Performance tools

This section gives an overview of some of the gpu performance analysis tools available on Piz Daint. Feel free to contact us on #profiler_questions or #cluster_support_pizdaint in Slack for more information.

NVIDIA Nsight Systems

  • Nsight™ Systems is NVIDIA® system-wide performance analysis tool. Several versions are installed on Piz Daint. It is also recommended to download and install it on your laptop.

    Warning

    Nsight™ Compute is not supported on Piz Daint with NVIDIA P100 gpus.

Load nsys on Piz Daint

The latest version of Nsight Systems available on Piz Daint is 2023.3.1:

/apps/daint/UES/hackaton/software/nvidia-nsight-systems/2023.3.1/bin/nsys --version
NVIDIA Nsight Systems version 2023.3.1.92-233133147223v0

Run a job with nsys on Piz Daint

An example jobscript is available in /scratch/snx3000/piccinal/reframe/stage/daint/gpu/PrgEnv-gnu/nsys/. For example, add to your jobscript:

nsys='/apps/daint/UES/hackaton/software/nvidia-nsight-systems/2023.3.1/bin/nsys'

nsys_opt1='profile --force-overwrite=true'
nsys_opt2='-o %h.%q{SLURM_NODEID}.%q{SLURM_PROCID}.nsys-rep'
nsys_opt3='--trace=cuda,mpi,nvtx --mpi-impl=mpich'
nsys_opt4='--stats=true --delay=2'

srun nsys $nsys_opt1 $nsys_opt2 $nsys_opt3 $nsys_opt4 ./myexe -- myexe_args

Get more informations about the tool with:

Note

nsys profile --help

You may also want to track the GPU memory usage with the '--cuda-memory-usage=true' flag. This feature may cause significant runtime overhead. You can also deactivate cpu reporting with '--sample=none'.

Open a report file with nsys on your laptop

A succesfull completed job on Piz Daint will create 1 or more *.nsys-rep report file(s). Transfer the files to your laptop with scp and open them with the installed nsys client.

nsys

NVIDIA nvprof

  • nvprof and NVVP are NVIDIA® previous generation profiler and gui.

    Warning

    It is recommended to use Nsight Systems (nsys). If you need help with nvprof, contact us please.