[CP2K-user] CP2K 7.1-Cuda Bandgap and HF energies different from previous versions
Fabian Ducry
fabia... at gmail.com
Mon May 4 15:37:50 UTC 2020
Dear Andres,
I can confirm and reproduce the issue. Apparently it appears when combining
CUDA + OMP in hybrid calculations. In that case the energy becomes a
function of #OMP threads per rank. For your input I got (cp2k 8.0, revision
3e7b916, run on Piz Daint)
no cuda
OMP_NUM_THREADS = 1 OMP_NUM_THREADS = 3
OMP_NUM_THREADS = 6
Exchange-correlation energy: -433.84964308969535
-433.84964308969302 -435.33426106395467
-435.96513615032325
Hartree-Fock Exchange energy: -127.87395928499694
-127.87395928499325 -125.97109874333140
-125.24809389970088
Total energy: -1976.39722899739672
-1976.39722899739013
-1975.95046919253809 -1975.87080541858177
Without OMP parallelization the energies agrees with the calculation
without CUDA accelleration. Increasing OMP_NUM_THREADS beyond 1 increases
the Hartree-Fock Exchange energy.
Apparently you have to disable OMP to obtain correct results. This is
obviously not very satisfying and I hope this gets fixed. I see that you
used 1 MPI/12 OMP ranks per node. Try increasing the number of MPI ranks
per node. To do so you have to set
export CRAY_CUDA_MPS=1 in the submission script.
I hope this helps.
Best,
Fabian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.cp2k.org/archives/cp2k-user/attachments/20200504/e6588d2e/attachment.htm>
More information about the CP2K-user
mailing list