<div>Hello,</div><div><br></div><div>I've played around trying to optimize my cp2k compute resource inputs. How should one think regarding using MPI, OMP, and accelerators. I have a system of ~4000 atoms that I want to do some MD calculations on using xTB.</div><div><br></div><div>My hardware is <br><li>2x AMD EPYC™ 7763, 64-core, 2.45 GHz processors per node</li><li>1024 GB DDR4 3200MT/s of physical memory per node</li><li>8x GPU accelerator NVIDIA A100 per node, 320GB HBM2 memory per node</li></div><div><br></div><div>and my bash script looks as follows:</div><div><br></div><div>#PBS -l select=2:mpiprocs=2:openmpthreads=64<br>#PBS -m a<br><br>module load CP2K/8.2-fosscuda-2020b <br><br>mpirun -n 2 cp2k.psmp -o cp2k.out cp2k.inp</div><div><br></div><div>This however seem to cause severe I/O limitations. Any suggestions?<br></div><div> </div> <p></p> -- <br /> You received this message because you are subscribed to the Google Groups "cp2k" group.<br /> To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:cp2k+unsubscribe@googlegroups.com">cp2k+unsubscribe@googlegroups.com</a>.<br /> To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/31d7c4fa-7dfa-467d-8ef3-42180e165d9an%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/cp2k/31d7c4fa-7dfa-467d-8ef3-42180e165d9an%40googlegroups.com</a>.<br />