Hi Corrado,<br /><br />I have a similar (somewhat older) single node server with two Xeon Gold 6152 CPUs @ 2.10GHz, 22 cores each, 44 total, 88 threads (hyperthreading enabled). I was currently running calculations with only OMP parallelization as I only have a limited amount of RAM (32GB), but I was wondering if I could also benefit from using mpirun with the --bind-to hwthread option, perhaps with -np 2 and OMP_NUM_THREADS=44? <br /><br />Out of interest, how much RAM do you have in your machine? I was thinking about suggesting to put some more RAM into our machine so that I can do heavier calculations, because I easily hit the memory limit... For some calculations I'm doing now I have to idle half of my processors so that I can run a calculation with 44 cores while using all the RAM on the node...<br /><br />Kind regards,<div>Léon<br /><br /></div><div class="gmail_quote"><div dir="auto" class="gmail_attr">On Tuesday, 14 February 2023 at 11:54:21 UTC+1 Corrado Bacchiocchi wrote:<br/></div><blockquote class="gmail_quote" style="margin: 0 0 0 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;"><div>
Hi Everyone,</div><div><br></div><div>thanks for the many suggestions.</div><div><br></div><div>I have a single node server with two Xeon Gold 6154 CPU @ 3.00GHz, 18 cores each, 36 total, 72 threads.</div><div>I have found that the following launch command:<br></div><div><br></div><div>mpirun -np 72 --bind-to hwthread
cp2k.psmp -i cp2k.inp -o cp2k.out
</div><div><br></div><div>
performs about 5x faster than</div><div><br></div><div>mpirun -n 2 --bind-to numa --map-by numa -display-map cp2k.psmp -i cp2k.inp -o cp2k.out</div><div><br></div><div>Regards</div><div>Corrado<br>
</div><div class="gmail_quote"><div dir="auto" class="gmail_attr">On Monday, June 6, 2022 at 1:58:42 PM UTC+2 Matthew Graneri wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hi Pierre,<div><br></div><div>Sorry it's taken so long to reply. Your reply really helped. Thank you!</div><div><br></div><div>Regards,</div><div><br></div><div>Matthew<br><br></div><div class="gmail_quote"><div dir="auto" class="gmail_attr">On Friday, May 20, 2022 at 7:16:43 PM UTC+8 <a rel="nofollow">wave...@gmail.com</a> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hi Everyone<div><br></div><div>While I haven't figured out the GPU side of things (btw, only part of cp2k is GPU-optimized), I found this approach useful for mpirun. Note that many people do not recommend using hyper-threading for this kind of application, so this will not give hyper-threading.</div><div><br></div><div><div><div> <font face="Courier New">mpirun -n 2 --bind-to numa --map-by numa -display-map cp2k.psmp -i my-cp2k-run.inp > my-cp2k-run.out</font></div></div><br></div><div><ol><li>The 'bind-to numa' and 'map-by numa' make use of the os's understanding of the processor.</li><li>These two together neatly puts the mpi ranks per cpu socket.</li><li>The '-display-map' writes the mpi assignments at the beginning of the output.</li></ol><div>Hope this helps!</div></div><div><br></div><div><div>Kind Regards</div><div><br></div><div>Sam</div></div><div class="gmail_quote"><div dir="auto" class="gmail_attr">On Wednesday, May 18, 2022 at 12:23:50 PM UTC+2 <a rel="nofollow">pierre.an...@gmail.com</a> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div lang="EN-IE" link="blue" vlink="purple" style="word-wrap:break-word">
<div>
<p class="MsoNormal"><span>Hi Matthew,<u></u><u></u></span></p>
<p class="MsoNormal"><span><u></u> <u></u></span></p>
<p class="MsoNormal"><span>Unfortunately, there’s no single way to determine the best MPI/OpenMP load. It is system, calculation type, and hardware dependant. I recommend testing the performance. The first thing you could
try is check if your CPUs are multithreaded. For example, if they are made of 34 cores and 2 virtual cores per physical core (68 virtual cores in total), you could try OMP_NUM_THREADS=2 and keep your mpirun -np (34*#nodes).<u></u><u></u></span></p>
<p class="MsoNormal"><span><u></u> <u></u></span></p>
<p class="MsoNormal"><span>Roughly speaking, MPI creates multiple replica of the calculation (called process), each replica dealing with part of the calculation. CP2K is efficiently parallelized with MPI. OpenMP generated
multiple threads on the fly, generally to parallelize a loop. OpenMP can be used in a MPI thread but not the other way around. Typically, having more MPI processed consumes more memory than the same number of OpenMP threads. To use multiple nodes, MPI is mandatory
and more efficient. These are generalities and, again, combining both is best but the ideal ratio varies. Testing is the best course of action, check which combination yields the largest number of ps/day with the minimum hardware resources. Doubling the hardware
does not double the output, so increasing the number of nodes becomes a waste of resources at some point. A rule of thumb, if the increase in output is less than 75-80% of the ideal case, then, it is not worth it.<u></u><u></u></span></p>
<p class="MsoNormal"><span><u></u> <u></u></span></p>
<p class="MsoNormal"><span>As you can see, there is a lot of try and error, no systematic rule I am afraid.<u></u><u></u></span></p>
<p class="MsoNormal"><span><u></u> <u></u></span></p>
<p class="MsoNormal"><span>Regards,<u></u><u></u></span></p>
<p class="MsoNormal"><span>Pierre<u></u><u></u></span></p></div></div><div lang="EN-IE" link="blue" vlink="purple" style="word-wrap:break-word"><div>
<p class="MsoNormal"><span><u></u> <u></u></span></p>
<p class="MsoNormal"><span><u></u> <u></u></span></p>
<p class="MsoNormal"><span><u></u> <u></u></span></p>
<div style="border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal" style="margin-bottom:12.0pt"><b><span style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black"><a rel="nofollow">cp...@googlegroups.com</a> <<a rel="nofollow">cp...@googlegroups.com</a>> on behalf of Matthew Graneri <<a rel="nofollow">mhvg...@gmail.com</a>><br>
<b>Date: </b>Wednesday, 18 May 2022 at 10:35<br>
<b>To: </b>cp2k <<a rel="nofollow">cp...@googlegroups.com</a>><br>
<b>Subject: </b>Re: [CP2K:16997] Running Cp2k in parallel using thread in a PC<u></u><u></u></span></p>
</div>
<p class="MsoNormal">Hi Pierre,<u></u><u></u></p>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I found this really valuable! Unfortunately, being very new to AIMD and very unfamiliar with computation in general, I was wondering if I might be able to get some advice? We have a HPC at my university where each node has 34 processors,
and ~750 GB RAM available for use. It runs on a slurm queuing system.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Until now, I've run all my jobs using: <span style="font-family:"Courier New"">
mpirun -np $SLURM_NTASKS cp2k.popt -I input.inp -o output.out</span><u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">where <span style="font-family:"Courier New"">$SLURM_NTASKS</span> is whatever number of processors I've allocated to the job via the
<span style="font-family:"Courier New"">--ntasks=x</span> flag.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">So instead, I'm thinking it might be more appropriate to use the .psmp executable, but I'm not sure what the difference between the OpenMP and the MPI threads are, and what kind of ratios between the OMP and MPI threads would be most effective
for speeding up an AIMD job, and how many threads of each type you can add before the parallelisation becomes less efficient.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Do you (or anyone else) have any advice on the matter? Is it better to have more OMP or MPI threads? And how many OMP threads per MPI thread would be appropriate? What kinds of ratios are most effective at speeding up calculations?<u></u><u></u></p>
</div>
<div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div>
<div>
<p class="MsoNormal">I would really appreciate any help I can get!<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Regards,<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Matthew<u></u><u></u></p>
</div>
<div>
<div>
<p class="MsoNormal">On Friday, September 20, 2019 at 10:45:55 PM UTC+8 <a rel="nofollow">pierre.an...@gmail.com</a> wrote:<u></u><u></u></p>
</div>
<blockquote style="border:none;border-left:solid #cccccc 1.0pt;padding:0cm 0cm 0cm 6.0pt;margin-left:4.8pt;margin-right:0cm">
<div>
<p class="MsoNormal">Hello Nikhil,<br>
<br>
Withe command "mpirun -n 42 cp2k.pop -i inp.inp -o -out.out", you are requesting 42 MPI threads and not 42 OpenMP threads. MPI usually relies on replicated data which means that, for a poorly program software, it will request a total amount of memory which
the amount of memory required by a scalar execution times the number of threads. This can very quickly become problematic, in particular for QM calculations. OpenMP, however relies on shared memory, the data is normally not replicated but shared between threads
and therefore, in an ideal scenario, the amount of memory needed for 42 OpenMP threads is the same as a single one.<br>
<br>
This might explains why you calculation freezes. You are out of memory. On your workstation, you should only use the executable "cp2k.ssmp" which is the OpenMP version. Then you don't need the mpirun command:<br>
<br>
cp2k.ssmp -i inp.inp -o -out.out<br>
<br>
To control the number of OpenMP threads, set the env variable: OMP_NUM_THREADS, e.g. in bash, export OMP_NUM_THREADS=48<br>
<br>
Now, if you need to balance between MPI and OpenMP, you should use the executable named cp2k.psmp. Here is such an example:<br>
<br>
export OMP_NUM_THREADS=24<br>
mpirun -n 2 cp2k.psmp -i inp.inp -o -out.out<br>
<br>
In this example, I am requesting two MPI threads and each of them can use up to 24 OpenMP threads.<br>
<br>
Hope this clarifies things for you.<br>
<br>
Regards,<br>
Pierre<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal" style="margin-bottom:12.0pt"><u></u> <u></u></p>
<div>
<p class="MsoNormal">On 20/09/2019 14:09, Nikhil Maroli wrote:<u></u><u></u></p>
</div>
</div>
<div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<p class="MsoNormal">Dear all, <u></u><u></u></p>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I have installed all the versions of CP2K in my workstation with 2 x 12 core processor, total thread=48<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I wanted to run cp2k in parallel using 42 threads, can anyone share the commands that i can use.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">I have tried <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">mpirun -n 42 cp2k.pop -i inp.inp -o -out.out<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">After this command there is a rise in memory to 100 % and the whole system freezes. (i have 128GB ram).<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Any suggestion will be greatly appreciated,<u></u><u></u></p>
</div>
</div>
</blockquote>
</div>
<div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<p class="MsoNormal">-- <br>
You received this message because you are subscribed to the Google Groups "cp2k" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to
<span><a rel="nofollow">cp2k+uns...@googlegroups.com</a></span>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/39284c57-f6eb-463e-81a6-3a123596a9f2%40googlegroups.com?utm_medium=email&utm_source=footer" rel="nofollow" target="_blank" data-saferedirecturl="https://www.google.com/url?hl=en-GB&q=https://groups.google.com/d/msgid/cp2k/39284c57-f6eb-463e-81a6-3a123596a9f2%2540googlegroups.com?utm_medium%3Demail%26utm_source%3Dfooter&source=gmail&ust=1688153470710000&usg=AOvVaw1JDKwd9M4ZGWzzg7Mb6XDu">
https://groups.google.com/d/msgid/cp2k/39284c57-f6eb-463e-81a6-3a123596a9f2%40googlegroups.com</a>.<u></u><u></u></p>
</blockquote>
<p class="MsoNormal"><br>
<br>
<u></u><u></u></p>
<pre>-- <u></u><u></u></pre>
<pre>Dr Pierre Cazade, PhD<u></u><u></u></pre>
<pre>AD3-023, Bernal Institute,<u></u><u></u></pre>
<pre>University of Limerick,<u></u><u></u></pre>
<pre>Plassey Park Road,<u></u><u></u></pre>
<pre>Castletroy, co. Limerick,<u></u><u></u></pre>
<pre>Ireland<u></u><u></u></pre>
</div>
</blockquote>
</div>
</div></div><div lang="EN-IE" link="blue" vlink="purple" style="word-wrap:break-word"><div><p class="MsoNormal">-- <br>
You received this message because you are subscribed to the Google Groups "cp2k" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to
<a rel="nofollow">cp2k+uns...@googlegroups.com</a>.<br></p></div></div><div lang="EN-IE" link="blue" vlink="purple" style="word-wrap:break-word"><div><p class="MsoNormal">
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/010a2dd7-dc2c-4475-8a9b-17cdbb10d20dn%40googlegroups.com?utm_medium=email&utm_source=footer" rel="nofollow" target="_blank" data-saferedirecturl="https://www.google.com/url?hl=en-GB&q=https://groups.google.com/d/msgid/cp2k/010a2dd7-dc2c-4475-8a9b-17cdbb10d20dn%2540googlegroups.com?utm_medium%3Demail%26utm_source%3Dfooter&source=gmail&ust=1688153470710000&usg=AOvVaw2AqVqZjw0oDV-t9TOEkZ02">
https://groups.google.com/d/msgid/cp2k/010a2dd7-dc2c-4475-8a9b-17cdbb10d20dn%40googlegroups.com</a>.<u></u><u></u></p>
</div>
</div>
</blockquote></div></blockquote></div></blockquote></div></blockquote></div>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "cp2k" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:cp2k+unsubscribe@googlegroups.com">cp2k+unsubscribe@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/5db6cb44-b485-4520-ae04-7e125bb2d47cn%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/cp2k/5db6cb44-b485-4520-ae04-7e125bb2d47cn%40googlegroups.com</a>.<br />