<div>Hello Matthias,</div><div><br></div><div>Thanks a lot for your fast reply and explanations. As you suggest, I tried with the CPU implementation and that solved the memory problem on Daint. It is working fine now, thank you again for your help.<br></div><div><br></div><div>Best regards,</div><div>Miriam<br></div><br><div class="gmail_quote"><div dir="auto" class="gmail_attr">Le jeudi 8 septembre 2022 à 13:28:12 UTC+2, Matthias Krack a écrit :<br/></div><blockquote class="gmail_quote" style="margin: 0 0 0 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<div lang="en-CH" link="blue" vlink="purple" style="word-wrap:break-word">
<div>
<p class="MsoNormal"><span lang="EN-US">Hello<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US">There is a hard limit (48*1024) coded in GPU grid routines of CP2K because of the limited GPU memory available. Using more nodes does not help here, because this won’t increase the shared
memory available per GPU. A work around is to use the CPU implementation of grid_integrate instead of the GPU implementation by selecting the grid
<a href="https://manual.cp2k.org/cp2k-2022_1-branch/CP2K_INPUT/GLOBAL/GRID.html#BACKEND" target="_blank" rel="nofollow" data-saferedirecturl="https://www.google.com/url?hl=fr&q=https://manual.cp2k.org/cp2k-2022_1-branch/CP2K_INPUT/GLOBAL/GRID.html%23BACKEND&source=gmail&ust=1662726004344000&usg=AOvVaw2wrAhOPIKLQuGdUA4cFbH5">
BACKEND</a> CPU explicitly (the default is AUTO which will then select GPU automatically on Piz Daint). Alternatively, you can try to change the code and increase that limit, e.g. to 51*1024, with the risk, however, of triggering other problems.<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US">I don’t know what causes the error on Eiger.<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US">HTH<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US">Matthias<u></u><u></u></span></p></div></div><div lang="en-CH" link="blue" vlink="purple" style="word-wrap:break-word"><div>
<p class="MsoNormal"><span lang="EN-US"><u></u> <u></u></span></p>
<div style="border:none;border-top:solid #b5c4df 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal" style="margin-left:36.0pt"><b><span style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black">"<a href data-email-masked rel="nofollow">cp...@googlegroups.com</a>" <<a href data-email-masked rel="nofollow">cp...@googlegroups.com</a>><br>
<b>Reply to: </b>"<a href data-email-masked rel="nofollow">cp...@googlegroups.com</a>" <<a href data-email-masked rel="nofollow">cp...@googlegroups.com</a>><br>
<b>Date: </b>Thursday, 8 September 2022 at 11:37<br>
<b>To: </b>"<a href data-email-masked rel="nofollow">cp...@googlegroups.com</a>" <<a href data-email-masked rel="nofollow">cp...@googlegroups.com</a>><br>
<b>Subject: </b>[CP2K:17629] cellopt calculation on EIGER aborted<u></u><u></u></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:36.0pt"><u></u> <u></u></p>
</div>
<p class="MsoNormal" style="margin-left:36.0pt">Hello all,<br>
<br>
I am trying to run a cell-optimization for a metal-organic framework using the scan functional and rvv10 vdw functional. As I had problems with SCF convergence, I increased the cutoff and used the NN50_SMOOTH method for calculating the XC derivatives and the
nn50 density smoothing for the xc calculations, as suggested in another conversation here.<br>
The singlepoint calculation converged with these settings, but when I tried to run the cellopt on piz daint (32 nodes, 64GB RAM per node) I got an out-of memory error:<br>
"ERROR: Not enough shared memory in grid_gpu_integrate.<br>
cab_len: 4704, alpha_len: 1512, cxyz_len: 364, total smem_per_block: 51.406250 kb"<br>
<br>
<br>
So I tried running the calculations on Alps (Eiger) instead (256GB RAM/node). Now I get an error in the cp2k outfile as soon as the SCF calculation starts that I don't understand:<br>
"libfabric:187819:1662628695:cxi:core:cxip_ux_onload_cb():2259<warn> nid001534: RXC (0x2300:32:0): PtlTE 105LE resources not recovered during flow control. FI_CXI_RX_MATCH_MODE=[hybrid|software] is required.<br>
<br>
Program received signal SIGABRT: Process abort signal."<br>
<br>
Does someone have an idea what went wrong?<br>
I am using cp2k-9.1, I attach you my input file and the outfile with the complete error message.
<br>
Thank you!<u></u><u></u></p>
</div></div><div lang="en-CH" link="blue" vlink="purple" style="word-wrap:break-word"><div><p class="MsoNormal" style="margin-left:36.0pt">-- <br>
You received this message because you are subscribed to the Google Groups "cp2k" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an email to
<a href data-email-masked rel="nofollow">cp2k+uns...@googlegroups.com</a>.<br>
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/c0f4eecc-78a1-407c-a18d-20d35785d392n%40googlegroups.com?utm_medium=email&utm_source=footer" target="_blank" rel="nofollow" data-saferedirecturl="https://www.google.com/url?hl=fr&q=https://groups.google.com/d/msgid/cp2k/c0f4eecc-78a1-407c-a18d-20d35785d392n%2540googlegroups.com?utm_medium%3Demail%26utm_source%3Dfooter&source=gmail&ust=1662726004345000&usg=AOvVaw3nAi4Gj-4NpRnr8zUJeUCo">
https://groups.google.com/d/msgid/cp2k/c0f4eecc-78a1-407c-a18d-20d35785d392n%40googlegroups.com</a>.<br>
<br>
<u></u><u></u></p>
</div>
</div>
</blockquote></div>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "cp2k" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:cp2k+unsubscribe@googlegroups.com">cp2k+unsubscribe@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/a4c8ead1-b7d0-4f3e-8674-25a0fbbdbfcbn%40googlegroups.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/cp2k/a4c8ead1-b7d0-4f3e-8674-25a0fbbdbfcbn%40googlegroups.com</a>.<br />