<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>Hello Krack,</p>
<p>we have tried to run cp2k using TRACE keyword and cp2k compute
job died ... <br>
</p>
<p>Is it possible to identify the issue using attached cp2k log file
?<br>
</p>
<p>Can you help us to identify the error in the regression tests ?</p>
<p><br>
</p>
<p>Regards</p>
<p>Salvatore<br>
</p>
<p><br>
</p>
<div class="moz-cite-prefix">Il 03/08/2022 16:11, Krack Matthias
(PSI) ha scritto:<br>
</div>
<blockquote type="cite"
cite="mid:51CC24E7-9764-46D8-97A8-D82E5C22DE72@psi.ch">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta name="Generator" content="Microsoft Word 15 (filtered
medium)">
<!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]-->
<style>@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}span.EmailStyle20
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}div.WordSection1
{page:WordSection1;}</style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
<div class="WordSection1">
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US">Hi Salvatore<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US">Your CP2K 8.2 build has obviously issues,
because many tests completed with the status FAILED or
WRONG. Thus that cp2k binary cannot be considered ready for
production.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US">From the error messages, it is not clear to me
what the problem could be. Maybe, someone else has an idea.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US">BR<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US">Matthias<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"
lang="EN-US"><o:p> </o:p></span></p>
<div style="border:none;border-top:solid #B5C4DF
1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal" style="margin-left:36.0pt"><b><span
style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black">SALVATORE
LABONIA <a class="moz-txt-link-rfc2396E" href="mailto:salvatore.labonia@gmail.com"><salvatore.labonia@gmail.com></a><br>
<b>Date: </b>Wednesday, 3 August 2022 at 14:40<br>
<b>To: </b><a class="moz-txt-link-rfc2396E" href="mailto:cp2k@googlegroups.com">"cp2k@googlegroups.com"</a>
<a class="moz-txt-link-rfc2396E" href="mailto:cp2k@googlegroups.com"><cp2k@googlegroups.com></a>, "Krack Matthias (PSI)"
<a class="moz-txt-link-rfc2396E" href="mailto:matthias.krack@psi.ch"><matthias.krack@psi.ch></a><br>
<b>Subject: </b>Re: [CP2K:17431] CP2K freeze<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:36.0pt"><o:p> </o:p></p>
</div>
<p style="margin-left:36.0pt">Hello,<o:p></o:p></p>
<p style="margin-left:36.0pt">cp2k 8.2 was compiled using
EasyBuild<o:p></o:p></p>
<p style="margin-left:36.0pt"><img
style="width:7.75in;height:3.0729in" id="_x0000_i1025"
src="cid:part1.SVSB6S6j.b7tDnOPg@gmail.com" class=""
width="744" height="295"><o:p></o:p></p>
<p style="margin-left:36.0pt">I attach some output from
regression test<o:p></o:p></p>
<p style="margin-left:36.0pt"><o:p> </o:p></p>
<p style="margin-left:36.0pt">Regards<o:p></o:p></p>
<p style="margin-left:36.0pt">Salvatore<o:p></o:p></p>
<p style="margin-left:36.0pt"><o:p> </o:p></p>
<div>
<p class="MsoNormal" style="margin-left:36.0pt">Il 03/08/2022
13:34, Krack Matthias (PSI) ha scritto:<o:p></o:p></p>
</div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US">Hi
Salvatore</span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US">You can
add the keyword
<a
href="https://manual.cp2k.org/cp2k-9_1-branch/CP2K_INPUT/GLOBAL.html#TRACE"
moz-do-not-send="true">TRACE</a> (or
<a
href="https://manual.cp2k.org/cp2k-9_1-branch/CP2K_INPUT/GLOBAL.html#list_TRACE_MASTER"
moz-do-not-send="true">
TRACE_MASTER</a> to trace only the MPI root process) in
the &GLOBAL section of the CP2K input to get a more
detailed output.</span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US">Does the
run freeze for any kind of CP2K input? How did you compile
CP2K? Could you run the regression test successfully? It
is difficult to make any suggestion without further
information.</span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US">Best</span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US">Matthias</span><o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt"><span
style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
<div style="border:none;border-top:solid #B5C4DF
1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal" style="margin-left:72.0pt"><b><span
style="font-size:12.0pt;color:black">From:
</span></b><span style="font-size:12.0pt;color:black"><a
href="mailto:cp2k@googlegroups.com"
moz-do-not-send="true">"cp2k@googlegroups.com"</a>
<a href="mailto:cp2k@googlegroups.com"
moz-do-not-send="true"><cp2k@googlegroups.com></a>
on behalf of Salvatore Labonia
<a href="mailto:salvatore.labonia@gmail.com"
moz-do-not-send="true"><salvatore.labonia@gmail.com></a><br>
<b>Reply to: </b><a href="mailto:cp2k@googlegroups.com"
moz-do-not-send="true">"cp2k@googlegroups.com"</a>
<a href="mailto:cp2k@googlegroups.com"
moz-do-not-send="true"><cp2k@googlegroups.com></a><br>
<b>Date: </b>Wednesday, 3 August 2022 at 12:35<br>
<b>To: </b><a href="mailto:cp2k@googlegroups.com"
moz-do-not-send="true">"cp2k@googlegroups.com"</a> <a
href="mailto:cp2k@googlegroups.com"
moz-do-not-send="true">
<cp2k@googlegroups.com></a><br>
<b>Subject: </b>[CP2K:17431] CP2K freeze</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:72.0pt"> <o:p></o:p></p>
</div>
<p class="MsoNormal" style="margin-left:72.0pt">Hello, <o:p></o:p></p>
<div>
<p class="MsoNormal" style="margin-left:72.0pt">we are
facing freeze using CP2K on our HPC cluster.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="margin-left:72.0pt">We have
totally 94 Dell server but running cp2k v9.1 compiled with
intel compiler and linked with intel mpi library, customer
is experiencing running freeze.
<o:p></o:p></p>
<div>
<p style="margin-left:72.0pt">No matter the number or the
type of involved nodes.<o:p></o:p></p>
<p style="margin-left:72.0pt">The freeze happens randomly,
not at the same interaction number, even using the same
running command and the same dataset for input.<o:p></o:p></p>
<p style="margin-left:72.0pt">Looking at processes status
on nodes when freeze occurs, they seem to be running,
using CPU but, if we try to attach to any process (and
forked children of course), we can see that they all are
sitting on a wait system call for data coming (orout
going) from (to) a pipe.<o:p></o:p></p>
<p style="margin-left:72.0pt">No other systems call are
run by processes…<o:p></o:p></p>
<p style="margin-left:72.0pt">Slurm thinks that job is
still running.<o:p></o:p></p>
<p style="margin-left:72.0pt">Killing one of the stuck
processes causes the death of orher processes and
finally slurm realizes that job has crashed.<o:p></o:p></p>
<p style="margin-left:72.0pt">Is this behaviour usual in
same circumstances (and therefore customer has something
to do to avoid it) or could it be caused by some other
reason (cp2k compilation, mpi version, intel compilers
version)?<o:p></o:p></p>
<p style="margin-left:72.0pt">Is there any way to have a
debugging execution of cp2k/mpi with a more or less
verbose output in order to understand at which
point/call does the freeze happen?<o:p></o:p></p>
<p style="margin-left:72.0pt"> Regards<o:p></o:p></p>
<p style="margin-left:72.0pt">Salvatore<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal" style="margin-left:72.0pt">-- <br>
You received this message because you are subscribed to the
Google Groups "cp2k" group.<br>
To unsubscribe from this group and stop receiving emails
from it, send an email to
<a href="mailto:cp2k+unsubscribe@googlegroups.com"
moz-do-not-send="true" class="moz-txt-link-freetext">cp2k+unsubscribe@googlegroups.com</a>.<br>
To view this discussion on the web visit <a
href="https://groups.google.com/d/msgid/cp2k/2bffd2de-1afd-4980-b3aa-6438990d81a9n%40googlegroups.com?utm_medium=email&utm_source=footer"
moz-do-not-send="true">
https://groups.google.com/d/msgid/cp2k/2bffd2de-1afd-4980-b3aa-6438990d81a9n%40googlegroups.com</a>.<br>
<br>
<br>
<o:p></o:p></p>
<p class="MsoNormal" style="margin-left:36.0pt">-- <br>
You received this message because you are subscribed to the
Google Groups "cp2k" group.<br>
To unsubscribe from this group and stop receiving emails
from it, send an email to
<a href="mailto:cp2k+unsubscribe@googlegroups.com"
moz-do-not-send="true" class="moz-txt-link-freetext">cp2k+unsubscribe@googlegroups.com</a>.<br>
To view this discussion on the web visit <a
href="https://groups.google.com/d/msgid/cp2k/4B492500-D071-47FB-B7F6-9D95EA33A429%40psi.ch?utm_medium=email&utm_source=footer"
moz-do-not-send="true">
https://groups.google.com/d/msgid/cp2k/4B492500-D071-47FB-B7F6-9D95EA33A429%40psi.ch</a>.<o:p></o:p></p>
</blockquote>
</div>
</blockquote>
</body>
</html>
<p></p>
-- <br />
You received this message because you are subscribed to the Google Groups "cp2k" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:cp2k+unsubscribe@googlegroups.com">cp2k+unsubscribe@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/502df271-acd9-34d9-5975-a4617300c3df%40gmail.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/cp2k/502df271-acd9-34d9-5975-a4617300c3df%40gmail.com</a>.<br />