<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
  </head>
  <body>
    <p>Hello Krack,</p>
    <p>we have tried to run cp2k using TRACE keyword and cp2k compute
      job died ... <br>
    </p>
    <p>Is it possible to identify the issue using attached cp2k log file
      ?<br>
    </p>
    <p>Can you help us to identify the error in the regression tests ?</p>
    <p><br>
    </p>
    <p>Regards</p>
    <p>Salvatore<br>
    </p>
    <p><br>
    </p>
    <div class="moz-cite-prefix">Il 03/08/2022 16:11, Krack Matthias
      (PSI) ha scritto:<br>
    </div>
    <blockquote type="cite"
      cite="mid:51CC24E7-9764-46D8-97A8-D82E5C22DE72@psi.ch">
      <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
      <meta name="Generator" content="Microsoft Word 15 (filtered
        medium)">
      <!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]-->
      <style>@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        font-size:11.0pt;
        font-family:"Calibri",sans-serif;}a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}span.EmailStyle20
        {mso-style-type:personal-reply;
        font-family:"Calibri",sans-serif;
        color:windowtext;}.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}div.WordSection1
        {page:WordSection1;}</style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
      <div class="WordSection1">
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US">Hi Salvatore<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US">Your CP2K 8.2 build has obviously issues,
            because many tests completed with the status FAILED or
            WRONG. Thus that cp2k binary cannot be considered ready for
            production.<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US">From the error messages, it is not clear to me
            what the problem could be. Maybe, someone else has an idea.<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US">BR<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US"><o:p> </o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US">Matthias<o:p></o:p></span></p>
        <p class="MsoNormal"><span style="mso-fareast-language:EN-US"
            lang="EN-US"><o:p> </o:p></span></p>
        <div style="border:none;border-top:solid #B5C4DF
          1.0pt;padding:3.0pt 0cm 0cm 0cm">
          <p class="MsoNormal" style="margin-left:36.0pt"><b><span
                style="font-size:12.0pt;color:black">From:
              </span></b><span style="font-size:12.0pt;color:black">SALVATORE
              LABONIA <a class="moz-txt-link-rfc2396E" href="mailto:salvatore.labonia@gmail.com"><salvatore.labonia@gmail.com></a><br>
              <b>Date: </b>Wednesday, 3 August 2022 at 14:40<br>
              <b>To: </b><a class="moz-txt-link-rfc2396E" href="mailto:cp2k@googlegroups.com">"cp2k@googlegroups.com"</a>
              <a class="moz-txt-link-rfc2396E" href="mailto:cp2k@googlegroups.com"><cp2k@googlegroups.com></a>, "Krack Matthias (PSI)"
              <a class="moz-txt-link-rfc2396E" href="mailto:matthias.krack@psi.ch"><matthias.krack@psi.ch></a><br>
              <b>Subject: </b>Re: [CP2K:17431] CP2K freeze<o:p></o:p></span></p>
        </div>
        <div>
          <p class="MsoNormal" style="margin-left:36.0pt"><o:p> </o:p></p>
        </div>
        <p style="margin-left:36.0pt">Hello,<o:p></o:p></p>
        <p style="margin-left:36.0pt">cp2k 8.2 was compiled using
          EasyBuild<o:p></o:p></p>
        <p style="margin-left:36.0pt"><img
            style="width:7.75in;height:3.0729in" id="_x0000_i1025"
            src="cid:part1.SVSB6S6j.b7tDnOPg@gmail.com" class=""
            width="744" height="295"><o:p></o:p></p>
        <p style="margin-left:36.0pt">I attach some output from
          regression test<o:p></o:p></p>
        <p style="margin-left:36.0pt"><o:p> </o:p></p>
        <p style="margin-left:36.0pt">Regards<o:p></o:p></p>
        <p style="margin-left:36.0pt">Salvatore<o:p></o:p></p>
        <p style="margin-left:36.0pt"><o:p> </o:p></p>
        <div>
          <p class="MsoNormal" style="margin-left:36.0pt">Il 03/08/2022
            13:34, Krack Matthias (PSI) ha scritto:<o:p></o:p></p>
        </div>
        <blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US">Hi
              Salvatore</span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US">You can
              add the keyword
              <a
href="https://manual.cp2k.org/cp2k-9_1-branch/CP2K_INPUT/GLOBAL.html#TRACE"
                moz-do-not-send="true">TRACE</a> (or
              <a
href="https://manual.cp2k.org/cp2k-9_1-branch/CP2K_INPUT/GLOBAL.html#list_TRACE_MASTER"
                moz-do-not-send="true">
                TRACE_MASTER</a> to trace only the MPI root process) in
              the &GLOBAL section of the CP2K input to get a more
              detailed output.</span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US">Does the
              run freeze for any kind of CP2K input? How did you compile
              CP2K? Could you run the regression test successfully? It
              is difficult to make any suggestion without further
              information.</span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US">Best</span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US">Matthias</span><o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt"><span
              style="mso-fareast-language:EN-US" lang="EN-US"> </span><o:p></o:p></p>
          <div style="border:none;border-top:solid #B5C4DF
            1.0pt;padding:3.0pt 0cm 0cm 0cm">
            <p class="MsoNormal" style="margin-left:72.0pt"><b><span
                  style="font-size:12.0pt;color:black">From:
                </span></b><span style="font-size:12.0pt;color:black"><a
                  href="mailto:cp2k@googlegroups.com"
                  moz-do-not-send="true">"cp2k@googlegroups.com"</a>
                <a href="mailto:cp2k@googlegroups.com"
                  moz-do-not-send="true"><cp2k@googlegroups.com></a>
                on behalf of Salvatore Labonia
                <a href="mailto:salvatore.labonia@gmail.com"
                  moz-do-not-send="true"><salvatore.labonia@gmail.com></a><br>
                <b>Reply to: </b><a href="mailto:cp2k@googlegroups.com"
                  moz-do-not-send="true">"cp2k@googlegroups.com"</a>
                <a href="mailto:cp2k@googlegroups.com"
                  moz-do-not-send="true"><cp2k@googlegroups.com></a><br>
                <b>Date: </b>Wednesday, 3 August 2022 at 12:35<br>
                <b>To: </b><a href="mailto:cp2k@googlegroups.com"
                  moz-do-not-send="true">"cp2k@googlegroups.com"</a> <a
                  href="mailto:cp2k@googlegroups.com"
                  moz-do-not-send="true">
                  <cp2k@googlegroups.com></a><br>
                <b>Subject: </b>[CP2K:17431] CP2K freeze</span><o:p></o:p></p>
          </div>
          <div>
            <p class="MsoNormal" style="margin-left:72.0pt"> <o:p></o:p></p>
          </div>
          <p class="MsoNormal" style="margin-left:72.0pt">Hello, <o:p></o:p></p>
          <div>
            <p class="MsoNormal" style="margin-left:72.0pt">we are
              facing freeze using CP2K on our HPC cluster.<o:p></o:p></p>
          </div>
          <div>
            <p class="MsoNormal" style="margin-left:72.0pt">We have
              totally 94 Dell server but running cp2k v9.1 compiled with
              intel compiler and linked with intel mpi library, customer
              is experiencing running freeze.
              <o:p></o:p></p>
            <div>
              <p style="margin-left:72.0pt">No matter the number or the
                type of involved nodes.<o:p></o:p></p>
              <p style="margin-left:72.0pt">The freeze happens randomly,
                not at the same interaction number, even using the same
                running command and the same dataset for input.<o:p></o:p></p>
              <p style="margin-left:72.0pt">Looking at processes status
                on nodes when freeze occurs, they seem to be running,
                using CPU but, if we try to attach to any process (and
                forked children of course), we can see that they all are
                sitting on a wait system call for data coming (orout
                going) from (to) a pipe.<o:p></o:p></p>
              <p style="margin-left:72.0pt">No other systems call are
                run by processes…<o:p></o:p></p>
              <p style="margin-left:72.0pt">Slurm thinks that job is
                still running.<o:p></o:p></p>
              <p style="margin-left:72.0pt">Killing one of the stuck
                processes causes the death of orher processes and
                finally slurm realizes that job has crashed.<o:p></o:p></p>
              <p style="margin-left:72.0pt">Is this behaviour usual in
                same circumstances (and therefore customer has something
                to do to avoid it) or could it be caused by some other
                reason (cp2k compilation, mpi version, intel compilers
                version)?<o:p></o:p></p>
              <p style="margin-left:72.0pt">Is there any way to have a
                debugging execution of cp2k/mpi with a more or less
                verbose output in order to understand at which
                point/call does the freeze happen?<o:p></o:p></p>
              <p style="margin-left:72.0pt"> Regards<o:p></o:p></p>
              <p style="margin-left:72.0pt">Salvatore<o:p></o:p></p>
            </div>
          </div>
          <p class="MsoNormal" style="margin-left:72.0pt">-- <br>
            You received this message because you are subscribed to the
            Google Groups "cp2k" group.<br>
            To unsubscribe from this group and stop receiving emails
            from it, send an email to
            <a href="mailto:cp2k+unsubscribe@googlegroups.com"
              moz-do-not-send="true" class="moz-txt-link-freetext">cp2k+unsubscribe@googlegroups.com</a>.<br>
            To view this discussion on the web visit <a
href="https://groups.google.com/d/msgid/cp2k/2bffd2de-1afd-4980-b3aa-6438990d81a9n%40googlegroups.com?utm_medium=email&utm_source=footer"
              moz-do-not-send="true">
https://groups.google.com/d/msgid/cp2k/2bffd2de-1afd-4980-b3aa-6438990d81a9n%40googlegroups.com</a>.<br>
            <br>
            <br>
            <o:p></o:p></p>
          <p class="MsoNormal" style="margin-left:36.0pt">-- <br>
            You received this message because you are subscribed to the
            Google Groups "cp2k" group.<br>
            To unsubscribe from this group and stop receiving emails
            from it, send an email to
            <a href="mailto:cp2k+unsubscribe@googlegroups.com"
              moz-do-not-send="true" class="moz-txt-link-freetext">cp2k+unsubscribe@googlegroups.com</a>.<br>
            To view this discussion on the web visit <a
href="https://groups.google.com/d/msgid/cp2k/4B492500-D071-47FB-B7F6-9D95EA33A429%40psi.ch?utm_medium=email&utm_source=footer"
              moz-do-not-send="true">
https://groups.google.com/d/msgid/cp2k/4B492500-D071-47FB-B7F6-9D95EA33A429%40psi.ch</a>.<o:p></o:p></p>
        </blockquote>
      </div>
    </blockquote>
  </body>
</html>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups "cp2k" group.<br />
To unsubscribe from this group and stop receiving emails from it, send an email to <a href="mailto:cp2k+unsubscribe@googlegroups.com">cp2k+unsubscribe@googlegroups.com</a>.<br />
To view this discussion on the web visit <a href="https://groups.google.com/d/msgid/cp2k/502df271-acd9-34d9-5975-a4617300c3df%40gmail.com?utm_medium=email&utm_source=footer">https://groups.google.com/d/msgid/cp2k/502df271-acd9-34d9-5975-a4617300c3df%40gmail.com</a>.<br />