<div dir="ltr"><div>Dear all,</div><div><br></div><div>when running cp2k on our local
cluster (SGI UV2000), I recently observe that sometimes during an mpi job all cp2k processes "freeze" at 100% cpu
usage (according to <span style="font-family: courier new, monospace;">top</span>). For example, when running a CELL_OPT calculation via</div><div><br></div><div><span style="font-family: courier new, monospace;"><div style="background-color: rgb(250, 250, 250); border-color: rgb(187, 187, 187); border-style: solid; border-width: 1px; overflow-wrap: break-word;" class="prettyprint"><code class="prettyprint"><div class="subprettyprint"><span style="color: #000;" class="styled-by-prettify">mpirun </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #000;" class="styled-by-prettify">n </span><span style="color: #066;" class="styled-by-prettify">144</span><span style="color: #000;" class="styled-by-prettify"> cp2k</span><span style="color: #660;" class="styled-by-prettify">.</span><span style="color: #000;" class="styled-by-prettify">popt </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #000;" class="styled-by-prettify">o cp2k</span><span style="color: #660;" class="styled-by-prettify">.</span><span style="color: #000;" class="styled-by-prettify">output cp2k</span><span style="color: #660;" class="styled-by-prettify">.</span><span style="color: #000;" class="styled-by-prettify">inp</span></div></code></div></span></div><div><span style="font-family: courier new, monospace;"><br></span></div><div><span style="font-family: arial, sans-serif;">the run 'freezes' after several steps, the last entries in the output file are:</span><br></div><div><br></div><div><span style="font-family: courier new, monospace;"><div style="background-color: rgb(250, 250, 250); border-color: rgb(187, 187, 187); border-style: solid; border-width: 1px; overflow-wrap: break-word;" class="prettyprint"><code class="prettyprint"><div class="subprettyprint"><span style="color: #660;" class="styled-by-prettify">>></span><span style="color: #000;" class="styled-by-prettify"> tail </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #000;" class="styled-by-prettify">f cp2k</span><span style="color: #660;" class="styled-by-prettify">.</span><span style="color: #000;" class="styled-by-prettify">output<br><br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Information</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #008;" class="styled-by-prettify">for</span><span style="color: #000;" class="styled-by-prettify"> grid number </span><span style="color: #066;" class="styled-by-prettify">10584</span><span style="color: #000;" class="styled-by-prettify"><br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Bounds</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">1</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #066;" class="styled-by-prettify">62</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">62</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Points</span><span style="color: #660;" class="styled-by-prettify">:</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">125</span><span style="color: #000;" class="styled-by-prettify"><br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Bounds</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">2</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #066;" class="styled-by-prettify">72</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">71</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Points</span><span style="color: #660;" class="styled-by-prettify">:</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">144</span><span style="color: #000;" class="styled-by-prettify"><br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Bounds</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">3</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #066;" class="styled-by-prettify">144</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">143</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Points</span><span style="color: #660;" class="styled-by-prettify">:</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">288</span><span style="color: #000;" class="styled-by-prettify"><br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Real</span><span style="color: #000;" class="styled-by-prettify"> space distribution over </span><span style="color: #066;" class="styled-by-prettify">8</span><span style="color: #000;" class="styled-by-prettify"> groups<br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Real</span><span style="color: #000;" class="styled-by-prettify"> space distribution along direction </span><span style="color: #066;" class="styled-by-prettify">2</span><span style="color: #000;" class="styled-by-prettify"><br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Border</span><span style="color: #000;" class="styled-by-prettify"> size </span><span style="color: #066;" class="styled-by-prettify">37</span><span style="color: #000;" class="styled-by-prettify"><br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Real</span><span style="color: #000;" class="styled-by-prettify"> space distribution over </span><span style="color: #066;" class="styled-by-prettify">18</span><span style="color: #000;" class="styled-by-prettify"> groups<br> RS_GRID</span><span style="color: #660;" class="styled-by-prettify">|</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Real</span><span style="color: #000;" class="styled-by-prettify"> space distribution along direction </span><span style="color: #066;" class="styled-by-prettify">3</span></div></code></div><br></span></div><div>I compiled cp2k-6.1 with the toolchain script and recently changed to openmpi 3.1.4 due to a bug in 3.1.0 (https://github.com/open-mpi/ompi/issues/5638) that caused cp2k runs to crash. (mpi on our cluster is a bit outdated that's why I'm not using it). The regtest gave 1 COMPILE WARNING, 0 FAILED/WRONG, 3015 CORRECT, 16 NEW. <br></div><div><br></div><div>I inspected one of the "frozen" cp2k.popt processes:<br></div><div></div><div><br></div><div style="background-color: rgb(250, 250, 250); border-color: rgb(187, 187, 187); border-style: solid; border-width: 1px; overflow-wrap: break-word;" class="prettyprint"><code class="prettyprint"><div class="subprettyprint"><span style="color: #660;" class="styled-by-prettify">>></span><span style="color: #000;" class="styled-by-prettify"> strace </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #000;" class="styled-by-prettify">fp </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #606;" class="styled-by-prettify">Process</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #000;" class="styled-by-prettify"> attached </span><span style="color: #008;" class="styled-by-prettify">with</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">3</span><span style="color: #000;" class="styled-by-prettify"> threads<br></span><span style="color: #660;" class="styled-by-prettify">[</span><span style="color: #000;" class="styled-by-prettify">pid </span><span style="color: #066;" class="styled-by-prettify">38585</span><span style="color: #660;" class="styled-by-prettify">]</span><span style="color: #000;" class="styled-by-prettify"> epoll_wait</span><span style="color: #660;" class="styled-by-prettify">(</span><span style="color: #066;" class="styled-by-prettify">10</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify"><</span><span style="color: #000;" class="styled-by-prettify">unfinished </span><span style="color: #660;" class="styled-by-prettify">...></span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">[</span><span style="color: #000;" class="styled-by-prettify">pid </span><span style="color: #066;" class="styled-by-prettify">38582</span><span style="color: #660;" class="styled-by-prettify">]</span><span style="color: #000;" class="styled-by-prettify"> restart_syscall</span><span style="color: #660;" class="styled-by-prettify">(<...</span><span style="color: #000;" class="styled-by-prettify"> resuming interrupted call </span><span style="color: #660;" class="styled-by-prettify">...></span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify"><</span><span style="color: #000;" class="styled-by-prettify">unfinished </span><span style="color: #660;" class="styled-by-prettify">...></span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">[</span><span style="color: #000;" class="styled-by-prettify">pid </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #660;" class="styled-by-prettify">]</span><span style="color: #000;" class="styled-by-prettify"> poll</span><span style="color: #660;" class="styled-by-prettify">([{</span><span style="color: #000;" class="styled-by-prettify">fd</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #066;" class="styled-by-prettify">5</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> events</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify">POLLIN</span><span style="color: #660;" class="styled-by-prettify">},</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">{</span><span style="color: #000;" class="styled-by-prettify">fd</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #066;" class="styled-by-prettify">15</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> events</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify">POLLIN</span><span style="color: #660;" class="styled-by-prettify">}],</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">2</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #660;" class="styled-by-prettify">)</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">(</span><span style="color: #606;" class="styled-by-prettify">Timeout</span><span style="color: #660;" class="styled-by-prettify">)</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">[</span><span style="color: #000;" class="styled-by-prettify">pid </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #660;" class="styled-by-prettify">]</span><span style="color: #000;" class="styled-by-prettify"> poll</span><span style="color: #660;" class="styled-by-prettify">([{</span><span style="color: #000;" class="styled-by-prettify">fd</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #066;" class="styled-by-prettify">5</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> events</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify">POLLIN</span><span style="color: #660;" class="styled-by-prettify">},</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">{</span><span style="color: #000;" class="styled-by-prettify">fd</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #066;" class="styled-by-prettify">15</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> events</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify">POLLIN</span><span style="color: #660;" class="styled-by-prettify">}],</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">2</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #660;" class="styled-by-prettify">)</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">(</span><span style="color: #606;" class="styled-by-prettify">Timeout</span><span style="color: #660;" class="styled-by-prettify">)</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">[</span><span style="color: #000;" class="styled-by-prettify">pid </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #660;" class="styled-by-prettify">]</span><span style="color: #000;" class="styled-by-prettify"> poll</span><span style="color: #660;" class="styled-by-prettify">([{</span><span style="color: #000;" class="styled-by-prettify">fd</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #066;" class="styled-by-prettify">5</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> events</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify">POLLIN</span><span style="color: #660;" class="styled-by-prettify">},</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">{</span><span style="color: #000;" class="styled-by-prettify">fd</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #066;" class="styled-by-prettify">15</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> events</span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify">POLLIN</span><span style="color: #660;" class="styled-by-prettify">}],</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">2</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #660;" class="styled-by-prettify">)</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">=</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">(</span><span style="color: #606;" class="styled-by-prettify">Timeout</span><span style="color: #660;" class="styled-by-prettify">)</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">...</span></div></code></div><div><span style="font-family: courier new, monospace;"></span></div><div><span style="font-family: courier new, monospace;"><br></span></div><div><span style="font-family: courier new, monospace;"><span style="font-family: arial, sans-serif;">the last line repeats until I stop strace. The file descriptors are:</span><br></span></div><div><span style="font-family: courier new, monospace;"><br></span></div><div style="background-color: rgb(250, 250, 250); border-color: rgb(187, 187, 187); border-style: solid; border-width: 1px; overflow-wrap: break-word;" class="prettyprint"><code class="prettyprint"><div class="subprettyprint"><span style="color: #660;" class="styled-by-prettify">>></span><span style="color: #000;" class="styled-by-prettify"> lsof </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #000;" class="styled-by-prettify">p </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">...</span><span style="color: #000;" class="styled-by-prettify"><br>cp2k</span><span style="color: #660;" class="styled-by-prettify">.</span><span style="color: #000;" class="styled-by-prettify">popt </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #000;" class="styled-by-prettify"> prahe </span><span style="color: #066;" class="styled-by-prettify">5u</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0000</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #660;" class="styled-by-prettify">,</span><span style="color: #066;" class="styled-by-prettify">10</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">10625</span><span style="color: #000;" class="styled-by-prettify"> anon_inode<br></span><span style="color: #660;" class="styled-by-prettify">...</span><span style="color: #000;" class="styled-by-prettify"><br>cp2k</span><span style="color: #660;" class="styled-by-prettify">.</span><span style="color: #000;" class="styled-by-prettify">popt </span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #000;" class="styled-by-prettify"> prahe </span><span style="color: #066;" class="styled-by-prettify">15u</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">IPv4</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">158047374</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">0t0</span><span style="color: #000;" class="styled-by-prettify"> TCP </span><span style="color: #660;" class="styled-by-prettify">*:</span><span style="color: #000;" class="styled-by-prettify">polestar </span><span style="color: #660;" class="styled-by-prettify">(</span><span style="color: #000;" class="styled-by-prettify">LISTEN</span><span style="color: #660;" class="styled-by-prettify">)</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">...</span><span style="color: #000;" class="styled-by-prettify"><br><br></span></div></code></div><div><span style="font-family: courier new, monospace;"><br></span></div><div style="background-color: rgb(250, 250, 250); border-color: rgb(187, 187, 187); border-style: solid; border-width: 1px; overflow-wrap: break-word;" class="prettyprint"><code class="prettyprint"><div class="subprettyprint"><span style="color: #660;" class="styled-by-prettify">>></span><span style="color: #000;" class="styled-by-prettify"> ls </span><span style="color: #660;" class="styled-by-prettify">-</span><span style="color: #000;" class="styled-by-prettify">l </span><span style="color: #660;" class="styled-by-prettify">/</span><span style="color: #000;" class="styled-by-prettify">proc</span><span style="color: #660;" class="styled-by-prettify">/</span><span style="color: #066;" class="styled-by-prettify">38571</span><span style="color: #660;" class="styled-by-prettify">/</span><span style="color: #000;" class="styled-by-prettify">fd<br></span><span style="color: #660;" class="styled-by-prettify">...</span><span style="color: #000;" class="styled-by-prettify"><br>lrwx</span><span style="color: #660;" class="styled-by-prettify">------</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">1</span><span style="color: #000;" class="styled-by-prettify"> prahe ustudent </span><span style="color: #066;" class="styled-by-prettify">64</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Apr</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">27</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">18</span><span style="color: #660;" class="styled-by-prettify">:</span><span style="color: #066;" class="styled-by-prettify">23</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">15</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">-></span><span style="color: #000;" class="styled-by-prettify"> socket</span><span style="color: #660;" class="styled-by-prettify">:[</span><span style="color: #066;" class="styled-by-prettify">158047374</span><span style="color: #660;" class="styled-by-prettify">]</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">...</span><span style="color: #000;" class="styled-by-prettify"><br>lrwx</span><span style="color: #660;" class="styled-by-prettify">------</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">1</span><span style="color: #000;" class="styled-by-prettify"> prahe ustudent </span><span style="color: #066;" class="styled-by-prettify">64</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #606;" class="styled-by-prettify">Apr</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">27</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">18</span><span style="color: #660;" class="styled-by-prettify">:</span><span style="color: #066;" class="styled-by-prettify">23</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #066;" class="styled-by-prettify">5</span><span style="color: #000;" class="styled-by-prettify"> </span><span style="color: #660;" class="styled-by-prettify">-></span><span style="color: #000;" class="styled-by-prettify"> anon_inode</span><span style="color: #660;" class="styled-by-prettify">:[</span><span style="color: #000;" class="styled-by-prettify">eventfd</span><span style="color: #660;" class="styled-by-prettify">]</span><span style="color: #000;" class="styled-by-prettify"><br></span><span style="color: #660;" class="styled-by-prettify">...</span></div></code></div><div><span style="font-family: courier new, monospace;"><br></span></div><div>The strace output is the same for three of the 144 processes, I haven't checked the others. At this point I understand the processes are waiting for some input, but I'm unfortunately lost otherwise. Any suggestions - or since I have these issues since using openmpi 3.1.4: Is this a bad choice? <br></div><div><br></div><div>Please let me know if you need any further files/info. <br></div><div><br></div><div>Thanks in advance and best regards,</div><div>Philipp<br></div><div><br></div><div><br></div></div>