<div dir="ltr">Hi Iain,<div><br></div><div>It is really helpful !</div><div>I'll respond/comment point by point if you allow me.</div><div><br></div><div>Points 1 & 4/5 :</div><div><br></div><div>I intended to use the regtest as a base test for optimising CP2K on our cluster. </div><div>First goal was to have a working "default" build. And then add more enhancement: the two you mentioned (libgrid and libsmm), also testing ELPA. (and maybe later move on onto MPI/OpenMP mostly for HF and memory needed for integrals)<br></div><div>So if -O2 it's not the determining factor in speed vs -O3 with ifort, I will stick to -O2 and some -O1/O0</div><div><br></div><div>Point 2:</div><div><br></div><div>Since we have intel procs, intel mkl/mpi I though intel ifort was the choice against gfortran. Maybe I was also dupped by intel advertising: "on intel machine intel is the best". And you know. if it costs money it should be better (Guess I was very wrong in this case and also relieved they didn't bought it for that.) and maybe because I'm <i>just</i> a theoretical chemist and this a new world (compilation/optimisation vs utilisation) but I'm learning ! </div><div><br></div><div>Maybe after, the OpenMP part, i'll go with gfortran (and still Intel MKL/MPI because they are better than other BLAS...ScaLAPACK stack/OpenMPI ?)</div><div><br></div><div>Point 3: </div><div><br></div><div>Yes, I was confused to when I reread (what arch result in what regtest) : "So I assume some files...-> Fail segfault", I ment some file has to be compiled with -O2 (for the -O3 to work) but since there is little difference in speed (what I see from benchmarking quickly and from your tests also), I'll start for all in -O2 and test with the 4 files you provided, and one by one. I know that qs_vxc_atom.F is vital. And I'll start from here.</div><div><br></div><div>From the tests I already ran, I've found the H2-BECKE-MD.inp test need et_coupling.F set to -O2 (instead of -O3) to avoid FAILED but if all is in -O2 I guess this problem is no more ! (And also find the others, and if they are dependant/independant)</div><div><br></div><div>Point 4:</div><div><br></div><div>Ok. But I was surprised about that. </div><div><br></div><div>For exemple the 4 NEB tests :</div><div>Correct in -O1</div><div>Wrong in -O2 (or -O3)</div><div>compared to EPCC Hydra cluster popt</div><div>But</div><div>Wrong in -O1</div><div>Correct in -O2 (or -O3)</div><div>compared to Sheffield Iceberg cluster popt</div><div><br></div><div>So... I was confused. </div><div><br></div><div>And back to point 5:</div><div><br></div><div>I'll test against that also. Thanks for the advice.</div><div><br></div><div>I have some long simulation (AIMD, good correlation with experience) to check after, but the reg-test was in a way of testing segfault/large error quickly to put a "Warning".</div><div>If no WRONG result be careful. If one WRONG be careful as hell for this type of calculation. But mostly to check if I don't break old thing by changing libs/optimisation and maybe even code.</div><div><br></div><div><br></div><div>So I'll stick with ifort -O2, maybe later test gfortran -O3.</div><div><br></div><div>Anyway thanks for throwing some (a lot of !) light on my problems/questions.</div><div><br></div><div>Kind regards</div><div><br></div><div>Rolf</div></div>