<br><br>On Tuesday, July 12, 2011 6:37:36 PM UTC-4, Maria Shadrina wrote:<blockquote class="gmail_quote" style="margin: 0;margin-left: 0.8ex;border-left: 1px #ccc solid;padding-left: 1ex;">The CPU is Intel-Xeon-E5462 (Harpertown). Is that normal situation for
<br>this type of xeon processors?
<br></blockquote><div><br></div><div><br></div><div>yes, as i already mentioned. those processors are extremely limited</div><div>in their available memory bandwidth by two CPUs having to share a </div><div>single memory controller per mainboard. all communication and data</div><div>access has to go through this single bottleneck.</div><div><br></div><div>typically, the memory bandwidth hungry applications (like cp2k but<br></div><div>not limited to it) are quite efficiently run with 4 cores per (dual processor)</div><div>node. this does require the use of processor affinity, though, so that MPI </div><div>tasks are evenly spread out across the cpus and cores. the reason for</div><div>this lies in the design of the CPU which is not a single quad-core CPU </div><div>but two dual core CPUs glued into one case. each of the dual-core CPUs</div><div>has a fairly large L2-cache which will help to make memory accesses</div><div>more efficient. by using only half the cores, you effectively double the</div><div>size of the cache and that helps massively. thus you are actually using</div><div>most of all CPUs and wasting less than you might think. sometimes,</div><div>the impact of the communication to the memory bandwidth contention</div><div>can be reduced by using a hybrid OpenMP+MPI binary, but i never tried</div><div>it for harpertown cpus.</div><div><br></div><div>cheers,</div><div> axel.</div><div> </div><blockquote class="gmail_quote" style="margin: 0;margin-left: 0.8ex;border-left: 1px #ccc solid;padding-left: 1ex;">
<br>Best regards, Maria.
<br>
<br>
<br>>
<br>> this happens a lot with CPUs that have very limited memory bandwith.
<br>> the quickstep algorithm is very demanding in terms of memory bandwidth.
<br>>
<br>>
<br>>
<br>> what type of xeon processors exactly? that makes all the difference.
<br>> the 56xx (westmere) and 55xx (nehalem) series ones for example have
<br>> _much_ more memory bandwidth than 54xx (harpertown) series ones.
<br>>
<br>> cheers,
<br>> axel.
<br></blockquote>