[CP2K:3713] hwloc support in cp2k-trunk
pousa.ch... at gmail.com
Thu Jan 26 08:50:00 UTC 2012
> ok. it may be version specific, too.
> [akohlmey at g002 input]$ rpm -qif /usr/lib64/libnuma.so.1
> Name : numactl Relocations: (not relocatable)
> Version : 2.0.3 Vendor: Red Hat, Inc.
> Release : 9.el6 Build Date: Thu Jun 17
> 10:46:17 2010
The version that I use is the latest stable one. But, I don't believe that
the error come from there. I still have to take a look on this libnuma
> yes, this kind of behavior is what i would have expected.
> this should also help with the internal threading in OpenMPI.
The main goal is to avoid memory allocations and access from different MPIs
on remote NUMA nodes. But, If you want to pin also threads you can try the
Linear strategy, which will pin process and threads.
> please have a look at the attached file. you'll see that there
> are some entries that don't look right. particularly the node
> names are all that of MPI rank 0.
I did some changes to fix this. Could you try the latest version of CP2K?
> yes. our MPI installation is configured by default to have a 1:1 core to
> rank mapping (since there is practically nobody yet using MPI+OpenMP)
> with memory affinity for giving people the best MPI-only performance.
Ok. So, for threads, even with this installation you can not specify their
> at the end of the attached file i include a copy of the wrapper script,
> that is OpenMPI specific (since that is the only MPI library installed).
thanks for the script.
> overall, it looks to me like that default settings are giving a desirable
> processor and memory affinity (which is great) that is consistent with
> the best settings i could get using my wrapper script, but the diagnostics
> seems to be off and may be confusing people, particularly technical
> support in computing centers, that are often too literal and assume
> that any software is always giving 100% correct information. ;-)
Now, it should work :) Let me know if you find new bugs.
Considering your machine, the cores number problem comes from the fact that
I was using the number that the OS gives to the cores. Now, I'm using the
logical ones. BTW, is your machine intel?
Christiane Pousa Ribeiro
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the CP2K-user