libdbcsr, MPI error
nadler
rod... at gmx.ch
Thu Nov 25 14:53:52 UTC 2010
The error still comes up after several intents. Current version
installed is 2.2.45. Furthermore, as mentioned in the first post:
independently of the number of cpus chosen, the execution stops after
a certain amount of cpu hours. In my case it is around 100 +/-10
hours; the guy from the support team told me that the same happens to
him after 163 +/-2 cpu hours, using the same input file I am using.
Any ideas about what could be the problem? Following, I put
informations about the clusters, compiler and current archfile.
Thanks!
Compiler: IBM XL Fortran for Linux, V12.1
The information about the machines:
Once, 1036 nodes of eServer BladeCenter JS20, having 2 PPC cpus
(2.2GHz) with 4GB RAM per node.
Then, 168 nodes of eServer BladeCenter JS21, having 4 PPC cpus
(2.3GHz) with 8GB RAM.
Communication occurs via Myrinet.
The current archfile is:
PERL = perl
CC = xlc
CPP = cpp
FC = xlf95_r -qsuffix=f=F
LD = xlf95_r
AR = ar -r
DFLAGS = -D__AIX -D__ESSL -D__FFTSG -D__FFTW3 -D__parallel -D__BLACS
-D__SCALAPACK -D__LIBINT
CPPFLAGS = -C $(DFLAGS) -P -traditional \
-I/gpfs/apps/FFTW/3.2.1/64/include
FCFLAGS = -O2 -qstrict -q64 -qarch=ppc970 -qcache=auto -qmaxmem=-1 -
qtune=ppc970 \
-I/gpfs/apps/FFTW/3.2.1/64/include \
-I/gpfs/apps/LIBINT/1.1.4/64/include \
-I/gpfs/apps/MPICH2/mx/1.0.8p1..3/64/include
FCFLAGS2 = -O0 -qstrict -q64 -qarch=ppc970 -qcache=auto -qmaxmem=-1 -
qtune=ppc970 \
-I/gpfs/apps/FFTW/3.2.1/64/include \
-I/gpfs/apps/LIBINT/1.1.4/64/include \
-I/gpfs/apps/MPICH2/mx/1.0.8p1..3/64/include
LDFLAGS = $(FCFLAGS) \
-L/gpfs/apps/LAPACK/3.2.1/64/lib \
-L/gpfs/apps/SCALAPACK/1.8/mpich2/64 \
-L/gpfs/apps/FFTW/3.2.1/64/lib \
-L/gpfs/apps/LIBINT/1.1.4/64/lib \
-L/opt/ibmcmp/xlmass/5.0/lib64 \
-L/gpfs/apps/MPICH2/mx/1.0.8p1..3/64/lib \
-L/gpfs/apps/MPICH2/slurm/64/lib \
-L/opt/osshpc/mx/lib64 \
-L/usr/lib64
LIBS = -lscalapack \
/gpfs/apps/SCALAPACK/1.8/mpich2/64/blacs.a \
-lmass_64 \
-lmpich -lpmi -lmyriexpress -lpthread \
-llapack -lessl -lfftw3f -lfftw3 -lint -lderiv
OBJECTS_ARCHITECTURE = machine_aix.o
### To speed up compilation time ###
pint_types.o: pint_types.F
$(FC) -c $(FCFLAGS2) $<
md_run.o: md_run.F
$(FC) -c $(FCFLAGS2) $<
kg_energy.o: kg_energy.F
$(FC) -c $(FCFLAGS2) $<
integrator.o: integrator.F
$(FC) -c $(FCFLAGS2) $<
geo_opt.o: geo_opt.F
$(FC) -c $(FCFLAGS2) $<
qmmm_init.o: qmmm_init.F
$(FC) -c $(FCFLAGS2) $<
cp2k_runs.o: cp2k_runs.F
$(FC) -c $(FCFLAGS2) $<
mc_ensembles.o: mc_ensembles.F
$(FC) -c $(FCFLAGS2) $<
ep_methods.o: ep_methods.F
$(FC) -c $(FCFLAGS2) $<
mc_ge_moves.o: mc_ge_moves.F
$(FC) -c $(FCFLAGS2) $<
force_env_methods.o: force_env_methods.F
$(FC) -c $(FCFLAGS2) $<
cp_lbfgs_optimizer_gopt.o: cp_lbfgs_optimizer_gopt.F
$(FC) -c $(FCFLAGS2) $<
mc_types.o: mc_types.F
$(FC) -c $(FCFLAGS2) $<
f77_interface.o: f77_interface.F
$(FC) -c $(FCFLAGS2) $<
mc_moves.o: mc_moves.F
$(FC) -c $(FCFLAGS2) $<
More information about the CP2K-user
mailing list