Energy is NaN with CUFFT
Axel
akoh... at gmail.com
Tue Mar 17 15:01:20 UTC 2009
On Mar 17, 7:16 am, Ondrej Marsalek <ondrej.... at gmail.com> wrote:
> Hi again,
>
> > hi,
>
> > i don't see this. just recompiled on the tesla machine at uiuc.
>
> > as usual, please provide the smallest possible input that reproduces
> > it,
> > as well as the compiler flags and hardware specs that you were using.
> > otherwise it is impossible to track it down.
>
> It happens only in parallel runs. I get the same kind of trouble with
> the Ar.inp example (as well as my own inputs) when run in parallel. In
> serial, it all seems to work. Can anyone else reproduce this?
there is no support for CUDA with parallel compiles.
axel.
>
> I use a single node with two dual core Opterons, OpenMPI 1.3 and the
> Intel compiler, version 10.1.018. The arch file used is below.
>
> Best,
> Ondrej
>
> NVCC = nvcc
> CC = mpicc
> CPP =
> FC = mpif90
> LD = mpif90
> AR = ar -r
> DFLAGS = -D__INTEL -D__FFTSG -D__parallel -D__BLACS -D__SCALAPACK
> -D__CUDA -D__FFTSGL
> CPPFLAGS =
> FCFLAGS = $(DFLAGS) -I$(INCLUDE) -fpp -free -O2 -unroll -pc64
> -heap-arrays 64 -march=pentium3 -mtune=core2
> NVFLAGS = $(DFLAGS)
> LDFLAGS = $(FCFLAGS) -static-intel
>
> LIBS = -L$(LIBRARY_PATH) \
> -lmkl_scalapack_lp64 \
> -lmkl_blacs_openmpi_lp64 \
> -lmkl_intel_lp64 \
> -lmkl_sequential \
> -lmkl_core \
> -L/usr/local/cuda/lib \
> -lcudart \
> -lcufft \
> -lcublas \
> -lstdc++
>
> OBJECTS_ARCHITECTURE = machine_intel.o
More information about the CP2K-user
mailing list