[CP2K-user] [CP2K:18588] Install issues with IBM Power9 processors with Nvidia V100 GPU
Nathan Keilbart
nathankeilbart at gmail.com
Thu Mar 30 01:12:26 UTC 2023
Hello everyone,
I've been working on installing CP2K on a system with IBM Power9 processors
and Nvidia V100 GPUs. I'm using the toolchain with these options:
./install_cp2k_toolchain.sh -j --with-cmake=system --mpi-mode=openmpi
--enable-cuda --gpu-ver=V100
It installs all the dependencies without any errors so that I copy over the
files to the arch folder and then source the setup file followed by
make -j ARCH=local_cuda VERSION=psmp
The following is some of the last lines of output
/usr/bin/env python3
/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/tools/build_utils/fypp/bin/fypp
-n --line-marker-format=gfortran5
/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/dbcsr_tensor_test.F
dbcsr_tensor_test.F90
c -fno-omit-frame-pointer -fopenmp -g -mtune=native -O3 -funroll-loops
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/openblas-0.3.21/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/fftw-3.3.10/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libint-v2.6.0-cp2k-lmax-5/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libxc-6.0.0/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/COSMA-2.6.2/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/modules'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/elpa'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/gsl-2.7/include'
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/hdf5-1.12.0/include
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libvdwxc-0.4.0/include
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/spglib-1.16.2/include
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpFFT-1.0.6/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpLA-1.5.4/include/spla'
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/sirius-7.3.2/include/cuda
-fbacktrace -ffree-form -fimplicit-none -std=f2008 -Werror=aliasing
-Werror=ampersand -Werror=c-binding-type -Werror=intrinsic-shadow
-Werror=intrinsics-std -Werror=line-truncation -Werror=tabs
-Werror=target-lifetime -Werror=underflow -Werror=unused-but-set-variable
-Werror=unused-variable -Werror=unused-dummy-argument -Werror=conversion
-Werror=zerotrip -Wno-maybe-uninitialized -Wuninitialized
-Wuse-without-only -D__OFFLOAD_CUDA -D__DBCSR_ACC -D__FFTW3 -D__LIBINT
-D__LIBXC -D__SCALAPACK -D__COSMA -D__ELPA -D__ELPA_NVIDIA_GPU -D__GSL
-D__HDF5 -D__LIBVDWXC -D__SPGLIB -D__LIBVORI -D__SPFFT -D__OFFLOAD_GEMM
-D__SPLA -D__SIRIUS -D__CUDA -D__SHORT_FILE__="\"dbcsr_tensor_test.F\""
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src'
dbcsr_tensor_test.F90
/bin/sh: c: command not found
make[4]:
[/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr//Makefile:258:
dbcsr_tensor_test.o] Error 127 (ignored)
/usr/bin/env python3
/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/tools/build_utils/fypp/bin/fypp
-n --line-marker-format=gfortran5
/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/dbcsr_tensor_api.F
dbcsr_tensor_api.F90
c -fno-omit-frame-pointer -fopenmp -g -mtune=native -O3 -funroll-loops
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/openblas-0.3.21/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/fftw-3.3.10/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libint-v2.6.0-cp2k-lmax-5/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libxc-6.0.0/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/COSMA-2.6.2/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/modules'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/elpa'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/gsl-2.7/include'
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/hdf5-1.12.0/include
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libvdwxc-0.4.0/include
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/spglib-1.16.2/include
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpFFT-1.0.6/include'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpLA-1.5.4/include/spla'
-I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/sirius-7.3.2/include/cuda
-fbacktrace -ffree-form -fimplicit-none -std=f2008 -Werror=aliasing
-Werror=ampersand -Werror=c-binding-type -Werror=intrinsic-shadow
-Werror=intrinsics-std -Werror=line-truncation -Werror=tabs
-Werror=target-lifetime -Werror=underflow -Werror=unused-but-set-variable
-Werror=unused-variable -Werror=unused-dummy-argument -Werror=conversion
-Werror=zerotrip -Wno-maybe-uninitialized -Wuninitialized
-Wuse-without-only -D__OFFLOAD_CUDA -D__DBCSR_ACC -D__FFTW3 -D__LIBINT
-D__LIBXC -D__SCALAPACK -D__COSMA -D__ELPA -D__ELPA_NVIDIA_GPU -D__GSL
-D__HDF5 -D__LIBVDWXC -D__SPGLIB -D__LIBVORI -D__SPFFT -D__OFFLOAD_GEMM
-D__SPLA -D__SIRIUS -D__CUDA -D__SHORT_FILE__="\"dbcsr_tensor_api.F\""
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/'
-I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src'
dbcsr_tensor_api.F90
/bin/sh: c: command not found
make[4]:
[/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr//Makefile:258:
dbcsr_tensor_api.o] Error 127 (ignored)
Updating archive
/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/lib/local_cuda/psmp/exts/dbcsr/libdbcsr.a
ar: creating
/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/lib/local_cuda/psmp/exts/dbcsr/libdbcsr.a
ar: dbcsr_cuda_profiling.o: No such file or directory
make[4]: ***
[/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr//Makefile:330:
/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/lib/local_cuda/psmp/exts/dbcsr/libdbcsr.a]
Error 1
make[3]: ***
[/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr/Makefile:179:
libdbcsr] Error 2
make[2]: ***
[/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/Makefile.inc:38: dbcsr]
Error 2
make[1]: *** [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/Makefile:128: psmp]
Error 2
make: *** [Makefile:123: all] Error 2
It seems that it is having issues with the DBCSR module. I initially had an
issue with this because I seemed to have left off the --recursive option
and after making sure my git clone had that it at least let me build most
of the serial version. It at least gave me the cp2k.sopt binary and it
seems to at least take inputs. I didn't have a chance to test it too much
yet. When I got this binary I had done
make -j ARCH=local_cuda VERSION="ssmp sdbg psmp pdbg"
as suggested.
Also, I've attempted to install with spack by using
spack install
cp2k at 2023.1+cosma+cuda+elpa+libint+libxc+mpi+openmp+pexsi+plumed+sirius+spglib
smm=blas cuda_arch=70
These are some of the last lines of output
>> 4028 collect2: error: ld returned 1 exit status
>> 4029 collect2: error: ld returned 1 exit status
>> 4030 make[3]: ***
[/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/obj/linux-rhel7-power9le-gcc/psmp/
all.dep:178:
/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/exe/linux-rhel7-power9le-gcc/cp2k.p
smp] Error 1
4031 make[3]: *** Waiting for unfinished jobs....
>> 4032 make[3]: ***
[/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/obj/linux-rhel7-power9le-gcc/psmp/
all.dep:194:
/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/exe/linux-rhel7-power9le-gcc/libcp2
k_unittest.psmp] Error 1
>> 4033 make[2]: ***
[/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/Makefile:146:
all] Error 2
>> 4034 make[1]: ***
[/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/Makefile:128:
psmp] Error 2
>> 4035 make: *** [Makefile:123: all] Error 2
Finally, I also have some intel machines that I'm attempting to build on
and having issues as well but we can start with the IBM machine as we're
hoping to accelerate the simulations with the GPU.
Please let me know what other information I can provide. Thank you.
Nathan
--
You received this message because you are subscribed to the Google Groups "cp2k" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cp2k+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cp2k/0ba3ac06-1cf6-44ef-a626-bc12491acc86n%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.cp2k.org/archives/cp2k-user/attachments/20230329/2bfd43bf/attachment.htm>
More information about the CP2K-user
mailing list