[CP2K-user] [CP2K:18594] Re: Install issues with IBM Power9 processors with Nvidia V100 GPU

Alfio Lazzaro alfio.lazzaro at gmail.com
Fri Mar 31 04:35:52 UTC 2023


There is still something wrong in your local_cuda.psmp file.
In your output above I cannot find the flag `-D__parallel` . Isee only the 
followings:

-D__OFFLOAD_CUDA -D__DBCSR_ACC   -D__FFTW3  -D__LIBINT -D__LIBXC 
-D__SCALAPACK -D__COSMA -D__ELPA -D__ELPA_NVIDIA_GPU -D__GSL -D__HDF5 
-D__LIBVDWXC -D__SPGLIB -D__LIBVORI -D__SPFFT  -D__OFFLOAD_GEMM  -D__SPLA 
-D__SIRIUS    -D__CUDA

So my guess is that the toolchain was not able to recognize MPI (no idea 
why). Could you add -D__parallel on top of those flags?

Il giorno venerdì 31 marzo 2023 alle 00:08:29 UTC+2 Nathan Keilbart ha 
scritto:

> Thank Alfio. I wasn't sure what file was controlling that. I updated the 
> file to have those compilers and then did a make realclean. Afterwards, I 
> am now getting this error:
>
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:192:19:
>
>              gcd_max = -1
>                    1
> Error: Symbol 'gcd_max' at (1) has no IMPLICIT type
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:193:18:
>
>              DO ipe = 1, CEILING(SQRT(REAL(npe, dp)))
>                   1
> Error: Symbol 'ipe' at (1) has no IMPLICIT type
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:194:18:
>
>                 jpe = npe/ipe
>                   1
> Error: Symbol 'jpe' at (1) has no IMPLICIT type
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:185:29:
>
>           my_blacs_grid_layout = BLACS_GRID_SQUARE
>                              1
> Error: Symbol 'my_blacs_grid_layout' at (1) has no IMPLICIT type; did you 
> mean 'blacs_grid_layout'?
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:221:25:
>
>        my_blacs_repeatable = .FALSE.
>                          1
> Error: Symbol 'my_blacs_repeatable' at (1) has no IMPLICIT type; did you 
> mean 'blacs_repeatable'?
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:213:18:
>
>        my_row_major = .TRUE.
>                   1
> Error: Symbol 'my_row_major' at (1) has no IMPLICIT type; did you mean 
> 'row_major'?
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:174:11:
>
>        npcol = 1
>            1
> Error: Symbol 'npcol' at (1) has no IMPLICIT type; did you mean 'ipcol'?
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:175:9:
>
>        npe = blacs_env%n_pid
>          1
> Error: Symbol 'npe' at (1) has no IMPLICIT type
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:173:11:
>
>        nprow = 1
>            1
> Error: Symbol 'nprow' at (1) has no IMPLICIT type; did you mean 'iprow'?
> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/src/fm/cp_blacs_env.F:188:22:
>
>           SELECT CASE (my_blacs_grid_layout)
>                       1
> Error: Argument of SELECT statement at (1) cannot be UNKNOWN
> make[3]: *** [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/Makefile:519: 
> cp_blacs_env.o] Error 1
> make[2]: *** [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/Makefile:146: all] 
> Error 2
>
> make[1]: *** [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/Makefile:128: psmp] 
> Error 2
> make: *** [Makefile:123: all] Error 2
>
> On Thursday, March 30, 2023 at 12:22:43 AM UTC-7 Alfio Lazzaro wrote:
>
>> There is no relation with the DBCSR compilation itself, you see a problem 
>> in DBCSR simply because it is the first to compile in CP2K.
>> The error message is:
>>
>> /bin/sh: c: command not found
>>
>> and indeed you are using the command
>>
>> c -fno-omit-frame-pointer -fopenmp -g -mtune=native  -O3 -funroll-loops  
>>   ...
>>
>> for compiling, therefore there is something wrong in the compiler call.
>> I think the problem is that the local_cuda.psmp file has something wrong 
>> in the definition of the compilers, namely the lines
>>
>> CC             := mpicc
>> FC             := mpif90
>> LD             := mpif90
>> AR             := ar -r
>>
>> could you check if they are linking to the rights commands?
>>
>>
>>
>>
>> Il giorno giovedì 30 marzo 2023 alle 03:12:26 UTC+2 Nathan Keilbart ha 
>> scritto:
>>
>>> Hello everyone,
>>>
>>> I've been working on installing CP2K on a system with IBM Power9 
>>> processors and Nvidia V100 GPUs. I'm using the toolchain with these options:
>>>
>>> ./install_cp2k_toolchain.sh -j --with-cmake=system --mpi-mode=openmpi 
>>> --enable-cuda --gpu-ver=V100
>>>
>>> It installs all the dependencies without any errors so that I copy over 
>>> the files to the arch folder and then source the setup file followed by
>>>
>>> make -j ARCH=local_cuda VERSION=psmp
>>>
>>> The following is some of the last lines of output
>>>
>>> /usr/bin/env python3 
>>> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/tools/build_utils/fypp/bin/fypp 
>>> -n --line-marker-format=gfortran5 
>>> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/dbcsr_tensor_test.F 
>>> dbcsr_tensor_test.F90
>>> c -fno-omit-frame-pointer -fopenmp -g -mtune=native  -O3 -funroll-loops 
>>>   
>>>  -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/openblas-0.3.21/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/fftw-3.3.10/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libint-v2.6.0-cp2k-lmax-5/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libxc-6.0.0/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/COSMA-2.6.2/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/modules' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/elpa' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/gsl-2.7/include' 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/hdf5-1.12.0/include 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libvdwxc-0.4.0/include 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/spglib-1.16.2/include 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpFFT-1.0.6/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpLA-1.5.4/include/spla' 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/sirius-7.3.2/include/cuda 
>>> -fbacktrace -ffree-form -fimplicit-none -std=f2008  -Werror=aliasing 
>>> -Werror=ampersand -Werror=c-binding-type -Werror=intrinsic-shadow 
>>> -Werror=intrinsics-std -Werror=line-truncation -Werror=tabs 
>>> -Werror=target-lifetime -Werror=underflow -Werror=unused-but-set-variable 
>>> -Werror=unused-variable -Werror=unused-dummy-argument -Werror=conversion 
>>> -Werror=zerotrip -Wno-maybe-uninitialized -Wuninitialized 
>>> -Wuse-without-only  -D__OFFLOAD_CUDA -D__DBCSR_ACC   -D__FFTW3  -D__LIBINT 
>>> -D__LIBXC -D__SCALAPACK -D__COSMA -D__ELPA -D__ELPA_NVIDIA_GPU -D__GSL 
>>> -D__HDF5 -D__LIBVDWXC -D__SPGLIB -D__LIBVORI -D__SPFFT  -D__OFFLOAD_GEMM 
>>>  -D__SPLA -D__SIRIUS    -D__CUDA -D__SHORT_FILE__="\"dbcsr_tensor_test.F\"" 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src' 
>>> dbcsr_tensor_test.F90 
>>> /bin/sh: c: command not found
>>> make[4]: 
>>> [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr//Makefile:258: 
>>> dbcsr_tensor_test.o] Error 127 (ignored)
>>> /usr/bin/env python3 
>>> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/tools/build_utils/fypp/bin/fypp 
>>> -n --line-marker-format=gfortran5 
>>> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/dbcsr_tensor_api.F 
>>> dbcsr_tensor_api.F90
>>> c -fno-omit-frame-pointer -fopenmp -g -mtune=native  -O3 -funroll-loops 
>>>   
>>>  -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/openblas-0.3.21/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/fftw-3.3.10/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libint-v2.6.0-cp2k-lmax-5/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libxc-6.0.0/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/COSMA-2.6.2/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/modules' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/elpa-2022.11.001/nvidia/include/elpa_openmp-2022.11.001/elpa' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/gsl-2.7/include' 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/hdf5-1.12.0/include 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/libvdwxc-0.4.0/include 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/spglib-1.16.2/include 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpFFT-1.0.6/include' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/SpLA-1.5.4/include/spla' 
>>> -I/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/tools/toolchain/install/sirius-7.3.2/include/cuda 
>>> -fbacktrace -ffree-form -fimplicit-none -std=f2008  -Werror=aliasing 
>>> -Werror=ampersand -Werror=c-binding-type -Werror=intrinsic-shadow 
>>> -Werror=intrinsics-std -Werror=line-truncation -Werror=tabs 
>>> -Werror=target-lifetime -Werror=underflow -Werror=unused-but-set-variable 
>>> -Werror=unused-variable -Werror=unused-dummy-argument -Werror=conversion 
>>> -Werror=zerotrip -Wno-maybe-uninitialized -Wuninitialized 
>>> -Wuse-without-only  -D__OFFLOAD_CUDA -D__DBCSR_ACC   -D__FFTW3  -D__LIBINT 
>>> -D__LIBXC -D__SCALAPACK -D__COSMA -D__ELPA -D__ELPA_NVIDIA_GPU -D__GSL 
>>> -D__HDF5 -D__LIBVDWXC -D__SPGLIB -D__LIBVORI -D__SPFFT  -D__OFFLOAD_GEMM 
>>>  -D__SPLA -D__SIRIUS    -D__CUDA -D__SHORT_FILE__="\"dbcsr_tensor_api.F\"" 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src/tensors/' 
>>> -I'/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/dbcsr/src' 
>>> dbcsr_tensor_api.F90 
>>> /bin/sh: c: command not found
>>> make[4]: 
>>> [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr//Makefile:258: 
>>> dbcsr_tensor_api.o] Error 127 (ignored)
>>> Updating archive 
>>> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/lib/local_cuda/psmp/exts/dbcsr/libdbcsr.a
>>> ar: creating 
>>> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/lib/local_cuda/psmp/exts/dbcsr/libdbcsr.a
>>> ar: dbcsr_cuda_profiling.o: No such file or directory
>>> make[4]: *** 
>>> [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr//Makefile:330: 
>>> /usr/gapps/qsg/codes/cp2k/lassen/v2023.1/lib/local_cuda/psmp/exts/dbcsr/libdbcsr.a] 
>>> Error 1
>>> make[3]: *** 
>>> [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/build_dbcsr/Makefile:179: 
>>> libdbcsr] Error 2
>>> make[2]: *** 
>>> [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/exts/Makefile.inc:38: dbcsr] 
>>> Error 2
>>> make[1]: *** [/usr/gapps/qsg/codes/cp2k/lassen/v2023.1/Makefile:128: 
>>> psmp] Error 2
>>> make: *** [Makefile:123: all] Error 2
>>>
>>> It seems that it is having issues with the DBCSR module. I initially had 
>>> an issue with this because I seemed to have left off the --recursive option 
>>> and after making sure my git clone had that it at least let me build most 
>>> of the serial version. It at least gave me the cp2k.sopt binary and it 
>>> seems to at least take inputs. I didn't have a chance to test it too much 
>>> yet. When I got this binary I had done 
>>>
>>> make -j ARCH=local_cuda VERSION="ssmp sdbg psmp pdbg"
>>>
>>> as suggested.
>>>
>>> Also, I've attempted to install with spack by using
>>>
>>> spack install 
>>> cp2k at 2023.1+cosma+cuda+elpa+libint+libxc+mpi+openmp+pexsi+plumed+sirius+spglib 
>>> smm=blas cuda_arch=70
>>>
>>> These are some of the last lines of output
>>>
>>>  >> 4028    collect2: error: ld returned 1 exit status
>>>   >> 4029    collect2: error: ld returned 1 exit status
>>>   >> 4030    make[3]: *** 
>>> [/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/obj/linux-rhel7-power9le-gcc/psmp/
>>>              all.dep:178: 
>>> /tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/exe/linux-rhel7-power9le-gcc/cp2k.p
>>>              smp] Error 1
>>>      4031    make[3]: *** Waiting for unfinished jobs....
>>>   >> 4032    make[3]: *** 
>>> [/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/obj/linux-rhel7-power9le-gcc/psmp/
>>>              all.dep:194: 
>>> /tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/exe/linux-rhel7-power9le-gcc/libcp2
>>>              k_unittest.psmp] Error 1
>>>   >> 4033    make[2]: *** 
>>> [/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/Makefile:146: 
>>> all] Error 2
>>>   >> 4034    make[1]: *** 
>>> [/tmp/keilbart/spack-stage/spack-stage-cp2k-2023.1-24dhoyt24tbnn4d423glgoeqqquibmb6/spack-src/Makefile:128: 
>>> psmp] Error 2
>>>   >> 4035    make: *** [Makefile:123: all] Error 2
>>>
>>> Finally, I also have some intel machines that I'm attempting to build on 
>>> and having issues as well but we can start with the IBM machine as we're 
>>> hoping to accelerate the simulations with the GPU.
>>>
>>> Please let me know what other information I can provide. Thank you.
>>>
>>> Nathan
>>>
>>

-- 
You received this message because you are subscribed to the Google Groups "cp2k" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cp2k+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cp2k/fdc0d095-3de8-4fa5-a6c1-0d4d48af1a7cn%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.cp2k.org/archives/cp2k-user/attachments/20230330/1bb8a890/attachment-0001.htm>


More information about the CP2K-user mailing list