[CP2K-user] [CP2K:17966] Re: cudaErrorStubLibrary runtime error in CP2K
Amit Gupta
amit.welcomes.u at gmail.com
Mon Oct 31 15:14:26 UTC 2022
Could it be a compile time issue on my end? I remember toolchain requiring
libcuda for linking (cosma I believe), and I found issues on different
repositories on github, mentioning that libcuda resides in
lib/stubs/libcuda.so. So I linked against it, provide its location at
runtime in LD_LIBRARY_PATH.
On Monday, October 31, 2022 at 3:55:13 AM UTC-5 Ole Schütt wrote:
> Hi Amit,
>
> if you are getting cudaErrorStubLibrary then the app failed to load the
> real driver. Presumably your local admin can help with this. See also the
> CUDA docs
> <https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__TYPES.html#group__CUDART__TYPES_1g3f51e3575c2178246db0a94a430e0038>
> :
>
> cudaErrorStubLibrary = 34 This indicates that the CUDA driver that the
> application has loaded is a stub library. Applications that run with the
> stub rather than a real driver loaded will result in CUDA API returning
> this error.
>
> -Ole
>
> On Sunday, October 30, 2022 at 9:59:11 PM UTC+1 Amit Gupta wrote:
>
>> Hi,
>> I compiled a CUDA compatible version of CP2K, but when I submit my jobs,
>> I get the following error:
>> --------------------------------------------------------------------------
>> ERROR: cudaErrorStubLibrary
>> /scratch/ag9288/softwares/cp2k-2022.2/src/offload/offload_library.c 49
>> forrtl: error (76): Abort trap signal
>> Image PC Routine Line
>> Source
>> cp2k.psmp 0000000006B83CAB Unknown Unknown
>> Unknown
>> libpthread-2.28.s 000014D73245EB20 Unknown Unknown
>> Unknown
>> libc-2.28.so 000014D712D4037F gsignal Unknown
>> Unknown
>> libc-2.28.so 000014D712D2ADB5 abort Unknown
>> Unknown
>> cp2k.psmp 00000000031F8C29 offload_get_devic 49
>> offload_library.c
>> cp2k.psmp 0000000001387F46 f77_interface_mp_ 276
>> f77_interface.F
>> cp2k.psmp 000000000042281D MAIN__ 284 cp2k.F
>> cp2k.psmp 0000000006BED096 Unknown Unknown
>> Unknown
>> libc-2.28.so 000014D712D2C493 __libc_start_main Unknown
>> Unknown
>> cp2k.psmp 000000000042195E Unknown Unknown
>> Unknown
>> --------------------------------------------------------------------------
>>
>> It looks like the executable is unable to find the card, while all the
>> libraries are correctly loaded.
>>
>> Version:
>> Intel fortran: 19.1
>> cuda : 11.3
>> openmpi 4.1.1
>> CP2K: 2022.2
>>
>> Steps I took to build the binary:
>> 1. Install the toolchain:
>> ./install_cp2k_toolchain.sh --with-intel=system --enable-cuda
>> --gpu-ver=V100 --with-openmpi=system --with-libxc=install
>> --with-libint=install --with-fftw=install --with-mkl=system
>> --with-libxsmm=install --with-elpa=install --with-ptscotch=no
>> --with-superlu=no --with-pexsi=no --with-quip=no
>> --with-plumed=no --with-sirius=no --with-gsl=no
>> --with-libvdwxc=no --with-spglib=no --with-hdf5=no --with-spfft=no
>> --with-spla=install --with-cosma=install --with-libvori=no
>> --with-openblas=no -j4
>>
>> 2. Manualy build and copy Libint, for the fortran makefile bug:
>> https://github.com/cp2k/libint-cp2k/releases
>> cmake -DCMAKE_C_COMPILER=icc -DCMAKE_C_FLAGS="-qopenmp -O2"
>> -DCMAKE_CXX_COMPILER=icpc -DCMAKE_CXX_FLAGS="-qopenmp -O2"
>> -DCMAKE_Fortran_COMPILER=ifort -DCMAKE_Fortran_FLAGS="-qopenmp -O2"
>> -DREQUIRE_CXX_API=OFF -DENABLE_FORTRAN=ON ..
>>
>> 3. Apply required CUDA patch to CP2K and compile:
>> https://github.com/cp2k/cp2k/commit/ee6c3aa.patch
>> make -j 4 ARCH=local_cuda VERSION="ssmp sdbg psmp pdbg"
>>
>> PS: The job-node is different from the compile node, could that might be
>> an issue? Also while the compilation went fine, there was an error in
>> symlinking the binaries.
>>
>
--
You received this message because you are subscribed to the Google Groups "cp2k" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cp2k+unsubscribe at googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cp2k/90a5f21b-f015-4eda-87b9-da2b13e47498n%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.cp2k.org/archives/cp2k-user/attachments/20221031/1a5ade16/attachment.htm>
More information about the CP2K-user
mailing list