[CP2K-user] [CP2K:22209] Re: Regarding improvement of speed calculation HSE06

Frederick Stein f.stein at hzdr.de
Tue Apr 28 15:18:45 UTC 2026


Just in case, be aware that MAX_MEMORY is per rank, i.e. you need to 
increase the memory if you use more OpenMP threads in favor of MPI ranks. 
Do you also have a timing report? This may also help to investigate. 
Usually, CUTOFF and REL_CUTOFF are less important with the 
Hartree-Fock-kernel, it is commonly more about EPS_DEFAULT, EPS_PGF_ORB or 
EPS_SCHWARZ.

Lorenzo Lagasco schrieb am Dienstag, 28. April 2026 um 16:30:01 UTC+2:

> Ok, maybe I can test a bit more carefully the EPS_DEFAULT and EPS_ORB (I 
> have already launched different tests for checking convergence on 
> NGRIDS,CUTOFF and REL_CUTOFF). Meanwhile, this is my slurm submission 
> script:
> #!/bin/bash
> #SBATCH --job-name=HSEO6
> #SBATCH --nodes=1
> #SBATCH --ntasks-per-node=52
> #SBATCH --partition=taras2-6230r
>
> module purge
> module load cp2k/may2025-gnu14.2.0-openmpi4.1.6-psm211.2.230
> export OMP_NUM_THREADS=1
>
> mpirun cp2k.popt -i DYE+SLAB.inp -o DYE+SLAB.out
>
>
> I tried to increase the number of OMP_NUM_THREADS from 1 to 2 or 3 and, as 
> a result, the time required for each scf step is longer and longer. 
> Moreover, I regulated the MAX_MEMORY in relation to the memory free of the 
> node. 
>
> Il giorno mar 28 apr 2026 alle ore 16:00 Frederick Stein <f.s... at hzdr.de> 
> ha scritto:
>
>> Dear Lorenzo,
>> Are you sure that you need these tight cutoffs (EPS_DEFAULT and 
>> especially EPS_PGF_ORB)? Can you check how many integrals are recalculated 
>> (check the output file)? If they are, is it possible for you to increase 
>> MAX_MEMORY? How are you running CP2K (number of MPI ranks, number of OpenMP 
>> threads)?
>> Best,
>> Frederick
>>
>> Lorenzo Lagasco schrieb am Dienstag, 28. April 2026 um 15:43:45 UTC+2:
>>
>>> Good afternoon everyone,
>>>
>>> I am writing to ask for advice on how to improve the computational 
>>> efficiency of an HSE06 calculation used to evaluate inter-state couplings 
>>> between diabatic states in a slab–organic dye system (using the Kondov 
>>> diabatization scheme, system containing 395 atoms). I have attached the 
>>> input file for reference. In a preliminary test on a CPU-only machine 
>>> (single node with 52 processors), a single SCF iteration takes 
>>> approximately 30 minutes.
>>>
>>> Any suggestions would be greatly appreciated.
>>>
>>> Best regards
>>>
>>> Lorenzo Lagasco
>>>
>>>
>>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "cp2k" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to cp2k+uns... at googlegroups.com.
>> To view this discussion visit 
>> https://groups.google.com/d/msgid/cp2k/439c2d51-5c6f-4fc9-be1b-1ff2884ab77an%40googlegroups.com 
>> <https://groups.google.com/d/msgid/cp2k/439c2d51-5c6f-4fc9-be1b-1ff2884ab77an%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>

-- 
You received this message because you are subscribed to the Google Groups "cp2k" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cp2k+unsubscribe at googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/cp2k/6149761d-bdef-4923-9d4f-09b2da904dbdn%40googlegroups.com.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.cp2k.org/archives/cp2k-user/attachments/20260428/2101e93a/attachment-0001.htm>


More information about the CP2K-user mailing list