FFT, CUDA, and Parallelization
Tyler Gubb
tag... at gmail.com
Tue Feb 25 05:09:32 UTC 2014
Hello,
I am curious what the current CUDA support is (i.e. for the development
trunk and future development). I have a linux box with two Quadro cards
and would like to run simulations with FFT and possibly matrices on the
cards. Attempts to compile MPI parallel + CUDA support have gone
surprisingly swimmingly. However the poor cards are quickly overwhelmed
for memory as DBCSR loads (Intel MPI is being used and each process takes a
chunk), but the Global/CUDA/Memory flag was removed after revision 13531.
Strangely, running the executable compiled without the -D__DBCSR_CUDA flag
results in the same error.
On a somewhat related note, the computer in question is not in a cluster.
Is OpenMP thus the optimal method for running in parallel (versus MPI)?
If so, then how would CUDA be utilized (assuming it can be)?
Lastly, I'd like to give a sincere thank you to the developers for their
hard work and dedication to the CP2K project.
Cheers,
T. Gubb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.cp2k.org/archives/cp2k-user/attachments/20140224/b890e376/attachment.htm>
More information about the CP2K-user
mailing list