CUDA memory transfer at runtime

Question

CUDA memory transfer at runtime

I know that CUDA cores can be “overlapped” by putting them in separate threads, but I wonder if memory can be transferred at runtime. CUDA kernels are asynchronous after

+3

c ++ cuda pci nvidia

paulAl Apr 19 '12 at 21:47

source share

2 answers

For clarification only, the above are only valid if your device supports it. You can check it running a request to the device and checking for a parallel copy and attribute execution

+1

amanda Apr 20 '12 at 16:07

source share

Roger dahl · Accepted Answer · 2012-04-19T22:23:41+0000

You can run kernels, transfer from host to device and forward from device to host at the same time.

http://developer.download.nvidia.com/CUDA/training/StreamsAndConcurrencyWebinar.pdf

CUDA memory transfer at runtime

More articles: