Is cudaHostRegister equivalent to the mlock () system call?

Locked page locked memory is faster to transfer to the GPU than unblocked memory. CUDA provides challenges cudaHostAllocand cudaHostRegisterto highlight or registration memory with page records. The Nvidia driver then checks for memory transfers if the host memory is locked and exits along the path to copy the code.

Is it possible to block page locking during a system call mlock(), achieving exactly the same effect (relative to transfer speeds) as cudaHostRegister? Or does the corresponding CUDA request update the internal database that the driver requests?

+3
source share
2 answers

, NVIDIA , cudaHostAlloc .. mlock , , , RLIMIT_MEMLOCK, . , NVIDIA . , , .

, cudaHostRegister mlock() , , , . , cudaMemcpy .

+2

. cuMemHostRegister() , (, , ), . , , .

+1

All Articles