CUDA 5 Drawback Dynamic Parallelism Examples

I searched googling and was able to find a trivial example of the new dynamic parallelism in Compute Capability 3.0 in one of their technical summaries linked from here . I know that HPC-specific cards are likely to be unavailable until this time next year (after nat'l labs get theirs), and yes, I understand that the simple example they gave is enough for you, but the more fun .

Are there any other examples that I missed?

To save you the trouble, here is the whole example provided in the technical overview:

__global__ ChildKernel(void* data){
    //Operate on data
}
__global__ ParentKernel(void *data){
    ChildKernel<<<16, 1>>>(data);
}
// In Host Code
ParentKernel<<<256, 64>>(data);

// Recursion is also supported
__global__ RecursiveKernel(void* data){
    if(continueRecursion == true)
        RecursiveKernel<<<64, 16>>>(data);
}

EDIT: GTC talk CUDA parallelism CUDA 5. . , , .

+5
5

, parallelism 3.5 .

3.0 GPU cuda 5.0. Dynamic parallelismnvcc -arch = sm_30 test.cu

test.cu(10): : global ( "child_launch" ) ( "parent_launch" ) compute_35 .

GPU

0: "GeForce GT 640" / CUDA 5.0/5.0 CUDA. / : 3.0

,

+5

"... CUDA 5...", Dynamic Parallelism CUDA 5, CUDA 4. , t , . CUDA 5.0 Parallelism, Compute Capability 3.5 (, GK110). .

CUDA 5 , .

+4

, 3.0 . GK110 ( " " ), , (, 3.1). ( ). , 3.0 GK104, GTX690 o GT640M .

0

I just wanted to contact you, given that the CUDA 5 RC was recently released. I looked at the SDK examples and could not find dynamic parallelism there. Someone will correct me if I am wrong. I searched for a kernel start in the kernels by grepping for "<<<<and found nothing.

0
source

All Articles