I don’t think that they use the Cooley-Taki algorithm because the index permutation phase makes it not very convenient for shared memory architectures. In addition, this algorithm works with power memory targets, which are also not suitable for memory collaboration. Most likely, they use some formulation of the Stockham FFT self-test: for example, the Bailey algorithm .
, , FFT , . , 512 1024- FFT ( , ) 128 . , radix-2 - . radix-8 radix-16, "" . , this "" .
user1545642