<< Chapter < Page | Chapter >> Page > |
We have found that by using the split nesting algorithm for circular convolution a new set of efficient prime length DFT modules that covera wide variety of lengths can be developed. We have also exploited the structure in the split nesting algorithmto write a program that automatically generates compact readable code for convolution and prime length FFT programs.
The resulting code makes clear the organization and structure of the algorithm and clearly enumerates the disjoint convolutions into whichthe problem is decomposed. These independent convolutions can be executed in parallel and, moreover, theindividual commands are of the form which can be executed as parallel/vector commands on appropriate computer architectures [link] . By recognizing also that the algorithms for different lengths share many ofthe same computational structures, the code we generate is made up of calls to a relatively small set of functions.Accordingly, the subroutines can be designed to specifically suit a given architecture.
The number of additions and multiplications incurred by the programs we have generated are the same as or are competitive with existing prime length FFT programs.We note that previously, prime length FFTs were made available for primes only up to 29.As in the original Winograd short convolution algorithms, the efficiency of the resulting prime point DFT algorithm depends largely upon the factorability of . For example, if is two times a prime, then an efficient point DFT algorithm is more difficult to develop.
It should be noted too that the programs for convolution developed above are useful in the convolution of long integer sequences when exact results are needed.This is because all multiplicative constants in an point integer convolution are integer multiples of and this division by can be delayed until the last stage or can simply be omitted if a scaled versionof the convolution is acceptable.
By developing a large library of prime point FFT programs we can extend the maximum length and the variety of lengths of a prime factor algorithm ora Winograd Fourier transform algorithm. Furthermore, because the approach taken in this paper gives a bilinear form,it can be incorporated into the dynamic programming technique for designing optimal composite length FFT algorithms [link] . The programs described in this paper can also be adapted to obtaindiscrete cosine transform (DCT) algorithms by simply permuting the input and output sequences [link] .
Notification Switch
Would you like to follow the 'Automatic generation of prime length fft programs' conversation and receive update notifications?