0.10 Implementing ffts in practice (Page 3/21)

Page 3 / 21

Y [k] = \sum_{ℓ = 0}^{n - 1} X [ℓ] ω_{n}^{ℓ k},

where $0 \leq k < n$ and $ω_{n} = exp (- 2 π i / n)$ is a primitive root of unity. Implemented directly, [link] would require $Θ (n^{2})$ operations; fast Fourier transforms are $O (n log n)$ algorithms to compute the same result. The most important FFT (and the one primarily used in FFTW) is known as the“Cooley-Tukey” algorithm, after the two authors who rediscovered and popularized it in 1965 [link] , although it had been previously known as early as 1805 by Gauss as well as by laterre-inventors [link] . The basic idea behind this FFT is that a DFT of a composite size $n = n_{1} n_{2}$ can be re-expressed in terms of smaller DFTs of sizes $n_{1}$ and $n_{2}$ —essentially, as a two-dimensional DFT of size $n_{1} \times n_{2}$ where the output is transposed . The choices of factorizations of $n$ , combined with the many different ways to implement the data re-orderings of thetranspositions, have led to numerous implementation strategies for the Cooley-Tukey FFT, with many variants distinguished by their ownnames [link] , [link] . FFTW implements a space of many such variants, as described in "Adaptive Composition of FFT Algorithms" , but here we derive the basic algorithm, identify its key features, and outline some important historical variations and their relation to FFTW.

The Cooley-Tukey algorithm can be derived as follows. If $n$ can be factored into $n = n_{1} n_{2}$ , [link] can be rewritten by letting $ℓ = ℓ_{1} n_{2} + ℓ_{2}$ and $k = k_{1} + k_{2} n_{1}$ . We then have:

Y [k_{1} + k_{2} n_{1}] = \sum_{ℓ_{2} = 0}^{n_{2} - 1} [(\sum_{ℓ_{1} = 0}^{n_{1} - 1}, X, [ℓ_{1} n_{2} + ℓ_{2}], ω_{n_{1}}^{ℓ_{1} k_{1}}), ω_{n}^{ℓ_{2} k_{1}}] ω_{n_{2}}^{ℓ_{2} k_{2}},

where $k_{1, 2} = 0, ..., n_{1, 2} - 1$ . Thus, the algorithm computes $n_{2}$ DFTs of size $n_{1}$ (the inner sum), multiplies the result by the so-called [link] twiddle factors $ω_{n}^{ℓ_{2} k_{1}}$ , and finally computes $n_{1}$ DFTs of size $n_{2}$ (the outer sum). This decomposition is then continued recursively. The literature uses the term radix to describe an $n_{1}$ or $n_{2}$ that is bounded (often constant); the small DFT of the radix is traditionally called a butterfly .

Many well-known variations are distinguished by the radix alone. A decimation in time ( DIT ) algorithm uses $n_{2}$ as the radix, while a decimation in frequency ( DIF ) algorithm uses $n_{1}$ as the radix. If multiple radices are used, e.g. for $n$ composite but not a prime power, the algorithm is called mixed radix . A peculiar blending of radix 2 and 4 is called split radix , which was proposed to minimize the count of arithmeticoperations [link] , [link] , [link] , [link] , [link] although it has been superseded in this regard [link] , [link] . FFTW implements both DIT and DIF, is mixed-radix with radices that are adapted to the hardware, and often uses much larger radices (e.g. radix 32) than wereonce common. On the other end of the scale, a “radix” of roughly $\sqrt{n}$ has been called a four-step FFT algorithm (or six-step , depending on how many transposes one performs) [link] ; see "FFTs and the Memory Hierarchy" for some theoretical and practical discussion of this algorithm.

A key difficulty in implementing the Cooley-Tukey FFT is that the $n_{1}$ dimension corresponds to discontiguous inputs $ℓ_{1}$ in $X$ but contiguous outputs $k_{1}$ in $Y$ , and vice-versa for $n_{2}$ . This is a matrix transpose for a single decomposition stage, and the compositionof all such transpositions is a (mixed-base) digit-reversal permutation (or bit-reversal , for radix 2). The resulting necessity of discontiguous memory access and data re-ordering hindersefficient use of hierarchical memory architectures (e.g., caches), so that the optimal execution order of an FFT for given hardware isnon-obvious, and various approaches have been proposed.

<< Chapter < Page Page > Chapter >>

Read also:

Get Jobilize Job Search Mobile App in your pocket Now!

100% Free Mobile Applications
Receive real-time job alerts and never miss the right job again

Source: OpenStax, Fast fourier transforms. OpenStax CNX. Nov 18, 2012 Download for free at http://cnx.org/content/col10550/1.22

Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Fast fourier transforms' conversation and receive update notifications?

Ask

	10 Physiotherapy Modalities-Thermo By Rhodes Start Quiz
	Introduction to Mechanics MCQ By Saylor Foundation Start Quiz
©flickr: Steve	C Programming Language By JavaChamp Team Start Quiz
	7 Neuroanatomy 07 The Visual System By Stephen Voron Start Quiz
	Social Work midterm By Katy Pratt Start Exam
	28 AP 28 Development Inheritance MCQ By OpenStax Start Quiz
	Advertising Promotion BUS210 By Melinda Salzer Start Quiz
	9 Domain Driven Design By JavaChamp Team Start Quiz
	Principles of macroeconomics for ap® courses By OpenStax Read Online Course
©flickr: Gareth	Professional Etiquette MCQ By Abby Sharp Start Quiz