<< Chapter < Page | Chapter >> Page > |
Suppose is piecewise Lipschitz and ia a piecewise constant.
where is a constant equal to average of on right and left side of discontinuity in this interval.
where is the width of the interval. Notice this rate is quite slow.
This problem naturally suggests the following remedy: use very small intervals near discontinuities and larger intervals insmooth regions. Specifically, suppose we use intervals of width to contain the discontinuities and the intervals ofwidth elsewhere. Then accordingly piecewise polynomial approximation satisfies
We can accomplish this need for "adaptive resolution" or "multiresolution" using recursive partitions and trees.
We discussed this idea already in our examination of classification trees. Here is the basic idea again, graphically.
Consider a function that contains no more than m points of discontinuity, and is away from these points.
LemmaConsider a complete RDP with n intervals, then there exists anassociated pruned RDP with intervals, such that an associated piecewise degree polynomial approximation , has a squared approximation error of .
Assume . Divide into intervals. If is smooth on a particular interval , then
In intervals that contain a discontinuity, recursively subdivide into two until the discontinuity is contained in an interval ofwidth . This process results in at most addition subintervals per discontinuity, and the squared approximationerror is on all of them accept the intervals of width containing the discontinuities where the error is at each point.
Thus, the overall squared norm is
and there are at most intervals in the partition. Since k>m, we can upperbound the number of intervals by .
Note that if the initial complete RDP has intervals, then the squared error is .
Thus, we only incur a factor of additional leafs and achieve the same overall approximation error as in the case. We will see that this is a small price to pay in order to handle not only smooth functions, but alsopiecewise smooth functions.
Let ; .
A wavelet approximation is a series of the form
where is a constant ,
and the basis functions are orthonormal, oscillatory signals, each with an associated scale and position . is called the wavelet at scale and position .
Suppose is piecewise constant with at most discontinuities. Let
Then, has at most non-zero wavelet coefficients; i.e., for all but terms, since at most one Haar Wavelet at each scale senses each point of discontinuity. Said another way, allbut at most of the wavelets at each scale have support over constant regions of .
itself will be piecewise constant with discontinuities only possible occurring at end points of the intervals . Therefore, in this case
Daubechies wavelets are the extension of the Haar wavelet idea. Haar wavelets have one "vanishing moment":
Daubechies wavelets are "smoother" basis functions with extra vanishing moments. The Daubechies- wavelet has vanishing moments.
The Daubechies-1 wavelet is just the Haar case.
If is a piecewise degree polynomial with at most m pieces, then using the Daubechies- wavelet system.
and
has at most non-zero wavelet coefficients. is called the Discrete Wavelet Transform (DWT) approximation of . The key idea is the same as we saw with trees.
We can also use DWT's to analyze and represent discrete, sampled functions. Suppose,
then we can write as
where
is a discrete time analog of the continuous time wavelets we considered before. In particular,
for the Daubechies- discrete wavelets.
Thus, we also have an analogous approximation result: If are samples from a piecewise degree polynomial function with a finite number of discontinuities, then has non-zero wavelet coefficients.
Suppose and has a finite number of discontinuities. Let denote piecewise degree- polynomial approximation to with pieces; a uniform partition into equal length intervals followed by addition splits at the points of discontinuity.
Then
and has non-zero coefficients according to our previous analysis.
Suppose is a 2-D image that is piecewise polynomial:
A pruned RDP of squares decorated with polyfits gives
Let sample range.
then
error on of the pixels, near zero elsewhere. The DWT of has non-zero wavelet coefficients. at scale
Notification Switch
Would you like to follow the 'Statistical learning theory' conversation and receive update notifications?