<< Chapter < Page | Chapter >> Page > |
In this example , we derived the maximum likelihood estimate of the mean andvariance of a Gaussian random vector. You might wonder why we chose to estimate the variance rather than the standard deviation . Using the same assumptions provided in the example, let's explore theconsequences of estimating a function of a parameter ( van Trees: Probs 2.4.9, 2.4.10 ).
Assuming that the mean is known, find the maximum likelihood estimates of first the variance, then thestandard deviation.
Are these estimates biased?
Describe how these two estimates are related. Assuming that is a monotonic function, how are and related in general? These results suggest a general question. Consider the problem of estimating somefunction of a parameter , say . The observed quantity is and the conditional density is known. Assume that is a nonrandom parameter.
What are the conditions for an efficient estimate to exist?
What is the lower bound on the variance of the error of any unbiased estimate of ?
Assume an efficient estimate of exists; when can an efficient estimate of some other function exist?
Let the observations consist of statistically independent, identically distributed Gaussian random variables having zero mean butunknown variance. We wish to estimate , their variance.
Find the maximum likelihood estimate and compute the resulting mean-squared error.
Show that this estimate is efficient.
Consider a new estimate given by , where is a constant. Find the value of that minimizes the mean-squared error for . Show that the mean-squared error of is less than that of . Is this result compatible with this previous part ?
Let the observations be of the form where and are statistically independent Gaussian random vectors. The vector has dimension ; the vectors and have dimension .
Derive the minimum mean-squared error estimate of , , from the relationship
Show that this estimate and the optimum linear estimate derived by the Orthogonality Principle are equal.
Find an expression for the mean-squared error when these estimates are used.
To illustrate the power of importance sampling, let's consider a somewhat nave example. Let have a zero-mean Laplacian distribution; we want to employ importance sampling techniques to estimate (despite the fact that we can calculate it easily). Let the density for be Laplacian having mean .
Find the weight that must be applied to each decision based on the variable .
Find the importance sampling gain. Show that this gain means that a fixed number of simulations are needed to achieve a given percentageestimation error (as defined by the coefficient of variation). Express this number as a function of thecriterion value for the coefficient of variation.
Now assume that the density for is Laplacian, but with mean . Optimize by finding the value that maximizes the importance sampling gain.
Suppose we consider an estimate of the parameter having the form , where denotes the vector of the observables and is a linear operator. The quantity is a constant. This estimate is not a linear function of the observables unless . We are interested in finding applications for which it is advantageous to allow . Estimates of this form we term "quasi-linear" .
Show that the optimum (minimum mean-squared error) quasi-linear estimate satisfies for all and where .
Find a general expression for the mean-squared error incurred by the optimum quasi-linear estimate.
Such estimates yield a smaller mean-squared error when the parameter has a nonzero mean. Let be a scalar parameter with mean . The observables comprise a vector having components given by , where are statistically independent Gaussian random variables [ ] independent of . Compute expressions for and . Verify that yields a smaller mean-squared error when .
Notification Switch
Would you like to follow the 'Statistical signal processing' conversation and receive update notifications?