<< Chapter < Page | Chapter >> Page > |
The number of length- observation sequences having a given type approximately equals . The probability that a given sequence has a given type approximately equals , which means that the probability a given sequence has a type not equal to the true distribution decays exponentially with the number of observations. Thus, whilethe coin flip sequences and are equally likely (assuming a fair coin), the second is more typical because its type is closer to the true distribution.
By far the most used technique for estimating the probability distribution of a continuous-valued random variable is the histogram ; more sophisticated techniques are discussed in Silverman . For real-valued data, subdivide the real line into intervals having widths , . These regions are called bins and they should encompass the range of values assumed by the data. For largevalues, the "edge bins" can extend to infinity to catch the overflows. Given observations of a stationary random sequence , , the histogram estimate is formed by simply forming a type from the number of these observations that fall into the bin and dividing by the binwidth . The histogram estimate resembles a rectangular approximation to the density. Unless the underlying density has the sameform (a rare event), the histogram estimate does not converge to the true density as the number of observations grows. Presumably, the value of the histogram at each bin convergesto the probability that the observations lie in that bin. To demonstrate this intuitive feeling, we compactly denote the histogram estimate by using indicator functions . An indicator function for the bin equals one if the observation lies in the bin and is zero otherwise. The estimate is simply the average of the indicator functions across theobservations. The expected value of is simply the probability that the observation lies in the bin. Thus, the expected value of each histogram value equals the integral of the actual density over the bin, showing thatthe histogram is an unbiased estimate of this integral. Convergence can be tested by computing the variance of theestimate. The variance of one bin in the histogram is given by To simplify this expression, the correlation between the observations must be specified. If the values arestatistically independent (we have white noise), each term in the sum becomes zero and the variance is given by . Thus, the variance tends to zero as and the histogram estimate is consistent, converging to . If the observations are not white, convergence becomes problematical. Assume, for example, that and are correlated in a first-order, geometric fashion. The variance does increase with this presumed correlation until, at the extreme ( ), the variance is a constant independent of ! In summary, if the observations are mutually correlated and the histogramestimate converges, the estimate converges to the proper value but more slowly than if the observations were white. Theestimate may not converge if the observations are heavily dependent from index to index. This type of dependencestructure occurs when the power spectrum of the observations is lowpass with an extremely low cutoff frequency.
Notification Switch
Would you like to follow the 'Statistical signal processing' conversation and receive update notifications?