<< Chapter < Page | Chapter >> Page > |
The sample average and the population mean
Consider the numerical average of the values in the sample . This is an observation of the sample average
The sample sum S n and the sample average A n are random variables. If another observation were made (another sample taken), the observed value of these quantities wouldprobably be different. Now S n and A n are functions of the random variables in the sampling process. As such, they have distributions related to the population distribution (the common distribution of the X i ). According to the central limit theorem, for any reasonable sized sample they should be approximatelynormally distributed. As the examples demonstrating the central limit theorem show, the sample size need not be large in many cases. Now if the population mean is μ and the population variance is σ 2 , then
so that
Herein lies the key to the usefulness of a large sample. The mean of the sample average A n is the same as the population mean, but the variance of the sample average is times the population variance. Thus, for large enough sample, the probability is high that the observed value of the sample average will be close to the population mean . The population standard deviation, as a measure of the variation is reduced by a factor .
Suppose a population has mean μ and variance σ 2 . A sample of size n is to be taken. There are complementary questions:
SOLUTION
Suppose the sample variance is known or can be approximated reasonably. If the sample size n is reasonably large, depending on the population distribution (as seen in the previous demonstrations), then A n is approximately .
We may use the MATLAB function norminv to calculate values of x for various p .
p = [0.8 0.9 0.95 0.98 0.99];x = norminv(0,1,(1+p)/2);
disp([p;x;x.^2]')
0.8000 1.2816 1.64240.9000 1.6449 2.7055
0.9500 1.9600 3.84150.9800 2.3263 5.4119
0.9900 2.5758 6.6349
For , . Use at least 385 or perhaps 400 because of uncertainty about the actual σ 2
The idea of a statistic
As a function of the random variables in the sampling process, the sample average is an example of a statistic.
Definition . A statistic is a function of the class which uses explicitly no unknown parameters of the population.
Notification Switch
Would you like to follow the 'Applied probability' conversation and receive update notifications?