<< Chapter < Page Chapter >> Page >

Statistics as functions of the sampling process

The random variable

W = 1 n i = 1 n ( X i - μ ) 2 , where μ = E [ X ]

is not a statistic, since it uses the unknown parameter μ . However, the following is a statistic.

V n * = 1 n i = 1 n ( X i - A n ) 2 = 1 n i = 1 n X i 2 - A n 2
Got questions? Get instant answers now!

It would appear that V n * might be a reasonable estimate of the population variance. However, the following result shows that a slight modification is desirable.

An estimator for the population variance

The statistic

V n = 1 n - 1 i = 1 n ( X i - A n ) 2

is an estimator for the population variance.

VERIFICATION

Consider the statistic

V n * = 1 n i = 1 n ( X i - A n ) 2 = 1 n i = 1 n X i 2 - A n 2

Noting that E [ X 2 ] = σ 2 + μ 2 , we use the last expression to show

E [ V n * ] = 1 n n ( σ 2 + μ 2 ) - σ 2 n + μ 2 = n - 1 n σ 2

The quantity has a bias in the average. If we consider

V n = n n - 1 V n * = 1 n - 1 i = 1 n ( X i - A n ) 2 , then E [ V n ] = n n - 1 n - 1 n σ 2 = σ 2

The quantity V n with 1 / ( n - 1 ) rather than 1 / n is often called the sample variance to distinguish it from the population variance. If the set of numbers

( t 1 , t 2 , , t N )

represent the complete set of values in a population of N members, the variance for the population would be given by

σ 2 = 1 N i = 1 N t i 2 - 1 N i = 1 N t i 2

Here we use 1 / N rather than 1 / ( N - 1 ) .

Got questions? Get instant answers now!

Since the statistic V n has mean value σ 2 , it seems a reasonable candidate for an estimator of the population variance. If we ask how good is it, we need to considerits variance. As a random variable, it has a variance. An evaluation similar to that for the mean, but more complicated in detail, shows that

Var [ V n ] = 1 n μ 4 - n - 3 n - 1 σ 4 where μ 4 = E [ ( X - μ ) 4 ]

For large n , Var [ V n ] is small, so that V n is a good large-sample estimator for σ 2 .

A sampling demonstration of the clt

Consider a population random variable X uniform [-1, 1]. Then E [ X ] = 0 and Var [ X ] = 1 / 3 . We take 100 samples of size 100, and determine the sample sums. This gives a sample of size 100 of the sample sum random variable S 100 , which has mean zero and variance 100/3.For each observed value of the sample sum random variable, we plot the fraction of observed sums less than or equal to that value. Thisyields an experimental distribution function for S 100 , which is compared with the distribution function for a random variable Y N ( 0 , 100 / 3 ) .

rand('seed',0) % Seeds random number generator for later comparison tappr % Approximation setupEnter matrix [a b] of x-range endpoints [-1 1]Enter number of x approximation points 100 Enter density as a function of t 0.5*(t<=1) Use row matrices X and PX as in the simple case qsample % Creates sample Enter row matrix of VALUES XEnter row matrix of PROBABILITIES PX Sample size n = 10000 % Master sample size 10,000Sample average ex = 0.003746 Approximate population mean E(X) = 1.561e-17Sample variance vx = 0.3344 Approximate population variance V(X) = 0.3333m = 100; a = reshape(T,m,m); % Forms 100 samples of size 100A = sum(a); % Matrix A of sample sums [t,f]= csort(A,ones(1,m)); % Sorts A and determines cumulative p = cumsum(f)/m; % fraction of elements<= each value pg = gaussian(0,100/3,t); % Gaussian dbn for sample sum valuesplot(t,p,'k-',t,pg,'k-.') % Comparative plot % Plotting details (see [link] )
Got questions? Get instant answers now!
Figure one is a graph of two plots, titled, Central limit theorem for sample sums. The horizontal axis is labeled, sample sum values, and the vertical axis is labeled, cumulative fraction. The values on the horizontal axis range from -15 to 20 in increments of 5. The values on the vertical axis range from 0 to 1 in increments of 0.1. There are two captions inside the graph. The first reads, X uniform on [-1 1], and the second reads, E[X] = 0 Var[X] = 1/3. The first plot is a smooth, dashed line, labeled gaussian. The second plot is a wavering, jagged solid line labeled experimental. Both plots follow generally the same shape. They begin in the bottom right at approximately (-12, 0) with a positive slope, and they move to the right, increasing at an increasing rate. At nearly the midpoint in the graph, approximately (0, 0.5), the graphs adjust and begin increasing at a decreasing rate, approaching the top-right corner of the graph while tapering off to a horizontal line. The gaussian, dashed line follows this path's description more accurately, while the solid experimental line seems to be closely fitted to the gaussian line's path with some imperfections causing it to waver jaggedly at a couple spots along the path. Figure one is a graph of two plots, titled, Central limit theorem for sample sums. The horizontal axis is labeled, sample sum values, and the vertical axis is labeled, cumulative fraction. The values on the horizontal axis range from -15 to 20 in increments of 5. The values on the vertical axis range from 0 to 1 in increments of 0.1. There are two captions inside the graph. The first reads, X uniform on [-1 1], and the second reads, E[X] = 0 Var[X] = 1/3. The first plot is a smooth, dashed line, labeled gaussian. The second plot is a wavering, jagged solid line labeled experimental. Both plots follow generally the same shape. They begin in the bottom right at approximately (-12, 0) with a positive slope, and they move to the right, increasing at an increasing rate. At nearly the midpoint in the graph, approximately (0, 0.5), the graphs adjust and begin increasing at a decreasing rate, approaching the top-right corner of the graph while tapering off to a horizontal line. The gaussian, dashed line follows this path's description more accurately, while the solid experimental line seems to be closely fitted to the gaussian line's path with some imperfections causing it to waver jaggedly at a couple spots along the path.
The central limit theorem for sample sums.

Questions & Answers

if three forces F1.f2 .f3 act at a point on a Cartesian plane in the daigram .....so if the question says write down the x and y components ..... I really don't understand
Syamthanda Reply
hey , can you please explain oxidation reaction & redox ?
Boitumelo Reply
hey , can you please explain oxidation reaction and redox ?
Boitumelo
for grade 12 or grade 11?
Sibulele
the value of V1 and V2
Tumelo Reply
advantages of electrons in a circuit
Rethabile Reply
we're do you find electromagnetism past papers
Ntombifuthi
what a normal force
Tholulwazi Reply
it is the force or component of the force that the surface exert on an object incontact with it and which acts perpendicular to the surface
Sihle
what is physics?
Petrus Reply
what is the half reaction of Potassium and chlorine
Anna Reply
how to calculate coefficient of static friction
Lisa Reply
how to calculate static friction
Lisa
How to calculate a current
Tumelo
how to calculate the magnitude of horizontal component of the applied force
Mogano
How to calculate force
Monambi
a structure of a thermocouple used to measure inner temperature
Anna Reply
a fixed gas of a mass is held at standard pressure temperature of 15 degrees Celsius .Calculate the temperature of the gas in Celsius if the pressure is changed to 2×10 to the power 4
Amahle Reply
How is energy being used in bonding?
Raymond Reply
what is acceleration
Syamthanda Reply
a rate of change in velocity of an object whith respect to time
Khuthadzo
how can we find the moment of torque of a circular object
Kidist
Acceleration is a rate of change in velocity.
Justice
t =r×f
Khuthadzo
how to calculate tension by substitution
Precious Reply
hi
Shongi
hi
Leago
use fnet method. how many obects are being calculated ?
Khuthadzo
khuthadzo hii
Hulisani
how to calculate acceleration and tension force
Lungile Reply
you use Fnet equals ma , newtoms second law formula
Masego
please help me with vectors in two dimensions
Mulaudzi Reply
how to calculate normal force
Mulaudzi
Got questions? Join the online conversation and get instant answers!
Jobilize.com Reply

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Applied probability. OpenStax CNX. Aug 31, 2009 Download for free at http://cnx.org/content/col10708/1.6
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Applied probability' conversation and receive update notifications?

Ask