<< Chapter < Page Chapter >> Page >

Before examining a general approximation procedure which has significant consequences for later treatments, we consider some illustrative examples.

Simple approximation to poisson

A random variable with the Poisson distribution is unbounded. However, for a given parameter value μ , the probability for k n , n sufficiently large, is negligible. Experimentindicates n = μ + 6 μ (i.e., six standard deviations beyond the mean) is a reasonable value for 5 μ 200 .

>>mu = [5 10 20 30 40 50 70 100 150 200];>>K = zeros(1,length(mu));>>p = zeros(1,length(mu));>>for i = 1:length(mu) K(i) = floor(mu(i)+ 6*sqrt(mu(i)));p(i) = cpoisson(mu(i),K(i)); end>>disp([mu;K;p*1e6]')5.0000 18.0000 5.4163 % Residual probabilities are 0.000001 10.0000 28.0000 2.2535 % times the numbers in the last column.20.0000 46.0000 0.4540 % K is the value of k needed to achieve 30.0000 62.0000 0.2140 % the residual shown.40.0000 77.0000 0.1354 50.0000 92.0000 0.066870.0000 120.0000 0.0359 100.0000 160.0000 0.0205150.0000 223.0000 0.0159 200.0000 284.0000 0.0133
Got questions? Get instant answers now!

An m-procedure for discrete approximation

If X is bounded, absolutely continuous with density functon f X , the m-procedure tappr sets up the distribution for an approximating simple random variable. An interval containing the range of X is divided into a specified number of equal subdivisions. The probability mass for each subinterval is assigned to the midpoint. If d x is the length of the subintervals, then the integral of the density function over the subintervalis approximated by f X ( t i ) d x . where t i is the midpoint. In effect, the graph of the density over the subinterval is approximated by a rectangle of length d x and height f X ( t i ) . Once the approximating simple distribution is established, calculations are carried out as for simple random variables.

A numerical example

Suppose f X ( t ) = 3 t 2 , 0 t 1 . Determine P ( 0 . 2 X 0 . 9 ) .

SOLUTION

In this case, an analytical solution is easy. F X ( t ) = t 3 on the interval [ 0 , 1 ] , so

P = 0 . 9 3 - 0 . 2 3 = 0 . 7210 . We use tappr as follows:

>>tappr Enter matrix [a b]of x-range endpoints [0 1] Enter number of x approximation points 200Enter density as a function of t 3*t.^2 Use row matrices X and PX as in the simple case>>M = (X>= 0.2)&(X<= 0.9);>>p = M*PX' p = 0.7210
Got questions? Get instant answers now!

Because of the regularity of the density and the number of approximation points, the result agrees quite well with the theoretical value.

The next example is a more complex one. In particular, the distribution is not bounded. However, it is easy to determine a bound beyond which the probability is negligible.

A graph of a Distribution figure. The x-axis displays the values for t ranging from 0-8 while the y-axis represents the values of u=F(t) ranging from 0-1. The plotted distribution rises gradually at first but at (4,0.3) the slope changes to be nearly vertical, and at around (5,0.95) the slope begins to plateau. A graph of a Distribution figure. The x-axis displays the values for t ranging from 0-8 while the y-axis represents the values of u=F(t) ranging from 0-1. The plotted distribution rises gradually at first but at (4,0.3) the slope changes to be nearly vertical, and at around (5,0.95) the slope begins to plateau.
Distribution function for [link] .

Radial tire mileage

The life (in miles) of a certain brand of radial tires may be represented by a random variable X with density

f X ( t ) = t 2 / a 3 for 0 t < a ( b / a ) e - k ( t - a ) for a t

where a = 40 , 000 , b = 20 / 3 , and k = 1 / 4000 . Determine P ( X 45 , 000 ) .

>>a = 40000;>>b = 20/3;>>k = 1/4000;>>% Test shows cutoff point of 80000 should be satisfactory>>tappr Enter matrix [a b]of x-range endpoints [0 80000] Enter number of x approximation points 80000/20Enter density as a function of t (t.^2/a^3).*(t<40000) + ... (b/a)*exp(k*(a-t)).*(t>= 40000) Use row matrices X and PX as in the simple case>>P = (X>= 45000)*PX' P = 0.1910 % Theoretical value = (2/3)exp(-5/4) = 0.191003>>cdbn Enter row matrix of VALUES XEnter row matrix of PROBABILITIES PX % See [link] for plot

In this case, we use a rather large number of approximation points. As a consequence, the results are quite accurate. In the single-variable case,designating a large number of approximating points usually causes no computer memory problem.

Got questions? Get instant answers now!

The general approximation procedure

We show now that any bounded real random variable may be approximated as closely as desired by a simple random variable (i.e., one having a finite set of possible values). For the unboundedcase, the approximation is close except in a portion of the range having arbitrarily small total probability.

We limit our discussion to the bounded case, in which the range of X is limited to a bounded interval I = [ a , b ] . Suppose I is partitioned into n subintervals by points t i , 1 i n - 1 , with a = t 0 and b = t n . Let M i = [ t i - 1 , t i ) be the i th subinterval, 1 i n - 1 and M n = [ t n - 1 , t n ] (see [link] ). Now random variable X may map into any point in the interval, and hence into any point in each subinterval M i . Let E i = X - 1 ( M i ) be the set of points mapped into M i by X . Then the E i form a partition of the basic space Ω . For the given subdivision, we form a simple random variable X s as follows. In each subinterval, pick a point s i , t i - 1 s i < t i . Consider the simple random variable X s = i = 1 n s i I E i .

Partition of the interval I including the range of X
Refinement of the partition by additional subdividion points.

This random variable is in canonical form. If ω E i , then X ( ω ) M i and X s ( ω ) = s i . Now the absolute value of the difference satisfies

| X ( ω ) - X s ( ω ) | < t i - t i - 1 the length of subinterval M i

Since this is true for each ω and the corresponding subinterval, we have the important fact

| X ( ω ) - X s ( ω ) | < maximum length of the M i

By making the subintervals small enough by increasing the number of subdivision points, we can make the difference as small as we please.

While the choice of the s i is arbitrary in each M i , the selection of s i = t i - 1 (the left-hand endpoint) leads to the property X s ( ω ) X ( ω ) ω . In this case, if we add subdivision points to decrease the size of some or all of the M i , the new simple approximation Y s satisfies

X s ( ω ) Y s ( ω ) X ( ω ) ω

To see this, consider t i * M i (see [link] ). M i is partitioned into M i ' M i ' ' and E i is partitioned into E i ' E i ' ' . X maps E i ' into M i ' and E i ' ' into M i ' ' . Y s maps E i ' into t i and maps E i ' ' into t i * > t i . X s maps both E i ' and E i ' ' into t i . Thus, the asserted inequality must hold for each ω By taking a sequence of partitions in which each succeeding partition refines the previous (i.e. adds subdivision points) in such a way that themaximum length of subinterval goes to zero, we may form a nondecreasing sequence of simple random variables X n which increase to X for each ω .

The latter result may be extended to random variables unbounded above. Simply let N th set of subdivision points extend from a to N , making the last subinterval [ N , ) . Subintervals from a to N are made increasingly shorter. The result is a nondecreasing sequence { X N : 1 N } of simple random variables, with X N ( ω ) X ( ω ) as N , for each ω Ω .

For probability calculations, we simply select an interval I large enough that the probability outside I is negligible and use a simple approximation over I .

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Applied probability. OpenStax CNX. Aug 31, 2009 Download for free at http://cnx.org/content/col10708/1.6
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Applied probability' conversation and receive update notifications?

Ask