<< Chapter < Page | Chapter >> Page > |
The graph of a continuous probability distribution is a curve. Probability is represented by area under the curve. We have already met this concept when we developed relative frequencies with histograms in Chapter 2 . The relative area for a range of values was the probability of drawing at random an observation in that group. Again with the Poisson distribution in Chapter 4 , the graph in Example 4.14 used boxes to represent the probability of specific values of the random variable. In this case, we were being a bit casual because the random variables of a Poisson distribution are discrete, whole numbers, and a box has width. Notice that the horizontal axis, the random variable x, purposefully did not mark the points along the axis. The probability of a specific value of a continuous random variable will be zero because the area under a point is zero. Probability is area.
The curve is called the probability density function (abbreviated as pdf ). We use the symbol f ( x ) to represent the curve. f ( x ) is the function that corresponds to the graph; we use the density function f ( x ) to draw the graph of the probability distribution.
Area under the curve is given by a different function called the cumulative distribution function (abbreviated as cdf ). The cumulative distribution function is used to evaluate probability as area. Mathematically, the cumulative probability density function is the integral of the pdf, and the probability between two values of a continuous random variable will be the integral of the pdf between these two values: the area under the curve between these values. Remember that the area under the pdf for all possible values of the random variable is one, certainty. Probability thus can be seen as the relative percent of certainty between the two values of interest.
We will find the area that represents probability by using geometry, formulas, technology, or probability tables. In general, integral calculus is needed to find the area under the curve for many probability density functions. When we use formulas to find the area in this textbook, the formulas were found by using the techniques of integral calculus.
There are many continuous probability distributions. When using a continuous probability distribution to model probability, the distribution used is selected to model and fit the particular situation in the best way.
Notification Switch
Would you like to follow the 'Introductory statistics' conversation and receive update notifications?