<< Chapter < Page Chapter >> Page >

In the margin of error formula, the sample proportions p ̂ and q ̂ are estimates of the unknown population proportions p and q . The estimated proportions p ̂ and q ̂ are used because p and q are not known. p ̂ and q ̂ are calculated from the data. p ̂ is the estimated proportion of successes. q ̂ is the estimated proportion of failures.

The confidence interval can only be used if the number of successes n p ̂ and the number of failures n q ̂ are both larger than 10.

For the normal distribution of proportions, the z-score formula is as follows.

If p ̂ ~ N ( p ̂ p ̂ q ̂ n ) then the z-score formula is z = p ̂ - p p ̂ q ̂ n

Suppose that a market research firm is hired to estimate the percent of adults living in a large city who have cell phones. 500 randomly selected adult residents in this city are surveyed to determine whether they have cell phones. Of the 500 people surveyed, 421 responded yes - they own cell phones. Using a 95% confidence level, compute a confidence interval estimate for the trueproportion of adults residents of this city who have cell phones.

    Solution

  • Start by discussing the assumptions and conditions that support your model.
  • The solution is step-by-step.

Assumptions and Conditions:

  • Randomization – It is stated in the problem that the respondents were selected randomly.
  • independence – when a random sample is taken from the population independence is plausable.
  • 10% condition – the sample size of 500 is less than 10% all adult residents in the large city.
  • Success/ Failure – p ̂ n>10 (1- p ̂ ) n>10 0.842(500) = 241>10 and 1-0.842)(500)=79>10 where p ̂ is the sample percentage and n is the sample size. Both products are larger than 10, so the success/ failure condition is met.
X = the number of people in the sample who have cell phones. X is binomial. X ~ B ( 500 , 421 500 ) .

To calculate the confidence interval, you must find p ̂ , q ̂ , and ME .

n = 500 x = the number of successes = 421

p ̂ = x n = 421 500 = 0.842

p ̂ = 0.842 is the sample proportion; this is the point estimate of the population proportion.

q ̂ = 1 - p ̂ = 1 - 0.842 = 0.158

Since CL = 0.95 , then α = 1 - CL = 1 - 0.95 = 0.05 α 2 = 0.025 .

Then z α 2 = z .025 = 1.96

ME = z α 2 p ̂ q ̂ n = 1.96 ( 0.842 ) ( 0.158 ) 500 = 0.032

p ̂ - ME = 0.842 - 0.032 = 0.81

p ̂ + ME = 0.842 + 0.032 = 0.874

The confidence interval for the true binomial population proportion is ( p ̂ - ME , p ̂ + ME ) = ( 0.810 , 0.874 ) .

Interpretation

We estimate with 95% confidence that between 81% and 87.4% of all adult residents of this city have cell phones.

Explanation of 95% confidence level

95% of the confidence intervals constructed in this way would contain the true value for the population proportion of all adult residents of this city who have cell phones.

For a class project, a political science student at a large university wants to estimate the percent of students that are registered voters. He surveys 500students and finds that 300 are registered voters. Compute a 90% confidence interval for the true percent of students that are registered voters and interpret the confidenceinterval.

  • Start by discussing the assumptions and conditions that support your model.
  • The solution is step-by-step.

Solution a

Assumptions and Conditions:
  • Randomization – no information is given in the problem about the sampling methodology.
  • independence – wsince we do not know how were respondents were chosen we do not know if responses are independent.
  • 10% condition – the sample size of 500 is less than 10% all adult residents in the large city.
  • Success/ Failure – p ̂ n>10 (1- p ̂ ) n>10 0.60(500) = 300>10 and 1-0.60)(500)=200>10 where p ̂ is the sample percentage and n is the sample size. Both products are larger than 10, so the success/ failure condition is met.(When assumptions and conditions are not met we note that in our work, but continue on with the problem.)
Let X = the number of people in the sample who have cell phones. X is binomial. X ~ B ( 500 , 421 500 ) . x = 300 and n = 500 .

p ̂ = x n = 300 500 = 0.600

q ̂ = 1 - p ̂ = 1 - 0.600 = 0.400

Since CL = 0.90 , then α = 1 - CL = 1 - 0.90 = 0.10 α 2 = 0.05 .

z α 2 = z .05 = 1.645

Use the the standard normal probability table to find z .05 . Remember that the area to the right of z .05 is 0.05 and the area to the left of z .05 is 0.95.

ME = z α 2 p ̂ q ̂ n = 1.645 ( 0.60 ) ( 0.40 ) 500 = 0.036

p ̂ - ME = 0.60 - 0.036 = 0.564

p ̂ + ME = 0.60 + 0.036 = 0.636

The confidence interval for the true binomial population proportion is ( p ̂ - ME , p ̂ + ME ) = ( 0.564 , 0.636 ) .

    Interpretation:

  • We estimate with 90% confidence that the true percent of all students that are registered voters is between 56.4% and 63.6%.
  • Alternate Wording: We estimate with 90% confidence that between 56.4% and 63.6% of ALL students are registered voters.

Explanation of 90% confidence level

90% of all confidence intervals constructed in this way contain the true value for the population percent of students that are registered voters.

Calculating the sample size n

If researchers desire a specific margin of error, then they can use the error bound formula to calculate the required sample size.

The error bound formula for a population proportion is

  • ME = z α 2 p ̂ q ̂ n
  • Solving for n gives you an equation for the sample size.
  • n = z α 2 2 p ̂ q ̂ ME 2

Suppose a mobile phone company wants to determine the current percentage of customers aged 50+ that use text messaging on their cell phone. How many customers aged 50+ should the company survey in order to be 90% confident that the estimated (sample) proportion is within 3 percentage points of the true population proportion of customers aged 50+ that use text messaging on their cell phone.

Solution

From the problem, we know that ME=0.03 (3%=0.03) and

z α 2 = z .05 = 1.645 because the confidence level is 90%

However, in order to find n , we need to know the estimated (sample) proportion p'. Remember that q'=1-p'. But, we do not know p' yet. Since we multiply p' and q' together, we make them both equal to 0.5 because p'q'= (.5)(.5)=.25 results in the largest possible product. (Try other products: (.6)(.4)=.24; (.3)(.7)=.21; (.2)(.8)=.16 and so on). The largest possible product gives us the largest n. This gives us a large enough sample so that we can be 90% confident that we are within 3 percentage points of the true population proportion. To calculate the sample size n, use the formula and make the substitutions.

n z 2 p ̂ q ̂ ME 2 gives n 1.645 2 (.5) (.5) .03 2 =751.7

Round the answer to the next higher value. The sample size should be 752 cell phone customers aged 50+ in order to be 90% confident that the estimated (sample) proportion is within 3 percentage points of the true population proportion of all customers aged 50+ that use text messaging on their cell phone.

**With contributions from Roberta Bloom.

Practice Key Terms 5

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics using spreadsheets. OpenStax CNX. Jan 05, 2016 Download for free at http://legacy.cnx.org/content/col11521/1.23
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics using spreadsheets' conversation and receive update notifications?

Ask