<< Chapter < Page Chapter >> Page >

When performing hypotheses tests the appropriate assumptions and conditions need to be met in order for us to use the model.

For a hypothesis test of a single population mean μ and the population standard deviation is known, performing a z-test the following assumptions and conditions must be met.

  • Randomization Condition: The data must be sampled randomly. Is one of the good sampling methodologies discussed in the Sampling and Data chapter being used?
  • Independence Assumption: The sample values must be independent of each other. This means that the occurrence of one event has no influence on the next event. Usually, if we know that people or items were selected randomly we can assume that the independence assumption is met.
  • 10% Condition: When the sample is drawn without replacement (usually the case), the sample size, n, should be no more than 10% of the population.
  • Large Enough Sample Condition: The sample size must be sufficiently large. Although the Central Limit Theorem tells us that we can use a Normal model to think about the behavior of sample means when the sample size is large enough, it does not tell us how large that should be. If the population is very skewed, you will need a pretty large sample size to use the CLT, however if the population is unimodal and symmetric, even small samples are ok. So think about your sample size in terms of what you know about the population and decide whether the sample is large enough. In general a sample size of 30 is considered sufficient.

When working with numerical data and σ is unknown, performing a Student's-t distribution (often called a t-test), the assumptions of randomization, independence and the 10% condition must be met. In addition, with small sample sizes we cannot assume that that data follows a normal distribution so we need to check the nearly normally distributed condition. To check the nearly normal condition start by making a histogram or stemplot of the data, it is a good idea to make an outlier boxplot, too. If the sample is small, less than 15 then the data must be normally distributed. If the sample size is moderate, between 15 and 40, then a little skewing in the data will can be tolerated. With large sample sizes, more than 40, we are concerned about multiple peaks (modes) in the data and outliers. The data might not be approximately normal with either of these conditions and you may want to run the test both with and without the outliers to determine the extent of their effect. If there are multiple modes in the data it could be that there are two groups in the data that need to be separated.

When working with categorical data, construct a hypothesis test of a single population proportion p , the assumptions of randomization, independence and the 10% condition must be met. In addition, a new assumption, the success/ failure condition , must be checked. When working with proportions we need to be especially concerned about sample size when the proportion is close to zero or one. To check that the sample size is large enough, calculate the success by multiplying the null hypothesized percentage by the sample size and calculate failure by multiplying one minus the null hypothesized percentage by the sample size. If both of these products are larger than ten then the condition is met. H o : p = p o

You are meeting the conditions for a binomial distribution which are there are a certain number n of independent trials, the outcomes of anytrial are success or failure, and each trial has the same probability of a success p . The shape of the binomial distribution needs to be similar to the shape of the normal distribution. To ensure this, the quantities n p and n (1-p) must both be greater than ten n p > 10 and n ( 1-p ) > 10 . Then the binomial distribution of sample (estimated) proportion can be approximated by the normal distribution with μ = p and σ = p q n . Remember that q = 1 - p .

Practice Key Terms 4

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics using spreadsheets. OpenStax CNX. Jan 05, 2016 Download for free at http://legacy.cnx.org/content/col11521/1.23
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics using spreadsheets' conversation and receive update notifications?

Ask