<< Chapter < Page | Chapter >> Page > |
When performing hypotheses tests the appropriate assumptions and conditions need to be met in order for us to use the model.
For a hypothesis test of a single population mean and the population standard deviation is known, performing a z-test the following assumptions and conditions must be met.
When working with numerical data and is unknown, performing a Student's-t distribution (often called a t-test), the assumptions of randomization, independence and the 10% condition must be met. In addition, with small sample sizes we cannot assume that that data follows a normal distribution so we need to check the nearly normally distributed condition. To check the nearly normal condition start by making a histogram or stemplot of the data, it is a good idea to make an outlier boxplot, too. If the sample is small, less than 15 then the data must be normally distributed. If the sample size is moderate, between 15 and 40, then a little skewing in the data will can be tolerated. With large sample sizes, more than 40, we are concerned about multiple peaks (modes) in the data and outliers. The data might not be approximately normal with either of these conditions and you may want to run the test both with and without the outliers to determine the extent of their effect. If there are multiple modes in the data it could be that there are two groups in the data that need to be separated.
When working with categorical data, construct a hypothesis test of a single population proportion , the assumptions of randomization, independence and the 10% condition must be met. In addition, a new assumption, the success/ failure condition , must be checked. When working with proportions we need to be especially concerned about sample size when the proportion is close to zero or one. To check that the sample size is large enough, calculate the success by multiplying the null hypothesized percentage by the sample size and calculate failure by multiplying one minus the null hypothesized percentage by the sample size. If both of these products are larger than ten then the condition is met. :
You are meeting the conditions for a binomial distribution which are there are a certain number of independent trials, the outcomes of anytrial are success or failure, and each trial has the same probability of a success . The shape of the binomial distribution needs to be similar to the shape of the normal distribution. To ensure this, the quantities and must both be greater than ten and . Then the binomial distribution of sample (estimated) proportion can be approximated by the normal distribution with and . Remember that .
Notification Switch
Would you like to follow the 'Collaborative statistics using spreadsheets' conversation and receive update notifications?