<< Chapter < Page | Chapter >> Page > |
Here are some facts about the F distribution.
Data from a fourth grade classroom in 1994 in a private K – 12 school in San Jose, CA.
Hand, D.J., F. Daly, A.D. Lunn, K.J. McConway, and E. Ostrowski. A Handbook of Small Datasets: Data for Fruitfly Fecundity. London: Chapman&Hall, 1994.
Hand, D.J., F. Daly, A.D. Lunn, K.J. McConway, and E. Ostrowski. A Handbook of Small Datasets. London: Chapman&Hall, 1994, pg. 50.
Hand, D.J., F. Daly, A.D. Lunn, K.J. McConway, and E. Ostrowski. A Handbook of Small Datasets. London: Chapman&Hall, 1994, pg. 118.
“MLB Standings – 2012.” Available online at http://espn.go.com/mlb/standings/_/year/2012.
Mackowiak, P. A., Wasserman, S. S., and Levine, M. M. (1992), "A Critical Appraisal of 98.6 Degrees F, the Upper Limit of the Normal Body Temperature, and Other Legacies of Carl Reinhold August Wunderlich," Journal of the American Medical Association , 268, 1578-1580.
The graph of the F distribution is always positive and skewed right, though the shape can be mounded or exponential depending on the combination of numerator and denominator degrees of freedom. The F statistic is the ratio of a measure of the variation in the group means to a similar measure of the variation within the groups. If the null hypothesis is correct, then the numerator should be small compared to the denominator. A small F statistic will result, and the area under the F curve to the right will be large, representing a large p -value. When the null hypothesis of equal group means is incorrect, then the numerator should be large compared to the denominator, giving a large F statistic and a small area (small p -value) to the right of the statistic under the F curve.
When the data have unequal group sizes (unbalanced data), then techniques from [link] need to be used for hand calculations. In the case of balanced data (the groups are the same size) however, simplified calculations based on group means and variances may be used. In practice, of course, software is usually employed in the analysis. As in any analysis, graphs of various sorts should be used in conjunction with numerical techniques. Always look at your data!
An F statistic can have what values?
What happens to the curves as the degrees of freedom for the numerator and the denominator get larger?
The curves approximate the normal distribution.
Use the following information to answer the next seven exercise. Four basketball teams took a random sample of players regarding how high each player can jump (in inches). The results are shown in [link] .
Team 1 | Team 2 | Team 3 | Team 4 | Team 5 |
---|---|---|---|---|
36 | 32 | 48 | 38 | 41 |
42 | 35 | 50 | 44 | 39 |
51 | 38 | 39 | 46 | 40 |
What is the df(num) ?
What is the df(denom) ?
ten
What are the Sum of Squares and Mean Squares Factors?
What are the Sum of Squares and Mean Squares Errors?
SS = 237.33; MS = 23.73
What is the F statistic?
What is the p -value?
0.1614
At the 5% significance level, is there a difference in the mean jump heights among the teams?
Use the following information to answer the next seven exercises. A video game developer is testing a new game on three different groups. Each group represents a different target market for the game. The developer collects scores from a random sample from each group. The results are shown in
[link]
Group A | Group B | Group C |
---|---|---|
101 | 151 | 101 |
108 | 149 | 109 |
98 | 160 | 198 |
107 | 112 | 186 |
111 | 126 | 160 |
What is the df(num) ?
two
What is the df(denom) ?
What are the SS between and MS between ?
SS = 5,700.4;
MS = 2,850.2
What are the SS within and MS within ?
What is the F Statistic?
3.6101
What is the p -value?
At the 10% significance level, are the scores among the different groups different?
Yes, there is enough evidence to show that the scores among the groups are statistically significant at the 10% level.
Use the following information to answer the next three exercises. Suppose a group is interested in determining whether teenagers obtain their drivers licenses at approximately the same average age across the country. Suppose that the following data are randomly collected from five teenagers in each region of the country. The numbers represent the age at which teenagers obtained their drivers licenses.
Northeast | South | West | Central | East | |
---|---|---|---|---|---|
16.3 | 16.9 | 16.4 | 16.2 | 17.1 | |
16.1 | 16.5 | 16.5 | 16.6 | 17.2 | |
16.4 | 16.4 | 16.6 | 16.5 | 16.6 | |
16.5 | 16.2 | 16.1 | 16.4 | 16.8 | |
________ | ________ | ________ | ________ | ________ | |
________ | ________ | ________ | ________ | ________ |
Enter the data into your calculator or computer.
p -value = ______
State the decisions and conclusions (in complete sentences) for the following preconceived levels of α .
α = 0.05
a. Decision: ____________________________
b. Conclusion: ____________________________
α = 0.01
a. Decision: ____________________________
b. Conclusion: ____________________________
Notification Switch
Would you like to follow the 'Introductory statistics' conversation and receive update notifications?