This module provides homework questions related to lessons on descriptive statistics. The original module by Dr. Barbara Illowsky and Susan Dean has been modified by Roberta Bloom. Some homework questions have been changed and/or added.
Twenty-five randomly selected students were asked the number of movies they watched the previous week. The results are as follows:
# of movies
Frequency
Relative Frequency
Cumulative Relative Frequency
0
5
1
9
2
6
3
4
4
1
Find the sample mean
Find the sample standard deviation,
Construct a histogram of the data.
Complete the columns of the chart.
Find the first quartile.
Find the median.
Find the third quartile.
Construct a box plot of the data.
What percent of the students saw fewer than three movies?
Find the 40th percentile.
Find the 90th percentile.
1.48
1.12
1
1
2
80%
1
3
The median age for U.S. blacks currently is 30.1 years; for U.S. whites it is 36.6 years. (Source: U.S. Census)
Based upon this information, give two reasons why the black median age could be lower than the white median age.
Does the lower median age for blacks necessarily mean that blacks die younger than whites? Why or why not?
How might it be possible for blacks and whites to die at approximately the same age, but for the median age for whites to be higher?
Forty randomly selected students were asked the number of pairs of sneakers they owned. Let X = the number of pairs of sneakers owned. The results are as follows:
X
Frequency
Relative Frequency
Cumulative Relative Frequency
1
2
2
5
3
8
4
12
5
12
7
1
Find the sample mean
Find the sample standard deviation,
Construct a histogram of the data.
Complete the columns of the chart.
Find the first quartile.
Find the median.
Find the third quartile.
Construct a box plot of the data.
What percent of the students owned at least five pairs?
Find the 40th percentile.
Find the 90th percentile.
3.78
1.29
3
4
5
32.5%
4
5
600 adult Americans were asked by telephone poll,
What do you think constitutes a middle-class income?The results are below. Also, include left endpoint, but not the right endpoint. (
Source: Time magazine; survey by Yankelovich Partners, Inc. )
"Not sure" answers were omitted from the results.
Salary ($)
Relative Frequency
<20,000
0.02
20,000 - 25,000
0.09
25,000 - 30,000
0.19
30,000 - 40,000
0.26
40,000 - 50,000
0.18
50,000 - 75,000
0.17
75,000 - 99,999
0.02
100,000+
0.01
What percent of the survey answered
"not sure"?
What percent think that middle-class is from $25,000 - $50,000 ?
Construct a histogram of the data
Should all bars have the same width, based on the data? Why or why not?
How should the<20,000
and the100,000+
intervals be handled? Why?
Find the 40th and 80th percentiles
Following are the published weights (in pounds) of all of the team members of the San Francisco 49ers from a previous year (Source: San Jose Mercury News).
177
205
210
210
232
205
185
185
178
210
206
212
184
174
185
242
188
212
215
247
241
223
220
260
245
259
278
270
280
295
275
285
290
272
273
280
285
286
200
215
185
230
250
241
190
260
250
302
265
290
276
228
265
Organize the data from smallest to largest value.
Find the median.
Find the first quartile.
Find the third quartile.
Construct a box plot of the data.
The middle 50% of the weights are from _______ to _______.
If our population were all professional football players, would the above data be a sample of weights or the population of weights? Why?
If our population were the San Francisco 49ers, would the above data be a sample of weights or the population of weights? Why?
Assume the population was the San Francisco 49ers. Find:
the population mean,
.
the population standard deviation,
.
the weight that is 2 standard deviations below the mean.
When Steve Young, quarterback, played football, he weighed 205 pounds. How many standard deviations above or below the mean was he?
That same year, the average weight for the Dallas Cowboys was 240.08 pounds with a standard deviation of 44.38 pounds. Emmit Smith weighed in at 209 pounds. With respect to his team, who was lighter, Smith or Young? How did you determine your answer?
Based on the shape of the data, what is the most appropriate measure of center for this data: mean, median, or mode? Explain.
Are there any outliers in the data? Use an appropriate numerical test involving the
IQR to identify outliers, if any, and clearly state your conclusion.
Are any data values further away than 2 standard deviations from the mean? Clearly state your conclusion and show numerical work to justify your answer.
241
205.5
272.5
205.5, 272.5
sample
population
236.34
37.50
161.34
0.84 std. dev. below the mean
Young
The mean is most appropriate. From the boxplot the data appear to be relatively symmetric. When the data are symmetric, it is appropriate to use the mean because it incorporates more information from the data. (If the data were skewed, then it would be more appropriate to use the median; but these data are not skewed.)
IQR = 272.5 – 202.5 = 67; Q1 – 1.5*IQR = 205.5 – 1.5(67) = 105; Q3 + 1.5*IQR = 272.5 + 1.5(67) = 373.
All weights are between 105 and 373. There are no outliers.
Mean – 2(standard deviation) = 240.08 – 2(44.38) = 151.32 ;
Mean + 2(standard deviation) = 240.08 + 2(44.38) = 328.84 ;All players' weights are between 2 standard deviations above and below the mean.
Receive real-time job alerts and never miss the right job again
Source:
OpenStax, Collaborative statistics homework book: custom version modified by r. bloom. OpenStax CNX. Dec 23, 2009 Download for free at http://legacy.cnx.org/content/col10619/1.2
Google Play and the Google Play logo are trademarks of Google Inc.
Notification Switch
Would you like to follow the 'Collaborative statistics homework book: custom version modified by r. bloom' conversation and receive update notifications?