<< Chapter < Page Chapter >> Page >

Construct the outlier boxplot for the exam scores of 13 students in your statistics class: 48, 55, 63, 68, 75, 76, 76, 78, 78, 81, 87, 89, 93. Write a complete description of the analysis of the exam score data. Remember that a complete description of numerical data includes comments on shape, center, and spread.

Minimum: 48, Ist Quartile: 68, Median: 76, 3rd Quartile: 81, Maximum:94) IQR = 13 with step 19.5 Lower Fence = 48.5 and Upper Fence = 100.5 Lower Adjacent Value = 55 Upper Adjacent Value = 93 Outlier at 48 *The distribution of statistics student’s exam scores is slightly skewed left or to the low values. A typical score on the test was 76 points, the median. The overall variability of the exam scores is 45 points with the middle half of the data having a spread of 13 points. These is one outlier at the low end of 48 points.

A survey of enrollment at 35 community colleges across the United States yielded the following figures ( source: Microsoft Bookshelf ):

  • 6414
  • 1550
  • 2109
  • 9350
  • 21828
  • 4300
  • 5944
  • 5722
  • 2825
  • 2044
  • 5481
  • 5200
  • 5853
  • 2750
  • 10012
  • 6357
  • 27000
  • 9414
  • 7681
  • 3200
  • 17500
  • 9200
  • 7380
  • 18314
  • 6557
  • 13713
  • 17768
  • 7493
  • 2771
  • 2861
  • 1263
  • 7285
  • 28165
  • 5080
  • 11622

  • Organize the data into a chart with five intervals of equal width. Label the two columns "Enrollment"and "Frequency."
  • Construct a histogram of the data.
  • If you were to build a new community college, which piece of information would be more valuable: the mode or the mean?
  • Calculate the sample mean.
  • Calculate the sample standard deviation.
  • A school with an enrollment of 8000 would be how many standard deviations away from the mean?

The median age of the U.S. population in 1980 was 30.0 years. In 1991, the median age was 33.1 years. ( Source: Bureau of the Census )

  • What does it mean for the median age to rise?
  • Give two reasons why the median age could rise.
  • For the median age to rise, is the actual number of children less in 1991 than it was in 1980? Why or why not?
  • Maybe

The following box plot shows the U.S. population for 1990, the latest available year. (Source: Bureau of the Census, 1990 Census)

A box plot with values from 0 to 105, with Q1 at 17, M at 33, and Q3 at 50.
  • Are there fewer or more children (age 17 and under) than senior citizens (age 65 and over)? How do you know?
  • 12.6% are age 65 and over. Approximately what percent of the population are of working age adults (above age 17 to age 65)?
  • more children
  • 62.4%

Try these multiple choice questions (exercises 24 - 30).

The next three questions refer to the following information. We are interested in the number of years students in a particular elementary statistics class have lived in California. The information in the following table is from the entire section.

Number of years Frequency
Total = 20
7 1
14 3
15 1
18 1
19 4
20 3
22 1
23 1
26 1
40 2
42 2

What is the IQR?

  • 8
  • 11
  • 15
  • 35

A

What is the mode?

  • 19
  • 19.5
  • 14 and 20
  • 22.65

A

Is this a sample or the entire population?

  • sample
  • entire population
  • neither

B

The next two questions refer to the following table. X = the number of days per week that 100 clients use a particular exercise facility.

x Frequency
0 3
1 12
2 33
3 28
4 11
5 9
6 4

The 80th percentile is:

  • 5
  • 80
  • 3
  • 4

D

The number that is 1.5 standard deviations BELOW the mean is approximately:

  • 0.7
  • 4.8
  • -2.8
  • Cannot be determined

A

The next two questions refer to the following histogram. Suppose one hundred eleven people who shopped in a special T-shirt store were asked the number of T-shirts they own costing more than $19 each.

A histogram showing the results of a survey.  Of 111 respondents, 5 own 1 t-shirt costing more than $19, 17 own 2, 23 own 3, 39 own 4, 25 own 5, 2 own 6, and no respondents own 7.

The percent of people that own at most three (3) T-shirts costing more than $19 each is approximately:

  • 21
  • 59
  • 41
  • Cannot be determined

C

If the data were collected by asking the first 111 people who entered the store, then the type of sampling is:

  • cluster
  • simple random
  • stratified
  • convenience

D

Below are the 2010 obesity rates by U.S. states and Washington, DC. ( Source: http://www.cdc.gov/obesity/data/adult.html) )

State Percent (%) State Percent (%)
Alabama 32.2 Montana 23.0
Alaska 24.5 Nebraska 26.9
Arizona 24.3 Nevada 22.4
Arkansas 30.1 New Hampshire 25.0
California 24.0 New Jersey 23.8
Colorado 21.0 New Mexico 25.1
Connecticut 22.5 New York 23.9
Delaware 28.0 North Carolina 27.8
Washington, DC 22.2 North Dakota 27.2
Florida 26.6 Ohio 29.2
Georgia 29.6 Oklahoma 30.4
Hawaii 22.7 Oregon 26.8
Idaho 26.5 Pennsylvania 28.6
Illinois 28.2 Rhode Island 25.5
Indiana 29.6 South Carolina 31.5
Iowa 28.4 South Dakota 27.3
Kansas 29.4 Tennessee 30.8
Kentucky 31.3 Texas 31.0
Louisiana 31.0 Utah 22.5
Maine 26.8 Vermont 23.2
Maryland 27.1 Virginia 26.0
Massachusetts 23.0 Washington 25.5
Michigan 30.9 West Virginia 32.5
Minnesota 24.8 Wisconsin 26.3
Mississippi 34.0 Wyoming 25.1
Missouri 30.5
  • Construct a bar graph of obesity rates of your state and the four states closest to your state. Hint: Label the x-axis with the states.
  • Use a random number generator to randomly pick 8 states. Construct a bar graph of the obesity rates of those 8 states.
  • Construct a bar graph for all the states beginning with the letter "A."
  • Construct a bar graph for all the states beginning with the letter "M."

Example solution for b using the random number generator for the Ti-84 Plus to generate a simple random sample of 8 states. Instructions are below.

  • Number the entries in the table 1 - 51 (Includes Washington, DC; Numbered vertically)
  • Press MATH
  • Arrow over to PRB
  • Press 5:randInt(
  • Enter 51,1,8)
Eight numbers are generated (use the right arrow key to scroll through the numbers). The numbers correspond to the numbered states (for this example: {47 21 9 23 51 13 25 4}. If any numbers are repeated, generate a different number by using 5:randInt(51,1)). Here, the states (and Washington DC) are {Arkansas, Washington DC, Idaho, Maryland, Michigan, Mississippi, Virginia, Wyoming}.Corresponding percents are {28.7 21.8 24.5 26 28.9 32.8 25 24.6}. A bar graph showing 8 states on the x-axis and corresponding obesity rates on the y-axis.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics using spreadsheets. OpenStax CNX. Jan 05, 2016 Download for free at http://legacy.cnx.org/content/col11521/1.23
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics using spreadsheets' conversation and receive update notifications?

Ask