<< Chapter < Page Chapter >> Page >

The test statistic (t-score) is calculated as follows:

T-score

t = ( x 1 - x 2 ) - ( μ 1 - μ 2 ) ( S 1 ) 2 n 1 + ( S 2 ) 2 n 2

    Where:

  • s 1 and s 2 , the sample standard deviations, are estimates of σ 1 and σ 2 , respectively.
  • σ 1 and σ 2 are the unknown population standard deviations.
  • x 1 and x 2 are the sample means. μ 1 and μ 2 are the population means.

The degrees of freedom (df) is a somewhat complicated calculation. However, a computer or calculator calculates it easily. The dfs are not always a whole number. The test statisticcalculated above is approximated by the student's-t distribution with dfs as follows:

Degrees of freedom

df = [ ( s 1 ) 2 n 1 + ( s 2 ) 2 n 2 ] 2 1 n 1 1 · [ ( s 1 ) 2 n 1 ] 2 + 1 n 2 1 · [ ( s 2 ) 2 n 2 ] 2

When both sample sizes n 1 and n 2 are five or larger, the student's-t approximation is very good. Notice that the sample variances s 1 2 and s 2 2 are not pooled. (If the question comes up, do not pool the variances.)

It is not necessary tocompute this by hand. A calculatoror computer easily computes it.

Independent groups

The average amount of time boys and girls ages 7 through 11 spend playing sports each day is believed to be the same. Anexperiment is done, data is collected, resulting in the table below. Both populations have a normal distribution.

Sample Size Average Number of Hours Playing Sports Per Day Sample Standard Deviation
Girls 9 2 hours 0.75
Boys 16 3.2 hours 1.00

Is there a difference in the mean amount of time boys and girls ages 7 through 11 play sports each day? Test at the 5% level of significance.

The population standard deviations are not known. Let g be the subscript for girls and b be the subscript for boys. Then, μ g is the population mean for girls and μ b is the population mean for boys. This is a test of two independent groups , two population means .

Random variable : X g - X b = difference in the sample mean amount of time girls and boys play sports each day.

H o : μ g = μ b or μ g μ b = 0

H a : μ g μ b or μ g μ b 0

The words "the same" tell you H o has an "=". Since there are no other words to indicate H a , then assume "is different." This is a two-tailed test.

Distribution for the test: Use t df where df is calculated using the df formula for independent groups, two population means. Using a calculator, df is approximately 18.8462. Do not pool the variances.

Calculate the p-value using a student's-t distribution: p-value = 0.0054

Graph:

Normal distribution curve of the difference in the average amount of time girls and boys play sports all day with values of -1.2, 0, and 1.2 on the x-axis. Two vertical upward lines extend from points -1.2 and 1.2 to the curve. The 1/2(p-values) areas are indicated on either side of these values. Normal distribution curve of the difference in the average amount of time girls and boys play sports all day with values of -1.2, 0, and 1.2 on the x-axis. Two vertical upward lines extend from points -1.2 and 1.2 to the curve. The 1/2(p-values) areas are indicated on either side of these values.

This is a picture of the t-table indicating the point for a two-tail t-test with 18 degrees of freedom and the value between 0.02 and 0.01. At that point the t-value is between 2.552 and 2.878.

s g = 0.75

s b = 1

So, x g - x b = 2 - 3.2 = - 1.2

Half the p-value is below -1.2 andhalf is above 1.2.

Make a decision: Since α > p-value, reject H o .

This means you reject μ g = μ b . The means are different.

Conclusion: At the 5% level of significance, the sample data show there is sufficient evidence to conclude that the mean number of hours that girls and boys aged 7through 11 play sports per day is different (mean number of hours boys aged 7 through 11 play sports per day is greater than the mean number of hours played by girls OR the mean number of hours girls aged 7 through 11 play sports per day is greater than the mean number of hours played by boys).

A study is done by a community group in two neighboring colleges to determine which one graduates students with more math classes. College A samples11 graduates. Their average is 4 math classes with a standard deviation of 1.5 math classes. College B samples 9 graduates. Their average is 3.5 math classes with astandard deviation of 1 math class. The community group believes that a student who graduates from college A has taken more math classes, on the average. Both populations have a normal distribution. Test at a 1% significance level.Answer the following questions.

Is this a test of two means or two proportions?

two means

Are the populations standard deviations known or unknown?

unknown

Which distribution do you use to perform the test?

student's-t

What is the random variable?

X A - X B

What are the null and alternate hypothesis?

  • H o : μ A μ B
  • H a : μ A > μ B

Is this test right, left, or two tailed?

right

What is the p-value?

0.1928

Do you reject or not reject the null hypothesis?

Do not reject.

Conclusion:

At the 1% level of significance, from the sample data, there is not sufficient evidence to conclude that a student who graduates from college A hastaken more math classes, on the average, than a student who graduates from college B.

Practice Key Terms 3

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics using spreadsheets. OpenStax CNX. Jan 05, 2016 Download for free at http://legacy.cnx.org/content/col11521/1.23
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics using spreadsheets' conversation and receive update notifications?

Ask