<< Chapter < Page Chapter >> Page >
This module provides a summary on Linear Regression and Correlation as a part of Collaborative Statistics collection (col10522) by Barbara Illowsky and Susan Dean.

Bivariate Data: Each data point has two values. The form is ( x , y ) .

Line of Best Fit or Least Squares Line (LSL): y ^ = a + bx

x = independent variable; y = dependent variable

Residual: Actual y value - predicted y value = y - y ^

    Correlation coefficient r:

  • Used to determine whether a line of best fit is good for prediction.
  • Between -1 and 1 inclusive. The closer r is to 1 or -1, the closer the original points are to a straight line.
  • If r is negative, the slope is negative. If r is positive, the slope is positive.
  • If r = 0 , then the line is horizontal.

Sum of Squared Errors (SSE): The smaller the SSE , the better the original set of points fits the line of best fit.

Outlier: A point that does not seem to fit the rest of the data.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics-parzen remix. OpenStax CNX. Jul 15, 2009 Download for free at http://legacy.cnx.org/content/col10732/1.2
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics-parzen remix' conversation and receive update notifications?

Ask