180 likes | 254 Views
Stat 350 Lab Session GSI: Yizao Wang Section 016 Mon 2pm30-4pm MH 444-D Section 043 Wed 2pm30-4pm MH 444-B. Today’s Agenda. Review on scatterplot and linear regression Module 10 Activity 1 and 2 Return Exam 2
E N D
Stat 350 Lab SessionGSI: Yizao WangSection 016 Mon 2pm30-4pm MH 444-DSection 043 Wed 2pm30-4pm MH 444-B
Today’s Agenda • Review on scatterplot and linear regression • Module 10 Activity 1 and 2 • Return Exam 2 • Please log in with UMID for today’s qwizdom questions. Your participation will be graded based on the number of questions you answer.
Second Midterm Summary • 2nd midtern exam more difficult than the 1st • Mean scores:lab016 57.3/lab043 56.2 (all labs: 55.3) • Deadline for checking scores: Friday April 11thCome to Yizao for totaling errors. Otherwise ask your lecture instructor.
Scatterplots • A Scatterplot displays the relationship between two quantitative variables • X: explanatory/predictor/independent variable • Y: response or dependent variable • Things to look for in Scatterplots • Form: linear, curved, clusters,… • Direction: positive or negative • Strength: how close points are to the underlying form (weak, moderate, strong, etc. ) • Outliers or other deviations from this overall pattern
Scatterplots • - #1, #2, and #3 show linear relationship. • - #1 and #2 show positive direction, but #3 shows negative direction. • #2 shows stronger strength comparing to #1. #4 has a curved form. #5 has a clustered form. #6 has outliers.
Regression Model • Population version: yi = 0 + 1xi + iwhere the i are i.i.d. N (0, ), seen as measurement errors or noise.0 and 1 are parameters, fixed but unknown constants, called intercept and slope, respectively. • Sample version: ŷi = b0 + b1xib0and b1 are estimated intercept and slope respectively.
About Correlation • Correlation r (or R) (1 r 1) measures the strength of linearassociation between two variables and directionRoughly, |r| > 0.7 => strong • R2 (0 R2 1) measures the proportion of the variation in the response that can be explained by the linear regression of Y on XRoughly, R2 > 0.5 => strong
How to Use the Yellow Card • Make sure you understand (and know how to calculate) the following terms:SXY, SXX, SYYResidual e = observed y – predicted ySSTO, SSM (SSREG), SSE • The inference part of linear regression: HT, CI, PI…
Which variable is the response? Poverty Rate Teen Birth Rate
Which variable is the explanatory variable? Poverty Rate Teen Birth Rate
Yes or No Based on the scatterplot, does there appear to be a linear relationship between teen birth rate and poverty rate?
Think about the strength and direction of the relationship. Which of the following do you think is most reasonable for the value of the correlation coefficient? -0.6 0.2 0.7 None of the above
What is the estimated regression line? Predicted_TeenBrth = 15.674 + 2.025(PovPct) Predicted_TeenBrth = 2.025 + 15.674(PovPct) PovPct = 15.674 + 2.025(Predicted_TeenBrth) PovPct = 2.025 + 15.674(Predicted_TeenBrth)
What output could be used to help determine the value of the correlation coefficient? • Model Summary > R • Model Summary > R Square • Coefficients > Standardized Coefficients Beta • All of the above
What is the correct interpretation of r2? • 49.5% of teen birth rates can be accounted for by the poverty rate. • 49.5% of the variation in teen birth rate can be accounted for by the linear relationship between teen birth rate and poverty rate.
What is the predicted teen birth rate of Michigan (with a poverty rate of 12.2%)? • 24.705 • 40.380 • 193.25 • Can’t calculate this number
Output for Homework 11 • Generate regression outputs for homework 11: scatterplot and regression tables...