1 / 12

Understanding Sources of Variation in Housing Price Predictions

Reviewing housing price prediction models, study protocols, data collection methods, and key experimental definitions to make informed decisions. Evaluate and choose the preferred model based on residual errors and interpret the study conclusions.

Download Presentation

Understanding Sources of Variation in Housing Price Predictions

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Stat 513 – Day 3 Sources of Variation (Section 1.1)

  2. Last Time – Review ideas?

  3. Last Time • Lab 1 – predicting housing prices • Model 1: predicted price = $408,060, • Residuals SE = $240,380 • Model 2: predicted price = -59.37 + 0.2127 sqft • Residuals SE = $185,070 • Model 3: predicted price = 86.76 +.22 sqft if lake front 58.11 + 08 sqft if not lake front • Residuals SE = $49,480 • Which model do you prefer?

  4. Definition • The study protocol outlines how the study will be conducted, providing enough detail, so that someone else could carry out the same study under identical conditions. It is important to consider the research question when evaluating whether the study protocol will be appropriate.

  5. Data Collection • You will be given a slip of paper with 30 letters to memorize in order • You will have 20 seconds to look at the letters, then turn the paper over and write down as many as you can remember in order • Your score is the number of correct letters before the first mistake

  6. Definitions • The explanatory variable deliberately manipulated in an experiment is often called a treatment variable or factor. • For categorical explanatory variables, the different categories of the treatment variable are often called levels. • In experiments, the objects (or “subjects”) that we are measuring are often called experimental units rather than observational units. • The conditions we impose on the experimental units (here the levels of the treatment variable) are also called treatments. Each experimental unit is assigned to one treatment.

  7. Key Idea • The goal of random assignment is to reduce the chances of there being any confounding variables in the study. By creating groups that are expected to be similar with respect to variables (other than the treatment variable of interest) that may impact the response, random assignment attempts to eliminate confounding. • See Example 1.1 for more discussion.

  8. Definition • A study is double blind if (i) the subjects do not know which treatment condition they are in, and, (ii) the person evaluating the response variable does not know which treatment condition the subject is in If only one of the above conditions are true for a study, then the study is said to be single blind.

  9. Definition • Inclusion criteria are the set of characteristics that individuals must have in order to participate in a study.

  10. Study Conclusions • Were the experimental units randomly selected from the population of interest? • If so, then feel comfortable generalizing our conclusions to that population • Were the experimental units randomly assigned to the treatments? • If so, then feel comfortable drawing cause-and-effect conclusions (potentially)

  11. Study Conclusions

  12. For next time • Finish and submit Lab 1 • Uploading files • Be working on HW 1 • Begin reading Ch. 1

More Related