1 / 40

Exploratory Data Analysis with SAS and R

Learn SAS and R for guided data analysis, interactive exploration, and statistical modeling. Import data, run regression, create visualizations, and explore assumptions. Detailed explanation of SAS/Insight features. Discover the difference in approach between SAS and R.

rlyons
Download Presentation

Exploratory Data Analysis with SAS and R

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Exploratory Data Analysis with SAS and R Kristin Vanderbilt University of New Mexico

  2. Outline: • SAS • Introduction • SAS/Insight • Guided Data Analysis • R • Graphics capabilities

  3. SAS windows • Program editor • Log • Output

  4. Animal brain weight/body weight data

  5. Importing data into SAS

  6. SAS: Guided Data Analysis

  7. Explore data before doing regression Choose Summarize ….

  8. Normal probability plot of Brain_wt

  9. Histogram of Brain_Wt

  10. Select variables with which to make a matrix plot …

  11. Matrix Plot

  12. Run the regression Choose Analyze ….

  13. Explore Assumptions…..

  14. Outliers

  15. Leverage Points

  16. Other functions available under Guided Data Analysis …. • View and Edit data • Transform variables • Select and view specific observations • Query for subsets of data • Create a “journal” storing analysis results

  17. SAS/Insight • Tool for data exploration and analysis • Analyze univariate distributions, • Investigate multivariate distributions, and • Fit explanatory models using ANOVA and regression

  18. SAS/Insight: Interactive Data Analysis

  19. Create a box plot

  20. Brush a group of points

  21. Label selected points

  22. Run a regression ….

  23. Click on a variable and transform it

  24. Lots of statistics are available

  25. Cook’s Distance

  26. R: an Open Source alternative • There is an important difference in philosophy between S (and hence R) and the other main statistical systems. In S a statistical analysis is normally done as a series of steps, with intermediate results being stored in objects. Thus whereas SAS and SPSS will give copious output from a regression or discriminant analysis, R will give minimal output and store the results in a fit object for subsequent interrogation by further R functions.

  27. R provides a command line interpreter for a dialect of S

  28. Other graphical options in R: Copied from: http://www.r-project.org/

  29. Image and 3D plot of volcano From: http://www.r-project.org/

  30. References • http://www.r-project.org/ • http://cran.r-project.org/doc/contrib/Lemon-kickstart/index.html

More Related