1 / 17

D/RS 1013

D/RS 1013. Discriminant Analysis. Discriminant Analysis Overview. multivariate extension of the one-way ANOVA looks at differences between 2 or more groups goal is to discriminate between groups considers several predictor variables simultaneously. Discriminant - Overview.

abdalla
Download Presentation

D/RS 1013

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. D/RS 1013 Discriminant Analysis

  2. Discriminant Analysis Overview • multivariate extension of the one-way ANOVA • looks at differences between 2 or more groups • goal is to discriminate between groups • considers several predictor variables simultaneously

  3. Discriminant - Overview • provides a way to describe differences between groups in simple terms. • removes the redundancy among large numbers of variables by combining into a smaller number of Discriminant functions • can classify cases to groups when their group membership is unknown

  4. Overview (cont.) • tests the significance of differences between two or more groups • examines several predictor variables simultaneously • construct linear combination of these variables, forming a single composite variable called a discriminant function • basically MANOVA flipped upside down

  5. Discriminant parallels with MANOVA and Regression • Discriminant works the other way, predicting group membership by some kind of scores • The discriminant function takes the form: • D = d1z1 + d2z2 + ..... + dpzp

  6. Discriminant functions • Di = d1z1 + d2z2 + ..... + dpzp • where, D = scores on the discriminant function • d1 - dp = discriminant function weighting coefficients for each of p predictor variables • z1 - zp = standardized scores on the original p variables

  7. Unstandardized Functions • even more like regression equation • Di = a + d1x1 + d2x2 + ..... + dpxp • a = the discriminant function constant • d1 - dp = discriminant function weighting coefficients for each of p predictor variables • x1 - xp = raw scores on the original p variables

  8. Forming discriminant functions • discriminant function is formed to maximize the F value associated with the D • F = bg variance on D / wg variance on D • provides a function with the greatest discriminating power.

  9. Functions beyond the first • first function is one of many combinations of the p original predictor variables. • # of useful functions is p (# of original variables) or k-1 (k=# of groups being considered), whichever is smaller. • later functions maximize the separation between groups and are orthogonal with the preceding functions.

  10. First discriminant function (3 gps) Separates group 1 from groups 2 & 3

  11. Second function (3 gps) Separates group 3 from groups 1 & 2

  12. Both functions together Orthogonal = uncorrelated

  13. Confusion Matrix • assign cases to groups based on their discriminant function scores • assignments compared with actual group memberships • confusion matrix gives both overall accuracy of classification and the relative frequencies of various types of misclassification

  14. Confusion matrix: example • our proportion correct is (43 + 39)/100= .82 • by chance alone we would end up with .50 correct • if we evenly divided our group assignments • between A & B half in each group correct by chance • can consider prior probabilities, if known

  15. Cross Validation • hold back some of the data to test the model that emerges • gives good idea of the kind of predictive accuracy we can expect for another sample • small samples and several variables unlikely to replicate across samples

  16. Classification Functions • weights and constants used to calculate scores for each case • as many scores as there are groups for each case • assign to group that the case has the highest classification function score for

  17. Assumptions • assumes that all predictors follow a multivariate normal distribution • test is robust with respect to normality, in practice, lack of normality doesn't make much of a difference • especially with large n and moderate number of predictors

More Related