190 likes | 281 Views
METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES. Katarzyna Wojtaszek student number 1118676 CROSS. I will try to answer questions:. How can I estimate correlation matrix when I have data? What can I do if matrices are non-PD? Shrinking method
E N D
METHODS OF TRANSFORMING NON-POSITIVE DEFINITE CORRELATION MATRICES Katarzyna Wojtaszek student number 1118676 CROSS
I will try to answer questions: • How can I estimate correlation matrix when I have data? • What can I do if matrices are non-PD? Shrinking method Eigenvalues method Vines method • How can we calculate distances between original and transformed matrices? • Which method is the best? comparing conclusions
How can I estimate correlation matrix if I have data? I can estimate the correlation matrices from data as follows: 1.I can estimate each off-diagonal element separately
2.I can also estimate whole data together: with i=1,…,s ; j=1,…,n
What can I do when matrices are non-PD? We can use some methods for transforming these matrices to PD correlation matrices using: Shrinking method Eigenvalues method Vines method
How can we calculate distances between original and transformed matrices? There are many methods which we can use to calculate the distance between matrices . In my project I used formula:
1. SHRINKING METHOD • linear shrinking • Assumptions: • Rnxn is given non-PD pseudo correlation matrix • is arbitrary correlation matrix • Define: ([0,1]) =R+ (R* - R) is a pseudo correlation matrix.
Idea: find the smallest such that matrix will be PD. Since R is non-PD then the smallest eigenvalue of R is negative , so we have to choose such that will be positive. Hence: And 0 if - / (*-). So we find matrix which is PD matrix given non-PD matrix R.
non-linear shrinking Assumption: Rnxn is given non-PD pseudo correlation matrix Procedure: where f is strictly increasing odd function with f(0)=0 and >0.
Comparison of the linear and non-linear shrinking methods Non-linear shrinking Rnxn SET OF PD-MATRICES Linear shrinking In
2.THE EIGENVALUE METHOD. • Assumptions: • Rnxn non-PD pseudo correlation matrix • P -orthogonal matrix such that R=PDPT • D matrix which the eigenvalues of R on the diagonal • is some constant 0
Idea: Replaced negative values in matrix D by . We obtain: R*=PD*PT = where is a diagonal matrix with diagonal elements equal for i=1,2,…,n.
3.VINES METHOD. • Assumptions: • Rnxn pseudo correlation matrix • Idea: • First we have to check if our matrix is PD
If some (-1,1) we change the value V( ) (-1,1)) and recalculate partial correlation using: V( )=V( ) + We obtain new matrix , witch we have check again.
Example • Let say that we have matrix R4x4 Very useful is making graphical model 1 2 4 3
Which method is the best? Comparing. Using Matlab I chose randomly 500 non-PD matrices, transformed them and calculated the average distances between non-PD and PD matrices. This table shows us my results.
Conclusions: • The reason that the linear shrinking is very bad method is that we shrink all elements by the same relative amount • The eigenvalues method performes fast and gives very good results regardless matrices dimensions • For the non-linear shrinking method the best choice of the projection function are and