170 likes | 213 Views
HITs Implementation. Presented by the Amazingly Brilliant John Yankowski and the slightly less brilliant Larry Phillips. Eigen Values and Vectors. Av = λ v ( λ is the Eigenvalue) Each λ corresponds to one Eigenvector v.
E N D
HITs Implementation Presented by the Amazingly Brilliant John Yankowski and the slightly less brilliant Larry Phillips
Eigen Values and Vectors • Av = λv (λ is the Eigenvalue) • Each λ corresponds to one Eigenvector v I don’t know what this means, but Google seems to think its related to Eigen somehow.
The POWER Method!!!! • x(k+1) = Ax(k) • xk -> Dominant Eigenvector • Hey John, What about other methods??
Steps • Step 1 Initialize y(0) = e; e is a column vector of all ones • Step 2 take x(k) = Lt y(k-1) , y(k) = Lx(k) and simplify to get…
x(k) = Lt L x(k-1)y(k) = L Lt y(k-1) • Computes the dominant eigenvector for the matrices LT L (Authority matrix) and L LT (Hub Matrix)
Benefits of using the dominant eigenvectors of LTL and LLT • Incurs a small cost in comparison with using scores from all documents on Web • Only one document eigenvector needs to be computed: (LTL or LLT)
Authoritative and Hub Matrices • Authoritative means the links are to the website • Hub means the the links shoot out from the website
Mexican Hats? • Yes, Mexican hats. • We submit a query that results in pages 1 and 6, where 1 happens to point to 6
But Hey, What about Sombreros?? • Related nodes can be added to a limited extent to make the search more comprehensive
I need Mexican Hats! • The query results in Matrix L
MSPaint Matrices are Awesome! • From L, we can find the Authoritative and Hub Matrices.
HITs successfully refines the score by computing • Xi(k) = Σ yj(k-1) • Can be written as X(k) = LTy(k-1) which is the power method that will give you the dominate eigenvector
We have vectors, weee!!! • xT = (0 0 .3660 .1340 .5 0) • yT = (.3660 0 .2113 0 .2113 .2113) • Why John, Don’t those add up to 1? • Why yes they do, and thank you for asking. • These numbers give you the ranking for all your Mexican hat web pages. • Auth. Ranking = (6 3 5 1 2 10) • Hub Ranking = (1 3 6 10 2 5) Dangerously close to a Mexican hat, so we’ll count it
Bibliometricity • Yeah, it’s a big word, and we know it • Refers to two documents that are in-laws (related through association).
How does Bibliometricity apply to mexican hats? • LTL = Din + Ccit • LLT = Dout + Cref Mexican Hat in action
How does this apply to the real world? • http://www.teoma.com is a search engine that uses hits technology.