HITs Implementation

HITs Implementation Presented by the Amazingly Brilliant John Yankowski and the slightly less brilliant Larry Phillips

Eigen Values and Vectors • Av = λv (λ is the Eigenvalue) • Each λ corresponds to one Eigenvector v I don’t know what this means, but Google seems to think its related to Eigen somehow.

The POWER Method!!!! • x(k+1) = Ax(k) • xk -> Dominant Eigenvector • Hey John, What about other methods??

Computing the ultimate authority and hub scores x and y

Steps • Step 1 Initialize y(0) = e; e is a column vector of all ones • Step 2 take x(k) = Lt y(k-1) , y(k) = Lx(k) and simplify to get…

x(k) = Lt L x(k-1)y(k) = L Lt y(k-1) • Computes the dominant eigenvector for the matrices LT L (Authority matrix) and L LT (Hub Matrix)

Benefits of using the dominant eigenvectors of LTL and LLT • Incurs a small cost in comparison with using scores from all documents on Web • Only one document eigenvector needs to be computed: (LTL or LLT)

Authoritative and Hub Matrices • Authoritative means the links are to the website • Hub means the the links shoot out from the website

Mexican Hats? • Yes, Mexican hats. • We submit a query that results in pages 1 and 6, where 1 happens to point to 6

But Hey, What about Sombreros?? • Related nodes can be added to a limited extent to make the search more comprehensive

I need Mexican Hats! • The query results in Matrix L

MSPaint Matrices are Awesome! • From L, we can find the Authoritative and Hub Matrices.

HITs successfully refines the score by computing • Xi(k) = Σ yj(k-1) • Can be written as X(k) = LTy(k-1) which is the power method that will give you the dominate eigenvector

We have vectors, weee!!! • xT = (0 0 .3660 .1340 .5 0) • yT = (.3660 0 .2113 0 .2113 .2113) • Why John, Don’t those add up to 1? • Why yes they do, and thank you for asking. • These numbers give you the ranking for all your Mexican hat web pages. • Auth. Ranking = (6 3 5 1 2 10) • Hub Ranking = (1 3 6 10 2 5) Dangerously close to a Mexican hat, so we’ll count it

Bibliometricity • Yeah, it’s a big word, and we know it • Refers to two documents that are in-laws (related through association).

How does Bibliometricity apply to mexican hats? • LTL = Din + Ccit • LLT = Dout + Cref Mexican Hat in action

How does this apply to the real world? • http://www.teoma.com is a search engine that uses hits technology.

HITs Implementation

HITs Implementation

Presentation Transcript

HITS 2003

HITs Implementation

Hurricane Hits England

Meteorite hits Russia

Meteorite hits Russia

Library HITS

Freedoms Greatest Hits!

Earthquake Hits Japan

1 Million Hits

Library HITS

Depression Hits Texas

Hits

hits

Hits and Digits

Beyond Counting Hits:

Hurricane Hits England

Hurricane hits England

Flood hits London!

Die erfolgreichsten Hits

Hollywood Hits

Hurricane Earl hits

Buy Website Hits