Document ranking
Document ranking. Text-based Ranking (1° generation). Doc is a binary vector. Binary vector X,Y in {0,1} D Score: overlap measure. What ’ s wrong ?. Normalization. Dice coefficient (wrt avg #terms) : Jaccard coefficient (wrt possible terms) :. NO, triangular. OK, triangular.
591 views • 57 slides