40 likes | 218 Views
Web Information retrieval (Web IR). Handout #8: Probabilistic information retrieval. Ali Mohammad Zareh Bidoki ECE Department, Yazd University alizareh@yaduni.ac.ir. Outline. R(q,d)= P(d,|q). Okapi BM25 Algorithm. f ( qi,D ) is the occurrences of qi in the document D
E N D
Web Information retrieval (Web IR) Handout #8:Probabilistic information retrieval Ali Mohammad Zareh Bidoki ECE Department, Yazd University alizareh@yaduni.ac.ir
Outline • R(q,d)= P(d,|q)
Okapi BM25 Algorithm • f(qi,D) is the occurrences of qi in the document D • f(qi;Q) is the occurrences of qi in the query Q • |D|is the length of the document D (i.e., the number of words), and avgdl is the average document length • k1, k3 and b are free • parameters. Usually set k1 = 2.5, k3 = 0 and b = 0.8.