180 likes | 288 Views
Using WordNet and WSD in Conceptual Query Expansion. Jiuling Zhang 2009-03-04. Outline. Why perform query expansion? WordNet based Word Sense Disambiguation WordNet Word Sense Disambiguation Conceptual Query Expansion Experiments Conclusion Future work References.
E N D
Using WordNet and WSD in Conceptual Query Expansion Jiuling Zhang 2009-03-04
Outline • Why perform query expansion? • WordNet based Word Sense Disambiguation • WordNet • Word Sense Disambiguation • Conceptual Query Expansion • Experiments • Conclusion • Future work • References
Why perform query expansion? queries are baffled with: • Incompleteness lack of enough knowledge • Vagueness uncertainties inherent to natural languages: synonymous & polysemous • Spelling errors
Why perform query expansion? How to handle this problem? • Query expansion/reformulation with a thesaurus • Query expansion by automatic thesaurus generation • Spelling correction
Why perform query expansion? Examples & recent researches: • Qiu introduced query expansion based on concept[1] • Smeaton tried to expand weighting and word sense disambiguation techniques[2] • Hoeber manually constructed a concept network to expand[3]
WordNet • A large manually constructed comprehensive thesaurus developed at Princeton[4] • WordNet is organized into a network of synonyms(synsets) • A synset is basic element in WordNet and words of the same synset are exchangeable in some context
WordNet • Example: word vs. synset wn good -synsn Sense 1 good => advantage, vantage Sense 2 good, goodness => morality Sense 3 good, goodness => quality Sense 4 commodity, trade good, good => artifact, artifact
Word Sense Disambiguation • Word Sense Disambiguation supervised Word Sense Disambiguation: relies on a sense-tagged corpus unsupervised Word Sense Disambiguation: relies on a machine readable thesaurus instead of sense-tagged corpus[5] • Our method belongs to the latter one
Word Sense Disambiguation Components: • WordNet • WordNet modules: WordNet::SenseRelate::AllWords[6] WordNet::QueryData[7] Lingua::WordNet[8]
Word Sense Disambiguation • The adapted Lesk measure is employed. • Adapted Lesk algorithm is higher version of the Lesk method by counting the number of overlaps not only of glosses of synonymy but also of glosses of other related synsets, hyponymy, meronymy, troponymy e.g. [5]
Conceptual Query Expansion Procedure: • Preprocess • Perform WSD to query • Combine new terms to obtain new queries • Perform WSD to obtained queries • Compare synset array & select expanded queries
Conceptual Query Expansion The flow chart of query expansion procedure
Experiment • Short queries are avoided • Expanded queries are to the Google and evaluated by 10 persons • Precision@10 are recorded
Experiment • Results:
Conclusion • Propose a new concept based query expansion using WordNet and WSD • Experimental results show it can improve effectiveness
Future work • Apply the similar idea to sentences in documents to perform documents expansion • Employing Markov language model to modify newly generated queries[9] • Experiments on TREC Web Track collections
References [1] Qiu, Y., Frei, H.-P.: Concept based query expansion. In Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval. ACM Press, Pittsburgh, Pennsylvania, USA (1993) 160-169 [2] R. Richardson, AF Smeaton.: Using WordNet in a Knowledge-Based Approach to Information Retrieval. Proceedings of the BCS-IRSG Colloquium, Crewe(1995) [3] Hoeber, X.-D. Yang, and Y. Yao.: Conceptual query expansion. In Proceedings of the Atlantic Web Intelligence Conference (2005) [4] Miller, G., R. Beckwith, C. Fellbaum, D. Gross, and K. Miller.: Five papers on WordNet. CSL Report 43, Cognitive Science Laboratory, Princeton University(1990) [5] Patwardhan, S., Banerjee, S., Pedersen, T.: UMND1: Unsupervised Word Sense Disambiguation Using Contextual Semantic Relatedness. In the Proceedings of SemEval-2007: 4th International Workshop on Semantic Evaluations(2007) 390-393 [6] http://search.cpan.org/~jrennie/WordNet-SenseRelate-AllWords/ [7] http://search.cpan.org/~jrennie/WordNet-QueryData-1.47/ [8] http://search.cpan.org/~dbrian/Lingua-Wordnet-0.74/ [9] Shuang Liu, Fang Liu, Clement Yu Weiyi Meng.: An Effective Approach to Document Retrieval via Utilizing WordNet and Recognizing Phrases. In Proceedings of the 27th Annual International ACM/SIGIR Conference on Research and development in information retrieval, Sheffield, Yorkshire, UK(2004)