290 likes | 565 Views
WORDNET. THE WORDNET SYSTEM. Lexicographer files Code: Lexico files database Search Routines and Interfaces. SYNSETS. A list of synonymous words or collocations that are interchangable in some context Pointers that describe the relations between
E N D
THE WORDNET SYSTEM • Lexicographer files • Code: Lexico files database • Search Routines and Interfaces
SYNSETS • A list of synonymous words or collocations that are interchangable in some context • Pointers that describe the relations between different synsets • Lexical and Semantic relations recognized
SYNSETS contd…….. • NOUNS • VERBS • ADJECTIVES Clusters of head synsets and satellite synsets Clusters are antonymous pairs/triplets Pertainyms • ADVERBS derived from adjectives lexical pointer to root adjective Lexical Relations
LEXICAL RELATIONS • Synonymy • Polysemy • Hyponymy/Hypernomy • Meronymy/Holonomy • Antonymy • Troponomy
Definitions… • Synonymy: Different words - related senses of meanings • Polysemy: Same word - different senses of meanings • Troponomy: Verbs that denote manner elaboration of another verb
…more Definitions • Hyponymy/Hyperonymy: Generic or Universal – Specific or Particular (IS-A) • Meronymy/Holonomy: Part of (HAS-A) • Antonymy: Opposite sense
FILES • Database INDEX files (Unix) index.pos (Unix) pos .idx (Windows) • Database DATA files (Unix) data.pos (Unix) pos .dat (Windows) • Files of sentences illustrating the use of VERB *.vrb • Morphology Exception lists pos .exc
FILE FORMATS • INDEX FILE FORMAT lemma pos synset_cnt p_cnt [ptr_symbol...] sense_cnt tagsense_cnt synset_offset [synset_offset...] • DATA FILE FORMAT synset_offset lex_filenum ss_type w_cnt word lex_id [word lex_id...] p_cnt [ptr...] [frames...] | gloss ptr: Pointer_symbol synset_offset pos source/target Frames: f_cnt + f_num w_num [ + f_num w_num + ..]
Formats… • SENSE INDEX FORMAT sense_key synset_offset sense_number tag_cnt Sense-key:lemma%lex_sense Lex_sense: ss_type:lex_filenum:lex_id:head_word:head_id
MORPHY • Query Base form • Query is single word or collocation • Single word: Separating inflections from query strings - plural of nouns - forms of verbs - degree of adjectives • Collocation: - individual base forms - hyphens, spaces, period as delimiters
MORPHY AT WORK…. • NOUN grapes grape • VERB eating eat • ADJ faster fast • ADV greenish green Returns a base form OR NULL on every call eg: axes - 2 base forms (axe and axis)
INPUT SEARCHES -Synonyms - Ordered by estimated frequency - Coordinate terms - Hypernyms - Derivationally related forms - Sentence frames - Synonyms grouped by similarity
VARIOUS SEARCHES…. • Derivationally Related Forms: word forms that are morphologically related to searchstr • Co-ordinate Terms: words having the same hypernyms • Sentence Frames: illustrative sentences
DISPLAY OF SENSES • Sense n [{synset_offset}] [<lex_filename>] word1[#sense_number][, word2...] • Each synset is printed on one line • Each line of output is preceded by a marker (usually => ) then a synset • synset glosses in parentheses at the end of each synset
Contd……. • Indentation by spaces for different levels in hierarchy • Semantic tagging using Brown Corpus • Senses are ordered by decreasing frequency of use • Verb senses grouped by similarity of meaning
Overview for "swimming" • The noun "swimming" has 1 sense in WordNet. 1. swimming, swim -- (the act of swimming) • The verb "swim" has 2 senses in WordNet. 1. swim -- (travel through water; "We had to swim for 20 minutes to reach the shore"; "a big fish was swimming in the tank") 2. float, swim -- (be afloat; stay on a liquid surface; not sink) • The adjective "swimming" has 2 senses in WordNet. 1. liquid, swimming, watery -- (filled or brimming with tears; "swimming eyes"; "watery eyes"; "sorrow made the eyes of many grow liquid") 2. naiant, swimming -- (applied to a fish depicted horizontally)