160 likes | 202 Views
Russian Word Sketches. Khokhlova Maria St.Petersburg State University Institute for Linguistic Studies khokhlova.marie@gmail.com. Russian Web Corpus, S. Sharoff; 10 mln tokens (sample); Russian National Corpus isn’t available in the Sketch Engine; 2 Sketch grammars: “Classic” grammar;
E N D
Russian Word Sketches Khokhlova Maria St.Petersburg State University Institute for Linguistic Studies khokhlova.marie@gmail.com
Russian Web Corpus, S. Sharoff; • 10 mln tokens (sample); • Russian National Corpus isn’t available in the Sketch Engine; • 2 Sketch grammars: • “Classic” grammar; • V. Benko’s grammar
Verb X/X Verb 2:[tag="V.*"] [tag!="Z"&tag!="SENT"]{0,2} 1:[tag!="SENT"&tag!="Z"&tag!="S.*"&tag!="I"] 1:[tag!="SENT"&tag!="Z"&tag!="S.*"&tag!="I"] [tag!=","&tag!="SENT"]{0,2} 2:[tag="V.*"]
“Classic” Approach: precise results, less noise. • V.Benko’s Approach: word sketches are generated for any word, important in the case of mistakes in corpora.