770 likes | 1.33k Views
British National Corpus (BNC). An introduction to Punjaporn Pojanapunya February 17, 2011. Outline of the talk. What is the BNC? What is Xaira? How to use the BNC for: Language teaching and learning Research. What is the BNC?.
E N D
British National Corpus(BNC) An introduction to PunjapornPojanapunya February 17, 2011
Outline of the talk • What is the BNC? • What is Xaira? • How to use the BNC for: • Language teaching and learning • Research
What is the BNC? • A 100 million word collection of samples of British English from a wide range of sources (10% spoken, 90% written texts). • Available under licence; latest edition is BNC XML edition (2007)
The main uses of the BNC • Reference Book Publishing • Natural language processing • Language Teaching and Learning • Materials design • Classroom reference • Independent learning • Linguistic Research • BNC as source of real language use • BNC as benchmark http://www.natcorp.ox.ac.uk/corpus/index.xml
What is Xaira? • XML Aware Indexing and Retrieval Architecture • A text searching tool • Usable with any XML corpus • Provided free with the BNC XML Edition
File menu Word – the different word forms in the corpus Phrase – the multi-word phrase or single word form Addkey – words with additional keys such as POS codes Pattern – word patterns XML – specific XML start- or end- tags Query builder – a complex query CQL – commands in CQL, the language Xaira uses to represent its queries internally
View menu • How Xaira looks • How the solutions are displayed
Window menu • Manage the windows on the screen Help menu
Search results • Solutions • No solutions • Too many solutions dialogue box
Solutions • Page mode/ Line mode • Plain text / XML text • Scope of context • Reference (status bar)
‘It's an interesting idea. <s n="105"><c c5="PUQ">‘</c> <w c5="PNP" hw="it" pos="PRON“>It</w> <w c5="VBZ" hw="be" pos="VERB">‘s</w> <w c5="AT0" hw="an" pos="ART">an</w> <w c5="AJ0" hw="interesting" pos="ADJ">interesting</w> <w c5="NN1" hw="idea" pos="SUBST">idea</w> <c c5="PUN">.</c></s>
Use the BNC with Xaira Case studies • 1: She’ll (turn/ go) mad!! • 2: Men are handsome/ women are beautiful • Language teaching and learning • Materials design, classroom reference, independent learning • 3: Words in my corpus vs. ‘standard’ use • Research
Case study 1: She’ll (turn/ go) mad!! • Task: • comparing use of “Turn” and “Go” • Turn + adj. vs. Go + adj. • Language point: Semantic prosody • Xaira functions: • Open the BNC • New query – query builder (word query + Addkey) • Sort
Link type: • - Next • Not next • one-way • two-way
Go + adj. (a-z) • Turn + adj. (a-z) 2 keys: • Examples of ‘go’, then ‘turn’ • Adj. (a-z)
Two Three