1 / 33

A Sprinkling of Key Words

A Sprinkling of Key Words. Mike Scott Aston University June 30, 2010. Issues: Key words (KWs). Keyness Aboutness Distribution patterns of KWs. complex pattern. or simple. fractal?. Fractal.

Download Presentation

A Sprinkling of Key Words

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Sprinkling of Key Words Mike Scott Aston University June 30, 2010

  2. Issues: Key words (KWs) • Keyness • Aboutness • Distribution patterns of KWs

  3. complex pattern

  4. or simple

  5. fractal?

  6. Fractal • A fractal is "a rough or fragmented geometric shape that can be split into parts, each of which is (at least approximately) a reduced-size copy of the whole,"[1] a property called self-similarity • (Wikipedia) • [1] Mandelbrot, B.B. (1982). The Fractal Geometry of Nature. W.H. Freeman and Company.

  7. Keyness • aboutness • importance • a textual category

  8. aboutness • what the text is about • what the message is • what it all means • picture from mindreadersdictionary.com

  9. importance centrality

  10. Context • Claimp by Maya Goldblum • New Designers 07

  11. Impoverished context • Dandelion Light by Sunghwa Jang • New Designers 07

  12. Identification of KWs: criteria • simple verbatim repetition • no allowance for anaphora, synonymy, antonymy etc. • threshold • one word, or more than one?

  13. Corpus-bound or corpus-driven? • Machine-identified keyness is ideal for corpus-driven research • The researcher lets the PC suggest areas needing further chasing up • See recent work by McEnery, Baker, etc. and Nelia Scott 1998

  14. Research Questions • How are the KWs of Bleak House distributed? • Are the KWs of different kinds (nouns/verbs … character/place/style words) distributed differently? • Do the KWs of the chapters reflect the pattern of the whole text but on a smaller scale?

  15. Bleak House • published 1852-3 • (20 monthly instalments) • 350,000 words • Preface + 66 Chapters

  16. reference corpus • 9 million words • 52 novels, • 29 other 19th Century authors • 23 Dickens

  17. Procedures • download Bleak House (Gutenberg Project) • separate each chapter as a separate file • create a wordlist of the reference corpus • create a wordlist of the whole of Bleak House • create a batch of wordlists, one of each chapter of Bleak House ref. corpus BH

  18. KW Procedures • Compute KW list of the whole novel • Compute batch of KW lists, one for each chapter

  19. Overall Results • Over 300 positive KWs for the whole novel • About 70 negative KWs including God (half as frequent as in 19th C literature overall)

  20. Excel • spreadsheet constructed at the same time as the batch of KW files fewer characters in first chapters pronouns are sprinkled http:\\www.lexically.net\downloads\corpus_linguistics\Bleak_House.xls

  21. Chapter by Chapter • Average of 23 KWs per chapter – same settings, same reference corpus (19th C Lit.) • Per chapter: minimum 5, maximum 38.

  22. Chapter by chapter variation

  23. Global KWs

  24. Local KWs

  25. middling burstiness • verbs appears begins puts observes replies continues says considers etc.

  26. Preliminary findings • All chapters have KWs • Individual chapters differ considerably in their KWs • because KWs are not all global • Character KWs enter the novel gradually • Pronouns and verbs present in many sections but absent in many too • not much to do with aboutness • middling level of burstiness • KWs of different kinds are distributed differently

  27. Preliminary conclusion • KWs of the chapters do not simply reflect the pattern of the whole text but on a smaller scale • Keyness is not fractal

  28. References • Baker, P., Gabrielatos C., Khosravinik, M., Krzyzanowski, M., McEnery, T. & Wodak, R., 2008. A useful methodological synergy? Combining critical discourse analysis and corpus linguistics to examine discourses of refugees and asylum seekers in the UK press. Discourse & Society 19(3), 273-305. • McEnery, Tony, 2009. "Keywords and Moral Panics: Mary Whitehouse and Media Censorship". in Dawn Archer (ed.) What's in a Word-list? Investigating word frequency and keyword extraction. Farnham: Ashgate, 93-124. • Scott, M. Nelia, 1998, Normalisation and Readers' Expectations: A Study of Literary Translation with Reference to Lispector's A Hora da Estrela. Liverpool: Unpublished PhD thesis, University of Liverpool.

More Related