1 / 47

Applications of corpus analysis in EAP: research, learning, and teaching

Applications of corpus analysis in EAP: research, learning, and teaching. Martin Hewings The University of Birmingham & Cambridge University Press m.j.hewings@bham.ac.uk. Outline of talk. Corpus analysis in EAP research

dtann
Download Presentation

Applications of corpus analysis in EAP: research, learning, and teaching

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Applications of corpus analysis in EAP: research, learning, and teaching Martin Hewings The University of Birmingham & Cambridge University Press m.j.hewings@bham.ac.uk

  2. Outline of talk • Corpus analysis in EAP research • Students learning from corpora: Data-driven learning and an alternative • Teachers learning from corpora: Classroom applications

  3. Features of 31 JEAP ‘corpus’ papers • Paper types • 26 corpus analyses • Corpus content • 19 writing; 5 speech; two both; journal articles predominant; focus on single soc sci disciplines • Corpus types • mainly expert/ published • Focus of analysis • mainly particular lexical/ grammatical features

  4. Corpus research for EAP Typically… • written corpora • expert/ published corpora • particular (soc sci) disciplines • lexical/ grammatical focus

  5. Corpus research for EAP Typically… • written corpora • expert/ published corpora • particular (soc sci) disciplines • lexical/ grammatical focus The future? • more speech & CMC • more learner corpora • more science disciplines (& more comparison) • greater number of areas of investigation

  6. Corpora as resources for learners: data-driven learning (DDL)

  7. Corpora as resources for learners: data-driven learning (DDL)

  8. Corpora as resources for learners: data-driven learning (DDL) DDL: • exposes students to ‘target’ language forms • provides authentic examples • provides information beyond dictionary or grammar • encourages inductive learning • encourages learner autonomy

  9. Lee & Swales (2006): DDL “…an exceptional group of students – highly acculturated into the genres of their discourse communities, mostly on the way to their PhDs, eager to perfect their English, possessing of advanced computer skills, and perfectly comfortable with quantitative data.”

  10. DDL: some reservations • lack of evidence to link DDL to language improvement • are the outcomes worth the time, effort and money? • it doesn’t suit all students

  11. Selecting corpus data for students (as an alternative to DDL) • An example: MBA students’ use of ‘I’ • ‘Research article (RA) corpus’: 120,000 words • ‘MBA corpus’: essays, 22,000 words

  12. MBA corpuson TV or from magazine, I am in the opinion that service more consumption of fuels. I am almost certain that there world imports composition. I believe services commodities s composition. In the future I believe there will be a new osing a million dollars. So I believe services commodities wector. As a result of this, I can predict that there will mports appeared. After 1987, I do not think that there was a mports about one third. But I don't think it will grow so er. As a result, therefore, I expect that the countries more than other commodities. I expect service industry will ervices" and is intangible. I feel that the intangible veloped in the next future. I personally see the above idea world. But, before I go on, I should make a point. After ore detail analysis, I think I should take deeper consideratio a long term point of view, I suppose the composition of

  13. MBA corpus on TV or from magazine, I am in the opinion that service more consumption of fuels. I am almost certain that there world imports composition. I believe services commodities s composition. In the future I believe there will be a new osing a million dollars. So I believe services commodities wector. As a result of this, I can predict that there will mports appeared. After 1987, I do not think that there was a mports about one third. But I don't think it will grow so er. As a result, therefore, I expect that the countries more than other commodities. I expect service industry will ervices" and is intangible. I feel that the intangible veloped in the next future. I personally see the above idea world. But, before I go on, I should make a point. After ore detail analysis, I think I should take deeper consideratio a long term point of view, I suppose the composition of

  14. Journals: published writing RA corpusuring the estimation period. I also computed Patell's (1976, p. y. The question: When should I buy? has one logical answer:(SVR) metric (in all cases), I choose to present only the resulthe wall' statements such as 'I don't care how you do it, just doon environment. In addition, I examine several subhypotheses basSize Test. In this section, I first test the hypothesis of diffy perennial question: should I invest now or wait for the as a long way from reality: 'I just did not want to be part of aep asking themselves,'How do I know? What evidence is there?' Thy? By information technology I mean the hardware and software, cI were doing this what would I need?' Another useful heuristic red per ASR No. 190. That is, I test the hypothesis that inflatioe key questions such as: 'If I were doing this what would I needh domestically and globally. I will, therefore, focus more on th

  15. Journals: published writing RA corpusuring the estimation period. I also computed Patell's (1976, p. y. The question: When should I buy? has one logical answer:(SVR) metric (in all cases), I choose to present only the resulthe wall' statements such as 'I don't care how you do it, just doon environment. In addition, I examine several subhypotheses basSize Test. In this section, I first test the hypothesis of diffy perennial question: should I invest now or wait for the as a long way from reality: 'I just did not want to be part of aep asking themselves,'How do I know? What evidence is there?' Thy? By information technology I mean the hardware and software, cI were doing this what would I need?' Another useful heuristic red per ASR No. 190. That is, I test the hypothesis that inflatioe key questions such as: 'If I were doing this what would I needh domestically and globally. I will, therefore, focus more on th

  16. Journals: published writing RA corpusuring the estimation period. I also computed Patell's (1976, p. y. The question: When should I buy? has one logical answer:(SVR) metric (in all cases), I choose to present only the resulthe wall' statements such as 'I don't care how you do it, just doon environment. In addition, I examine several subhypotheses basSize Test. In this section, I first test the hypothesis of diffy perennial question: should I invest now or wait for theas a long way from reality: 'I just did not want to be part of aep asking themselves,'How do I know? What evidence is there?' Thy? By information technology I mean the hardware and software, cI were doing this what would I need?' Another useful heuristic red per ASR No. 190. That is, I test the hypothesis that inflatioe key questions such as: 'If I were doing this what would I needh domestically and globally. I will, therefore, focus more on th

  17. Teachers learning from corpora: checking intuitions

  18. Teachers learning from corpora: checking intuitions • it is [adjective] to-infinitive • it is [adjective] that Cambridge Corpus of Academic English (CCAE). About 400 million words of published academic written texts.

  19. Teachers learning from corpora: checking intuitions • it is [adjective] to-infinitive 48,170 • it is [adjective] that 24,115

  20. it is [adjective] to-infinitivecrucial difficult helpful important necessary possible safe straightforward> 4000 times < 500 times possible 7784 important 5019 difficult 4345 necessary 4103 straightforward 481 crucial 282 helpful 255 safe 194

  21. it is [adjective] thatclear interesting likely notable possible significant surprising true> 1000 times < 300 times clear 5284 possible 4116 likely 2561 true 1170 significant 257 surprising 251 interesting 235 notable 206

  22. it is true that • Itistruethathaving a theoretical foundation for what one is doing in the classroom is important,but it is at least equally important to transform that knowledge into activities that are simple,appealing to the students, and successful. • While itistruethatnational expenditure estimates are often larger than those of national income, this is not always the case.

  23. Teachers learning from corpora: checking intuitions What adverbs come before… ……. similar but not …….different? ……. different but not …..similar? ……. similar or ……… different?

  24. closely essentially radically rather reasonably roughly strikingly totally vastly

  25. closelyessentiallyradically rather reasonably roughly strikingly totally vastly

  26. closelyessentiallyradically rather reasonably roughly strikingly totally vastly

  27. ‘highly different’ and ‘almost similar’ Student writing errors (?) Their aims were also highly different. …the number of full-time and part-time jobs was almost similar.

  28. Their aims were also highly different. • Cambridge Academic English Corpus (400+ mill words of writing) 17 examples • Google Scholar26,300 examples • e.g. The advocates of Semantic Syntax and of Principles Parameters emphasize that their conceptualizations of grammatical theory are highly different.

  29. …the number of full-time and part-time jobs was almost similar. • Cambridge Academic English Corpus (400+ mill words of writing) 29 examples • Google Scholar 102, 000 examples • e.g. Comparing the charts of Figures 12 and 13 with those of Figures 7 and 6, respectively, we conclude that they are almost similar.

  30. From corpus research to teaching materials

  31. From corpus research to teaching materials: ‘on the surface’

  32. From corpus research to teaching materials: ‘below the surface’

  33. Corpus analysis in EAP: the future • In research… • In (data-driven) learning… • In teaching…

  34. Applications of corpus analysis in EAP: research, learning, and teaching Martin Hewings The University of Birmingham & Cambridge University Press m.j.hewings@bham.ac.uk

  35. Cambridge Academic English

  36. Open-access academic corpora include… • Corpus of Contemporary American English (COCA) (Academic) 120 million words • British Academic Written English (BAWE) 6.5 million words of good-standard student writing • Michigan Corpus of Academic Spoken English (MICASE) 1.8 million words • British Academic Spoken English (BASE) 1.6 million words

  37. Teachers learning from corpora: discovering new information

  38. Some nouns have a related adjective ending: • -ic base – basic (not basical) • -ical astrology – astrological (not astrologic) • -ic or –ical analysis – analytic/ analytical

  39. -ic or –ical ? ecological and geographical (rather than ecological and geographic) taxonomic and geographic (rather than taxonomic and geographical)

  40. technologic or technogical? • TV and other viewing media are very important in this most technologic era. …in this most technologictechnological era. • In the early IVF literature, patient experiences of infertility are outlined as negative, requiring a technologic solution. • Vast majority from ‘medicine and health’

More Related