1 / 29

The Computational Foundations of Language Arts

The Computational Foundations of Language Arts. Art Graesser , Zhiqiang Cai & Nia Dowell Psychology, Computer Science, & the Institute for Intelligent Systems. Overview. Analysis of text and discourse with Coh -Metrics

tacey
Download Presentation

The Computational Foundations of Language Arts

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Computational Foundations of Language Arts Art Graesser, ZhiqiangCai & Nia Dowell Psychology, Computer Science, & the Institute for Intelligent Systems

  2. Overview • Analysis of text and discourse with Coh-Metrics • Five primary measures of text complexity • Complexity profiles for different literary texts

  3. Text Difficulty Measures • Common measures • Flesch-Kincaid (Klare, 1975) • Degrees of Reading Power (Koslin, Zeno, Koslin, 1987) • Lexile scores (Stenner, 2006, MetaMetrix) • Typical factors • Word Familiarity • Word frequency in the language • Number of letters • Number of syllables • Sentence Length • Validated primarily with Cloze comprehension tests • Predictive of surface level comprehension

  4. Multilevel framework of discourse comprehension • Words • Syntax • Textbase • Explicit ideas (propositions) • Referential cohesion • Situation model • causal, intentional, temporal, spatial logical relationships • Connectives • Genre and rhetorical structure • Pragmatic communication • Graesser & McNamara (2010) • Graesser, Millis & Zwaan (1997) • Kintsch • Perfetti

  5. Measures from Coh-Metrix Text Graesser, McNamara, Louwerse, & Cai (2004)

  6. Preprocessing Syntax Analysis Lexical Analysis (Filters) (Tagger, Parser) (Lemmatizier, Stemmer) WordNet Word Difficulty Referential Cohesion Database Info LSA Wd Lists Sentence Complexity Causal Cohesion CELEX MRC Temporal Cohesion Syntax Features Lexical Features Spatial Cohesion Text Complexity Components Coh-Metrix Database

  7. Example Coh-Metrix Measures • Co-reference Cohesion • Noun and argument overlap • Stem overlap • (lemmas: run, runs, runner) • Latent semantic analysis (LSA) • Lexical diversity (type-token ratio) • Pronouns Word Measures • Number of syllables • Part of speech (noun, verb…) • Word frequency • Concreteness, imagery • Multiple meanings • Syntax • Structural complexity • Modifiers per noun-phrase • Words before main verb of main clause • Syntactic similarity between sentences • Situation Model Cohesion • Connectives & discourse markers • Causal and intentional verbs • Causal and intentional cohesion • Repetition in tense and aspect • Logical operators • and, or, therefore, if, then, not

  8. Analysis of TASA Texts • TASA - Touchstone Applied Science Associates • 37,651 texts • Represents texts a student would experience throughout K12. • Texts had mean of 288.6 words (SD = 25.4) • Most of the texts in the language arts, science, and social studies • Degrees of Reading Power (DRP) scores • Conducted a Principal Components Analysis

  9. Coh-Metrix Ease of Processing • Narrativity. Narrative text tells a story, with characters, events, places, and things that are familiar to the reader. • Referential cohesion. High cohesion text contains words and ideas that overlap across sentences and the entire text, forming threads that connect the textbase together for the reader. • Situation model cohesion. Causal, intentional, and temporal connectives help the reader to form a more coherent and deeper understanding of the text. • Syntactic simplicity. Sentences with few words and simple, familiar structures are easier to process and understand. • Word concreteness. Concrete words evoke mental images and are more meaningful to the reader than abstract words.

  10. Narrativity (Genre) NARRATIVE (e.g. stories, language arts) Familiar words Early age of acquisition Verbs Adverbs Pronouns Negations Intentional actions INFORMATIONAL (e.g., science, expository) Nouns Adjectives Longer noun-phrases Passives

  11. Z-scores on Five Components as a Function of DRP Grade Levels

  12. Z-scores for Language Arts versus Science and Social Studies

  13. Z-scores on Five Components as a Function of DRP Grade Levels for Language Arts

  14. Drama

  15. Midsummer Night’s Dream (grade 6-8)William Shakespeare Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  16. Romeo and Juliet (grade 9-10)William Shakespeare Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  17. Macbeth (grade 11-12)William Shakespeare Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  18. Death of a Salesman (grade 11-12) Arthur Miller Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  19. Glass Menagerie (grade 9-10)Tennessee Williams Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  20. Novels

  21. Adventures of Tom Sawyer (grade 6-8)Mark Twain Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  22. Little Women (grade 6-8)Louisa May Alcott Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  23. Grapes of Wrath (grade 9-10)John Steinbeck Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  24. Farenheit 451 (grade 9-10)Ray Bradbury Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  25. Jane Eyre(grade 9-10)Charlotte Brontë Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  26. As I Lay Dying (grade 11-12)William Faulkner Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  27. The Great Gatsby (grade 11-12)F. Scott Fitzgerald Percentile on Text Complexity NARRATIVITY REFERENTIAL COHESION SITUATION MODEL COHESION SYNTAX WORD ABSTRACTNESS EASY DIFFICULT high low high low high low simple complex concrete abstract

  28. Next Steps • Analyze K12 texts in different states • Gates Foundation and Core Standards • Structural equation modeling on 5 complexity components • Integrating LIWC with Coh-Metrix

  29. ?

More Related