220 likes | 399 Views
CMC Data in Learning and Teaching ( LETEC ) Corpora. Thierry Chanier , Université Blaise Pascal with Christophe Reffay , École Normale Supérieure de Cachan. International workshop: Building Corpora of Computer-Mediated Communication: Issues, Challenges, and Perspectives
E N D
CMC Data in Learning and Teaching (LETEC) Corpora Thierry Chanier, Université Blaise Pascal with Christophe Reffay, École Normale Supérieure de Cachan International workshop: Building Corpora of Computer-Mediated Communication: Issues, Challenges, and Perspectives Feb 14–15, 2013, TU Dortmund University
Contents • 1) LETEC and CMC • 2) Questions about CMC in learning situations and multimodality • 3) CMC and French linguistic networks 1 2 3
1 Rationale for LEarning & TEachingCorpora (LETEC) • A simple collection of learners’ online interaction data does not represent a scientific object, as Kern and Warshauer (2004) emphasised in the language learning field: • “Researchers must carefully document the relationships among media choice, language usage, and communicative purpose, but they must also attend to the increasingly blurry line separating linguistic interaction and extra linguistic variables. […] Studies of linguistic interaction will likely need to account for a host of independent variable: the instructor's role as mediator, facilitator, or teacher …
LETEC Components CMC Data Educational scenario Context Public Licence Instantiation Analysis Private licence Research protocol LETEC = a structured entity containing all the elements resulting from an online learning situation, whose context is described by an educational scenario and a research protocol
LETEC Structure XML (Mulce-struct) IMS-CP format
XML Mulce-struct : Instantiationsubpart Blog_act Audio_act …
Mulce online Repository.Mulce.org Databank Mulce.org Documentation Metadata in OLAC and in CLARIN
Open access, ethics and licence For usage: licence For participants: Informed consent form + Anonymization process Open Data: http://opendefinition.org/guide/
MulceRepositorycoverage • 3D : text &audio chats, non verbal • Text chat • Mail, forum, text chat • text &audio chats, non verbal • blog, audio chat, non verbal • blog, audio chat, non verbal • Blog, text &audio chats, non verbal • Text chat & whiteboard Learning situations
MulceRepositorycoverage (May 2012) • - 45 corpora • 36 000 verbal acts (audio, textchat, blog, email, forum) • 1 million tokens. • 10 000 non verbal acts
2 CMC / learning situations / Multimodality • Specific macro or micro-structure for chat in learningsituations? • Multimodal environments part of CMC?
Text chat in learning situation vs Internet discussions Internet discussions Discussions in learning situations Monika9 Malmo: I'm thinking about what I know about the Swedish culture. It's in many ways not very different from the American culture I think. Christie Penn: Monika9, be sure to use the zero artice. Monika9 Malmo: What is the zero artice? Christie Penn: It means no article. Monika9 Malmo: Hmm, I asked my teacher what it means, but she said I had to ask you again. Can you explain what it means to me? Christie Penn: That's okay. Don't worry about it for now. Can you write the next sentence? Monika9 Malmo: Okay, I'll continue. Monika9 Malmo: I think some Swedish literature is known in other countries. Monika9 Malmo: Do you know anything about Swedish literature? Christie Penn: Good. I don't know much I'm afraid. What can you tell me about it? Monika9 Malmo: Ah, I think I know what you mean with zero article now. You meant I shouldn't write "The American culture", it should be only "American culture", right?“ Interactions words Short messages (Sauro, 2009) (Beißwenger et al., 2012)
Text chat in learning situation - Longer turns - Opening & Closing Transactions - Few interaction words - Important macro and discourse structures (Corpus Favi : mulce.org:mce-favi-letec-all)
Multimodality in SecondLife Non verbal modes Verbal modes not detailed here, see Wigham & Chanier, ReCALL 25(1) audio textchat radio transmission private public proxemic transmission
Multimodality and CMC ? The element <posting> is the basic CMC-specific element in our schema. In CMC documents it represents the largest structural unit that can be assigned to one author and one point in time. The category posting is defined as a content unit that has been sent to the server “en bloc”. TEI and CMC, (Beißwenger et al., 2012) (LETEC corpus Archi21 : archi21-slrefl-av-j2)
3 Corpora Networks in France EU FR Consortiums: - IR Corpus-écrits - IR Corpus-oraux et multimodaux -IR Archives des ethnologues -IR CAHIER (philo & littérature) - …
Forthcoming national project on language Corpus de Référence Du Français contemporain Digitales Wörterbuch der deutschen Sprache Empirikom : Empirische Erforschung internetbasierter Kommunikation Subpart on CMC in French Deutsches Referenzkorpus zur internetbasierten Kommunikation (DeRiK)
Salut s que <NOM_4> c dcd à ht 1 dvd pr sa cop ki e pa la 2main? SIG on CMC in French (nouv-com) SMS Twitt Blogs Forums Chat Etc. Grenoble Montpellier Paris Clermont-ferrand
References • Thierry.chanier@univ-bpclermont.fr • Web site : http://lrlweb.univ-bpclermont.fr/spip.php?rubrique98 • All publications are in Open archives : • http://www.base-search.net > searchAuthor : Chanier ; all documents • Reffay, C., Betbeder, M.-L., Chanier, T. (2012). "Multimodal Learning and Teaching Corpora Exchange: Lessons learned in 5 years by the Mulce project". Special Issue on dataTEL : Datasets and Data Supported Learning in Technology-Enhanced Learning, International Journal of Technology Enhanced Learning (IJTEL), (4) , 1/2). Pp 11-30. DOI: 10.1504/IJTEL.2012.048310 ; http://edutice.archives-ouvertes.fr/edutice-00718392