10 likes | 117 Views
Season 1, 1 st Video (40 pairs). Modeling Situated Language Learning in Early Childhood via Hypernetworks. Thomas is a tank engine who lives on the island of Sodor. He has six small wheels, a short funnel. a short boiler and a short dome.
E N D
Season 1, 1stVideo (40 pairs) Modeling Situated Language Learning in Early Childhood via Hypernetworks Thomas is a tank engine who lives on the island of Sodor. He has six small wheels, a short funnel a short boiler and a short dome. Zhang, Byoung-Tak1,2, Lee, Eun Seok2, Heo, Min-Oh1, and Kang, Myounggu1 1 School of Computer Science & Engineering, Seoul National University 2 Interdisciplinary Program in Cognitive Science, Seoul National University {btzhang, eslee, mgkang, moheo}@bi.snu.ac.kr Season 1 10 videos (398 pairs) Season 1, 2ndVideo (40 pairs) The driver won't pick you, said Gordon. He wants strong engines. Edward was in the shed with the engines. They were all bigger than him and boasted about it. - The typical proportion (%) and SD of semantically well-structured sentences from 100 random sentences generation task with developmental learning and improvised learning. (By human scaling measurement) Background Simulation Experiment Results Human language is grounded, i.e. embodied and situated in the environment (Barsalou, 2008; Zwaan & Kaschak, 2008) Grounded language models rely on multimodal sensory data (Knoeferle & Crocker, 2006; Spivey, 2007; Yu et al. 2008) Language grounding requires a flexible modeling tool that can deal with high-dimensional data What kinds of grounded, linguistic “concept map” are cognitive-computationally plausible and child-learning friendly? How does the “multimodal” concept map of a young child evolve as being incrementally exposed to situated language environments? What kinds of cognitive learning algorithms can naturally simulate the situated language learning process? Can we use the situated concept-map building process for endowing AI agents with language learning capability? · · · · · · · · · · Mental images generated from the learned multimodal concept map (Data 2) · Dataset Sequence 1 2 3 4 5 6 7 8 9 10 Generated mental images per each word · Fat Research Questions · People - Excerpted sentences from earlier learning stages to later ones with evolving concept association · Went · Station Generated mental images by a sentence · Fat people went station Example sentences generated from the learned concept map(Data 1) · Method Evolution of the multimodal concept map (MCM) for Data 2 Key Ideas in This Work · 1. Define linguistic words and visual words 2. Learn incrementally a hypernetwork on the videos 3. Plot the multimodal concept map for a given query · • Hypernetwork structure (Zhang, 2008) as a multimodal concept map • Population coding as an incremental learning-friendly representation • Cartoon videos as a surrogate for situated child language corpora MCM for ‘Station’ after learning the 1st video MCM for ‘Station’ after learning the 2nd video Data Sets Data 1: Educational animation video scripts for children (text only) 33,910 sentences, 6,124 word types, 208,116 word tokens; 10 learning stages according to the recommended language levels of difficulty Data 2: Video of Thomas & Friends, Season 1, Numbers 1 - 10 398 pairs of the dialogue sentence and its corresponding image · · MCM for ‘Station’ after learning the videos 1-4 MCM for ‘Station’ after learning the videos 1-5 References · · · · · · · Barsalou, L. W. (2008). Grounded cognition, Ann. Rev. Psych., 59, 617-645. Griffiths, T., Steyvers, M., & Tenenbaum, J., (2007) Topics in semantic representation, Psych. Rev., 114(2),211–244. Knoeferle, P., & Crocker, M. W. (2006). The coordinated interplay of scene, utterance, and world knowledge: Evidence from eye-tracking, Cog. Sci., 30, 481-529. Spivey, J. M. (2007). The Continuity of Mind, Oxford Univ. Press. Yu, C, Smith, L. B., & Pereira, A. F. (2008). Grounding word learning in multimodal sensorimotor interaction, CogSci-2008, pp. 1017-1022. Zhang, B.-T. (2008). Hypernetworks: a molecular evolutionary architecture for cognitive learning and memory. IEEE Comp. Intell. Mag., 3(3), 49-63. Zwann, R. A. & Kaschak, M. P. (2008). Language in the brain, body, and world. Chap. 19, Cambridge Handbook of Situated Cognition. Biointelligence Lab, Seoul National University, Seoul 151-744, Korea (http://bi.snu.ac.kr)