200 likes | 212 Views
This presentation provides an introduction to the Multilingual Modeling (MM) research conducted at the Institute for Infocomm Research (I2R). It highlights the projects and applications that aim to develop robust and scalable multilingual modeling technologies to facilitate accurate information acquisition in different languages.
E N D
Multilingual Modeling (MM) Presentation to SMT Workshop 17 October 2006 ZHANG Min / AW Ai Ti Institute for Infocomm Research
Outline • Introduction to I2R • MM: Multilingual Modeling Research • Projects
Transforming Knowledge Into Prosperity Institute for Infocomm Research (I2R) - an introduction
A National Research Institute(funded by Singapore’s Ministry of Trade and Industry) BMRC SERC Science & Engineering Bio-medical Our Mission:To advance Infocomm Technology to benefit humanity and create prosperity for the country Size: 350 Scientists & Engineers
I²R Organization Structure(Research Divisions) • Communications Division • focuses on wireless and lightwave technologies to deliver solutions for mobile, broadband and multimedia communications. • Computing Division • applies computer science to enhance the quality of life through technologies which enable safer working environments, better healthcare, smarter homes and higher productivity tools. • Media Division • deals with the essential value chain of media, starting from the creation of the media leading to its processing and analysis, followed by the production, distribution and presentation of the final media product.
Media Division Core competencies: Language processing, machine translation, information extraction and retrieval Efficient coding, compression, transmission, authentication for audio, image & video Pattern recognition, computer vision, video understanding; speech processing • Data fusion & optimisation • Brain-machine interface; acoustic noise cancellation; wearable and pervasive computing Applications: Homeland security MPEG-x, JVT, H.26y, JPEG • Surveillance Biometrics • Proactive Healthcare Multimedia Communications & Transmission System Collaborators Advanced Recognition Technologies (Israel),Biometrics Engineering Research Center (Korea), Fraunhofer (Germany), IBM, Infineon Technologies AG (Germany), Institute of Computing Technology (China), New Jersey Institute of Technology (USA), NTT DoCoMo Inc. (Japan), Santa Clara University (USA), Zentek Technology, University of Tuebingen (Germany), Brain Science Institute – RIKEN (Japan), Columbia University (USA), PCS Security, Singapore Police Force, Singapore Sports Council, Tan Tock Seng Hospital Imagination 2Reality
Multilingual Modeling Research Objective: to develop robust and scalable multilingual modeling technologies to facilitate people acquiring accurate information in different languages within the shortest time. Problem: • reliance on manually-constructed resources • not satisfactory machine translation output Funding: • Singapore Gov, MNC, Industry
Multilingual Technology • Alignment: • Corpus Alignment and Bilingual/Multilingual Term Extraction from non-parallel corpus • Machine Translation (ML/Statistical/Linguistic): • Asian Language Machine Translation • Rule-based system with deep syntax and semantic information • Augmented with language specific statistical module • Multilingual Information Access • SMS Normalization and Translation • Phrase-based statistical MT • Machine Transliteration • Chinese, Japanese, Korean English • Named-Entity Translation
Machine Translation Development 1989: Initiate R&D in EnglishChinese MT 1990: Awarded S$2m IBM E-C MT project 1992: Develop inhouse EnglishX (Asian) MT 1993: Set up MT Service Unit 1997: AsiaRain Automated Translations (I2R Spin-off using MT technology) 2000: Commercialize MT technology (I2R start-up delivering MT technology) 2004: Enhance and Construct Lexical Resources, Machine Learning techniques in MT engine 2006: Multilingual Modeling
SMS Translation: Interactive TV • TIMM – Television Interactive with Multilingual Messaging • Create a multi-lingual communication platform • Remove communication barrier, enhance interactivity between TV viewers • Provide a new channel for advertisers and marketers to reach out to TV viewers through personalization
Conclusion Multilingual Modeling is Cool, Hot and enjoyable!
We offer high salary with comprehensive benefits. Flexible and excellent working environment. World class R&D. Substantial Funding. Growing-up opportunity. Study as a Ph.D/Internship or Work in I2R? Why not join and collaborate with us? Interesting? E-mail us at: {mzhang,aaiti}@i2r.a-star.edu.sg