100 likes | 265 Views
Hammam Riza Agency for the Assessment and Application of Technology (BPPT). Indonesia Language Technology Research Community -ILTRC. AFNLP Board Meeting 2009 Indonesia Country Report. Activities in Language Technologies (2008-2009).
E N D
Hammam Riza Agency for the Assessment and Application of Technology (BPPT) Indonesia Language Technology Research Community -ILTRC AFNLP Board Meeting 2009Indonesia Country Report
Activities in Language Technologies (2008-2009) PANL 10n project: UI and BPPT develop POS Tagger and Machine Translation using 1M words corpus (Penn TB and Indonesian news corpus) Indonesian-English Statistical MT – Pharaoh/ Moses decoder (using 10 Million words parallel corpus) for Bi-directional Multi-Domain Translation System (BPPT) ALR Workshop 2009 A-STAR Consortium: Speech to Speech Translation System for 9 Asian Languages (in cooperation with NICT, ETRI, CASIA, CDAC, IOIT, I2R and BPPT) Promotion of Local Computing Capacity for Local Languages (endangered languages)Ministry of Communication and Information, National Language Center Dictionary system: Indonesian-German, Indonesian-France, was launched by Institute of Technology Bandung Information Retrieval and Extraction at University of Indonesia
Distribution of 13 largest local languages Number of Local Languages 742 (Ethnolgoue) 713 bhs lainnya 45.4
Activities inText to Speech (TTS) • TTS Indonesia – A-STAR Project, HMM based (NICT-ATR), toward Asian Networked Speech to Speech Translation System • Intensive research and development for “automatic prosody pattern extractor” using Artificial Neural Network (ITB) • Text to Speech system for Indonesian language (ITB/UI) • TTS for 5 local languages (Javanese, Sundanese, Balinese, Minang and Makassar) started by National Language Center Spoken Language Communication Group Kyoto, Japan
Activities in Speech Recognition • BPPT collaboration with Telkom on Speech Recognition and Summarization resulted in Indonesia Linux Voice Command (ILVC) – further developed into LiSan and Perisalah (Transcription System) • Speech Corpora and Speech recognition system for Bahasa Indonesia at the University of Indonesia (M. Adriani et.al) and Institut Teknologi Bandung (A. Arman et.al)
LiSan (Linux Voice Command)developed by O. Riandi and BPPT team • Voice command as man-machine interface for accessibility is governed by Law No.4 1997 for Handicapped Person, Law PP No.43 on use of technology for improving social benefit of handicapped person. • Use of Bahasa Indonesia as “language of conduct”
Indonesia Transcription System Transcriptor June 2009 System use in Cabinet Meeting endorsed by State Secretary and Office of President