20 likes | 183 Views
Lexica and Corpora for Speech-to-Speech Translation Components IST-2001-32216 www.lc-star.com. Consortium Members. Project implementation Track I (duration 2 years) has as objectives: - specification and creation of large word lists
E N D
Lexica and Corpora for Speech-to-Speech Translation Components IST-2001-32216 www.lc-star.com Consortium Members
Project implementation • Track I (duration 2 years) has as objectives: - specification and creation of large word lists and lexica suited for flexible vocabulary speech recognition and high quality speech synthesis covering a wide range of domains • Track II (duration 3 years) has as objectives: • - investigation of speech centered translation • technologies focusing on requirements • concerning language resources (LR) • - specification and creation of the LR needed for • speech centered translation • - building a demonstrator for speech-to-speech • translation • - demonstration of language transfer • Outcome • lexica for recognition and synthesis for 13 languages • and text corpora for 3 languages will be specified • and created • quasi industrial standards for language resources • will be established • speech centered translation approaches are • investigated concerning their needs on language • resources • LR based transfer of speech-to-speech translation • is shown by a demonstrator translating within • 3 languages • the results will be disseminated via ELRA. • Objectives • improve human-to-human and man-machine • communication in multilingual environments • specify and create lexica and corpora needed for • transferring Speech-to-Speech Translation • (SST) components and speech centered translation • into selected languages • Focus • SST components targeted to be integrated into • speech driven interfaces embedded in mobile • appliances and network servers • creation of large lexica with phonetic, prosodic • and morpho-syntactic content, and bilingual aligned • text corpora for 3 languages Lexica for 13 languages: Turkish, Russian, Italian, Greek, Spanish, Catalan, German, Classical Arabic, US-English, Hebrew, Finnish, Mandarin, Slovenian Corpora for 3 languages: Catalan, US-English, Spanish The LC-STAR project is open for new participants. In case of interest, please contact Mrs. Ute Ziegenhain (e-mail: ute.ziegenhain@mchp.siemens.de)