1. Overview

Target languageaudio English wordscorr. totargetaudio WORD SEGMENTATION THROUGH CROSS-LINGUAL WORD-TO-PHONEME ALIGNMENT Felix Stahlberg, Tim Schlippe, Stephan Vogel, Tanja Schultz Phoneme recognition Cross-lingual alignment What‘s this used for Phoneme sequence Alignment Word segmentation parakeseusaesto Phoneme sequenceincludingwordboundaries 1. Overview 2. Cross-Lingual Alignment Word assignment / Pronunciationdictionary • Long-term Goals • Bootstrap speech technology for non-written and under-resourced languages • Given • Collect training data for ASR and MT systems rapidly and at low cost • Goal ofthisPaper • Segment phoneme sequences into word units using the written translations • Simulate phoneme recognition errors realistically • Compare our cross-lingual word segmentation method to monolingual ones, e.g. Adaptor Grammars (Johnson, 2008) IBM Model 3 Model 3P Pronunciation dictionary / Parallel corpus with “word labels” • Problem: Generative storydoes not fit word-to-phoneme alignment • Extendsgenerative storyof IBM Model 3 with additional steps • Uses GIZA++ alignmentstoinitialize Model 3P parameters • Thenour PISA alignment tool1)applies EM algorithm • Audio data • Their written translations in another language (e.g. English) • Pronunciation dictionary • Parallel corpus, language model Generative Story The PISA alignmenttoolisavailableathttp://pisa.googlecode.com/underthe GPL v3 Alignment 3. Experiments andResults Compare: Adaptor Grammars (Monolingual) GIZA++ word-to-phoneme alignments Model 3P • Cross-Lingual: • GIZA++, Model 3P Results Sentence: English • Experimental Setup • English-Spanish BTEC corpus (123k sentence pairs) • Phoneme recognition errors up to 25.3% were simulated using the confusion matrix of a Spanish phoneme recognizer trained on the Spanish portion of GlobalPhone (Schultz, 2002) Phoneme sequence: Spanish Audio: SLT 2012 – The Fourth IEEE Workshop on Spoken Language Technology

1. Overview

1. Overview

Presentation Transcript

Section 1: Overview

Block 1 Overview

Exam 1 Overview

Assignment #1 Overview

QSim 1 : Overview

1. Overview

1 . 1. Module Overview

Exam 1 overview

Section 1: Overview

1. Overview

Chapter 1 - Overview

1. Overview.

1. Legal Overview

1. Indonesia Overview

1. Overview

Overview(1)

Assignment 1 Overview

Lecture 1 Overview

Lecture 1: Overview

Unit 1 Overview