HIWIRE MEETING Trento, January 11-12, 2007

HIWIRE MEETINGTrento, January 11-12, 2007 José C. Segura, Javier Ramírez

Schedule • PEQ • HAFE • IS07 setup • New improvements in robust VAD • Revised multiple observation LRT (MO-LRT) • Improve noise reduction and frame-dropping

PEQ • Evaluation • AURORA2, AURORA3, AURORA4 • Compared to HEQ • PEQ shows better performance on all databases • Results using Loquendo recognizer • Improved results • Slight degradation on clean conditions

PEQ / HEQ comparative results

HAFE • In collaboration with TUC-NTUA • Released two C modules, integrated in HAFE V1.0 • Basic Analysis • VAD (LTSD) • Wiener filter (optional) • Output: WAV / MFCC / FB • Post-Processing • PEQ (optional) • Regression computation (optional) • Frame-Dropping (optional) • CMS /CMVN (optional)

IS07 setup • Prepared an HTK setup for evaluation on the HIWIRE database • Training scripts based on LORIA ones • Test scripts include MLLR adaptation with variable number of utterances • Baseline results • Only for clean data • With and without adaptation

IS07 setup (without adaptation)

IS07 (with adaptation)

A review of MO-LRT VAD • Multiple observation likelihood ratio test: • Given 2N+1 independent observations of the noisy speech • Hypothesis test: • G0 : All the observations in the buffer are non-speech • G1 : “ “ “ noisy speech • Gaussian model: where

Hangover analysis

Revised MO-LRT • Given 2N+1 independent observations of the noisy speech: • All the possible hypothesis on the individual observations: hk= 0 : xk = n hk= 1 : xk = s + n • Hypothesis subsets

Revised MO-LRT • We assume that just a single speech to non-speech or non-speech to speech transition can occur in h

Compared to Sohn et al. VAD.

ROC curves in quiet noise conditions (stopped car and engine running) and close talking microphone.

ROC curves in high noise conditions (high speed over a good road) and distant talking microphone.

Presented at ICASSP 2007: • Javier Ramirez, José C. Segura, Juan M. Górriz, “Revised contextual LRT for voice activity detection”, ICASSP 2007. • Under review: • Javier Ramírez, José C. Segura, Juan M. Górriz and Luz García, “Improved Voice Activity Detection Using Contextual Multiple Hypothesis Testing for Robust Speech Recognition”, IEEE Transactions on Audio, Speech and Language Processing.

HIWIRE MEETING Trento, January 11-12, 2007

HIWIRE MEETING Trento, January 11-12, 2007

Presentation Transcript

CARA Public Meeting January 20, 2007

Annual Meeting January 30, 2007

January 12, 2007

VELO-meeting January 25, 2007

HIWIRE MEETING Paris, February 11, 2005

OOPC-12 Meeting 2007: Paris

Friday, January 12, 2007

CAPC audio conference January 11, 2007

FGDC CWG Meeting January 15, 2007

HIWIRE MEETING Athens, November 3-4, 2005

REMODECE MEETING January, the 29th 2007

Executive Board meeting 24 January 2007

HIWIRE PRESENTATION

Lab Meeting Thursday, January 25, 2007

Building Coordinators Network January 12, 2007

HL7 RCRIM Meeting: 9 January 2007

Community Meeting December 12, 2007

REVIEW 12/11, 2007 -- MOLES

January 12, 2007

VeriFone Welcomes You! January 12, 2007

REMODECE MEETING January, the 29th 2007

Remodece meeting January 2007