80 likes | 226 Views
Speech Projects at ITC irst Roberto Gretter ITC irst AT&T Shannon Labs, Florham Park, NJ, 6/27/2000. ITCirst Trento, Italy. About 100 researchers in 5 divisions division SSI: vision, speech, statistics, software system analysis speech: 16 researchers in 3 internal projects:
E N D
Speech Projects at ITC irst Roberto Gretter ITC irst AT&T Shannon Labs, Florham Park, NJ, 6/27/2000
ITCirstTrento, Italy • About 100 researchers in 5 divisions • division SSI: vision, speech, statistics, software system analysis • speech: 16 researchers in 3 internal projects: • MUNST - multilinguality, broadcast news, CORETEX • SHINE - recognition in car, acoustic localization • DITELO - telephone, dialogue
Accessing tourism information • Local tourism agency provides info database about accomodation, structures and services, localities, events, sports, leisure time, art, nature, transportation, … (see http://www.provincia.tn.it/apt go) • Mixed initiative dialogue, explicit confirmation, no barge-in, over 2000 words vocabulary, information presented using a Mixed Representation approach (template-based + deep generation) • First prototype for data collection • Bootstrap system handles accommodation and place names • Bootstrap grammars
<localita`> trento rovereto riva del garda cles <vorrei> vorrei dovrei voglio mi piacerebbe <stelle> ( a | con) ? (( 1|2|3|4|5 ) o ? )+ stelle <tipo> albergo campeggio residence affittacamere main_grammar Tourism information • Bootstrap grammars: <vorrei> andare in un <tipo>/TYPE <stelle>/STARS <vorrei> soggiornare vicino a <localita`> /CITY un <tipo>/TYPEa <localita`> /CITY <stelle>/STARS mi manda i dati <output>/CHAN al numero <tel>/FAX <vorrei> il <info>/INFO del <tipo>/TYPE <n>/NAME • Only 2 main grammars defined • Data collected without Wizard of Oz
Tourism information: acquisitions • Users provided with some task to accomplish. • Users: domain expert / non expert • Acquisitions using the first prototype (May 2000): 2 hours 37 min speech - about 220 dialogues • Preliminary evaluation on 34 dialogues
automatic routing agent automatic answering agent user human router human operator test-it test-de HMM-it 93.9 - HMM-de - 89.4 HMM-mix 94.3 89.5 Aurora (proposal) • Similar to HMIHY task: • call routing (How can I help you?) • natural language processing, information extraction, document analysis and retrieval, database access • German (DFKI), Italian (Offnet, IRST, CELI), Dutch (CTIT, Uni Twente), Spanish (DFE, Uni Barcelona) • Multilinguality in SpeeData • data-entry (Land-register)
Kataweb • One of the most important Italian WEB portals go • Requirement: Fast Porting of new speech and NL technologies to WEB portals (Dec 2000) • Access information via written queries: • Who invented the electrical light? • Which is the capital of Alaska? • Information extraction from news • The Bank of Japan decided … the president said ... • Spoken dialogue by phone • call center solution • trading on line by phone, about 500 stocks • Future developments are possible • extend domains • integrate speech and natural language • computer vision
Technology transfer to solution developers • What they are asking us, today • to include the recognizer in menu-based call centers • to build applications requiring dialogue • What they are going to ask us, tomorrow: • multilinguality • speaker verification • What we provide • Spinet server (recognizer, grammars, lexicon, ...) • API (c++, Java) • assistance in building prototypes/systems • assistance in verifying/improving systems