190 likes | 348 Views
Spoken Language Interaction in Telecommunication at ENST/CNRS-LTCI. Gérard CHOLLET, Richard CROCE, Dijana PETROVSKA-DELACRETAZ, Marc SIGELLE, Pascal VAILLANT, François YVON ( chollet,croce,petrovsk,sigelle,vaillant ) @ tsi.enst.fr yvon@infres.enst.fr
E N D
Spoken Language Interaction in Telecommunication at ENST/CNRS-LTCI Gérard CHOLLET, Richard CROCE, Dijana PETROVSKA-DELACRETAZ, Marc SIGELLE, Pascal VAILLANT, François YVON (chollet,croce,petrovsk,sigelle,vaillant)@tsi.enst.fryvon@infres.enst.fr ENST/CNRS-LTCI46 rue Barrault75634 PARIS cedex 13http://www.tsi.enst.fr/~chollet
Outline • What is ENST/CNRS-LTCI ? • Research and application topics: • The SIROCCO project • The EUREKA !2340 MAJORDOME project • VoIP, VoiceXML, Human-Computer Interaction • Perspectives
Our affiliations ENST:Ecole Nationale Supérieure des Télécommunicationshttp://www.enst.fr CNRS:Centre National de la Recherche Scientifiquehttp://www.cnrs.fr LTCI:Laboratoire de Traitement et Communication de l’Information
What is ENST?Ecole Nationale de Télécommunications • classed among the • ‘Grandes Ecoles d'Ingénieurs’. • 250 state certified engineers • each year . • part of ‘Groupement des Ecoles • de Télécommunications’
GET : Groupement des Ecoles de Télécommunication • ENST • ENST-Bretagne in Brest • Institut National des Télécommunicationsin Évry • Eurecom in Sophia-Antipolis • ENIC (Ecole Nouvelle d’Ingénieurs en Télécoms) in Lille • Institut des Applications Avancées de l’Internet in Marseille
Academic departments within ENST • COMELEC :Communications, Electronic, VLSI, … • INFRES :Computer Science, Networking, NLP, … • TSI : Signal and Image Processing, Speech, … • EGSH : Economy, Management, Social Sciences, …
TSI Department :Signal and Image Processing • "Image Processing and Understanding" • "Statistical Signal Processing Applied to Communications" • "Perception, Learning and Modelling" • Very Low Bit Rate Speech Coding • Speech Recognition, Speaker Verification • "Coding" • Speech and Sound compression • "Audio, Acoustics and Waves" • acoustical antennas, audio protheses
SIROCCO project Unlimited Vocabulary Speech Recognition INRIA (IRISA et LORIA), LIA, IRIT, ENST-LTCI http://www.irisa.fr/sirocco/
SIROCCO • Unlimited vocabulary speech recognition system • French lexicon (MathLex) with 64kwords (AUF task) • Feature extraction with Spro (G. Gravier) • Context-dependent HMM phone models • Word pronunciation graph • Uses CMU-Toolkit for Language modeling • Beam search for word hypothesis • Rescoring of word hypothesis by A*
Holistique EDF «MAJORDOME» Unified Messaging System Eureka Projet no 2340 D. Bahu-Leyser, G. Chollet, R. Croce, K. Hallouli , J. Kharroubi, D. Kofman, L. Likforman, E. Matta-Sanchez, D. Petrovska, M. Sigelle, P. Vaillant, F. Yvon
MAJORDOME ( E-mail • Speaker verification • Dialogue • Routing • Updating the agenda • Automatic summary Voice Fax Majordome’s Functionalities
Overview of Majordome • Background tasks (server-side only): • sorting and filtering messages from different sources (E-mail, voice, fax, SMS,…); • extracting relevant information for reporting to user (names of senders, subject,…). • Dialogue with the user: over phone or Web. • The system presents the state of the mailbox, the type of messages, their sender, subject, and may sum them up or read them on request; • The users access their mailbox, addressbook, time schedule, or URIs (Web addresses).
Voice technology in Majordome • Server side background tasks: continuous speech recognition applied to voice messages upon reception • Detection of sender name and subject • User interaction: • Speaker’s identification • Speech recognition (receiving users’ commands through voice interaction) • Text-to-speech synthesis (reading text summaries, E-mails or faxes)
Network192.168.111.0/11 Cisco Catalyst 6507 Unisphere ERX-700 1Gbps (FO Interne) 1Gbps Salle C-234 VTHD Intranet DistanceLearningService Renater GK GW IPVR Salle C-234 Salle C-234 Video Server Salle PBX ( RTC/RNIS PBX ENST-Paris Voice Over IP Platform Network 192.168.222.0/11 Network192.168.223.0/11 Visioconference
Is the called person here ? NetCentrex user called Majordome / NetCentrex project PABX /Gateway ENST -Call Control Server -Application Server Calling person NetCentrex # Usual # IP-VR NetCentrex Recorder Machine No response Usual user called Vocal E-mail
NetCentrex user called Majordome / NetCentrex project PABX /Gateway ENST -Call Control Server -Application Server Calling person NetCentrex # Usual # Voice Interactive call No response MAJORDOME IP-VR NetCentrex • Speaker verification • Dialogue • Vocal e-mail • Routing • Updating the agenda • Automatic summary Usual user called
A framework: A L I S P Automatic Language Independent Speech Processing with applications in Speech Coding, Synthesis, Recognition, Speaker Verification and Language Identification
Perspectives • The application context of the Majordome project could be of interest to COST-278. • The Majordome/NetCentrex platform could be made available to interested partners. • HTK, ISIP and SIROCCO softwares are available as freeware. One of them will be used on the NetCentrex platform.