190 likes | 698 Views
Jorge Gurlekian, Laura Colantoni, Humberto Torres Laboratorio de Investigaciones Sensoriales (LIS), CONICET - Universidad de Buenos Aires, Argentina. Antonio Rinc
E N D
1. Database for an Automatic Speech Recognition System for Argentine Spanish Prosodic Database for an Argentine Spanish Text to Speech System
3. 1) Linguistic definitions • Corpus • Prompt sheet • Dialectal areas2) Technical setup • Recording platform • Recording software3) Speakers • Age groups • Recruitment
5. Jorge Gurlekian
LIS, CONICET, Buenos Aires, ArgentinaHernán Rodríguez
Universidad Nacional de La Plata, Argentina Laura Colantoni
LIS, CONICET, Buenos Aires, Argentina
Humberto Torres
LIS, CONICET, Buenos Aires, Argentina
7. 1) Corpus • Training Texts • Selection of the syllables • 741 final sentences 2) Recording and labelling • Recording software • Software for labelling • ToBI for Argentine Spanish3) Population • Automatic loading of text files • Functions included into the database
10. Word accents: American English ToBI H* L* L*+H L+H* H+H* Argentine Spanish ToBI nH* nL* nL*+Hm Lm+nH* Hm+nH* plus: nH*+Lm Hm+nL* nH*+Hm Phrase accents: H- L-Boundary tones: American English ToBI Initial: %H Final: L% or H% Argentine Spanish ToBI Initial: n%L or n%H Final: nL% or nH%
14. • Spelling• Isolated words• Numerals• Directory assistance information• Sentences containing all the SAMPA symbols• Sentences with words used in software interaction• Sentences containing temporal expressions
15. Words (interaction with software) Sequence of 10 isolated digits Sheet number (6 digits) Telephone number (9-11 digits)Credit card number (14-16 digits)PIN code (6 digits) Spontaneous date, e.g. birthdayPrompted date Relative and general date expressionSentence (interaction with software) Isolated digits Spelling of surname Spelling of directory city name Spelling of real/artificial words
16.
Recording Platform:
Pentium PC computer with an AVM-ISDN-A1 board,
and an ISDN basic access (BRI) interface
Recording software:
Automatic recording system for telephone calls
17. Age groups:
• 16-30
• 31-45
• 46-60
Recruitment:
Universities, professors, students, friends and relatives