1 / 38

10 Innovative Speech Applications

10 Innovative Speech Applications. Spring SpeechTEK San Francisco February 23, 2007. Deborah A. Dahl, Ph.D. Conversational Technologies. Innovative Applications of Speech. Most people think of call center self service or dictation when they think of speech applications

idalee
Download Presentation

10 Innovative Speech Applications

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 10 Innovative Speech Applications Spring SpeechTEK San Francisco February 23, 2007 Deborah A. Dahl, Ph.D. Conversational Technologies

  2. Innovative Applications of Speech • Most people think of call center self service or dictation when they think of speech applications • There are many more ways to apply speech technology! • The applications we’ll talk about use • recognition • synthesis • other analyses of the speech signal • These applications range from research to commercial projects

  3. 1. MossTalk Words: The Problem • Over 1 million Americans have difficulty with language because of an injury to the parts of the of the brain that control language (aphasia) • Aphasia leads to social isolation and inability to work • Insurance only pays for a limited amount of speech therapy

  4. MossTalk Words: How Speech Helps • The user sees a picture and tries to say the word that corresponds to the picture • Cues are available to help the user remember the word • As soon as the user says the right word, the speech recognizer says “that’s right” • Advantages of the computer • Low cost • Available 24/7 • Consistent • User’s performance is automatically recorded

  5. MossTalk Words: Example

  6. MossTalk Words: More Information • http://www.mosstalk.com

  7. 2. English X-Change: The Problem • Millions of people in China want to learn English • There are fewer than 100,000 English teachers in China, concentrated in urban areas • But there are 225,000,000 English learners!

  8. English X-Change: How Speech Helps • The EX software program is simulation-based, interactive computer program for learning English • Many of the lessons require students to speak English • The EX program uses speech recognition to evaluate students’ pronunciation • The degree to which a student’s pronunciation of a word or phrase approaches correct native spoken English pronunciation can be adjusted • In one study, students who studied using the EX software program produced substantially (and significantly) higher test scores than did those who experienced traditional classroom instruction with trained native English speakers

  9. English X-Change: Example

  10. English X-Change: More Information • www.englishxchange.com

  11. 3. Compliance for Life: The Problem • People often don’t take their medications as directed • “The misuse or nonuse of prescribed medications is estimated to add nearly $200 billion a year to the cost of medical care.” (Reported by Jane Brody, Just What the Doctor Ordered? Not Exactly, The New York Times, May 9, 2006) • Non-compliance rates can be very high • for example hypertension non-compliance is estimated at 40% (avg.) • “simply forgot” is the most common reason

  12. Compliance for Life: How Speech Helps • Provides a phone- and web-based automated notification system to create, edit and cancel medication reminders • The phone interface allows users to manage reminders when the web isn’t available • Example of creating a reminder

  13. Compliance for Life: More Information • www.iReminder.com

  14. 4. Animated Speech: The Problem • Millions of preschool and elementary school children have language and speech disabilities • There is a shortage of skilled teachers and professionals to give them the one on one attention that they need

  15. Animated Speech: How Speech Helps • Applies animated agents • to produce accurate visible speech • facilitate face-to-face oral communication • Teaches vocabulary to children with language challenges • Instruction is always available to the child, 24 hours a day, 365 days a year • System has extreme patience, doesn’t become angry, tired, or bored

  16. Animated Speech: Example

  17. Animated Speech: More Information • http://www.animatedspeech.com/

  18. 5. Diagnosing Depression: The Problem • Depression is traditionally diagnosed by self-report • Some depressed individuals are reluctant to admit that their medication is ineffective

  19. Diagnosing Depression: How Speech Helps • Depression is correlated with specific vocal characteristics • These can be observed in speech recorded over the phone

  20. Diagnosing Depression: More Information • http://www.healthtechsys.com/ivr/ivrmain.html

  21. 6. Guided Speech: The Problem • Automatic speech recognition is widely used for telephone self-service, but it’s not always accurate • Human speech recognition is accurate, but humans are expensive and get bored with handling routine calls

  22. Guided Speech: How Speech Helps • Human guide in the background assists self service application to ensure completion • Use speech recognition with an operator backup • Agents are able to handle 4 calls- silently and simultaneously

  23. Guided Speech IVR Call

  24. Guided Speech: More Information • http://www.spoken.com

  25. 7. Model Talker: The Problem • People who have limited ability to speak can use TTS to speak their typed utterances • Concatenative TTS sounds much better than formant-based TTS • The number of TTS voices is limited, and there may not be an existing voice that a user likes

  26. Model Talker: How Speech Helps • Model Talker lets users record their own voice and generate a TTS voice from their own recordings • Example of a field-generated voice (generated by a user who downloaded Model Talker from the internet and recorded their voice on their own computer) and a example of this user’s actual voice

  27. Model Talker: More Information • http://www.modeltalker.com/

  28. 8. ASL Speech Recognition: The Problem • American Sign Language is the fourth most-used language in the United States • Currently, human ASL translators are frequently necessary to facilitate communication between deaf and hearing presenters and their audiences • Good ASL translators are in high demand and are not always available

  29. ASL Speech Recognition: How Speech Helps • Combine speech recognition and understanding with automatic ASL generation to translate from speech to sign language • Example of generated ASL:

  30. ASL Speech Recognition: More Information • http://asl.cs.depaul.edu/

  31. 9. VoiceBox in Car Navigation: The Problem • Current in-car navigations systems require multiple button presses to set destinations • In one test of Neverlost • the average time to set a destination for 25 first time users of was 4 minutes and 31 seconds • 315 button pushes • 5 testers dropped out and said they could not do it

  32. VoiceBox in-Car Navigation: How Speech Helps • For setting destinations, speech is much faster and less confusing • For the same task of setting a destination, the VoiceBox time average was 18 seconds for new users

  33. VoiceBox in-Car Navigation: Example

  34. VoiceBox: More Information • www.voicebox.com

  35. 10. Rex the Talking Pill Box: The Problem • Some patients can’t read or understand the instructions on their prescription bottles • illiteracy • low vision • cognitive limitations

  36. Rex Talking Pill Bottle: How Speech Helps • Pharmacist programs the bottle with the medication instructions • Or, user can record their own message • Users presses a button to hear instructions • Example:

  37. Rex Talking Pill Bottle: More Information • http://www.rxtalks.com/

  38. Summary • Speech technologies can be applied in many innovative ways • make up for expensive or unavailable human expertise (MossTalk, Timo Stories, English X-Change, Guided Speech) • help users who need assistance understanding or producing spoken language (ASL translator, Model Talk) remembering (Compliance for Life) or reading (Rex, Timo Stories) • systematically analyze voice quality in ways that most people can’t do (depression diagnosis)

More Related