1 / 22

A Speech Interface to Virtual Environment

A Speech Interface to Virtual Environment. Authors Scott McGlashan and Tomas Axling Swedish Institute of Computer Science. Presentation Agenda. Introduction The TALKING AGENT system DIVE SR/TTS Agent Modeling Framework Interaction Metaphor Reference Resolution Future Work Conclusion.

Download Presentation

A Speech Interface to Virtual Environment

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Speech Interface to Virtual Environment Authors Scott McGlashan and Tomas Axling Swedish Institute of Computer Science

  2. Presentation Agenda • Introduction • The TALKING AGENT system • DIVE • SR/TTS • Agent Modeling Framework • Interaction Metaphor • Reference Resolution • Future Work • Conclusion

  3. Purposes of this paper • Analyze the technical and design issues to combine a virtual world with a speech interface. • Describe system architecture of the TALKING AGENT system.

  4. Problems of Integration • Speech Recognition : Limited vocabulary to gain accuracy. • Language Understanding : Limited knowledge to maximize the understanding. • Interaction Metaphor : Who does the user talk to? (Above questions are discussed in detail in the authors’ last paper “Speech Interface to Virtual Reality”.)

  5. Innovation of this System • Combining intelligent agent and speech interface to carry out specialized functions in the VR World. • Functions have been implemented : • Transporting objects • Fetching objects • Painting objects • Increasing the size of objects

  6. System Architecture

  7. DIVE-Virtual Reality System • DIVE(Distribute Interactive Virtual Environment) is a multi-user virtual environment. • DIVE allow users and environment interact in real-time. • DIVE contains a database composed of hierarchically organized objects .

  8. DIME- DIVE Meeting Environment

  9. Speech Recognition • SR with limited pre-defined phrases promises good recognition performance. • Using grammar to set constraint to search space. • Using commercial SR-engine (Nuance).

  10. Agent Modeling Framework • High-level languages do not support complex symbolic computations. • Oz is well suited for this purpose. • Using ODI as interface between Oz and DIVE. • The parent agent consists basic functions. • We can define more specific agent by extend parent agent.

  11. Agent Modeling Framework

  12. Interaction Metaphor • Direct manipulation -Personal Presence. • Various metaphors for spoken interaction have been proposed. • Proxy • Divinity • Telekinesis • Interface Agent • This system adopt the Proxy metaphor.

  13. The DIVERSE System-Interface Agent

  14. Addressing Agent • Inside the user’s eye-sight • Dialogue initiated by clicking on the agent. • Outside the user’s eye-sight • Phone agent-First press the phone agent then connect to remote agent

  15. Feedback • Given speech input ,system should give the visual feedback to the user. • If the agent listening or not? • What is the feedback when talking to agent far away?

  16. Reference Resolution • Given some descriptions , the reference resolution engine maps them to object which user is referring to. • Considerations • Object focus. • Property Perception. • Discourse Modeling.

  17. Robust Interaction • When errors don’t matter • User can view the results and current them by direct manipulation. • Safety-critical applications • Confirm user command. • Clarifying incomplete or ambiguous commands.

  18. Future Work • Agent behavior should related to its previous action . • Add mental components. • Talking to agent by aura-driven . • Evaluate this system with realistic scenario. • Ex: virtual travel agency.

  19. Conclusions • Add a speech interface to VR-system. • Using constraint SR to achieve high accuracy. • Developing an appropriate metaphor. • The agents modeled in this system provide specific functions in the virtual world.

  20. Q & A

  21. Paper Source McGlashan, S Speech Interfaces to Virtual Reality in Proceedings of the Second Conference on the Military Applications of Synthetic Environments and Virtual Reality, Stockholm, Sweden, 1995.

More Related