360 likes | 469 Views
error handling – Higgins / Galatea. Dialogs on Dialogs Group July 2005. work by … . Gabriel Skantze ph.d. student KTH, Stockholm.
E N D
error handling – Higgins / Galatea Dialogs on Dialogs Group July 2005
work by … • Gabriel Skantzeph.d. studentKTH, Stockholm “I am doing research on spoken dialogue systems. More specifically, I am interested in studying miscommunication and error handling, but also in the representation and modelling of utterances and dialogue, as well as conducting experiments with users.“ • and co-authors: J. Edlund, D. House, R. Carlson
3 papers • Higgins Higgins – a spoken dialogue system for investigating error handling techniques, Edlund, Skantze, Carlson [2004] • Galatea GALATEA: A Discourse Modeller Supporting Concept-Level Error Handling in Spoken Dialog Systems, Skantze [2005] • Prosody & Clarifications The Effects of Prosodic Features on the Interpretation of Clarification Ellipses, Edlund, House, Skantze [2004]
1st paper • Higgins Higgins – a spoken dialogue system for investigating error handling techniques, Edlund, Skantze, Carlson [2004] • Galatea GALATEA: A Discourse Modeller Supporting Concept-Level Error Handling in Spoken Dialog Systems, Skantze [2005] • Prosody & Clarifications The Effects of Prosodic Features on the Interpretation of Clarification Ellipses, Edlund, House, Skantze [2004]
Higgins • practical goal of Higgins project • build a collaborative dialog system in which error handling ideas can be tested empirically • error handling issues, plus • incremental dialogue processing • on-line prosodic feature extraction • robust interpretation • flexible generation and output
domain • pedestrian city navigation and guiding • user gives system a destination • system guides user by giving verbal instructions • complex • large variety of error types • semantic structures can be quite complex • reference resolution • domain can be extended even further
architecture • follow-up from Adapt • everything is XML • domain objects • utterance semantics • discourse model • database content • system output (before surface) • 3D city model
research issues • early detection and correction • late detection • incrementality • error recovery
early detection and correction • KTH LVCSR – output likely to contain errors • robust interpretation Pickering: • some syntactic analysis is needed • e.g. relations between objects • but handles insertions and non-agreement phrases • humans - good at early detection (woz)
late detection and correction • discourse modeller (GALATEA) • joins several results from Pickering into a discourse model • adds grounding information • can be manipulated later • remove concepts which turn out not to be grounded
incrementality • end-pointers cause trouble • even more so in this domain better:
incrementality [2] • all components support incremental processing • several issues • when to barge in? (semantic content and prosody) • longer-than-utterance units: interpreter or dialog manager? • rapid and unobtrusive feedback: challenge for synthesis
error recovery • signaling non-understandings • decreased experience of task success • slower recovery • ask other task-related question
2nd paper • Higgins Higgins – a spoken dialogue system for investigating error handling techniques, Edlund, Skantze, Carlson [2004] • Galatea GALATEA: A Discourse Modeller Supporting Concept-Level Error Handling in Spoken Dialog Systems, Skantze [2005] • Prosody & Clarifications The Effects of Prosodic Features on the Interpretation of Clarification Ellipses, Edlund, House, Skantze [2004]
GALATEA • a discourse modeller for conversational spoken dialog systems • builds a discourse model (what has been said during the discourse) • resolution of ellipses & anaphora • tracks the grounding status • who said what when (plus confidence information) • can be used for concept-level error handling
should do grounding at concept level • explicit and implicit verification on whole utterance can be tedious and unnatural • 45% of clarifications in BNC are fragmentary / elliptical
should do grounding at concept level • Traum (1994) – utterance level computational model of grounding • Larsson (2002) – issue-level computational model of grounding in Issue-Based DM • Rieser (2004), Schlangen (2004):systems capable of fragmentary clarification requests, but models do not handle user reactions • systems should keep grounding information at the concept level • like RavenClaw?
semantic representation • rooted unordered trees of semantic concepts • nodes: attr-value pairs, objects, relations, properties
semantic representation • enhanced with “meta”-information • confidence • communicative acts • info is new / given
ellipsis resolution • transforms ellipsis into full propositions • rule based • ~10 rules • domain-specific
anaphora resolution • keeps a list of entities (talked about) • assigns ids • when given entities are added to the discourse, look up the antecedent • if found, unification (and move to the top of the entity list) • unification also allows entities to be referred to in new ways • how does this fare and compare?
grounding status • who added the concept? • in which turn? • how confident? • may be used by the action manager • for instance remove all items with high grounding status when referring to an entity
late error detection • discover inconsistencies in discourse model • look at grounding status to see where error may be • concept can be removed
future • methods for automatic tuning of strategy selection • extend to track confidence and grounding status at different levels • evaluate • how people respond to incorrect confirmations, and how can that information be used to update grounding status • error recovery after non-understandings • other domains
3rd paper • Higgins Higgins – a spoken dialogue system for investigating error handling techniques, Edlund, Skantze, Carlson [2004] • Galatea GALATEA: A Discourse Modeller Supporting Concept-Level Error Handling in Spoken Dialog Systems, Skantze [2005] • Prosody & Clarifications The Effects of Prosodic Features on the Interpretation of Clarification Ellipses, Edlund, House, Skantze [2004]
prosody in clarifications • effects of prosodic features on interpretation of elliptical clarifications • U: Further ahead on the right I see a red building… • S: Red (?) • vary prosodic features • study impact on user’s understanding of the system’s intention
motivation • long (whole utterance) confirmations are not good • tedious, unnatural • BNC corpus: 45% of clarifications are elliptical • short confirmations • make dialog more efficient by focusing on the actual problematic fragments • however • interpretation depends on context and prosody
3 readings • U: Further ahead on the right I see a red building… • S: Red (?) • Ok, red [all positive] • Do you really mean red? What do you mean by red? [positive perception, negative understanding] • Did you say red? [positive contact, negative perception]
stimuli • 3 test words [red, blue, yellow] • di-phone voice (MBROLA) • manipulated • peak position [mid, early, late / 100ms] • peak height [130Hz / 160 Hz] • vowel duration [normal, long / +100ms]
subjects + design • 8 speakers: 2f / 6m, 2nn / 6n • introduced to Higgins • listen to all 42 (only once); random order • 3 options • Okay, X • Did you really mean X? • Did you say X?
results • no effects for • color, subject, duration • significant effects for • peak position, peak height, & their interaction
results • Statement: early, low peak • Question: late, high peak • Clear division between “did you mean” and “did you say”
food for thought • how about English? • red • red? • red!? • how many ways can you say it?
conclusion • strong relationship between intonation and meaning • statement: early, low peak • question: late, high peak • clear division between “did you mean” and “did you say”