MAGIC Seen from the Perspective of RAGS

MAGIC Seen from the Perspective of RAGS Kathleen R. McKeown Department of Computer Science Columbia University

MAGIC Multimedia Abstract Generation of Intensive Care data Collaborators: Steven Feiner, Desmond Jordan Shimei Pan, James Shaw, Michelle Zhou Kris Concepcion, Liz Chen, Jeanne Fromer

Scenario Goal: provide post-operative information on bypass patients (CABG) • Prior to completion of surgery and before transport to Cardiac Intensive Care Unit (ICU) • Status needed for ICU nurse, cardiologist • Time critical

Issues for Language Generation • Conciseness: Coordinated speech and text that is brief but unambiguous • Coordination with other media: Modify wording and speech to coordinate references with graphical highlighting • Media specific tailoring: • Produce wording appropriate for spoken language • Use information from language generation to improve quality of synthesized speech

Status • Implemented prototype showing coordination between media for limited input • Text output for large numbers of input cases • Undergoing evaluation *now* in ICU • Runs on live data on a daily basis • 5-10% error rate • Continuing research on effects of LG information on prosody, partial results

Principles • Early processes produce media independent representations • Representations use partial orderings in order to make early commitments where possible and retain flexibility • Both the speech and graphics content planner may add content and ordering constraints • Constraints on later decisions may be added early on (e.g., lexical choice)

Data Server and Filter (conceptual) • Input • 18:25 <drug> Drips Norepinephrine • 18:27 <drug> Drips Norepinephrine • 18:29 <drug> Misc. Magnesium Sulfate • 18:29 <surgery> Cardiac Defibrillated by surgeon • 18:33:11 100 (BP) 51 (HR) • 18:34:01 96 52 • Output • C-inanimate entity -> C-drug -> C-operating-room-medication ->C-Drip -> C-Norepinephrine • Top-level categories • C-state, C-event, C-entity (abstract, physical, organization, math) • Inferences • Hypotension: time, duration, drugs given

General Content Planner - SOAP(Rhetorical, semantic, conceptual) • Overview • Demographics • Name, Age, MRN, Gender, Doctor, Operation • Medical history • Lines • Therapy • Devices • Detail View • Drips (on leaving) • Induction info • Devices • Lab report • Timeline • Inferences • End values • Conclusions

Speech Content Planner - Satisfying Conciseness • Speech content planner groups information into sentences • Ms. Jones is an 80 year old, hypertensive diabetic female patient of Dr. Smith undergoing CABG. • Ms. Jones is an 80 year old, female patient of Dr. Smith undergoing CABG. She has a history of diabetes and hypertension. • To satisfy communicative goal to be concise, selects adjectives, prepositional phrases when possible.

Input to speech content planner -semantic propositions • X is-a patient • X has-property last name = Jones • X has-property age = 80 years • X has-property history = hypertension • X has-property history = diabetes • X has-property gender = female • X has-property surgery = CABG • X has-property doctor = Y • Y has-property last name = Smith

Forming Sentence Structure(Rhetorical, semantic, lexical, syntactic) • ((relation is-a) (arg1 ((item ((class name) (last-name “Jones”))))) (arg2 ((item ((class patient)))))) • ((relation is-a) (arg1 ((item ((class name) (last-name “Jones”))))) (arg2 ((item ((class patient)) (premod ((history hypertension))))))

3 Types of Aggregation • Hypotactic aggregation: Given a set of propositions, can one be realized as a modifier? • Semantic aggregation: if a patient is on multiple drips and all devices, a patient has received massive cardiotonic therapy • Paratactic aggregation: Combine related propositions using conjunction and apposition

Coordination across media • Temporal media • Coordinate spoken references with highlighting of graphical references • Requires negotiation of ordering and duration of media actions

Negotiating Ordering • Spoken language generator has grammatical constraints on linear ordering • Graphics generator has spatial constraints on layout • Individual accounts of these constraints may result in an incoherent presentation

Ms. Jones is an 80 year old, diabetic, hypertensive female patientof Dr. Smith undergoing CABG.

Problems for Language Generation: Ordering • When to provide an ordering over references? • produce a partial ordering after word choice • How to select an ordering compatible with graphics? • produce several possibilities ordered by preference • How to communicate orderings with graphics? • maintain a mapping between strings and semantic objects

Media Negotiation(Conceptual, Semantic, Document) • Speech components produce candidate partial orders 1.(< name age (* diabetes hypertension) gender surgeon operation) 10 2. (< name age gender surgeon operation (* diabetes hypertension) 5 3. (< name age gender (* diabetes hypertension) surgeon operation) 4

Media Negotiation • Graphics components produce candidate partial orders 1. (di (highlight demographics) ((<m) (subhighlight (mrn age gender))(subhighlight (medhistory))(subhighlight (surgeon operation))) 10 2. (di (highlight demographics)(* (subhighlight (mrn age gender))(subhighlight (medhistory))(subhighlight (surgeon operation))) 7

CTS Architecture Machine Learning Prosody model Speech Corpus Other Source Prosodic Rules NLG System Prosody Realizer T T S Text + Input Sound Annotated Structure Text

Focus of Research(Rhetorical, Semantic, Syntactic, Prosodic) • Build a prosody model for CTS using prosodic features (based on ToBI): • pitch accent, phrase accent, boundary tone, break index. • Features produced by LG • Syntactic structure, POS tags, Semantic boundaries, Concept • Informativeness, predictability (statistical models) • Abnormality, unexpectedness, sequential rhetorical relation

Mapping to RAGS • Data filter - conceptual • General Content Planner -rhetorical, semantic, conceptual • Speech Content Planner - rhetorical, semantic plus constraints on lexicalization, syntax • Lexical Chooser - semantic, lexical, syntactic • Media Coordination - semantic, conceptual, document • Syntactic Realization - semantic, syntactic • Prosody Realization -rhetorical, semantic, syntactic, prosodic

Acknowledgments This work was funded in part by • DARPA • NSF • ONR • New York State Center for Advanced Technology • NLM

MAGIC Seen from the Perspective of RAGS

MAGIC Seen from the Perspective of RAGS

Presentation Transcript

Mycology from the perspective of the Clinician

The UK Skills Challenge as seen from an Aerospace perspective

Encouragement from the Perspective of

How does DRG-funding affect quality - seen from the patient’s perspective

Femtocells : From the Perspective of Development

Going East – CEE seen from a Western MNC perspective

Scheduling From the Perspective of the Application

The NSF Perspective on HEC-FSIO (as seen from CNS)

Rags to Riches

Green Business from the perspective of

Rags to riches

The cybersecurity in ALICE - as seen from user’s perspective

Rags

The challenge of patent governance in ICT Standards, seen from a patent authority’s perspective

The Crusades from the Perspective of

Web and Paper Questionnaires seen from the Business Respondent’s Perspective

From the Nurses Perspective

From Rags to MORE THAN RICHES

From “Rags to Super Riches”

THE MAGIC OF

From ‘rags to riches’ of Denis Vincent

Magic Pen By As Seen on TV