Tiddler: Customised publishing based on XML profiles and XML ...

1. Tiddler: Customised publishing based on XML profiles and XML data sources Fran�ois Paradis, C�cile Paris, Anne-Marie Vercoustre, Stephen Wan, Ross Wilkinson, MingFang Wu

2. Outline Motivation Examples Current approaches Our approach How it works? Analysis and Conclusion

3. Motivation: Why customised publishing? Too much information: people want less information but more relevant to their need, knowledge, or task On different devices at different times: paper, Web, WAP (To build customer relationship)

4. Examples Customised Travel Guides Depending on who (preferences), where to go , when to go, Depending on when/where to use it Corporate brochures Depending on who you are and your current interest(s) Bring the India Lonely Planet Guide, and some glossy brochure from a travel agency. Bring CSIRO flyers. Bring the India Lonely Planet Guide, and some glossy brochure from a travel agency. Bring CSIRO flyers.

5. Current techniques Distinct versions of manually crafted documents: one for printing, one for the Web. No personalisation Word -> HTML; Latex -> HTML; HTML-> WAP Information Retrieval: personalisation through queries, synthesis of the results; no much coherence Document generation from database queries and different stylesheets: coherence but not high level semantic of the resulting document. Limited type of sources Document generation using NL techniques: relies on the availability of knowledge base in appropriate format Can be see as three generations of document generation. Many variants of this. Even in generation 1, there where tools for generating personalised documents, e.g. personalised maintenance manuals (SGML based)Can be see as three generations of document generation. Many variants of this. Even in generation 1, there where tools for generating personalised documents, e.g. personalised maintenance manuals (SGML based)

6. Tiddler approach Exploits both language generation and IR-document synthesis approaches: Coherence preserved Wide variety of data sources (including web pages) accessible Dynamically plan documents Customise information using user models Generate documents for multiple media types (Paper, Palm Pilots, Web browsers, Mobile Phones)

7. System Architecture

8. User Model Include preferences, information need, Context (device) historic Collected via a G.U.I. Interface Used to: customise information to user determine layout and content detail depending on media encapsulate some Users� Goal Goal is about information need Virtual Document Planner resolves goal using Planning techniques

9. Input: User Model Name: Zoe Medium: Palm Pilot Destination: Melbourne Date: 1 June-15 June 2001 Activities: Cycling, Opera, Major Mitchell Travel Information: Accommodation (backpacker)

10. XML Representation The model is so far very application dependant. We need to model: long term (user �preference�) short term: user query and context History The model is so far very application dependant. We need to model: long term (user �preference�) short term: user query and context History

11. Output: Palm Pilot Version

12. Output: Web Version

13. Virtual Document Planner:Overview 1 The Virtual Document Planner: uses Planning Techniques: Goal achieved by finding subgoals that satisfy it Subgoals are linked by rhetorical relations Subgoals satisfied by: other decomposable subgoals primitive subgoals Ask Cecile for examples of rhetorical relations.Ask Cecile for examples of rhetorical relations.

14. Virtual Document Planner:Overview 2 The Virtual Document Planner: produces a branching tree structure: Node = information need goal Nodes in branches = subgoals Nodes linked by rhetorical relations Subgoals and Goals represent: content selection presentation decisions

16. Virtual Document Planner: Sub-stages Three substages: The Content Planner The Presentation Planner The Surface Generator

17. Virtual Document Planner: Sub-stage 1 The Content Planner: uses Goal Planning produces a tree structure nodes = document content Branches = rhetorical relations that may be realised with discourse markers

18. Virtual Document Planner: Sub-stage 2 The Presentation Planner: Leaves of the tree = chosen content Leaves expanded with layout mark-up of document Mark-up depends on document organisation Customised for particular media type.

19. Virtual Document Planner: Sub-stage 3 The Surface Generator: Dependent on medium Content and layout mark-up are mapped to: text XML HTML WML Natural Language graphics pictures tables lists

20. Data Sources Norfolk technology: provides interface between: Virtual Document Planner Data sources Data Sources originate from: corporate data bases existing web pages of known layout (wrapping) Data Sources can be: static: Norfolk retrieves content in advance -> XML dynamic: Norfolk retrieves content as needed by Virtual Document Planner

21. Why are Dynamic Documents useful? A document can: be composed using most up-to-date information customise information to user tailor content to particular query tailored to a particular media

22. What are the limitations of current dynamic pages? Dynamic pages are often: statically planned with templates and stylesheets Templates grow exponentially in number as document becomes more flexible represented in program language code makes maintenance more difficult limited to filtering at document level for customisation required to maintain separate templates for different media

23. Conclusions (1) Tiddler Advantages: Easier to maintain because Documents use goal planning, not template based Document Rules not in a program language code Customisation filters and uses relevant information from parts of documents Information can be gathered from multiple sources Documents for different media are generated from the same document skeleton Only need to update the skeleton

24. Conclusions (2) Future Work: - Reasoning about the discourse to provide feedback/explanations - Dynamic and complex user model to deal with history of information delivery - Complex user model to build customer relationship The advantages of the use of discourse model, which is that it allows us to reason about the discourse, so it makes the feedback with users more meaningful. (we know why we generated a piece of discourse, and can explain it to the user too; we know that we've explained something to the user already, so don't need to explain it again, etc.) Or, use the discourse model to keep track of what the user has already been presented (so that we don't repeat information). The advantages of the use of discourse model, which is that it allows us to reason about the discourse, so it makes the feedback with users more meaningful. (we know why we generated a piece of discourse, and can explain it to the user too; we know that we've explained something to the user already, so don't need to explain it again, etc.) Or, use the discourse model to keep track of what the user has already been presented (so that we don't repeat information).

Tiddler: Customised publishing based on XML profiles and XML ...

Tiddler: Customised publishing based on XML profiles and XML ...

Presentation Transcript

Tiddler: Customised publishing based on XML profiles and XML data sources

Secure Publishing of XML Documents

Outside-In XML Publishing

XML Publishing

XML in Multi-channel Publishing

On Relational Support for XML Publishing

Information Interchange based on XML

XML-based Agent Platform

XML and LINQ to XML

Secure Publishing of XML Documents

On Views and XML

Workshop on XML-Based Library Applications

Workshop on XML-Based Library Applications

Workshop on XML-Based Library Applications 2. XML and Library Metadata

Lecture 13: XQuery XML Publishing, XML Storage

XML Publishing

CAIDA’s XML-based Web Publishing Environment

XML to XML through XML

XML and XML in DLESE

Workshop on XML-Based Library Applications

Lecture 12: XML Publishing, XML Storage

CAIDA’s XML-based Web Publishing Environment