180 likes | 364 Views
Overcoming the limitation of XML Documentation Type Definition XML schema generator. CS689 Hae-Soon Kwon 11/16/2000. Content. Research Problem Research Goal Literature Review Significance of the Study Research Design Data Analysis Schedule Facilities and Resources Deliverables.
E N D
Overcoming the limitation of XML Documentation Type DefinitionXML schema generator CS689 Hae-Soon Kwon 11/16/2000
Content • Research Problem • Research Goal • Literature Review • Significance of the Study • Research Design • Data Analysis • Schedule • Facilities and Resources • Deliverables
Research Problem • Extensible Markup Language(XML): developed by W3C to replace the conventional web languages. • XML documents originate well-formed ness and validation. • If the Document Type Definition file(DTD) is declared for that XML document then we can say it is valid. • DTD is a grammar to use the free parsers in that XML document.
Research Problem (continue) • Several weaknesses of XML DTD: - Difficulty of implementation - Not guarantee of validation for the document - No supporting by regular language grammar • To overcome the weaknesses: - W3C released XML Schema ( October 2000) - There are several researches and tools to generate DTD automatically but they have their own limitations. - New effort is required to provide a method for generating XML Schema.
Research Goal • To make an automatic XML Schema generator • To make valid XML document • To provide the convenience way to the XML users to create XML Schema
XML Schema-compared with DTD • XML Schema has prefix ‘xsd:’ associated with its type( ex) <xsd:element> </xsd:element> ) • XML distinguishes element type to element, complexType and simpleType and so on • Element value type is explained in more specific way (ex) <xsd:element name=“age” value=“xsd:integer”/> ) • For more information: W3C Schema Recommendation( http://www.w3.org/TR /xmlschema)
Basic technical term • XML document <person> <name> <first name> Chris </first name> <last name> Allen </last name> </name> <age> 25 </age> <gender> male </gender> </person> Next: XML DTD
Basic technical term • XML DTD <!ELEMENT person ( name, age, gender )> <!ELEMENT name ( first_name, last_name)> <!ELEMENT first_name (#PCDATA)> <!ELEMENT last_name (#PCDATA)> <!ELEMENT age (#PCDATA)> <!ELEMENT gender (#PCDATA)> Next: XML Schema
Basic technical term • XML Schema <xsd:schema xmlns:xsd=”http://www.w3.org/1999/XMLSchema”> <xsd:element name=”person” type=”person”/> <xsd:complexType name=”person”> <xsd:element name=”name” type=”name”/> <xsd:complexType name=”name”> <xsd:element name=”first_name” type =”xsd:string”/> <xsd:elementname=”last_name” type=”xsd:string”/> </xsd:complexType> <xsd:element name=”age” type=”xsd:integer”/> <xsd:element name=”gender”type=”xsd:string”/> </xsd:complexType>
Literature Review • DTD-Miner (Chung-Hue Moh, et al. 2000): automatic DTD generator • Using Regular Tree Automata as XML Schema ( Boris Chidlovskii, 2000): give a method to convert tree automata to DTD • DTD Creation from URL or Simple Text( Keith E. Shafer, 1995): give methods to communicate with clients and to handle errors • Using a Custom DTD( Web Design Group, 1999)
Significance of the Study • The study for automatic generating XML Schema is the requirement of current trends and this study may be a cornerstone in this field.
Research Design: Frame Work Browser for user XML document Schema file Schema generator module Schema file Web interface Interface module Union tree construction module Schema construction module Pre-processor module Pre-processor module If max repetition # is greater than 1 Document tree Extraction module If max repletion # = 1 Set of tags
Research Design: implementation • To communicate with users: HTML and CGI script • To generate trees and Schema: Java
Data Analysis • Schema generator will be tested to measure how correctly it works. • Both URL and text form XML documents will be used for input file. • Each line that is generated by the generator will be compared with original Schema by hand. • Every program implementation step will be tested and feedback.
Schedule Jan Mar May July Oct Algorithmdevelopment Web-related programming Schema generator module implementation Test and feedback
Facilities and Resources • Tree automata algorithm • DTD-Miner • Automatic DTD creating tool: Fred • DTD/Schema Editors/Tools • Miscellaneous
Deliverables • Automatic XML Schema generator • Its algorithm
Thank you… Question? Happy Thanksgiving