1 / 15

Possibility of XML I/O support in ROOT

Explore the possibility of XML I/O support in ROOT, with a focus on existing XML packages, implementation methods, challenges, and potential solutions. This article discusses the need for XML I/O in ROOT and proposes a method to introduce it without requiring manual I/O code from users. XML I/O in ROOT can facilitate data exchange between different packages and enable viewing and editing of XML files. The article also presents the problems faced in implementing XML I/O in ROOT and explores solutions using the TBuffer class modification.

jgifford
Download Presentation

Possibility of XML I/O support in ROOT

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Possibility of XML I/O support in ROOT S. Linev, GSI, Darmstadt ROOT Workshop 2004

  2. Content • XML and existing packages • Concept of XML I/O for ROOT • Possible implementations • Problems and questions • Conclusion XML I/O in ROOT. S.Linev, GSI

  3. eXtensible Markup Language (XML) • Tree like structure (not ROOT tree) of text tags • Each tag opened should be closed • Tag can include other tags, contain text, has attributes • In addition: DTD, XSLT, XML schema, namespaces, … <?xml version="1.0"?> <Example> <item1>item text</item1> <item2 id="001"> <subitem>subitem text</subitem> </item2> <item3 ref="001"/> </Example> Example values: item1 “item text“ item2 id="001" subitem “subitem text“ item3 ref="001" XML I/O in ROOT. S.Linev, GSI

  4. XML packages C/C++ based XML packages: • libxml (Gnome) http://xmlsoft.org • Xerces-C++ (Apache) http://xml.apache.org/xerces-C/ • expat (Mozilla) http://expat.sourceforge.net Benchmarks of XML packages: http://xmlbench.sourceforge.net XML I/O in ROOT. S.Linev, GSI

  5. Usage of libxml2 library Example of code to create XML file: xmlDocPtr fDoc = xmlNewDoc(0); xmlNodePtr fNode = xmlNewDocNode(fDoc, 0, (const xmlChar*) "Example", 0); xmlDocSetRootElement(fDoc, fNode); xmlNewTextChild(fNode, 0, (const xmlChar*) "item1",(const xmlChar*) "item text"); xmlNodePtr sub2 = xmlAddChild(fNode, xmlNewNode(0, (const xmlChar*) "item2")); xmlNewTextChild(sub2, 0, (const xmlChar*) "subitem", (const xmlChar*) "subitem text"); xmlNewProp(sub2, (const xmlChar*) "id", (const xmlChar*) "001"); xmlNodePtr sub3 = xmlAddChild(fNode, xmlNewNode(0, (const xmlChar*) "item3")); xmlNewProp(sub3, (const xmlChar*) "ref", (const xmlChar*) "001"); xmlSaveFormatFile("Example.xml", fDoc, 1); xmlFreeDoc(fDoc); XML I/O in ROOT. S.Linev, GSI

  6. XML and ROOT • XML as metadata storage place: configuration, parameters and geometry objects • XML files can be viewed and edited (with some restriction) with standard XML tools • Data exchange between different packages But currently: • There is no XML support in ROOT (yet) • Each new class requires its own XML streamer XML I/O in ROOT. S.Linev, GSI

  7. Motivation • ROOT has all class information in TStreamerInfo class with methods to serialize/deserialize objects • Why not implement similar mechanism for XML, not only for binary ROOT format? • Aim – introduce XML I/O in ROOT, where user should not write I/O code himself XML I/O in ROOT. S.Linev, GSI

  8. <TXmlExample ref="id0"> <fValue v="10"/> <fArray> <Double v="1.0"/> <Double v="10.0"/> <Double v="5.0"/> <Double v="7.0"/> <Double v="2.0"/> </fArray> <fStr TString="Hello"/> <fSelfPtr ptr="id0"/> </TXmlExample> array Object representation in XML object id class name class TXmlExample { public: Int_t fValue; Double_t fArray[5]; TString fStr; TXmlExample* fSelfPtr; ClassDef(TXmlExample, 1); }; basic type string (special case) pointer XML I/O in ROOT. S.Linev, GSI

  9. First Implementation • New class with two functions similar to TStreamerInfo::WriteBuffer() and TStreamerInfo::ReadBuffer()were implemented to serialize/deserialize objects to/from XML structures • Libxml2 library was used • Requires no any ROOT modifications XML I/O in ROOT. S.Linev, GSI

  10. Problems • Only relatively “simple” objects can be stored • Custom streamers are not supported • As a result, ROOT classes like histograms (TH1), containers (TObjArray) and many other can not be supported XML I/O in ROOT. S.Linev, GSI

  11. TBuffer class modification • Make six methods of TBuffer virtual: void WriteObject(const void *actualObjStart, TClass *actualClass); void* ReadObjectAny(const TClass* cast); Int_t CheckByteCount(UInt_t startpos, UInt_t bcnt, const TClass *clss); void SetByteCount(UInt_t cntpos, Bool_t packInVersion = kFALSE); Version_t ReadVersion(UInt_t *start = 0, UInt_t *bcnt = 0); UInt_t WriteVersion(const TClass *cl, Bool_t useBcnt = kFALSE); • Redefine these methods in new TXmlBuffer class to perform XML specific actions • To support “TFile-like” key organization, new TXmlFile and TXmlKey classes have been created XML I/O in ROOT. S.Linev, GSI

  12. TObjArray arr; arr.Add(new TNamed("name1", "title1")); arr.Add(new TNamed("name2", "title2")); arr.Add(new TNamed("name3", "title3")); TXmlFile file("test.xml","1xxox"); file.Write(&arr, "array"); XmlKey with name and setup info TObjArray tag with version Part of TObjArray streamer (writing): … fName.Streamer(b); nobjects = GetAbsLast()+1; b << nobjects; b << fLowerBound; … Now only version, later full class info Example with TObjArray <?xml version="1.0"?> <root> <XmlKey name="array" setup="1xxox"> <TObjArray version="3"> <XmlObject> <TNamed> <fName TString="name1"/> <fTitle TString="title1"/> </TNamed> </XmlObject> <XmlObject> <TNamed> <fName TString="name2"/> <fTitle TString="title2"/> </TNamed> </XmlObject> <XmlObject> <TNamed> <fName TString="name3"/> <fTitle TString="title3"/> </TNamed> </XmlObject> <XmlBlock size="9"> 0000 00 00 0300 00 00 00 </XmlBlock> </TObjArray> <XmlClasses> <TNamed version="1"/> </XmlClasses> </XmlKey> </root> <?xml version="1.0"?> <root> <XmlKey name="array" setup="1xxox"> <TObjArray version="3"> <UChar>0</UChar> <Int>3</Int> <Int>0</Int> <XmlObject> <TNamed> <fName TString="name1"/> <fTitle TString="title1"/> </TNamed> </XmlObject> <XmlObject> <TNamed> <fName TString="name2"/> <fTitle TString="title2"/> </TNamed> </XmlObject> <XmlObject> <TNamed> <fName TString="name3"/> <fTitle TString="title3"/> </TNamed> </XmlObject> </TObjArray> <XmlClasses> <TNamed version="1"/> </XmlClasses> </XmlKey> </root> XML I/O in ROOT. S.Linev, GSI

  13. Consequence of TBuffer modification • Most of ROOT classes can be stored • Users classes with custom streamers can be supported • Works, if reading and writing parts of custom streamer have similar sequence of I/O actions (normal situation) • Some classes like TTree & TClonesArray are not tested and may be not required to be stored in XML format • At worse case 10% lost of I/O performance Still not fully acceptable because: • this is just “hacking” of ROOT code • TXmlFile and TXmlKey repeats a lot of functionality of similar TFile and TKey classes XML I/O in ROOT. S.Linev, GSI

  14. Further investigations • Producing of DTD files for validation purposes • Using of XML namespaces to avoid names intersection • Extension of TFile and TKey logic on XML files (via abstract interfaces) • C++ code generator for XML I/O to access ROOT objects outside a ROOT environment • Support of different XML packages XML I/O in ROOT. S.Linev, GSI

  15. Conclusion • There is no general XML I/O in ROOT • Very limited solution possible without ROOT changing • With slight TBuffer modifications acceptable XML support in ROOT is possible • Further investigations required XML I/O in ROOT. S.Linev, GSI

More Related