330 likes | 453 Views
CrossRef Deposit Schema 2.0. Bruce D. Rosenblum I NERA I NCORPORATED Innovative Software Solutions CrossRef Annual Meeting September 26, 2002. Presentation Overview. What's new in 2.0 Hierarchy overview Specific Issues. What's New in 2.0. It’s more than journal articles Books
E N D
CrossRef Deposit Schema 2.0 Bruce D. Rosenblum INERA INCORPORATED Innovative Software Solutions CrossRef Annual Meeting September 26, 2002
Presentation Overview • What's new in 2.0 • Hierarchy overview • Specific Issues
What's New in 2.0 • It’s more than journal articles • Books • Major reference works • Conference proceedings
DOI Assignment (version 0.3) • Journal • Article
DOI Assignment (version 2.0) • Journal • Article • Issue • Volume • Title • Book • Volume and/or Series • Content Item (e.g. chapters, sections, etc.) • Conference • Proceedings • Paper
Schema Design • Designed for linking • Based on: • Extensive review of actual citations • Publisher input • Only essential fields are required • Deposit Header • Container metadata (journal name, book title, etc.) • Publication date • DOI, URI
Schema vs. DTD • W3C XML Schema • Data validation • Parsers • Conformant • Xerces 2.0 • Semi-conformant • XML Spy • Turbo XML (XML Authority) 2.2.1.100 or later
0.3 vs. 2.0 Comparison • More Hierarchy • Less redundant data • Naming Conventions • Preserved 0.3 DTD names where appropriate • Used ONIX names for some new elements • See documentation for mapping tables
Conference Metadata Official Conference name including number and subject <event_metadata> <conference_name>29th International Conference on Computer Graphics and Interactive Techniques</conference_name> <conference_theme>Interactive Techniques</conference_theme> <conference_acronym>SIGGRAPH 2002</conference_acronym> <conference_number>2</conference_number> <conference_date start_year=”2002">2002</conference_date> </event_metadata> <proceedings_metadata language="en"> <proceedings_title>Proceedings of the twenty-ninth International Conference on Computer Graphics and Interactive Techniques - SIGGRAPH 2002</proceedings_title> <proceedings_subject> Computer Graphics</proceedings_subject> Annual conference slogan Popular or jargon name Proceedings cover page title Subject matter of printed proceedings
Data Submission Issues • Special Characters • Unicode • Combining characters • Non-Breaking space ( ) • Face Markup • <b>, <i>, <sup>, <sub>, etc. • Math Markup • Good faith representation with Unicode and face markup
General Element Comments • <publisher_item> • <publication_date> • <timestamp> • <person_name> • <collection>
<publisher_item> • <identifier> • Public standard identifier • PII, SICI, DOI • <item_number> • Publisher internal identifier • Not based on public standard • item_number_type attribute • e.g. article_number in electronic-only publications
<publication_date> • Assignable to item or item container • Journal: issue or article • Book: book or content item • Proceedings: proceedings or paper • Required if underlined • Allows unique online and print dates
<timestamp> • Integer version number • Formatted as date/time • <head> timestamp used if not in <doi_data>
<person_name> • New <suffix> element • Do not include titles, e.g. "Dr.", "Prof." • Ambiguous names: <surname> only • <surname>Leonardo da Vinci</surname> • <surname>Prince Charles</surname> • Organizations: Use <organization>, not <surname>
<collection> • Designed for multiple resolution • Infinitely recursive • For future use
Content-Specific Elements • Books • <content_item> • <edition_number> • Conference Proceedings • Conference metadata • <conference_name> • <conference_number> • <conference_date>
<content_item> • Attributes • component_type: chapter, section, part, track, reference_entry, other • level_sequence_number • Indicates level of nesting • Reflects TOC layout • Flat schema model implies submission order if important
<edition_number> • Edition number of book • Update when entire book updated • Unresolved issue: • For online content with discrete updates, when do you update the edition number • Possible solutions: • Register new DOI if chapter is significantly changed • Keep original DOI and make old versions accessible
<conference_date> • CDATA format <conference_date>Jan. 15-17, 1997</conference> • Attribute format <conference_date start_day="15" start_month="01" start_year="1997" end_day="17” end_month="01" end_year="1997”/> • CDATA and attribute <conference_date start_day="15" start_month="01" start_year="1997" end_day="17” end_month="01" end_year="1997”> Jan. 15-17, 1997</conference>
Resources • CrossRef Schema Documentation 2.0.5.pdf • and sample XML files • http://mddb2.crossref.org/doc/TechDoc.html • tech-support@crossref.org
Questions? Bruce D. Rosenblum INERA INCORPORATED +1 (617) 969 - 3053 brosenblum@inera.com www.inera.com