560 likes | 744 Views
Journals and Magazines and Books, Oh My! A Look at ACS' Use of NLM Tagsets. Dan O'Brien, ACS Publications Presented at JATS-Con, 1-Nov-2010. What We'll Cover. Intro ACS, Products, Processes Framework & terminology for discussing customizations ACS Pubs' Use of NLM Tagsets
E N D
American Chemical Society Journals and Magazines and Books, Oh My! A Look at ACS' Use of NLM Tagsets Dan O'Brien, ACS Publications Presented at JATS-Con, 1-Nov-2010
What We'll Cover • Intro • ACS, Products, Processes • Framework & terminology for discussing customizations • ACS Pubs' Use of NLM Tagsets • Overall Approach • Journals • Books • Magazine • Successes & Lessons Learned American Chemical Society
Character Introductions • ACS & ACS Pubs • Journals • Books • Magazine • Processes • Terminology American Chemical Society
Introductions: ACS • Professional membership organization • Chartered by U.S. Congress in 1876 • Non-profit • Over 160,000 members • ACS Publications Division ("ACS Pubs") • Journals • Magazine • Books • On a quest American Chemical Society
Introductions: ACS Journals • 40 peer-reviewed titles • 300,000 annual published pages • ~50% volume published weekly • Among highest ISI impact factors • "King" of publishing forest American Chemical Society
Introductions: Books • Symposium Series • Around 30 titles published annually • Around 25 chapters per book • Hard covers, rigid content format American Chemical Society
Introductions: C&EN Magazine • Chemical & Engineering News • Weekly Print & Web issues • Daily Online News • "BusinessWeek" for chemists • Flexible format, loose content definitions • More than meets the eye American Chemical Society
Introductions, cont. • Pressure for product innovation: Wicked Which of the West • NLM Tagsets – has the answers: Wizard of Oz American Chemical Society
Introductions: Processes • Journals & Books: • Standard scholarly publishing model • XML-first article/chapter based production • Automated Pre-Editing (Inera AutoRedact) • Technical Editing • Automated Post-Editing & Validations • Article ASAP publication (Journals) • Issue/Book publication (Journals & Books) • Magazine: • Staff writers vs. authors • Feature articles, Thematic issues • Story Online News? Issue? • Edit-to-Fit American Chemical Society
Introductions: Journal Process American Chemical Society
Introductions: Books Process American Chemical Society
Introductions: Magazine Process American Chemical Society
Terminology • Tag – a bit of XML markup: an element, attribute, etc. • Tag Definition – the coding (in DTD or XSD syntax) that declares the tag name and what its allowed to do. • Module – a way of logically organizing tag definitions, allowing reuse for multiple schemas. • Tagset – a collection of related tag definitions forming a complete vocabulary, usually stored within a set of interrelated modules • Schema – an application of a tagset to form a specific content model American Chemical Society
Tagset Module Module Tag definition A Schema (DTD, XSD, etc.) Tag definition B Module Module Tag definition C Tag definition D Module Module Module Module Schema (DTD, XSD, etc.) Tag definition dependencies Terminology American Chemical Society
Terminology – "Customization Levels" American Chemical Society
Terminology – "Customization Levels" American Chemical Society
Terminology – "Customization Levels" American Chemical Society
Terminology – "Customization Implementation Methods" American Chemical Society
Tagset Module Module Tag definition A Custom Schema (DTD, XSD, etc.) Tag definition B Module Module Tag definition C Tag definition D Module Module Module Module Public Schema (DTD, XSD, etc.) Tag definition dependencies Terminology – "Customization Implementation Methods" American Chemical Society
Tagset Module Tag definition A Custom Schema (DTD, XSD, etc.) Tag definition B Module Module Tag definition C Tag definition D Module Module Module Module Public Schema (DTD, XSD, etc.) Tag definition dependencies Terminology – "Customization Implementation Methods" American Chemical Society
Terminology – "Customization Profile" American Chemical Society
ACS Pubs' Use of NLM Tagsets – Overview & Approach • Leverage a public schema, or develop one from scratch? • If use a public schema, would customization be needed? (i.e., where on the "Customization Levels” spectrum?) • Product drivers !! • Process drivers !! • ACS Terminology !? • If customization would be needed: • How much customization was needed? (scoping) • What customizations are needed? (details) • How to implement the customizations? (i.e., where on the "Implementation Methods" spectrum?) American Chemical Society
ACS Journals' Use of NLM Tagsets • Production vs. Delivery • What we use and why • Customization Profile • Highlights of Customizations American Chemical Society
ACS Journal Production:What we use • Custom-built DTD based loosely on NLM Journal Archiving & Interchange v2.2 • ~2005, as NLM tagset was beginning to increase in prominence for STM publishing • Pre-2010: Monolithic tagset & schema used for editing, page composition, interchange with web delivery and 3rd parties • Late 2010: New version of tagset supporting multiple schema flavors: • "X" – External & Delivery Interchange • "P" – Internal Production • "L" – Page Layout American Chemical Society
ACS Journal v1.03 DTDs ACS Journal v1.03 Tagset Core tagset modules External/Interchange DTD ACS Journal Production:What we use American Chemical Society
ACS Journal v1.03 DTDs ACS Journal v1.03 Tagset Production-specific tagset features extend core modules Production DTD Core tagset modules External/Interchange DTD Overrides of tag definitions ACS Journal Production:What we use American Chemical Society
ACS Journal v1.03 DTDs ACS Journal v1.03 Tagset Page layout specific tagset features extend production-specific modules Layout DTD Production-specific tagset features extend core modules Production DTD Core tagset modules External/Interchange DTD Overrides of tag definitions ACS Journal Production:What we use American Chemical Society
ACS Journal Production: Why • No public tagset met the minimum requirements for • ACS Journal Product – without undesirable product limitations • ACS Journal Process – without increasing costs • Allowing ACS Pubs Terminology • Without significant staff training & documentation updates • Without risking rejection • NLM's Journal tagset came closest • Could have used massive extensions? • ACS Pubs Terminology pushed us into "Built From" American Chemical Society
ACS Journal Production: Customization Profile American Chemical Society
ACS Journal Production: Customizations – Terminology American Chemical Society
ACS Journal Production: Customizations – Process American Chemical Society
ACS Journal Production: Customizations – Product American Chemical Society
ACS Journal Production: Customizations – Product, cont. American Chemical Society
ACS Journal Delivery: What we use • Online delivery system: based on Literatum from Atypon • Literatum speaks "NLM Journal Archive & Interchange" • Common base tagset ≠ XML content compatibility • Differing schemas • Differing tagging expectations ...see Figure <xref rid="xfca3"/>. vs. ...see Figure <xref rid="xfca3">4</xref>. American Chemical Society
ACS Journal Delivery: What we use • Two-part content interface • Production system: "ACS-Delivery-Prep" (export ) • Delivery system: "ACS2NLM" lexer ( import) Both advantages & disadvantages + Insulates Production developers from Delivery intricacies + Delivery system tagging can evolve without Production - Occasional failure point - New products, production tagging changes = ACS2NLM lexer changes American Chemical Society
ACS Journal Delivery: Customization Profile American Chemical Society
ACS Books' Use of NLM Tagsets • What we use and why • Customization Profile • Highlights of Customizations American Chemical Society
ACS Books: What we use and why - Drivers • Delivery System: Leverage our new Literatum-based delivery platform. • Composition: Leverage Arbortext Publishing Engine for highly-automated XML-based page composition. • Like Journals: Don't re-invent the XML wheel. • Unlike Journals: Books had unique product characteristics of their own; different type of wheel. • Book + Chapter production: • Individual Chapter level: production editing and some composition • Whole Book level: final book composition, indexing • Delivery: combination of both book and chapter XML & PDF deliverables. American Chemical Society
ACS Books: What we use and why - Answers • Delivery System: • Literatum already supported an Extended version of NLM Book v2.3 • Production & Delivery could share a common tagset! • Composition: Extended NLM Book v2.3 fit the bill • Like Journals: • Extended NLM Book v2.3 had CALS table model • Many elements & structures were similar to ACS Journal tagset, easing adoption • Unlike Journals: Extended NLM Book v2.3 addressed almost all book-specific metadata & processing needs • Book + Chapter production: gap! Solution: Xinclude • Allows "link book to chapter" instead of "copy chapter into book" American Chemical Society
ACS Books: Customization Profile American Chemical Society
ACS Books: Customization Highlights • Addition of XInclude • Allows a chapter XML to be processed both as stand-alone document AND within context of entire book • Use of OASIS Table Model (instead of default XHTML Table model) • Addition of DocBook <index> Model • Addition of <book-series-meta> section (similar to <journal-meta>) American Chemical Society
Book DTD Book DTD Chapter XMLs Book XML Book XML <book-part>… <book> <book-series-meta>… <book-meta>… <body> <book-part>… <book-part>… <book-part>… <book> <book-series-meta>… <book-meta>… <body> <xi:include hef="ch1.xml"/> <xi:include hef="ch2.xml"/> <xi:include hef="ch3.xml"/> <book-part>… <book-part>… ACS Books: Customization Highlights - XInclude American Chemical Society
ACS C&EN Magazine's Use of NLM Tagsets • What we use and why • Customization Profile • Highlights of Customizations American Chemical Society
ACS Magazine: What we use and why • What: A customized version of the ACS Journal Tagset • (Which was "informed by" NLM Journal Tagset) • Drivers: • Ability to archive a "content of record" that is format independent • Ability to serve as technology-neutral "content interchange format" • Automated web delivery • External content syndication • Other contenders: DITA for Publications, DocBook, EPUB, PRISM, NewsML, American Chemical Society
ACS Magazine:Customization Profile American Chemical Society
ACS Magazine: Customization Highlights • Amorphous, modular content structures: XInclude • Same content produced as • Single article in print • Several distinct pages online • Web-only articles & article components • Blur between articles & subarticles • Graphics, tables, media have separate production lifecycles, joined later • Non-contiguous Pagination • Ads American Chemical Society
ACS Magazine: Customization Highlights • Flexible, recursive categorization model • Print/web name, internal code, source/type • "CO2 Sequestration" vs. "Carbon Dioxide Sequestration" • RSS feeds • Alternate topic-oriented TOCs • Special content constructs • Dek • Eyebrow • Pull quotes American Chemical Society
ACS Magazine: Customization Highlights American Chemical Society
ACS Pubs' Use of NLM Tagsets – Summary Tagset Lineage & Content Interchange Map American Chemical Society