220 likes | 356 Views
Enabling Academic Research: Office Add-ins. Alex Wade Director – Scholarly Communication Microsoft External Research. Interactive Data. Collaboration. Dynamic Documents. Reputation & Influence. Envisioning a New Era of Research Reporting. Imagine…
E N D
Enabling Academic Research:Office Add-ins Alex Wade Director – Scholarly Communication Microsoft External Research
Interactive Data Collaboration DynamicDocuments Reputation & Influence Envisioning a New Era of Research Reporting Imagine… • Live research reports that had multiple end-user ‘views’ and which could dynamically tailor their presentation to each user • An authoring environment that absorbs and encapsulates research workflows and outputs from the lab experiments • A report that can be dropped into an electronic lab workbench in order to reconstitute an entire experiment • A researcher working with multiple reports on a Surface and having the ability to mash up data and workflows across experiments • The ability to apply new analyses and visualizations and to perform new in silico experiments Reproducible Research
Office 2007 File Formats • Office Open XML (OOXML) • .docx, .xlsx, .pptx • ZIP compatible Open Packaging Convention (OPC) container • XML content, as well as other file formats
Programmer View of Open XML Files • ZIP Archive • Document Parts • XML Parts • Binary Parts • Typed (RFC 2616) • Relationships • Connections between parts • Content Type Stream • A specially-named stream • Defines mappings from part names to content types • Not itself a part, not URI addressable • Folder structure for convenience only
Developer Scenario: Content Inspection Remove confidential information, tracked changes or metadata from outbound documents: Remove macros, inappropriate language, or other content from inbound documents: Open XML Processing Open XML Processing
Development Scenario: Consuming Documents Create expense reports as spreadsheet documents, which are loaded into a back-end system on the server: Authoring environment(Microsoft Office, etc.) Open XML Processing Back-end system(LOB/CRM/etc.)
Development Scenario: Document Assembly Create sales reports from financial and forecast data stored in a CRM system: Web client or rich clientallows user to select orenter content criteria
Development Scenario: Custom XML Markup Tagging document content with custom semantics for processing by a back-end system. Authoring environment Open XML Processing
Custom XML Data Store Doc/Template • Custom-defined XML part • Stored separately from document body • Any XML can be stored • Document properties • WSS meta-data • Custom XML (with or without XML schema) • External applications can easily read or write the custom XML part • True separation of data and presentation Doc Parts XML External App
OpenXmlDeveloper.org • Formed by 40 companies to share developer information about the Office Open XML file formats • Articles with full source code for C#, VB, Java, XSLT • Forums for posting technical questions
Creative Commons Add-in for Office Intent: Insert Creative Commons licenses from within Office 2007 Services: Integrates with Creative Commons Web API to create new licenses Relationships: license information stored as RDF XML within the document OOXML Source code and binary: http://ccaddin2007.codeplex.com
Article Authoring Add-in for Word 2007 Services: repository deposit via SWORD Structure: Read, convert, and author NLM XML documents Relationships: ORE Resource Map creation Relationships: Citation lookup and reference management Structure: Client-side XML validation Binary (v.2 beta 3): http://research.microsoft.com/authoring/ This work is licensed under a Creative Commons Attribution 3.0 United States License.
Evolving Publishing Landscape • Scientific, Technical, and Medical publishing (STM) • Accelerating move to electronic publishing and consumption of journals • Semantics over presentation at authoring • Open Access publishing and repositories • High Energy Physics – arXiv.org • Biomedical – PubMed Central
PubMed Central • National Institute of Health’sNational Library of Medicine • PubMed Central repository • Archiving mandated by law as part of funding • Conversion usually done by publisher • NLM XML formats • Light on presentation, extensive metadata • Used in the publishing and archiving process, even outside of biomed • Article publishing and book formats
Key Add-in Functionality • Opening and saving of NLM XML files • Preserving metadata and semantics • Article templates • Simplify authoring experience • Package and convey requirements to authors • Metadata access and editing • Author view - simple • Editor view - extensive
Author and Editor Experience • For Authors - rely on native Word functionality/concepts as much as possible • Add-in and file formats should be invisible • Try to gather metadata from authors • Reduce data entry errors • Provide Editors and staff with full access to metadata, format concepts, and validation • Likely to know at least parts of the format or metadata in depth • Need full extent of the format presented in the UI
Word Building Blocks • Custom ribbon • Side Panel • InfoPath integration • Content control and custom elements • Format SDK and packaging functionality • MathML XSLT (two way)
Additional Capabilities • Direct upload/submission from Word using the Authoring Add-in through SWORD • Simplified experience for authors • Some data validation before submission • Data ready for use by intake infrastructure • Embedding data within articles and referencing data stored online based on ORE • Enable an additional dimension to the content
Status • Version 2 Beta 3 available for free download • http://research.microsoft.com/authoring/ • Great community participation and engagement • Valuable feedback from the National Library of Medicine, Hindawi Publishing, and other Beta participants
Chem4Word - Chemistry Drawing in Word Author/edit 1D and 2D chemistry. Change chemical layout styles. • Peter Murray-Rust • Joe Townsend • Jim Downing Intent: Recognizes chemical dictionary and ontology terms Relationships: Navigate and link referenced chemistry Data: Semantics stored in Chemistry Markup Language <?xmlversion="1.0" ?> <cmlversion="3" convention="org-synth-report" xmlns="http://www.xml-cml.org/schema"> <moleculeid="m1"> <atomArray> <atomid="a1" elementType="C" x2="-2.9149999618530273" y2="0.7699999809265137" /> <atomid="a2" elementType="C" x2="-1.5813208400249916" y2="1.5399999809265137" /> <atomid="a3" elementType="O" x2="-0.24764171819695613" y2="0.7699999809265134" /> <atomid="a4" elementType="O" x2="-1.5813208400249912" y2="3.0799999809265137" /> <atomid="a5" elementType="H" x2="-4.248679083681063" y2="1.5399999809265137" /> <atomid="a6" elementType="H" x2="-2.914999961853028" y2="-0.7700000190734864" /> <atomid="a7" elementType="H" x2="-4.248679083681063" y2="-1.907348645691087E-8" /> <atomid="a8" elementType="H" x2="1.0860374036310796" y2="1.5399999809265132" /> </atomArray> <bondArray> <bondatomRefs2="a1 a2" order="1" /> <bondatomRefs2="a2 a3" order="1" /> <bondatomRefs2="a2 a4" order="2" /> <bondatomRefs2="a1 a5" order="1" /> <bondatomRefs2="a1 a6" order="1" /> <bondatomRefs2="a1 a7" order="1" /> <bondatomRefs2="a3 a8" order="1" /> </bondArray> </molecule> </cml> Intelligence: Verifies validity of authored chemistry Available soon: http://research.microsoft.com/chem4word/