120 likes | 229 Views
Unlock Content ™. Cross Media Strategy I: Tools & Technologies. Gilbane San Francisco – April 10 th – 12 th Andrew Grygiel VP Marketing & Marketing Development. Topics. Important technologies, techniques and concepts XML XQuery Digital Product Creation. XML: The Fundamental Enabler.
E N D
Unlock Content™ Cross Media Strategy I: Tools & Technologies Gilbane San Francisco – April 10th – 12th Andrew GrygielVP Marketing & Marketing Development
Topics • Important technologies, techniques and concepts • XML • XQuery • Digital Product Creation
XML: The Fundamental Enabler • Separation of form and content • <title>, <link>, and <description> • Not <bold> and <italic> • Ideal repository / archive format • Open, standard, human-readable, presentation-neutral • Use markup to denote many different things • Metadata: <author>, <pub-date> • Structure: <section>, <caption>, <citation>, <footnote> • Entities: <person>, <place>, <thing> • Facts: relationships between entities • And much more: molecules, diseases, concepts, sentiment, parts-of-speech, coordinates, ...
XQuery • XML Query language (XQuery) • Designed specifically for querying XML • Kind of SQL for XML • Recently became an approved W3C standard • De-risk: use, acquisition from small suppliers • Reflects the XML bifurcation • RDBMS vendors: data-centric implementations • XML content server vendors: content-centric implementations • Poorly named: much more than a query language • Programming language for building XML-based web applications
Digital Product Creation: Traditional • Decide what you want to build • Round up content for it • From inside the organization • License from outside if necessary • Normalize that content • Get it to one DTD or schema • Build application / delivery system from scratch • See if it works App All fixed cost
Digital Product Creation: New • Round up all your content • Semi-normalize it • Create platform to accelerate new product creation • Decide what you want to build • License additional content if needed • Clean-up and enrich to enable the app • Build application / delivery system on the platform • See if it works App App App ●● Platform Use platform to reduce marginal cost of applications
Digital Product Creation Activities • Integrate • Of internal, licensed external, and possibly Internet content • Enrich • From basic clean-up to advanced entity, fact, concept, and sentiment extraction • Contextualize • Creation of content applications that don’t just provide information • But instead help a known role accomplish a known task
Enrich: Intelligence Demo Fact extraction (relationships between entities) <inxight:travel> <inxight:person confidence="100">President Nixon</inxight:person> went to <inxight:city confidence="100" latitude="39.928888" longitude="116.388333">Beijing</inxight:city> for a week early in <inxight:year confidence="100">1972</inxight:year>, meeting <inxight:person confidence="90" reference="person5">Mao</inxight:person> as well as <inxight:person confidence="90">Zhou</inxight:person>. </inxight:travel> Also have sentiment extraction, concept extraction, entity extraction, part-of-speech extraction, … (“a boy and his algorithm”)
Contextualize: Elsevier PathCONSULT For pathologists doing differential diagnosis
Resources • Querying XML • Melton and Buxton • Mark Logic CEO blog • http://marklogic.blogspot.com • Discovering XQuery blog • http://xquery.typepad.com • CMS Watch • www.cmswatch.com • Text Technologies, DBMS2 • www.texttechnologies.com • www.dbms2.com • XML and databases • http://www.rpbourret.com/xml/xmlanddatabases.htm