980 likes | 1.11k Views
Marshall Breeding Independent Consult, Author, Founder and Publisher, Library Technology Guides http://www.librarytechnology.org/ http://twitter.com/mbreeding. Cloud Computing: A New Generation of Technology Enables Deeper Collaboration. 05 Sept 2013.
E N D
Marshall Breeding Independent Consult, Author, Founder and Publisher, Library Technology Guides http://www.librarytechnology.org/ http://twitter.com/mbreeding Cloud Computing: A New Generation of Technology Enables Deeper Collaboration 05 Sept 2013 4th International Symposium on Information Management
Summary • Marshall Breeding will present a summary of the latest trends in library management systems and discovery services. Many of these new products, especially those based on cloud computing technologies, have a profound impact on the models of resource sharing available to libraries. Breeding will also review some of the major tech products and organizational trends that have transpired in recent times. On many fronts libraries are consolidating their resource sharing arrangements to form ever larger pools of resources available to their clients.
Library Technology Guides www.librarytechnology.org
Mergers and Acquisitions http://www.librarytechnology.org/automationhistory.pl
Library Journal Automation Marketplace • Published annually in April 1 issue • Based on data provided by each vendor • Focused primarily on North America • Context of global library automation market
LJ Automation Marketplace Annual Industry report published in Library Journal: • 2013: Rush to Innovate • 2012: Agents of Change • 2011: New Frontier: battle intensifies to win hearts, minds and tech dollars • 2010: New Models, Core Systems • 2009: Investing in the Future • 2008: Opportunity out of turmoil • 2007: An industry redefined • 2006: Reshuffling the deck • 2005: Gradual evolution • 2004: Migration down, innovation up • 2003: The competition heats up • 2002: Capturing the migrating customer
Library Technology Reports • Resource Sharing in Libraries: Concepts, Products, Technologies, and Trends • January 2013 • Vol 49, No. 1
Library Technology Reports • Supplementing your local collection through resource sharing is a smart way to ensure your library has the resources to satisfy the needs of your users. Marshall Breeding’s new Library Technology Report explores technologies and strategies for sharing resources, helping you streamline workflows and improve resource-sharing services by covering key strategies like interlibrary loan, consortial borrowing, document delivery, and shared collections. You’ll also learn about such trends and services as: • OCLC WorldCat Resource Sharing, and other systems that facilitate cooperative, reciprocal lending • System-to-system communications that allow integrated systems to interact with resource-sharing environments • Technical components that reliably automate patron requests, routing to suppliers with tools for tracking, reporting, and staff intervention as needed • Specialized applications that simplify document delivery, such as Ariel, Odyssey, or OCLC’s Article Exchange • How the NISO Circulation Interchange Protocol (NCIP) can enable borrowing among consortial libraries using separate integrated library systems • The Orbis Cascade Alliance consortium, examined using a case study
Cloud Computing for Libraries Book Image Publication Info: • Volume 11 in The Tech Set • Published by Neal-Schuman / ALA TechSource • ISBN: 781555707859 • http://www.neal-schuman.com/ccl
AppropriateAutomation Infrastructure • Current automation products out of step with current realities • Majority of library collection funds spent on electronic content • Majority of automation efforts support print activities • New discovery solutions help with access to e-content • Management of e-content continues with inadequate supporting infrastructure
Key Context: Libraries in Transition • Academic Shift from Print > Electronic • E-journal transition largely complete • Circulation of print collections slowing • E-books now in play (consultation > reading) • All libraries: • Need better tools for access to complex multi-format collections • Strong emphasis on digitizing local collections • Demands for enterprise integration and interoperability
Key Text: Changed expectations in metadata management • Moving away from individual record-by-record creation • Life cycle of metadata • Metadata follows the supply chain, improved and enhanced along the way as needed • Manage metadata in bulk when possible • E-book collections • Highly shared metadata • E-journal knowledge bases, e.g. • Great interest in moving toward semantic web and open linked data • Very little progress in linked data for operational systems • AACR2 > RDA • MARC > Bibframe (http://bibframe.org/)
Key Context: Technologies in transition • Client / Server > Web-based computing • Beyond Web 2.0 • Integration of social computing into core infrastructure • Local computing shifting to cloud platforms • Application Service Provider offerings standard • New expectations for multi-tenant software-as-a-service • Full spectrum of devices • full-scale / net book / tablet / mobile • Mobile the current focus, but is only one example of device and interface cycles
Fundamental technology shift • Mainframe computing • Client/Server • Web-based and Cloud Computing http://www.flickr.com/photos/carrick/61952845/ http://soacloudcomputing.blogspot.com/2008/10/cloud-computing.html http://www.javaworld.com/javaworld/jw-10-2001/jw-1019-jxta.html
Major trend in Information Technology Term “in the cloud” has devolved into marketing hype, but cloud computing in the form of multi-tenant software as a service offers libraries opportunities to break out of individual silos of automation and engage in widely shared cooperative systems Opportunities for libraries to leverage their combined efforts into large-scale systems with more end-user impact and organizational efficiencies Cloud Computing
Beyond “Cloudwashing” • Cloud as marketing hype • Cloud computing used very freely, tagged to almost any virtualized environment • Any arrangement where the library relies on some kind of remote hosting environment for major automation components • Includes almost any vendor-hosted product offering • Example: ASP now Software-as-a-Service
Cloud computing – characteristics • Web-based Interfaces • Externally hosted • Pricing: subscription or utility • Highly abstracted computing model • Provisioned on demand • Scaled according to variable needs • Elastic – consumption of resources can contract and expand according to demand
Budget Allocations Local Computing Cloud Computing • Server Purchase • Server Maintenance • Application software license • Data Center overhead • Energy costs • Facility costs • Annual Subscription • Measured Service? • Fixed fees • Factors • Hosting • Software Licenses • Optional modules
Multi Tennant SaaS is the modern approach One copy of the code base serves multiple sites Software functionality delivered entirely through Web interfaces No workstation clients Upgrades and fixes deployed universally Usually in small increments Software as a Service
SaaS provides opportunity for highly shared data models Bibliographic knowledgebase: one globally shared copy that serves all libraries Discovery indexes: article and object-level index for resource discovery E-resource knowledge bases: shared authoritative repository of e-journal holdings General opportunity to move away from library-by-library metadata management to globally shared workflows Data as a service
Key Context: Libraries in Transition • Academic Shift from Print > Electronic • E-journal transition largely complete • Circulation of print collections slowing • E-books now in play (consultation > reading) • Public: Emphasis on Customer Engagement • Increased pressure on physical facilities • Increased circulation of print collections • Dramatic increase in interest in e-books • All libraries: • Need better tools for access to complex multi-format collections • Strong emphasis on digitizing local collections • Demands for enterprise integration and interoperability
Reconceptualization of Automation • Current organization of functionality based on past assumptions • Possible new organizing principles • Fulfillment = Circulation + ILL + DCB + e-commerce • Resource management = Cataloging + Acquisitions + Serials + ERM • Customer Relationship Management = Reference + Circulation + ILL (public services) • Enterprise Resource Planning = Acquisitions + Collection Development
Open Systems • Achieving openness has risen as the key driver behind library technology strategies • Libraries need to do more with their data • Ability to improve customer experience and operational efficiencies • Demand for Interoperability • Open source – full access to internal program of the application • Open API’s – expose programmatic interfaces to data and functionality
Challenge: More integrated approach to information and service delivery • Library Web sites offer a menu of unconnected silos: • Books: Library OPAC (ILS online catalog module) • Search the Web site • Articles: Aggregated content products, e-journal collections • OpenURL linking services • E-journal finding aids (Often managed by link resolver) • Subject guides (e.g. SpringshareLibGuides) • Local digital collections • ETDs, photos, rich media collections • Metasearch engines • Discovery Services – often just another choice among many • All searched separately
ILS Data Online Catalog Search: Scope of Search • Books, Journals, and Media at the Title Level • Not in scope: • Articles • Book Chapters • Digital objects • Web site content • Etc. Search Results
Next-gen Catalogs or Discovery Interface (2002-2009) • Single search box • Query tools • Did you mean • Type-ahead • Relevance ranked results (for some content sources) • Faceted navigation • Enhanced visual displays • Cover art • Summaries, reviews, • Recommendation services
Discovery Interface search model ILS Data Digital Collections Search: Local Index ProQuest Search Results EBSCOhost Metasearch Engine … MLA Bibliography ABC-CLIO Real-time query and responses
Discovery Products http://www.librarytechnology.org/discovery.pl
Differentiation in Discovery • Products increasingly specialized between public and academic libraries • Public libraries: emphasis on engagement with physical collection + e-books • Academic libraries: concern for discovery of heterogeneous material types, especially books + articles + digital objects
Discovery from Local to Web-scale • Initial products focused on technology • AquaBrowser, Endeca,Primo, Encore, VuFind, • LIBERO Uno, Civica Sorcer, Axiell Arena • Mostly locally-installed software • Current phase is focused on pre-populated indexes that aim to deliver Web-scale discovery • Primo Central (Ex Libris) • Summon (Serials Solutions) • WorldCat Local (OCLC) • EBSCO Discovery Service (EBSCO) • Encore Synergy (no index, though)
Web-scale Index-based Discovery ILS Data (2009- present) Digital Collections Search: Web Site Content Institutional Repositories Aggregated Content packages Search Results Consolidated Index Open Access … E-Journals CustomerProfile Usage-generatedData Reference Sources Pre-built harvesting and indexing
Web-scale Search Problem ILS Data Digital Collections Search: Web Site Content Institutional Repositories Consolidated Index Aggregated Content packages Search Results … E-Journals Pre-built harvesting and indexing ??? Non Participating Content Sources Problem in how to deal with resources not provided to ingest into consolidated index
Populating Web-scale index with full text • Citations or structured metadata provide key data to power search & retrieval and faceted navigation • Indexing fulltext of content amplifies access • Every title, phrase, term becomes an access point • Important to understand depth indexing • Currency, dates covered, full-text or citation • Many other factors
Full-text Book indexing • HathiTrust: 11 million volumes, 5.3 million titles, 263,000 serial titles, 3.5 billion pages • HathiTrust in Discovery Indexes • Primo Central (Jan 20, 2012) [previously indexed only metadata] • EBSCO Discovery Service (Sept 8 2011) • WorldCat Local (Sept 7, 2011) • Summon (Mar 28, 2011)
Challenge for Relevancy • Technically feasible to index hundreds of millions or billions of records through Lucene or SOLR • Difficult to order records in ways that make sense • Many fairly equivalent candidates returned for any given query • Must rely on use-based and social factors to improve relevancy rankings
Challenges for Collection Coverage • To work effectively, discovery services need to cover comprehensively the body of content represented in library collections • What about publishers that do not participate? • Is content indexed at the citation or full-text level? • What are the restrictions for non-authenticated users? • How can libraries understand the differences in coverage among competing services?
Evaluating Index-based Discovery Services • Intense competition: how well the index covers the body of scholarly content stands as a key differentiator • Difficult to evaluate based on numbers of items indexed alone. • Important to ascertain now your library’s content packages are represented by the discovery service. • Important to know what items are indexed by citation and which are full text • Important to know whether the discovery service favors the content of any given publisher
Open Discovery Initiative • NISO Work Group to Develop Standards and Recommended Practices for Library Discovery Services Based on Indexed Search • Informal meeting called at ALA Annual 2011 • Co-Chaired by Marshall Breeding and Jenny Walker • Term: Dec 2011 – May 2013
Balance of Constituents Marshall Breeding, Vanderbilt UniversityJamene Brooks-Kieffer, Kansas State University Laura Morse, Harvard University Ken Varnum, University of Michigan Sara Brownmiller, University of Oregon Lucy Harrison, College Center for Library Automation (D2D liaison/observer) Michele Newberry Lettie Conrad, SAGE PublicationsBeth LaPensee, ITHAKA/JSTOR/PorticoJeff Lang, Thomson Reuters Linda Beebe, American Psychological AssocAaron Wood, Alexander Street Press Jenny Walker, Ex Libris GroupJohn Law, Serials SolutionsMichael Gorrell, EBSCO Information Services David Lindahl, University of Rochester (XC) Jeff Penka, OCLC (D2D liaison/observer)
The rise of e-books • Academic libraries: e-books included in aggregated content packages • E-books used primarily for research and consultation, not long reading • Public Libraries: Subscriptions to e-book services that provide an outsourced collection of loanable e-books • K-12 Schools, Colleges, Universities: interest in electronic textbooks
Integrating e-Books into Library Automation Infrastructure • Current approach involves mostly outsourced arrangements • Collections licensed wholesale from single provider • Hand-off to DRM and delivery systems of providers • Loading of MARC records into local catalog with linking mechanisms • No ability to see availability status of e-books from the library’s online catalog or discovery interface