400 likes | 493 Views
Katherine Skinner, Executive Director, Educopia Institute Christina Drummond, Research Associate Professor, University of North Texas. Using Informatics & Visualizations to Understand Digital Preservation Activity. CNI Fall Forum - 2013 Washington D.C. December 10, 2013.
E N D
Katherine Skinner, Executive Director, Educopia Institute Christina Drummond, Research Associate Professor, University of North Texas Using Informatics & Visualizationsto Understand Digital Preservation Activity CNI Fall Forum - 2013 Washington D.C. December 10, 2013
Data Visualization for Education Skinner and Drummond 2013 CNI Fall Forum - 2013
Skinner and Drummond 2013 CNI Fall Forum - 2013
Skinner and Drummond 2013 CNI Fall Forum - 2013
Digital Preservation How can analytics guide the development of a niche field?
Metrics 1) Activity Performance Academic Performance Publications Prestige Frequency Quantity Grants Dollar value Quantity Frequency Awards Collaborators • Success • Impact other projects, sectors • Seeding follow-on projects • Sustainability • Collaborators • Sector diversity • Geographic diversity 2) Organization Performance • Activity level • Leadership • Sector diversity • Program creation, hosting CNI Fall Forum - 2013 Skinner and Drummond 2013
Metrics 3) Composite State Performance 1) Activity Performance • Activity involvement • Organizational involvement • Success • Impact other projects, sectors • Seeding follow-on projects • Sustainability • Collaborators • Sector diversity • Geographic diversity 2) Organization Performance • Activity level • Leadership • Sector diversity • Program creation, hosting Skinner and Drummond 2013 CNI Fall Forum - 2013
Pilot Data Sources Reports: • National Forum on Archival Continuing Education …(2002) • Connecting the Archival Community (2002) • Electronic Records Agenda Report (2003) • Status of the Preservation of Electronic Records by State Archives (2004) • Survey of State Historical Records Advisory Boards (2006) • State of State Records…(2007) • NDIIPP Preserving our Digital Heritage… (2010) • NDIIPP States of Sustainability…(2012) • SERI Phase 1 (2012) • State Historical Records Advisory Boards…(2013) • NEH NCRR Evaluation Report (2013) Websites: • LC Digital Preservation Partners • Best Practices Exchange conferences • SAA, SERI, NASCIO, NAGARA, Legal Information Preservation Alliance, OCLC, LOCKSS, LC, Repository Exchange Surveys: • NASCIO and NAGARA members Grant awards: • IMLS, NEH, NHPRC, NSF (incl. DataNet), NIH, DOE, Mellon Foundation CNI Fall Forum - 2013
Data Collection Who: • Post-doctoral researchers • Digital preservation field experts Caveats: • Sector and organizational type taxonomies need to be refined. • Required publically accessible URL to validate data. • Disproportionately represents efforts with open collaborator lists. Aspire to openness CNI Fall Forum - 2013
Pilot Inclusion Criteria • Collaborative across institutions • Institutional repositories generally not included • Stated digital preservation aim • U.S. partners • Events: Hosts, planning committees CNI Fall Forum - 2013
Pilot Dataset • 3298 records • 211 unique collaborative activities • 1856 organizations • 1274 based in U.S.A. • including organizational subunits CNI Fall Forum - 2013
Activity Data • General info: Name, URL, short description • Activity type classification • Start and End Year • Geographic focus • Founding institution • Current host CNI Fall Forum - 2013
Collaborating Organization Data • Activity involvement • Organizational role • Primary contact (if known) • Address • Structure (e.g. nonprofit, academic) • Organizational Type (e.g. library, museum) • Thematic Focus (e.g. legal, health, aerospace) CNI Fall Forum - 2013
Data Interface 1/3 Excel: • Raw data store • Pivot tables, mockups, printable reports • Relational data in flat data structure • Relational database more ideal, more costly CNI Fall Forum - 2013
Activities within States CNI Fall Forum - 2013
Organizational Engagement CNI Fall Forum - 2013
Directory Function CNI Fall Forum - 2013
Geographic Questions • Concentration of activities, organizations • Under-representation Activity location = organizational collaborator locations CNI Fall Forum - 2013
Data Interface 2/3 • Web-based maps • SQL based, customizable • Ease-of-use factor – simple GUI, quick results CNI Fall Forum - 2013
http://cdb.io/1bf9zwU CNI Fall Forum - 2013
http://cdb.io/17vbRux CNI Fall Forum - 2013
Interactive Queries Dynamic filtering of views • For activities: • by state, NDIIPP funding, activity type, end date • For organizations: • by state, sector (structure), organization type, field CNI Fall Forum - 2013
Data Interface 3/3 • Combination of Excel and CartoDB functionality • Web-based, open data access • Free Risk of pilot data being overextended as final CNI Fall Forum - 2013
Organization Dashboard CNI Fall Forum - 2013
Activity Dashboard CNI Fall Forum - 2013
Engagement by Activity Characteristics CNI Fall Forum - 2013
Measures for Investigation • Geographic hubs • Under-represented areas • Sector, disciplinary engagement • Balance of activity offerings and organization engagement CNI Fall Forum - 2013
Which Sectors Dominate Efforts in an Area? CNI Fall Forum - 2013
State-based Collaboration Many Activities,Many Players Many Activities,Few Players • Increase Programmatic Opportunities Increase Outreach / Engagement Number of Activities Few Activities,Few Players Few Activities,Many Players Number of Involved Organizations CNI Fall Forum - 2013
Most activities fewest collaborators Fewest activities with most collaborators CNI Fall Forum - 2013
Should data driven dashboards support digital stewardship policy and program development?How? CNI Fall Forum 2013 Skinner and Drummond 2013
Who is best positioned to actas data stewards?As interface stewards? CNI Fall Forum 2013 Skinner and Drummond 2013
Which metrics or relationshipsare most useful to measure impact? CNI Fall Forum 2013 Skinner and Drummond 2013
Which metrics or relationshipsare most useful to identify opportunities? CNI Fall Forum 2013 Skinner and Drummond 2013
Which metrics or relationshipsare most useful to identify risks? CNI Fall Forum 2013 Skinner and Drummond 2013
Next phase Funding contingent • Activity lineage tree • Track impact on program development • Individual level data • Identify champions, creators • Transition from pilot to refined taxonomies, maintained data CNI Fall Forum - 2013