320 likes | 447 Views
A LOOMING CRISIS: MAINTAINING ACCESS TO ELECTRONIC RESEARCH PRODUCTS. Daphne Fautin University of Kansas Gail Kampmeier Illinois Natural History Survey. Electronic PEET Products. Project web pages Images Literature - publications, reports, field journals
E N D
A LOOMING CRISIS: MAINTAINING ACCESS TO ELECTRONIC RESEARCH PRODUCTS Daphne Fautin University of Kansas Gail Kampmeier Illinois Natural History Survey
Electronic PEET Products • Project web pages • Images • Literature - publications, reports, field journals • Gene sequences and other molecular data • Character matrices & keys • Databases - data & structure
What Happens… • When project funding ceases • When project members disperse • When PIs retire, change research topics, move, or … Who will champion access to the electronic resources produced by PEETs, AToLs, BSIs, PBIs, …?
Fate of Our Electronic Resources Who should be responsible? • Institutions originally receiving project funding? • Funding agencies? • Those creating the resources? • Professional societies?
Issues • Who owns the products? (not an issue only for electronic media) • How can the products continue to be served? • How should the products best be preserved?
This is a global issue Among efforts to grapple with it is the 2005 National Science Board Report 05-40 www.nsf.gov/pubs/2005/nsb0540 (NPR this morning on electronic art and art museums)
Issues • Who owns the products? (not an issue only for electronic media) • How can the products continue to be served? • How should the products best be preserved?
Archiving • LIBRARIES have historically been the repository of scholarly output (= publications) • MUSEUMS have been custodians of specimens • Some other physical objects end up in TRADITIONAL ARCHIVES
Archiving • WHICH products should be preserved • HOW should they be preserved • WHERE should they be preserved Locally, supercomputers, electronic archives, etc. Metadata: retrieval requires excellent documentation Software versions: a practical challenge, not a technical one (remember Gene Stoermer!)
Electronic PEET Products • Project web pages • Images • Literature - publications, reports, field journals • Gene sequences and other molecular data • Character matrices & keys • Databases - data & structure
Caveats: Pages Not Archived • Anything requiring interaction with the server • Forms, database-generated content • Javascript not resolving in true URLs • Server-side image maps • Pages with robot exclusion headers (robots.txt) • Orphan pages (no links into) • Unknown sites
Electronic PEET Products • Project web pages • Images • Literature - publications, reports, field journals • Gene sequences and other molecular data • Character matrices & keys • Databases - data & structure
Images • Scanned • Resolution • Format standard: TIF? • Produced digitally • Format evolution of production software if not saved as flat TIF
Electronic PEET Products • Project web pages • Images • Literature - publications, reports, field journals • Gene sequences and other molecular data • Character matrices & keys • Databases - data & structure
Literature, Reports, Field Journals... • Issues similar to images • Format evolution • Media migration • Metadata for retrieval • OCR for finding individual items • Solutions are library-like, requiring recurring infusions of • $$$ • Personnel • Migrate as formats evolve, versions change • Time • Digital lifetime determination
Electronic PEET Products • Project web pages • Images • Literature - publications, reports, field journals • Gene sequences and other molecular data • Character matrices & keys • Databases - data & structure
A central archive – a library! Maintained by a Federal agency Gene sequences and other molecular data
Electronic PEET Products • Project web pages • Images • Literature - publications, reports, field journals • Gene sequences • Character matrices & keys • Databases - data & structure
Character Matrices & Keys • DELTA/INTKEY (example of standard in danger of format evolution) • Lucid (now in Version 3.4) • MacClade • PAUP • Hennig86 • MorphoBank • Others…
Relational Databases: Content & Structure • Archiving • Metadata essential for discovery • Convert to flat files • Software-independent format (e.g. comma delimited) • Lose relational structure – but relationships can be coded
Relational Databases: Content & Structure • Continued service • Version changes • High maintenance (some require professional DBA) • One size generally does not fit all – makes it difficult to pass on • Maintain also “front end” (required for queries) • scripting language: e.g. ColdFusion, PHP
TO MAINTAIN ACCESS TO ELECTRONIC RESEARCH PRODUCTS a SILVER BULLETorSILVER BUCKSHOT? Concentration of resources vs. discovery of new methods by diversification
Demonstrate value / usefulness Hits / citations Can be problematic for taxonomy / systematics Become part of large entity
the data portal for and legacy of www.gbif.org (currently the third-largest data provider with nearly 10 million records) www.iobis.org the main provider of marine data to
LIBRARIES have been custodians of scholarly knowledge A distributed resource PORTAL CONTRIBUTORS Maintaining functionality OBIS GBIF FishBase Consortium Individuals Institutions
Develop a clear technical and financial strategy; create policy for key issues consistent with the technical and financial strategy. The Foundation should actively engage with the community to ensure that community policies and priorities are established and then updated in a timely way. www.nsf.gov/pubs/2005/nsb0540
Recurring Challenges • $$$ • Personnel • Time • Format evolution / back compatibility • Metadata – complete, appropriate (controlled vocabulary) • Digital lifetime - determining what, if anything, should be truly discarded