100 likes | 196 Views
Stacy Kowalczyk Indiana University August 30, 2006 Indiana University, Bloomington, IN. Workshop on Scholarly Databases. Introduction. Stacy Kowalczyk Associate Director for Projects and Services for the Digital Library Program Ph.D. Student at SLIS Reason for attending
E N D
Stacy Kowalczyk Indiana University August 30, 2006 Indiana University, Bloomington, IN Workshop on Scholarly Databases Workshop on Scholarly Databases
Introduction • Stacy Kowalczyk • Associate Director for Projects and Services for the Digital Library Program • Ph.D. Student at SLIS • Reason for attending • Author disambiguation • Data preservation • Data provenance Workshop on Scholarly Databases
Data • Description • Digitized materials • Text, music, scores, film, scientific data • Metadata – structural, intellectual, administrative • Purpose • General scholarly research • Audience • General public access Workshop on Scholarly Databases
Data Statistics • 33 databases from historic photographs to full text of Victorian Women Authors. • Creator • Creation dates • Subject categories • Many collection specific data fields • http://www.dlib.indiana.edu/collections/index.shtml • http://www.dlib.indiana.edu/research/index.shtml Workshop on Scholarly Databases
Data Formats • Data formats / types - what type of data is stored and in what format? • ASCII text • XML • RDBMS • AIFF, Real Streaming, MP3/4 • TIFF, JPG, PDF Workshop on Scholarly Databases
Data Management • Database Technology - Oracle, MySQL and eXtensible Text Framework (XTF) • Storage Technology – Sun servers attached to the IU Mass Store • Backup Strategy – Daily incremental backups with weekly full backups. 2 copies of data stored 100 miles apart. Workshop on Scholarly Databases
Organization • Partners • The State of Indiana • CIC (the Big 10 +) • Digital Library Federation • Harvard • Ownership - Indiana University • Funding • NFS • NEH • LSTA • IMLS Workshop on Scholarly Databases
Integration Challenges • Managing the huge number of small files that belong to one logical object • Finding better ways to create the links between delivery systems and access systems • Maintaining metadata consistency Workshop on Scholarly Databases
Integration Solutions • Managing the files • METS - the Metadata Encoding and Transmission Standard • Oracle referential integrity • Creating automated processes for linking Workshop on Scholarly Databases
Research References • www.dlib.indiana.edu Workshop on Scholarly Databases