150 likes | 285 Views
Digitization Programmes. National Library of the Czech Republic Adolf Knoll adolf.knoll@nkp.cz. 4M. History. Digitization started in 1992-93 Routine production in 1996 Two national programmes: Memoriae Mundi Series Bohemica (direct scanning – manuscripts, old printed books, maps, …)
E N D
Digitization Programmes National Library of the Czech Republic Adolf Knoll adolf.knoll@nkp.cz 4M
History • Digitization started in 1992-93 • Routine production in 1996 • Two national programmes: • Memoriae Mundi Series Bohemica (direct scanning – manuscripts, old printed books, maps, …) • Kramerius (preservation microfilming + digitization – acid paper materials esp. newspapers and other periodicals 4M
Coverage • Limited access to rare materials • Endangered by frequent use • Endangered by deterioration of carriers, corrosion of writing and printing materials, fading colours, etc. • Special case: brittle acid paper • Need to preserve through offering copies 4M
National programmes • Parts of the Public Information Services of Libraries programme • Grant system under the Ministry of Culture since 2000 • Call for proposals, grant committees, the applicants can require up to 70% of cost • Open access + application of recommended standards 4M
MMSB Yearly: ca. 110,000 pages digitized Altogether: 400,000 pages MSS 140,000 pages old prints Several hundreds of maps 3 devices for direct digitization (cameras and camera scanners) Kramerius Yearly: up to 400,000 pages digitized and up to 1,000,000 filmed Altogether: Ca. 800,000 pages Sound recordings of interviews 2 microfilm scanners Production
High readability Standard platform Enhanced communication SGML and any applications thereof: TEI, Czech DOBM XML These features enable: Direct access Indexing of metadata in sophisticated access tools: ManuFreT (local) AiP SAFE (Internet) Re-export into production formats and conversion into other formats Metadata framework 4M
Visual data • Stored in graphic formats for: • preview (GIF, JPEG) • access (JPEG, limited TIFF/G4) • archival storage (JPEG, limited TIFF/G4) • Optimized for access • production processing (optimal parameters for JPEG) • access processing (on-the-fly DjVu conversion, DjVu) • application of multiresolution formats (MrSID) 4M
Access data: user side • transfer: • modem 56 kbit/s = 7 KB per 1 second • JPEG 2.1 MB = 300 seconds = 5 minutes • DjVu 140 KB = 20 seconds • well-comparable quality • decompression problems • standard, progressive, or successive display • multiresolutional display 4M
Access data: provider side • acceptable quality for established purpose • optimized resolution • optimized lossy compression quality factors • new methods • Mix Raster Content approach (image layers compressed separately and merged) • wavelet encoding scheme • JBIG-similar encoding black-and-white scheme • Multiresolution approach (several resolution stored in one image file) 4M
DCT lossless MRC + wavelet 4M
Administration Reference digital archive on-line Fast disk arrays for access Reference digital archive for off-line media (CD, mg. tapes) Reference archive – preservation microfilm Archiving of and access to reformatted documents
U1, U2, U3, …, Ux - data storage faciltities Grid CESNET2 U3 Virtual integrator international e.g. ECH:TOPICC, VICODI, European Digital Periodicals User(s) U2 Catalogues http://www.memoria.cz periodicals Digital library http://www.cdh.nkp.cz U1