210 likes | 351 Views
& Content Management. Digitisation. Services. 600 People – India. Services. Digitisation Services Bibliographic services Content Management Services. Digitisation Services. Full text capture of historic manuscripts 16 th Century Church Records 17 & 18 th century Census Records
E N D
& Content Management Digitisation Services
Services • Digitisation Services • Bibliographic services • Content Management Services
Digitisation Services • Full text capture of historic manuscripts • 16th Century Church Records • 17 & 18th century Census Records • 18th and 19th Century Life Event Records
Bibliographic Services • On site catalogue imaging Services • Retro Conversion of Catalogues • MARC21 • MODS • Finding Aids • EAD
Digitisation of Large volume of Print content • The UK parliamentary debates – The Hansard • 16th - 18th Century American Texts • Legal texts and publications • Historic Newspapers
Archival Newspaper Digitisation • A complete Solution for the historic newspaper Digitisation: • On-site / Off-site Scanning of Paper or microfilm • OCR and clean up • Article level Zoning • Quality Assurance • Hosting & Search solutions
Micrographic services Micrographic lab that can scan and print 16mm or 35 mm microfilms, Microfiche or aperture cards to 600 DPI Tiff images. Capable of scanning up to 40,000 frames / day
Reprographic services Scanning for Newspapers & large format documents Overhead non contact scanning with minimal damage to original pages Capable of scanning up to 10,000 broadsheet pages /day Colour scanning with 10,200 pixels Image Enhancement: Cropping, de-skew, de-speckle, Lighting corrections, histogram adjustment, Filter, Geometrical corrections.
Scanning From Microfilm… Advantages Sometimes only microfilm is available Lower cost for scanning
Scanning From Microfilm… Disadvantages Poor microfilming Process & material technology of 50s & 60s Poor Filming Methods
Scanning from Paper… Advantages Excellent image & Text quality
Scanning from Paper… Disadvantages Badly Stored/ damaged original paper copies. Higher cost of scanning
AEL uses third party software tools as well as own tools for article segmentation • Automatic zoning & article segmentation software is not perfect! • Manual correction of the segmentation is required for 20-40% of the articles.
OCR Problems Most archival Newspapers yield low OCR accuracies. Useless for OCR Poor OCR AEL offers manual OCR clean-up options. • Headline re-key • Proofread / re-key first few lines • Full page proofread
Customized search solutions for the digitised archive Madras
Article level Metadata • METS • ALTO • MODS • MIX • NewsML • Other metadata schemas
Newspaper Digitisation Process Conversion flow WEB Search Images from Paper & Microfilm Scanning Content Content Input Content formatting Zoning & article segmentation Database Server Content database Content Export OCR / Cleanup Quality Assurance OCR Text Images XML metadata Jpeg 2000 Images PDF Web Hosting