1 / 32

International Atomic Energy Agency International Nuclear Information System (INIS)

International Atomic Energy Agency International Nuclear Information System (INIS). DIGITISATION INIS Training Seminar 7-11 October 2013, Vienna, Austria Thomas Kalapurackal INIS Unit. WHAT IS DIGITISATION?. DIGITISING IS NOT PHOTOCOPYING….!

ramona
Download Presentation

International Atomic Energy Agency International Nuclear Information System (INIS)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. International Atomic Energy AgencyInternational Nuclear Information System (INIS) • DIGITISATION • INIS Training Seminar • 7-11 October 2013, Vienna, Austria • Thomas Kalapurackal • INIS Unit Thomas - INIS Training Seminar-7-11 Oct 2013

  2. WHAT IS DIGITISATION? DIGITISING IS NOT PHOTOCOPYING….! Process of converting paper docs, microfilm, microfiche etc.. Into electronic image files. Thomas - INIS Training Seminar-7-11 Oct 2013

  3. DIGITISATION HARD COPY/SOFT COPY DOCUMENTS FULL TEXT SEARCHABLE ELECTRONIC DOCUMENTS

  4. WHY IS DIGITISATION? It came as a most wonderful and welcome tool in hands of libraries, museums, archives, societies, publishers, and others for preserving billions of paper/analogue documents in digital format. • Retain the original look with a point of view of the future relevance. • Protecting from loss or danger • Effective, efficient and purposeful use • Knowledge Transfer to the next generation Thomas - INIS Training Seminar-7-11 Oct 2013

  5. In The Past……… Paper filing and capturing documents on film were common preservation methods……. Thomas - INIS Training Seminar-7-11 Oct 2013

  6. Time has changed…… And the information must be stored in such media that the storage is safe and the retrieval is quick. Thomas - INIS Training Seminar-7-11 Oct 2013

  7. DIGITISATION • THE AGE OF DIGITISATION HAS BEGUN! Thomas - INIS Training Seminar-7-11 Oct 2013

  8. HISTORY OF DIGITISATION….. The first image scanner developed for use with a computer was a drum scanner. It was built in 1957 at the US National Bureau of Standards by a team led by Russell A. Kirsch. Thomas - INIS Training Seminar-7-11 Oct 2013

  9. HISTORY OF DIGITISATION….. And the first image ever scanned on this machine was a 5 cm square photograph of Kirsch's then-three-month-old son, Walden. Thomas - INIS Training Seminar-7-11 Oct 2013

  10. HISTORY OF DIGITISATION In 1975 Ray Kurzweil invented the flat bed scanner. Kurzweil also was the inventor of text to speech technology Thomas - INIS Training Seminar-7-11 Oct 2013

  11. DOCUMENT SCANNER A flatbed scanner is usually composed of a glass pane (or platen), under which there is a bright light (often xenon or cold cathode fluorescent) which illuminates the pane, and a moving optical array in CCD scanning. Thomas - INIS Training Seminar-7-11 Oct 2013

  12. DOCUMENT SCANNER Leading Flat-bed Scanners in the Market are from: FUJITSU KODAK HP (HEWLETT PACKARD) CANON XEROX EPSON PANASONIC and many more….. Thomas - INIS Training Seminar-7-11 Oct 2013

  13. DIGITISATION VATICAN DIGITISED THE WHOLE LIBRARY COLLECTION RECENTLY…!! Thomas - INIS Training Seminar-7-11 Oct 2013

  14. At INIS……. INIS has two Colour Scanning Stations at present: FUJITSU (Serial No. fi-5750C) (72 page per minute A4 size) KODAK(Serial No. i1440) (75 page per minute A4 size) Thomas - INIS Training Seminar-7-11 Oct 2013

  15. At INIS Since its creation in 1970, INIS collects and disseminate the NCL Reports received from Member States and Intl. Organisations. In 1997 INIS replaced the microfiche-based production system with an imaging system to process and to disseminate all NCL Documents in electronic format. Thomas - INIS Training Seminar-7-11 Oct 2013

  16. Page size (A5 to A0) Single or double side? Quality of document Can we feed the document? Color or B/W? DOCUMENT PREPARATION Thomas - INIS Training Seminar-7-11 Oct 2013

  17. Resolution (300 dpi) Single Side/Double side Format (Tiff/pdf/jpg etc.) Resolution Page size (A5 to A0) Orientation (Portrait/Landscape) Source (Feeder/Flatbed) Brightness Contrast Noise Removal Deskew? SETTINGS Thomas - INIS Training Seminar-7-11 Oct 2013

  18. Deskew? Black Border? Noisy Image Positioning Crop? Clean-up? Rotate? Convert color? Page missing? Insert Page? Re-size? Delete, Split, Copy? QUALITY CONTROL & IMAGE ENHANCEMENT Thomas - INIS Training Seminar-7-11 Oct 2013

  19. Image Enhancement Skewed? Thomas - INIS Training Seminar-7-11 Oct 2013

  20. Image Enhancement • Noisy? Thomas - INIS Training Seminar-7-11 Oct 2013

  21. Image Enhancement Image Positioning: Thomas - INIS Training Seminar-7-11 Oct 2013

  22. Image Enhancement • Black Border? Thomas - INIS Training Seminar-7-11 Oct 2013

  23. Thomas - INIS Training Seminar-7-11 Oct 2013

  24. = Not readable !! = Repeat scanning for better result = OCR will not be perfect = 100% readable ! = Get the best image ! = OCR will be perfect ! VRS – AUTO BRIGHTNESS Without VRS = VRSAUTO-BRIGHTNESS Thomas - INIS Training Seminar-7-11 Oct 2013

  25. Manual settings may not give perfect results always! This one was highlighted with Orange Marker and it is not readable…! VRS – EDGE TRESHOLDING Dokumenten Stapel ohne VRS like, Thomas - INIS Training Seminar-7-11 Oct 2013

  26. Perfect Scanning • No missing texts • Better than Original Thomas - INIS Training Seminar-7-11 Oct 2013

  27. Original OCR Result l’ei’l7nologji Ohne VRS OCR Result technology it VRS -> VRS also gives excellent results in OCR Thomas - INIS Training Seminar-7-11 Oct 2013

  28. Important Features in PixEdit (Version 7.11.18) Thomas - INIS Training Seminar-7-11 Oct 2013

  29. WHEN SHOULD I SCANHARD COPY DOCUMENTS ? • The NCL document is not available in electronic format. • I have a scanner. • Refer to NCL Guidelines Section 4.1, “Scanning to PDF” Thomas - INIS Training Seminar-7-11 Oct 2013

  30. SCANNING DOCUMENTS • INIS STANDARD • 300 dots per inch (DPI) • 400 DPI for small characters • B/W: TIFF CCITT Group 4 or JBIG2 • Color Images: JPEG • PLEASE DO NOT SCAN B/W PAGES in 24 bit color depth • First priority: Scanning Quality Thomas - INIS Training Seminar-7-11 Oct 2013

  31. FIRST SCANNING TEST Send an e-mail with a small test document to INIS Your test document will be analyzed and INIS will tell you if you can continue to submit your NCL full text electronically If the scanning quality is not good enough, INIS will help you to find the best settings for your scanner Thomas - INIS Training Seminar-7-11 Oct 2013

  32. FUTURE People live longer now……. Media too….! THANK YOU….!! Thomas - INIS Training Seminar-7-11 Oct 2013

More Related