210 likes | 455 Views
Advanced OCR with OmniPage and FineReader. Overview. Optical character recognition Structural recognition Options Loading Zoning OCR Editing. Optical Character Recognition (OCR). OCR turns pictures of text into e-text Does well unless… The picture is fuzzy The contrast is poor
E N D
Overview • Optical character recognition • Structural recognition • Options • Loading • Zoning • OCR • Editing
Optical Character Recognition (OCR) • OCR turns pictures of text into e-text • Does well unless… • The picture is fuzzy • The contrast is poor • The font is unusual • The font is too small or too large • The material has unusual characters
Structural Recognition • Analyzes the layout of the page • Columns • Headings • Graphics • Tables • Usually does fairly well, unless the layout is non-standard
Programs that Run OCR • Programs for consumers • Kurzweil 1000, 3000 • OpenBook • Intel Reader • Many others… • Programs for production • ABBYY FineReader • Nuance OmniPage
Consumer Programs • Highly automated • Designed for individuals who have print disabilities • Are not good production tools • Do not provide flexibility • Do not allow much overriding • Interfaces not designed for editing
Production Programs in General • A good program for production allows you to… • Control the zones (areas or blocks of text and graphics) • Add, delete, change • Edit easily • Improve recognition
Preferred Programs • ABBYY FineReader • Relatively easy to learn • Fairly intuitive • Good structural recognition • Nuance OmniPage • Less intuitive but more accessible • Often does better with technical materials
Both Good Tools • If you can afford to have both, it’s nice, but not absolutely necessary. • If you have both, run a couple test pages through each to see which is doing better on a particular job.
Under the Hood • For best results with a program, set up your options before you begin! • Tools > Options
Lots of Languages • FineReader and OmniPage handle multiple languages. • For foreign language, turn on all the languages in the book. • It will recognize the diacritical marks. • Turn on what you need, but only what you need.
Math • If you are running OCR on math, try turning on Greek. • Greek will allow the program to recognize alphas, deltas, sigmas, etc.
Another Decision • Detect page orientation or not? • Does not always get it right • Try it if you have many pages turned
Considerations • You may or may not want to keep headers and footers. • I generally keep them to pull the page numbers. • You may want to keep the page breaks. • Retaining page breaks helps to maintain one-to-one page correspondence with the book.
Fitting Everything • In some cases, you may need to work with a custom paper size to fit everything onto one page. • This feature can be helpful when you are retaining everything on the page but not the layout.
Loading Files • “Open” • Opens saved program files • “Load” • Loads image files to process • Note that this same issue comes up with saving!
Wizards Are Evil… • Do not rely on the automation • Load the image file and choose the processes you want
Workspace • The program has three primary areas • Pages Pane • Either thumbnails or details • Allows simple navigation of pages • Image Pane • Your graphic • Text Pane • Area where the text from OCR will show
More Accessible • Both programs have a detail view. • Shows text instead of graphics • Detail view is more accessible for screen readers. • Otherwise, it is personal preference.
Two Ways to Save • To Save the program file to access later in the OCR program, choose File > Save • This saves your work file. • You save your converted file during the last phase of the processing.
Production Tips • Work with dual monitors • Check your computer and video card • Stretching an OCR program across two monitors is a HUGE time-saver! • Learn to use keyboard shortcuts. • They save tons of time!