1 / 24

Cold Fusion Verity Oracle Enterprise Search Acrobat PDF Index Assistant Comparison

Cold Fusion Verity Oracle Enterprise Search Acrobat PDF Index Assistant Comparison. Arden O. Weiss ardenweiss@ verizon .net. Scope of Presentation:. The Application using these Tools The Menu Structure Tool Pros/Cons Output Examples Conclusions. The Application Purposes:.

lorin
Download Presentation

Cold Fusion Verity Oracle Enterprise Search Acrobat PDF Index Assistant Comparison

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Cold Fusion Verity Oracle Enterprise Search Acrobat PDF Index Assistant Comparison Arden O. Weissardenweiss@verizon.net Arden Weiss

  2. Scope of Presentation: • The Application using these Tools • The Menu Structure • Tool Pros/Cons • Output Examples • Conclusions Arden Weiss

  3. The Application Purposes: • A Cold Fusion Application that: • Organizes all archival Facilities, Goals andDocuments by source and content. • Assists users do archival research using SQL on fields and full-text search logic. • Reports on quantity of archival documents and types thereof. Arden Weiss

  4. The Application Attributes: • Searches the content of: • Archival PDF files • Several Oracle Databases • Email Archive • Other Public/shared Folders • Displays Search Results • Does bean-counting type reports • Access controlled by Database/App. Arden Weiss

  5. The Analyses Tree Structure: • Facilities (name/address/contact data) • Research Goals (descp/conflicts/dates) • Research Documents, Interviews, categorization data (detailed info) ____________________ Full text Verity Search includes all Oracle fields plus PDF documents. Arden Weiss

  6. The Cold Fusion Program Menu: Arden Weiss

  7. The CF Program In Action: • Accessing documents from the top-down. • Searching for Facilities. • Searching for Goals • Searching for Documents • Searching for words in Verity Index. Arden Weiss

  8. Cold Fusion Verity Pros/Cons: • Cold Fusion Verity Pros: • Search under Cold Fusion Program control. • Search speed and results display is fast. • Drilldown logic can be used to add specificity. • Output can be formatted as desired. • Output can be redirected to other data stores. • Index update under Cold Fusion Program control. • Cold Fusion Verity Cons: • Problems Indexing large data stores. • Context for search results not always obvious. Arden Weiss

  9. Oracle Search Pros/Cons: • Oracle Enterprise Search Pros: • Search can do a global search of LAN/WAN. • Indexing of large data stores is excellent. • Search speed is fast even for large data stores. • Output is displayed in Google-like display. • Search can be run external to/parallel w/CF App. • Oracle Enterprise Search Cons: • Search logic limited to simple queries. • Output can not be formatted as desired. • Output is displayed in Google-like display. • Search results export is manual via copy/paste. Arden Weiss

  10. PDF Index Assistant Pros/Cons: • Index Assistant Search Pros: • Indexing of large folders of PDF files works well. • Search speed is fast even for a big set of PDF files. • Results are displayed in a well-organized manner. • Results are displayed/highlighted in full context. • Search can be run external to/parallel w/CF App. • Index Assistant Search Cons: • Search is limited to contents of indexed PDF files. • All included files must be in PDF format. • Keeping the index current is a manual process. • Search results export is manual via copy/paste. Arden Weiss

  11. Oracle Search Screen: Arden Weiss

  12. Oracle Search Example Results: Results can be: - Grouped by: Source, Date, Author, File Format - Sorted by: Relevance, Date, Author, File Format, Title, Path, Language Arden Weiss

  13. Oracle Example Search Results: Matching Attribute Names Include (any or all): Author, Description, Headline 1 2 or 3, Host, Keywords, Language, Last Modified Date, Mimetype, Reference to Text, Subject, Title, Urldepth, Url Arden Weiss

  14. Loading Acrobat Index Builder: Arden Weiss

  15. Opening PDF Index (PDX) File: Arden Weiss

  16. Selecting Folders to Include: Rebuild recreates PDX file from Scratch – about8 min for 1471 PDF files. Build updates existing PDXfile (took seconds when changes were minimal). Arden Weiss

  17. Finding PDF Files to Include: This is a rebuild operation – 1st looks for files to include. Arden Weiss

  18. Building Acrobat PDX Index: Build (update) operation is faster than Rebuild operation. Arden Weiss

  19. Scheduled PDX Index Updates: • Use a catalog batch PDX file (.bpdx) to schedule when to automatically build, rebuild, update, and purge an index. • A BPDX text file contains a list of platform- dependent catalog index file paths and flags. • Use a scheduling application, such as Windows Scheduler, to display the BPDX file in Acrobat. • Acrobat re-creates the index according to the flags in the BPDX file. Arden Weiss

  20. Searching PDX Index (1 of 3): • On Acrobat’s Main Menu click on “Edit” then “Search” or press <Shift> <Ctrl> F to display the Search Window. • Click on Advanced Search Options link at Screen Bottom. • Click on Select Index at top of Window to display: Arden Weiss

  21. Searching PDX Index (2 of 3): • The Search Window then changes to show “Currently Selected Indexes” with excellent search options. • Enter criteria and Press the “Search” button to display results. Arden Weiss

  22. Searching PDX Index (3 of 3): • Search for “WEISS” in the “Currently Selected Indexes” -- whole words only checked. Results shown below. Arden Weiss

  23. Conclusions and Thoughts: • All three search technologies co-exist well. • Oracle Search is not PDF-centric and may be too broad a search function to easily control. • Oracle Search may be a good way to discover what missed being put into the CF Archive. • SQL Server may have functionality similar to Oracle Enterprise Search. • Acrobat Index Search gets you immediately closer to the real PDFs (Verity does not highlight search words displayed PDFs. • Verity and Acrobat are much cheaper dates. Arden Weiss

  24. Tha-Tha-That’s All Folks Arden Weiss

More Related