1 / 9

Seybold SF 2002

Seybold SF 2002. Mark Stephens (Managing Director). Who are IDRSolutions?. Established 1999 Based in United Kingdom, resellers in Australia and USA. Customers range from large multi-nationals to individuals. Focus – Systems integration & extracting content from pdf.

snana
Download Presentation

Seybold SF 2002

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Seybold SF 2002 Mark Stephens (Managing Director)

  2. Who are IDRSolutions? • Established 1999 • Based in United Kingdom, resellers in Australia and USA. • Customers range from large multi-nationals to individuals. • Focus – Systems integration & extracting content from pdf.

  3. Why extract data from pdf files? • Retrieve content from pdf files. • Extract data from legacy systems using printed output which can be easily converted into pdf. xml

  4. Extraction from pdf • Pdf files lack structure so the items on the page are not connected. • We develop algorithms to group the content from different types of page layout to meet customers’ requirements.

  5. What do we offer? Storypad • Enterprise – a high end extraction and repurposing tool. • Personal – a low-end extraction tool for pdf. • Customized – versions to suit specific requirements. • A newLGPL library for pdf. Cross-platform tools written in Java Native windows exe (dll ??)

  6. Java Pdf Extraction Decoder Access Library • Routines to read and parse pdf files • Extraction of raw and scaled/clipped images • Extraction of text fragments as XML • Font information converted to XML metadata • Location on page of objects • Page Rasterizer • Examples included • Active development • Free of all dependencies – ie Acrobat SDK • LGPL license –no license fee, full source code

  7. LGPL and Open Source • Open Source offers ONE way to keep costs down, improve flexibility and match user requirements. • Examples – itext, Zope, JBoss, MySQL, Linux, Xpdf, Ghostscript, Apache, Samba, GIMP….

  8. Free as in air, not beer • Access to the source code. • Right to modify the code. • No license fees required. • No limitations on usage. • Limited lock-in • Commercial support/development available. • No support. • Cannot be passed off as your own work. • Acquisition cost – time to understand the software, modify it to meet requirements, test, support.

  9. More details Visit our websites at www.idrsolutions.com www.jpedal.org Or come and see us…

More Related