90 likes | 206 Views
Seybold SF 2002. Mark Stephens (Managing Director). Who are IDRSolutions?. Established 1999 Based in United Kingdom, resellers in Australia and USA. Customers range from large multi-nationals to individuals. Focus – Systems integration & extracting content from pdf.
E N D
Seybold SF 2002 Mark Stephens (Managing Director)
Who are IDRSolutions? • Established 1999 • Based in United Kingdom, resellers in Australia and USA. • Customers range from large multi-nationals to individuals. • Focus – Systems integration & extracting content from pdf.
Why extract data from pdf files? • Retrieve content from pdf files. • Extract data from legacy systems using printed output which can be easily converted into pdf. xml
Extraction from pdf • Pdf files lack structure so the items on the page are not connected. • We develop algorithms to group the content from different types of page layout to meet customers’ requirements.
What do we offer? Storypad • Enterprise – a high end extraction and repurposing tool. • Personal – a low-end extraction tool for pdf. • Customized – versions to suit specific requirements. • A newLGPL library for pdf. Cross-platform tools written in Java Native windows exe (dll ??)
Java Pdf Extraction Decoder Access Library • Routines to read and parse pdf files • Extraction of raw and scaled/clipped images • Extraction of text fragments as XML • Font information converted to XML metadata • Location on page of objects • Page Rasterizer • Examples included • Active development • Free of all dependencies – ie Acrobat SDK • LGPL license –no license fee, full source code
LGPL and Open Source • Open Source offers ONE way to keep costs down, improve flexibility and match user requirements. • Examples – itext, Zope, JBoss, MySQL, Linux, Xpdf, Ghostscript, Apache, Samba, GIMP….
Free as in air, not beer • Access to the source code. • Right to modify the code. • No license fees required. • No limitations on usage. • Limited lock-in • Commercial support/development available. • No support. • Cannot be passed off as your own work. • Acquisition cost – time to understand the software, modify it to meet requirements, test, support.
More details Visit our websites at www.idrsolutions.com www.jpedal.org Or come and see us…