280 likes | 424 Views
Querying Graphics through Analysis and Recognition INRIA Lorraine. Research fields. Image processing and segmentation Structural pattern recognition Statistical pattern recognition Information spotting and retrieval In the context of the analysis and recognition of graphics-rich documents.
E N D
Querying Graphics through Analysis and Recognition INRIA Lorraine
Research fields • Image processing and segmentation • Structural pattern recognition • Statistical pattern recognition • Information spotting and retrieval • In the context of the analysis and recognition of graphics-rich documents
Scientific staff Suzanne Collin, Assist. Prof. UHP Philippe Dosch, Assist. Prof. Nancy 2 Bart Lamiroy, Assist. Prof. INPL/Mines Gérald Masini, CR CNRS Salvatore Tabbone, Assist. Prof. U. Nancy 2 Karl Tombre, Prof. INPL/Mines Laurent Wendling, Assist. Prof. UHP PhD students Sabine Barrat, CIFRE contract (pending) Thi Oanh Nguyen, joint supervision with IFI (Hanoi, Vietnam) Oriol Ramos Terrades, joint supervision with UAB, Barcelona (Spain) Jan Rendek, CIFRE France Télécom Jean-Pierre Salmon, FRESH (European project) Zhang Wan, joint supervision with City U. Hong Kong (pending) Daniel Zuwala, MESR grant • Administrative staff • Isabelle Herlich (part time) • Françoise Laurent (part time) • Technical staff • Yamina Smail, Epeires project • X, Fresh project (pending)
Main results 2004-05 • Hierarchical binarization
Main results 2004-05 Focus on symbol recognition – Symbol spotting combining Radon-based signature and structural approach
Main results 2004-05 Improvement of recognition rates through combination of shape descriptors Recognition rates by descriptors The set of images I
Main results 2004-05 Improvement of recognition rates through combination of shape descriptors Application : extraction of letters in heritage documents Ranking Recognition rates
Main results 2004-05 • Raster-to-vector conversion method based on random sampling and parametric fitting
Segmenting the skeleton RANVEC : Random sampling on pairs of vector points Extension of primitive as long as it fits arc or segment (linear regression)
Application domains/transfer • Electrical wiring diagrams in aeronautics FRESH project (FP6 STREP Aeronautics program)
Application domains/transfer • Cultural heritage documents ACI Madonne, FP6 STREP proposal QUIMERA-Doc submitted 9/05
QgarLib : library of C++ classes • QgarApps : applications • QgarGUI : user interface • qgar.org, APP • Refactoring to professional standards • Open architecture (XML) • 80,000 lines of C++ code (comments not counted) • 30 to 40 downloads of code per month • >10 documentation browses per day (robots excluded)
Positioning within INRIA • Fully within one of INRIA’s 7 challenges in strategic plan: Developing multimedia data and information processing • Regular partnership with Imadoc (research group at Irisa) • Joint contacts Texmex (Sym-C)/Qgar with industrial partner • Recent contacts with Lear on browsing of large image bases
Collaborations • National: informal consortium Nancy, Rennes, La Rochelle, Rouen, Tours, Lyon with several joint projects (ACI Madonne, RNTL past and submission, Techno-Vision Epeires, IST submission) and coordination of actions • CVC/UAB, Barcelona: long lasting relationship, associated team SymbolRec, joint PhD supervisions • City University Hong Kong: associated in Epeires, PAI submission accepted, joint PhD supervision • IFI, Vietnam: joint PhD supervision • University of Auckland (NZ), University of Bern, Carleton University (Canada)
Achievements, strengths, weaknesses • Leadership position at international level on graphics recognition • Announced in project and largely addressed: • Symbol recognition and spotting • Performance evaluation • Strong and adequate applicative backing • Improvement in number of PhD students • Still low on permanent workforce
Future work • Scalability of symbol recognition methods • Large number of models • Variations within the same shape class • Combining structural and statistical methods • Hierarchical approach
Future work • Complex symbols
Future work • Dynamic, on-the fly recognition and spotting: from model-based recognition to freehand recognition
Future work • Multi-modal indexation (text / graphics / image / video) in multimedia and document databases (collaborations with Texmex, Lear, …) • Interactivity with user (relevance feedback)
Future work • Performance evaluation • International symbol recognition contests 2003 & 2005 • Epeires • French Techno-Vision program • 4 universities, FT R&D, 1 company + foreign partners UAB & CityU • www.epeires.org • Future research challenges • Simple and non-biased metrics • Ground-truth/recognition output matching methods • Generation of large sets of training and benchmarking data using realistic image degradation models
Future work • Software : increase number of applications