470 likes | 600 Views
ISTORAMA: A Content-Based Image Search Engine and Hierarchical Triangulation of 3D Surfaces. Dr. Ioannis Kompatsiaris Centre for Research and Technology Hellas Informatics and Telematics Institute Thermi-Thessaloniki, Greece ikom@iti.gr. Outline. Introduction Istorama architecture
E N D
ISTORAMA: A Content-Based Image Search Engine andHierarchical Triangulation of 3D Surfaces. Dr. Ioannis Kompatsiaris Centre for Research and Technology Hellas Informatics and Telematics Institute Thermi-Thessaloniki, Greece ikom@iti.gr
Outline • Introduction • Istorama architecture • K-Means with Connectivity Constraint Algorithm (KMCC) • Demo • Object/model based coding • Adaptive Triangulation and Progressive transmission • Reduced pyramid - quincunx sampling • Experimental results • Conclusions
Need for efficient image search • Huge number of images or databases of images • Highly visual and graphical nature of the Web • Text descriptors are not always efficient • Greater flexibility with “content-based” access • Queries which are more natural to humans
Proposed approach • Usually a description, a “signature” or a set indexes is created for the whole image • Images usually contain different objects • Proposed approach: the image is first separated into objects (segmentation) • Descriptors are created for each object • The user can search for a specific object contained in images
ISTORAMA architecture World Wide Web User PHP Crawler - Spider Indexing - Retrieval Algorithms Data Base JDBC Java Data Base Connection Server
The K-Means with Connectivity Constraint Algorithm (KMCC) I • Based on K-Means algorithm • K-Means does not take into account spatial information • In KMCC, the spatial proximity of each region is also taken into account by defining a new spatial center and by integrating the K-Means with a component labeling procedure • Automatic correction of the number of regions K
The K-Means with Connectivity Constraint Algorithm (KMCC) II • Step1 K-Means is performed • Step2Spatial centers are calculated • Step3 Generalised distance • Step 4 Component labeling L connected regions
The K-Means with Connectivity Constraint Algorithm (KMCC) III
Object descriptors • Color, texture and spatial characteristics • Color: histogram, 8 bins • Spatial: (centroid), • Shape: area, eccentricity • where λ1, λ2are the two first eigenvalues
Experimental Results (Claire) Original sequence Frames 1-10 Segmentation Moving object Facial region
Experimental results (table-tennis) Original sequence Frames 1-10
Experimental results (table-tennis) Moving objects Segmentation
Experimental results (Akiyo+Foreman) Original sequence Frames 1-10 Facial region Original sequence Frames 1-10 Facial region
Conclusions • K-means with spatial proximity algorithm • Multiple features segmentation • Higher order segmentation • Correspondence of objects between consequent frames • Max-min criterion for automatic regularisation parameters
Future work • Use of texture • Indexing of video • Integration with text descriptors
Introduction • Triangular meshes of high quality are used in: • Computer Aided Design • 3D representation of objects (e.g. archaeological artifacts) • Animation and visual simulation • Entertainment (computer games) • Digital Terrain Modelling
Adaptive triangulation Compressionof finely detailed surfaces is necessary for: • computation • storage • transmission • display efficiency
Progressive transmission • Early, coarse approximations are refined though additional bits
Background • Vertices removal and retriangulation [Schroeder] [Cohen] • General mesh optimization process/function [Hoppe] • Multiresolution analysis (MRA) [Lounsbery] • Wavelets [Schroeder] [Gross] • Progressive transmission [Schroeder] [Hoppe] • Generalized triangle mesh representation [Deering]
Properties of the algorithm • Efficient compression of the wireframe information • Simplification of the wireframe by adaptive triangulation • Progressive transmission of the wireframe information • Prioritised transmission of the wireframe • Straightforward correspondence between successive scales
Input surfaces • Surface represented as a parametric function in the parametric space • determined by the position of a set of control points or nodes • It allows for arbitrary, possibly closed wire-frame surfaces to be defined.
Input surfaces • The filters are applied to the 2D parametric representation of the surface as though it were a 2D image with intensity equal to • Such surfaces include also: • depth images estimated from stereo pairs and • every surface that is homomorphic to a plane, cylinder or torus
Optimal filters • Optimal filters are determined by their Fourier transform: • where is the power spectral density. • Alternatively may be determined by the equation:
Optimum bit allocation • bits/vertex is assumed to be transmitted • bits/vertex are allocated to each level using • is the sum of error variances
Error prioritization • The prediction errors corresponding to all predicted vertices are calculated and sorted with the vertices corresponding to higher errors being put first on the list Lower Errors Higher Errors
Entropy estimation • Entropy coding is used • The number of bits needed for error transmission is the entropy of the errors • Using the quincunx sampling geometry at the receiver, there is no need to transmit the exact co-ordinates of the position of each transmitted vertex • The final cost of the transmission is the sum of the error entropy and the position entropy
Adaptive Triangulation Procedure • Synthesis stage of the QMVINT pyramid • The vertex along with the vertices used to predict it are added to the mesh • Handling of cracks • Triangulation of the next vertex
Experimental results • Original dense depth map and surface of the “Venus” data
Experimental results • 2569 vertices and 4006 triangles at level 2 MSE = 1.30
Experimental results • 7661 vertices and 11135 triangles at level 1 MSE = 1.30
Experimental results • 11416 vertices and 15827 triangles at level 0 MSE = 0.12
Conclusions • Hierarchical representation of 3D surfaces using 3D adaptive triangular wireframes • The variance of the error transmitted is minimised and therefore results to optimal compression of the wireframe information • It produces a hierarchy where coarse meshes are as similar to their finer versions as is possible
Conclusions • The triangulation algorithm is integrated with a bit allocation procedure • The number of nodes and triangles of the wireframe as well as the information needed for the transmission or storage of the wireframe are reduced simultaneously using a unified approach (QMVINT filtering) • Precise correspondence between triangles at each level is achieved
Future work • Expansion and application directly to 3D surfaces • Estimation of filters