200 likes | 359 Views
Generic Fourier Descriptor for Shape-based Image Retrieval Dengsheng Zhang, Guojun Lu Gippsland School of Comp. & Info Tech Monash University Churchill, VIC 3842 Australia dengsheng.zhang@infotech.monash.edu.au http://www.gscit.monash.edu.au/~dengs/. Outline. Motivations Problems
E N D
Generic Fourier Descriptor for Shape-based Image RetrievalDengsheng Zhang, Guojun LuGippsland School of Comp. & Info TechMonash UniversityChurchill, VIC 3842Australiadengsheng.zhang@infotech.monash.edu.auhttp://www.gscit.monash.edu.au/~dengs/
Outline • Motivations • Problems • Generic Fourier Descriptor (GFD) • Experimental Results • Conclusions
Motivations • Content-based Image Retrieval • Image description is important for image searching • Image description constitutes one of the key part of MPEG-7 • Shape is an important image feature along with color and texture • Effective and Efficient Shape Descriptor • good retrieval accuracy, compact features, general application, low computation complexity, robust retrieval performance and hierarchical coarse to fine representation
Fourier Descriptor • Obtained by applying Fourier transform on a shape signature, such as the central distance function r(t). No Contour Shape Signature Same Contour and Different content
Zernike Moments • Acquired by applying Zernike moment transform on a shape region in polar space. • Complex form • Does not allow finer resolution in radial direction • create a number of repetitions in each order of moment • Shape must be normalized into an unit disk
Generic Fourier Descriptor • Polar Transform • For an input image f(x, y), it is first transformed into polar image f(r, ): • Find R = max{ r( ) }
Generic Fourier Descriptor-II • Polar Raster Sampling Polar Grid Polar image Polar raster sampled image in Cartesian space
Generic Fourier Descriptor-III • Binary polar raster sampled shape images Polar raster sampling Polar raster sampling
Generic Fourier Descriptor-IIV • 2-D Fourier transform on polar raster sampled image f(r, ): where 0r<R and i = i(2/T) (0i<T); 0<R, 0<T. R and T are the radial frequency resolution and angular frequency resolution respectively. • The normalized Fourier coefficients are the GFD.
Generic Fourier Descriptor-V • Rotation invariant Fourier Fourier Polar raster sampled Polar raster sampled PF PF
Generic Fourier Descriptor-VI • Translation invariant due to using shape centroid as origin. • Scale normalization: • Due to f(x, y) is real, only a quarter of the transformed coefficients are distinct. The first 36 coefficients are selected as shape descriptor. • The similarity between two shapes are measured by the city block distance between the two set of GFDs.
Experiment • Datasets • MPEG-7 region shape database (CE-2) has been tested. CE-2 has been organized by MPEG-7 into six datasets to test a shape descriptor’s behaviors under different distortions. • Set A1 is for test of scale invariance. 100 shapes in Set A1 has been classified into 20 groups which are designated as queries. • Set A2 is for test of rotation invariance. 140 shapes in Set A2 has been classified into 20 groups which are designated as queries • Set A3 is for test of rotation/scaling invariance. • Set A4 is for test of robustness to perspective transform. 330 shapes in Set A4 has been classified into 30 groups which are designated as queries. • Set B consists of 2811 shapes from the whole database, it is for subjective test. 682 shapes in Set B have been manually classified into 10 groups by MPEG-7. • For the whole database, 651 shapes have been classified into 31 groups which can be used as queries.
Performance Measurement • Precision-Recall • For each query, the precision of the retrieval at each level of the recall is obtained. The result precision of retrieval is the average precision of all the query retrievals.
Results • Average Precision-Recall on Set A1 and A2 Rotation Invariance Test Scale Invariance Test
Results • Average Precision-Recall on Set A4 and CE-2 General Invariance Test Perspective Invariance Test
Average Precision-Recall on Set B Subjective Test
Results Set A1 Set A4
Set B CE-2
Conclusions • A new shape descriptor, generic Fourier descriptor (GFD) has been proposed. • It has been tested on MPEG-7 region shape database • Comparisons have been made between GFD and MPEG-7 shape descriptor ZMD. • Compared with ZMD, GFD has four advantages: • it captures spectral features in both radial and circular directions; • it is simpler to compute; • it is more robust and perceptually meaningful; • the physical meaning of each feature is clearer. • The proposed GFD satisfies all the six requirements set by MPEG-7 for shape representation .