Shape-Adaptive DCT for coding arbitrarily shaped objects

Shape-Adaptive DCT for coding arbitrarily shaped objects SrikanthRajagopalan ShantanuKurhekar Stanford University EE 398A Final Project

Overview • Motivation • SA-DCT • Implementation • Results • Analysis • Conclusion • Extension • References

Motivation • DCT – block transform • Edges not encoded differently • Can lead to blocking/ringing effect at edges • Edge distortion • Need to preserve edges or shape perfectly

SA-DCT • Boundary blocks encoded using shape-adaptive transform • In edge block, pixels belonging to background are ignored • Pixels belonging to foreground preserved by shifting to top-left • Overhead info (object mask, block type) required to be transmitted to decoder

Overall Algorithm 1. Divide image into blocks 2. Classify blocks : foreground, background or boundary 3. For foreground blocks, apply DCT 4. For boundary blocks, apply SA-DCT 5. Compress object mask and block type(overhead) 6. Entropy encode coefficients 7. Transmit coefficients and overhead 8. Reverse process at receiver to reconstruct image

SA-DCT implementation

Types of edge blocks

Overhead info • Edge mask: • 1 : boundary pixels • 0 : otherwise • Block type: • 0 : background blocks • 1 : boundary blocks • : foreground blocks • Only edge mask of boundary blocks and block • type are sent

Overhead info • Compressed using RLE and entropy encoded • Used at receiver to get object position in image • For image with large no of boundary blocks > overhead is larger • > more out-of-boundary pixels dropped by SA-DCT

Original image – Pyramid of Giza

Boundary image

Block-type classification

Up-down classification

Left-right classification

Reconstructed images SA-DCT DCT

Closer look at edges SA-DCT DCT

Original image - Pentagon

Reconstructed images SA-DCT DCT

Original image – Tower of Pisa

Reconstructed images DCT SA-DCT

Rate -Distortion Performance Pentagon Pyramid Pisa • Rate for SA-DCT always better • PSNR comparable or higher at high Q step-size • PSNR lower at low Q step-size

Analysis • Higher Q step-size : SADCT better • Comparable PSNR, much better rate • No ringing effect at edges in SADCT • Edges are preserved • Lower Q step-size : DCT better • Preserves whole image • Blocking effect negligible • SADCT has lower PSNR since out-of-boundary pixels ignored

Very high quantization DCT SA-DCT

Conclusion • Use SADCT: • Low bit rate scenario • Maintain object shape • Background unimportant • Use DCT: • Overall image quality • Background important

Extension: Lagrangian Cost • Lagrangian cost used to decide whether SA-DCT or DCT should be applied on edge block • Block with large number of object pixels : DCT • Block with less number of object pixels : SA-DCT • Cost function : • J = D + λ R • where λ = 0.2 * q ^ 2

Extension: Lagrangian Cost Example SA-DCT DCT SA-DCT

References • T. Sikora and B. Makai, “Shape-adaptive DCT for generic coding of video,” IEEE Transactions on Circuits and Systems for Video Technology (CSVT), Feb. 1995 • W. Ng and Z. Lin, “A New Shape-Adaptive DCT for Coding Arbitrarily Shaped Image Segments,” IEEE, 2000 • G. K. Wallace, DEC, “The JPEG Still Picture Compression Standard,” IEEE Transactions on Consumer Electronics, December 1991

Thank You!

Shape-Adaptive DCT for coding arbitrarily shaped objects

Shape-Adaptive DCT for coding arbitrarily shaped objects

Presentation Transcript

Coding ADO.NET Objects: Connection, Command, DataReader

Coding ADO.NET Objects: Connection, Command, DataReader

DCT

Adaptive Detection of Arbitrarily Shaped Ultrashort Quantum Light States

Adaptive Huffman Coding

Shape-Adaptive Transforms for Coding Arbitrarily Shaped Objects and Interactive Streaming

Direction-Adaptive Partitioned Block Transform for Color Image Coding

An Extension to the Dynamic Window Approach for arbitrarily shaped Robots

ADAPTIVE LEARNING OBJECTS SEQUENCING FOR COMPETENCE-BASED LEARNING

A NEW SHAPE-ADAPTIVE DCT FOR CODING OF ARBITRARILY SHAPED IMAGE SEGMENTS

DCT

Dynamically Parameterized Architectures for Power Aware Video Coding: Motion Estimation and DCT

Adaptive Huffman coding

Operationally Optimal VERTEX-BASED SHAPE CODING

Coding ADO.Net DataSet Objects

A Computational Approach to Simulate Light Diffusion in Arbitrarily Shaped Objects

DCT

Adaptive Linear Prediction Lossless Image Coding

A Computational Approach to Simulate Light Diffusion in Arbitrarily Shaped Objects

Coding ADO.NET Objects: Connection, Command, DataReader