Vidwav Wavelet Video Coding Specifications

Vidwav Wavelet Video Coding Specifications Student: Chia-Yang TsaiAdvisor: Prof. Hsueh-Ming HangInstitute of Electronics, NCTU

Outline • Framework • Main modules • Motion • Temporal transform • Spatial transform • Entropy coding • Bitstream formation • Additional modules • Base layer • In-band coding • Wavelet ringing reduction • Conclusions

References • Microsoft Research Asia + ENST + INRIA , “Vidwav Wavelet Video Coding Specifications”, ISO/IEC JTC1/SC29/WG11, M12339, Poznan, July. 2005.

Framework

T+2D (Pre-spatial decomposition is void)

2D+t

MotionME/MCModes/MV coding

ME/MC • Adopted from H.264/AVC. • Performed by minimizing the Lagrangian cost function • J= D + λR

MV Coding • Median prediction • For squre MC block size • 16x16, 8x8, 4x4 • (ABC) or (ABD) • Directional segmentation prediction • If predictor is not available, use median prediction 8x16 16x8 8x4 4x8

Temporal TransformMotion aligned temporal filteringPrediction stepUpdate step

Split MAP MAU MAU MAP Merge Motion Aligned Temporal Filtering • Lifting structure • MAP (motion aligned prediction) • MAU (motion aligned update)

Motion Aligned Temporal Filtering(Cont.) • Temporal lifting steps of 5/3 biorthogonal wavelet

Prediction • Bidirectional connection • Uni-directional connection When left-unidirectional connected When right-unidirectional connected

Prediction (Cont.) • OBMC

Update • Generated from H-frames through MC • Clipping

Spatial Transform

Structure • Decomposition structure • Description S<>S<>E<>E<>E<>E<>E<>E<>E<> S<> S<> S<> E<> E<> E<> E<>E<> E<> E<> S<> E<> E<> E<> E<> S<> E<> E<> E<> E<> S<> E<> E<> E<> E<>

Entropy Coding3D-EBCOTBitstream

EBCOT • Embedded Block Coding coding pass Code block Bitplanes  Fractional bitplanes

Coding Pass Operation Significant Propagation Pass Magnitude Refinement Pass Previous bit-plane Normalization Pass ZC ZC ZC SC MR Insignificant sample SC Significant sample In previous bit-plane ZC Current bit-plane Significant sample In current bit-plane

Bitstream • N of total bitplanes in the block • The coded bitstream of the block, which consists of 3N-2 segments, each segment corresponding to the output bitstream of one coding pass. • The length of each bitstream segment and/or the position of end-point of each coding pass. • The R-D slope information at the end of each coding pass.

Bitstream FormationSyntaxBitstream selection

Syntax • Global • GOP • Group of picture for MCTF • Layer • For quick bitstream truncation • Packet • One component (Y, U, or V) of one temporal sunbbad • Subband • 3D subband • Block • Block of 3D EBCOT • Pass • Coding pass

Bitstream Selection • Optimized Truncation Block n+1 Block n RD slope iteration m iteration m-1 Coding pass index Rn Rn+1 ΣRi <= bitrate constrain

Base Layer

Embedded Base Layer Codec

Motion Information Prediction • BASESKIP mode • Use base-layer motion vectors as motion predictors Macroblock partitions mode map rule to generate the predictors for higher spatial resolution

Hierarchical B-Pictures • Provide extra- temporal resolutions

In-Band CodingLeaky motion compensationMode-based temporal filtering

In-Band MCTF • The forming of different quality reference of LL • Low quality reference as IP_DIR • High quality reference as IP_LBS

In-Band MCTF • Leaky motion compensation • leaky factor • Attenuate the prediction based on the unknown information at the decoder • make a good trade-off between drifting errors and coding efficiency

In-Band MCTF • Mode-based temporal filtering • Mode I: Low quality reference • Mode 2: High quality reference • Mode is selected by RD cost

Wavelet Ringing Reduction

Description • Purpose • Reduce artifact after EDWT • Similar like the de-blocking filter in DCT based coding • After experiments,

Conclusions

Wavelet Based SVC • Advantages • Nature for multi-resolution scalability • Open-loop prediction structure • Provides elegant SNR scalability without impairing full exploitation of spatial-temporal correlation • Simplifies the R-D model of the bitstreams. • Facilitates the bitstream truncation • each subband is independent with other subbands

Wavelet Based SVC • Disadvantages • Decomposition modes (coding modes) selection • Texture & side information trade off • Intra-prediction • Badly-matched blocks • Downsampling filter problems

Thanks for your attention!Any questions?

Vidwav Wavelet Video Coding Specifications

Vidwav Wavelet Video Coding Specifications

Presentation Transcript

Distributed Video Coding

Video Coding

Scalable Video Coding

Embedded Zerotree Wavelet - An Image Coding Algorithm

Wavelet Based Image Coding

Wavelet-based Region-of-Interest (RoI) Video Coding

Scalable Video Coding with Wavelet-Based Approaches

Video coding

Fully Scalable Multiview Wavelet Video Coding

Video Coding

Fast Lossless Multi-Resolution Motion Estimation for Scalable Wavelet Video Coding

3-D Direction Aligned Wavelet Transform for Scalable Video Coding

Basics of Video Coding and H.263 Video Coding

Motion-Compensated Lifted Wavelet transform for video coding

Video coding

Wavelet Transform and SPIHT Coding

Embedded Zero Tree Wavelet Coding

Video Coding Concept

Video Coding Concept

Distributed Video Coding

Video Coding Standards

Distributed Video Coding