380 likes | 667 Views
Vidwav Wavelet Video Coding Specifications. Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang Institute of Electronics, NCTU. Outline. Framework Main modules Motion Temporal transform Spatial transform Entropy coding Bitstream formation Additional modules Base layer In-band coding
E N D
Vidwav Wavelet Video Coding Specifications Student: Chia-Yang TsaiAdvisor: Prof. Hsueh-Ming HangInstitute of Electronics, NCTU
Outline • Framework • Main modules • Motion • Temporal transform • Spatial transform • Entropy coding • Bitstream formation • Additional modules • Base layer • In-band coding • Wavelet ringing reduction • Conclusions
References • Microsoft Research Asia + ENST + INRIA , “Vidwav Wavelet Video Coding Specifications”, ISO/IEC JTC1/SC29/WG11, M12339, Poznan, July. 2005.
T+2D (Pre-spatial decomposition is void)
ME/MC • Adopted from H.264/AVC. • Performed by minimizing the Lagrangian cost function • J= D + λR
MV Coding • Median prediction • For squre MC block size • 16x16, 8x8, 4x4 • (ABC) or (ABD) • Directional segmentation prediction • If predictor is not available, use median prediction 8x16 16x8 8x4 4x8
Temporal TransformMotion aligned temporal filteringPrediction stepUpdate step
Split MAP MAU MAU MAP Merge Motion Aligned Temporal Filtering • Lifting structure • MAP (motion aligned prediction) • MAU (motion aligned update)
Motion Aligned Temporal Filtering(Cont.) • Temporal lifting steps of 5/3 biorthogonal wavelet
Prediction • Bidirectional connection • Uni-directional connection When left-unidirectional connected When right-unidirectional connected
Prediction (Cont.) • OBMC
Update • Generated from H-frames through MC • Clipping
Structure • Decomposition structure • Description S<>S<>E<>E<>E<>E<>E<>E<>E<> S<> S<> S<> E<> E<> E<> E<>E<> E<> E<> S<> E<> E<> E<> E<> S<> E<> E<> E<> E<> S<> E<> E<> E<> E<>
EBCOT • Embedded Block Coding coding pass Code block Bitplanes Fractional bitplanes
Coding Pass Operation Significant Propagation Pass Magnitude Refinement Pass Previous bit-plane Normalization Pass ZC ZC ZC SC MR Insignificant sample SC Significant sample In previous bit-plane ZC Current bit-plane Significant sample In current bit-plane
Bitstream • N of total bitplanes in the block • The coded bitstream of the block, which consists of 3N-2 segments, each segment corresponding to the output bitstream of one coding pass. • The length of each bitstream segment and/or the position of end-point of each coding pass. • The R-D slope information at the end of each coding pass.
Syntax • Global • GOP • Group of picture for MCTF • Layer • For quick bitstream truncation • Packet • One component (Y, U, or V) of one temporal sunbbad • Subband • 3D subband • Block • Block of 3D EBCOT • Pass • Coding pass
Bitstream Selection • Optimized Truncation Block n+1 Block n RD slope iteration m iteration m-1 Coding pass index Rn Rn+1 ΣRi <= bitrate constrain
Motion Information Prediction • BASESKIP mode • Use base-layer motion vectors as motion predictors Macroblock partitions mode map rule to generate the predictors for higher spatial resolution
Hierarchical B-Pictures • Provide extra- temporal resolutions
In-Band CodingLeaky motion compensationMode-based temporal filtering
In-Band MCTF • The forming of different quality reference of LL • Low quality reference as IP_DIR • High quality reference as IP_LBS
In-Band MCTF • Leaky motion compensation • leaky factor • Attenuate the prediction based on the unknown information at the decoder • make a good trade-off between drifting errors and coding efficiency
In-Band MCTF • Mode-based temporal filtering • Mode I: Low quality reference • Mode 2: High quality reference • Mode is selected by RD cost
Description • Purpose • Reduce artifact after EDWT • Similar like the de-blocking filter in DCT based coding • After experiments,
Wavelet Based SVC • Advantages • Nature for multi-resolution scalability • Open-loop prediction structure • Provides elegant SNR scalability without impairing full exploitation of spatial-temporal correlation • Simplifies the R-D model of the bitstreams. • Facilitates the bitstream truncation • each subband is independent with other subbands
Wavelet Based SVC • Disadvantages • Decomposition modes (coding modes) selection • Texture & side information trade off • Intra-prediction • Badly-matched blocks • Downsampling filter problems