410 likes | 775 Views
MCTF in Current Scalable Video Coding Schemes. Student: Chia-Yang Tsai Advisor: Prof. Hsueh-Ming Hang Institute of Electronics, NCTU. Outline. Overview MCTF in Interframe Wavelet MCTF in JSVM Comparison References. Outline. Overview Scalable Video Coding MCTF in Interframe Wavelet
E N D
MCTF in Current Scalable Video Coding Schemes Student: Chia-Yang TsaiAdvisor: Prof. Hsueh-Ming HangInstitute of Electronics, NCTU
Outline • Overview • MCTF in Interframe Wavelet • MCTF in JSVM • Comparison • References
Outline • Overview • Scalable Video Coding • MCTF in Interframe Wavelet • MCTF in JSVM • Comparison • References
Scalable Video Coding • Ability to adjust • Different client requirements • Scalabilities • Rate/SNR • Spatial • Temporal
MCTF MCTF = Motion Compensated Temporal Filtering
Rate/SNR Scalability • Progressive approximation 300kbps PSNR=32.2 dB 500kbps PSNR=34.6 dB 1000kbps PSNR=38.2 dB GOP Header Motion Info. Image Data
Wavelet decomposition provides spatial scalability Bit-plane Coder Spatial Scalability
Temporal Scalability 30Hz Video Sequence 15Hz Video Sequence H H H H H H H H1 7.5Hz Video Sequence H2 H2 H2 H2 3.25Hz Video Sequence H3 H3 H4 L
Scalable Video Coding • History HHI (AVC-based) JSVM UNSW (wavelet) RPI (wavelet) MSRA (wavelet) 2005 2004.3 2004.7
Approaches An AVC/H.264-based approach (also DCT-based)
Approaches A wavelet-based approach with t+2D structure.
Approaches A wavelet-based approach with 2D+t structure
Lifting Scheme • 5/3 lifting scheme
Outline • Overview • MCTF in Interframe Wavelet • Barbell lifting • In-band MCTF • Base-layer structure • MCTF in JSVM • Comparison • References
Barbell Lifting Scheme • Purpose: • Improve the accuracy of motion field. • Methods: • Take (5,3) wavelet kernel. • Use “barbell function” to generate prediction /update values.
Barbell Lifting Scheme Prediction Stage Update Stage
In-Band MCTF • Purpose: • Improve coding performance with spatial scalability • Methods: • Leaky motion compensation • Mode-based temporal filtering
In-Band MCTF • The forming of different quality reference of LL • Low quality reference as IP_DIR • High quality reference as IP_LBS
In-Band MCTF • Leaky motion compensation • leaky factor • Attenuate the prediction based on the unknown information at the decoder • make a good trade-off between drifting errors and coding efficiency
In-Band MCTF • Mode-based temporal filtering • Mode I: Low quality reference • Mode 2: High quality reference • Mode is selected by RD cost
Base-Layer Structure • Purpose: • Coding efficiency improvement in low rates • AVC compatible • Methods: • Insert AVC encoding module into MCTF
Base-Layer Structure Encoder Decoder
Outline • Overview • MCTF in Interframe Wavelet • MCTF in JSVM • Base layer structure • Inter-layer prediction • Adaptive prediction/update steps • Comparison • References
Base Layer Structure • Purpose • Coding efficiency improvement in low rates • Compatibility to AVC • Methods • Unrestricted MCTF (UMCTF) • Hierarchical B pictures
Base Layer Structure • UMCTF • Update step is omitted. • Hierarchical B pictures • Fully compatible to AVC Main profile • Non-dyadic decomposition is available
Inter-Layer Prediction • Purpose • Reduce redundancy between layers • Methods • Inter-layer texture prediction • Inter-layer motion prediction
Adaptive Prediction/Update Steps • Purpose: • Delay (Memory)control • Method: • Sub-partitioning of GOP
Outline • Overview • MCTF in Interframe Wavelet video • MCTF in JSVM • Comparison • Cons and pros • Experimental results • References
Wavelet Based SVC • Key features • 3D wavelet decomposition • Open-loop prediction structure • Spatial-temporal resolution scalability • SNR scalability
Wavelet Based SVC • Advantages • Nature for multi-resolution scalability • Open-loop prediction structure • Provides elegant SNR scalability without impairing full exploitation of spatial-temporal correlation • Simplifies the R-D model of the bitstreams. • Facilitates the bitstream truncation • each subband is independent with other subbands
Wavelet Based SVC • Disadvantages • Decomposition modes (coding modes) selection • Texture & side information trade off • Intra-prediction • Badly-matched blocks • Downsampling filter problems
AVC Based SVC • Key features • MCTF/Hierarchical B structure for temporal scalability • Hierarchical B structure with close-loop structure for base layer • Multiple spatial layers for spatial scalability • Multiple FGS layers at each spatial resolution for SNR scalability • DCT coding of all the frames
AVC Based SVC • Advantages • All the RDO and intra-prediction can be used. • It guarantees the quality of the first testing point. • MPEG filter for low resolution video • the target low resolution video is visually good.
AVC Based SVC • Disadvantages • Redundancy between spatial layers
References [1] “Draft of joint scalable video model (JSVM)3.0 reference encoding algorithm description”, ISO/IEC JTC1/SC29/WG11, N7311, Poznan, July 2005. [2] D. Zhang, J. Xu, H. Xiong, and F. Wu, “Improvement for in-band video coding with spatial scalability”, ISO/IEC JTC1/SC29/WG11, M11681, HongKong, Jan. 2005. [3] V. Bottreau, G. Pau, and J. Xu, “Vidwav evaluation software manual”, ISO/IEC JTC1/SC29/WG11, M12176, Poznan, July. 2005. [4] X. Ji, J. Xu, D. Zhao, and F. Wu, “Responses of CE1d: base- layer”, ISO/IEC JTC1/SC29/WG11, M11127, Redmond, July 2004. [5] R. Xiong, J. Xu, and F. Wu, “Coding performance comparison between MSRA wavelet video coding and JSVM”, ISO/IEC JTC1/SC29/WG11, M11975, Busan, April 2005. [6] R. Xiong, J. Xu, and F. Wu, “Response to VidWav EE1”, ISO/IEC JTC1/SC29/WG11, M12286, Poznan, July 2005. [7] J. Reichel, K. Hanke and B. Popescu, “Scalable Video Model V1.0”, ISO/IEC JTC1/SC29/WG11, N6372, Munich, March 2004.