320 likes | 478 Views
My Work. Design MPEG Surround Encoding Algorithms Subset coding mode Parameter band stride Parameter slots Adaptive smoothing Prediction mode for R-TTT box Implementation in the Reference Software. Outline. MPEG Surround Introduction Proposed Algorithms Experimental Results Demo.
E N D
My Work • Design MPEG Surround Encoding Algorithms • Subset coding mode • Parameter band stride • Parameter slots • Adaptive smoothing • Prediction mode for R-TTT box • Implementation in the Reference Software
Outline • MPEG Surround Introduction • Proposed Algorithms • Experimental Results • Demo
Outline • MPEG Surround Introduction • Spatial Hearing • MPEG Surround Encoder • MPEG Surround Decoder
Spatial Hearing • Three parameters describing how human locate sound source in the horizontal place • Interaural Level Difference (ILD) • Interaural Time Difference (ITD) • Interaural Coherence (IC)
MPEG Surround • Low-bitrate parametric coding technology for multi-channel audio signal • Backward compatibility to stereo equipment • Standardization • CfP on SAC in March 2004 • Finalize in July, 2006 (ISO/IEC 23003-1)
MPEG Surround Encoder • Capture the spatial image of multi-channel audio • Generate a mono/stereo downmix
MPEG Surround Decoder • Synthesis multi-channel output signal • Backward compatibility
Filter Banks • 2 stages
Downmix and Parameter Extraction • Two elementary blocks construct hierarchical structures • R-OTT box • R-TTT box
Parameter Sets and Bands • Parameter sets: grouping of time slots • Parameter bands: grouping of subbands
R-OTT Box • Create a mono downmix from a stereo input • Extract relevant spatial parameters • Channel Level Differences (CLD) where • Inter-Channel coherence (ICC)
OTT Box • Synthesize by a mono downmix with parameters
R-TTT Box(1/2) • Create a stereo downmix from three input channels • Two way to reconstruction the 3rd signal • Prediction mode: linear combination • Energy mode: preserving energies
R-TTT Box(2/2) • Prediction mode: • 2 CPCs and 1 ICC: where • Energy Mode: • 2 CLDs:
TTT Box • Prediction Mode: • With residual signal-> 2 CPCs • Without residual signal-> use the ICC to compensate the energy loss • Energy Mode: • Energy reconstruction
Entropy Coding Schemes • Differential coding • Huffman tables
Outline • Proposed Algorithms • Subset coding mode • Parameter band stride • Parameter slots • Adaptive smoothing • Prediction mode for R-TTT box
New Encoder Structure • Additional modules:
Reference Software Encoder • Parameter set=1 • Parameter band=20 • Tree structure: 5151, 5152, 525 • Time slots: 16, 32 • Fine quantization • Differential in T/F, PCM + 1D Huffman
Subset coding mode • 4 coding modes for each parameter subset: • Default • Keep • Interpolation • Lossless • Ref S/W implements only the Lossless mode
Subset coding mode • Flow chart • Search each mode for the least error • Compare to a threshold • Exploit redundancy of time
Experimental Results • Only the Lossless mode costs bits • The bitrate can be estimated by :
Experimental Results • Comparisons:
Experimental Results • 2 phenomenon: • all theoretical results > experimental results • number of parameter sets => both theoretical and experimental results