Selected Papers from ICIP 2004

Selected Papers from ICIP 2004 Presented by Peter

Secure Media Streaming & Secure Adaptation for Non-scalable Video (Invited Paper) John G.Apostolopoulos Hewlett-Packard Labs, Palo Alto, CA

Summary • Targets • Adapting the media for the time-varying available network bandwidth for non-scalable video • Protect the security of the media • Media data is encrypted while R-D information is unencrypted • Each P-frame is encoded into one packet • Some P-frame is more important than others that give less distortion for when being dropped

Total Distortion Left: Carphone, Foreman Right: MrthrDhtr, Salesman

Secure Adaptive Streaming using a Secure-Media Rate-Distortion Hint Track (SM-RDHT) • SM-RDHT for untrusted streaming server • RD information embedded in hint track • Media itself is encrypted

SM-RDHT

Secure Transcoding at a Mid-Network Node using Secure Scalable Packets • H.264 video is packetized into secure scalable packets • Unencrypted packet headers provide R-D information – the importance of each packet • Mid-network transcoder can perform R-D optimized adaptation across multiple packets of a single stream or across packets of multiple different strems • 1 byte R-D information for each packet (one frame)

Secure Transcoding at a Mid-Network Node using Secure Scalable Packets

Comments • Packet size is limited to one frame that reduce the flexibility • Distortion highly depends on method of error concealment methods used at the decoder that is unknown at the streaming server and Mid-Network nodes

Discussions

Simple AVC-Based Codecs with Spatial Scalability R. Lange, Ł. Błaszak and M. Domański Poznań University of Tachnology, Poland

Summary • Spatial scalability for AVC-Based Codecs • Base layer is fully AVC-compliant • Prediction using the interpolation based layer and previous frame of enhancement layer • Improved motion vector encoding • Codec complexity is comparable to the complexity of a pair of codec used for simulcast coding of two layer • Proposed to the AVC standard

To Layers Structure

Improved Encoding of Motion Vectors in the Enhancement Layer • Optimum motion vectors are used in both layers • Previous proposals used motion vectors in the same layer to predict the current MV • Improved version includes up-scaled base layer MV to predict current MV • Prediction residuals are encoded using CABAC • Directional prediction is used as AVC standard for 8x16 blocks

Improved results from MV Predcition

Prediction in the Enhancement Layer • Two additional reference frames • Interpolated from decoded current base-layer frame • Average of latter and last temporal reference frame • Scalable coding is efficient if temporal prediction and base layer prediction are mixed with substantial prob. For each mode. • Edge-adaptive interpolation technique is used that improve the performance by 1dB

Experimental Results

Comments • The proposed scalable codec is not compliant with AVC • Only 2 layers are presented, Not Finegranularity scalable in spatial resolution • Coding efficiency improvement is small comparing to simulcast

Discussions

Mode Mapping Method for H.264/AVC Spatial Downscaling Transcoding P. Zhang, Y. Lu, Q. Huang and W. Gao Chinese Academy of Science, Microsoft Research Asia

Summary • Focused only on mode decision part • Cascaded Pixel-Domain transcoder (CPDT) • Mode mapping only for 16x16 predict mode for I frames and 8x8 prediction mode for P frames • Save about 50% time cost • High correlation between the four modes of original MB and the corresponding MB at half resolution • Two modes are proposed: • Simple Mode (SimMap) • Use motion vectors information (MapMV)

Simple Mode Mapping Method (SimMap) • For I frames • If more than 1 MB use I4x4 mode => use I4x4 in downsized frame • Otherwise use I16x16 • For P frames • If all MB are I16x16 => I16x16 • If more than 1MB are intra mode => I4x4 • If all are P16x16 or skip mode => P16x16, otherwise P8x8 is selected • 4 sub-modes are decided by direct mapping

Mode Mapping with Motion Vector (MapMV) • For the SimMap, the P16x8 and P8x16 are not utilized • When P8x8 is selected from SimMap: • Compute the distance between all MVs • If all distance < Th, => P16x16 • If D(MV1, MV3) and D(MV2,MV4) <Th, => P16x8 • If D(MV1, MV2) and D(MV3,MV4) <Th => P8x16 • Else P8x8

Comments • PSNR loss more than 2dB at low bit rate

Discussions

A New Rate Control Scheme for H.264 Video Coding P. Yin and J. Boyce Corporate Research, Thomson Inc. Princeton

Summary • A new constant bit rate control method based on TMN8 • Use simple preprocessing to achieve the target bit rate • Better target bit rate, bit allocation, buffer management • Adoption of virtual frame skipping • Frame level and MB level rate control • Simulations show that the method meet the target bitrate even for scene changes and scene transitions

Preprocessing Stage • Chicken and egg problem in RDO: • Quantization parameter QP is needed for RDO such as mode decision • Residue signal is needed to determine the QP, i.e. mode has to be decided to obtain the QP • For I frames, residue signals are estimated using original pixels • For P frames, rate constrained 16x16, 1 reference frame ME is performed to obtain the estimated residue • Average QP of previously coded picture is used the determine the 

Frame-layer rate control • Determine the frame QPf • Use GOP layer rate control • GOP length = 1s • Constraint the number of bit allocated for GOP to prevent buffer overflow • Unused bits are distributed over several following GOPs • Allocate more target bits for P pictures at the beginning of GOP for better references • Virtual buffer level is used to prevent QP deceases very quickly

MB-layer Rate Control • For I picture, a higher distortion is given to MBs with less detail • For P picture, a higher distortion is given to the MBs with more residue errors • Better perceptual quality is maintained for I picture and can be propagated to following P pictures • QP variation within a frame is limited to QPf 2

Virtual Frame Skipping • After a frame is encoded, if buffer level > 90% of total buffer, next frame is virtual skipped until buffer level is less than 90% • Virtual skipped is achieved by code every MB in P picture to skip mode • Increase the frame QP by 2 after a frame is skipped

Comments • An implemented rate control system

Discussions

Video Encoder Complexity Reduction by Estimating Skip Mode Distortion I. Richardson and Y. Zhao The Robert Gordon University

Summary • Predicts MBs that are likely to be skipped by the encoder • By estimating the increase in distortion due not skipping • Complexity reduced as motion estimation, FDCT, quantization and VLC is skipped for a skipped MB

Macroblock Distortion and Skip Prediction • SAE is used as distortion measure: • The difference between the SAEskip and SAEnoskip is used to determine whether a MB will be skipped ot not • SAEnoskip is not available at the encoder, estimated by SAEnoskip of previous frames

Comments

Discussions

An Improved Rate-Quantization Model for Rate Control in Real-Time Video Encoding B. Xie and W. Zeng PacketVideo Corporation, San Diego University of Missouri-Columbia

Rate-Distortion Optimized Video Coding with Stopping Rules: Quality and Complexity M. Moecke and R. Seara Federal University of Santa Catarina

Selected Papers from ICIP 2004

Selected Papers from ICIP 2004

Presentation Transcript

Papers underway from Toronto

Selected Capital Market deals 2004-2013

Selected Issues from BEPS

Selected results from ICHEP

ICIP Screenshots

Selected Papers in CCGrid2004

Highlights From Selected NSSE Studies

Selected Papers from COLING 2004

Notes from First Papers

ECML/PKDD 2007 Selected Papers – Association Mining

Headlines from Iraqi Papers

Diagrams from Past Papers

Selected highlights from LHCb

Selected Topics from Philippians

Selected Results from PHENIX

Selected topics from

Papers from Week 1

Paper Line Copy Papers from Progression Papers Online

Selected Questions from Minute Papers

Selected Topics from Philippians

Selected Answers from Homework