1 / 25

Hierarchical Prediction Structures in H.264/AVC

Hierarchical Prediction Structures in H.264/AVC. Kai-Chao Yang. Outline. Analysis of Hierarchical B Pictures and MCTF ICME 2006 Multiple Description Video Coding using Hierarchical B Pictures ICME 2007 Rate-Distortion Optimization for Fast Hierarchical Picture Transcoding ISCAS 2006

cosima
Télécharger la présentation

Hierarchical Prediction Structures in H.264/AVC

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Hierarchical Prediction Structures in H.264/AVC Kai-Chao Yang

  2. Outline • Analysis of Hierarchical B Pictures and MCTF ICME 2006 • Multiple Description Video Coding using Hierarchical B Pictures ICME 2007 • Rate-Distortion Optimization for Fast Hierarchical Picture Transcoding ISCAS 2006 • All Related Researches

  3. Analysis of Hierarchical B Pictures and MCTF Heiko Schwarz, Detlev Marpe, and Thomas Wiegand ICME 2006

  4. Hierarchical B-Pictures (1/2) • Key pictures • Hierarchical prediction structures • Dyadic structure • Non-dyadic structure GOP GOP GOP Hierarchical prediction Hierarchical prediction Hierarchical prediction … IDR I/P I/P I/P … … … … … … … …

  5. Hierarchical B-Pictures (2/2) • Coding delay • Minimum coding delay = hierarchy levels – 1 • Memory requirement • Maximum decoded picture buffer (DPB): 16 • Reference picture buffering type • Sliding window • Adaptive memory control • Memory management control operation (MMCO) • 0: End MMCO loop • 1: mark a Short-term frame as “Unused” • 2: mark a Long-term frame as “Unused” • 3: assign a Long-term index to a frame • 4: specify the maximum Long-term frame index • 5: reset • Minimum DPB size = hierarchy levels Coding order 0 5 4 3 2 1 Frame buffer Short-term frames Long-term frames 0 1 2 … … N-2 N-1 N New Old replace Thomas Wiegand, “Joint Committee Draft (CD),” Joint Video Team, JVT-C167, 6-10 May, 2002

  6. Coding Efficiency of Hierarchical B-Pictures • QPk = QPk-1 + (k=1 ? 4:1) • Problem : PSNR fluctuations High spatial detail and slow regular motion Fast and complex motion

  7. Visual Quality • Comparison of visual quality • Finer detailed regions of the background using larger GOP sizes. IBBP GOP 16

  8. MCTF Versus Hierarchical B-Pictures • Drawbacks of MCTF • Open-loop encoder control • Significant cost in update stage

  9. Multiple Description Video Coding using Hierarchical B Pictures Minglei Liu and Ce Zhu ICME 2007

  10. Concept of Multiple Description Coding • Multiple bit-streams are generated from one source signal and transmitted over separate channels Decoded signal from S1 Decoder 1 Decoded signal from S1 and S2 MDC encoder S1 Source signal Channel 1 Decoder 2 Channel 2 S2 Decoded signal from S2 Decoder 3 MDC decoder

  11. The proposed architecture for MDC • GOP size = 8 • Two output streams (S1, S2) are generated GOP S1 i i+8 GOP S2 i+1 i+9 Combination … … i+1 i i+2 i+3 i+4 i+5 i+6 i+7 i+8 i+9

  12. Coding Efficiency (1/2) • Improvement of coding efficiency • Increasing QP values for higher layers • Transmitting MVs only for higher layers • Skipping frames at higher layers

  13. Coding Efficiency (2/2) Max. QP = 51 for highest level Side distortion Side distortion Central distortion Central distortion

  14. Rate-Distortion Optimization for Fast Hierarchical Picture Transcoding Huifeng Shen, Xiaoyan Sun, Feng Wu, and Shipeng Li ISCAS 2006

  15. Rate Reduction Transcoding (1/3) • Cascaded pixel-domain transcoding structure • Fully decoding the original signal, and then re-encoding it A. Vetro, C. Christopoulos, and H. Sun, "Video transcoding architectures and techniques: an overview", IEEE Signal processing magazine, March 2003.

  16. Rate Reduction Transcoding (2/3) • Open-loop transcoding in coded domain • Partially decoding the original signal and re-quantizing DCT coefficients • drift A. Vetro, C. Christopoulos, and H. Sun, "Video transcoding architectures and techniques: an overview", IEEE Signal processing magazine, March 2003.

  17. Rate Reduction Transcoding (3/3) • Closed-loop transcoding with drift compensation • Partially decoding the original signal, and then compensating the re-quantized drift data A. Vetro, C. Christopoulos, and H. Sun, "Video transcoding architectures and techniques: an overview", IEEE Signal processing magazine, March 2003.

  18. Hierarchical B Pictures Transcoding • Open-loop transcoding method can be used • Motion information is unchanged; DCT coefficients are truncated, re-quantized, or partially discarded • Drift inside a GOP will not propagate to other GOPs • However, motions are more important in hierarchical B-pictures structure • At low bit-rate, most bits are spent on motion information • Proposed RDO model – combination of texture RDO and motion RDO

  19. Traditional Rate-Distortion Model • RD model • S = (S1, …, Sk) denotes k MBs • I = (I1, …, Ik) denotes k coding parameters of S • Fully decoding and re-encoding is needed!

  20. Proposed Rate-Distortion Model (1/4) • Proposed RD model • Claim • Rtexture: rate spent in coding quantized DCT coefficients • Rmotion: rate spent in coding MB modes, block modes, and MVs • Dtexture: distortion caused by downscaled texture with unchanged MVs • Dmotion: distortion caused by motion adjustment relative to the unchanged motion case

  21. Proposed Rate-Distortion Model (2/4) • Texture RDO model • To minimize the RD function,  • Let    N.Kamaci, Y. Altunbasak, and R.M. Mersereau, "Frame bit allocation for the H.264/AVC video coder via Cauchy-density-based rate and distortion models", IEEE Trans. on CSVT, Vol 15, No. 8, Aug. 2005. 2.54 -5.35

  22. Proposed Rate-Distortion Model (3/4) • Motion RDO model • Rmotioncan be easily computed, but Dmotion is unknow • Dmotion can be approximated by mv mean-square error A. Secker and D. Taubman, "Highly scalable video compression with scalable motion coding", IEEE Trans. on Image Processing, Vol. 13, No.8, August 2004.

  23. Proposed Rate-Distortion Model (4/4) • Motion adjustment • Original • Adjustment … …

  24. Simulation results

  25. All related researches • Rate control optimization • Bit allocation • Trade-off between coding efficiency and delay • Multi-view • Temporal scalable coding in SVC • Elimination of PSNR fluctuation? • More efficient hierarchical structures?

More Related