Comparing Gamma and Weibull as Frame Size Distributions for High Efficient Video Coding

Similar documents
Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)

How To Improve Performance Of H.264/Avc With High Efficiency Video Coding (Hevc)

Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder

Study and Implementation of Video Compression standards (H.264/AVC, Dirac)

Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)

WHITE PAPER. H.264/AVC Encode Technology V0.8.0

H.264/MPEG-4 AVC Video Compression Tutorial

Parametric Comparison of H.264 with Existing Video Standards

Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201

Video compression: Performance of available codec software

Figure 1: Relation between codec, data containers and compression algorithms.

Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm

Microsoft Lync. Unified Communication Specification for H.264 AVC and SVC UCConfig Modes V 1.1

THE EMERGING JVT/H.26L VIDEO CODING STANDARD

Complexity-rate-distortion Evaluation of Video Encoding for Cloud Media Computing

THE PRIMARY goal of most digital video coding standards

Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet

Efficient Stream-Reassembling for Video Conferencing Applications using Tiles in HEVC

Comparison of the Coding Efficiency of Video Coding Standards Including High Efficiency Video Coding (HEVC)

Wireless Ultrasound Video Transmission for Stroke Risk Assessment: Quality Metrics and System Design

Video Coding Technologies and Standards: Now and Beyond

IMPACT OF COMPRESSION ON THE VIDEO QUALITY

Multihypothesis Prediction using Decoder Side Motion Vector Derivation in Inter Frame Video Coding


Peter Eisert, Thomas Wiegand and Bernd Girod. University of Erlangen-Nuremberg. Cauerstrasse 7, Erlangen, Germany

Video Coding Standards and Scalable Coding

Introduction to image coding

Multiple Description Coding (MDC) and Scalable Coding (SC) for Multimedia

A Tutorial on Image/Video Coding Standards

Overview: Video Coding Standards

Using AVC/H.264 and H.265 expertise to boost MPEG-2 efficiency and make the 6-in-6 concept a reality

Rate-Constrained Coder Control and Comparison of Video Coding Standards

Efficient Coding Unit and Prediction Unit Decision Algorithm for Multiview Video Coding

Quality Optimal Policy for H.264 Scalable Video Scheduling in Broadband Multimedia Wireless Networks

Overview of the Scalable Video Coding Extension of the H.264/AVC Standard

Traffic Prioritization of H.264/SVC Video over e Ad Hoc Wireless Networks

H 261. Video Compression 1: H 261 Multimedia Systems (Module 4 Lesson 2) H 261 Coding Basics. Sources: Summary:

Internet Video Streaming and Cloud-based Multimedia Applications. Outline

For Articulation Purpose Only

MISB EG Engineering Guideline. 14 May H.264 / AVC Coding and Multiplexing. 1 Scope. 2 References

Enhanced Prioritization for Video Streaming over Wireless Home Networks with IEEE e

How To Improve Performance Of The H264 Video Codec On A Video Card With A Motion Estimation Algorithm

Video Coding Standards. Yao Wang Polytechnic University, Brooklyn, NY11201

White paper. H.264 video compression standard. New possibilities within video surveillance.

Video Network Traffic and Quality Comparison of VP8 and H.264 SVC

MPEG Unified Speech and Audio Coding Enabling Efficient Coding of both Speech and Music

GPU Compute accelerated HEVC decoder on ARM Mali TM -T600 GPUs

The H.264/MPEG-4 Advanced Video Coding (AVC) Standard

Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network

302 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 2, FEBRUARY 2009

Standard encoding protocols for image and video coding

Alberto Corrales-García, Rafael Rodríguez-Sánchez, José Luis Martínez, Gerardo Fernández-Escribano, José M. Claver and José Luis Sánchez

JPEG Image Compression by Using DCT

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Comparative Assessment of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders for Low-Delay Video Applications

Image Compression through DCT and Huffman Coding Technique

EQUITABLE QUALITY VIDEO STREAMING OVER DSL. Barry Crabtree, Mike Nilsson, Pat Mulroy and Steve Appleby

Probability Interval Partitioning Entropy Codes

H.264/MPEG-4 Advanced Video Coding Alexander Hermans

2007 IEEE International Conference on Signal Processing and Communications November 24-27, 2007, Dubai, United Arab Emirates

Fast entropy based CABAC rate estimation for mode decision in HEVC

CM0340 SOLNS. Do not turn this page over until instructed to do so by the Senior Invigilator.

White paper. An explanation of video compression techniques.

Introduzione alle Biblioteche Digitali Audio/Video

MPEG-1 and MPEG-2 Digital Video Coding Standards

AUDIO CODING: BASICS AND STATE OF THE ART

THE High Efficiency Video Coding (HEVC) standard is

DCT-JPEG Image Coding Based on GPU

Intra-Prediction Mode Decision for H.264 in Two Steps Song-Hak Ri, Joern Ostermann

Fast Hybrid Simulation for Accurate Decoded Video Quality Assessment on MPSoC Platforms with Resource Constraints

X264: A HIGH PERFORMANCE H.264/AVC ENCODER. Loren Merritt and Rahul Vanam*

Audio Coding Algorithm for One-Segment Broadcasting

We are presenting a wavelet based video conferencing system. Openphone. Dirac Wavelet based video codec

MPEG Digital Video Coding Standards

QOS Requirements and Service Level Agreements. LECTURE 4 Lecturer: Associate Professor A.S. Eremenko

Video Compression Trends & OUTLOOK. Andreas Pentrelli Solution Consultant

NTT DOCOMO Technical Journal. Standardization of HEVC Video Coding Scheme Reducing Video Traffic by Half and Perspective for Mobile Services

EFFICIENT QUANTIZATION PARAMETER ESTIMATION IN HEVC BASED ON ρ-domain

Compressing Moving Images. Compression and File Formats updated to include HTML5 video tag. The DV standard. Why and where to compress

A Look at Emerging Standards in Video Security Systems. Chris Adesanya Panasonic Network Systems Company

Video Encryption Exploiting Non-Standard 3D Data Arrangements. Stefan A. Kramatsch, Herbert Stögner, and Andreas Uhl

How To Test Video Quality On A Network With H.264 Sv (H264)

Transform-domain Wyner-Ziv Codec for Video

Statistical Modeling of Huffman Tables Coding

ATSC Standard: 3D-TV Terrestrial Broadcasting, Part 2 Service Compatible Hybrid Coding Using Real-Time Delivery

Copyright 2008 IEEE. Reprinted from IEEE Transactions on Multimedia 10, no. 8 (December 2008):

STUDY OF MUTUAL INFORMATION IN PERCEPTUAL CODING WITH APPLICATION FOR LOW BIT-RATE COMPRESSION

VIDEOTELEPHONY AND VIDEOCONFERENCE OVER ISDN

An Introduction to Ultra HDTV and HEVC

IMPROVING QUALITY OF VIDEOS IN VIDEO STREAMING USING FRAMEWORK IN THE CLOUD

Digital Audio and Video Data

Understanding Compression Technologies for HD and Megapixel Surveillance

Parallelization of video compressing with FFmpeg and OpenMP in supercomputing environment

REMOTE RENDERING OF COMPUTER GAMES

Evaluating the Impact of Network Performance on Video Streaming Quality for Categorised Video Content

Efficient Data Recovery scheme in PTS-Based OFDM systems with MATRIX Formulation

INTERNATIONAL TELECOMMUNICATION UNION 42!.3-)33)/. /&./.4%,%0(/.% 3)'.!,3

Comparison of Video Compression Standards

MULTI-RESOLUTION PERCEPTUAL ENCODING FOR INTERACTIVE IMAGE SHARING IN REMOTE TELE-DIAGNOSTICS. Javed I. Khan and D. Y. Y. Yun

Transcription:

http://dx.doi.org/10.5755/j01.eee.0.8.8439 ELEKTRONIKA IR ELEKTROTECHNIKA, ISSN 139-115, VOL. 0, NO. 8, 014 Comparing Gamma and Weibull as Frame Size Distributions for High Efficient Video Coding C. Perra 1 1 Department of Electric and Electronic Engineering, University of Cagliari, Piazza D'Armi, IT-0913 Cagliari, Italy cperra@diee.unica.it 1 Abstract Digital video is one of the major traffic components in communication networks. Modelling the frame size of encoded video data is a preliminary step in the research and development of synthetic video data generators enabling a thorough analysis of video architecture systems that are often difficult to perform with real digital video data. In this paper, a statistical analysis of frame sizes of High Efficient Video Coding (HEVC) video generated at bit rates of interest for high quality Full HD video applications is performed. The selection of potential distributions for modelling the HEVC frame size distribution is based on the results from the modelling of H.64 frame size distribution. Experimental results show that the Gamma distribution has a better fit, to the HEVC frame size distribution, than the Weibull distribution. Index Terms Video modelling, video traffic generator, HEVC, Full HD. I. INTRODUCTION Video coding standards have mainly gone through the development of ITU-T and ISO/IEC standards. The ITU-T produced H.6x standards [1], ISO/IEC produced MPEG-1 [] and MPEG-4 Visual [3], and the two organizations jointly produced the H.6/MPEG- Video [4], H.64/MPEG-4 Advanced Video Coding (AVC) [5], and High Efficiency Video Coding (HEVC) [6]. A video sequence is composed by group of frames (GOP). Each GOP is a sequence of frames encoded as: intra frame (I frame), if only spatial redundancy is exploited for the frame compression; prediction (P frame) from previous frames, if both temporal and spatial redundancy are exploited for the frame compression; bidirectional ( frame) prediction when the temporal prediction is computed from previous and future frames. In most video encoders a bit rate control is employed for satisfying bitrate constraints given by the communication channels. Such rate control can target a constant bitrate (CR) video coding or a variable bit rate (VR) video coding. VR is the preferred choice for applications requiring a constant quality service such as digital television broadcasting, DVDs, and lue-ray disks. Research and development of services related to digital Manuscript received January 6, 014; accepted August 5, 014. The author gratefully acknowledges Sardinia Regional Government for the financial support (P.O.R. Sardegna F.S.E. Operational Programme of the Autonomous Region of Sardinia, European Social Fund 007-013 - Axis IV Human Resources, Objective l.3, Line of Activity l.3.1). video often employ synthetic video traffic generation for analysing the behaviour and performance of a given video service with respect to the coding process, the rate control process, and the multiplexing and transmission process. A video traffic model should be strictly related to the video coding standard that will be used for coding digital video, capturing the characteristic of the video sequence. itrate, GOP structure (the sequence order of I,, P frames in a GOP), frame size, frame size distribution, frame sizes dependences (short range dependencies, long range dependencies, autocorrelation), intra-gop correlation, inter- GOP correlation [7] are some of the characteristics of major interest. A traffic model for MPEG-4 and H.64/MPEG4 AVC video traces has been proposed in [7]. Inter-GOP and intra- GOP correlation in compressed VR (variable bit rate) sequences have been addressed by incorporating waveletdomain analysis into time-domain modelling. The proposed model has been developed at frame-size level, allowing for the analysis of loss ratio for each type of frames. A statistical analysis of H.64 video frame size distribution has been presented in [8]. Frame sizes of H.64/MPEG4 AVC encoded video have been modelled with well-known statistical distributions. The main results have shown that the Gamma and Weibull distributions give appropriate statistical distributions of video frame sizes. A video traffic modelling tool for simulation-based performance evaluation studies [9] consider different models for traffic correlations based on the M/G/ process, able to exhibit both short range dependence (SRD) and long range dependence (LRD). A model for controlled VR video traffic reflecting different properties of video traffic related to the content, encoder, and rate controller is proposed in [10]. The model is based on the interaction of video encoder and encoded bit stream by the means of a rate controller. The model parameters depend on the encoding parameters, on the rate control parameters, and on the properties of video content. This model exhibits into the synthetic generated video traffic both LRD and SRD dependences and also some properties related to the content and the encoding parameters. Traffic modelling of VR video is necessary for performance assessment of a network design and for the creation of synthetic loads to be used for benchmarking a network. A recent classification and survey of VR video 44

traffic models [11] provides a review of the state of the art and a classification and comparison among some representative video traffic model. The objective of this paper is to analyse the statistical behaviour of frame sizes in HEVC video generated at bit rates of interest for high definition video applications to be used as a basis for simulation studies of digital television services. The paper is organized as follows. Section II briefly reviews the video coding architecture. The statistical analysis is presented in Section III. Section IV concludes the paper. II. HEVC VIDEO CODING A digital video sequence is composed of video scenes. A video scene is composed of frames (pictures). Each frame is encoded with different algorithms. The main frame types are named I, P, and. I frames are encoded exploiting spatial redundancy for the compression. The encoding of an I frame is self-contained meaning that no other information is required for the reconstruction (decoding) of that frame. P frames are encoded exploiting both spatial and temporal correlations. The encoding of P frames requires the knowledge of previously (reference) encoded frames and the decoder must save all reference frames necessary for decoding the target P frame. The encoding of frames requires the knowledge of previous and future frames (which must be encoded in advance) and the decoder must save all past and future reference frames necessary for decoding the target frame. In general, in MPEG international standards a frame is partitioned into slices and the encoding is performed at slice level, hence the encoding is referred to I-slices, -slices, P- slices. Video encoding rate control is designed for satisfying requirement such as encoding at a given bitrate, encoding at given target quality, encoding at the highest possible quality with a constraint of a target bitrate, encoding at a given bitrate with a constraint of a maximum buffer size. In constant bit rate encoding, the encoders dynamically adjust encoding parameters in order to minimize the difference between the actual bitrate and the target bitrate. Most of the algorithms are based on the adaptation of the quantization parameters at one or more of the following video levels: scene, GOP, frame, slice, macroblock, block. The encoded bit streams while satisfying the CR constraint presents a highly variable video quality. CR is not the preferred choice in digital video entertainment application. VR video encoding can be controlled or uncontrolled. In controlled encoding, the bit rate of the encoded bit stream is controlled by a VR rate controller according to feedback signals from encoding results and coding complexity of video source. The rate controller imposes some constraints on the degree of variability allowed in the bit rate. In uncontrolled encoding, the video pictures are encoded with an almost constant quantization (QP) to provide a relative constant quality for encoded video regardless of coding complexity of video source. Novel service, the increased popularity of high-definition (HD) video, a possible future adoption of video format with resolutions higher than HD have fostered the development of video coding standard improving the capacity of current standards such as H.64/MPEG-4 AVC. Network traffic caused by video applications for mobile devices, or for video-on-demand services, is very demanding for high quality/high resolution videos. The recent High Efficiency Video Coding (HEVC) standard is an ITU-T and ISO/IEC joint project activity developed under a partnership named Joint Collaborative Team on Video Coding (JVT-VC). HEVC is mainly focused into two key points: support higher video resolution and augment the use of parallel processing architectures. Fig. 1. HEVC Video encoder [1]. 45

I POC (Picture Order Count) 0 1 3 4 5 6 7 8 Decoding order 0 4 3 5 7 6 8 1 RPS (Reference Picture Set) --- [0,] [0,4] [,4] [0,8] [4,6] [4,8] [6,8] [0] Fig.. Temporal prediction structure and the POC values, decoding order, and RPS content for each picture. As in the previews ITU-T and ISO/IEC, the HEVC standardizes only the bitstream structure and syntax of the decoder. HEVC video coding layer makes use of the same hybrid approach employed by previous video standard since H.61. Figure 1 shows a diagram of a hybrid encoder for creating a bistream compliant with HEVC. A coding algorithm for producing an HEVC bitstream operates as follows. Each image is decomposed into block units by a coding tree unit (CTU) process. Each block is transmitted to the encoder. The first image of a video sequence is encoded exploiting intra-picture prediction. The other images are encoded exploiting inter-picture prediction defining motion vector (MV) highlighting temporal redundancies between frames. The signal difference between the original picture and the corresponding intra- or inter- prediction is transformed by the mean of linear spatial transform. Transformed coefficients are then scaled and quantized. The quantized coefficients are then encoded (entropy coding) by a context-adaptive binary arithmetic coding (CAAC) process. Finally, the encoded data is transmitted together with the other prediction information. III. STATISTICAL ANALYSIS Test video sequences have been chosen from [13], where several contributions of video content that are most relevant for determining the effectiveness of consumer video processing applications and quality measurement algorithms are provided. The selection of test video sequences has been oriented by the following constraints: YUV 4::0 sampling, full HD 190 1080 resolution, 60 Hz frame rate. The cyclic GOP structure used repeatedly throughout the test video sequence is shown in Fig.. This coding structure is of size 8 (8 slices). This prediction structure is characterized by a structural delay of eight pictures and can provide an improved coding efficiency compared to IP coding [14]. Video sequences have been encoded at the following bitrates: 1,, 4, 6, 8, and 10 Mbps. Figure 3 shows the average PSNR at different bitrates. PSNR (Y) is the average for all the frames, PSNR (Y, only I frame) is the average for the I frames, PSNR (Y, only frame) is the average for the frames. I frames have a higher PSNR than frames. Statistical analysis has been done for each prediction layer. Referring to Fig. : layer 0 (L0) is composed of frames with POC 0 and 8; layer 1 (L1) is composed of frames with POC 4; layer (L) is composed of frames with picture order code (POC) and 6; layer 3 (L3) is composed of frames with POC 1, 3, 5, and 7. For each layer, frame size histograms have been analysed in order to empirically determine a set of statistical distribution to be used as a starting point for fitting to the data sets. An example of the frame size, for 60 consecutive HEVC encoded frames, is shown in Fig. 4. Gamma and Weibull distribution have been selected for the fitting analysis, on the basis of the results obtained when modelling H.64 frame sizes [8]. The parameters of the Gamma and Weibull distributions have been computed using the maximum likelihood estimation (MLE) method. Figure 5 shows an example of a L3 frame sizes histogram, for a video sequence encoded by HEVC at 6 Mbps. The goodness of the fit has been tested using the Pearson's chi-square test. The value of the test is where n (o i ei ), (1) i 1 ei is the Pearson's cumulative test statistic, oi is an is an expected frequency, n is the observed frequency, ei size of the histogram domain. Table I shows the results for each layer for the dataset encoded at 6 Mbps. Numerical results for the other experimental bitrates are similar to the results in Table I. Gamma distribution has shown better fit than Weibull distribution, for each layer and for each bitrate. TALE I. CHI-SQUARE TEST RESULTS FOR EACH LAYER AT 6 MPS. Gamma Weibull L0 191 61 L1 943 1400 L 53 547 L3 40 191 46

Fig. 3. Average PSNR at different bitrates. PSNR (Y) is the average for all the frames, PSNR (Y, only I frame) is the average for the I frames, PSNR (Y, only frame) is the average for the frames. I frames have a higher PSNR than frames. Fig. 4. Example of one second of HEVC encoded frame size at 6 Mbps of a Full HD video (190 108), 60 Hz, YUV 4::0. 10 9 8 7 6 counts 5 4 3 1 0 0 0.5 1 1.5 bins.5 3 x 10 4 Fig. 5. Example of histogram for L3 frame sizes of test video sequences encoded by HEVC at 6 Mbps. IV. CONCLUSIONS Defining the statistical properties of a VR video traffic and relying upon a video traffic model able to produce simulated traffic data having the desired statistical properties enables for an accurate study of the performance of communication networks. In this paper, the statistical properties of frame sizes of HD video sequences encoded by the HEVC standard have been analysed and the Gamma and Weibull distributions 47

have been evaluated and compared as fitting distributions. HEVC videos, selected from a dataset of sports video content-types, have been generated at bit rates of interest for high definition video applications. For a GOP structure of interest in digital video applications the analysis has been conducted at different prediction layers. Experimental results have shown that the Gamma distribution is more appropriate than the Weibull distribution for modelling the statistical distribution of HEVC encoded frame. REFERENCES [1] Video Coding for Low it Rate Communication, ITU-T Rec. H.63, 1995. [] Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1.5 Mbit/s Part : Video, ISO/IEC 1117- (MPEG-1), ISO/IEC JTC 1, 1993. [3] Coding of Audio-Visual Objects Part : Visual, ISO/IEC 14496- (MPEG-4 Visual version 1), ISO/IEC JTC 1, 1999. [4] Generic Coding of Moving Pictures and Associated Audio Information Part : Video, ITU-T Rec. H.6 and ISO/IEC 13818- (MPEG Video), ITU-T and ISO/IEC JTC 1, 1994. [5] Advanced Video Coding for Generic Audio-Visual Services, ITU-T Rec. H.64 and ISO/IEC 14496-10 (AVC), ITU-T and ISO/IEC JTC 1, 003. [6] G. J. Sullivan, J. Ohm, W. Han, T. Wiegand, A. High, E. Video, C. Hevc, Overview of the high efficiency video coding (HEVC) standard, IEEE Trans. Circuits And Systems For Video Technology, vol., no. 1, pp. 1649 1668, 01. [Online]. Available: http://dx.doi.org/10.1109/tcsvt.01.1191 [7] M. D. M. Dai, Y. Z. Y. Zhang, D. Loguinov, A unified traffic model for MPEG-4 and H. 64 video traces, IEEE Trans. Multimedia, vol. 11, no. 5, pp. 1010 103, 009. [Online]. Available: http://dx.doi.org/10.1109/tmm.009.0180 [8] K. Salah, F. Al-Haidari, M. H. Omar, A. Chaudhry, Statistical analysis of H.64 video frame size distribution, IET Communications, vol. 5, no. 14, pp. 1978 1986, 011. [Online]. Available: http://dx.doi.org/10.1049/iet-com.010.0868 [9] M. E. Sousa-Vieira, A video traffic modelling tool for simulationbased performance evaluation studies, in 4th Int. Conf. Signal Processing and Communication Systems, 010. [10] M. Rezaei, I. ouazizi, M. Gabbouj, A model for controlled VR video traffic, in IEEE Int. Conf. Signal Processing and Communications (ICSPC), 007. [11] S. Tanwir, H. Perros, A survey of VR video traffic models, IEEE Communications Surveys & Tutorials, no. 99, pp. 1 5, 013. [1] G. J. Sullivan, J. R. Ohm, Woo-Jin Han, T. Wiegand, Overview of the high efficiency video coding (HEVC) standard, 01. [13] The Consumer Digital Library, 010. [Online]. Available: http://www.cdvl.org [14] H. Schwarz, G. Sullivan, T. Tan, T. Wiegand, Comparison of the coding efficiency of video coding standards including high efficiency video coding (HEVC), IEEE Trans. Circuits And Systems For Video Technology, vol., no. 1, pp. 1669 1684, 01. [Online]. Available: http://dx.doi.org/10.1109/tcsvt.01.119 48