Adaptive Block Truncation Filter for MVC Depth Image Enhancement
|
|
|
- Bethany Henderson
- 10 years ago
- Views:
Transcription
1 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) Adaptive Block Truncation Filter for MVC Depth Image Enhancement Xuyuan Xu 1, Lai-Man Po 1, Chun-Ho Cheung 2, Litong Feng 1, Kwok-Wai Cheung 3, Chi-Wang Ting 1, Ka-Ho Ng 1 1. Department of Electronic Engineering, City University of Hong Kong, Hong Kong SAR, China 2. Department of Information Systems, City University of Hong Kong, Hong Kong SAR, China 3. Department of Computer Science, Chu Hai College of Higher Education, Hong Kong SAR, China ABSTRACT In Multiview Video plus Depth (MVD) format, virtual views are generated from decoded texture videos with decoded depth images through Depth Image based Rendering (DIBR). 3DV-ATM is a reference model for H.264/AVC based Multiview Video Coding (MVC) and aims at achieving high coding efficiency for 3D video in MVD format. Depth images are first downsampled then coded by 3DV-ATM. However, sharp object boundary characteristic of depth images does not well match with the transform coding of 3DV-ATM. Depth boundaries are often blurred with ringing artifacts in the decoded depth images that result in noticeable artifacts in synthesized views. This paper presents a low complexity adaptive block truncation filter to recover the sharp object boundaries of depth images using adaptive block repositioning and expansion for increasing the depth values refinement accuracy. This new approach is very efficient and can avoid false depth boundary refinement when block boundaries lie around the depth edge regions and ensure sufficient information within the processing block for depth layers classification. Experimental results show that sharp depth edges can be recovered using the proposed filter and boundary artifacts in the synthesized views can be removed. The proposed method can provide improvement up to 3.25dB in the depth map enhancement and bitrate reduction of 3.06% in the synthesized views. Index Terms Multiview Video plus Depth (MVD), Depth Image based Rendering (DIBR), 3DV-ATM, Multiview Video Coding (MVC), depth image compression, depth image filter. 1. INTRODUCTION In the last two decades, 3D video (3DV) technology advances significantly. It evolves from the simple stereoscopic system to more realistic multiview video systems, such as autostereoscopic 3DV displays and freeview TV [1, 2]. However, realization of multiview systems using large number of texture video views is not efficient. Depth-enhanced video format such as multiview video plus depth (MVD) format has attracted much attention due to its high efficiency and compatibility with conventional multiview video systems. Instead of coding numerous views, two or three texture views with their corresponding depth images are used in Multiview Video Coding (MVC) standard. Virtual views can be generated by specifying the preference viewing angles through the Depth Image based Rendering (DIBR) [3]. There are two reference test models for 3DV codecs, 3DV-ATM [4] and 3DV-HTM [5], which are based on H.264/AVC and H.265/HEVC, respectively. In 3DV-ATM, depth images are downsampled by half in both horizontal and vertical directions using the MPEG 13- tap downsampling filter [6]. After that, texture videos with reduced resolution depth videos are encoded using 3DV- ATM coding scheme. In decoding, bilinear filter is used to upsample the decoded depth images into original resolution. Coding techniques in 3DV-ATM not only include conventional intra-view prediction and inter-view prediction for texture and depth videos, but also contain new coding techniques favorable for higher compression ratio, for example, view synthesis prediction, in-loop joint inter-view depth filter, depth range based weighted prediction for depth coding, etc. The characteristic of depth image, however, is different from that of texture image. Depth image is a piecewise smooth image that has large homogeneous regions within objects and sharp depth changes at object boundaries, as shown in Fig. 1(a). H.264/AVC based coding method in 3DV-ATM, however, does not handle well at sharp object boundaries, which are always blurred by subsampling or transform coding even with ringing artifact (depth inconsistencies around boundaries) in the decoded depth images as shown in Fig. 1(b). This results in annoying artifacts in the synthesized views, as shown in Fig. 1(d). Fig. 1 (a) Original depth image (b) Compressed depth image with QP=32 (c) Synthesized view from uncompressed texture and depth (d) Synthesized view from compressed texture and depth. (a) (b) (c) Fig. 2 (a) Decoded depth with edge blocks (b) Cropped rectangular region of the decoded depth (c) The refined block A position. To tackle this problem, many depth map enhancement filters have been proposed. They are generally classified as pixelbased or block-based depth map filters. In pixel-based depth map filter, bilateral filter [7] and clustering of depth layer [8] are the most popular approaches. In [7], a depth adaptive joint bilateral was proposed. It takes the advantages of smoothing regions inside object boundary and preserving sharp edges. But the proper selection of filter parameters has great impact on sharp edge recovery. In the depth clustering /14/$ IEEE 544
2 filter and depth reconstruction filter [8], sharp edge can be retained through the depth layer classification. However, pixel-based depth map filter may create inconsistent depth values around the depth edge regions and generate ringing artifacts. In addition, this depth inconsistency causes texture distortion in synthesized views. In general, the block-based depth map filter can avoid inconsistent depth values and remove ringing artifacts. In [9, 10], a depth map postprocessing filter is designed to minimize the compression artifacts by analyzing the histogram inside each block. Unfortunately, layer classification fails in the situation that the boundaries of the blocks lying around the depth edge regions due to the lack of information of all the depth layers. In addition, the conventional block-based depth map filters divide the depth image into non-overlapped blocks. Only blocks containing large depth discontinuity edges, like blocks with white boundary as shown in Fig. 2(a), are filtered. However, some of these block boundaries located closer to edge regions are similar to the block A shown in Fig. 2(b). In this case, sharp edges are difficult to be recovered due to insufficient depth layer information within the block. For example, the block A of Fig. 2(b) contains foreground information and small part of the smooth edges. It is because the smooth edge regions only take very small percentages of the block A. They will be treated as noise and refined as the foreground regions. The lack of information of background regions makes it difficult to recover the original sharp edge. Sometimes it even degrades the quality of synthesized views in both objective and subjective evaluations that cause the overall performance to degrade after applying the block filter. If these blocks can be placed in the correct position, such as the center of block A is located at the vertical edge regions as shown in Fig. 2(c), the refinement of depth values has reliable reference since this center positioning provides sufficient depth values for each of the depth layers. Smooth edge regions of block A can be easily treated as noise and refined using the correct background or foreground depth layer information to recover its original depth values. In this paper, a block-based filter with adaptive block positioning and expansion is proposed to sharpen the compressed smooth depth edges. Details of proposed filter are presented in section 2. Experimental results are shown in section 3, and a conclusion is drawn in section ADAPTIVE BLOCK TRUNCATION FILTER Depth image exhibits a high degree of spatial correlation except at object boundaries [11,12]. Object boundaries are more sensitive to coding errors compared to smooth and flat regions [12]. The proposed filter aims to enhance depth edge regions and recover the original sharp edges from blurred edges caused by depth image subsampling or compression. The proposed filter includes three steps: (1) Edge Pixel Detection, (2) Edge Block Repositioning and Expansion, and (3) Depth Refinement and Layer based Smoothing. Details of each step are described in following subsections EDGE PIXEL DETECTION Sharp depth edge pixels are detected by the horizontal and vertical gradients of depth image defined as below: =, (1) =. (2) If horizontal or vertical gradients are larger than a threshold ( ) (i.e. > or > ), depth pixel at position (x, y) is treated as sharp edge pixels, i.e. the detected edge pixels in white color as shown in Fig. 3(a). Since the view synthesis of ATM only has horizontal disparities, in our experiment, a 1-D kernel, (-1, 1), is used to calculate both horizontal and vertical gradients and D T is set adaptively based on the hole s size created by the depth value difference between two adjacent pixels to the virtual view. The location of the virtual view is the nearer neighborhood of the input view. The threshold D T is calculated as: = h 1 1 ( 1 (3) ) where t c and f are the baseline distance and the focal length, respectively. z n and z f represent the nearest distance and the farthest distance in the scene. h is the hole size and set to 2. In 3DV-ATM, the camera parameters are encoded and transmitted to the decoder. Thus, using the adaptive threshold D T, no extra data is needed to transfer to decoder. Also, no modification is required for the encoder. (a) (b) Fig. 3 (a) Example of edge pixels that detected from a decoded depth image (b) A depth image is divided into non-overlapped blocks. (a) (b) (c) Fig. 4 (a) Detected depth edge blocks (b) Depth edge blocks after repositioning (c) Depth edge blocks after repositioning and expansion ADAPTIVE BLOCK POSITIONING AND EXPANSION After edge pixel detection, a depth image is divided into M M non-overlapped blocks as shown in Fig. 3(b). Block size is set adaptively based on the image resolution as = 2 ( ), (4) where W is the width of the depth image, a is the constant (a = 125) and the round( ) function rounds the input value to its nearest integer value. The non-overlapped blocks that contain edge pixels are considered as edge blocks as shown in Fig. 2(a) and only these edge blocks will be processed. In order to avoid the boundaries of these edge blocks lying too close to the smooth depth edge regions as the problem 545
3 described in the introduction, the positions and sizes of these edge blocks are adjusted before depth refinement. Basically, the block repositioning aims at locating most of the depth edge pixels at the block s center. This repositioning can be efficiently realized by using the average of edge pixels positions within the processing block as the block s center. Then, the top-left position of the repositioned edge block can be determined by =(, ) =, (5) where P i is depth pixel s position (x i, y i ), represents the set of depth edge pixels inside the processing block, N is the total number of depth edge pixels inside the M M block, and Q=(M/2, M/2). In Eq. (5), the subtraction of Q is used to get the xy-coordinate of the top-left position of the block. Fig. 4 shows examples of edge block locations before and after repositioning. Center of the edge blocks can be located to vertical depth edges after this simple process as shown in Fig. 4(b). However, this repositioning may cause some depth edge pixels that lie inside the edge block before reposition to outside region of the block as indicated in the circle region in Fig. 4(b). In order to tackle this problem, size of the edge block is also needed to adaptively expand for covering all the missing depth edge pixels. This block expansion process can be realized by using the minima of horizontal and vertical coordinates between the edge pixels inside the block before repositioning and the repositioned coordinates as the new top-left position of the expanded block. Thus, the top-left position of the expanded edge block can be expressed as = (, ) = min,min { }, min,min { } (6) where the function min(a, b) is used to find the minimum value between a and b. The expanded block size can be respectively determined by the maxima of horizontal and vertical distances from to its lower-right block coordinate (x B +M, y B +M) of the repositioned block and from to maxima edge pixel coordinates inside the block before repositioning. Thus, and can be expressed as =max +, max { } (7) =max +, max { } (8) where the function max(a, b) is used to find the maximum value between a and b. Fig. 4(c) shows the edge blocks after the adaptive block expansion. By using these repositioned and expanded edge blocks for depth values refinement, the refinement accuracy could be significantly improved DEPTH REFINEMENT After the edge block repositioning and expansion, a modified block truncation method is used to refine the depth values. It first classified the processing block into two layers, foreground (R F ) and background region (R B ) by comparing depth values with the mean depth value (D m ) of the block as: = {(, ) (, ) }, (9) = {(, ) (, ) < }, (10) where (, ) is the depth value at location (x, y). After the layer classification, mean values of foreground region and background region are calculated and denoted as and. Depth values of detected edge pixels are refined by selecting the nearer value of and as follow: (, ) =, (, ) (, ), (11), h where (, ) is the refined depth value. With this depth refinement, the blurred depth edges can be recovered. Since block size is set to a relative small value (eg: 8 8 for image with resolution of ) and the refined block is the edge regions, it has very high chance of having only two layers and makes the block truncation very efficient. To further remove the ringing artifacts, a layer based smooth filter is used. A 3 3 average filter is independently applied to the foreground region (R F ) and background region (R B ). Sharp edges can be maintained and the noise around the edges regions due to compression can be minimized. 3. EXPERIMENTAL RESULTS Experiment is conducted according to codec configuration (EHP profile) of 3DV-ATM 5.0 common testing conditions [13]. The proposed adaptive block truncation filter (ABTF) is implemented in the decoder and applied to the decoded depth images without dilation filter since our purpose is to recover depth images from compression. Results from ABTF are compared with pixel-based filter using adaptive joint bilateral filter (AJBF) [7] and block-based filter using adaptive sharpening filter (ASF) [8]. Three texture views plus three depth views in MVD format are encoded and decoded by the 3DV-ATM. The decoded depth images are enhanced by AJBF, ASF and ABTF and compared with the original depth image without downsampling by Peak signalto-noise ratio (PSNR). The decoded texture videos with the enhanced depth videos are used to generate the virtual views by 3DV-HTM 4.0 rendering reference software according to common test conditions [13]. Six virtual views between three cameras (evenly distributed) are synthesized for evaluation. The virtual views generated from original texture videos with original depth videos are used as the reference views for objective performance by using PSNR. The cropped original, compressed with different QP values, and enhanced depth images of Undo_dancer are shown in Fig. 5 and Fig. 6. Their corresponding synthesized views are shown in Fig. 7, Fig. 8. The large distortions around the object boundaries due to subsampling and transform coding are easily observed in the decoded depth as shown in Fig. 5(b). Depth images enhanced by AJBF and ASF still have blurred edges around depth transitional regions as in Fig. 5 (b) and (c). In view synthesis, the blurred edges result in the annoying boundary artifacts as in Fig. 7. Both AJBF and ASF are based on bilateral filter and try to recover the compressed depth image. Depth images enhanced by ASF are better than AJBF, since the ASF optimizes the parameter for each of the blocks adaptively while AJBF uses a fixed 546
4 parameter setting. In addition, ringing artifacts are reduced in ASF since it is a block-based depth map filter. From Fig. 5(e), the proposed filter can recover sharp depth edges and the filtered depth image is more similar to its original depth images. By comparing the results of synthesized views from AJBF and ASF, Fig. 7(e) show the proposed filter can significantly reduce the boundary artifacts caused by the blurred depth edges. For larger QP (Quantization Parameter) value, there are more distortions in the edges regions, which increase the difficulty in recovering sharp edges. ABTF can also recover the sharp edges when the QP value is large, and the synthesized boundary artifacts can be removed since the sharp depth edges can be recovered as Fig. 6 (d). (e) Fig. 5 (a) Original depth image of 148 th frame view 5 of Undo_dancer, (b) Compressed depth image (QP=31) (c) Depth image with AJBF (d) Depth image with ASF and (e) Depth image with ABTF. Fig. 6 (a) Compressed depth image (QP=41) of 148 th frame view 5 of Undo_dancer, (b) Depth image with AJBF (c) Depth image with ASF and (d) Depth image with ABTF. (e) Fig. 7 Synthesized view of 148 th frame view 3 of Undo_dancer based on (a) Original texture and depth (b) Compressed texture and depth (QP=31) (c) Compressed texture and depth using AJBF (d) Compressed texture and depth using ASF (e) Compressed texture and depth using ABTF. Fig. 8 Synthesized view of 148 th frame view 3 of Undo_dancer based on (a) Compressed texture and depth (QP=41) (b) Compressed texture and depth using AJBF (c) Compressed texture and depth using ASF (d) Compressed texture and depth using ABTF. Table 1 (a) presents the objective evaluation improvement of decoded depth images and it shows the enhanced depth images by AJBF and ASF have significant improvement in terms of PSNR. Results of AJBF attain lower PSNR compared with those of ASF. However, the enhanced depth images by AJBF and ASF still contain blurred depth edges that result in annoying boundaries artifacts in the synthesized views. Thus, the PSNR of synthesized views through AJBF and ASF are very similar to those of ATM anchor as in Table 2. Results show that depth image filtered by ABTF and the synthesized texture views are more similar to the reference views compared with AJBF and ASF. ABTF has improvement up to 3.25 db in the depth image and bitrate reduction of 3.06% in the synthesized views. This exists in Undo_dancer sequence, in which depth distances between background and foreground are large and its depth images satisfy our assumption that depth image exhibits a high degree of spatial correlation except at object boundaries. In addition, the proposed ABTF has a low complexity as tabulated in Table 2 (b). There is only around 7% increase in runtime when comparing with the decoding time of the 3DV-ATM 5.0. However, the ABTF can achieve significant improvement for the decoded depth image and synthesized view. AJBF ASF ABTF Time Ration Poznan_hall Poznan_hall Poznan_street Poznan_street Undo_dancer Undo_dancer GT_FLY GT_FLY Newspaper Newspaper Kendo Kendo Balloons Balloons Average Average (a) (b) Table 1 (a) PSNR improvement of depth map compared with ATM 5.0 anchor (b) Decoding complexity of ABTF compared with ATM 5.0 anchor. AJBF ASF ABTF Poznan_hall Poznan_street Undo_dancer GT_FLY Newspaper Kendo Balloons Average Table 2 PSNR improvement of synthesized view compared with ATM 5.0 anchor. 6. CONCLUSION As transform coding based H.264/AVC and H.265/HEVC were adopted in recent 3DV coding standards using MVD formats, sharp depth edges of the decoded depth images are often blurred or even with ringing artifacts, which results annoying boundary artifacts in the synthesized views. To recover sharp depth edges from decoded depth images, an adaptive block truncation filter with low complexity is proposed. It uses a very effective depth edge block repositioning and expansion method to achieve higher quality depth values refinement. It is because the adaptive block positioning and expansion can increase the sharp depth edges recoverability since it contains sufficient information of each depth layer inside the block. Experimental results show sharp depth edges can be recovered and the boundary artifacts in the synthesized views can also be removed. In addition, the objective evaluation also has improvement in terms of PSNR. 547
5 7. REFERENCE [1] K. Müller, P. Merkle, T. Wiegand, 3-D Video Representation Using Depth Maps, Proc. IEEE, vol.99, no.4, pp , Apr [2] M. Tanimoto, M.P. Tehrani, T. Fujii, T. Yendo, Free- Viewpoint TV, IEEE Signal Processing Mag., vol. 28, no.1, pp , Jan [3] P. Kauff, N. Atzpadin, C. Fehn, M. Müller, O. Schreer, A. Smolic, and R. Tanger, B, Depth map creation and image based rendering for advanced 3DTV services providing interoperability and scalability, Signal Process.: Image Commun., Special Issue on 3DTV, vol. 22, no. 2, pp , Feb [4] ISO/IEC JTC1/SC29/WG11, Tentative test model for AVCbased 3D video coding (3DV-TM), Doc. M22842, Switzerland, Nov [5] ISO/IEC JTC1/SC 29/WG 11, Test Model under Consideration for HEVC based 3D video coding, Doc. M12350, Nov [6] Y. Chen; S. Liu; Y. K. Wang, M. M. Hannuksela, H. Li, M. Gabbouj, "Low-complexity asymmetric multiview video coding," Proc. IEEE Int. Conf. Multimedia and Expo (ICME), pp.773,776, Apr [7] D.V.S.X. De Silva, W.A.C. Fernando, H. Kodikaraarachchi, S. T. Worall, A. M. Kondoz, "Improved depth map filtering for 3D-TV systems", Proc. IEEE Int. Conf. Consumer Electronics (ICCE ), pp , Jan [8] K.J. Oh, S.Y. Yea, A. Vetro, Y.S. Ho, Depth Reconstruction Filter and Down/Up Sampling for Depth Coding in 3-D Video, IEEE Signal Process. Lett., vol.16, no.9, pp , Sept [9] D.V.S.X. De Silva, W.A.C. Fernando, H. Kodikaraarachchi, S.T. Worall, A.M. Kondoz, Adaptive sharpening of depth maps for 3D-TV, Electron. Lett., vol.46, no.23, pp , Nov [10] D.V.S.X. De Silva, W.A.C. Fernando, H. Kodikaraarachchi, S.T. Worall, A.M. Kodoz, A Depth Map Post-Processing Framework for 3D-TV Systems based on Compression Artifact Analysis, IEEE J. Sel. Topics Signal Process., accepted for publication, [11] E. Ekmekcioglu, M. Mrak, S. Worrall, and A. Kondoz, Utilisation of Edge Adaptive Upsampling in Compression of Depth Map Videos for Enhanced Free-viewpoint Rendering, Proc. IEEE Int. Conf. Image Processing (ICIP), pp , Nov [12] K.J. Oh, A. Vetro, Y.S. Ho, Depth Coding Using a Boundary Reconstruction Filter for 3-D Video Systems, IEEE Tans. Circuits Syst. Video Technol., vol.21, no.3, pp , Mar [13] ISO/IEC JTC1/SC29/WG11, Common Test Conditions for 3DV experimentation, Doc. N12745, Geneva, Switzerland, May
Efficient Coding Unit and Prediction Unit Decision Algorithm for Multiview Video Coding
JOURNAL OF ELECTRONIC SCIENCE AND TECHNOLOGY, VOL. 13, NO. 2, JUNE 2015 97 Efficient Coding Unit and Prediction Unit Decision Algorithm for Multiview Video Coding Wei-Hsiang Chang, Mei-Juan Chen, Gwo-Long
The Open University s repository of research publications and other research outputs
Open Research Online The Open University s repository of research publications and other research outputs Disocclusion Hole-Filling in DIBR-Synthesized Images using Multi-Scale Template Matching Conference
Quality improving techniques for free-viewpoint DIBR
Quality improving techniques for free-viewpoint DIBR Luat Do a, Sveta Zinger a and Peter H.N. de With a,b a Eindhoven University of Technology, P.O. Box 513, 5600 MB Eindhoven, Netherlands b Cyclomedia
Quality Estimation for Scalable Video Codec. Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden)
Quality Estimation for Scalable Video Codec Presented by Ann Ukhanova (DTU Fotonik, Denmark) Kashaf Mazhar (KTH, Sweden) Purpose of scalable video coding Multiple video streams are needed for heterogeneous
Performance Analysis and Comparison of JM 15.1 and Intel IPP H.264 Encoder and Decoder
Performance Analysis and Comparison of 15.1 and H.264 Encoder and Decoder K.V.Suchethan Swaroop and K.R.Rao, IEEE Fellow Department of Electrical Engineering, University of Texas at Arlington Arlington,
View Sequence Coding using Warping-based Image Alignment for Multi-view Video
View Sequence Coding using Warping-based mage Alignment for Multi-view Video Yanwei Liu, Qingming Huang,, Wen Gao 3 nstitute of Computing Technology, Chinese Academy of Science, Beijing, China Graduate
Study and Implementation of Video Compression Standards (H.264/AVC and Dirac)
Project Proposal Study and Implementation of Video Compression Standards (H.264/AVC and Dirac) Sumedha Phatak-1000731131- [email protected] Objective: A study, implementation and comparison of
REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4
REPRESENTATION, CODING AND INTERACTIVE RENDERING OF HIGH- RESOLUTION PANORAMIC IMAGES AND VIDEO USING MPEG-4 S. Heymann, A. Smolic, K. Mueller, Y. Guo, J. Rurainsky, P. Eisert, T. Wiegand Fraunhofer Institute
Hole-Filling Method Using Depth Based In-Painting For View Synthesis in Free Viewpoint Television (FTV) and 3D Video
MISUBISHI ELECRIC RESEARCH LABORAORIES http://www.merl.com Hole-Filling Method Using Depth Based In-Painting For View Synthesis in Free Viewpoint elevision (FV) and 3D Video Kwan-Jung Oh, Sehoon Yea, Yo-Sung
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference
Very Low Frame-Rate Video Streaming For Face-to-Face Teleconference Jue Wang, Michael F. Cohen Department of Electrical Engineering, University of Washington Microsoft Research Abstract Providing the best
Combating Anti-forensics of Jpeg Compression
IJCSI International Journal of Computer Science Issues, Vol. 9, Issue 6, No 3, November 212 ISSN (Online): 1694-814 www.ijcsi.org 454 Combating Anti-forensics of Jpeg Compression Zhenxing Qian 1, Xinpeng
Color Segmentation Based Depth Image Filtering
Color Segmentation Based Depth Image Filtering Michael Schmeing and Xiaoyi Jiang Department of Computer Science, University of Münster Einsteinstraße 62, 48149 Münster, Germany, {m.schmeing xjiang}@uni-muenster.de
Efficient Prediction Structures for Multi-view Video Coding
Efficient Prediction Structures for Multi-view Video Coding Philipp Merkle, Member, IEEE, Aljoscha Smolic, Karsten Müller, Senior Member, IEEE, and Thomas Wiegand, Member, IEEE Abstract An experimental
Video compression: Performance of available codec software
Video compression: Performance of available codec software Introduction. Digital Video A digital video is a collection of images presented sequentially to produce the effect of continuous motion. It takes
Parametric Comparison of H.264 with Existing Video Standards
Parametric Comparison of H.264 with Existing Video Standards Sumit Bhardwaj Department of Electronics and Communication Engineering Amity School of Engineering, Noida, Uttar Pradesh,INDIA Jyoti Bhardwaj
A Novel Hole filling method based on Projection onto Convex Set in DIBR
3rd International Conference on Multimedia Technology ICMT 2013) A Novel Hole filling method based on Projection onto Convex Set in DIBR Weiquan Wang1 and Yingyun Yang2 and Qian Liang3 Abstract. Depth
Multihypothesis Prediction using Decoder Side Motion Vector Derivation in Inter Frame Video Coding
Multihypothesis Prediction using Decoder Side Motion Vector Derivation in Inter Frame Video Coding Steffen Kamp, Johannes Ballé, and Mathias Wien Institut für Nachrichtentechnik, RWTH Aachen University,
Study and Implementation of Video Compression standards (H.264/AVC, Dirac)
Study and Implementation of Video Compression standards (H.264/AVC, Dirac) EE 5359-Multimedia Processing- Spring 2012 Dr. K.R Rao By: Sumedha Phatak(1000731131) Objective A study, implementation and comparison
How To Improve Performance Of The H264 Video Codec On A Video Card With A Motion Estimation Algorithm
Implementation of H.264 Video Codec for Block Matching Algorithms Vivek Sinha 1, Dr. K. S. Geetha 2 1 Student of Master of Technology, Communication Systems, Department of ECE, R.V. College of Engineering,
Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur
Module 8 VIDEO CODING STANDARDS Version ECE IIT, Kharagpur Lesson H. andh.3 Standards Version ECE IIT, Kharagpur Lesson Objectives At the end of this lesson the students should be able to :. State the
JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions
Edith Cowan University Research Online ECU Publications Pre. JPEG compression of monochrome D-barcode images using DCT coefficient distributions Keng Teong Tan Hong Kong Baptist University Douglas Chai
Video Coding Technologies and Standards: Now and Beyond
Hitachi Review Vol. 55 (Mar. 2006) 11 Video Coding Technologies and Standards: Now and Beyond Tomokazu Murakami Hiroaki Ito Muneaki Yamaguchi Yuichiro Nakaya, Ph.D. OVERVIEW: Video coding technology compresses
Tracking Moving Objects In Video Sequences Yiwei Wang, Robert E. Van Dyck, and John F. Doherty Department of Electrical Engineering The Pennsylvania State University University Park, PA16802 Abstract{Object
Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet
DICTA2002: Digital Image Computing Techniques and Applications, 21--22 January 2002, Melbourne, Australia Bandwidth Adaptation for MPEG-4 Video Streaming over the Internet K. Ramkishor James. P. Mammen
Adaptive Image Warping Using Disparity Map-Based Rendering For Hole Prevention In 3D View Synthesis
Adaptive Image Warping Using Disparity Map-Based Rendering For Hole Prevention In 3D View Synthesis Yoganandhan.S Lakshmanan.V VinithaSubashini.B Department Of Computer Science and Engineering [email protected]
302 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 2, FEBRUARY 2009
302 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 2, FEBRUARY 2009 Transactions Letters Fast Inter-Mode Decision in an H.264/AVC Encoder Using Mode and Lagrangian Cost Correlation
Content Adaptation for Virtual Office Environment Using Scalable Video Coding
Content Adaptation for Virtual Office Environment Using Scalable Video Coding C. T. E. R. Hewage, [1] H. Kodikara Arachchi, [1] T. Masterton, [2] A. C. Yu, [1] H. Uzuner, [1] S. Dogan, [1] and A. M. Kondoz
REDUCED COMPUTATION USING ADAPTIVE SEARCH WINDOW SIZE FOR H.264 MULTI-FRAME MOTION ESTIMATION
REDUCED COMPUTATION USING ADAPTIVE SEARCH WINDOW SIZE FOR H.264 MULTI-FRAME MOTION ESTIMATION Liang-Ming Ji and Wan-Chi. Siu Department of Electronic and Information Engineering, Hong Kong Polytechnic
Complexity-rate-distortion Evaluation of Video Encoding for Cloud Media Computing
Complexity-rate-distortion Evaluation of Video Encoding for Cloud Media Computing Ming Yang, Jianfei Cai, Yonggang Wen and Chuan Heng Foh School of Computer Engineering, Nanyang Technological University,
A Novel Method to Improve Resolution of Satellite Images Using DWT and Interpolation
A Novel Method to Improve Resolution of Satellite Images Using DWT and Interpolation S.VENKATA RAMANA ¹, S. NARAYANA REDDY ² M.Tech student, Department of ECE, SVU college of Engineering, Tirupati, 517502,
Internet Video Streaming and Cloud-based Multimedia Applications. Outline
Internet Video Streaming and Cloud-based Multimedia Applications Yifeng He, [email protected] Ling Guan, [email protected] 1 Outline Internet video streaming Overview Video coding Approaches for video
Redundant Wavelet Transform Based Image Super Resolution
Redundant Wavelet Transform Based Image Super Resolution Arti Sharma, Prof. Preety D Swami Department of Electronics &Telecommunication Samrat Ashok Technological Institute Vidisha Department of Electronics
Key Terms 2D-to-3D-conversion, depth image-based rendering, hole-filling, image warping, optimization.
Disparity-Map-Based Rendering for Hole Prevention Mr. S. K. Somasundaram, Department of Computer Science & Engineering, Associate Professor, PSNA College of Engineering & Technology, Anna University,Chennai,
2695 P a g e. IV Semester M.Tech (DCN) SJCIT Chickballapur Karnataka India
Integrity Preservation and Privacy Protection for Digital Medical Images M.Krishna Rani Dr.S.Bhargavi IV Semester M.Tech (DCN) SJCIT Chickballapur Karnataka India Abstract- In medical treatments, the integrity
Low-resolution Character Recognition by Video-based Super-resolution
2009 10th International Conference on Document Analysis and Recognition Low-resolution Character Recognition by Video-based Super-resolution Ataru Ohkura 1, Daisuke Deguchi 1, Tomokazu Takahashi 2, Ichiro
SSIM Technique for Comparison of Images
SSIM Technique for Comparison of Images Anil Wadhokar 1, Krupanshu Sakharikar 2, Sunil Wadhokar 3, Geeta Salunke 4 P.G. Student, Department of E&TC, GSMCOE Engineering College, Pune, Maharashtra, India
Vision based Vehicle Tracking using a high angle camera
Vision based Vehicle Tracking using a high angle camera Raúl Ignacio Ramos García Dule Shu [email protected] [email protected] Abstract A vehicle tracking and grouping algorithm is presented in this work
Figure 1: Relation between codec, data containers and compression algorithms.
Video Compression Djordje Mitrovic University of Edinburgh This document deals with the issues of video compression. The algorithm, which is used by the MPEG standards, will be elucidated upon in order
SUBJECTIVE EVALUATION OF HEVC INTRA CODING FOR STILL IMAGE COMPRESSION
Proceedings of Seventh International Workshop on Video Processing and Quality Metrics for Consumer Electronics January 30-February 1, 2013, Scottsdale, Arizona SUBJECTIVE EVALUATION OF HEVC INTRA CODING
Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences
Detection and Restoration of Vertical Non-linear Scratches in Digitized Film Sequences Byoung-moon You 1, Kyung-tack Jung 2, Sang-kook Kim 2, and Doo-sung Hwang 3 1 L&Y Vision Technologies, Inc., Daejeon,
How To Improve Performance Of H.264/Avc With High Efficiency Video Coding (Hevc)
Evaluation of performance and complexity comparison for coding standards HEVC vs. H.264/AVC Zoran M. Milicevic and Zoran S. Bojkovic Abstract In order to compare the performance and complexity without
Image Authentication Scheme using Digital Signature and Digital Watermarking
www..org 59 Image Authentication Scheme using Digital Signature and Digital Watermarking Seyed Mohammad Mousavi Industrial Management Institute, Tehran, Iran Abstract Usual digital signature schemes for
BLOCK-MATCHING motion estimation is inextricably
16 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 7, NO. 1, FEBRUARY 2005 Novel Cross-Diamond-Hexagonal Search Algorithms for Fast Block Motion Estimation Chun-Ho Cheung, Member, IEEE, and Lai-Man Po, Member, IEEE
AUDIO CODING: BASICS AND STATE OF THE ART
AUDIO CODING: BASICS AND STATE OF THE ART PACS REFERENCE: 43.75.CD Brandenburg, Karlheinz Fraunhofer Institut Integrierte Schaltungen, Arbeitsgruppe Elektronische Medientechnolgie Am Helmholtzring 1 98603
Performance Verification of Super-Resolution Image Reconstruction
Performance Verification of Super-Resolution Image Reconstruction Masaki Sugie Department of Information Science, Kogakuin University Tokyo, Japan Email: [email protected] Seiichi Gohshi Department
Introduction to image coding
Introduction to image coding Image coding aims at reducing amount of data required for image representation, storage or transmission. This is achieved by removing redundant data from an image, i.e. by
Region of Interest Encoding in Video Conference Systems
Region of Interest Encoding in Video Conference Systems Christopher Bulla and Christian Feldmann Institut für Nachrichtentechnik RWTH Aachen University Aachen, GERMANY {bulla,feldmann}@ient.rwth-aachen.de
Reversible Data Hiding for Security Applications
Reversible Data Hiding for Security Applications Baig Firdous Sulthana, M.Tech Student (DECS), Gudlavalleru engineering college, Gudlavalleru, Krishna (District), Andhra Pradesh (state), PIN-521356 S.
Region of Interest Access with Three-Dimensional SBHP Algorithm CIPR Technical Report TR-2006-1
Region of Interest Access with Three-Dimensional SBHP Algorithm CIPR Technical Report TR-2006-1 Ying Liu and William A. Pearlman January 2006 Center for Image Processing Research Rensselaer Polytechnic
WHITE PAPER. H.264/AVC Encode Technology V0.8.0
WHITE PAPER H.264/AVC Encode Technology V0.8.0 H.264/AVC Standard Overview H.264/AVC standard was published by the JVT group, which was co-founded by ITU-T VCEG and ISO/IEC MPEG, in 2003. By adopting new
PageX: An Integrated Document Processing and Management Software for Digital Libraries
PageX: An Integrated Document Processing and Management Software for Digital Libraries Hanchuan Peng, Zheru Chi, Wanchi Siu, and David Dagan Feng Department of Electronic & Information Engineering The
A Dynamic Approach to Extract Texts and Captions from Videos
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,
Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm
Video Authentication for H.264/AVC using Digital Signature Standard and Secure Hash Algorithm Nandakishore Ramaswamy Qualcomm Inc 5775 Morehouse Dr, Sam Diego, CA 92122. USA [email protected] K.
ESE498. Intruder Detection System
0 Washington University in St. Louis School of Engineering and Applied Science Electrical and Systems Engineering Department ESE498 Intruder Detection System By Allen Chiang, Jonathan Chu, Siwei Su Supervisor
Template-based Eye and Mouth Detection for 3D Video Conferencing
Template-based Eye and Mouth Detection for 3D Video Conferencing Jürgen Rurainsky and Peter Eisert Fraunhofer Institute for Telecommunications - Heinrich-Hertz-Institute, Image Processing Department, Einsteinufer
Super-resolution method based on edge feature for high resolution imaging
Science Journal of Circuits, Systems and Signal Processing 2014; 3(6-1): 24-29 Published online December 26, 2014 (http://www.sciencepublishinggroup.com/j/cssp) doi: 10.11648/j.cssp.s.2014030601.14 ISSN:
Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network
Complexity-bounded Power Control in Video Transmission over a CDMA Wireless Network Xiaoan Lu, David Goodman, Yao Wang, and Elza Erkip Electrical and Computer Engineering, Polytechnic University, Brooklyn,
A NEW SUPER RESOLUTION TECHNIQUE FOR RANGE DATA. Valeria Garro, Pietro Zanuttigh, Guido M. Cortelazzo. University of Padova, Italy
A NEW SUPER RESOLUTION TECHNIQUE FOR RANGE DATA Valeria Garro, Pietro Zanuttigh, Guido M. Cortelazzo University of Padova, Italy ABSTRACT Current Time-of-Flight matrix sensors allow for the acquisition
A Prediction-Based Transcoding System for Video Conference in Cloud Computing
A Prediction-Based Transcoding System for Video Conference in Cloud Computing Yongquan Chen 1 Abstract. We design a transcoding system that can provide dynamic transcoding services for various types of
Audio Coding Algorithm for One-Segment Broadcasting
Audio Coding Algorithm for One-Segment Broadcasting V Masanao Suzuki V Yasuji Ota V Takashi Itoh (Manuscript received November 29, 2007) With the recent progress in coding technologies, a more efficient
Kapitel 12. 3D Television Based on a Stereoscopic View Synthesis Approach
Kapitel 12 3D Television Based on a Stereoscopic View Synthesis Approach DIBR (Depth-Image-Based Rendering) approach 3D content generation DIBR from non-video-rate depth stream Autostereoscopic displays
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS
COMPARISON OF OBJECT BASED AND PIXEL BASED CLASSIFICATION OF HIGH RESOLUTION SATELLITE IMAGES USING ARTIFICIAL NEURAL NETWORKS B.K. Mohan and S. N. Ladha Centre for Studies in Resources Engineering IIT
Image Compression through DCT and Huffman Coding Technique
International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2015 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Rahul
Multiple Description Coding (MDC) and Scalable Coding (SC) for Multimedia
Multiple Description Coding (MDC) and Scalable Coding (SC) for Multimedia Gürkan Gür PhD. Candidate e-mail: [email protected] Dept. Of Computer Eng. Boğaziçi University Istanbul/TR ( Currenty@UNITN)
Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I
Text Localization & Segmentation in Images, Web Pages and Videos Media Mining I Multimedia Computing, Universität Augsburg [email protected] www.multimedia-computing.{de,org} PSNR_Y
To determine vertical angular frequency, we need to express vertical viewing angle in terms of and. 2tan. (degree). (1 pt)
Polytechnic University, Dept. Electrical and Computer Engineering EL6123 --- Video Processing, S12 (Prof. Yao Wang) Solution to Midterm Exam Closed Book, 1 sheet of notes (double sided) allowed 1. (5 pt)
A Learning Based Method for Super-Resolution of Low Resolution Images
A Learning Based Method for Super-Resolution of Low Resolution Images Emre Ugur June 1, 2004 [email protected] Abstract The main objective of this project is the study of a learning based method
THE EMERGING JVT/H.26L VIDEO CODING STANDARD
THE EMERGING JVT/H.26L VIDEO CODING STANDARD H. Schwarz and T. Wiegand Heinrich Hertz Institute, Germany ABSTRACT JVT/H.26L is a current project of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC
Comparative Assessment of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders for Low-Delay Video Applications
Comparative Assessment of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC Encoders for Low-Delay Video Applications Dan Grois* a, Detlev Marpe a, Tung Nguyen a, and Ofer Hadar b a Image Processing Department,
IN SERVICE IMAGE MONITORING USING PERCEPTUAL OBJECTIVE QUALITY METRICS
Journal of ELECTRICAL ENGINEERING, VOL. 54, NO. 9-1, 23, 237 243 IN SERVICE IMAGE MONITORING USING PERCEPTUAL OBJECTIVE QUALITY METRICS Tubagus Maulana Kusuma Hans-Jürgen Zepernick User-oriented image
NEIGHBORHOOD REGRESSION FOR EDGE-PRESERVING IMAGE SUPER-RESOLUTION. Yanghao Li, Jiaying Liu, Wenhan Yang, Zongming Guo
NEIGHBORHOOD REGRESSION FOR EDGE-PRESERVING IMAGE SUPER-RESOLUTION Yanghao Li, Jiaying Liu, Wenhan Yang, Zongming Guo Institute of Computer Science and Technology, Peking University, Beijing, P.R.China,
IMPACT OF COMPRESSION ON THE VIDEO QUALITY
IMPACT OF COMPRESSION ON THE VIDEO QUALITY Miroslav UHRINA 1, Jan HLUBIK 1, Martin VACULIK 1 1 Department Department of Telecommunications and Multimedia, Faculty of Electrical Engineering, University
Video Coding Basics. Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected]
Video Coding Basics Yao Wang Polytechnic University, Brooklyn, NY11201 [email protected] Outline Motivation for video coding Basic ideas in video coding Block diagram of a typical video codec Different
Analyzing Facial Expressions for Virtual Conferencing
IEEE Computer Graphics & Applications, pp. 70-78, September 1998. Analyzing Facial Expressions for Virtual Conferencing Peter Eisert and Bernd Girod Telecommunications Laboratory, University of Erlangen,
An Efficient Compression of Strongly Encrypted Images using Error Prediction, AES and Run Length Coding
An Efficient Compression of Strongly Encrypted Images using Error Prediction, AES and Run Length Coding Stebin Sunny 1, Chinju Jacob 2, Justin Jose T 3 1 Final Year M. Tech. (Cyber Security), KMP College
White paper. H.264 video compression standard. New possibilities within video surveillance.
White paper H.264 video compression standard. New possibilities within video surveillance. Table of contents 1. Introduction 3 2. Development of H.264 3 3. How video compression works 4 4. H.264 profiles
EECS 556 Image Processing W 09. Interpolation. Interpolation techniques B splines
EECS 556 Image Processing W 09 Interpolation Interpolation techniques B splines What is image processing? Image processing is the application of 2D signal processing methods to images Image representation
Mobile video streaming and sharing in social network using cloud by the utilization of wireless link capacity
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 3 Issue 7 July, 2014 Page No. 7247-7252 Mobile video streaming and sharing in social network using cloud by
LIST OF CONTENTS CHAPTER CONTENT PAGE DECLARATION DEDICATION ACKNOWLEDGEMENTS ABSTRACT ABSTRAK
vii LIST OF CONTENTS CHAPTER CONTENT PAGE DECLARATION DEDICATION ACKNOWLEDGEMENTS ABSTRACT ABSTRAK LIST OF CONTENTS LIST OF TABLES LIST OF FIGURES LIST OF NOTATIONS LIST OF ABBREVIATIONS LIST OF APPENDICES
A QoE Based Video Adaptation Algorithm for Video Conference
Journal of Computational Information Systems 10: 24 (2014) 10747 10754 Available at http://www.jofcis.com A QoE Based Video Adaptation Algorithm for Video Conference Jianfeng DENG 1,2,, Ling ZHANG 1 1
MPEG-1 and MPEG-2 Digital Video Coding Standards
Please note that the page has been produced based on text and image material from a book in [sik] and may be subject to copyright restrictions from McGraw Hill Publishing Company. MPEG-1 and MPEG-2 Digital
PERFORMANCE ANALYSIS OF HIGH RESOLUTION IMAGES USING INTERPOLATION TECHNIQUES IN MULTIMEDIA COMMUNICATION SYSTEM
PERFORMANCE ANALYSIS OF HIGH RESOLUTION IMAGES USING INTERPOLATION TECHNIQUES IN MULTIMEDIA COMMUNICATION SYSTEM Apurva Sinha 1, Mukesh kumar 2, A.K. Jaiswal 3, Rohini Saxena 4 Department of Electronics
Face Recognition in Low-resolution Images by Using Local Zernike Moments
Proceedings of the International Conference on Machine Vision and Machine Learning Prague, Czech Republic, August14-15, 014 Paper No. 15 Face Recognition in Low-resolution Images by Using Local Zernie
