Fast Rate-distortion Optimized Mode Decision of H.264/AVC Video Coding Standard

Fast Rate-distortion Optimized Mode Decision of H.264/AVC Video Coding Standard PDF Author: Mohammed Golam Sarwer
Publisher:
ISBN:
Category : Coding theory
Languages : en
Pages : 206

Get Book Here

Book Description

Fast Rate-distortion Optimized Mode Decision of H.264/AVC Video Coding Standard

Fast Rate-distortion Optimized Mode Decision of H.264/AVC Video Coding Standard PDF Author: Mohammed Golam Sarwer
Publisher:
ISBN:
Category : Coding theory
Languages : en
Pages : 206

Get Book Here

Book Description


Implementation of a Fast Inter-prediction Mode Decision in H.264/AVC Video Encoder

Implementation of a Fast Inter-prediction Mode Decision in H.264/AVC Video Encoder PDF Author: Amruta Kiran Kulkarni
Publisher:
ISBN:
Category :
Languages : en
Pages :

Get Book Here

Book Description
H.264/MPEG-4 Part 10 or AVC (advanced video coding) is currently one of the most widely used industry standards for video compression. There are several video codec solutions, both software and hardware, available in the market for H.264. This video compression technology is primarily used in applications such as video conferencing, mobile TV, blu-ray discs, digital television and internet video streaming. This thesis uses the JM 17.2 reference software [15], which is available for all users and can be downloaded from http://iphome.hhi.de/suehring/tml. The software is mainly used for educational purposes; it also includes the reference software manual which has information about installation, compilation and usage. In real time applications such as video streaming and video conferencing it is important that the video encoding/decoding is fast. It is known, that most of the complexity lies in the H.264 encoder, specifically the motion estimation (ME) and mode decision process introduces high computational complexity and takes a lot of CPU (central processing unit) usage. The mode decision process is complex because of variable block sizes (16X16 to 4x4) motion estimation and half and quarter pixel motion compensations. Hence, the objective of this thesis is to reduce the encoding time while maintaining the same quality and efficiency of compression. The Fast adaptive termination (FAT) [30] algorithm is used in the mode decision and motion estimation process. Based on the rate-distortion (RD) cost characteristics all the inter modes are classified as either skip modes or non-skip modes. In order to select the best mode for any macroblock, the minimum RD cost of these two modes is predicted. Further, for skip mode, an early-skip mode detection test is proposed; for non-skip mode a three-stage scheme is proposed to speed up the mode decision process. Experimental results demonstrate that the proposed technique has good robustness in coding efficiency with different quantization parameters (QP) and various video sequences. It is able to achieve encoding time saving by 47.6% and loss of only 0.01% decrease in structural similarity index matrix (SSIM) with negligible degradation in peak signal to noise ratio (PSNR) and acceptable increase in bit rate.

Advances in Multimedia Information Processing - PCM 2007

Advances in Multimedia Information Processing - PCM 2007 PDF Author: Horace H. S. Ip
Publisher: Springer Science & Business Media
ISBN: 3540772545
Category : Computers
Languages : en
Pages : 853

Get Book Here

Book Description
This book constitutes the refereed proceedings of the 8th Pacific Rim Conference on Multimedia, PCM 2007, held in Hong Kong, China, in December 2007. The 73 revised full papers and 21 revised posters presented were carefully reviewed and selected from 247 submissions. The papers are organized in topical sections on image classification and retrieval, the AVS china national standard - technology, applications and products, human face and action recognition, and many more topics.

High Efficiency Video Coding and Other Emerging Standards

High Efficiency Video Coding and Other Emerging Standards PDF Author: K.R. Rao
Publisher: CRC Press
ISBN: 1000794636
Category : Technology & Engineering
Languages : en
Pages : 319

Get Book Here

Book Description
High Efficiency Video Coding and Other Emerging Standards provides an overview of high efficiency video coding (HEVC) and all its extensions and profiles. There are nearly 300 projects and problems included, and about 400 references related to HEVC alone. Next generation video coding (NGVC) beyond HEVC is also described. Other video coding standards such as AVS2, DAALA, THOR, VP9 (Google), DIRAC, VC1, and AV1 are addressed, and image coding standards such as JPEG, JPEG-LS, JPEG2000, JPEG XR, JPEG XS, JPEG XT and JPEG-Pleno are also listed.Understanding of these standards and their implementation is facilitated by overview papers, standards documents, reference software, software manuals, test sequences, source codes, tutorials, keynote speakers, panel discussions, reflector and ftp/web sites – all in the public domain. Access to these categories is also provided.

Complexity-Aware High Efficiency Video Coding

Complexity-Aware High Efficiency Video Coding PDF Author: Guilherme Corrêa
Publisher: Springer
ISBN: 3319257781
Category : Technology & Engineering
Languages : en
Pages : 246

Get Book Here

Book Description
This book discusses computational complexity of High Efficiency Video Coding (HEVC) encoders with coverage extending from the analysis of HEVC compression efficiency and computational complexity to the reduction and scaling of its encoding complexity. After an introduction to the topic and a review of the state-of-the-art research in the field, the authors provide a detailed analysis of the HEVC encoding tools compression efficiency and computational complexity. Readers will benefit from a set of algorithms for scaling the computational complexity of HEVC encoders, all of which take advantage from the flexibility of the frame partitioning structures allowed by the standard. The authors also provide a set of early termination methods based on data mining and machine learning techniques, which are able to reduce the computational complexity required to find the best frame partitioning structures. The applicability of the proposed methods is finally exemplified with an encoding time control system that employs the best complexity reduction and scaling methods presented throughout the book. The methods presented in this book are especially useful in power-constrained, portable multimedia devices to reduce energy consumption and to extend battery life. They can also be applied to portable and non-portable multimedia devices operating in real time with limited computational resources.

The Era of Interactive Media

The Era of Interactive Media PDF Author: Jesse S. Jin
Publisher: Springer Science & Business Media
ISBN: 1461435013
Category : Computers
Languages : en
Pages : 650

Get Book Here

Book Description
Interactive Media is a new research field and a landmark in multimedia development. The Era of Interactive Media is an edited volume contributed from world experts working in academia, research institutions and industry. The Era of Interactive Media focuses mainly on Interactive Media and its various applications. This book also covers multimedia analysis and retrieval; multimedia security rights and management; multimedia compression and optimization; multimedia communication and networking; and multimedia systems and applications. The Era of Interactive Media is designed for a professional audience composed of practitioners and researchers working in the field of multimedia. Advanced-level students in computer science and electrical engineering will also find this book useful as a secondary text or reference.

Rate Distortion Optimization for Interprediction in H.264/AVC Video Coding

Rate Distortion Optimization for Interprediction in H.264/AVC Video Coding PDF Author: Jonathan Patrick Skeans
Publisher:
ISBN:
Category : Rate distortion theory
Languages : en
Pages : 59

Get Book Here

Book Description
Part 10 of MPEG-4 describes the Advanced Video Coding (AVC) method widely known as H.264. H.264 is the product of a collaborative effort known as the Joint Video Team(JVT). The final draft of the standard was completed in May of 2003 and since then H.264 has become one of the most commonly used formats for compression [1]. H.264, unlike previous standards, describes a myriad of coding options that involve variable block size inter prediction methods, nine different intra prediction modes, multi frame prediction and B frame prediction. There are a huge number of options for coding that will tend to generate a different number of coded bits and different reconstruction quality. A video encoder is challenged to minimize coded bitrate and maximize quality. However, choosing the coding mode of a macroblock to achieve this is a difficult problem due to the large number of coding combinations and parameters. Rate Distortion Optimization is an effective technique for choosing the 'best' coding mode for a macroblock. This thesis presents two features of an H.264 encoder, multi frame prediction and B frame prediction. Additionally, a Rate Distortion Optimization scheme is implemented with the features to improve overall performance of the encoder.

Effective Video Coding for Multimedia Applications

Effective Video Coding for Multimedia Applications PDF Author: Sudhakar Radhakrishnan
Publisher: BoD – Books on Demand
ISBN: 953307177X
Category : Computers
Languages : en
Pages : 270

Get Book Here

Book Description
Information has become one of the most valuable assets in the modern era. Within the last 5-10 years, the demand for multimedia applications has increased enormously. Like many other recent developments, the materialization of image and video encoding is due to the contribution from major areas like good network access, good amount of fast processors e.t.c. Many standardization procedures were carrried out for the development of image and video coding. The advancement of computer storage technology continues at a rapid pace as a means of reducing storage requirements of an image and video as most situation warrants. Thus, the science of digital video compression/coding has emerged. This storage capacity seems to be more impressive when it is realized that the intent is to deliver very high quality video to the end user with as few visible artifacts as possible. Current methods of video compression such as Moving Pictures Experts Group (MPEG) standard provide good performance in terms of retaining video quality while reducing the storage requirements. Many books are available for video coding fundamentals.This book is the research outcome of various Researchers and Professors who have contributed a might in this field. This book suits researchers doing their research in the area of video coding.The understanding of fundamentals of video coding is essential for the reader before reading this book. The book revolves around three different challenges namely (i) Coding strategies (coding efficiency and computational complexity), (ii) Video compression and (iii) Error resilience. The complete efficient video system depends upon source coding, proper inter and intra frame coding, emerging newer transform, quantization techniques and proper error concealment.The book gives the solution of all the challenges and is available in different sections.

Visual Content Processing and Representation

Visual Content Processing and Representation PDF Author: Luigi Atzori
Publisher: Springer Science & Business Media
ISBN: 3540335781
Category : Computers
Languages : en
Pages : 233

Get Book Here

Book Description
This book constitutes the thoroughly refereed postproceedings of the 9th International Workshop on Visual Content Processing and Representation, VLBV 2005. The 28 revised full papers presented together with 4 panel summaries were selected from 85 submissions during two rounds of reviewing and revision. The papers address all current issues in visual content processing techniques such as video and image analysis, representation and coding, communications and delivery, consumption, synthesis, protection, and adaptation.

Reducing the Compexity of Inter-prediction Mode Decision for High Effeciency Video Codec

Reducing the Compexity of Inter-prediction Mode Decision for High Effeciency Video Codec PDF Author: Kushal Shah
Publisher:
ISBN:
Category :
Languages : en
Pages : 77

Get Book Here

Book Description
The High Efficiency Video Coding (HEVC) standard is the latest joint video project of the International Telecommunication Unit (ITU-T) Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture Experts Group (MPEG) standardization organizations, working together in a partnership known as the Joint Collaborative Team on Video Coding (JCT-VC). While the HEVC is based on the same architecture of the widely used H.264/AVC (Advance Video Coding) standard [8], it includes many new coding tools, and almost all the encoder blocks are optimized with respect to their counterparts in the H.264/AVC standard. This allows the new standard to achieve up to 50% bitrate reduction compared to its predecessor with the same visual quality at the cost of increased complexity [1]. Like H.264/AVC, mode decisions with Motion Estimation (ME) remain among the most time-consuming computations in HEVC. In an inter-prediction mode decision, a fullsearch algorithm searches for every possible block size and refines the results from integer-pel to quarter-pel resolution. Thus, a full-search algorithm guarantees the highest level of compression performance. However, the considerable computational complexity for a mode decision decreases the encoding speed. In this thesis a fast adaptive termination [20] algorithm is proposed that terminates early the mode decision in inter-prediction for HEVC. Based on Rate Distortion (RD) cost, all the inter prediction modes are classified as skip or non-skip modes, and to select the best mode minimum RD cost of these two modes are predicted. For skip mode, the mode decision is predicted in early stage while in non-skip mode different stages are proposed to speed-up the mode decision. Experimental results based on several video test sequences suggest a decrease of about 25%-40% in encoding time is achieved with implementation of the Fast Adaptive Termination algorithm for interprediction mode decision with negligible degradation in peak signal to noise ratio (PSNR). Metrics such as BD-bitrate (Bjøntegaard Delta bitrate), BD-PSNR (Bjøntegaard Delta Peak Signal to Noise Ratio), SSIM (Structural Similarity) and computational complexity are also used.