Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System

碩士 === 國立臺灣大學 === 電子工程學研究所 === 96 === Multiview and stereo video can bring the viewers a 3D and real perceptual experience by transmitting different video sequences simultaneously on the display. By special multiview displays, different views are projected to different eyes of viewers. As the displa...

Full description

Bibliographic Details
Main Authors: Pei-Kuei Tsung, 叢培貴
Other Authors: Liang-Gee Chen
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/05178940604233896905
id ndltd-TW-096NTU05428118
record_format oai_dc
spelling ndltd-TW-096NTU054281182016-05-11T04:17:10Z http://ndltd.ncl.edu.tw/handle/05178940604233896905 Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System 高畫質多視角立體視訊編碼系統預測核心之演算法與硬體架構設計 Pei-Kuei Tsung 叢培貴 碩士 國立臺灣大學 電子工程學研究所 96 Multiview and stereo video can bring the viewers a 3D and real perceptual experience by transmitting different video sequences simultaneously on the display. By special multiview displays, different views are projected to different eyes of viewers. As the display technology growing, more and more related applications, like 3D-TV and free-viewpoint TV (FTV) are closer and closer to be realized. Further, the requirement of high quality video is emerged in these years. The high definition (HD) video specifications, like 1920x1080 pixels and 1280x720 pixels, are strongly recommend for the advanced video applications including the multiview video applications. In order to make the multiview applications practicable, an efficient multiview video coding (MVC) scheme is needed. As the reference software and research platform, the joint multiview video model (JMVM) is released by the MPEG 3DAV Group. In the JMVM, the H.264/AVC is adopted to the base layer. Moreover, the hybrid motion and disparity compensated prediction is used to further enhance the rate-distortion performance. These constraints are raising the difficulty of the real-time MVC algorithm and architecture implementation than typical H.264 In this thesis, content-aware prediction algorithm with inter-view mode decision for MVC is proposed first. After analyzing and reusing the motion information from the neighboring views, the computational complexity in a MVC prediction core can be reduced to 98.4--99.1\% for ME in most view channels with negligible quality loss of 0.03--0.06 dB in PSNR. Compared with simulcast coding, the proposed algorithm provides coding gain of 0.09--1.44 dB with only 51.4--64.1\% computational complexity. It indicates that the computational redundancy is effectively removed. Second, hardware-oriented algorithm and architecture analysis and implementation are introduced. A system bandwidth analysis scheme of MVC with precedence constraint is proposed at first in this chapter. By adopting the precedence constraint concept from the graph theory, the bandwidth problem in MVC can be solved by selecting the most suitable data-reuse scheme. Then, the proposed MVC motion estimation algorithm is modified and combined with the hardware-oriented predictor-based motion estimation algorithm for the general H.264 encoder. After that, a complete hardware-oriented solution for the prediction core in both single view and multiview video encoder is proposed. Then, by improving the motion vector prediction scheme, the hardware resource requirement on the multiview video coding prediction engine can be further reduced even in high definition or Super-HD cases. With the proposed hardware-oriented algorithm, the on-chip memory requirement is reduced to 18.3--20.4\% and the system bandwidth is reduced to 53.2--95.8\% comparing with the level-C and level-C+ data-reuse scheme. Based on the proposed algorithm and architecture, a "High Definition Multiview Video and Super-HD H.264 Video Integer Motion Estimation Accelerator" single chip design with the largest resolution super-HD 4096x2160 pixels, single view; 1920x1080 pixels, stereo view; and 1280x720 pixels, multiview, are introduced in the end of this thesis. Liang-Gee Chen 陳良基 2008 學位論文 ; thesis 79 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 電子工程學研究所 === 96 === Multiview and stereo video can bring the viewers a 3D and real perceptual experience by transmitting different video sequences simultaneously on the display. By special multiview displays, different views are projected to different eyes of viewers. As the display technology growing, more and more related applications, like 3D-TV and free-viewpoint TV (FTV) are closer and closer to be realized. Further, the requirement of high quality video is emerged in these years. The high definition (HD) video specifications, like 1920x1080 pixels and 1280x720 pixels, are strongly recommend for the advanced video applications including the multiview video applications. In order to make the multiview applications practicable, an efficient multiview video coding (MVC) scheme is needed. As the reference software and research platform, the joint multiview video model (JMVM) is released by the MPEG 3DAV Group. In the JMVM, the H.264/AVC is adopted to the base layer. Moreover, the hybrid motion and disparity compensated prediction is used to further enhance the rate-distortion performance. These constraints are raising the difficulty of the real-time MVC algorithm and architecture implementation than typical H.264 In this thesis, content-aware prediction algorithm with inter-view mode decision for MVC is proposed first. After analyzing and reusing the motion information from the neighboring views, the computational complexity in a MVC prediction core can be reduced to 98.4--99.1\% for ME in most view channels with negligible quality loss of 0.03--0.06 dB in PSNR. Compared with simulcast coding, the proposed algorithm provides coding gain of 0.09--1.44 dB with only 51.4--64.1\% computational complexity. It indicates that the computational redundancy is effectively removed. Second, hardware-oriented algorithm and architecture analysis and implementation are introduced. A system bandwidth analysis scheme of MVC with precedence constraint is proposed at first in this chapter. By adopting the precedence constraint concept from the graph theory, the bandwidth problem in MVC can be solved by selecting the most suitable data-reuse scheme. Then, the proposed MVC motion estimation algorithm is modified and combined with the hardware-oriented predictor-based motion estimation algorithm for the general H.264 encoder. After that, a complete hardware-oriented solution for the prediction core in both single view and multiview video encoder is proposed. Then, by improving the motion vector prediction scheme, the hardware resource requirement on the multiview video coding prediction engine can be further reduced even in high definition or Super-HD cases. With the proposed hardware-oriented algorithm, the on-chip memory requirement is reduced to 18.3--20.4\% and the system bandwidth is reduced to 53.2--95.8\% comparing with the level-C and level-C+ data-reuse scheme. Based on the proposed algorithm and architecture, a "High Definition Multiview Video and Super-HD H.264 Video Integer Motion Estimation Accelerator" single chip design with the largest resolution super-HD 4096x2160 pixels, single view; 1920x1080 pixels, stereo view; and 1280x720 pixels, multiview, are introduced in the end of this thesis.
author2 Liang-Gee Chen
author_facet Liang-Gee Chen
Pei-Kuei Tsung
叢培貴
author Pei-Kuei Tsung
叢培貴
spellingShingle Pei-Kuei Tsung
叢培貴
Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System
author_sort Pei-Kuei Tsung
title Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System
title_short Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System
title_full Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System
title_fullStr Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System
title_full_unstemmed Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System
title_sort algorithm and architecture design of prediction core in high definition stereo and multiview video coding system
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/05178940604233896905
work_keys_str_mv AT peikueitsung algorithmandarchitecturedesignofpredictioncoreinhighdefinitionstereoandmultiviewvideocodingsystem
AT cóngpéiguì algorithmandarchitecturedesignofpredictioncoreinhighdefinitionstereoandmultiviewvideocodingsystem
AT peikueitsung gāohuàzhìduōshìjiǎolìtǐshìxùnbiānmǎxìtǒngyùcèhéxīnzhīyǎnsuànfǎyǔyìngtǐjiàgòushèjì
AT cóngpéiguì gāohuàzhìduōshìjiǎolìtǐshìxùnbiānmǎxìtǒngyùcèhéxīnzhīyǎnsuànfǎyǔyìngtǐjiàgòushèjì
_version_ 1718265677408632832