Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System
碩士 === 國立臺灣大學 === 電子工程學研究所 === 96 === Multiview and stereo video can bring the viewers a 3D and real perceptual experience by transmitting different video sequences simultaneously on the display. By special multiview displays, different views are projected to different eyes of viewers. As the displa...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2008
|
Online Access: | http://ndltd.ncl.edu.tw/handle/05178940604233896905 |
id |
ndltd-TW-096NTU05428118 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-096NTU054281182016-05-11T04:17:10Z http://ndltd.ncl.edu.tw/handle/05178940604233896905 Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System 高畫質多視角立體視訊編碼系統預測核心之演算法與硬體架構設計 Pei-Kuei Tsung 叢培貴 碩士 國立臺灣大學 電子工程學研究所 96 Multiview and stereo video can bring the viewers a 3D and real perceptual experience by transmitting different video sequences simultaneously on the display. By special multiview displays, different views are projected to different eyes of viewers. As the display technology growing, more and more related applications, like 3D-TV and free-viewpoint TV (FTV) are closer and closer to be realized. Further, the requirement of high quality video is emerged in these years. The high definition (HD) video specifications, like 1920x1080 pixels and 1280x720 pixels, are strongly recommend for the advanced video applications including the multiview video applications. In order to make the multiview applications practicable, an efficient multiview video coding (MVC) scheme is needed. As the reference software and research platform, the joint multiview video model (JMVM) is released by the MPEG 3DAV Group. In the JMVM, the H.264/AVC is adopted to the base layer. Moreover, the hybrid motion and disparity compensated prediction is used to further enhance the rate-distortion performance. These constraints are raising the difficulty of the real-time MVC algorithm and architecture implementation than typical H.264 In this thesis, content-aware prediction algorithm with inter-view mode decision for MVC is proposed first. After analyzing and reusing the motion information from the neighboring views, the computational complexity in a MVC prediction core can be reduced to 98.4--99.1\% for ME in most view channels with negligible quality loss of 0.03--0.06 dB in PSNR. Compared with simulcast coding, the proposed algorithm provides coding gain of 0.09--1.44 dB with only 51.4--64.1\% computational complexity. It indicates that the computational redundancy is effectively removed. Second, hardware-oriented algorithm and architecture analysis and implementation are introduced. A system bandwidth analysis scheme of MVC with precedence constraint is proposed at first in this chapter. By adopting the precedence constraint concept from the graph theory, the bandwidth problem in MVC can be solved by selecting the most suitable data-reuse scheme. Then, the proposed MVC motion estimation algorithm is modified and combined with the hardware-oriented predictor-based motion estimation algorithm for the general H.264 encoder. After that, a complete hardware-oriented solution for the prediction core in both single view and multiview video encoder is proposed. Then, by improving the motion vector prediction scheme, the hardware resource requirement on the multiview video coding prediction engine can be further reduced even in high definition or Super-HD cases. With the proposed hardware-oriented algorithm, the on-chip memory requirement is reduced to 18.3--20.4\% and the system bandwidth is reduced to 53.2--95.8\% comparing with the level-C and level-C+ data-reuse scheme. Based on the proposed algorithm and architecture, a "High Definition Multiview Video and Super-HD H.264 Video Integer Motion Estimation Accelerator" single chip design with the largest resolution super-HD 4096x2160 pixels, single view; 1920x1080 pixels, stereo view; and 1280x720 pixels, multiview, are introduced in the end of this thesis. Liang-Gee Chen 陳良基 2008 學位論文 ; thesis 79 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 電子工程學研究所 === 96 === Multiview and stereo video can bring the viewers a 3D and real perceptual experience by transmitting different video sequences simultaneously on the display.
By special multiview displays, different views are projected to different eyes of viewers. As the display technology growing, more and more related
applications, like 3D-TV and free-viewpoint TV (FTV) are closer and closer to be realized. Further, the requirement of high quality video is emerged in these
years. The high definition (HD) video specifications, like 1920x1080 pixels and 1280x720 pixels, are strongly recommend for the advanced video applications
including the multiview video applications. In order to make the multiview applications practicable, an efficient multiview video coding (MVC) scheme is
needed. As the reference software and research platform, the joint multiview video model (JMVM) is released by the MPEG 3DAV Group. In the JMVM, the H.264/AVC
is adopted to the base layer. Moreover, the hybrid motion and disparity compensated prediction is used to further enhance the rate-distortion performance.
These constraints are raising the difficulty of the real-time MVC algorithm and architecture implementation than typical H.264
In this thesis, content-aware prediction algorithm with inter-view mode decision for MVC is proposed first. After analyzing and reusing the motion
information from the neighboring views, the computational complexity in a MVC prediction core can be reduced to 98.4--99.1\%
for ME in most view channels with negligible quality loss of 0.03--0.06 dB in PSNR. Compared with simulcast coding, the proposed
algorithm provides coding gain of 0.09--1.44 dB with only 51.4--64.1\% computational complexity. It indicates that the computational redundancy is effectively
removed. Second, hardware-oriented
algorithm and architecture analysis and implementation are introduced. A system bandwidth analysis scheme of MVC with precedence constraint is proposed at first in this
chapter. By adopting the precedence constraint concept from the graph theory, the bandwidth problem in MVC can be solved by selecting the most suitable
data-reuse scheme. Then, the proposed MVC motion estimation algorithm is modified and combined with the hardware-oriented predictor-based motion estimation
algorithm for the general H.264 encoder. After that, a complete hardware-oriented solution for the prediction core in both single view and multiview video
encoder is proposed. Then, by improving the motion vector prediction scheme, the hardware resource requirement on the multiview video coding prediction engine
can be further reduced even in high definition or Super-HD cases. With the proposed hardware-oriented algorithm, the on-chip memory requirement is reduced to 18.3--20.4\% and the system bandwidth is reduced to 53.2--95.8\% comparing with the level-C and level-C+ data-reuse scheme.
Based on the proposed algorithm and architecture, a "High Definition Multiview Video and Super-HD H.264 Video Integer Motion Estimation Accelerator"
single chip design with the largest resolution super-HD 4096x2160 pixels, single view; 1920x1080 pixels, stereo view; and 1280x720 pixels, multiview,
are introduced in the end of this thesis.
|
author2 |
Liang-Gee Chen |
author_facet |
Liang-Gee Chen Pei-Kuei Tsung 叢培貴 |
author |
Pei-Kuei Tsung 叢培貴 |
spellingShingle |
Pei-Kuei Tsung 叢培貴 Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System |
author_sort |
Pei-Kuei Tsung |
title |
Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System |
title_short |
Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System |
title_full |
Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System |
title_fullStr |
Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System |
title_full_unstemmed |
Algorithm and Architecture Design of Prediction Core in High Definition Stereo and Multiview Video Coding System |
title_sort |
algorithm and architecture design of prediction core in high definition stereo and multiview video coding system |
publishDate |
2008 |
url |
http://ndltd.ncl.edu.tw/handle/05178940604233896905 |
work_keys_str_mv |
AT peikueitsung algorithmandarchitecturedesignofpredictioncoreinhighdefinitionstereoandmultiviewvideocodingsystem AT cóngpéiguì algorithmandarchitecturedesignofpredictioncoreinhighdefinitionstereoandmultiviewvideocodingsystem AT peikueitsung gāohuàzhìduōshìjiǎolìtǐshìxùnbiānmǎxìtǒngyùcèhéxīnzhīyǎnsuànfǎyǔyìngtǐjiàgòushèjì AT cóngpéiguì gāohuàzhìduōshìjiǎolìtǐshìxùnbiānmǎxìtǒngyùcèhéxīnzhīyǎnsuànfǎyǔyìngtǐjiàgòushèjì |
_version_ |
1718265677408632832 |