Video Processing for MPEG-4 Encoding Systems

博士 === 國立臺灣大學 === 電機工程學研究所 === 89 === Abstract This dissertation presents three video processing techniques that are essential for real-time MPEG-4 encoding systems. The content-based interactivity is one of the most important features of the MPEG-4 visual coding; however, the...

Full description

Bibliographic Details
Main Authors:	Shyh-Yih Ma, 馬仕毅
Other Authors:	Liang-Gee Chen
Format:	Others
Language:	en_US
Published:	2001
Online Access:	http://ndltd.ncl.edu.tw/handle/14127110273531076387

id	ndltd-TW-089NTU00442140
record_format	oai_dc
spelling	ndltd-TW-089NTU004421402016-07-04T04:17:06Z http://ndltd.ncl.edu.tw/handle/14127110273531076387 Video Processing for MPEG-4 Encoding Systems MPEG-4編碼系統視訊處理之研究 Shyh-Yih Ma 馬仕毅博士國立臺灣大學電機工程學研究所 89 Abstract This dissertation presents three video processing techniques that are essential for real-time MPEG-4 encoding systems. The content-based interactivity is one of the most important features of the MPEG-4 visual coding; however, the automatic generation of the shape information is still an open research problem. In the first part of this dissertation, an efficient moving object segmentation is proposed to automatically generate the shape information. Background registration technique is used in this method to construct a reliable background image from the accumulated frame difference information. The moving object region is then separated from the background region by comparing the current frame with the constructed background image. Finally, a post-processing step is applied on the obtained object mask to remove noise regions and to smooth the object boundary. In situations where object shadows appear in the background region, a pre-processing gradient filter is applied on the input image to reduce the shadow effect. Very good segmentation quality and high execution speed are demonstrated by the experimental results. This algorithm is very suitable for real-time MPEG-4 VOP generation. The motion estimation is always the most computationally intensive part of a video encoding system. In the second part of the dissertation, an efficient motion estimation algorithm for real-time implementations of MPEG-4 encoder on multimedia processors is presented. The motion vector predictor position is used as the starting point in the search process because the correlation between neighboring motion vectors is strong. The line search pattern is used in the proposed algorithm to reduce the memory access as well as to exploit the special multimedia processor instructions for SAD calculations. Experimental results show that the performance of the proposed predictive line search (PLS) is very close to the full search algorithm. Compared with the well-known diamond search fast algorithm, the predictive line search shows better performance and robustness especially for high motion sequences. In order to exploit the concept of system-on-a-chip, a CMOS active pixel sensor (APS) camera chip with direct frame difference output is reported in the last part of the dissertation. This chip combines an important signal processing for video segmentation with the image sensing circuit to form a smart camera system. The proposed APS cell circuit has in-pixel storage for previous frame image data so that the current frame image and the previous frame image can be read out simultaneously in differential mode. The signal swing of the pixel circuit is maximized for low supply voltage operation. Good image quality and correct signal processing results can be obtained from the experimental results. These techniques can greatly help the design and implementation of real-time MPEG-4 applications. Liang-Gee Chen 陳良基 2001 學位論文 ; thesis 93 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	博士 === 國立臺灣大學 === 電機工程學研究所 === 89 === Abstract This dissertation presents three video processing techniques that are essential for real-time MPEG-4 encoding systems. The content-based interactivity is one of the most important features of the MPEG-4 visual coding; however, the automatic generation of the shape information is still an open research problem. In the first part of this dissertation, an efficient moving object segmentation is proposed to automatically generate the shape information. Background registration technique is used in this method to construct a reliable background image from the accumulated frame difference information. The moving object region is then separated from the background region by comparing the current frame with the constructed background image. Finally, a post-processing step is applied on the obtained object mask to remove noise regions and to smooth the object boundary. In situations where object shadows appear in the background region, a pre-processing gradient filter is applied on the input image to reduce the shadow effect. Very good segmentation quality and high execution speed are demonstrated by the experimental results. This algorithm is very suitable for real-time MPEG-4 VOP generation. The motion estimation is always the most computationally intensive part of a video encoding system. In the second part of the dissertation, an efficient motion estimation algorithm for real-time implementations of MPEG-4 encoder on multimedia processors is presented. The motion vector predictor position is used as the starting point in the search process because the correlation between neighboring motion vectors is strong. The line search pattern is used in the proposed algorithm to reduce the memory access as well as to exploit the special multimedia processor instructions for SAD calculations. Experimental results show that the performance of the proposed predictive line search (PLS) is very close to the full search algorithm. Compared with the well-known diamond search fast algorithm, the predictive line search shows better performance and robustness especially for high motion sequences. In order to exploit the concept of system-on-a-chip, a CMOS active pixel sensor (APS) camera chip with direct frame difference output is reported in the last part of the dissertation. This chip combines an important signal processing for video segmentation with the image sensing circuit to form a smart camera system. The proposed APS cell circuit has in-pixel storage for previous frame image data so that the current frame image and the previous frame image can be read out simultaneously in differential mode. The signal swing of the pixel circuit is maximized for low supply voltage operation. Good image quality and correct signal processing results can be obtained from the experimental results. These techniques can greatly help the design and implementation of real-time MPEG-4 applications.
author2	Liang-Gee Chen
author_facet	Liang-Gee Chen Shyh-Yih Ma 馬仕毅
author	Shyh-Yih Ma 馬仕毅
spellingShingle	Shyh-Yih Ma 馬仕毅 Video Processing for MPEG-4 Encoding Systems
author_sort	Shyh-Yih Ma
title	Video Processing for MPEG-4 Encoding Systems
title_short	Video Processing for MPEG-4 Encoding Systems
title_full	Video Processing for MPEG-4 Encoding Systems
title_fullStr	Video Processing for MPEG-4 Encoding Systems
title_full_unstemmed	Video Processing for MPEG-4 Encoding Systems
title_sort	video processing for mpeg-4 encoding systems
publishDate	2001
url	http://ndltd.ncl.edu.tw/handle/14127110273531076387
work_keys_str_mv	AT shyhyihma videoprocessingformpeg4encodingsystems AT mǎshìyì videoprocessingformpeg4encodingsystems AT shyhyihma mpeg4biānmǎxìtǒngshìxùnchùlǐzhīyánjiū AT mǎshìyì mpeg4biānmǎxìtǒngshìxùnchùlǐzhīyánjiū
_version_	1718334284641599488

Video Processing for MPEG-4 Encoding Systems

Similar Items