Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System

碩士 === 臺灣大學 === 電子工程學研究所 === 95 === 3D graphics technology, which is developed since 1960s, is widely used in animations, games, and user interfaces. For real-time graphics applications, Graphics Processing Units (GPUs) are now mainly designed for the desk-top environments. In recent years, there ar...

Full description

Bibliographic Details
Main Authors: Yu-Cheng Lin, 林昱呈
Other Authors: Shao-Yi Chien
Format: Others
Language:en_US
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/94388485679664874646
id ndltd-TW-095NTU05428125
record_format oai_dc
spelling ndltd-TW-095NTU054281252015-10-13T13:55:54Z http://ndltd.ncl.edu.tw/handle/94388485679664874646 Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System 適用於三維繪圖系統之頂點與像素通用著色處理器之硬體架構設計與實現 Yu-Cheng Lin 林昱呈 碩士 臺灣大學 電子工程學研究所 95 3D graphics technology, which is developed since 1960s, is widely used in animations, games, and user interfaces. For real-time graphics applications, Graphics Processing Units (GPUs) are now mainly designed for the desk-top environments. In recent years, there are two important migrations in graphics accelerators. The first one is that the fixed-function pipeline in the early days is now gradually replaced by the programmable pipeline, shader pipeline. The shader pipeline provides the artists and programmers freedom to program the GPU, and extraordinary graphic effects are emerging in an endless stream. The second important migration is that graphics accelerators for mobile devices become more and more important. Powerful graphics functions are going to be integrated in hand-held devices to provide users better user interface and portable gaming environments. The limited resources on a mobile devices, including hardware resource and energy resource, cause the major drawback to provide 3D graphic capability on the handheld devices. Several low-power low-cost solutions have been proposed in these years with low performance. A more efficient solution, where the computing, memory, and power resources should be effectively allocated, is still required. In this thesis, low-power cost-efficient yet high performance universal vertex/pixel shaders, which are used to replace the vertex shader and the pixel shader in the traditional programmable pipeline, are proposed. There are three major contributions in hardware architecture in this thesis. First, the universal vertex/pixel shader, which unifies the functions of the vertex shader as well as the pixel shader and has the ability to make adaptive execution-time resource allocation based on the different scenarios, is proposed to solve the load-imbalance problems. Second, the configurable memory array (CMA) can be used as input/output vertex cache and can change the configurations dynamically to keep the memory usage efficiently for different applications. Finally, many low power design techniques are also proposed. The main low power techniques applied are early rejection after transformation (ERAT) and gated clock. The ERAT technique analyzes the contents of transformed primitives to avoid redundant lighting computation in order to reduce power consumption of the shaders. Instruction level gated clock can be achieved from the operation (OP) and the active vector codes. The clock of those data registers of the un-issued PEs are gated for saving dynamic power. The unused vector pipeline would be turned off and gated to save power. The proposed design techniques are verified by real implementation. Implementation results show that over 40 percent processing time could be saved with all the architecture advantages mentioned above. The prototype chip is fabricated by UMC 90nm technology. The die size is 3.500×3.500mm2. It is capable of processing 200 mega vertices per second and 200 mega pixels per second, which is equivalent to 6.4 giga floating point operations per second. The power consumption is 10.75mW in the worst case when the chip works at 200MHz. Shao-Yi Chien 簡韶逸 2007 學位論文 ; thesis 78 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 臺灣大學 === 電子工程學研究所 === 95 === 3D graphics technology, which is developed since 1960s, is widely used in animations, games, and user interfaces. For real-time graphics applications, Graphics Processing Units (GPUs) are now mainly designed for the desk-top environments. In recent years, there are two important migrations in graphics accelerators. The first one is that the fixed-function pipeline in the early days is now gradually replaced by the programmable pipeline, shader pipeline. The shader pipeline provides the artists and programmers freedom to program the GPU, and extraordinary graphic effects are emerging in an endless stream. The second important migration is that graphics accelerators for mobile devices become more and more important. Powerful graphics functions are going to be integrated in hand-held devices to provide users better user interface and portable gaming environments. The limited resources on a mobile devices, including hardware resource and energy resource, cause the major drawback to provide 3D graphic capability on the handheld devices. Several low-power low-cost solutions have been proposed in these years with low performance. A more efficient solution, where the computing, memory, and power resources should be effectively allocated, is still required. In this thesis, low-power cost-efficient yet high performance universal vertex/pixel shaders, which are used to replace the vertex shader and the pixel shader in the traditional programmable pipeline, are proposed. There are three major contributions in hardware architecture in this thesis. First, the universal vertex/pixel shader, which unifies the functions of the vertex shader as well as the pixel shader and has the ability to make adaptive execution-time resource allocation based on the different scenarios, is proposed to solve the load-imbalance problems. Second, the configurable memory array (CMA) can be used as input/output vertex cache and can change the configurations dynamically to keep the memory usage efficiently for different applications. Finally, many low power design techniques are also proposed. The main low power techniques applied are early rejection after transformation (ERAT) and gated clock. The ERAT technique analyzes the contents of transformed primitives to avoid redundant lighting computation in order to reduce power consumption of the shaders. Instruction level gated clock can be achieved from the operation (OP) and the active vector codes. The clock of those data registers of the un-issued PEs are gated for saving dynamic power. The unused vector pipeline would be turned off and gated to save power. The proposed design techniques are verified by real implementation. Implementation results show that over 40 percent processing time could be saved with all the architecture advantages mentioned above. The prototype chip is fabricated by UMC 90nm technology. The die size is 3.500×3.500mm2. It is capable of processing 200 mega vertices per second and 200 mega pixels per second, which is equivalent to 6.4 giga floating point operations per second. The power consumption is 10.75mW in the worst case when the chip works at 200MHz.
author2 Shao-Yi Chien
author_facet Shao-Yi Chien
Yu-Cheng Lin
林昱呈
author Yu-Cheng Lin
林昱呈
spellingShingle Yu-Cheng Lin
林昱呈
Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System
author_sort Yu-Cheng Lin
title Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System
title_short Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System
title_full Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System
title_fullStr Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System
title_full_unstemmed Hardware Architecture Design and Implementation ofUniversal Vertex/Pixel Shader for 3D Graphics System
title_sort hardware architecture design and implementation ofuniversal vertex/pixel shader for 3d graphics system
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/94388485679664874646
work_keys_str_mv AT yuchenglin hardwarearchitecturedesignandimplementationofuniversalvertexpixelshaderfor3dgraphicssystem
AT línyùchéng hardwarearchitecturedesignandimplementationofuniversalvertexpixelshaderfor3dgraphicssystem
AT yuchenglin shìyòngyúsānwéihuìtúxìtǒngzhīdǐngdiǎnyǔxiàngsùtōngyòngzhesèchùlǐqìzhīyìngtǐjiàgòushèjìyǔshíxiàn
AT línyùchéng shìyòngyúsānwéihuìtúxìtǒngzhīdǐngdiǎnyǔxiàngsùtōngyòngzhesèchùlǐqìzhīyìngtǐjiàgòushèjìyǔshíxiàn
_version_ 1717745217334935552