Orchestrating thread scheduling and cache management to improve memory system throughput in throughput processors
Throughput processors such as GPUs continue to provide higher peak arithmetic capability. Designing a high throughput memory system to keep the computational units busy is very challenging. Future throughput processors must continue to exploit data locality and utilize the on-chip and off-chip resou...
Main Author: | |
---|---|
Format: | Others |
Language: | en |
Published: |
2014
|
Subjects: | |
Online Access: | http://hdl.handle.net/2152/25098 |