Accelerating Dense Linear Algebra for GPUs, Multicores and Hybrid Architectures: an Autotuned and Algorithmic Approach
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. The introduction of new machines from vendors provides us opportunities to optimize DLA libraries for the new machines and thus exploit their power. Unfortunately the optimization phase is not straigh...
Main Author: | |
---|---|
Format: | Others |
Published: |
Trace: Tennessee Research and Creative Exchange
2010
|
Subjects: | |
Online Access: | http://trace.tennessee.edu/utk_gradthes/734 |