PLASMA
Parallel Linear Algebra Software for Multicore Architectures
|
Core BLAS and Auxiliary functions. Standard BLAS and LAPACK auxiliary routines are grouped by amount of work into Level 1, 2, 3. More...
Modules | |
Level 0: element operations, O(1) work | |
Operations on single elements. | |
Level 1: vectors operations, O(n) work | |
Vector operations that perform \( O(n) \) work on \( O(n) \) data. These are memory bound, since every operation requires a memory read or write. | |
Level 2: matrix-vector operations, O(n^2) work | |
Matrix operations that perform \( O(n^2) \) work on \( O(n^2) \) data. These are memory bound, since every operation requires a memory read or write. | |
Level 3: matrix-matrix operations, O(n^3) work | |
Matrix-matrix operations that perform \( O(n^3) \) work on \( O(n^2) \) data. These benefit from cache reuse, since many operations can be performed for every read from main memory. | |
Householder reflectors | |
Precision conversion | |
Matrix norms | |
Linear system solvers | |
Core BLAS and Auxiliary functions. Standard BLAS and LAPACK auxiliary routines are grouped by amount of work into Level 1, 2, 3.