PLASMA
Parallel Linear Algebra Software for Multicore Architectures
Core BLAS and Auxiliary (single core)

Core BLAS and Auxiliary functions. Standard BLAS and LAPACK auxiliary routines are grouped by amount of work into Level 1, 2, 3. More...

Modules

 Level 0: element operations, O(1) work
 Operations on single elements.
 
 Level 1: vectors operations, O(n) work
 Vector operations that perform \( O(n) \) work on \( O(n) \) data. These are memory bound, since every operation requires a memory read or write.
 
 Level 2: matrix-vector operations, O(n^2) work
 Matrix operations that perform \( O(n^2) \) work on \( O(n^2) \) data. These are memory bound, since every operation requires a memory read or write.
 
 Level 3: matrix-matrix operations, O(n^3) work
 Matrix-matrix operations that perform \( O(n^3) \) work on \( O(n^2) \) data. These benefit from cache reuse, since many operations can be performed for every read from main memory.
 
 Householder reflectors
 
 Precision conversion
 
 Matrix norms
 
 Linear system solvers
 

Detailed Description

Core BLAS and Auxiliary functions. Standard BLAS and LAPACK auxiliary routines are grouped by amount of work into Level 1, 2, 3.