PLASMA
Parallel Linear Algebra Software for Multicore Architectures
|
Functions | |
int | plasma_cungqr (int m, int n, int k, plasma_complex32_t *pA, int lda, plasma_desc_t T, plasma_complex32_t *pQ, int ldq) |
void | plasma_omp_cungqr (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int | plasma_dorgqr (int m, int n, int k, double *pA, int lda, plasma_desc_t T, double *pQ, int ldq) |
void | plasma_omp_dorgqr (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int | plasma_sorgqr (int m, int n, int k, float *pA, int lda, plasma_desc_t T, float *pQ, int ldq) |
void | plasma_omp_sorgqr (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int | plasma_zungqr (int m, int n, int k, plasma_complex64_t *pA, int lda, plasma_desc_t T, plasma_complex64_t *pQ, int ldq) |
void | plasma_omp_zungqr (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int plasma_cungqr | ( | int | m, |
int | n, | ||
int | k, | ||
plasma_complex32_t * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
plasma_complex32_t * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal columns, which is defined as the first n columns of a product of the elementary reflectors returned by plasma_cgeqrf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. m >= n >= 0. |
[in] | k | The number of columns of elementary tile reflectors whose product defines the matrix Q. n >= k >= 0. |
[in] | pA | Details of the QR factorization of the original matrix A as returned by plasma_cgeqrf, where the k first columns are the reflectors. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_cgeqrf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_cungqr | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_cungqr(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_cgeqrf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |
int plasma_dorgqr | ( | int | m, |
int | n, | ||
int | k, | ||
double * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
double * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal columns, which is defined as the first n columns of a product of the elementary reflectors returned by plasma_dgeqrf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. m >= n >= 0. |
[in] | k | The number of columns of elementary tile reflectors whose product defines the matrix Q. n >= k >= 0. |
[in] | pA | Details of the QR factorization of the original matrix A as returned by plasma_dgeqrf, where the k first columns are the reflectors. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_dgeqrf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_dorgqr | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_dorgqr(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_dgeqrf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |
int plasma_sorgqr | ( | int | m, |
int | n, | ||
int | k, | ||
float * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
float * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal columns, which is defined as the first n columns of a product of the elementary reflectors returned by plasma_sgeqrf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. m >= n >= 0. |
[in] | k | The number of columns of elementary tile reflectors whose product defines the matrix Q. n >= k >= 0. |
[in] | pA | Details of the QR factorization of the original matrix A as returned by plasma_sgeqrf, where the k first columns are the reflectors. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_sgeqrf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_sorgqr | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_sorgqr(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_sgeqrf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |
int plasma_zungqr | ( | int | m, |
int | n, | ||
int | k, | ||
plasma_complex64_t * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
plasma_complex64_t * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal columns, which is defined as the first n columns of a product of the elementary reflectors returned by plasma_zgeqrf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. m >= n >= 0. |
[in] | k | The number of columns of elementary tile reflectors whose product defines the matrix Q. n >= k >= 0. |
[in] | pA | Details of the QR factorization of the original matrix A as returned by plasma_zgeqrf, where the k first columns are the reflectors. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_zgeqrf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_zungqr | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_zungqr(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_zgeqrf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |