PLASMA
Parallel Linear Algebra Software for Multicore Architectures
|
Functions | |
int | plasma_cunglq (int m, int n, int k, plasma_complex32_t *pA, int lda, plasma_desc_t T, plasma_complex32_t *pQ, int ldq) |
void | plasma_omp_cunglq (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int | plasma_dorglq (int m, int n, int k, double *pA, int lda, plasma_desc_t T, double *pQ, int ldq) |
void | plasma_omp_dorglq (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int | plasma_sorglq (int m, int n, int k, float *pA, int lda, plasma_desc_t T, float *pQ, int ldq) |
void | plasma_omp_sorglq (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int | plasma_zunglq (int m, int n, int k, plasma_complex64_t *pA, int lda, plasma_desc_t T, plasma_complex64_t *pQ, int ldq) |
void | plasma_omp_zunglq (plasma_desc_t A, plasma_desc_t T, plasma_desc_t Q, plasma_workspace_t work, plasma_sequence_t *sequence, plasma_request_t *request) |
int plasma_cunglq | ( | int | m, |
int | n, | ||
int | k, | ||
plasma_complex32_t * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
plasma_complex32_t * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal rows, which is defined as the first m rows of a product of the elementary reflectors returned by plasma_cgelqf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. n >= m. |
[in] | k | The number of rows of elementary tile reflectors whose product defines the matrix Q. m >= k >= 0. |
[in] | pA | Details of the LQ factorization of the original matrix A as returned by plasma_cgelqf. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_cgelqf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_cunglq | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_cunglq(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_cgelqf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |
int plasma_dorglq | ( | int | m, |
int | n, | ||
int | k, | ||
double * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
double * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal rows, which is defined as the first m rows of a product of the elementary reflectors returned by plasma_dgelqf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. n >= m. |
[in] | k | The number of rows of elementary tile reflectors whose product defines the matrix Q. m >= k >= 0. |
[in] | pA | Details of the LQ factorization of the original matrix A as returned by plasma_dgelqf. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_dgelqf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_dorglq | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_dorglq(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_dgelqf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |
int plasma_sorglq | ( | int | m, |
int | n, | ||
int | k, | ||
float * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
float * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal rows, which is defined as the first m rows of a product of the elementary reflectors returned by plasma_sgelqf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. n >= m. |
[in] | k | The number of rows of elementary tile reflectors whose product defines the matrix Q. m >= k >= 0. |
[in] | pA | Details of the LQ factorization of the original matrix A as returned by plasma_sgelqf. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_sgelqf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_sorglq | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_sorglq(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_sgelqf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |
int plasma_zunglq | ( | int | m, |
int | n, | ||
int | k, | ||
plasma_complex64_t * | pA, | ||
int | lda, | ||
plasma_desc_t | T, | ||
plasma_complex64_t * | pQ, | ||
int | ldq | ||
) |
Generates an m-by-n matrix Q with orthonormal rows, which is defined as the first m rows of a product of the elementary reflectors returned by plasma_zgelqf.
[in] | m | The number of rows of the matrix Q. m >= 0. |
[in] | n | The number of columns of the matrix Q. n >= m. |
[in] | k | The number of rows of elementary tile reflectors whose product defines the matrix Q. m >= k >= 0. |
[in] | pA | Details of the LQ factorization of the original matrix A as returned by plasma_zgelqf. |
[in] | lda | The leading dimension of the array A. lda >= max(1,m). |
[in] | T | Auxiliary factorization data, computed by plasma_zgelqf. |
[out] | pQ | On exit, pointer to the m-by-n matrix Q. |
[in] | ldq | The leading dimension of the array Q. ldq >= max(1,m). |
PlasmaSuccess | successful exit |
< | 0 if -i, the i-th argument had an illegal value |
void plasma_omp_zunglq | ( | plasma_desc_t | A, |
plasma_desc_t | T, | ||
plasma_desc_t | Q, | ||
plasma_workspace_t | work, | ||
plasma_sequence_t * | sequence, | ||
plasma_request_t * | request | ||
) |
Non-blocking tile version of plasma_zunglq(). May return before the computation is finished. Allows for pipelining of operations at runtime.
[in] | A | Descriptor of matrix A. A is stored in the tile layout. |
[in] | T | Descriptor of matrix T. Auxiliary factorization data, computed by plasma_zgelqf. |
[out] | Q | Descriptor of matrix Q. On exit, matrix Q stored in the tile layout. |
[in] | work | Workspace for the auxiliary arrays needed by some coreblas kernels. For multiplication by Q contains preallocated space for work arrays. Allocated by the plasma_workspace_create function. |
[in] | sequence | Identifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). |
[out] | request | Identifies this function call (for exception handling purposes). |
void | Errors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time. |