PLASMA
Parallel Linear Algebra Software for Multicore Architectures

\( C = \alpha A A^T + \beta C \) where \( C \) is Hermitian More...

Functions

int plasma_cherk (plasma_enum_t uplo, plasma_enum_t trans, int n, int k, float alpha, plasma_complex32_t *pA, int lda, float beta, plasma_complex32_t *pC, int ldc)
 
void plasma_omp_cherk (plasma_enum_t uplo, plasma_enum_t trans, float alpha, plasma_desc_t A, float beta, plasma_desc_t C, plasma_sequence_t *sequence, plasma_request_t *request)
 
int plasma_zherk (plasma_enum_t uplo, plasma_enum_t trans, int n, int k, double alpha, plasma_complex64_t *pA, int lda, double beta, plasma_complex64_t *pC, int ldc)
 
void plasma_omp_zherk (plasma_enum_t uplo, plasma_enum_t trans, double alpha, plasma_desc_t A, double beta, plasma_desc_t C, plasma_sequence_t *sequence, plasma_request_t *request)
 

Detailed Description

\( C = \alpha A A^T + \beta C \) where \( C \) is Hermitian

Function Documentation

int plasma_cherk ( plasma_enum_t  uplo,
plasma_enum_t  trans,
int  n,
int  k,
float  alpha,
plasma_complex32_t *  pA,
int  lda,
float  beta,
plasma_complex32_t *  pC,
int  ldc 
)

Performs one of the Hermitian rank k operations

\[ C = \alpha A \times A^H + \beta C, \]

or

\[ C = \alpha A^H \times A + \beta C, \]

where alpha and beta are real scalars, C is an n-by-n Hermitian matrix, and A is an n-by-k matrix in the first case and a k-by-n matrix in the second case.

Parameters
[in]uplo
  • PlasmaUpper: Upper triangle of C is stored;
  • PlasmaLower: Lower triangle of C is stored.
[in]trans
  • PlasmaNoTrans:

    \[ C = \alpha A \times A^H + \beta C; \]

  • PlasmaConjTrans:

    \[ C = \alpha A^H \times A + \beta C. \]

[in]nThe order of the matrix C. n >= 0.
[in]kIf trans = PlasmaNoTrans, number of columns of the A matrix; if trans = PlasmaConjTrans, number of rows of the A matrix.
[in]alphaThe scalar alpha.
[in]pAA is an lda-by-ka matrix. If trans = PlasmaNoTrans, ka = k; if trans = PlasmaConjTrans, ka = n.
[in]ldaThe leading dimension of the array A. If trans = PlasmaNoTrans, lda >= max(1, n); if trans = PlasmaConjTrans, lda >= max(1, k).
[in]betaThe scalar beta.
[in,out]pCC is an ldc-by-n matrix. On exit, the uplo part of the matrix is overwritten by the uplo part of the updated matrix.
[in]ldcThe leading dimension of the array C. ldc >= max(1, n).
Return values
PlasmaSuccesssuccessful exit
See also
plasma_omp_cherk
plasma_cherk
void plasma_omp_cherk ( plasma_enum_t  uplo,
plasma_enum_t  trans,
float  alpha,
plasma_desc_t  A,
float  beta,
plasma_desc_t  C,
plasma_sequence_t *  sequence,
plasma_request_t *  request 
)

Performs rank k update. Non-blocking tile version of plasma_cherk(). May return before the computation is finished. Operates on matrices stored by tiles. All matrices are passed through descriptors. All dimensions are taken from the descriptors. Allows for pipelining of operations at runtime.

Parameters
[in]uplo
  • PlasmaUpper: Upper triangle of C is stored;
  • PlasmaLower: Lower triangle of C is stored.
[in]trans
  • PlasmaNoTrans:

    \[ C = \alpha A \times A^H + \beta C; \]

  • PlasmaConjTrans:

    \[ C = \alpha A^H \times A + \beta C. \]

[in]alphaThe scalar alpha.
[in]ADescriptor of matrix A.
[in]betaThe scalar beta.
[in,out]CDescriptor of matrix C.
[in]sequenceIdentifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). Check the sequence->status for errors.
[out]requestIdentifies this function call (for exception handling purposes).
Return values
voidErrors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time.
See also
plasma_cherk
plasma_omp_cherk
plasma_omp_cherk
plasma_omp_dherk
plasma_omp_sherk
int plasma_zherk ( plasma_enum_t  uplo,
plasma_enum_t  trans,
int  n,
int  k,
double  alpha,
plasma_complex64_t *  pA,
int  lda,
double  beta,
plasma_complex64_t *  pC,
int  ldc 
)

Performs one of the Hermitian rank k operations

\[ C = \alpha A \times A^H + \beta C, \]

or

\[ C = \alpha A^H \times A + \beta C, \]

where alpha and beta are real scalars, C is an n-by-n Hermitian matrix, and A is an n-by-k matrix in the first case and a k-by-n matrix in the second case.

Parameters
[in]uplo
  • PlasmaUpper: Upper triangle of C is stored;
  • PlasmaLower: Lower triangle of C is stored.
[in]trans
  • PlasmaNoTrans:

    \[ C = \alpha A \times A^H + \beta C; \]

  • PlasmaConjTrans:

    \[ C = \alpha A^H \times A + \beta C. \]

[in]nThe order of the matrix C. n >= 0.
[in]kIf trans = PlasmaNoTrans, number of columns of the A matrix; if trans = PlasmaConjTrans, number of rows of the A matrix.
[in]alphaThe scalar alpha.
[in]pAA is an lda-by-ka matrix. If trans = PlasmaNoTrans, ka = k; if trans = PlasmaConjTrans, ka = n.
[in]ldaThe leading dimension of the array A. If trans = PlasmaNoTrans, lda >= max(1, n); if trans = PlasmaConjTrans, lda >= max(1, k).
[in]betaThe scalar beta.
[in,out]pCC is an ldc-by-n matrix. On exit, the uplo part of the matrix is overwritten by the uplo part of the updated matrix.
[in]ldcThe leading dimension of the array C. ldc >= max(1, n).
Return values
PlasmaSuccesssuccessful exit
See also
plasma_omp_zherk
plasma_cherk
void plasma_omp_zherk ( plasma_enum_t  uplo,
plasma_enum_t  trans,
double  alpha,
plasma_desc_t  A,
double  beta,
plasma_desc_t  C,
plasma_sequence_t *  sequence,
plasma_request_t *  request 
)

Performs rank k update. Non-blocking tile version of plasma_zherk(). May return before the computation is finished. Operates on matrices stored by tiles. All matrices are passed through descriptors. All dimensions are taken from the descriptors. Allows for pipelining of operations at runtime.

Parameters
[in]uplo
  • PlasmaUpper: Upper triangle of C is stored;
  • PlasmaLower: Lower triangle of C is stored.
[in]trans
  • PlasmaNoTrans:

    \[ C = \alpha A \times A^H + \beta C; \]

  • PlasmaConjTrans:

    \[ C = \alpha A^H \times A + \beta C. \]

[in]alphaThe scalar alpha.
[in]ADescriptor of matrix A.
[in]betaThe scalar beta.
[in,out]CDescriptor of matrix C.
[in]sequenceIdentifies the sequence of function calls that this call belongs to (for completion checks and exception handling purposes). Check the sequence->status for errors.
[out]requestIdentifies this function call (for exception handling purposes).
Return values
voidErrors are returned by setting sequence->status and request->status to error values. The sequence->status and request->status should never be set to PlasmaSuccess (the initial values) since another async call may be setting a failure value at the same time.
See also
plasma_zherk
plasma_omp_zherk
plasma_omp_cherk
plasma_omp_dherk
plasma_omp_sherk