mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-06 12:20:52 +01:00
`kernel_micro_gemm` generated using BRGEMM:
```
template <bool accum>
inline void kernel_micro_gemm(
const half* __restrict__ A,
const half* __restrict__ B,
float* __restrict__ C,
int64_t M,
int64_t N,
int64_t K,
int64_t lda,
int64_t ldb,
int64_t ldc
) {
at::native::cpublas::brgemm(
M, N, K,
lda, ldb, ldc,
1.f, accum ? 1.f : 0.f,
A,
B,
C);
}
```
Pull Request resolved: https://github.com/pytorch/pytorch/pull/136255
Approved by: https://github.com/jgong5, https://github.com/jansel
|
||
|---|---|---|
| .. | ||
| amp | ||
| __init__.py | ||