mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-06 00:20:18 +01:00
Summary: * Further simplify `ATen/native/cuda/Blas.cpp` by moving `_scaled_mm`, `_scaled_mm_v2` and supporting methods to a new file, `ATen/native/cuda/ScaledBlas.cpp` Test Plan: ``` pytest -svv test/test_matmul_cuda.py pytest -svv test/test_scaled_matmul_cuda.py ``` Reviewers: Subscribers: Tasks: Tags: Signed-off-by: Simon Layton <simonlayton@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/166314 Approved by: https://github.com/eqy ghstack dependencies: #166313 |
||
|---|---|---|
| .. | ||
| conda | ||
| src | ||
| tools | ||
| CMakeLists.txt | ||