pytorch/docs/source/notes
Andres Lugo 5f5f508aa8 [ROCm] Ck backend UX refactor (#152951)
Refactors how the enablement/disablement of CK Gemms and SDPA works.

- Adds USE_ROCM_CK_GEMM compile flag for enabling CK gemms.
- USE_ROCM_CK_GEMM is set to True by default on Linux
- Updates USE_CK_FLASH_ATTENTION to USE_ROCM_CK_SDPA.
- USE_ROCM_CK_SDPA is set to False by default
- (USE_CK_FLASH_ATTENTION still works for now, but will be deprecated in a future release)
- Prevents these CK libraries from being used unless pytorch has been built specifically with the functionality AND is running on a system architecture that supports it.
- the getters for these library backends will also do some validity checking in case the user used an environment variable to change the backend. If invalid, (i.e. one of the cases mentioned above is false) the backend will be set as the current non-CK default

Pull Request resolved: https://github.com/pytorch/pytorch/pull/152951
Approved by: https://github.com/eqy, https://github.com/jeffdaily, https://github.com/m-gallus

Co-authored-by: Jeff Daily <jeff.daily@amd.com>
Co-authored-by: Jithun Nair <jithun.nair@amd.com>
Co-authored-by: Jane (Yuan) Xu <31798555+janeyx99@users.noreply.github.com>
2025-08-08 18:40:17 +00:00
..
amp_examples.rst Update document for autocast on CPU (#135299) 2024-09-13 09:11:47 +00:00
autograd.rst [doc] Add documentation for division by zero behavior in autograd (#155987) 2025-06-16 19:02:12 +00:00
broadcasting.rst
cpu_threading_torchscript_inference.rst [3/n] Remove references to TorchScript in PyTorch docs (#158315) 2025-07-15 21:14:18 +00:00
cuda.rst Update warning of TF32 (#158209) 2025-07-16 01:28:50 +00:00
custom_operators.rst Redirect the custom ops landing page :D (#139634) 2024-11-04 22:25:15 +00:00
ddp.rst Update DDP dynamo debug docs (#118295) 2024-01-29 14:58:26 +00:00
extending.func.rst
extending.rst [autograd][docs] Add more details on why save_for_backward is important in extending autograd note (#153005) 2025-05-09 16:36:57 +00:00
faq.rst
get_start_xpu.rst fix link for tutorial of inductor on windows (#159853) 2025-08-05 18:37:47 +00:00
gradcheck.rst [BE] fix typos in docs/ (#156080) 2025-06-21 02:47:32 +00:00
hip.rst [ROCm] Ck backend UX refactor (#152951) 2025-08-08 18:40:17 +00:00
large_scale_deployments.rst [3/n] Remove references to TorchScript in PyTorch docs (#158315) 2025-07-15 21:14:18 +00:00
libtorch_stable_abi.md Address richard's comments on libtorch_stable_abi note (#156324) 2025-06-27 19:19:12 +00:00
mkldnn.rst Enable TF32 as fp32 internal precision for matmul/linear/conv (#157520) 2025-07-17 08:57:34 +00:00
modules.rst Fix to modules.rst: indent line with activation functions (#139667) 2024-11-08 01:12:52 +00:00
mps.rst
multiprocessing.rst [BE] fix typos in docs/ (#156080) 2025-06-21 02:47:32 +00:00
numerical_accuracy.rst Update warning of TF32 (#158209) 2025-07-16 01:28:50 +00:00
out.rst add Out Notes (#151306) 2025-04-24 20:25:09 +00:00
randomness.rst Fix typo in Reproducibility docs (#141341) 2024-11-26 16:53:26 +00:00
serialization.rst Delete sections referencing torchscript in serialization docs (#156648) 2025-06-25 23:41:24 +00:00
windows.rst Removing conda references from PyTorch Docs (#152702) 2025-05-20 20:33:28 +00:00