pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

Andres Lugo 5f5f508aa8 [ROCm] Ck backend UX refactor (#152951 ) Refactors how the enablement/disablement of CK Gemms and SDPA works. - Adds USE_ROCM_CK_GEMM compile flag for enabling CK gemms. - USE_ROCM_CK_GEMM is set to True by default on Linux - Updates USE_CK_FLASH_ATTENTION to USE_ROCM_CK_SDPA. - USE_ROCM_CK_SDPA is set to False by default - (USE_CK_FLASH_ATTENTION still works for now, but will be deprecated in a future release) - Prevents these CK libraries from being used unless pytorch has been built specifically with the functionality AND is running on a system architecture that supports it. - the getters for these library backends will also do some validity checking in case the user used an environment variable to change the backend. If invalid, (i.e. one of the cases mentioned above is false) the backend will be set as the current non-CK default Pull Request resolved: https://github.com/pytorch/pytorch/pull/152951 Approved by: https://github.com/eqy, https://github.com/jeffdaily, https://github.com/m-gallus Co-authored-by: Jeff Daily <jeff.daily@amd.com> Co-authored-by: Jithun Nair <jithun.nair@amd.com> Co-authored-by: Jane (Yuan) Xu <31798555+janeyx99@users.noreply.github.com>		2025-08-08 18:40:17 +00:00
..
amp_examples.rst	Update document for autocast on CPU (#135299 )	2024-09-13 09:11:47 +00:00
autograd.rst	[doc] Add documentation for division by zero behavior in autograd (#155987 )	2025-06-16 19:02:12 +00:00
broadcasting.rst
cpu_threading_torchscript_inference.rst	[3/n] Remove references to TorchScript in PyTorch docs (#158315 )	2025-07-15 21:14:18 +00:00
cuda.rst	Update warning of TF32 (#158209 )	2025-07-16 01:28:50 +00:00
custom_operators.rst	Redirect the custom ops landing page :D (#139634 )	2024-11-04 22:25:15 +00:00
ddp.rst	Update DDP dynamo debug docs (#118295 )	2024-01-29 14:58:26 +00:00
extending.func.rst
extending.rst	[autograd][docs] Add more details on why save_for_backward is important in extending autograd note (#153005 )	2025-05-09 16:36:57 +00:00
faq.rst
get_start_xpu.rst	fix link for tutorial of inductor on windows (#159853 )	2025-08-05 18:37:47 +00:00
gradcheck.rst	[BE] fix typos in docs/ (#156080 )	2025-06-21 02:47:32 +00:00
hip.rst	[ROCm] Ck backend UX refactor (#152951 )	2025-08-08 18:40:17 +00:00
large_scale_deployments.rst	[3/n] Remove references to TorchScript in PyTorch docs (#158315 )	2025-07-15 21:14:18 +00:00
libtorch_stable_abi.md	Address richard's comments on libtorch_stable_abi note (#156324 )	2025-06-27 19:19:12 +00:00
mkldnn.rst	Enable TF32 as fp32 internal precision for matmul/linear/conv (#157520 )	2025-07-17 08:57:34 +00:00
modules.rst	Fix to modules.rst: indent line with activation functions (#139667 )	2024-11-08 01:12:52 +00:00
mps.rst
multiprocessing.rst	[BE] fix typos in docs/ (#156080 )	2025-06-21 02:47:32 +00:00
numerical_accuracy.rst	Update warning of TF32 (#158209 )	2025-07-16 01:28:50 +00:00
out.rst	add Out Notes (#151306 )	2025-04-24 20:25:09 +00:00
randomness.rst	Fix typo in Reproducibility docs (#141341 )	2024-11-26 16:53:26 +00:00
serialization.rst	Delete sections referencing torchscript in serialization docs (#156648 )	2025-06-25 23:41:24 +00:00
windows.rst	Removing conda references from PyTorch Docs (#152702 )	2025-05-20 20:33:28 +00:00