pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Joel Schlosser	316c188a5e	Remove torch.functional entries from the doc ignore list (#158581 ) Options to address the "undocumented python objects": 1. Reference the functions in the .rst via the `torch.functional` namespace. Note that this changes the generated doc filenames / locations for most of these functions! 2. Document these functions by referencing them from the `torch.` namespace instead, in line with common usage. This would also require setting the `__module__` for these functions and moving entries from `torch.functional`'s `__all__` -> `torch`'s `__all__`, which is BC-breaking. 3. Update the .rst files to also document the `torch.functional` forms of these functions, duplicating docs. This PR takes option (3) above and: * Removes all 20 `torch.functional` entries from the doc ignore list * Removes `torch.functional.align_tensors()` entirely, since we don't want to document it. * This is technically BC-breaking, although the previous impl simply errored out. This change could be moved to a separate isolated PR for safety. * Introduces `torch.aliases.md` as a hidden page for the `torch.functional` aliases to the `torch` analogue functions Pull Request resolved: https://github.com/pytorch/pytorch/pull/158581 Approved by: https://github.com/janeyx99	2025-07-25 17:19:01 +00:00
PyTorch MergeBot	c8316d0e79	Revert "[BE] Remove torch deploy \| remove torch deploy specific files (#158290 )" This reverts commit `6ed2cb6ccd`. Reverted https://github.com/pytorch/pytorch/pull/158290 on behalf of https://github.com/ZainRizvi due to Reverting as per offline discussion to fix internal breaks. @PaliC will reland this as a codev diff. Instructions here: https://fburl.com/fixing-ghfirst-reverts ([comment](https://github.com/pytorch/pytorch/pull/158288#issuecomment-3119037960))	2025-07-25 16:09:39 +00:00
PyTorch MergeBot	a9f6770edd	Revert "[BE] Remove __reduce_deploy__ (#158291 )" This reverts commit `9c68c4d08f`. Reverted https://github.com/pytorch/pytorch/pull/158291 on behalf of https://github.com/ZainRizvi due to Reverting as per offline discussion to fix internal breaks. @PaliC will reland this as a codev diff. Instructions here: https://fburl.com/fixing-ghfirst-reverts ([comment](https://github.com/pytorch/pytorch/pull/158288#issuecomment-3119037960))	2025-07-25 16:09:39 +00:00
Jeff Daily	9b29166f57	[ROCm] add flag torch.backends.miopen.immediate (#158951 ) The MIOpen integration has changed over the years. In the past, the MIOpen default for benchmark was True and if it were set to False it would use MIOpen Immediate Mode. But with #145294 the MIOpen benchmark default changed to False and to activate immediate mode you would set the deterministic flag to True. This has proved too restrictive because benchmark and deterministic flags are independent from immediate mode. Thus, immediate mode needs its own flag. Though MIOpen still masquerades behind torch.backends.cudnn and its flags, it seemed inappropriate to add an miopen-exclusive flag to the set of cudnn flags. This PR adds the first miopen-only flag to control its immediate mode. Pull Request resolved: https://github.com/pytorch/pytorch/pull/158951 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-07-25 04:01:51 +00:00
Xuehai Pan	f5e2de928b	[BE] fix remaining flake8 v7 warnings (#159044 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/159044 Approved by: https://github.com/Skylion007 ghstack dependencies: #159043	2025-07-25 02:56:34 +00:00
Ti-Tai Wang	da35562bba	[ONNX] Filter out torchscript sentences (#158850 ) Fixes #157300 Pull Request resolved: https://github.com/pytorch/pytorch/pull/158850 Approved by: https://github.com/justinchuby, https://github.com/svekars	2025-07-24 20:59:06 +00:00
Wei (Will) Feng	693197eed6	[doc] remove FSDP1 developer note (#158991 ) this resolve pytorch doc audit - we remove fsdp1 doc and promote fsdp2 https://docs.pytorch.org/tutorials/intermediate/FSDP_tutorial.html Pull Request resolved: https://github.com/pytorch/pytorch/pull/158991 Approved by: https://github.com/svekars, https://github.com/mori360 ghstack dependencies: #158989	2025-07-24 08:21:54 +00:00
Wei (Will) Feng	68349118b5	[doc] add weifengpy to torch distributed pocs (#158989 ) <img width="415" height="355" alt="Screenshot 2025-07-23 at 16 02 12" src="https://github.com/user-attachments/assets/35b6bb45-d5ed-4d74-8369-e8e66aaa2618" /> Pull Request resolved: https://github.com/pytorch/pytorch/pull/158989 Approved by: https://github.com/mori360	2025-07-24 04:42:33 +00:00
Mikayla Gawarecki	7f649ed4f8	Add basic torch.hash_tensor op (#154149 ) Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to reduction with xor. - The hash is always uint64. - Integers will be casted to uint64 before performing the xor_sum reduction - Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction Pull Request resolved: https://github.com/pytorch/pytorch/pull/154149 Approved by: https://github.com/albanD	2025-07-23 22:28:03 +00:00
fduwjj	82f8e04f27	Update distributed maintainers (#158900 ) I maintain couple components of distributed like devicemesh, c10d and PGNCCL, gloo, etc. Can I be marked not as emeritus? Thanks! Pull Request resolved: https://github.com/pytorch/pytorch/pull/158900 Approved by: https://github.com/albanD	2025-07-23 21:53:27 +00:00
PaliC	9c68c4d08f	[BE] Remove __reduce_deploy__ (#158291 ) This PR removes the integration point torch.fx had with torch::deploy (and another minor change). Note: This PR has some broken mypy errors, but I believe those should have been in the code base beforehand, and should be fixed in a separate PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/158291 Approved by: https://github.com/albanD ghstack dependencies: #158288, #158290	2025-07-23 20:27:28 +00:00
PaliC	6ed2cb6ccd	[BE] Remove torch deploy \| remove torch deploy specific files (#158290 ) This PR removes specific files found in pytorch which are only used for torch::deploy. This is mostly testing code and a debugger. Pull Request resolved: https://github.com/pytorch/pytorch/pull/158290 Approved by: https://github.com/albanD ghstack dependencies: #158288	2025-07-23 20:27:28 +00:00
drisspg	691736ae07	Add kernel options to flex docs (#158875 ) Fixes https://github.com/pytorch/pytorch/issues/158741 Pull Request resolved: https://github.com/pytorch/pytorch/pull/158875 Approved by: https://github.com/BoyuanFeng, https://github.com/albanD	2025-07-23 19:05:19 +00:00
Panagiotis Kourdis	fd47401536	[doc] Updates to distributed.md for XCCL backend (#155834 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155834 Approved by: https://github.com/guangyey, https://github.com/AlannaBurke, https://github.com/d4l3k Co-authored-by: Yu, Guangye <106960996+guangyey@users.noreply.github.com>	2025-07-22 21:01:43 +00:00
PyTorch MergeBot	6341311333	Revert "Add unified memory APIs for torch.accelerator (#152932 )" This reverts commit `2ad5c25cfc`. Reverted https://github.com/pytorch/pytorch/pull/152932 on behalf of https://github.com/ZainRizvi due to Very sorry but this is still breaking internally. @albanD would you be able to help get this past the finish line? D78496124 has more details on the failure and the workaround might be to do something like what's in D78684669. To validate the fixes internally, you can follow the instructions here to ghimport the changes: https://fburl.com/fixing-ghfirst-reverts ([comment](https://github.com/pytorch/pytorch/pull/138222#issuecomment-3100195370))	2025-07-22 01:01:41 +00:00
PyTorch MergeBot	4c18e85300	Revert "[BE] Remove torch deploy \| remove torch deploy specific files (#158290 )" This reverts commit `a6de309ca1`. Reverted https://github.com/pytorch/pytorch/pull/158290 on behalf of https://github.com/ZainRizvi due to Sorry but this is breaking internally, see D78496147 for details. To validate your fixes internally, you can follow the instructions here: https://fburl.com/fixing-ghfirst-reverts ([comment](https://github.com/pytorch/pytorch/pull/158288#issuecomment-3099826158))	2025-07-21 23:17:39 +00:00
PyTorch MergeBot	920f26c761	Revert "[BE] Remove __reduce_deploy__ (#158291 )" This reverts commit `0b9fb91f17`. Reverted https://github.com/pytorch/pytorch/pull/158291 on behalf of https://github.com/ZainRizvi due to Sorry but this is breaking internally, see D78496147 for details. To validate your fixes internally, you can follow the instructions here: https://fburl.com/fixing-ghfirst-reverts ([comment](https://github.com/pytorch/pytorch/pull/158288#issuecomment-3099826158))	2025-07-21 23:17:38 +00:00
Jane Xu	7cc5d03dfc	Document the rest of the specific optimizer module APIs (#158669 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/158669 Approved by: https://github.com/albanD ghstack dependencies: #158483	2025-07-19 07:27:15 +00:00
Jane Xu	f73594164a	[BE] document Adadelta and Adagrad APIs properly (#158483 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/158483 Approved by: https://github.com/albanD	2025-07-19 07:27:15 +00:00
Svetlana Karslioglu	79e49efadd	Pull latest Sphinx theme (#158595 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/158595 Approved by: https://github.com/albanD	2025-07-18 18:46:47 +00:00
PyTorch MergeBot	9a7c2f1f64	Revert "Add torch compile force disable caches alias (#158072 )" This reverts commit `2ecf083b72`. Reverted https://github.com/pytorch/pytorch/pull/158072 on behalf of https://github.com/jeffdaily due to fails on rocm, signal ignored while rocm was unstable ([comment](https://github.com/pytorch/pytorch/pull/158072#issuecomment-3086740829))	2025-07-18 04:58:24 +00:00
angelayi	66c9bc5062	[export] Add runnable code to export docs (#158506 ) Preview: https://docs-preview.pytorch.org/pytorch/pytorch/158506/export.html Yay I can add runnable code to export docs now Also moved export API reference to a different file. With these changes, we can start to consolidate the [export tutorial](https://docs.pytorch.org/tutorials/intermediate/torch_export_tutorial.html) with the docs on pytorch docs. We just need to move the section on DDE and 0/1 specialization, and then I think we can delete the export tutorial. Pull Request resolved: https://github.com/pytorch/pytorch/pull/158506 Approved by: https://github.com/pianpwk, https://github.com/svekars	2025-07-17 20:15:22 +00:00
Oguz Ulgen	2ecf083b72	Add torch compile force disable caches alias (#158072 ) Bunch of people keep thinking current alias only disables inductor cache because it has the name inductor in it. lets globalize the name Pull Request resolved: https://github.com/pytorch/pytorch/pull/158072 Approved by: https://github.com/ezyang	2025-07-17 15:40:36 +00:00
Jiang, Yanbing	f4d8bc46c7	Enable TF32 as fp32 internal precision for matmul/linear/conv (#157520 ) ### Description This PR is to enable TF32 as fp32 internal precision for matmul/linear/conv in `mkldnn backend`. Since we have refined fp32 precision API in https://github.com/pytorch/pytorch/pull/125888, we can easily extend the API to support TF32 for `mkldnn backend`. ``` torch.backends.mkldnn.matmul.fp32_precision = 'tf32' torch.backends.mkldnn.conv.fp32_precision = "tf32" ``` Related kernel update and UTs update are done. And the wrapper `bf32_on_and _off` is updated to `reduced_f32_on_and_off`, and it can run tests 3 times, one is reduced_f32 OFF, the other two are reduced_f32 ON (including `bf32 ON` and `tf32 ON`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/157520 Approved by: https://github.com/mingfeima, https://github.com/jansel	2025-07-17 08:57:34 +00:00
PaliC	0b9fb91f17	[BE] Remove __reduce_deploy__ (#158291 ) This PR removes the integration point torch.fx had with torch::deploy (and another minor change). Note: This PR has some broken mypy errors, but I believe those should have been in the code base beforehand, and should be fixed in a separate PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/158291 Approved by: https://github.com/albanD ghstack dependencies: #158288, #158290	2025-07-17 05:56:26 +00:00
PaliC	a6de309ca1	[BE] Remove torch deploy \| remove torch deploy specific files (#158290 ) This PR removes specific files found in pytorch which are only used for torch::deploy. This is mostly testing code and a debugger. Pull Request resolved: https://github.com/pytorch/pytorch/pull/158290 Approved by: https://github.com/albanD ghstack dependencies: #158288	2025-07-17 05:56:18 +00:00
Yu, Guangye	2ad5c25cfc	Add unified memory APIs for torch.accelerator (#152932 ) # Motivation The following API will be put under torch.accelerator - empty_cache - max_memory_allocated - max_memory_reserved - memory_allocated - memory_reserved - memory_stats - reset_accumulated_memory_stats - reset_peak_memory_stats Pull Request resolved: https://github.com/pytorch/pytorch/pull/152932 Approved by: https://github.com/albanD ghstack dependencies: #138222	2025-07-17 01:56:01 +00:00
Yiming Zhou	a9ee4250d5	[4/n] Remove references to TorchScript in PyTorch docs (#158317 ) Summary: jit.rst Test Plan: CI Rollback Plan: Differential Revision: D78309840 Pull Request resolved: https://github.com/pytorch/pytorch/pull/158317 Approved by: https://github.com/svekars, https://github.com/zhxchen17	2025-07-16 20:01:34 +00:00
angelayi	1cc62c2cb9	[export] Update docs (#157750 ) Preview: https://docs-preview.pytorch.org/pytorch/pytorch/157750/export.html Changes: * Rename draft_export.md -> export.draft_export.md for consistency. * Removed non-strict section in export, instead pointed to programming model doc. * Extended "Expressing Dynamism" section to include Dim hints, ShapeCollection, and AdditionalInputs. * Removed Specialization section in favor of programming model doc * Added pt2 archive doc * Cleaned up sidebar Pull Request resolved: https://github.com/pytorch/pytorch/pull/157750 Approved by: https://github.com/pianpwk	2025-07-16 19:53:12 +00:00
Jiang, Yanbing	900fba4c07	Update warning of TF32 (#158209 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/158209 Approved by: https://github.com/jansel	2025-07-16 01:28:50 +00:00
Yiming Zhou	05dfd312cf	[3/n] Remove references to TorchScript in PyTorch docs (#158315 ) Summary: - cpp_index.rst - fx.md - jit_builtin_functions.rst - jit_python_reference.md - jit_unsupported.md cpu_threading large_scale_deployment Test Plan: CI Rollback Plan: Differential Revision: D78309320 Pull Request resolved: https://github.com/pytorch/pytorch/pull/158315 Approved by: https://github.com/svekars, https://github.com/zhxchen17	2025-07-15 21:14:18 +00:00
Yiming Zhou	0640cfa38c	[2/n] Remove references to TorchScript in PyTorch docs (#158306 ) Summary: Removed jit_language_reference.md Test Plan: CI Rollback Plan: Differential Revision: D78308133 Pull Request resolved: https://github.com/pytorch/pytorch/pull/158306 Approved by: https://github.com/svekars, https://github.com/zhxchen17	2025-07-15 20:57:23 +00:00
Yiming Zhou	19625daf88	[1/n] Remove references to TorchScript in PyTorch docs (#158305 ) Summary: Removed jit_language_reference_v2.md Test Plan: CI Rollback Plan: Differential Revision: D78308009 Pull Request resolved: https://github.com/pytorch/pytorch/pull/158305 Approved by: https://github.com/jingsh, https://github.com/svekars	2025-07-15 20:16:53 +00:00
Ti-Tai Wang	5606c516fd	[ONNX] Remove legacy Dort (#158258 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/158258 Approved by: https://github.com/justinchuby, https://github.com/malfet	2025-07-15 19:14:06 +00:00
Jason Ansel	31326a9ad7	Fix typo in torch.set_float32_matmul_precision docs (#158191 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/158191 Approved by: https://github.com/Skylion007, https://github.com/malfet	2025-07-12 18:23:11 +00:00
Ti-Tai Wang	2eff14c445	[ONNX] Delete torch.onnx.dynamo_export (#158130 ) It's deprecated since torch==2.7. Pull Request resolved: https://github.com/pytorch/pytorch/pull/158130 Approved by: https://github.com/justinchuby	2025-07-12 02:30:47 +00:00
Tristan Rice	0d77364ee3	dist2: cleanup non-option methods on PG (missing, timeouts) (#158123 ) This updates the ProcessGroup.* API to include timeouts on all non-option based overloaded methods. This also adds 2 missing ones `alltoall_base` and `barrier`. Following design in: https://docs.google.com/document/d/13R-1t_yESTvmAjcCN-wQjQQadIEu0JNIdS65uZawZzY/edit?tab=t.0#heading=h.3ctbqqopzc89 Test plan: ``` pytest test/distributed/test_dist2.py ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/158123 Approved by: https://github.com/Skylion007, https://github.com/fduwjj	2025-07-12 00:06:37 +00:00
Shivam Raikundalia	11d6ad8b2e	[Docs] Update PT2 Profiler Torch-Compiled Region Image (#158066 ) Summary: In Pytorch 2.5 we added source code attribution to PT2 traces. Each Torch-Compiled Region will now have its frame id and frame compile id associated with it. Update the image in the doc and add a description of this in the doc itself Test Plan: {F1980179183} Rollback Plan: Differential Revision: D78118228 Pull Request resolved: https://github.com/pytorch/pytorch/pull/158066 Approved by: https://github.com/aaronenyeshi	2025-07-11 07:56:45 +00:00
zeshengzong	b4fc42ca80	Add `torch.segment_reduce` docs (#154352 ) Fixes #153138 ## Test Result ![image](https://github.com/user-attachments/assets/62346d62-d048-4259-906b-f8261e10b4cc) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154352 Approved by: https://github.com/albanD	2025-07-11 06:16:38 +00:00
Jerry Zhang	11a86ad2fa	Remove pytorch quant docs since we are moving to torchao (#157766 ) Summary: att Test Plan: doc page generated from CI Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/157766 Approved by: https://github.com/Skylion007	2025-07-11 03:21:47 +00:00
Howard Huang	8532033679	RPC tutorial audit (#157938 ) Fix [T228333894](https://www.internalfb.com/intern/tasks/?t=228333894) Pull Request resolved: https://github.com/pytorch/pytorch/pull/157938 Approved by: https://github.com/AlannaBurke	2025-07-10 14:15:37 +00:00
Dmitry Rogozhkin	b146ca74f0	docs: add get_default_backend_for_device to distributed documentation (#156783 ) `torch.distributed.get_default_backend_for_device()` API was added to torch 2.6, but is still missing in distributed documentation. This commit addresses the gap. CC: @guangyey, @EikanWang Pull Request resolved: https://github.com/pytorch/pytorch/pull/156783 Approved by: https://github.com/guangyey, https://github.com/malfet	2025-07-10 05:11:30 +00:00
Tristan Rice	ed051c3084	torch.distributed: add initial _dist2 prototype API (#157841 ) This adds the initial dist2 API as proposed in https://docs.google.com/document/d/13R-1t_yESTvmAjcCN-wQjQQadIEu0JNIdS65uZawZzY/edit?tab=t.0#heading=h.3ctbqqopzc89 This is a WIP experimental API and is a sandbox for a number of new features and quality of life improvements/changes to c10d. Test plan: ``` pytest test/distributed/test_dist2.py ``` Docs ``` cd docs make html ``` ![Screenshot 2025-07-08 at 13-39-23 Object Oriented Distributed API - torch distributed _dist2 — PyTorch main documentation](https://github.com/user-attachments/assets/9c03a7ec-09e5-42b9-8478-1ec28bc2b6bd) Pull Request resolved: https://github.com/pytorch/pytorch/pull/157841 Approved by: https://github.com/fduwjj	2025-07-09 23:40:43 +00:00
Dhia-naouali	eaf32fffb7	fixed a tiny typo in torch.compiler.md (#157462 ) Fixes #157444 there was a typo in [docs/source/torch.compiler.md](https://github.com/pytorch/pytorch/blob/main/docs/source/torch.compiler.md) : see -> seen Pull Request resolved: https://github.com/pytorch/pytorch/pull/157462 Approved by: https://github.com/Skylion007, https://github.com/svekars	2025-07-02 19:15:15 +00:00
Ti-Tai Wang	c174f3a6a5	[ONNX] Delete deprecated tutorial page link (#157310 ) Related to https://github.com/pytorch/tutorials/issues/3420 Pull Request resolved: https://github.com/pytorch/pytorch/pull/157310 Approved by: https://github.com/justinchuby	2025-07-01 01:18:26 +00:00
Saiteja Samudrala	2796f31b5e	[DCP] OSS Zero Overhead Checkpointing Implementation (#156207 ) Summary: This diff updates DCP driver code/APIs to support Zero Overhead Checkpointing Test Plan: Test with TorchTitan on this PR: https://github.com/pytorch/torchtitan/pull/1287 Differential Revision: D72391401 Pull Request resolved: https://github.com/pytorch/pytorch/pull/156207 Approved by: https://github.com/teja-rao	2025-06-29 03:19:48 +00:00
Justin Chu	5692cbb818	[ONNX] Delete symbolic caffe2 (#157102 ) Caffe2 is removed from pytorch. This is a clean up. Pull Request resolved: https://github.com/pytorch/pytorch/pull/157102 Approved by: https://github.com/titaiwangms, https://github.com/cyyever	2025-06-28 05:22:02 +00:00
Jane Xu	4048a144ab	Address richard's comments on libtorch_stable_abi note (#156324 ) Followups from #155984 Pull Request resolved: https://github.com/pytorch/pytorch/pull/156324 Approved by: https://github.com/zou3519	2025-06-27 19:19:12 +00:00
Svetlana Karslioglu	2860f5c4f5	Remove mentioning of TorchScript in Export doc (#156969 ) Remove mentioning of TorchScript Pull Request resolved: https://github.com/pytorch/pytorch/pull/156969 Approved by: https://github.com/angelayi Co-authored-by: Angela Yi <yiangela7@gmail.com>	2025-06-27 17:59:15 +00:00
rzou	aa2d54148d	Add AOTDispatcher config to set backward autocast behavior (#156356 ) This PR adds a new config `backward_pass_autocast`, to set the backward autocast behavior. It does not change the existing behavior. The reason why we need this is that torch.compile acquires a forward and backward graph at the time of the forward pass. This means that implemented naively, if there are any context managers active outside the call to torch.compile, the backward graph will also get the behaviors from those context managers. This PR gives users a way to tweak the autocast behavior of the backward pass. Please see torch._functorch.config for the options to the `backward_pass_autocast` config. Pull Request resolved: https://github.com/pytorch/pytorch/pull/156356 Approved by: https://github.com/bdhirsh ghstack dependencies: #155354	2025-06-27 14:58:58 +00:00
Vasiliy Kuznetsov	414ad47045	revamp dtype documentation for 2025 (#156087 ) The dtype documentation has not been updated in awhile, let's do a revamp. 1. combine the duplicated docs for dtypes from `tensors.rst` and `tensor_attributes.rst` to live in `tensor_attributes.rst`, and link to that page from `tensors.rst` 2. split the dtype table into floating point and integer dtypes 3. add the definition of shell dtype 4. add the float8 and MX dtypes as shell dtypes to the dtype table 5. remove legacy quantized dtypes from the table 6. add the definition of various dtype suffixes ("fn", etc) Pull Request resolved: https://github.com/pytorch/pytorch/pull/156087 Approved by: https://github.com/albanD	2025-06-27 13:10:23 +00:00
Geevarghese George	7f6e7103a3	Convert to markdown: jit_python_reference.rst, jit_unsupported.rst, jit_utils.rst, library.rst (#155404 ) Fixes #155024 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155404 Approved by: https://github.com/svekars	2025-06-26 21:09:46 +00:00
haozhe.zhu	53e0b9c393	refine fp32 precision api (#125888 ) Based on the [conversation](https://github.com/pytorch/pytorch/issues/121791), we plan to drop the "highest, high, medium" to represent fp32 internal computation data types . Instead, we will directly use the algorithm to represent it. ### Design Choice: Directly use algorithms name like "TF32", "BF16". #### Pros - The names are more informative. 'tf32' is more informative than a simple "high". - Easier to extend new algorithm like `tf32x3` #### Cons - "HIGHEST, HIGH, MEDIUM" indicated the relative precision between different algorithms. However, we can have more documents to discuss them. ### We provide a layered structure for backends/operators. ('f32' is short for 'fp32_precision') ![image](https://github.com/user-attachments/assets/f89143e5-d6a1-4865-9351-9a50439f5067) ### We provide 3 fp32 compute precision can be set: - "ieee": Not allowed to use any other internal computation data types . - "tf32": Allowed to use tf32 as internal computation data types. - "bf16": Allowed to use bf16 as internal computation data types. - "none": Precision's are not set. Can be override by its father node. ### Overriding Precision Settings Child node can be override by its father node if it is set to default. For current default settings: ``` backend = generic, op = all, precision setting = none backend = cuda, op = all, precision setting = none backend = cuda, op = conv, precision setting = tf32 backend = cuda, op = rnn, precision setting = tf32 backend = cuda, op = matmul, precision setting = none backend = matmul, op = all, precision setting = none backend = matmul, op = conv, precision setting = none backend = matmul, op = rnn, precision setting = none backend = matmul, op = matmul, precision setting = none ``` - If the user set `torch.backends.mkldnn.fp32_precision="bf16"`, his child nodes `torch.backends.mkldnn.matmul.fp32_precision` / `torch.backends.mkldnn.conv.fp32_precision` / `torch.backends.mkldnn.rnn.fp32_precision` will also be override to "bf16". - If the user set `torch.backends.fp32_precision="bf16"`, `torch.backends.mkldnn.fp32_precision` and his child nodes will also we override to "bf16". ### Backward Compatible Since new API allow user to have more fine-grained control. There will be some conflict. For example, previous `torch.backends.cudnn.allow_tf32` are not enough to represent the status for `torch.backends.cudnn.rnn.fp32_precision="ieee"` and `torch.backends.cudnn.conv.fp32_precision="tf32"`. Therefore, our goal for backward compatible is - If the user only uses previous APIs, it will work as previous expectations. - If the user use new API to change the status to an un-representable status for old API, and try to access the status by old API. We will raise Runtime Error and point the document for user. ### Test Plan ``` python test/test_cuda.py -k test_fp32_precision_with_tf32 python test/test_cuda.py -k test_fp32_precision_with_float32_matmul_precision python test/test_cuda.py -k test_invalid_status_for_legacy_api python test/test_mkldnn.py -k test_mlkdnn_get_set python test/test_mkldnn.py -k test_generic_precision python test/test_mkldnn.py -k test_invalid python test/test_mkldnn.py -k test_default_use_parent ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/125888 Approved by: https://github.com/jgong5, https://github.com/albanD Co-authored-by: Jiang, Yanbing <yanbing.jiang@intel.com>	2025-06-26 10:32:20 +00:00
Mikayla Gawarecki	2c6324a1eb	Delete sections referencing torchscript in serialization docs (#156648 ) Address [T228333890](https://www.internalfb.com/intern/tasks/?t=228333890) Pull Request resolved: https://github.com/pytorch/pytorch/pull/156648 Approved by: https://github.com/svekars	2025-06-25 23:41:24 +00:00
albanD	a25d1443fa	Mark TorchServe as all emeritus (#156865 ) As per title and to follow the broader tutorial cleanup work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/156865 Approved by: https://github.com/svekars, https://github.com/malfet, https://github.com/seemethere	2025-06-25 23:34:57 +00:00
Yu, Guangye	9b498d3bb2	Update docs for torch.device (#156686 ) # Motivation Update the doc, to make `torch.device`'s constructor officially support the following methods: - A device string, which is a string representation of the device type and optionally the device ordinal. - A device type and a device ordinal. - A device ordinal, which is treated as the current accelerator type. Pull Request resolved: https://github.com/pytorch/pytorch/pull/156686 Approved by: https://github.com/albanD	2025-06-25 02:12:36 +00:00
PyTorch MergeBot	6459a5c7a9	Revert "Add unified memory APIs for torch.accelerator (#152932 )" This reverts commit `35e44067c4`. Reverted https://github.com/pytorch/pytorch/pull/152932 on behalf of https://github.com/Camyll due to internal build failures ([comment](https://github.com/pytorch/pytorch/pull/138222#issuecomment-3002206756))	2025-06-25 00:11:35 +00:00
Yu, Guangye	35e44067c4	Add unified memory APIs for torch.accelerator (#152932 ) # Motivation The following API will be put under torch.accelerator - empty_cache - max_memory_allocated - max_memory_reserved - memory_allocated - memory_reserved - memory_stats - reset_accumulated_memory_stats - reset_peak_memory_stats Pull Request resolved: https://github.com/pytorch/pytorch/pull/152932 Approved by: https://github.com/albanD ghstack dependencies: #138222	2025-06-24 07:57:48 +00:00
Edward Z. Yang	17eb649d55	Implement guard collectives (optimized version) (#156562 ) This is a remix of https://github.com/pytorch/pytorch/pull/155558 Instead of mediating guard collective via a config option, in this one it's done via a `set_stance` like API. The motivation is that checking for the config value on entry on torch.compile is apparently quite expensive, according to functorch_maml_omniglot. So this makes it a bit cheaper. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/156562 Approved by: https://github.com/Microve	2025-06-24 04:59:49 +00:00
Animesh Jain	fab85fc5f9	[compile][hierarchical compilation] Release nested_compile_region API (#156449 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/156449 Approved by: https://github.com/zou3519, https://github.com/jansel	2025-06-21 15:14:59 +00:00
Xuehai Pan	2ccfd14e23	[BE] fix typos in docs/ (#156080 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/156080 Approved by: https://github.com/cyyever, https://github.com/albanD	2025-06-21 02:47:32 +00:00
Justin Chu	fbbab794ef	[ONNX] Implement Attention-23 (#156431 ) Implement Attention-23 using sdpa and flexattention. - I used copilot for this. - Also updated the conversion logic to remove trailing None inputs. @gramalingam @kunal-vaishnavi @titaiwangms Pull Request resolved: https://github.com/pytorch/pytorch/pull/156431 Approved by: https://github.com/titaiwangms Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-06-20 23:54:57 +00:00
Bhagirath Mehta	de1930a429	Add ONNX dynamo metadata documentation (#155816 ) Describe auto-generated metadata when calling torch.onnx.export Pull Request resolved: https://github.com/pytorch/pytorch/pull/155816 Approved by: https://github.com/justinchuby, https://github.com/titaiwangms Co-authored-by: Justin Chu <justinchuby@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-06-20 20:12:22 +00:00
sekyondaMeta	18e4c461fb	Update index.md (#155143 ) Related to: https://github.com/pytorch/pytorch/issues/152134 Update to index.md to add language for Stable and Unstable Pull Request resolved: https://github.com/pytorch/pytorch/pull/155143 Approved by: https://github.com/AlannaBurke, https://github.com/atalman Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-20 18:53:32 +00:00
windsonsea	9944cd0949	Convert to markdown: quantization-accuracy-debugging.rst, quantization-backend-configuration.rst, quantization-support.rst, random.rst (#155520 ) Related to #155032 - ✅ quantization-accuracy-debugging.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/quantization-accuracy-debugging.html) vs [main](https://docs.pytorch.org/docs/main/quantization-accuracy-debugging.html) - ✅ quantization-backend-configuration.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/quantization-backend-configuration.html) vs [main](https://docs.pytorch.org/docs/main/quantization-backend-configuration.html) - ✅ quantization-support.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/quantization-support.html) vs [main](https://docs.pytorch.org/docs/main/quantization-support.html) - ✅ random.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/random.html) vs [main](https://docs.pytorch.org/docs/main/random.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155520 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-18 18:46:04 +00:00
nirajkamalk	202d2ae53a	Convert rst to md: rpc.rst, signal.rst, size.rst, special.rst (#155430 ) Fixes #155033 - [x] [rpc.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/rpc.rst) - [x] [signal.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/signal.rst) - [x] [size.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/size.rst) - [sparse.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/sparse.rst) fixed in #155438 due to large size. - [x] [special.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/special.rst) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155430 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-18 01:27:04 +00:00
Jane Xu	e8bfce9a43	Document how to use stack-based APIs with StableIValue (#155984 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155984 Approved by: https://github.com/albanD, https://github.com/zou3519	2025-06-18 01:10:23 +00:00
PyTorch MergeBot	fa4f07b5b8	Revert "[Docs] Convert to markdown to fix 155032 (#155520 )" This reverts commit `cd66ff8030`. Reverted https://github.com/pytorch/pytorch/pull/155520 on behalf of https://github.com/atalman due to breaks multiple test_quantization.py::TestQuantizationDocs::test_quantization_ ([comment](https://github.com/pytorch/pytorch/pull/155520#issuecomment-2981996091))	2025-06-17 22:22:50 +00:00
windsonsea	cd66ff8030	[Docs] Convert to markdown to fix 155032 (#155520 ) Fix #155032 - ✅ quantization-accuracy-debugging.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/quantization-accuracy-debugging.html) vs [main](https://docs.pytorch.org/docs/main/quantization-accuracy-debugging.html) - ✅ quantization-backend-configuration.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/quantization-backend-configuration.html) vs [main](https://docs.pytorch.org/docs/main/quantization-backend-configuration.html) - ✅ quantization-support.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/quantization-support.html) vs [main](https://docs.pytorch.org/docs/main/quantization-support.html) - ✅ quantization.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/quantization.html) vs [main](https://docs.pytorch.org/docs/main/quantization.html) - ✅ random.rst: [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155520/random.html) vs [main](https://docs.pytorch.org/docs/main/random.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155520 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-17 20:29:45 +00:00
Svetlana Karslioglu	fc5ae12293	Fix issue with right-nav (#156119 ) Enable on page right nav. For autosummary, we need to set `"show_toc_level": 2` so that navigation is enabled. Example: * Main: https://docs.pytorch.org/docs/main/special.html - right nav (under On this page) is empty. * Preview: https://docs-preview.pytorch.org/pytorch/pytorch/156119/special.html - right nav (under On this page) has a all the object listed <img width="1125" alt="Screenshot 2025-06-16 at 2 48 16 PM" src="https://github.com/user-attachments/assets/0790bb72-5997-4542-9847-0a89be4598c0" /> vs <img width="1030" alt="Screenshot 2025-06-16 at 2 48 55 PM" src="https://github.com/user-attachments/assets/4897c49c-044d-4bea-a8cd-490c90cca2b0" /> Pull Request resolved: https://github.com/pytorch/pytorch/pull/156119 Approved by: https://github.com/albanD	2025-06-17 18:09:51 +00:00
windsonsea	7fcad0231c	[Docs] Convert to markdown to fix 155025 (#155789 ) Related to #155025 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155789 Approved by: https://github.com/svekars	2025-06-17 15:08:14 +00:00
Wei (Will) Feng	4a8f5e752b	[FSDP2] explain user contract for fully_shard (#156070 ) <img width="896" alt="Screenshot 2025-06-16 at 1 36 00 AM" src="https://github.com/user-attachments/assets/7cdea256-2454-49c7-8b32-24549a13134d" /> Pull Request resolved: https://github.com/pytorch/pytorch/pull/156070 Approved by: https://github.com/mori360	2025-06-17 10:03:19 +00:00
Justin Silver	008345be9d	Fix #155018 (convert distributed rst to markdown) (#155528 ) Used [rst2myst tool](https://rst-to-myst.readthedocs.io/en/latest/) Fixes #155018 Docs comparison (check out the 'new' whenever docs build) 1. distributed.checkpoint ([old](https://docs.pytorch.org/docs/main/distributed.checkpoint.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155528/distributed.checkpoint.html)) 2. distributed.elastic ([old](https://docs.pytorch.org/docs/main/distributed.elastic.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155528/distributed.elastic.html)) 3. distributed.fsdp.fully_shard ([old](https://docs.pytorch.org/docs/main/distributed.fsdp.fully_shard.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155528/distributed.fsdp.fully_shard.html)) 4. distributed.optim ([old](https://docs.pytorch.org/docs/main/distributed.optim.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155528/distributed.optim.html)) 5. distributed.pipelining ([old](https://docs.pytorch.org/docs/main/distributed.pipelining.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155528/distributed.pipelining.html)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155528 Approved by: https://github.com/wz337, https://github.com/svekars	2025-06-16 20:46:09 +00:00
Julian De la Barrera Brandner	2dc1627451	[doc] Add documentation for division by zero behavior in autograd (#155987 ) Fixes #128796 This PR adds documentation about the behavior of division by zero operations in PyTorch's autograd system. The documentation explains: 1. How division by zero produces `inf` values following IEEE-754 floating point arithmetic 2. How autograd handles these cases and why masking after division can lead to `nan` gradients 3. Provides concrete examples showing the issue 4. Recommends two solutions: - Masking before division - Using MaskedTensor (experimental API) The documentation is added to the autograd notes section, making it easily discoverable for users who encounter this common issue. This addresses the original issue #128796 which requested better documentation of this behavior to help users avoid common pitfalls when dealing with division by zero in their models. dditional changes: - Fixed formatting consistency by replacing curly apostrophes with straight apostrophes in the existing documentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/155987 Approved by: https://github.com/soulitzer Co-authored-by: sekyondaMeta <127536312+sekyondaMeta@users.noreply.github.com>	2025-06-16 19:02:12 +00:00
windsonsea	fbd88ae2b5	Convert to markdown: checkpoint.rst (#156009 ) Related to #155014 Use two commits to have a try. ```bash 1800 git mv docs/source/checkpoint.rst docs/source/checkpoint.md 1802 git commit -m "[Docs] Rename checkpoint.rst" 1803 git push origin ckpoint # update the markdown file 1805 git add . 1806 git commit -m "modify checkpoint.md" 1807 git push origin ckpoint ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/156009 Approved by: https://github.com/svekars	2025-06-16 17:48:23 +00:00
windsonsea	a10024d7de	Convert complex_numbers.rst to markdown (#156039 ) Related to #155014 Have a try by following https://github.com/pytorch/pytorch/pull/155899#issuecomment-2974715750 Pull Request resolved: https://github.com/pytorch/pytorch/pull/156039 Approved by: https://github.com/svekars	2025-06-16 17:24:37 +00:00
Dhia naouali	c620d0b5c7	convert: rst to myst pr2/2 (#155911 ) Fixes #155038 parent [PR](https://github.com/pytorch/pytorch/pull/155375) (made two PRs to pass sanity check) this PR converts the following three .rst files with the mentioned referenced in each file - [torch.compiler_faq](https://github.com/pytorch/pytorch/blob/main/docs/source/torch.compiler_faq.rst) - torch.compiler_troubleshooting - nonsupported_numpy_feats - torchdynamo_fine_grain_tracing - [torch.compiler_fine_grain_apis](https://github.com/pytorch/pytorch/blob/main/docs/source/torch.compiler_fine_grain_apis.rst) - None - [torch.compiler_get_started](https://github.com/pytorch/pytorch/blob/main/docs/source/torch.compiler_get_started.rst) - torch.compiler_overview - torch.compiler_api - torchdynamo_fine_grain_tracing I made the suggested edits by the maintainers as commented in the parent PR (used git mv on all files, yet it still appeared as delete-create action) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155911 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-16 00:44:44 +00:00
Animesh Jain	54976bca10	[dynamo] Provide helper functions for guard filter hook (#155083 ) Collection of ready-made guard filters. One issue is that they are not composable - `filter1(filter2(guard))`. On the other hand, they are easy to use. Pull Request resolved: https://github.com/pytorch/pytorch/pull/155083 Approved by: https://github.com/zhxchen17, https://github.com/jansel	2025-06-15 17:49:36 +00:00
Edgar Romo Montiel	ca3cabd24a	Convert to markdown: named_tensor.rst, nested.rst, nn.attention.bias.rst, nn.attention.experimental.rst, nn.attention.flex_attention.rst #155028 (#155696 ) Fixes #155028 This pull request updates the documentation by transitioning from .rst to .md format. It introduces new Markdown files for the documentation of named_tensor, nested, nn.attention.bias, nn.attention.experimental, and nn.attention.flex_attention Pull Request resolved: https://github.com/pytorch/pytorch/pull/155696 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-14 03:32:00 +00:00
dggaytan	3003c681ef	Converting .rst files to .md files (#155377 ) Fixes #155036 This pull request updates the documentation for several modules by transitioning from .rst to .md format, improving readability and usability. It introduces new Markdown files for the documentation of torch.ao.ns._numeric_suite, torch.ao.ns._numeric_suite_fx, AOTInductor, AOTInductor Minifier, and the torch.compiler API Pull Request resolved: https://github.com/pytorch/pytorch/pull/155377 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-13 22:54:27 +00:00
ggsmith842	799443605b	Convert to markdown: distributed.tensor.parallel.rst, distributed.tensor.rst, distributions.rst, dlpack.rst (#155297 ) Fixes #155019 ## Description Convert to markdown: distributed.tensor.parallel.rst, distributed.tensor.rst, distributions.rst, dlpack.rst ## Checklist - [X] dlpack.rst converted to dlpack.md --> [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155297/dlpack.html) - [X] distributions.rst converted to distributions.md --> [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155297/distributions.html) - [X] distributed.tensor.rst converted to distributed.tensor.md --> [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155297/distributed.tensor.html) - [X] distributed.tensor.parallel.rst converted to distributed.tensor.parallel.md --> [Preview](https://docs-preview.pytorch.org/pytorch/pytorch/155297/distributed.tensor.parallel.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155297 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-13 22:08:37 +00:00
Runtian (Rachel) Li	049dc48d1e	fix code chunk indentation for `jit_language_reference_v2.md` (#155937 ) Fixes https://github.com/pytorch/pytorch/issues/155023 Related PR: #155781 Description: As discussed, this PR is a follow-up update for `jit_language_reference_v2.md` by deleting the code chunk indentation. Checklist: - [x] The issue being fixed is referenced above (Fixes https://github.com/pytorch/pytorch/issues/155023) - [x] Only one issue is addressed in this pull request - [x] Labels from the issue that this PR is fixing are added to this pull request - [x] No unnecessary issues are included into this pull request. @pytorchbot label "topic: docs" @pytorchbot label "topic: not user facing" @pytorchbot label docathon-h1-2025 @pytorchbot label "module: docs" Pull Request resolved: https://github.com/pytorch/pytorch/pull/155937 Approved by: https://github.com/jingsh, https://github.com/svekars	2025-06-13 21:05:23 +00:00
GdoongMathew	731351bb4a	Convert rst to markdown - optim.rst #155031 (#155813 ) Fixes #155031 ![image](https://github.com/user-attachments/assets/36507ca1-eb1e-4358-9e66-ce25ec8a2be1) @pytorchbot label "docathon-h1-2025" "module: docs" "topic: not user facing" "topic: docs" Pull Request resolved: https://github.com/pytorch/pytorch/pull/155813 Approved by: https://github.com/AlannaBurke	2025-06-13 21:03:39 +00:00
windsonsea	7d1b3f599d	[Docs] Convert to markdown cond.rst, config_mod.rst (#155653 ) Related to #155014 Only included 2 files in this PR: - cond.rst - config_mod.rst Pull Request resolved: https://github.com/pytorch/pytorch/pull/155653 Approved by: https://github.com/svekars	2025-06-13 20:58:57 +00:00
Justin Silver	f3e6c8e834	Fix #155016 for Docathon - convert rst to markdown (#155198 ) Used [rst2myst tool](https://rst-to-myst.readthedocs.io/en/latest/) One note is that "Created On" and "Last Updated On" banner doesn't show in the markdown files... I'm not sure if that's just an artifact of my local build though. Fixes #155016 Docs comparison (check out the 'new' whenever docs build) 1. cuda ([old](https://docs.pytorch.org/docs/main/cuda.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155198/cuda.html)) 2. cuda.tunable ([old](https://docs.pytorch.org/docs/main/cuda.tunable.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155198/cuda.tunable.html)) 3. leave cudnn_persistent_rnn.rst as is because it's reused in docstrings 4. cudnn_rnn_determinism.rst as is because it's reused in docstrings. 5. data ([old](https://docs.pytorch.org/docs/main/data.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155198/data.html)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155198 Approved by: https://github.com/albanD, https://github.com/svekars	2025-06-13 20:24:34 +00:00
Ankita George	bf798a2f01	Change _hfstorage to hfstorage (#155837 ) Summary: Change HF classes to not have an underscore, there-by making them public, we will add documentation to them following this Test Plan: ensure existing tests pass Rollback Plan: Differential Revision: D76364024 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155837 Approved by: https://github.com/saumishr	2025-06-13 20:19:51 +00:00
Dhia-naouali	c5d00e150a	convert: rst to myst pr 1/2 (#155840 ) Fixes #155038 parent [PR](https://github.com/pytorch/pytorch/pull/155375) (made two PRs to pass sanity check) this PR converts the following two .rst files - [torch.compiler_dynamo_overview](https://github.com/pytorch/pytorch/blob/main/docs/source/torch.compiler_dynamo_overview.rst) - [torch.compiler_fake_tensor](https://github.com/pytorch/pytorch/blob/main/docs/source/torch.compiler_fake_tensor.rst) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155840 Approved by: https://github.com/sekyondaMeta	2025-06-13 18:02:28 +00:00
Runtian (Rachel) Li	093aaccae2	convert `jit_language_reference_v2.rst` to `jit_language_reference_v2.md` (#155781 ) Fixes https://github.com/pytorch/pytorch/issues/155023 Description: converted `jit_language_reference_v2.rst` to `jit_language_reference_v2.md` I indented the code blocks to minimize the file difference to pass the sanity check for no more than 2000 lines of change. I will submit another PR to fix the indentation after this PR is merged. Checklist: - [x] The issue being fixed is referenced above (Fixes https://github.com/pytorch/pytorch/issues/155023) - [x] Only one issue is addressed in this pull request - [x] Labels from the issue that this PR is fixing are added to this pull request - [x] No unnecessary issues are included into this pull request. @pytorchbot label "topic: docs" @pytorchbot label "topic: not user facing" @pytorchbot label docathon-h1-2025 @pytorchbot label module: docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/155781 Approved by: https://github.com/svekars	2025-06-13 17:33:10 +00:00
ZhaoqiongZ	3d595fd559	update get start xpu (#151886 ) update link and product name add print to print ```torch.xpu.is_available()``` result in code snippet for user not using command python Pull Request resolved: https://github.com/pytorch/pytorch/pull/151886 Approved by: https://github.com/guangyey, https://github.com/AlannaBurke Co-authored-by: Yu, Guangye <106960996+guangyey@users.noreply.github.com>	2025-06-13 07:46:13 +00:00
Justin Silver	70bb34929a	Convert to .md: draft_export.rst, export.ir_spec.rst, fft.rst (#155567 ) Used [rst2myst tool](https://rst-to-myst.readthedocs.io/en/latest/) Fixes #155020. This PR is split into 3 to pass sanity check. Docs comparison (check out the 'new' whenever docs build) 1. draft_export ([old](https://docs.pytorch.org/docs/main/draft_export.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155567/draft_export.html)) 2. export.ir_spec ([old](https://docs.pytorch.org/docs/main/export.ir_spec.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155567/export.ir_spec.html)) 3. fft ([old](https://docs.pytorch.org/docs/main/fft.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155567/fft.html)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155567 Approved by: https://github.com/svekars	2025-06-13 05:19:43 +00:00
GdoongMathew	2ba930d4ce	Convert rst to markdown - profiler.rst #155031 (#155559 ) Fixes https://github.com/pytorch/pytorch/issues/155031 * [profiler.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/profiler.rst) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155559 Approved by: https://github.com/svekars	2025-06-13 05:02:54 +00:00
Runtian (Rachel) Li	e8b3dfa7c0	convert jit_language_reference.rst to jit_language_reference.md (#155633 ) Part of changes https://github.com/pytorch/pytorch/issues/155023 (parent PR https://github.com/pytorch/pytorch/pull/155429) - converted jit_language_reference.rst to jit_language_reference.md @pytorchbot label "topic: docs" @pytorchbot label "topic: not user facing" @pytorchbot label docathon-h1-2025 @pytorchbot label module: docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/155633 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-13 04:58:28 +00:00
Runtian (Rachel) Li	3f65e38b73	Convert hub.rst to hub.md (#155483 ) Part of changes https://github.com/pytorch/pytorch/issues/155023 (parent PR https://github.com/pytorch/pytorch/pull/155429) @pytorchbot label "topic: docs" @pytorchbot label "topic: not user facing" @pytorchbot label docathon-h1-2025 @pytorchbot label module: docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/155483 Approved by: https://github.com/svekars	2025-06-13 04:39:55 +00:00
Justin Silver	e085012335	Fix #155020 - rst2markdown for export.rst (split PR) (#155753 ) Used [rst2myst tool](https://rst-to-myst.readthedocs.io/en/latest/) Fixes #155020. This PR is split into 3 to pass sanity check. This is the 3rd one. Docs comparison (check out the 'new' whenever docs build) 1. export ([old](https://docs.pytorch.org/docs/main/export.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155567/export.html)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155753 Approved by: https://github.com/sekyondaMeta	2025-06-12 19:30:52 +00:00
Grant Smith	7986c0dba6	rename distributed.rst to md (#155767 ) Fixes #155019 For sanity checks, split PR to have this one only include distributed.rst -> distributed.md Preview -> [distributed.md](https://docs-preview.pytorch.org/pytorch/pytorch/155767/distributed.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155767 Approved by: https://github.com/sekyondaMeta	2025-06-12 18:42:15 +00:00
Runtian (Rachel) Li	9df2e8020f	fix code indentation for fx.md (#155764 ) Fixes https://github.com/pytorch/pytorch/issues/155023 Related PR: #155482 Description: As discussed here https://github.com/pytorch/pytorch/pull/155482#pullrequestreview-2918032289, I removed indentation for python code blocks as a follow-up modification for fx.md Checklist: - [x] The issue being fixed is referenced above (Fixes https://github.com/pytorch/pytorch/issues/155023) - [x] Only one issue is addressed in this pull request - [x] Labels from the issue that this PR is fixing are added to this pull request - [x] No unnecessary issues are included into this pull request. @pytorchbot label "topic: docs" @pytorchbot label "topic: not user facing" @pytorchbot label docathon-h1-2025 @pytorchbot label module: docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/155764 Approved by: https://github.com/svekars	2025-06-12 16:02:33 +00:00
Kazuaki Ishizaki	b00b641ff1	[Docs] Convert to markdown: accelerator.rst, amp.rst, autograd.rst, backends.rst, benchmark_utils.rst (#155762 ) Fixes #155013 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155762 Approved by: https://github.com/svekars	2025-06-12 02:55:06 +00:00
Justin Silver	cf9878d7a2	Fix #155022 rst to markdown conversion (#155540 ) Used [rst2myst tool](https://rst-to-myst.readthedocs.io/en/latest/) Fixes #155022 Docs comparison (check out the 'new' whenever docs build) 1. func.ux_limitations ([old](https://docs.pytorch.org/docs/main/func.ux_limitations.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155540/func.ux_limitations.html)) 2. func.whirlwind_tour ([old](https://docs.pytorch.org/docs/main/func.whirlwind_tour.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155540/func.whirlwind_tour.html)) 3. future_mod ([old](https://docs.pytorch.org/docs/main/future_mod.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155540/future_mod.html)) 4. futures ([old](https://docs.pytorch.org/docs/main/futures.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155540/futures.html)) 5. fx.experimental ([old](https://docs.pytorch.org/docs/main/fx.experimental.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155540/fx.experimental.html)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155540 Approved by: https://github.com/AlannaBurke, https://github.com/svekars	2025-06-12 00:21:22 +00:00
jafraustro	1b032384b1	Convert rst files to md (#155369 ) Fixes #155021 Fixes #155158 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155369 Approved by: https://github.com/svekars, https://github.com/malfet	2025-06-11 23:00:52 +00:00
loganthomas	458cc7213b	DOC: Convert to markdown: mobile_optimizer.rst, model_zoo.rst, module_tracker.rst, monitor.rst, mps_environment_variables.rst (#155702 ) Fixes #155026 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155702 Approved by: https://github.com/sekyondaMeta, https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-11 22:16:04 +00:00
Justin Silver	b7a73a2cdb	Convert to markdown: export.programming_model.rst (#155659 ) Converts only export.programming_model.rst to markdown Used [rst2myst tool](https://rst-to-myst.readthedocs.io/en/latest/) Fixes #155020, but split into a second PR to pass sanity check Docs comparison (check out the 'new' whenever docs build) 1. export.programming_model ([old](https://docs.pytorch.org/docs/main/export.programming_model.html) vs. [new](https://docs-preview.pytorch.org/pytorch/pytorch/155659/export.programming_model.html)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155659 Approved by: https://github.com/sekyondaMeta	2025-06-11 20:23:46 +00:00
Kazuaki Ishizaki	2002e3a311	[Docs] Convert to markdown: torch.compiler_transformations.rst, torch.compiler.config.rst (#155347 ) Part of changes #155040 (parent PR #155120) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155347 Approved by: https://github.com/svekars	2025-06-11 18:55:30 +00:00
Runtian (Rachel) Li	925fbfca27	Convert fx.rst to fx.md (#155482 ) Part of changes #155023 (parent PR #155429) @pytorchbot label "topic: docs" @pytorchbot label "topic: not user facing" @pytorchbot label docathon-h1-2025 @pytorchbot label module: docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/155482 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-11 18:46:35 +00:00
GdoongMathew	14f3639e09	Convert to .md: onnx_verification.rst, onnx.rst, package.rst, (#155556 ) Fixes https://github.com/pytorch/pytorch/issues/155031 * [onnx_verification.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/onnx_verification.rst) * [onnx.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/onnx.rst) * [package.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/package.rst) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155556 Approved by: https://github.com/AlannaBurke, https://github.com/sekyondaMeta	2025-06-10 21:40:40 +00:00
nirajkamalk	ae0f1f8984	Convert to markdown onnx rst (#155228 ) Fixes #155030 Converts the following files to MyST markdown and ensure that the doc tests are green: - [x] [onnx_dynamo_onnxruntime_backend.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/onnx_dynamo_onnxruntime_backend.rst) - [x] [onnx_dynamo.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/onnx_dynamo.rst) - [x] [onnx_ops.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/onnx_ops.rst) - [onnx_torchscript_supported_aten_ops.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/onnx_torchscript_supported_aten_ops.rst) - not changed as it is autogenerated - [onnx_torchscript.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/onnx_torchscript.rst) - fixed in #155390 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155228 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-10 21:33:07 +00:00
Alberto A. Gallegos	8a396c5635	DOC: Convert to markdown: torch.compiler_best_practices_for_backends.rst, torch.compiler_cudagraph_trees.rst, torch.compiler_custom_backends.rst, torch.compiler_dynamic_shapes.rst, torch.compiler_dynamo_deepdive.rst (#155137 ) Fixes #155037 [torch.compiler_best_practices_for_backends.rst](https://github.com/pytorch/pytorch/tree/main/docs/source/torch.compiler_best_practices_for_backends.rst) shows error 404 cc @svekars @sekyondaMeta @AlannaBurke Pull Request resolved: https://github.com/pytorch/pytorch/pull/155137 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-10 20:51:05 +00:00
jafraustro	01b8f5e685	Convert to markdown: testing.rst, threading_environment_variables.rst, torch_cuda_memory.rst, torch_environment_variables.rst, torch_nccl_environment_variables.rst (#155523 ) Fixes #155035 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155523 Approved by: https://github.com/AlannaBurke, https://github.com/svekars	2025-06-10 20:38:36 +00:00
Kazuaki Ishizaki	08d15d3ec1	[Docs] Convert to markdown: torch.compiler_troubleshooting.rst (#155514 ) Part of changes #155040 (parent PR #155120) Follow-up of #155351. I split the changes of `torch.compiler_troubleshooting.rst ` into #155351 and this PR due to the 2000-line limit in one PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/155514 Approved by: https://github.com/svekars	2025-06-10 15:41:31 +00:00
Vivek Nayak	75f258dd1f	Fix spelling mistake (#155495 ) Summary: Change "primtivies" to "primitives". Test Plan: n/a Rollback Plan: Differential Revision: D76229938 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155495 Approved by: https://github.com/angelayi, https://github.com/cyyever	2025-06-10 09:06:58 +00:00
Sahdev Zala	f34335bf33	Convert compiler rst files to markdown (#155335 ) Convert following compiler rst files to md file. torch.compiler_inductor_profiling.rst torch.compiler_ir.rst torch.compiler_nn_module.rst torch.compiler_performance_dashboard.rst torch.compiler_profiling_torch_compile.rst Fixes #155039 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155335 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-10 01:12:11 +00:00
Kazuaki Ishizaki	5df3bf13ec	[Docs] Convert to markdown: torch.compiler_troubleshooting.rst (#155351 ) Part of changes #155040 (parent PR #155120) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155351 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-09 23:18:31 +00:00
Qasim Khan	82e6475d92	Add doc for missing functions for torch.special module (#155074 ) Fixes #132178 Added all the missing functions that had a docstring but were not present in the documentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/155074 Approved by: https://github.com/albanD	2025-06-09 22:28:26 +00:00
Parag Ekbote	2908c10259	Document the default garbage_collection_threshold value and improve the organization of cuda docs (#155341 ) Fixes #150917 As mentioned in the issue, I've updated the documentation of `garbage_collection_threshold`and improved the organization. Could you please review? Pull Request resolved: https://github.com/pytorch/pytorch/pull/155341 Approved by: https://github.com/AlannaBurke, https://github.com/ngimel	2025-06-08 22:09:35 +00:00
Abhinav Tharamel	d41f62b7a0	Fix/issue #155027 (#155252 ) Fixes #155027 Converted RST files to Markdown Pull Request resolved: https://github.com/pytorch/pytorch/pull/155252 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-08 21:17:31 +00:00
Yuki Kobayashi	11bc29856d	Fix some incorrect reST markups in the document (#154831 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154831 Approved by: https://github.com/cyyever, https://github.com/Skylion007	2025-06-07 19:09:46 +00:00
Francisco R Castro Garcia	cd82096973	DOC: Convert to markdown: ddp_comm_hooks.rst, debugging_environment_variables.rst, deploy.rst, deterministic.rst, distributed.algorithms.join.rst (#155298 ) Fixes #155017 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155298 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-06 22:44:50 +00:00
Kazuaki Ishizaki	c95705dac2	[Docs] Convert to markdown: torch.compiler_troubleshooting_old.rst, torch.compiler.rst (#155348 ) Part of changes #155040 (parent PR #155120) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155348 Approved by: https://github.com/svekars	2025-06-06 21:26:24 +00:00
loganthomas	4f5b34427b	DOC: Convert to markdown: torch.overrides.rst, type_info.rst, utils.rst, xpu.rst (#155088 ) Fixes #155041 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155088 Approved by: https://github.com/svekars Co-authored-by: Svetlana Karslioglu <svekars@meta.com>	2025-06-06 20:16:13 +00:00
Joel Schlosser	5e93abe3c0	Address docs for clip_grad functions (#155125 ) This PR takes the opinionated stance that `torch.nn.utils.<func>` should be the preferred API over `torch.nn.utils.clip_grad.<func>`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/155125 Approved by: https://github.com/albanD, https://github.com/mikaylagawarecki, https://github.com/janeyx99	2025-06-05 19:22:09 +00:00
Jane Xu	2f3f8339ec	[BE] Document device memory apis in correct module (#155126 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155126 Approved by: https://github.com/msaroufim, https://github.com/Skylion007	2025-06-05 15:16:48 +00:00
Justin Chu	3e57de1251	[ONNX] Create support for rotary embeddings (#154745 ) This PR registers the RotaryEmbedding op in the `torch.ops.onnx` name spaces and allows the exporter to recognize and export onnx operators. ## Design ONNX operators of their respective opset version is implemented in torch/onnx/ops/_impl.py, and are registered in the torch.ops.onnx namespace following the following rule: `OpType-version => torch.ops.onnx.OpType.opset{version}` For example, `RotaryEmbedding-23` becomes `torch.ops.onnx.RotaryEmbedding.opset23` This name is parsed by the exporter to create an onnx node in the graph without having to go through translation. When users use the ops in the model, we provide more convenient, unversioned functions under `torch.onnx.ops` that will dispatch to the implementations based on user input (type and provided attributes). For example, users can directly call `torch.onnx.ops.rotary_embedding()` to use the op natively in their pytorch models. I chose snake case naming to make the functions more pythonic and aligned with other torch apis. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154745 Approved by: https://github.com/titaiwangms	2025-06-04 03:07:43 +00:00
Alanna Burke	250e9af4da	Removing per torch.compile audit. (#154572 ) Removing https://pytorch.org/docs/stable/torch.compiler_best_practices_for_backends.html per torch.compile audit Pull Request resolved: https://github.com/pytorch/pytorch/pull/154572 Approved by: https://github.com/williamwen42, https://github.com/svekars	2025-06-03 15:41:52 +00:00
bobrenjc93	33f2d0ff45	add reference to stances from dynamic shapes doc (#154823 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154823 Approved by: https://github.com/Skylion007, https://github.com/williamwen42 ghstack dependencies: #154802, #154826, #154822	2025-06-02 18:47:19 +00:00
bobrenjc93	d99e9568ec	Add docs for how to mark as unbacked (#154822 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154822 Approved by: https://github.com/Skylion007 ghstack dependencies: #154802, #154826	2025-06-02 18:30:57 +00:00
bobrenjc93	9fe1b40d17	[ez] add dynamic sources docs (#154826 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154826 Approved by: https://github.com/Skylion007 ghstack dependencies: #154802	2025-06-02 17:53:30 +00:00
Nikita Shulga	0350c7e72c	[BE] Introduce torch.AcceleratorError (#152023 ) Which inherits from `RuntimeError` and contains `error_code`, which in case of CUDA should contain error returned by `cudaGetLastError` `torch::detail::_new_accelerator_error_object(c10::AcceleratorError&)` follows the pattern of CPython's [`PyErr_SetString`](`cb8a72b301/Python/errors.c (L282)`), namely - Convert cstr into Python string with `PyUnicode_FromString` - Create new exception object using `PyObject_CallOneArg` just like it's done in [`_PyErr_CreateException`](`cb8a72b301/Python/errors.c (L32)`) - Set `error_code` property using `PyObject_SetAttrString` - decref all temporary references Test that it works and captures CPP backtrace (in addition to CI) by running ```python import os os.environ['TORCH_SHOW_CPP_STACKTRACES'] = '1' import torch x = torch.rand(10, device="cuda") y = torch.arange(20, device="cuda") try: x[y] = 2 print(x) except torch.AcceleratorError as e: print("Exception was raised", e.args[0]) print("Captured error code is ", e.error_code) ``` which produces following output ``` Exception was raised CUDA error: device-side assert triggered CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. Exception raised from c10_cuda_check_implementation at /home/ubuntu/pytorch/c10/cuda/CUDAException.cpp:41 (most recent call first): C++ CapturedTraceback: #4 std::_Function_handler<std::shared_ptr<c10::LazyValue<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > const> (), c10::SetStackTraceFetcher(std::function<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > ()>)::{lambda()#1}>::_M_invoke(std::_Any_data const&) from Logging.cpp:0 #5 c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) from ??:0 #6 c10::cuda::c10_cuda_check_implementation(int, char const, char const, int, bool) [clone .cold] from CUDAException.cpp:0 #7 void at::native::gpu_kernel_impl<at::native::AbsFunctor<float> >(at::TensorIteratorBase&, at::native::AbsFunctor<float> const&) [clone .isra.0] from tmpxft_000191fc_00000000-6_AbsKernel.cudafe1.cpp:0 #8 at::native::abs_kernel_cuda(at::TensorIteratorBase&) from ??:0 #9 at::Tensor& at::native::unary_op_impl_with_complex_to_float_out<at::native::abs_stub_DECLARE_DISPATCH_type>(at::Tensor&, at::Tensor const&, at::native::abs_stub_DECLARE_DISPATCH_type&, bool) [clone .constprop.0] from UnaryOps.cpp:0 #10 at::(anonymous namespace)::(anonymous namespace)::wrapper_CUDA_out_abs_out(at::Tensor const&, at::Tensor&) from RegisterCUDA_0.cpp:0 #11 at::_ops::abs_out::call(at::Tensor const&, at::Tensor&) from ??:0 #12 at::native::abs(at::Tensor const&) from ??:0 #13 c10::impl::wrap_kernel_functor_unboxed_<c10::impl::detail::WrapFunctionIntoFunctor_<c10::CompileTimeFunctionPointer<at::Tensor (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CompositeExplicitAutograd__abs>, at::Tensor, c10::guts::typelist::typelist<at::Tensor const&> >, at::Tensor (at::Tensor const&)>::call(c10::OperatorKernel, c10::DispatchKeySet, at::Tensor const&) from RegisterCompositeExplicitAutograd_0.cpp:0 #14 at::_ops::abs::redispatch(c10::DispatchKeySet, at::Tensor const&) from ??:0 #15 torch::autograd::VariableType::(anonymous namespace)::abs(c10::DispatchKeySet, at::Tensor const&) from VariableType_1.cpp:0 #16 c10::impl::wrap_kernel_functor_unboxed_<c10::impl::detail::WrapFunctionIntoFunctor_<c10::CompileTimeFunctionPointer<at::Tensor (c10::DispatchKeySet, at::Tensor const&), &torch::autograd::VariableType::(anonymous namespace)::abs>, at::Tensor, c10::guts::typelist::typelist<c10::DispatchKeySet, at::Tensor const&> >, at::Tensor (c10::DispatchKeySet, at::Tensor const&)>::call(c10::OperatorKernel, c10::DispatchKeySet, at::Tensor const&) from VariableType_1.cpp:0 #17 at::_ops::abs::call(at::Tensor const&) from ??:0 #18 at::native::isfinite(at::Tensor const&) from ??:0 #19 c10::impl::wrap_kernel_functor_unboxed_<c10::impl::detail::WrapFunctionIntoFunctor_<c10::CompileTimeFunctionPointer<at::Tensor (at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CompositeImplicitAutograd__isfinite>, at::Tensor, c10::guts::typelist::typelist<at::Tensor const&> >, at::Tensor (at::Tensor const&)>::call(c10::OperatorKernel, c10::DispatchKeySet, at::Tensor const&) from RegisterCompositeImplicitAutograd_0.cpp:0 #20 at::_ops::isfinite::call(at::Tensor const&) from ??:0 #21 torch::autograd::THPVariable_isfinite(_object, _object, _object) from python_torch_functions_2.cpp:0 #22 PyObject_CallFunctionObjArgs from ??:0 #23 _PyObject_MakeTpCall from ??:0 #24 _PyEval_EvalFrameDefault from ??:0 #25 _PyObject_FastCallDictTstate from ??:0 #26 _PyStack_AsDict from ??:0 #27 _PyObject_MakeTpCall from ??:0 #28 _PyEval_EvalFrameDefault from ??:0 #29 _PyFunction_Vectorcall from ??:0 #30 _PyEval_EvalFrameDefault from ??:0 #31 _PyFunction_Vectorcall from ??:0 #32 _PyEval_EvalFrameDefault from ??:0 #33 _PyFunction_Vectorcall from ??:0 #34 _PyEval_EvalFrameDefault from ??:0 #35 PyFrame_GetCode from ??:0 #36 PyNumber_Xor from ??:0 #37 PyObject_Str from ??:0 #38 PyFile_WriteObject from ??:0 #39 _PyWideStringList_AsList from ??:0 #40 _PyDict_NewPresized from ??:0 #41 _PyEval_EvalFrameDefault from ??:0 #42 PyEval_EvalCode from ??:0 #43 PyEval_EvalCode from ??:0 #44 PyUnicode_Tailmatch from ??:0 #45 PyInit__collections from ??:0 #46 PyUnicode_Tailmatch from ??:0 #47 _PyRun_SimpleFileObject from ??:0 #48 _PyRun_AnyFileObject from ??:0 #49 Py_RunMain from ??:0 #50 Py_BytesMain from ??:0 #51 __libc_init_first from ??:0 #52 __libc_start_main from ??:0 #53 _start from ??:0 Captured error code is 710 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/152023 Approved by: https://github.com/eqy, https://github.com/mradmila, https://github.com/ngimel ghstack dependencies: #154436	2025-06-01 21:02:43 +00:00
Nikita Shulga	f7c09f864a	[Docs] Reformat sparse example (#154785 ) Not sure why, but rst fails to colorize multiline inputs, but works fine for single line commands Test plan: \| [Before](https://docs.pytorch.org/docs/main/sparse.html#construction) \| [After](https://docs-preview.pytorch.org/pytorch/pytorch/154785/sparse.html#construction) \| \| ------------- \| ------------- \| \| <img width="466" alt="image" src="https://github.com/user-attachments/assets/96a5c52a-1804-4d05-a5cf-c10221aaddf6" /> \| <img width="477" alt="image" src="https://github.com/user-attachments/assets/99565288-5c0b-4e8e-bd60-f016ebc207b5" /> \| Fixes https://github.com/pytorch/pytorch/issues/154779 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154785 Approved by: https://github.com/janeyx99, https://github.com/Skylion007	2025-06-01 20:56:14 +00:00
Natalia Gimelshein	f01e628e3b	Resubmit Remove MemPoolContext (#154042 ) (#154746 ) Summary: Per title Test Plan: Added tests + existing tests Differential Revision: D75695030 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154746 Approved by: https://github.com/malfet	2025-05-31 01:21:54 +00:00
PyTorch MergeBot	d173ba5a75	Revert "Remove MemPoolContext (#154042 )" This reverts commit `3b38989b5f`. Reverted https://github.com/pytorch/pytorch/pull/154042 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/154042#issuecomment-2921401100))	2025-05-30 06:53:37 +00:00
bobrenjc93	9c06dff1ce	[multigraph] use specializations in compile_and_call_fx_graph (#153449 ) The goal of this multigraph work is to enable a compiled region that has a single dynamo trace but multiple backend specializations. This work was inspired by vLLM which does this in a somewhat hacky way where they use a custom backend to capture a dynamo graph and then manually invoke compile_fx multiple times to get specialized graphs. There's really two parts of this work: The frontend changes: 1) we introduce an optional kwarg `specialize_on` to mark_{dynamic,unbacked} that takes in a list of specializations. I debated other methods including specifying specializations via decorators, but ultimately decided this approach was more harmonious. The big issue with decorators is the difficulty of composing well with the rest of the torch.compile ecosystem including graph breaks, lazy initialization of variable trackers and symbolic variables, etc. The backend changes (this PR): 1) We capture the backend_specialization specified in the mark_{dynamic,unbacked} API into a SymbolicContext. See changes in `/_dynamo/variables/builder.py` 2) After we are done dynamo tracing, we will lazily (more on this later) invoke `call_user_compiler` up to N + 1 times for N specializations and 1 generic graph. Under the hood this will call compile_fx, which composes nicely with both Async Compile and AOTAutogradCache. We do this by using a context manager to patch in specialization specific axioms into the ShapeEnv before invoking the user compiler. 3) When we have specializations, we install a lazy specialized dispatch function that checks each specialization and dispatches to the first one that matches. Instead of doing all of the specialization compiles up front, we do the compiles lazily. The first time a specialization is invoked, we will do the compilation and save it in a cache so subsequent invocations are fast. If none of the specializations match, we dispatch to the generic graph. I decided to do this over returning N different GuardedCodes since 1) it doesn't pollute the dynamo cache (eg. if you have 8 specializations, you would hit the cache limit) 2) it naturally incorporates the hierarchical lattice structure of the guards since the specializations are always necessarily stricter than the generic region's guards. I benchmarked this PR stack with #152596 and found around a 50% reduction when dispatching to the specialized regions: ![495269647_576053105510082_9189856138964956774_n](https://github.com/user-attachments/assets/66030fed-d62e-4d87-940f-aa13c99b1a73) Pull Request resolved: https://github.com/pytorch/pytorch/pull/153449 Approved by: https://github.com/zou3519 ghstack dependencies: #153433	2025-05-30 03:19:49 +00:00
nirajkamalk	40abb2b403	Fix deprecated amp APIs in docs (#154553 ) Update usage of deprecated amp APIs. Fixes https://github.com/pytorch/tutorials/issues/3331 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154553 Approved by: https://github.com/Skylion007	2025-05-29 00:05:59 +00:00
Natalia Gimelshein	3b38989b5f	Remove MemPoolContext (#154042 ) Removes MemPoolContext from custom user mempools. The ground truth for which pool should be used is in graph_pools active pool, and MemPoolContext just introduced an opportunity for the pool pointed to by MemPoolContext and active pool in graph_pools to go out of sync (see all the asserts in the code to make sure that happens, and yet it still could happen in a multithread scenario, see my recent PRs (#153990). Pull Request resolved: https://github.com/pytorch/pytorch/pull/154042 Approved by: https://github.com/albanD, https://github.com/syed-ahmed	2025-05-28 16:35:48 +00:00
Yuki Kobayashi	f55f2f42a7	Add missing docstring for `sym_ite` (#154201 ) `sym_ite` is listed in [the reference page](https://docs.pytorch.org/docs/stable/torch.html) and has no document. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154201 Approved by: https://github.com/Skylion007	2025-05-26 15:59:21 +00:00
bobrenjc93	53ecb8159a	Introduce statically_known_false (#154291 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154291 Approved by: https://github.com/mengluy0125	2025-05-24 14:23:55 +00:00
Svetlana Karslioglu	1ab2993345	Add a link to transformer_building_blocks tutorial (#154281 ) Cross-link to https://docs.pytorch.org/tutorials/intermediate/transformer_building_blocks.html Pull Request resolved: https://github.com/pytorch/pytorch/pull/154281 Approved by: https://github.com/mikaylagawarecki	2025-05-24 02:50:24 +00:00
Svetlana Karslioglu	ec368a1903	Add sitemap (#154158 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/154158 Approved by: https://github.com/albanD	2025-05-23 18:01:00 +00:00
Shangdi Yu	04a6fe7914	Update provenance tracking doc (#154062 ) Summary: Update the doc to reflect the changes in https://github.com/pytorch/pytorch/pull/153584/files#diff-e0cdb58c0f84f56f20c5433339b6d83c470dcde47847e2328effea6bedd4cd27 and https://github.com/pytorch/tlparse/pull/110 Test Plan: CI Differential Revision: D75155981 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154062 Approved by: https://github.com/svekars, https://github.com/desertfire	2025-05-23 17:09:52 +00:00
Anita Katahoire	996c4d803d	Removing conda references from PyTorch Docs (#152702 ) Addresses #148339 Pull Request resolved: https://github.com/pytorch/pytorch/pull/152702 Approved by: https://github.com/svekars, https://github.com/albanD, https://github.com/atalman	2025-05-20 20:33:28 +00:00
Svetlana Karslioglu	7c9d94e9bb	Redirect mobile_optimizer.rst to executorch (#153664 ) Redirect mobile_optimizer.rst to executorch Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/153664 Approved by: https://github.com/byjlw, https://github.com/malfet	2025-05-20 18:13:45 +00:00
Mikayla Gawarecki	6383ddcfa4	Update serialization docs (#153631 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/153631 Approved by: https://github.com/albanD	2025-05-19 20:22:07 +00:00
Angela Yi	b4fb801b2d	[export] Move PT2 constants to torch::_export (#153206 ) Test Plan: `buck2 test //sigmoid/...` https://www.internalfb.com/intern/testinfra/testrun/1970325119807758 Differential Revision: D74417085 Pull Request resolved: https://github.com/pytorch/pytorch/pull/153206 Approved by: https://github.com/zhxchen17, https://github.com/dolpm	2025-05-17 08:21:59 +00:00
Anthony Shoumikhin	7d39e73c57	Fix more URLs (#153277 ) Or ignore them. Found by running the lint_urls.sh script locally with https://github.com/pytorch/pytorch/pull/153246 Pull Request resolved: https://github.com/pytorch/pytorch/pull/153277 Approved by: https://github.com/malfet	2025-05-14 16:23:50 +00:00
angelayi	d51bc27378	[export] Make draft_export public (#153219 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/153219 Approved by: https://github.com/pianpwk	2025-05-14 02:18:36 +00:00
Svetlana Karslioglu	f136046919	Clean up right nav (#153090 ) - Move community and language binding links to the horizontal bar - Add an intro to the community page. - Fix the link in the ogp_image - Fix the link in the version switcher - Clean up unneeded links Pull Request resolved: https://github.com/pytorch/pytorch/pull/153090 Approved by: https://github.com/albanD	2025-05-12 21:00:45 +00:00
PyTorch MergeBot	fdc387ec7c	Revert "refine fp32 precision api (#125888 )" This reverts commit `4c11b26158`. Reverted https://github.com/pytorch/pytorch/pull/125888 on behalf of https://github.com/huydhn due to Sorry for reverting your change but it seems to cause some failures on ROCm ([comment](https://github.com/pytorch/pytorch/pull/125888#issuecomment-2869274791))	2025-05-11 00:35:46 +00:00
haozhe.zhu	4c11b26158	refine fp32 precision api (#125888 ) Based on the [conversation](https://github.com/pytorch/pytorch/issues/121791), we plan to drop the "highest, high, medium" to represent fp32 internal computation data types . Instead, we will directly use the algorithm to represent it. ### Design Choice: Directly use algorithms name like "TF32", "BF16". #### Pros - The names are more informative. 'tf32' is more informative than a simple "high". - Easier to extend new algorithm like `tf32x3` #### Cons - "HIGHEST, HIGH, MEDIUM" indicated the relative precision between different algorithms. However, we can have more documents to discuss them. ### We provide a layered structure for backends/operators. ('f32' is short for 'fp32_precision') ![image](https://github.com/user-attachments/assets/f89143e5-d6a1-4865-9351-9a50439f5067) ### We provide 3 fp32 compute precision can be set: - "ieee": Not allowed to use any other internal computation data types . - "tf32": Allowed to use tf32 as internal computation data types. - "bf16": Allowed to use bf16 as internal computation data types. - "none": Precision's are not set. Can be override by its father node. ### Overriding Precision Settings Child node can be override by its father node if it is set to default. For current default settings: ``` backend = generic, op = all, precision setting = none backend = cuda, op = all, precision setting = none backend = cuda, op = conv, precision setting = tf32 backend = cuda, op = rnn, precision setting = tf32 backend = cuda, op = matmul, precision setting = none backend = matmul, op = all, precision setting = none backend = matmul, op = conv, precision setting = none backend = matmul, op = rnn, precision setting = none backend = matmul, op = matmul, precision setting = none ``` - If the user set `torch.backends.mkldnn.fp32_precision="bf16"`, his child nodes `torch.backends.mkldnn.matmul.fp32_precision` / `torch.backends.mkldnn.conv.fp32_precision` / `torch.backends.mkldnn.rnn.fp32_precision` will also be override to "bf16". - If the user set `torch.backends.fp32_precision="bf16"`, `torch.backends.mkldnn.fp32_precision` and his child nodes will also we override to "bf16". ### Backward Compatible Since new API allow user to have more fine-grained control. There will be some conflict. For example, previous `torch.backends.cudnn.allow_tf32` are not enough to represent the status for `torch.backends.cudnn.rnn.fp32_precision="ieee"` and `torch.backends.cudnn.conv.fp32_precision="tf32"`. Therefore, our goal for backward compatible is - If the user only uses previous APIs, it will work as previous expectations. - If the user use new API to change the status to an un-representable status for old API, and try to access the status by old API. We will raise Runtime Error and point the document for user. ### Test Plan ``` python test/test_cuda.py -k test_fp32_precision_with_tf32 python test/test_cuda.py -k test_fp32_precision_with_float32_matmul_precision python test/test_cuda.py -k test_invalid_status_for_legacy_api python test/test_mkldnn.py -k test_mlkdnn_get_set python test/test_mkldnn.py -k test_generic_precision python test/test_mkldnn.py -k test_invalid python test/test_mkldnn.py -k test_default_use_parent ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/125888 Approved by: https://github.com/jgong5, https://github.com/albanD Co-authored-by: Jiang, Yanbing <yanbing.jiang@intel.com>	2025-05-10 11:13:04 +00:00
soulitzer	9d00f2b375	[autograd][docs] Add more details on why save_for_backward is important in extending autograd note (#153005 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/153005 Approved by: https://github.com/albanD	2025-05-09 16:36:57 +00:00
Shangdi Yu	faff387bfd	Mini tutorial for provenance tracking (#152211 ) as title Pull Request resolved: https://github.com/pytorch/pytorch/pull/152211 Approved by: https://github.com/svekars, https://github.com/eellison, https://github.com/desertfire	2025-05-09 01:41:04 +00:00
Wei Feng	5a8c9c3ab0	[FSDP2][Doc] add pointer to torchtitan (#153079 ) <img width="838" alt="Screenshot 2025-05-08 at 10 51 05 AM" src="https://github.com/user-attachments/assets/4cf43a16-3801-424b-a74f-ede1d41ff052" /> Pull Request resolved: https://github.com/pytorch/pytorch/pull/153079 Approved by: https://github.com/mori360	2025-05-08 22:22:07 +00:00
Yuxin Wu	2cf7fd0d2b	Update docs of saved_tensors_hooks to avoid ref cycle (#153049 ) Fixes #115255 Pull Request resolved: https://github.com/pytorch/pytorch/pull/153049 Approved by: https://github.com/Skylion007, https://github.com/soulitzer	2025-05-07 18:54:56 +00:00

1 2 3 4 5 ...

3202 Commits