pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Saketh Are	86399d8e0c	Add histogramdd to torch.rst (#68273 ) Summary: The `torch.histogramdd` operator is documented in `torch/functional.py` but does not appear in the generated docs because it is missing from `docs/source/torch.rst`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/68273 Reviewed By: cpuhrsch Differential Revision: D32470522 Pulled By: saketh-are fbshipit-source-id: a23e73ba336415457a30bae568bda80afa4ae3ed	2021-11-16 11:55:40 -08:00
Thomas Metcalfe	ba16b1eca7	[numpy] Alias `arctan2` to `atan2` (#67010 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/65906 Adds an alias `arctan2` to improve numpy compatibility cc mruberry rgommers Pull Request resolved: https://github.com/pytorch/pytorch/pull/67010 Reviewed By: anjali411 Differential Revision: D32378998 Pulled By: mruberry fbshipit-source-id: 424c5c10c12b49c20ee83ccd109325c480b5b6cf	2021-11-16 09:41:09 -08:00
Anirudh Dagar	b07a11929d	Array API: Add torch.linalg.cross (#63285 ) Summary: ### Create `linalg.cross` Fixes https://github.com/pytorch/pytorch/issues/62810 As discussed in the corresponding issue, this PR adds `cross` to the `linalg` namespace (Note: There is no method variant) which is slightly different in behaviour compared to `torch.cross`. Note: this is NOT an alias as suggested in mruberry's [https://github.com/pytorch/pytorch/issues/62810 comment](https://github.com/pytorch/pytorch/issues/62810#issuecomment-897504372) below > linalg.cross being consistent with the Python Array API (over NumPy) makes sense because NumPy has no linalg.cross. I also think we can implement linalg.cross without immediately deprecating torch.cross, although we should definitely refer users to linalg.cross. Deprecating torch.cross will require additional review. While it's not used often it is used, and it's unclear if users are relying on its unique behavior or not. The current default implementation of `torch.cross` is extremely weird and confusing. This has also been reported multiple times previously. (See https://github.com/pytorch/pytorch/issues/17229, https://github.com/pytorch/pytorch/issues/39310, https://github.com/pytorch/pytorch/issues/41850, https://github.com/pytorch/pytorch/issues/50273) - [x] Add `torch.linalg.cross` with default `dim=-1` - [x] Add OpInfo and other tests for `torch.linalg.cross` - [x] Add broadcasting support to `torch.cross` and `torch.linalg.cross` - [x] Remove out skip from `torch.cross` OpInfo - [x] Add docs for `torch.linalg.cross`. Improve docs for `torch.cross` mentioning `linalg.cross` and the difference between the two. Also adds a warning to `torch.cross`, that it may change in the future (we might want to deprecate it later) --- ### Additional Fixes to `torch.cross` - [x] Fix Doc for Tensor.cross - [x] Fix torch.cross in `torch/overridres.py` While working on `linalg.cross` I noticed these small issues with `torch.cross` itself. [Tensor.cross docs](https://pytorch.org/docs/stable/generated/torch.Tensor.cross.html) still mentions `dim=-1` default which is actually wrong. It should be `dim=None` after the behaviour was updated in PR https://github.com/pytorch/pytorch/issues/17582 but the documentation for the `method` or `function` variant wasn’t updated. Later PR https://github.com/pytorch/pytorch/issues/41850 updated the documentation for the `function` variant i.e `torch.cross` and also added the following warning about the weird behaviour. > If `dim` is not given, it defaults to the first dimension found with the size 3. Note that this might be unexpected. But still, the `Tensor.cross` docs were missed and remained outdated. I’m finally fixing that here. Also fixing `torch/overrides.py` for `torch.cross` as well now, with `dim=None`. To verify according to the docs the default behaviour of `dim=-1` should raise, you can try the following. ```python a = torch.randn(3, 4) b = torch.randn(3, 4) b.cross(a) # this works because the implementation finds 3 in the first dimension and the default behaviour as shown in documentation is actually not true. >>> tensor([[ 0.7171, -1.1059, 0.4162, 1.3026], [ 0.4320, -2.1591, -1.1423, 1.2314], [-0.6034, -1.6592, -0.8016, 1.6467]]) b.cross(a, dim=-1) # this raises as expected since the last dimension doesn't have a 3 >>> RuntimeError: dimension -1 does not have size 3 ``` Please take a closer look (particularly the autograd part, this is the first time I'm dealing with `derivatives.yaml`). If there is something missing, wrong or needs more explanation, please let me know. Looking forward to the feedback. cc mruberry Lezcano IvanYashchuk rgommers Pull Request resolved: https://github.com/pytorch/pytorch/pull/63285 Reviewed By: gchanan Differential Revision: D32313346 Pulled By: mruberry fbshipit-source-id: e68c2687c57367274e8ddb7ef28ee92dcd4c9f2c	2021-11-11 12:49:41 -08:00
Kurt Mohler	db014b8529	Add `set_deterministic_debug_mode` and `get_deterministic_debug_mode` (#67778 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/67386 Pull Request resolved: https://github.com/pytorch/pytorch/pull/67778 Reviewed By: ngimel Differential Revision: D32310661 Pulled By: mruberry fbshipit-source-id: 300129e96ca51c22fa711182ce6a9f4d4d2ce57f	2021-11-11 12:48:29 -08:00
eqy	790763b0fe	Add an option to disable reduced precision reductions for FP16 GEMM (#67946 ) Summary: https://github.com/pytorch/pytorch/issues/67578 disabled reduced precision reductions for FP16 GEMMs. After benchmarking, we've found that this has substantial performance impacts for common GEMM shapes (e.g., those found in popular instantiations of multiheaded-attention) on architectures such as Volta. As these performance regressions may come as a surprise to current users, this PR adds a toggle to disable reduced precision reductions `torch.backends.cuda.matmul.allow_fp16_reduced_precision_reduction = ` rather than making it the default behavior. CC ngimel ptrblck stas00 Note that the behavior after the previous PR can be replicated with `torch.backends.cuda.matmul.allow_fp16_reduced_precision_reduction = False` Pull Request resolved: https://github.com/pytorch/pytorch/pull/67946 Reviewed By: zou3519 Differential Revision: D32289896 Pulled By: ngimel fbshipit-source-id: a1ea2918b77e27a7d9b391e030417802a0174abe	2021-11-09 17:27:20 -08:00
James Reed	eaf0457eef	[distributed][docs] Delete distributed optimimzer section from RPC and add reference to namespace docs page (#68068 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68068 cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse SciPioneer H-Huang Test Plan: Imported from OSS Reviewed By: pritamdamania87 Differential Revision: D32286554 Pulled By: jamesr66a fbshipit-source-id: a43fe1f0cfa74721f467b128f2e878bd02f32546	2021-11-09 15:01:54 -08:00
Xiaoyu Zhang	273f7ae9b3	fx: Update fx.rst (#68043 ) Summary: When I run this part of the code on the document with PyTorch version 1.10.0, I found some differences between the output and the document, as follows: ```python import torch import torch.fx as fx class M(torch.nn.Module): def forward(self, x, y): return x + y # Create an instance of `M` m = M() traced = fx.symbolic_trace(m) print(traced) print(traced.graph) traced.graph.print_tabular() ``` I get the result： ```shell def forward(self, x, y): add = x + y; x = y = None return add graph(): %x : [#users=1] = placeholder[target=x] %y : [#users=1] = placeholder[target=y] %add : [#users=1] = call_function[target=operator.add](args = (%x, %y), kwargs = {}) return add opcode name target args kwargs ------------- ------ ----------------------- ------ -------- placeholder x x () {} placeholder y y () {} call_function add <built-in function add> (x, y) {} output output output (add,) {} ``` This pr modified the document。 Pull Request resolved: https://github.com/pytorch/pytorch/pull/68043 Reviewed By: driazati Differential Revision: D32287178 Pulled By: jamesr66a fbshipit-source-id: 48ebd0e6c09940be9950cd57ba0c03274a849be5	2021-11-09 14:00:45 -08:00
James Reed	3f048c637f	[distributed] Render `torch.distributed.optim` members (#67885 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67885 cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse SciPioneer H-Huang Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D32191952 Pulled By: jamesr66a fbshipit-source-id: a9ed52da8e89b3491eab2e691f5571338f83e8e3	2021-11-08 16:20:55 -08:00
jcwchen	5b036d5f2b	[Doc] [ONNX]Fix a broken url for ONNXRuntime custom op (#67944 ) Summary: Description Update the broken url by a valid link https://onnxruntime.ai/docs/reference/operators/add-custom-op.html. Motivation Closes https://github.com/pytorch/pytorch/issues/67849. The url is broken. Pull Request resolved: https://github.com/pytorch/pytorch/pull/67944 Reviewed By: NivekT Differential Revision: D32252880 Pulled By: H-Huang fbshipit-source-id: 400b0efa3d6f63e60b016c482fbbed1293c29806	2021-11-08 15:51:02 -08:00
andrewor	4a8f27445d	[Quant] Add dynamic QAT Linear module (#67325 ) Summary: Summary: This commit adds the `torch.nn.qat.dynamic.modules.Linear` module, the dynamic counterpart to `torch.nn.qat.modules.Linear`. Functionally these are very similar, except the dynamic version expects a memoryless observer and is converted into a dynamically quantized module before inference. Pull Request resolved: https://github.com/pytorch/pytorch/pull/67325 Test Plan: `python3 test/test_quantization.py TestQuantizationAwareTraining.test_dynamic_qat_linear` Reviewers: Charles David Hernandez, Jerry Zhang Subscribers: Charles David Hernandez, Supriya Rao, Yining Lu Tasks: 99696812 Tags: pytorch Reviewed By: malfet, jerryzh168 Differential Revision: D32178739 Pulled By: andrewor14 fbshipit-source-id: 5051bdd7e06071a011e4e7d9cc7769db8d38fd73	2021-11-08 10:24:25 -08:00
Alban Desmaison	9cdd1d7e48	Docs module check (#67440 ) Summary: Add check to make sure we do not add new submodules without documenting them in an rst file. This is especially important because our doc coverage only runs for modules that are properly listed. temporarily removed "torch" from the list to make sure the failure in CI looks as expected. EDIT: fixed now This is what a CI failure looks like for the top level torch module as an example: ![image](https://user-images.githubusercontent.com/6359743/139264690-01af48b3-cb2f-4cfc-a50f-975fca0a8140.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/67440 Reviewed By: jbschlosser Differential Revision: D32005310 Pulled By: albanD fbshipit-source-id: 05cb2abc2472ea4f71f7dc5c55d021db32146928	2021-11-01 06:24:27 -07:00
kshitij12345	510e3026a9	[numpy] add torch.argwhere (#64257 ) Summary: Adds `torch.argwhere` as an alias to `torch.nonzero` Currently, `torch.nonzero` is actually provides equivalent functionality to `np.argwhere`. From NumPy docs, > np.argwhere(a) is almost the same as np.transpose(np.nonzero(a)), but produces a result of the correct shape for a 0D array. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64257 Reviewed By: qihqi Differential Revision: D32049884 Pulled By: saketh-are fbshipit-source-id: 016e49884698daa53b83e384435c3f8f6b5bf6bb	2021-10-30 15:26:11 -07:00
Vasiliy Kuznetsov	99282126dc	pytorch quantization: document the custom module APIs (#67449 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67449 Adds a description of what the current custom module API does and API examples for Eager mode and FX graph mode to the main PyTorch quantization documentation page. Test Plan: ``` cd docs make html python -m http.server // check the docs page, it renders correctly ``` Reviewed By: jbschlosser Differential Revision: D31994641 Pulled By: vkuzo fbshipit-source-id: d35a62947dd06e71276eb6a0e37950d3cc5abfc1	2021-10-29 05:22:17 -07:00
Kenichi Maehashi	6ed68f3f84	Document `torch.jit.is_tracing()` (#67326 ) Summary: This PR adds `torch.jit.is_tracing()` to the JIT API reference. This function is widely used but left undocumented: https://github.com/search?q=torch.jit.is_tracing&type=code Pull Request resolved: https://github.com/pytorch/pytorch/pull/67326 Reviewed By: tugsbayasgalan Differential Revision: D31985251 Pulled By: Krovatkin fbshipit-source-id: 852b432b08d63df8bd7a7a02c9555e61f5f37978	2021-10-28 09:56:09 -07:00
albanD	6293e0ad61	update coverage ignore to not skip whole modules (#67395 ) Summary: This reduces the chance of a newly added functions to be ignored by mistake. The only test that this impacts is the coverage test that runs as part of the python doc build. So if that one works, it means that the update to the list here is correct. Pull Request resolved: https://github.com/pytorch/pytorch/pull/67395 Reviewed By: jbschlosser Differential Revision: D31991936 Pulled By: albanD fbshipit-source-id: 5b4ce7764336720827501641311cc36f52d2e516	2021-10-28 08:07:24 -07:00
Alban Desmaison	708f7b1209	Update extending doc to cover forward mode AD (#66962 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66962 Reviewed By: VitalyFedyunin Differential Revision: D31897782 Pulled By: albanD fbshipit-source-id: 64164783a14a7ed4cedc17da28f1181d9807a499	2021-10-27 14:18:38 -07:00
Nikita Shulga	b18c298f24	ONNX: Delete or document skipped ORT tests (#64470 ) (#66143 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66143 Delete test_list_remove. There's no point in testing conversion of this model since TorchScript doesn't support it. Add a link to an issue tracking test_embedding_bag_dynamic_input. [ONNX] fix docs (#65379) Mainly fix the sphinx build by inserting empty before bulleted lists. Also some minor improvements: Remove superfluous descriptions of deprecated and ignored args. The user doesn't need to know anything other than that they are deprecated and ignored. Fix custom_opsets description. Make indentation of Raises section consistent with Args section. [ONNX] publicize func for discovering unconvertible ops (#65285) * [ONNX] Provide public function to discover all unconvertible ATen ops This can be more productive than finding and fixing a single issue at a time. * [ONNX] Reorganize test_utility_funs Move common functionality into a base class that doesn't define any tests. Add a new test for opset-independent tests. This lets us avoid running the tests repeatedly for each opset. Use simple inheritance rather than the `type()` built-in. It's more readable. * [ONNX] Use TestCase assertions rather than `assert` This provides better error messages. * [ONNX] Use double quotes consistently. [ONNX] Fix code block formatting in doc (#65421) Test Plan: Imported from OSS Reviewed By: jansel Differential Revision: D31424093 fbshipit-source-id: 4ced841cc546db8548dede60b54b07df9bb4e36e	2021-10-22 13:46:16 -07:00
Nikita Shulga	7a78f715a6	[ONNX] Add warning for inplace updates on tensor.shape in tracing mode (#63170 ) (#66142 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66142 * Add warning * Lint and clang fixes * Remove duplicate comments * Added pitfalls section * Modify sections * Minor modifications * Add underline to avoid doc build failures Test Plan: Imported from OSS Reviewed By: jansel Differential Revision: D31424092 fbshipit-source-id: c83195f3c66885ad1aecde13b3029c45dd171dbd	2021-10-22 13:46:14 -07:00
Natalia Gimelshein	f29e5220a6	Revert D31474901: [pytorch][PR] [numpy] add torch.argwhere Test Plan: revert-hammer Differential Revision: D31474901 Original commit changeset: 335327a4986f fbshipit-source-id: 534093e459762ff7a888c58d76e49e362015f2ba	2021-10-21 15:50:54 -07:00
kshitij12345	462f333c01	[numpy] add torch.argwhere (#64257 ) Summary: Adds `torch.argwhere` as an alias to `torch.nonzero` Currently, `torch.nonzero` is actually provides equivalent functionality to `np.argwhere`. From NumPy docs, > np.argwhere(a) is almost the same as np.transpose(np.nonzero(a)), but produces a result of the correct shape for a 0D array. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64257 Reviewed By: dagitses Differential Revision: D31474901 Pulled By: saketh-are fbshipit-source-id: 335327a4986fa327da74e1fb8624cc1e56959c70	2021-10-21 14:02:11 -07:00
lezcano	a2e94b80fa	Create linalg.matrix_exp (#62715 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62715 Fixes https://github.com/pytorch/pytorch/issues/61648 Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D31641698 Pulled By: mruberry fbshipit-source-id: 2e2965d14807b6b4fada4b809d539066dd0ba277	2021-10-19 09:07:15 -07:00
Yukio Siraichi	8854817f44	Implement Python Array API `asarray` function. (#60627 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60627 In this PR, the core of `frombuffer` and `fromDLPack` onto _tensor_new.cpp_. `asarray` uses such refactored functions for interpreting the object as a tensor. We follow the Python Array API standard found: https://data-apis.org/array-api/latest/API_specification/creation_functions.html?highlight=asarray Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D31640510 Pulled By: mruberry fbshipit-source-id: d0869e0d73cb50023d5866b001dac5d34ca30dfd	2021-10-16 21:11:31 -07:00
Vasiliy Kuznetsov	76f3b07caf	quantization docs: remove erroneous rebase artifact (#66577 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66577 There was a rebase artifact erroneously landed to quantization docs, this PR removes it. Test Plan: CI Imported from OSS Reviewed By: soulitzer Differential Revision: D31651350 fbshipit-source-id: bc254cbb20724e49e1a0ec6eb6d89b28491f9f78	2021-10-14 11:30:47 -07:00
Natalia Gimelshein	fdd9f49cf5	add a note on numerical accuracy (#65947 ) Summary: Per title Fixes https://github.com/pytorch/pytorch/issues/54437 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65947 Reviewed By: albanD Differential Revision: D31612445 Pulled By: ngimel fbshipit-source-id: 5c155891a088aef3b9813f253d0dc1ee4d51ae1c	2021-10-13 12:43:55 -07:00
lezcano	82a216c45b	Add tensor.{adjoint(),H,mT,mH} methods and properties (#64179 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64179 This PR follows the discussion in https://github.com/pytorch/pytorch/issues/45063#issuecomment-904431478 Fixes https://github.com/pytorch/pytorch/issues/45063 cc ezyang anjali411 dylanbespalko mruberry Lezcano nikitaved rgommers pmeier asmeurer leofang AnirudhDagar asi1024 emcastillo kmaehashi heitorschueroff Test Plan: Imported from OSS Reviewed By: bertmaher Differential Revision: D30730483 Pulled By: anjali411 fbshipit-source-id: 821d25083f5f682450f6812bf852dc96a1cdf9f2	2021-10-13 07:44:43 -07:00
Vasiliy Kuznetsov	565cf47abf	Quantization docs: add pages for Numeric Suite (Eager and FX) (#66380 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66380 Description: 1. creates doc pages for Eager and FX numeric suites 2. adds a link from main quantization doc to (1) 3. formats docblocks in Eager NS to render well 4. adds example code and docblocks to FX numeric suite Test Plan: ``` cd docs make html python -m http.server // renders well ``` Reviewed By: jerryzh168 Differential Revision: D31543173 Pulled By: vkuzo fbshipit-source-id: feb291bcbe92747495f45165f738631fa5cbffbd	2021-10-11 18:47:58 -07:00
Vasiliy Kuznetsov	8b1258698e	Improve quantization API docs (#66379 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66379 Description: Creates a quantization API reference and fixes all the docblock errors. This is #66122 to #66210 squashed together Test Plan: ``` cd docs make html python -m http.server // open webpage, inspect it, looks good ``` Reviewed By: ejguan Differential Revision: D31543172 Pulled By: vkuzo fbshipit-source-id: 9131363d6528337e9f100759654d3f34f02142a9	2021-10-11 18:46:11 -07:00
Hong Xu	0348148725	Update link to qnnpack in quantization doc. (#66226 ) Summary: The old repo has been archived. Pull Request resolved: https://github.com/pytorch/pytorch/pull/66226 Reviewed By: vkuzo Differential Revision: D31534712 Pulled By: ezyang fbshipit-source-id: 4d7f070c8547aeb25464c72b25ed21f209821bc2	2021-10-11 08:19:19 -07:00
Mike Ruberry	9971113340	Revert D31447612: Create a documentation page for FX graph mode quantization APIs Test Plan: revert-hammer Differential Revision: D31447612 (`a89ac3138e`) Original commit changeset: 07d0a6137f15 fbshipit-source-id: f2cba7d835011500580b4ab9cff72171280ee18b	2021-10-10 01:51:13 -07:00
Mike Ruberry	b85fd4c54f	Revert D31447613: Create separate documentation pages for quantization observers and fake_quants Test Plan: revert-hammer Differential Revision: D31447613 (`f0fa3d1110`) Original commit changeset: 63b4cf518bad fbshipit-source-id: 67de592d1e12a5149cdb22b0725caad063f94476	2021-10-10 01:51:11 -07:00
Mike Ruberry	10633460ce	Revert D31447614: Create a documentation page for `torch.ao.quantization.QConfig` Test Plan: revert-hammer Differential Revision: D31447614 (`7332ed13ed`) Original commit changeset: 5d9dd2a4e864 fbshipit-source-id: 6ac15a956222ca61f7fbb75ed36bcc58b23f0f36	2021-10-10 01:51:09 -07:00
Mike Ruberry	037ac2330e	Revert D31447616: Quantization docs: consilidate all API references on a single page Test Plan: revert-hammer Differential Revision: D31447616 (`fe86f0e068`) Original commit changeset: 2f9c4dac2b2f fbshipit-source-id: 673368e87399f0a25441688bb9356de5a2f3e66e	2021-10-10 01:51:07 -07:00
Mike Ruberry	09c3e6002b	Revert D31447615: Quantization docs: rewrite API reference to be more automated Test Plan: revert-hammer Differential Revision: D31447615 (`7d2526ab20`) Original commit changeset: 09874ad9629f fbshipit-source-id: 0963c9f5118e243cd299f8cded2bf7b0848a7105	2021-10-10 01:51:05 -07:00
Mike Ruberry	df1858bea5	Revert D31447611: Quantization documentation: move backend section down Test Plan: revert-hammer Differential Revision: D31447611 (`309a8cf46c`) Original commit changeset: 537b146559bc fbshipit-source-id: c400aef9a2ea5d18f8076879fe6354be7a6732f1	2021-10-10 01:51:03 -07:00
Mike Ruberry	ad0accdecd	Revert D31447610: Quantization docs: add pages for Numeric Suite (Eager and FX) Test Plan: revert-hammer Differential Revision: D31447610 (`9539e6216b`) Original commit changeset: 441170c4a6c3 fbshipit-source-id: b49bff54405cdb8465397077e38506a36b277921	2021-10-10 01:49:19 -07:00
Vasiliy Kuznetsov	9539e6216b	Quantization docs: add pages for Numeric Suite (Eager and FX) (#66222 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66222 Description: 1. creates doc pages for Eager and FX numeric suites 2. adds a link from main quantization doc to (1) 3. formats docblocks in Eager NS to render well 4. adds example code and docblocks to FX numeric suite Test Plan: ``` cd docs make html python -m http.server // renders well ``` Reviewed By: jerryzh168 Differential Revision: D31447610 Pulled By: vkuzo fbshipit-source-id: 441170c4a6c3ddea1e7c7c5cc2f1e1cd5aa65f2f	2021-10-09 06:46:06 -07:00
Vasiliy Kuznetsov	309a8cf46c	Quantization documentation: move backend section down (#66210 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66210 Description: Moves the backend section of the quantization page further down, to ensure that the API description and reference sections are closer to the top. Test Plan: ``` cd docs make html python -m server.http // renders well ``` Reviewed By: jerryzh168 Differential Revision: D31447611 Pulled By: vkuzo fbshipit-source-id: 537b146559bce484588b3c78e6b0cdb4c274e8dd	2021-10-09 06:46:04 -07:00
Vasiliy Kuznetsov	7d2526ab20	Quantization docs: rewrite API reference to be more automated (#66201 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66201 Description: This PR switches the quantization API reference to use `autosummary` for each section. We define the sections and manually write a list of modules/functions/methods to include, and sphinx does the rest. A result is a single page where we have every quantization function and module with a quick autogenerated blurb, and user can click through to each of them for a full documentation page. This mimics how the `torch.nn` and `torch.nn.functional` doc pages are set up. In detail, for each section before this PR: * creates a new section using `autosummary` * adds all modules/functions/methods which were previously in the manual section * adds any additional modules/functions/methods which are public facing but not previously documented * deletes the old manual summary and all links to it Test Plan: ``` cd docs make html python -m http.server // renders well, links work ``` Reviewed By: jerryzh168 Differential Revision: D31447615 Pulled By: vkuzo fbshipit-source-id: 09874ad9629f9c00eeab79c406579c6abd974901	2021-10-09 06:46:02 -07:00
Vasiliy Kuznetsov	fe86f0e068	Quantization docs: consilidate all API references on a single page (#66198 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66198 Consolidates all API reference material for quantization on a single page, to reduce duplication of information. Future PRs will improve the API reference page itself. Test Plan: ``` cd docs make html python -m http.server // renders well ``` Reviewed By: jerryzh168 Differential Revision: D31447616 Pulled By: vkuzo fbshipit-source-id: 2f9c4dac2b2fb377568332aef79531d1f784444a	2021-10-09 06:46:00 -07:00
Vasiliy Kuznetsov	7332ed13ed	Create a documentation page for `torch.ao.quantization.QConfig` (#66129 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66129 Adds a documentation page for `torch.ao.quantization.QConfig`. It is useful for this to have a separate page since it shared between Eager and FX graph mode quantization. Also, ensures that all important functions and module attributes in this module have docstrings, so users can discover these without reading the source code. Test Plan: ``` cd docs make html python -m http.server // open webpage, inspect it, renders correctly ``` Reviewed By: jerryzh168 Differential Revision: D31447614 Pulled By: vkuzo fbshipit-source-id: 5d9dd2a4e8647fa17b96cefbaae5299adede619c	2021-10-09 06:45:58 -07:00
Vasiliy Kuznetsov	f0fa3d1110	Create separate documentation pages for quantization observers and fake_quants (#66125 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66125 Before this PR, the documentation for observers and fake_quants was inlined in the Eager mode quantization page. This was hard to discover, especially since that page is really long, and we now have FX graph mode quantization reusing all of this code. This PR moves observers and fake_quants into their own documentation pages. It also adds docstrings to all user facing module attributes such as the default observers and fake_quants, so people can discover them from documentation without having to inspect the source code. For now, enables autoformatting (which means all public classes, functions, members with docstrings will get docs). If we need to exclude something in these files from docs in the future, we can go back to manual docs. Test Plan: ``` cd docs make html python -m server.http // inspect docs on localhost, renders correctly ``` Reviewed By: dagitses Differential Revision: D31447613 Pulled By: vkuzo fbshipit-source-id: 63b4cf518badfb29ede583a5c2ca823f572c8599	2021-10-09 06:45:56 -07:00
Vasiliy Kuznetsov	a89ac3138e	Create a documentation page for FX graph mode quantization APIs (#66122 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66122 Description: Adds a documentation page for FX graph mode quantization APIs which reads from the docstrings in `quantize_fx`, and links it from the main quantization documentation page. Also, updates the docstrings in `quantize_fx` to render well with reStructuredText. Test Plan: ``` cd docs make html python -m http.server // open webpage, inspect it, looks good ``` Reviewed By: dagitses Differential Revision: D31447612 Pulled By: vkuzo fbshipit-source-id: 07d0a6137f1537af82dce0a729f9617efaa714a0	2021-10-09 06:44:38 -07:00
Edward Yang	11bc435622	Allow registration of custom symbolics for prim namespace (#64460 ) (#66139 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66139 [ONNX] Add prim::PythonOp check back in export.cpp (#64944) Add prim::PythonOp check back in export.cpp Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D31424102 fbshipit-source-id: 6d2eef767fab846ed79ea509e97b714072bac9f4 Co-authored-by: jiafatom <jiafa@microsoft.com>	2021-10-08 07:41:06 -07:00
Peter Bell	2213c463ba	C++ API and docs for hfftn (#66127 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66127 cc mruberry peterbell10 Test Plan: Imported from OSS Reviewed By: dagitses Differential Revision: D31450216 Pulled By: mruberry fbshipit-source-id: 2878aee294aa7d74482b66d536258bac0541408d	2021-10-07 12:48:36 -07:00
Thiago Crepaldi	8d435877d5	Fix typos at ONNX docs (#66090 ) Summary: This PR fixes small typos at ONNX docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/66090 Reviewed By: albanD Differential Revision: D31385765 Pulled By: ezyang fbshipit-source-id: f4879069a2acf9c8adaa81c26a6a5014634761f5	2021-10-05 21:11:47 -07:00
Michael Suo	ad889d0b5e	Revert D30634700: [pytorch][PR] Fix typo in tensor docs Test Plan: revert-hammer Differential Revision: D30634700 (`d937473709`) Original commit changeset: e8952be20966 fbshipit-source-id: b18694e332023abcdf17ec1900b81b00d21f1014	2021-10-01 15:23:38 -07:00
Akshit Khurana	d937473709	Fix typo in tensor docs (#64160 ) Summary: Remove extra character from `torch.qfint32` Pull Request resolved: https://github.com/pytorch/pytorch/pull/64160 Test Plan: Docs Reviewed By: jerryzh168 Differential Revision: D30634700 Pulled By: axitkhurana fbshipit-source-id: e8952be20966b9a3f9d62d9957ae255d5d4889bb	2021-10-01 14:57:55 -07:00
kshitij12345	c1447f06a8	[special] special alias for softmax (#62251 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62251 Reviewed By: H-Huang Differential Revision: D31141834 Pulled By: mruberry fbshipit-source-id: aecaf62af248e9034ef589159ce0fb325c729493	2021-10-01 03:55:32 -07:00
BowenBao	89cbe6229d	[ONNX] Update doc and error message for indexing export (#64290 ) (#64579 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64579 Added suggested workarounds into indexing section of onnx export documentation. Update indexing export warning message with link to documentation. Test Plan: Imported from OSS Reviewed By: jansel Differential Revision: D30919603 Pulled By: malfet fbshipit-source-id: 7fe65cb5aa7de4f7d93ff05011ba22f5adb27811 Co-authored-by: BowenBao <bowbao@microsoft.com>	2021-09-30 21:08:56 -07:00
Kiuk Chung	3900509b7d	(torchelastic) make --max_restarts explicit in the quickstart and runner docs (#65838 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65838 closes https://github.com/pytorch/pytorch/pull/65675 The default `--max_restarts` for `torch.distributed.run` was changed to `0` from `3` to make things backwards compatible with `torch.distributed.launch`. Since the default `--max_restarts` used to be greater than `0` we never documented passing `--max_restarts` explicitly in any of our example code. Test Plan: N/A doc change only Reviewed By: d4l3k Differential Revision: D31279544 fbshipit-source-id: 98b31e6a158371bc56907552c5c13958446716f9	2021-09-29 19:29:01 -07:00
Michael Suo	cd2656a2e5	[package] add some docs describing how to debug dependencies (#65704 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65704 As title. Test Plan: Imported from OSS Reviewed By: tugsbayasgalan Differential Revision: D31209866 Pulled By: suo fbshipit-source-id: 4c8ec1d5418ea75b71c4b9a498b86f0ef5383544	2021-09-27 12:14:23 -07:00
Yi Wang	7f25c3e666	Update distributed.rst to show that CUDA send/recv on GPU is supported (#65601 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65601 I believe this feature was supported one year ago: https://github.com/pytorch/pytorch/pull/44921 #Closes: https://github.com/pytorch/pytorch/issues/65525 ghstack-source-id: 138918961 Test Plan: N/A Reviewed By: pritamdamania87, mingzhe09088 Differential Revision: D31163535 fbshipit-source-id: 9321a0a5137a3e265e2b54bd78730ac28c7acd55	2021-09-24 12:30:10 -07:00
BowenBao	9323ea2195	[ONNX] minor doc improvements and cleanup (#62514 ) (#64373 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64373 * Fix some bad formatting and clarify things in onnx.rst. * In `export_to_pretty_string`: * Add documentation for previously undocumented args. * Document that `f` arg is ignored and mark it deprecated. * Update tests to stop setting `f`. * Warn if `_retain_param_name` is set. * Use double quotes for string literals in test_operators.py. Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D30905271 Pulled By: malfet fbshipit-source-id: 3627eeabf40b9516c4a83cfab424ce537b36e4b3	2021-09-23 22:20:44 -07:00
Tingting Markstrum	2a0208f4dc	fixed comments referring fairscale master branch (#65531 ) Summary: replace comments referring fairscale master branch with main branch Pull Request resolved: https://github.com/pytorch/pytorch/pull/65531 Test Plan: buck build cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse SciPioneer H-Huang gcramer23 Reviewed By: H-Huang, anj-s Differential Revision: D31132552 Pulled By: tmarkstrum fbshipit-source-id: d3ee8920ab5cccad99f640934c21e8eee022e9b9	2021-09-23 14:37:58 -07:00
Rodrigo Berriel	7e772e7685	Update link to tutorial on defining NN modules (#65534 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/65527. Please, see my comment in the issue: https://github.com/pytorch/pytorch/issues/65527#issuecomment-925863193. The file was renamed in `ce58d5904c (diff-e5ef486bd89eb38de15752211d9437953681b8caa8f44d7c86bb820d13151df2)`, but the link in this repository was not updated. It doesn't change the fact that the old link is still working, but I guess this has to be fixed in [pytorch/tutorials](https://github.com/pytorch/tutorials) instead of here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65534 Reviewed By: soulitzer Differential Revision: D31144269 Pulled By: H-Huang fbshipit-source-id: f70744a21113b7dc84510e2992d87f0fed793985	2021-09-23 11:26:50 -07:00
Rodrigo Berriel	11ca641491	[docs] Add images to some activation functions (#65415 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/65368. See discussion in the issue. cc mruberry SsnL jbschlosser soulitzer Pull Request resolved: https://github.com/pytorch/pytorch/pull/65415 Reviewed By: soulitzer Differential Revision: D31093303 Pulled By: albanD fbshipit-source-id: 621c74c7a2aceee95e3d3b708c7f1a1d59e59b93	2021-09-22 11:05:29 -07:00
Rodrigo Berriel	00b732e98b	Remove orphan from cuDNN persistent note (#65160 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/60009. As the document is properly [included](https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/rnn.py#L799), and [`:orphan:` doesn't need to be used in included documents](https://github.com/sphinx-doc/sphinx/issues/6787#issuecomment-549256840), and no warning is emitted in my local build when removing it, I think it can be removed. The artifact reported in https://github.com/pytorch/pytorch/issues/60009 can be seen in 3 pages: [torch.nn.RNN](https://pytorch.org/docs/stable/generated/torch.nn.RNN.html#torch.nn.RNN), [torch.nn.LSTM](https://pytorch.org/docs/stable/generated/torch.nn.LSTM.html#torch.nn.LSTM), and [torch.nn.GRU](https://pytorch.org/docs/stable/generated/torch.nn.GRU.html#torch.nn.GRU). cc ezyang suo Pull Request resolved: https://github.com/pytorch/pytorch/pull/65160 Reviewed By: bdhirsh Differential Revision: D31020280 Pulled By: ezyang fbshipit-source-id: 6c3541e5a856a91cf1ce1d2db4d04f5d13118ee4	2021-09-21 11:09:47 -07:00
Rodrigo Berriel	f0ada4bd54	[docs] Remove .data from some docs (#65358 ) Summary: Related to https://github.com/pytorch/pytorch/issues/30987. Fix the following task: - [ ] Remove the use of `.data` in all our internal code: - [ ] ... - [x] `docs/source/scripts/build_activation_images.py` and `docs/source/notes/extending.rst` In `docs/source/scripts/build_activation_images.py`, I used `nn.init` because the snippet already assumes `nn` is available (the class inherits from `nn.Module`). cc albanD Pull Request resolved: https://github.com/pytorch/pytorch/pull/65358 Reviewed By: malfet Differential Revision: D31061790 Pulled By: albanD fbshipit-source-id: be936c2035f0bdd49986351026fe3e932a5b4032	2021-09-21 06:32:31 -07:00
Michael Carilli	e3210ca184	[CUDA graphs] Beta, not prototype (#65247 ) Summary: Powers have decided this API should be listed as beta. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65247 Reviewed By: malfet Differential Revision: D31057940 Pulled By: ngimel fbshipit-source-id: 137b63cbd2c7409fecdc161a22135619bfc96bfa	2021-09-20 13:32:36 -07:00
albanD	473e55d5b2	Use classmethods for overrides (#64841 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64841 Test Plan: Imported from OSS Reviewed By: heitorschueroff Differential Revision: D30991424 Pulled By: albanD fbshipit-source-id: 551e2119768f3a4292713f3bfa83930f5506adbd	2021-09-17 08:32:49 -07:00
Jane Xu	4c4c03124b	Remove old references to 9.2 in documentation (#65059 ) Summary: Removes references in .rst and README.md and comments in the Dockerfile Pull Request resolved: https://github.com/pytorch/pytorch/pull/65059 Reviewed By: malfet Differential Revision: D30961110 Pulled By: janeyx99 fbshipit-source-id: 702a9a81bf08125ec4ac38bc656fc2c128c30018	2021-09-16 13:24:05 -07:00
BowenBao	6512838fab	[ONNX] Enhance shape (two changes merged) (#64585 ) Summary: Enhanced shape inference by introducing typeReliableMap. [ONNX] exporter changes for torch hub models (https://github.com/pytorch/pytorch/issues/62856) Pull Request resolved: https://github.com/pytorch/pytorch/pull/64585 Reviewed By: ezyang Differential Revision: D30870418 Pulled By: msaroufim fbshipit-source-id: 87a294799cb87d649d1d13b6114a5cfbac9be15c Co-authored-by: jiafatom <jiafa@microsoft.com>	2021-09-15 13:02:19 -07:00
Michael Carilli	36cac2be4d	[CUDA graphs] moves memory sharing intro paragraph (#64996 ) Summary: Puts memory sharing intro under Sharing memory... header, where it should have been all along. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64996 Reviewed By: mruberry Differential Revision: D30948619 Pulled By: ngimel fbshipit-source-id: 5d9dd267b34e9d3fc499d4738377b58a22da1dc2	2021-09-14 17:53:43 -07:00
Xiaoyu Zhang	d932ddd24b	fix quantization.rst doc (#64802 ) Summary: RT。 Pull Request resolved: https://github.com/pytorch/pytorch/pull/64802 Reviewed By: jbschlosser Differential Revision: D30887210 Pulled By: vkuzo fbshipit-source-id: 0267883d3065d724ea654a28db78f5fe5702ef06	2021-09-13 07:19:54 -07:00
Heitor Schueroff	b37503e452	Initial implementation of nanmean (#62671 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62671 Very crude first implementation of `torch.nanmean`. The current reduction kernels do not have good support for implementing nan* variants. Rather than implementing new kernels for each nan* operator, I will work on new reduction kernels with support for a `nan_policy` flag and then I will port `nanmean` to use that. TODO - [x] Fix autograd issue Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D30515181 Pulled By: heitorschueroff fbshipit-source-id: 303004ebd7ac9cf963dc4f8e2553eaded5f013f0	2021-09-13 05:53:58 -07:00
Ilqar Ramazanli	2b41bf40c5	To add SequentialLR to PyTorch Core Schedulers (#64037 ) Summary: Partially resolves https://github.com/pytorch/vision/issues/4281 In this PR we are proposing a new scheduler --SequentialLR-- which enables list of different schedulers called in different periods of the training process. The main motivation of this scheduler is recently gained popularity of warming up phase in the training time. It has been shown that having a small steps in initial stages of training can help convergence procedure get faster. With the help of SequentialLR we mainly enable to call a small constant (or linearly increasing) learning rate followed by actual target learning rate scheduler. ```PyThon scheduler1 = ConstantLR(optimizer, factor=0.1, total_iters=2) scheduler2 = ExponentialLR(optimizer, gamma=0.9) scheduler = SequentialLR(optimizer, schedulers=[scheduler1, scheduler2], milestones=[5]) for epoch in range(100): train(...) validate(...) scheduler.step() ``` which this code snippet will call `ConstantLR` in the first 5 epochs and will follow up with `ExponentialLR` in the following epochs. This scheduler could be used to provide call of any group of schedulers next to each other. The main consideration we should make is every time we switch to a new scheduler we assume that new scheduler starts from the beginning- zeroth epoch. We also add Chained Scheduler to `optim.rst` and `lr_scheduler.pyi` files here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64037 Reviewed By: albanD Differential Revision: D30841099 Pulled By: iramazanli fbshipit-source-id: 94f7d352066ee108eef8cda5f0dcb07f4d371751	2021-09-09 09:36:32 -07:00
kshitij12345	2c351c76e0	[special] Alias igamma, igammac to special.gammaninc, special.gammaincc (#61902 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Also added relevant OpInfo TODO: * [x] Check rendered docs gammainc : https://docs-preview.pytorch.org/61902/special.html#torch.special.gammainc * [x] Check rendered docs gammaincc: https://docs-preview.pytorch.org/61902/special.html#torch.special.gammaincc Pull Request resolved: https://github.com/pytorch/pytorch/pull/61902 Reviewed By: ngimel Differential Revision: D30761428 Pulled By: mruberry fbshipit-source-id: 06a16432873357958d53364f12a4e91c29779d26	2021-09-07 15:31:26 -07:00
Anirudh Dagar	337c71be05	Array API: Add `torch.linalg.matmul` alias to `torch.matmul` (#63227 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/62811 Add `torch.linalg.matmul` alias to `torch.matmul`. Note that the `linalg.matmul` doesn't have a `method` variant. Also cleaning up `torch/_torch_docs.py` when formatting is not needed. cc IvanYashchuk Lezcano mruberry rgommers Pull Request resolved: https://github.com/pytorch/pytorch/pull/63227 Reviewed By: mrshenli Differential Revision: D30770235 Pulled By: mruberry fbshipit-source-id: bfba77dfcbb61fcd44f22ba41bd8d84c21132403	2021-09-07 12:35:32 -07:00
Ilqar Ramazanli	f767cf6683	To change WarmUp Scheduler with ConstantLR and LinearLR (#64395 ) Summary: Partially unblocks https://github.com/pytorch/vision/issues/4281 Previously we have added WarmUp Schedulers to PyTorch Core in the PR : https://github.com/pytorch/pytorch/pull/60836 which had two mode of execution - linear and constant depending on warming up function. In this PR we are changing this interface to more direct form, as separating linear and constant modes to separate Schedulers. In particular ```Python scheduler1 = WarmUpLR(optimizer, warmup_factor=0.1, warmup_iters=5, warmup_method="constant") scheduler2 = WarmUpLR(optimizer, warmup_factor=0.1, warmup_iters=5, warmup_method="linear") ``` will look like ```Python scheduler1 = ConstantLR(optimizer, warmup_factor=0.1, warmup_iters=5) scheduler2 = LinearLR(optimizer, warmup_factor=0.1, warmup_iters=5) ``` correspondingly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64395 Reviewed By: datumbox Differential Revision: D30753688 Pulled By: iramazanli fbshipit-source-id: e47f86d12033f80982ddf1faf5b46873adb4f324	2021-09-07 08:42:31 -07:00
Anirudh Dagar	1a1fb31cfa	Support `torch.concat` alias, add `cat` OpInfo & remove OpInfo test_out skips {cat, stack, hstack, vtack, dstack} (#62560 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/61767 ## Changes - [x] Add `torch.concat` alias to `torch.cat` - [x] Add OpInfo for `cat`/`concat` - [x] Fix `test_out` skips (Use `at::native::resize_output` or `at::native::resize_output_check`) - [x] `cat`/`concat` - [x] `stack` - [x] `hstack` - [x] `dstack` - [x] `vstack`/`row_stack` - [x] Remove redundant tests for `cat`/`stack` ~I've not added `cat`/`concat` to OpInfo `op_db` yet, since cat is a little more tricky than other OpInfos (should have a lot of tests) and currently there are no OpInfos for that. I can try to add that in a subsequent PR or maybe here itself, whatever is suggested.~ Edit: cat/concat OpInfo has been added. Note: I've added the named tensor support for `concat` alias as well, maybe that's out of spec in `array-api` but it is still useful for consistency in PyTorch. Thanks to krshrimali for guidance on my first PR :)) cc mruberry rgommers pmeier asmeurer leofang AnirudhDagar asi1024 emcastillo kmaehashi heitorschueroff krshrimali Pull Request resolved: https://github.com/pytorch/pytorch/pull/62560 Reviewed By: saketh-are Differential Revision: D30762069 Pulled By: mruberry fbshipit-source-id: 6985159d1d9756238890488a0ab3ae7699d94337	2021-09-06 23:57:18 -07:00
Chris Cai	008bf6689b	Back out "D30740897 Add fusion enabled apis" (#64500 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64500 D30740897 (`39aeb3bf63`) broke caffe2/torch/fb/module_factory/optimizers/tests:test_full_sync_optimizer_needed_coverage (https://fburl.com/test/mb46jxon) and blocked training_platform_unit_tests {F660271297} multsect results confirms ``` multisect --config FBCODE_TEST bisect 844424966128796 --workers 16 revisions --begin 09629edc --end fc86b434 D30740897 (`39aeb3bf63`) ```` {F660271232} Test Plan: ``` buck test mode/opt //caffe2/torch/fb/module_factory/optimizers/tests:test_full_sync_optimizer_needed_coverage Started reporting to test run: https://www.internalfb.com/intern/testinfra/testrun/4785074671474181 ✓ Pass: caffe2/torch/fb/module_factory/optimizers/tests:test_full_sync_optimizer_needed_coverage - main (3.729) Summary Pass: 1 ``` Differential Revision: D30753916 fbshipit-source-id: 302fd4113ef1f3069846be03edc2300d82b66719	2021-09-04 20:55:58 -07:00
Ansley Ussery	6831d8e379	Support Union in TorchScript (#64234 ) Summary: This PR is created to replace https://github.com/pytorch/pytorch/pull/53180 PR stack, which has all the review discussions. Reason for needing a replacement is due to a messy Sandcastle issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64234 Reviewed By: gmagogsfm Differential Revision: D30656444 Pulled By: ansley fbshipit-source-id: 77536c8bcc88162e2c72636026ca3c16891d669a	2021-09-03 06:12:24 -07:00
Elias Ellison	39aeb3bf63	Add fusion enabled apis (#64429 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64429 Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D30740897 Pulled By: eellison fbshipit-source-id: 446aa63b5d763f1cfffea62547db7294368e3438	2021-09-02 22:19:09 -07:00
Elias Ellison	7031fbdc63	update optimize_for_inference docs (#64428 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64428 Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D30740898 Pulled By: eellison fbshipit-source-id: b94d2c3deb661a6ba048f19e8c1d5e1799667eeb	2021-09-02 22:17:58 -07:00
Edward Yang	71e149834b	Add a warning about DataLoader num_workers > 0 "memory leak" (#64337 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64337 See https://github.com/pytorch/pytorch/issues/13246 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D30690320 Pulled By: ezyang fbshipit-source-id: 2751aca05a94e63d25162599f458855988516fad	2021-09-01 21:49:41 -07:00
Yi Wang	778af56504	[DDP Comm Hook] Add debugging communication hooks to ddp_comm_hooks.rst (#64352 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64352 as title ghstack-source-id: 137246253 Test Plan: N/A Reviewed By: rohan-varma Differential Revision: D30694089 fbshipit-source-id: a78110b11d59bb0718f43c99ede23f2fd8ab21d0	2021-09-01 17:37:19 -07:00
Yi Wang	a8f9aab840	[DDP Comm Hook] Add bf16 gradient compression to ddp_comm_hooks.rst (#64346 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64346 as title ghstack-source-id: 137170288 Test Plan: N/A Reviewed By: rohan-varma Differential Revision: D30693513 fbshipit-source-id: 8c64b8404ff3b0322e1bbbd93f6ef051ea91307d	2021-09-01 16:34:00 -07:00
Michael Carilli	8d08b103be	[CUDA graphs] Prototype API and documentation (#63269 ) Summary: RFC: https://github.com/pytorch/pytorch/issues/61880 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63269 Reviewed By: mruberry Differential Revision: D30596643 Pulled By: ngimel fbshipit-source-id: b1f8061406364b667e2c2d4d30fbce1f0d8456be	2021-08-31 13:34:23 -07:00
Raghuraman Krishnamoorthi	347ef69529	[ao][docs] Clarify operator support for quantization (#63270 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63270 Add table to quantization main page showing supported modules for static and dynamic quantization. ghstack-source-id: 137087204 Test Plan: Imported from OSS Reviewed By: HDCharles Differential Revision: D30658654 fbshipit-source-id: a82c998e1db6370596d5b0ca4c7cc96c1c90f30e	2021-08-31 12:32:47 -07:00
Raghuraman Krishnamoorthi	b9275a4003	[ao][docs] Add description of qconfig and qengine to quantization page (#63582 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63582 Current quantization docs do not define qconfig and qengine. Added text to define these concepts before they are used. ghstack-source-id: 137051719 Test Plan: Imported from OSS Reviewed By: HDCharles Differential Revision: D30658656 fbshipit-source-id: a45a0fcdf685ca1c3f5c3506337246a430f8f506	2021-08-31 10:33:07 -07:00
oleshp	93f1090267	Update contribution_guide.rst (#64142 ) Summary: Grammatical update. Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/64142 Reviewed By: mruberry Differential Revision: D30639394 Pulled By: ezyang fbshipit-source-id: cf1a4dfbd8e34b0772f1b09f5d820278e8ef8574	2021-08-30 19:26:59 -07:00
lezcano	f3e329cbec	Implements the orthogonal parametrization (#62089 ) Summary: Implements an orthogonal / unitary parametrisation. It does passes the tests and I have trained a couple models with this implementation, so I believe it should be somewhat correct. Now, the implementation is very subtle. I'm tagging nikitaved and IvanYashchuk as reviewers in case they have comments / they see some room for optimisation of the code, in particular of the `forward` function. Fixes https://github.com/pytorch/pytorch/issues/42243 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62089 Reviewed By: ezyang Differential Revision: D30639063 Pulled By: albanD fbshipit-source-id: 988664f333ac7a75ce71ba44c8d77b986dff2fe6	2021-08-30 13:12:07 -07:00
Kushashwa Ravi Shrimali	d37636901e	[Doc] `make_tensor` to `torch.testing` module (#63925 ) Summary: This PR aims to add `make_tensor` to the `torch.testing` module in PyTorch docs. TODOs: * [x] Add examples cc: pmeier mruberry brianjo Pull Request resolved: https://github.com/pytorch/pytorch/pull/63925 Reviewed By: ngimel Differential Revision: D30633487 Pulled By: mruberry fbshipit-source-id: 8e5a1f880c6ece5925b4039fee8122bd739538af	2021-08-30 12:25:40 -07:00
Mike Ruberry	29ad84f252	Removes beta warning from the special module documentation (#64148 ) Summary: Updates documentation per feature review. torch.special is now stable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64148 Reviewed By: ngimel Differential Revision: D30632049 Pulled By: mruberry fbshipit-source-id: 8f6148ec7737e7b3a90644eeca23eb217eda513d	2021-08-29 19:38:46 -07:00
Joel Schlosser	196fd3ee7a	Modules note v2 (#63963 ) Summary: This PR expands the [note on modules](https://pytorch.org/docs/stable/notes/modules.html) with additional info for 1.10. It adds the following: * Examples of using hooks * Examples of using apply() * Examples for ParameterList / ParameterDict * register_parameter() / register_buffer() usage * Discussion of train() / eval() modes * Distributed training overview / links * TorchScript overview / links * Quantization overview / links * FX overview / links * Parametrization overview / link to tutorial Pull Request resolved: https://github.com/pytorch/pytorch/pull/63963 Reviewed By: albanD Differential Revision: D30606604 Pulled By: jbschlosser fbshipit-source-id: c1030b19162bcb5fe7364bcdc981a2eb6d6e89b4	2021-08-27 11:30:18 -07:00
Can Balioglu	65e6194aeb	Introduce the torchrun entrypoint (#64049 ) Summary: This PR introduces a new `torchrun` entrypoint that simply "points" to `python -m torch.distributed.run`. It is shorter and less error-prone to type and gives a nicer syntax than a rather cryptic `python -m ...` command line. Along with the new entrypoint the documentation is also updated and places where `torch.distributed.run` are mentioned are replaced with `torchrun`. cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse agolynski SciPioneer H-Huang mrzzd cbalioglu gcramer23 Pull Request resolved: https://github.com/pytorch/pytorch/pull/64049 Reviewed By: cbalioglu Differential Revision: D30584041 Pulled By: kiukchung fbshipit-source-id: d99db3b5d12e7bf9676bab70e680d4b88031ae2d	2021-08-26 20:17:48 -07:00
Kiuk Chung	9d95d48567	(torch.distributed) Add torch.distributed.is_torchelastic_launched() util method + make init_method=tcp:// compatible with torchelastic (#63910 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63910 Addresses the current issue that `init_method=tcp://` is not compatible with `torch.distributed.run` and `torch.distributed.launch`. When running with a training script that initializes the process group with `init_method=tcp://localhost:$port` as such: ``` $ python -u -m torch.distributed.run --max_restarts 0 --nproc_per_node 1 --nnodes 1 --master_addr $(hostname) --master_port 6000 ~/tmp/test.py ``` An `Address in use` error is raised since the training script tries to create a TCPStore on port 6000, which is already taken since the elastic agent is already running a TCPStore on that port. For details see: https://github.com/pytorch/pytorch/issues/63874. This change does a couple of things: 1. Adds `is_torchelastic_launched()` check function that users can use in the training scripts to see whether the script is launched via torchelastic. 1. Update the `torch.distributed` docs page to include the new `is_torchelastic_launched()` function. 1. Makes `init_method=tcp://` torchelastic compatible by modifying `_tcp_rendezvous_handler` in `torch.distributed.rendezvous` (this is NOT the elastic rendezvous, it is the old rendezvous module which is slotted for deprecation in future releases) to check `is_torchelastic_launched()` AND `torchelastic_use_agent_store()` and if so, only create TCPStore clients (no daemons, not even for rank 0). 1. Adds a bunch of unittests to cover the different code paths NOTE: the issue mentions that we should fail-fast with an assertion on `init_method!=env://` when `is_torchelastic_launched()` is `True`. There are three registered init_methods in pytorch: env://, tcp://, file://. Since this diff makes tcp:// compatible with torchelastic and I've validated that file is compatible with torchelastic. There is no need to add assertions. I did update the docs to point out that env:// is the RECOMMENDED init_method. We should probably deprecate the other init_methods in the future but this is out of scope for this issue. Test Plan: Unittests. Reviewed By: cbalioglu Differential Revision: D30529984 fbshipit-source-id: 267aea6d4dad73eb14a2680ac921f210ff547cc5	2021-08-25 22:57:43 -07:00
Joseph Spisak	b629ea4620	Update persons_of_interest.rst (#63907 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/63907 Reviewed By: jspisak Differential Revision: D30534972 Pulled By: dzhulgakov fbshipit-source-id: ba726fc53e292a362c387cc8b5f7776ca2a2544c	2021-08-25 22:50:54 -07:00
Jithun Nair	730ce29baf	Add note on ifdefing based on CUDA_VERSION for ROCm path (#62850 ) Summary: CUDA_VERSION and HIP_VERSION follow very unrelated versioning schemes, so it does not make sense to use CUDA_VERSION to determine the ROCm path. This note explicitly addresses it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/62850 Reviewed By: mruberry Differential Revision: D30547562 Pulled By: malfet fbshipit-source-id: 02990fa66a88466c2330ab85f446b25b78545150	2021-08-25 15:02:03 -07:00
Jithun Nair	726fd26b3e	Update ROCm PyTorch persons of interest (#55206 ) Summary: cc jeffdaily sunway513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/55206 Reviewed By: VitalyFedyunin Differential Revision: D30296584 Pulled By: dzhulgakov fbshipit-source-id: 6e5c610cc6b7c7fd58b80fa3f9de31f269341a88	2021-08-22 22:31:09 -07:00
Victor Quach	b95ce1591d	Add docs describing saved tensor hooks (#62362 ) Summary: Add section to the Autograd mechanics docs to describe the recently exposed saved tensors (https://github.com/pytorch/pytorch/issues/52451), how to register packing / unpacking hooks (https://github.com/pytorch/pytorch/issues/60975) and how to use default hooks (https://github.com/pytorch/pytorch/issues/61834) Sister PR: https://github.com/pytorch/pytorch/issues/62361 (will add a link from autograd.rst to notes/autograd in whatever PR does not land first) Pull Request resolved: https://github.com/pytorch/pytorch/pull/62362 Reviewed By: soulitzer Differential Revision: D30453177 Pulled By: Varal7 fbshipit-source-id: f5759977b069ff0ef36a47b08856d297691a6caa	2021-08-20 11:10:51 -07:00
Philip Meier	99203580a9	Updates internal `assert_allclose` callsites in favor of `assert_close` (#61841 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61841 Redo of #60863. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D30408145 Pulled By: mruberry fbshipit-source-id: 0b34ebc7f23ba38ecd89640b61d8aca59b7eab58	2021-08-19 12:50:41 -07:00
Michael Dagitses	feba6806c9	clarify that `torch.finfo.tiny` is the smallest normal number (#63241 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63241 This is a common source of confusion, but it matches the NumPy behavior. Fixes #44010 Fixes #59526 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30307646 Pulled By: dagitses fbshipit-source-id: d848140ba267560387d83f3e7acba8c3cdc53d82	2021-08-18 13:44:52 -07:00
soulitzer	2f615f6313	Improve custom function docs (#60312 ) Summary: - Adds some code examples for `ctx` methods and make requirements of arguments more clear - Type annotations for `save_for_backward`, `mark_dirty`, `mark_non_differentiable`, and `set_materialize_grads` (BC-breaking?) - Refactor `torch.autograd.Function` doc Pull Request resolved: https://github.com/pytorch/pytorch/pull/60312 Reviewed By: VitalyFedyunin Differential Revision: D30314961 Pulled By: soulitzer fbshipit-source-id: a284314b65662e26390417bd2b6b12cd85e68dc8	2021-08-18 11:31:31 -07:00
Michael Dagitses	0f2f6a79cb	clarify the documentation of `torch.meshgrid` (#62977 ) Summary: Also warn about the behavior differences from `numpy.meshgrid`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/62977 Reviewed By: mruberry, ngimel Differential Revision: D30220930 Pulled By: dagitses fbshipit-source-id: ae6587b41792721cae2135376c58121b4634e296	2021-08-18 04:01:22 -07:00
kyshel	e75ed4a4b5	add comma to prevent syntax errors (#62492 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/62492 Reviewed By: VitalyFedyunin Differential Revision: D30304684 Pulled By: ezyang fbshipit-source-id: db08ca39bcecbfd79ea50df18536bf4e87f51e15	2021-08-16 12:27:31 -07:00
Supriya Rao	0831b59cf5	[docs][ao] Add missing docstrings for quantized_max_pool1d and quantized_max_pool2d (#63242 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63242 These functions are part of the native functions namespace as well as the quantized namespace Test Plan: CI Imported from OSS Reviewed By: jerryzh168 Differential Revision: D30316430 fbshipit-source-id: cd9c839e5c1a961e3c6944e514c16fbc256a2f0c	2021-08-15 22:47:03 -07:00
Supriya Rao	a090073fe4	[docs][ao] Add missing documentation for torch.quantized_batch_norm (#63240 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63240 Op is exposed via torch.quantized_batch_norm to the end user without any existing documentation Test Plan: CI Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D30316431 fbshipit-source-id: bf2dc8b7b6f497cf73528eaa2bedef9f65029d84	2021-08-15 22:45:56 -07:00
Ilqar Ramazanli	cec08e7032	To add warm-up scheduler to optim (#60836 ) Summary: Warm up of learning rate scheduling has initially been discussed by Priya et. al. in the paper: https://arxiv.org/pdf/1706.02677.pdf . In the section 2.2 of the paper they discussed and proposed idea of warming up learning schedulers in order to prevent big variance / noise in the learning rate. Then idea has been further discussed in the following papers: * Akilesh Gotmare et al. https://arxiv.org/abs/1810.13243 * Bernstein et al http://proceedings.mlr.press/v80/bernstein18a/bernstein18a.pdf * Liyuan Liu et al: https://arxiv.org/pdf/1908.03265.pdf There are two type of popularly used learning rate warm up ideas * Constant warmup (start with very small constant learning rate) * Linear Warmup ( start with small learning rate and gradually increase) In this PR we are adding warm up as learning rate scheduler. Note that learning rates are chainable, which means that we can merge warmup scheduler with any other learning rate scheduler to make more sophisticated learning rate scheduler. ## Linear Warmup Linear Warmup is multiplying learning rate with pre-defined constant - warmup_factor in the first epoch (epoch 0). Then targeting to increase this multiplication constant to one in warmup_iters many epochs. Hence we can derive the formula at i-th step to have multiplication constant equal to: warmup_factor + (1-warmup_factor) * i / warmup_iters Moreover, the fraction of this quantity at point i to point i-1 will give us 1 + (1.0 - warmup_factor) / [warmup_iterswarmup_factor+(i-1)(1-warmup_factor)] which is used in get_lr() method in our implementation. Below we provide an example how to use linear warmup scheduler and to give an example to show how does it works. ```python import torch from torch.nn import Parameter from torch.optim import SGD from torch.optim.lr_scheduler import WarmUpLR model = [Parameter(torch.randn(2, 2, requires_grad=True))] optimizer = SGD(model, 0.1) scheduler = WarmUpLR(optimizer, warmup_factor=0.1, warmup_iters=10, warmup_method="linear") for epoch in range(15): print(epoch, scheduler.get_last_lr()[0]) optimizer.step() scheduler.step() ``` ``` 0 0.010000000000000002 1 0.019000000000000003 2 0.028000000000000008 3 0.03700000000000001 4 0.04600000000000001 5 0.055000000000000014 6 0.06400000000000002 7 0.07300000000000002 8 0.08200000000000003 9 0.09100000000000004 10 0.10000000000000005 11 0.10000000000000005 12 0.10000000000000005 13 0.10000000000000005 14 0.10000000000000005 ``` ## Constant Warmup Constant warmup has straightforward idea, to multiply learning rate by warmup_factor until we reach to epoch warmup_factor, then do nothing for following epochs ```python import torch from torch.nn import Parameter from torch.optim import SGD from torch.optim.lr_scheduler import WarmUpLR model = [Parameter(torch.randn(2, 2, requires_grad=True))] optimizer = SGD(model, 0.1) scheduler = WarmUpLR(optimizer, warmup_factor=0.1, warmup_iters=5, warmup_method="constant") for epoch in range(10): print(epoch, scheduler.get_last_lr()[0]) optimizer.step() scheduler.step() ``` ``` 0 0.010000000000000002 1 0.010000000000000002 2 0.010000000000000002 3 0.010000000000000002 4 0.010000000000000002 5 0.10000000000000002 6 0.10000000000000002 7 0.10000000000000002 8 0.10000000000000002 9 0.10000000000000002 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/60836 Reviewed By: saketh-are Differential Revision: D29537615 Pulled By: iramazanli fbshipit-source-id: d910946027acc52663b301f9c56ade686e62cb69	2021-08-15 12:31:45 -07:00
anjali411	045c4cb82f	Add copy button to code snippets in docs (#63149 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63149 Test Plan: Imported from OSS Reviewed By: navahgar, albanD Differential Revision: D30308891 Pulled By: anjali411 fbshipit-source-id: ad51180ab2f27c4525682b2603bbf753bb8f1ce9	2021-08-15 06:25:32 -07:00
Meghan Lele	7107c367b5	[docs] Mention `vsplit`, `hsplit` and `tensor_split` in Tensor views doc (#63191 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63191 Summary This commit adds `vsplit`, `hsplit` and `tensor_split` to the list of view ops on the Tensor Views documentation page. Test Plan Continuous integration. Before <img width="195" alt="Captura de Pantalla 2021-08-12 a la(s) 2 55 07 p m" src="https://user-images.githubusercontent.com/4392003/129275921-c1cfdf6c-9f1f-45f3-98b6-1de7a0f0cc84.png"> After <img width="197" alt="Captura de Pantalla 2021-08-12 a la(s) 2 55 15 p m" src="https://user-images.githubusercontent.com/4392003/129275936-de4afde7-0143-4e1d-b38f-c86256f4896c.png"> Fixes This commit fixes #62727. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D30293181 Pulled By: SplitInfinity fbshipit-source-id: 283783a4ccc3ebc50cb0a427e55c7a6cb618ffd7	2021-08-13 11:44:38 -07:00
Shen Li	1022443168	Revert D30279364: [codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: revert-hammer Differential Revision: D30279364 (`b004307252`) Original commit changeset: c1ed77dfe43a fbshipit-source-id: eab50857675c51e0088391af06ec0ecb14e2347e	2021-08-12 11:45:01 -07:00
Zsolt Dollenstein	b004307252	[codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: manual inspection & sandcastle Reviewed By: zertosh Differential Revision: D30279364 fbshipit-source-id: c1ed77dfe43a3bde358f92737cd5535ae5d13c9a	2021-08-12 10:58:35 -07:00
Rishi Puri	324673a537	rebase for autocast updates to include device_type and dtype flags (#61002 ) Summary: Fixes #{55374} https://github.com/pytorch/pytorch/issues/55374 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61002 Reviewed By: malfet, mruberry Differential Revision: D30016812 Pulled By: ngimel fbshipit-source-id: 6e09a29f539d28e9aea5cd9489b1e633cc588033	2021-08-10 20:03:12 -07:00
Victor Quach	557047eb4c	Add docstring for saved tensors default hooks (#62361 ) Summary: Add documentation for the saved tensors default hooks introduced in https://github.com/pytorch/pytorch/issues/61834 / https://github.com/pytorch/pytorch/issues/62563 Sister PR: https://github.com/pytorch/pytorch/issues/62362 (will add a link from autograd.rst to notes/autograd in whatever PR does not land first) Pull Request resolved: https://github.com/pytorch/pytorch/pull/62361 Reviewed By: zou3519 Differential Revision: D30081997 Pulled By: Varal7 fbshipit-source-id: cb923e943e1d96db9669c1d863d693af30910c62	2021-08-10 14:59:38 -07:00
Yi Wang	7a3f1386ae	Add GradBucket::parameters() to ddp_comm_hooks.rst (#62877 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62877 as title ghstack-source-id: 135214612 Test Plan: N/A Reviewed By: rohan-varma Differential Revision: D30153490 fbshipit-source-id: d4cec434a53ef6e65b60c065804884d1a114aa0d	2021-08-06 14:50:47 -07:00
Andrew Gu	8aa12cbf86	Add tutorial link (#62785 ) Summary: Addresses: https://github.com/pytorch/pytorch/pull/62605#discussion_r681380364 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62785 Test Plan: I checked the render, and the link redirects as desired. Reviewed By: mrshenli Differential Revision: D30133229 Pulled By: andwgu fbshipit-source-id: baefe0d1f1b78ece44bb42e67629bc130dbf8e9a	2021-08-05 17:28:02 -07:00
cpatru	6d896cb545	Update faq.rst so OOM section mentions checkpoint (#62709 ) Summary: This FAQ has a section for CUDA OOMs where there are lots of don'ts. This limits modeling solution. Deep nets can blow up memory due to output caching during training. It's a known problem with a known solution: to trade-off compute for memory via checkpointing. FAQ should mention it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/62709 Reviewed By: nairbv Differential Revision: D30103326 Pulled By: ezyang fbshipit-source-id: 3a8b465a7fbe19aae88f83cc50fe82ebafcb56c9	2021-08-05 07:40:08 -07:00
Sean Lawlor	34c9f5a8da	[DDP Communication Hook] Update get_tensor and set_tensor to be cleaner naming conventions (buffer() and set_buffer()) (#62662 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62662 Replaced the methods set_tensor(.) and get_tensor() in the python exposed API from the C++ logic with buffer() and set_buffer(.) to be a cleaner interface. Reviewed By: SciPioneer Differential Revision: D30012869 fbshipit-source-id: bd8efab583dd89c96f9aeb3dd48a12073f0b1482	2021-08-04 09:27:31 -07:00
Victor Quach	5830f122f1	Add docstrings for save_on_cpu hooks (#62410 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62410 This PR adds docstrings for CPU hooks introduced in #61928. Also uncomments the warning about pinned memory in CUDA semantics docs. Depends on: #62361. For now docstrings are an orphan page at https://docs-preview.pytorch.org/62410/generated/torch.autograd.graph.set_save_on_cpu_hooks.html#torch-autograd-graph-set-save-on-cpu-hooks Test Plan: Imported from OSS Reviewed By: soulitzer Differential Revision: D29990129 Pulled By: Varal7 fbshipit-source-id: 7a98eeee6a0abb11e2c2d9169cd1aa35ad7ba3f4	2021-08-03 17:53:45 -07:00
Heitor Schueroff	d7d399f3df	Exposes _aminmax as aminmax and makes it structured (#62401 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62401 This PR exposes the `torch._aminmax` operator as `torch.aminmax`. TODO - [x] add examples to documentation - [x] add minmax to rst docs fixes https://github.com/pytorch/pytorch/issues/62164 Test Plan: Imported from OSS Reviewed By: soulitzer Differential Revision: D30072246 Pulled By: heitorschueroff fbshipit-source-id: 557d30af7c28ca6c238c59122367104036429ecd	2021-08-03 16:10:43 -07:00
Andrew Gu	62a90c227f	Make _Join, _Joinable, _JoinHook public (#62605 ) Summary: Overview: This removes the preceding `_` from `_Join`, `_Joinable`, and `_JoinHook` in preparation for adding the generic join context manager tutorial (see [here](https://github.com/pytorch/tutorials/pull/1610)). This also adds a docs page, which can be linked from the tutorial. [Here](https://github.com/pytorch/pytorch/files/6919475/render.pdf) is a render of the docs page. Pull Request resolved: https://github.com/pytorch/pytorch/pull/62605 Test Plan: `DistributedDataParallel.join()`: ``` touch /tmp/barrier && TEMP_DIR="/tmp" BACKEND="nccl" WORLD_SIZE="2" gpurun python test/distributed/test_distributed_fork.py -- TestDistBackendWithFork.test_ddp_uneven_inputs TestDistBackendWithFork.test_ddp_uneven_inputs_stop_iteration_sync_bn TestDistBackendWithFork.test_ddp_grad_div_uneven_inputs TestDistBackendWithFork.test_ddp_uneven_input_join_disable TestDistBackendWithFork.test_ddp_uneven_input_exception ``` `ZeroRedundancyOptimizer`: ``` gpurun4 python test/distributed/optim/test_zero_redundancy_optimizer.py ``` NOTE: DDP overlap tests are failing due to a landing race. See https://github.com/pytorch/pytorch/pull/62592. Once the fix is landed, I will rebase, and tests should be passing. `Join`: ``` gpurun4 python test/distributed/algorithms/test_join.py ``` Reviewed By: mrshenli Differential Revision: D30055544 Pulled By: andwgu fbshipit-source-id: a5ce1f1d9f1904de3bdd4edd0b31b0a612d87026	2021-08-03 12:20:11 -07:00
Kevin Tse	87465a6e68	adding operator cumulative_trapezoid (#61615 ) Summary: Stack from [ghstack](https://github.com/ezyang/ghstack): * https://github.com/pytorch/pytorch/issues/61616 * https://github.com/pytorch/pytorch/issues/61615 * https://github.com/pytorch/pytorch/issues/61475 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61615 Reviewed By: malfet, mruberry Differential Revision: D29975064 Pulled By: NivekT fbshipit-source-id: 4d4e98f3efb720fdc44eb238ecbf0fa157ac13d7	2021-08-03 08:04:00 -07:00
Yi Wang	db071ef005	[Reland][DDP Communication Hook] Rename 4 Methods of GradBucket Class (#62592 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62592 Reland #62510 `GradBucket` is an important class defined in both C++ and Python, used for PyTorch Distributed Training. We need to rename the following methods for simplicity: 1) get_index -> index 2) is_the_last_bucket_to_allreduce -> is_last, 3) get_per_parameter_tensors -> gradients, 4) get_model_params_for_bucket -> parameters. ghstack-source-id: 134848352 Test Plan: unit test Reviewed By: andwgu Differential Revision: D30049431 fbshipit-source-id: 1bcac331aa30e529b7230e3891bc811c531b0ea9	2021-08-02 16:38:09 -07:00
Howard Huang	dc1bd6acee	Remove PROCESS GROUP rpc backend (#62411 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62411 Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D29990408 Pulled By: H-Huang fbshipit-source-id: 183d3b316767b12993cebbe32b73c2850fd1cc42	2021-08-02 12:26:22 -07:00
Eli Uriegas	6f95850127	Revert D30024161: [DDP Communication Hook] Rename 4 Methods of GradBucket Class Test Plan: revert-hammer Differential Revision: D30024161 (`29c8b1db57`) Original commit changeset: 07e6072a2f7b fbshipit-source-id: d571c2caadaf7b71fe2aba3c0597bd8074d153de	2021-08-02 10:26:54 -07:00
Qing Hu	29c8b1db57	[DDP Communication Hook] Rename 4 Methods of GradBucket Class (#62510 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62510 `GradBucket` is an important class defined in both C++ and Python, used for PyTorch Distributed Training. We need to rename the following methods for simplicity: 1) get_index -> index 2) is_the_last_bucket_to_allreduce -> is_last, 3) get_per_parameter_tensors -> gradients, 4) get_model_params_for_bucket -> parameters. Test Plan: Local run comprehensive test with following results: https://pxl.cl/1Ml8b For two timeout failure test cases, most likely environment related and fail in my devserver. Reviewed By: SciPioneer Differential Revision: D30024161 fbshipit-source-id: 07e6072a2f7b81f731425d9b71f8c8b60d383b0f	2021-08-02 09:33:32 -07:00
Ce Gao	73ba166e2a	fix(elastic-docs): Fix elastic launch doc (#62378 ) Summary: The documentation link should be https://pytorch.org/docs/stable/elastic/run.html Pull Request resolved: https://github.com/pytorch/pytorch/pull/62378 Reviewed By: aivanou Differential Revision: D30002830 Pulled By: kiukchung fbshipit-source-id: 34b434acaa10222561df43f6397a2420eef02015	2021-07-30 10:58:13 -07:00
Natalia Gimelshein	d783617216	enable warnings on cuda synchronization (#62092 ) Summary: This creates `torch.cuda.set_warn_on_synchronization()` function that would warn or error when synchronizing operation is performed. We could wrap it in a context manager for ease of use, but it would be a lie, because it sets global, and not thread-local state. Since it's intended for debugging, maybe that's ok though. As all `torch.cuda.*` functions, it's going through CPython, not pybind, so the argument is converted to long before being passed to c10 function. I'll make python argument a python enum class, but without pybind it'll still have to go thourgh long conversion. For a test script ``` import torch torch.cuda.set_warn_on_synchronization(1) x=torch.randn(10, device="cuda") x.nonzero() y=torch.randn((), device="cuda") if y: print("something") torch.multinomial(x.abs(), 10, replacement=False) torch.randperm(20000, device="cuda") ind = torch.randint(10, (3,), device="cuda") mask = torch.randint(2, (10,), device="cuda", dtype=torch.bool) val = torch.randn((), device="cuda") x[mask]=1. x[mask] = val torch.cuda.synchronize() ``` the output is ``` /../playground/sync_warn_test.py:4: UserWarning: called a synchronizing operation (Triggered internally at ../c10/cuda/CUDAFunctions.cpp:145.) x.nonzero() /../playground/sync_warn_test.py:7: UserWarning: called a synchronizing operation (Triggered internally at ../c10/cuda/CUDAFunctions.cpp:145.) if y: something /../playground/sync_warn_test.py:9: UserWarning: called a synchronizing operation (Triggered internally at ../c10/cuda/CUDAFunctions.cpp:145.) torch.multinomial(x.abs(), 10, replacement=False) /../playground/sync_warn_test.py:15: UserWarning: called a synchronizing operation (Triggered internally at ../c10/cuda/CUDAFunctions.cpp:145.) x[mask] = val ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/62092 Reviewed By: mruberry Differential Revision: D29968792 Pulled By: ngimel fbshipit-source-id: cc6f817212c164727ed99ecf6ab050dc29631b9e	2021-07-30 09:13:01 -07:00
Gary Miguel	9fdf7ec6a2	[docs] Update sphinx to 3.5.4 (#61601 ) Summary: Sphinx 4.x is out, but it seems that requires many more changes to adopt. So instead use the latest version of 3.x, which includes several nice features. * Add some noindex directives to deal with warnings that would otherwise be triggered by this change due to conflicts between the docstrings declaring a function and the autodoc extension declaring the same function. * Update distributions.utils.lazy_property to make it look like a regular property when sphinx autodoc inspects classes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61601 Reviewed By: ejguan Differential Revision: D29801876 Pulled By: albanD fbshipit-source-id: 544d2434a15ceb77bff236e934dbd8e4dbd9d160	2021-07-30 06:23:10 -07:00
huqinghao	7fc96db45d	fix typo errors in quantization-support.rst Line320 (#44447 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/44379 change "`torch.per_channel_symmetric` — per tensor, symmetric" to "`torch.per_channel_symmetric` — per channel, symmetric" Pull Request resolved: https://github.com/pytorch/pytorch/pull/44447 Reviewed By: mruberry Differential Revision: D29909645 Pulled By: ezyang fbshipit-source-id: e1505d070ec2b335dd6503b528e6a2f3bda2f1e3	2021-07-27 10:42:29 -07:00
mattip	a13f714b6d	DOC: remove git stamp from release documentation version (#58486 ) Summary: CI built the documentation for the recent 1.9.0rc1 tag, but left the git version in the `version`, so (as of now) going to https://pytorch.org/docs/1.9.0/index.html and looking at the version in the upper-left corner shows "1.9.0a0+git5f0bbb3" not "1.9.0". This PR should change that to cut off everything after and including the "a". It should be cherry-picked to the release/1.9 branch so that the next rc will override the current documentation with a "cleaner" version. brianjo Pull Request resolved: https://github.com/pytorch/pytorch/pull/58486 Reviewed By: zou3519 Differential Revision: D28640476 Pulled By: malfet fbshipit-source-id: 9fd1063f4a2bc90fa8c1d12666e8c0de3d324b5c	2021-07-26 16:28:59 -07:00
Yukio Siraichi	5224490ae9	Implement NumPy-like `frombuffer` tensor constructor. (#59077 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59077 Fixes #58549 `from_buffer` constructs a tensor object from an already allocated buffer through CPython's buffer protocol. Besides the standard `dtype`, `count`, and `offset` parameters, this function also accepts: - `device`: where the buffer lives - `requires_grad`: should autograd record operations on the new tensor A new test file _test_buffer_protocol.py_ was created. Currently, only CPU tests were implemented. That's because neither PyTorch nor Numba implements CPython's buffer protocol. Therefore, there's no way to create a CUDA buffer with the existing dependencies (could use PyCUDA for that, though). At the moment, if `device` differs from the device the buffer actually lives, two things may happen: - `RuntimeError`, if `device='cuda'` - Segmentation fault (not tested -- see above), if `device='cpu'` Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D29870914 Pulled By: mruberry fbshipit-source-id: 9fa8611aeffedfe39c9af74558178157a11326bb	2021-07-23 13:17:48 -07:00
kshitij12345	943ca5f6f7	[special] alias for mvlgamma (#61633 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Have added `out` variant for consistency. TODO: * [x] Check docs https://docs-preview.pytorch.org/61633/special.html#torch.special.multigammaln Pull Request resolved: https://github.com/pytorch/pytorch/pull/61633 Reviewed By: albanD Differential Revision: D29815514 Pulled By: mruberry fbshipit-source-id: 003c7b6a5938ecc7a96727310e8a39da0b3d7aca	2021-07-23 11:24:27 -07:00
Calvin McCarter	bdf439a958	Adds _LazyInstanceNorm and LazyInstanceNormXd (#60982 ) Summary: Signed-off-by: Calvin McCarter <calvin@lightmatter.co> Fixes https://github.com/pytorch/pytorch/issues/60981 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60982 Reviewed By: albanD Differential Revision: D29810547 Pulled By: jbschlosser fbshipit-source-id: d933d4c7fe5cf7be9b09a5ab93f740b94cf08cc1	2021-07-21 06:45:45 -07:00
Nikita Shulga	604f503d30	Revert D29794958 + compilation fix (#61937 ) Summary: This PR un-reverts https://github.com/pytorch/pytorch/issues/61475 + fixes compilation with MSVC, that does not recognize alternative operator spellings (i.e. using `or` instead of `\|\|` ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/61937 Reviewed By: albanD Differential Revision: D29805941 Pulled By: malfet fbshipit-source-id: 01e5963c6717c1b44b260300d87ba0bf57f26ce9	2021-07-20 18:14:45 -07:00
Nikita Shulga	22fff61f06	Revert D29794958: [pytorch][PR] changing trapz to trapezoid Test Plan: revert-hammer Differential Revision: D29794958 (`95cec8f4fa`) Original commit changeset: 60b9c07efd47 fbshipit-source-id: 2dcda2d62e01c2521a86ae5ed8246cfb686d3f64	2021-07-20 16:00:46 -07:00
Kevin Tse	95cec8f4fa	changing trapz to trapezoid (#61475 ) Summary: This PR resolves issue https://github.com/pytorch/pytorch/issues/52606 while also adding support for complex number Stack from [ghstack](https://github.com/ezyang/ghstack): * https://github.com/pytorch/pytorch/issues/61616 * https://github.com/pytorch/pytorch/issues/61615 * https://github.com/pytorch/pytorch/issues/61475 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61475 Reviewed By: mruberry Differential Revision: D29794958 Pulled By: NivekT fbshipit-source-id: 60b9c07efd47fd85b9c8178768fc7828d7b57d29	2021-07-20 15:25:55 -07:00
ndkshr	0a6d88244b	Fix grammatical errors on the PyTorch Contribution Guide (#61818 ) Summary: ## What does the PR do? - Fix grammatical errors on the PyTorch Contribution Guide page. ## Changes [Screenshots] > Note: > 1. The changes are highlighted in each screenshot. > 2. Could not load CSS while testing locally, hope that is not an issue as all the changes are made on the content. 1. ![Change1](https://user-images.githubusercontent.com/20442648/126077764-39fd8b78-524f-407d-bc39-c93167bd10a7.PNG) 2. ![Change2](https://user-images.githubusercontent.com/20442648/126077766-9dd7dc61-ef06-41d0-a7e5-cfd179ece0cd.PNG) 3. ![Change3](https://user-images.githubusercontent.com/20442648/126077767-2c2e05e4-09fc-403a-a18e-9b108651a5f8.PNG) 4. ![Change4](https://user-images.githubusercontent.com/20442648/126077769-ad755db6-3afa-457b-b95c-9f6c6281f828.PNG) 5. ![Change5](https://user-images.githubusercontent.com/20442648/126077770-a7759dee-7f90-4b9e-a07c-4dec4ca934d0.PNG) 6. ![Change6](https://user-images.githubusercontent.com/20442648/126077772-0474e58d-c0c8-4156-b56f-808d225c38e7.PNG) 7. ![Change7](https://user-images.githubusercontent.com/20442648/126077774-d48382a7-5379-49a4-a8d2-b478fabf0bf0.PNG) 8. ![Change8](https://user-images.githubusercontent.com/20442648/126077777-fd743825-8dd7-4cb9-a22c-233e5fa085a6.PNG) Pull Request resolved: https://github.com/pytorch/pytorch/pull/61818 Reviewed By: dzhulgakov Differential Revision: D29775606 Pulled By: mrshenli fbshipit-source-id: 3f3bfdeede341f784b72dfe55da9ba8bdce1192a	2021-07-19 15:06:22 -07:00
Kushashwa Ravi Shrimali	7e1f01d4c0	Alias for `polygamma` (#59691 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: mruberry kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59691 Reviewed By: gchanan Differential Revision: D29707514 Pulled By: mruberry fbshipit-source-id: 40c15e1fda3d9f7013977b0f36a77b228dda6aa5	2021-07-16 00:06:27 -07:00
kshitij12345	968a01a94a	[special] migrate xlogy (#60641 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60641 Reviewed By: gchanan Differential Revision: D29709306 Pulled By: mruberry fbshipit-source-id: e8a5f64009a895a25618637de40b55cf36b8f794	2021-07-15 15:32:09 -07:00
Sam Estep	3a0801f960	[skip ci] Fix "arugment" typos (#61459 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/61455. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61459 Reviewed By: soulitzer Differential Revision: D29636559 Pulled By: samestep fbshipit-source-id: 9ad65265c0491d9e81bb303abe3a07c6843bfa4a	2021-07-15 15:20:18 -07:00
Eli Uriegas	e5fcc903d6	torch: Make __version__ better with comparisons (#61556 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61556 Prior to 1.10.0 `torch.__version__` was stored as a str and so many did comparisons against `torch.__version__` as if it were a str. In order to not break them we have TorchVersion which masquerades as a str while also having the ability to compare against both packaging.version.Version as well as tuples of values, eg. (1, 2, 1) Examples: Comparing a TorchVersion object to a Version object ``` TorchVersion('1.10.0a') > Version('1.10.0a') ``` Comparing a TorchVersion object to a Tuple object ``` TorchVersion('1.10.0a') > (1, 2) # 1.2 TorchVersion('1.10.0a') > (1, 2, 1) # 1.2.1 ``` Comparing a TorchVersion object against a string ``` TorchVersion('1.10.0a') > '1.2' TorchVersion('1.10.0a') > '1.2.1' ``` Resolves https://github.com/pytorch/pytorch/issues/61540 Signed-off-by: Eli Uriegas <eliuriegas@fb.com> Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D29671234 Pulled By: seemethere fbshipit-source-id: 6044805918723b4aca60bbec4b5aafc1189eaad7	2021-07-15 15:12:09 -07:00
Jinay Dagli	a9c3580080	Grammatical update of tech docs (#61547 ) Summary: Added some minor grammatical updates to the 'Complex Numbers' docs. ![Screenshot (180)](https://user-images.githubusercontent.com/75036632/125342884-0b952500-e373-11eb-9e63-410ff31e6c21.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/61547 Reviewed By: zou3519 Differential Revision: D29677361 Pulled By: H-Huang fbshipit-source-id: 78222310a755911192905a8f52aa0ae325900006	2021-07-14 14:01:59 -07:00
James Reed	ac64a41e8a	[FX][docs] Add note about python set pitfall (#61597 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61597 Test Plan: Imported from OSS Reviewed By: Chillee Differential Revision: D29685735 Pulled By: jamesr66a fbshipit-source-id: b5c5b53ff94fac1022f69b7c0ad4e4055b116029	2021-07-13 20:09:13 -07:00
Anjali Chourdia	30e48bbeae	Add neg bit (#56058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56058 User facing changes: 1. Adds a negative bit and corresponding new API (`is_neg()`,`resolve_neg()`) 2. `tensor.conj().imag` now returns a floating point tensor with neg bit set to 1 instead of a tensor with no notion of negative bit. Note that imag is still a view and all the view properties still hold for imag. Non user facing changes: 1. Added a new Negative dispatch key and a backend fallback to handle it 2. Updated copy kernel to handle negative bit 3. Merged conjugate and negative bit fallback kernel 4. fixed https://github.com/pytorch/pytorch/issues/60478 (caused due to https://github.com/pytorch/pytorch/pull/54987) Testing: 1. Added a new OpInfo based test `test_neg_view` (verifies that out-of-place and in-place operations work correctly for all operations when the input is a neg view tensor by checking the result against an actually negated tensor, verifies that autograd returns the same output for both neg view and actually negated tensors as well as it works fine when grad_out is a neg view). 2. Added a new test class containing `test_conj_view`, `test_neg_view`. Test Plan: Imported from OSS Reviewed By: soulitzer Differential Revision: D29636403 fbshipit-source-id: 12214c9dc4806c51850f4a72a109db9527c0ca63	2021-07-13 13:50:42 -07:00
Michael Dagitses	58df01c3b8	clarify default value of requires_grad for tensors (#61038 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61038 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D29491984 Pulled By: dagitses fbshipit-source-id: 7e6b7f8e81d77f38c881b86a68c17d3cf5483dad	2021-07-12 12:57:37 -07:00
kshitij12345	3faf6a715d	[special] migrate log_softmax (#60512 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Rendered Docs: https://14335157-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.log_softmax Pull Request resolved: https://github.com/pytorch/pytorch/pull/60512 Reviewed By: iramazanli Differential Revision: D29626262 Pulled By: mruberry fbshipit-source-id: c42d4105531ffb004f11f1ba6ae50be19bc02c91	2021-07-12 11:01:25 -07:00
Jithun Nair	336970c03e	Add note on torch.distributed backends on ROCm (#58975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58975 Reviewed By: soulitzer Differential Revision: D29595510 Pulled By: rohan-varma fbshipit-source-id: 384bb67fcd003d65b76e957a474406b2a38099b9	2021-07-10 03:51:19 -07:00
Lily Johnson	5fbc853c5f	[package] PackageExporter remove verbose mode (#61145 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61145 Remove 'verbose' mode from PackageExporter as people have complained that it is not useful. Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D29559681 Pulled By: Lilyjjo fbshipit-source-id: eadb1a3a25fadc64119334a09bf1fa4b355b1edd	2021-07-08 18:26:43 -07:00
BowenBao	8726f08e15	[ONNX] Update documentation (#58712 ) (#60249 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60249 * Add introductory paragraph explaining what ONNX is and what the torch.onnx module does. * In "Tracing vs Scripting" and doc-string for torch.onnx.export(), clarify that exporting always happens on ScriptModules and that tracing and scripting are the two ways to produce a ScriptModule. * Remove examples of using Caffe2 to run exported models. Caffe2's website says it's deprecated, so it's probably best not to encourage people to use it by including it in examples. * Remove a lot of content that's redundant: * The example of how to mix tracing and scripting, and instead link to Introduction to TorchScript, which includes very similar content. * "Type annotations" section. Link to TorchScript docs which explain that in more detail. * "Using dictionaries to handle Named Arguments as model inputs" section. It's redundant with the description of the `args` argument to `export()`, which appears on the same page once the HTML is generated. * Remove the list of supported Tensor indexing patterns. If it's not in the list of unsupported patterns, users can assume it's supported, so having both is redundant. * Remove the list of supported operators and models. I think the list of supported operators is not very useful. A list of supported model architectures may be useful, but in reality it's already very out of date. We should add it back if / when we have a system for keeping it up to date. * "Operator Export Type" section. It's redundant with the description of the `operator_export_type` arg to to `export()`, which appears on the same page once the HTML is generated. * "Use external data format" section. It's redundant with the description of the `use_external_data_format` arg to `export()`. * "Training" section. It's redundant with the description of the `training` arg to `export()`. * Move the content about different operator implementations producing different results from the "Limitations" section into the doc for the `operator_export_type` arg. * Document "quantized" -> "caffe2" behavior of OperatorExportTypes.ONNX_ATEN_FALLBACK. * Combing the text about using torch.Tensor.item() and the text about using NumPy types into a section titled "Avoid NumPy and built-in Python types", since they're both fundamentally about the same issue. * Rename "Write PyTorch model in Torch way" to "Avoiding Pitfalls". * Lots of minor fixes: spelling, grammar, brevity, fixing links, adding links. * Clarify limitation on input and output types. Phrasing it in terms of PyTorch types is much more accessible than in terms of TorchScript types. Also clarify what actually happens when dict and str are used as inputs and outputs. * In Supported operators, use torch function and class names and link to them. This is more user friendly than using the internal aten op names. * Remove references to VariableType.h, which doesn't appear to contain the information that it once did. Instead refer to the generated .pyi files. * Remove the text in the FAQ about appending to lists within loops. I think this limitation is no longer present (perhaps since https://github.com/pytorch/pytorch/pull/51577). * Minor fixes to some code I read along the way. * Explain the current rationale for the weird ::prim_PythonOp op name. Test Plan: Imported from OSS Reviewed By: zou3519, ZolotukhinM Differential Revision: D29494912 Pulled By: SplitInfinity fbshipit-source-id: 7756c010b2320de0692369289604403d28877719 Co-authored-by: Gary Miguel <garymiguel@microsoft.com>	2021-07-08 16:29:32 -07:00
Aliaksandr Ivanou	13658b10bb	[torch] Various improvements to `torch.distributed.launch` and `torch.distributed.run` (#61294 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61294 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60925 * Make `torch.distributed.launch` restarts to 0 * Remove unnecessary `-use_env` warning, move `-use_env` warnings * Move `-use_env` warnings to `torch.distributed.launch` * Make default log level WARNING * Add new doc section around transitioning to `torch.distributed.run` * Make `torch.distributed.launch` not use error-propagation * Set default events handler to `null` that does not print events to console * Add reference from `torch.distributed.launch` to `torch.distributed.run` * Set correct preexec function that sends SIGTERM to child processes when parent dies Issues resolved: https://github.com/pytorch/pytorch/issues/60716 https://github.com/pytorch/pytorch/issues/60754 Test Plan: sandcastle python -m torch.distributed.launch --nproc_per_node 2 main.py -> uses 0 restarts python -m torch.distributed.run --nproc_per_node 2 main.py -> uses default for torchelastic, 0 restarts python -m torch.distributed.launch --nproc_per_node=4 --use_env --no_python main.py -> produces error python -m torch.distributed.launch --nproc_per_node=4 --use_env main.py -> no warning python -m torch.distributed.launch --nproc_per_node=4 --no_python main.py ->warning Output of running torch.distributed.launch without --use_env: $path/torch/distributed/launch.py:173: FutureWarning: The module torch.distributed.launch is deprecated and will be removed in future. Use torch.distributed.run. Note that --use_env is set by default in torch.distributed.run. If your script expects `--local_rank` argument to be set, please change it to read from `os.environ('LOCAL_RANK')` instead. New section: {F628923078} {F628974089} Reviewed By: cbalioglu Differential Revision: D29559553 fbshipit-source-id: 03ed9ba638bf154354e1530ffc964688431edf6b	2021-07-08 16:28:06 -07:00
Howard Huang	cdc027679b	Add compare_set in distributed docs (#61351 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61351 Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D29588206 Pulled By: H-Huang fbshipit-source-id: 9db48e7b6de29503275f10616470ad2d66b075f9	2021-07-08 12:30:32 -07:00
Kushashwa Ravi Shrimali	423523d8bb	Alias for logsumexp to special namespace (#58838 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: kshitij12345 Lezcano mruberry Pull Request resolved: https://github.com/pytorch/pytorch/pull/58838 Reviewed By: malfet Differential Revision: D29565033 Pulled By: mruberry fbshipit-source-id: 9b715ea00c78f47b6f183357ee3c7d4c3abe4d01	2021-07-07 13:32:15 -07:00
Philip Meier	1262b2c4c6	fix `torch.futures` docstring examples (#61029 ) Summary: Trying to run the doctests for the complete documentation hangs if it reaches the examples of `torch.futures`. It turns out to be only syntax errors, which are normally just reported. My guess is that `doctest` probably doesn't work well for failures within async stuff. Anyway, while debugging this, I fixed the syntax. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61029 Reviewed By: mruberry Differential Revision: D29571923 Pulled By: mrshenli fbshipit-source-id: bb8112be5302c6ec43151590b438b195a8f30a06	2021-07-07 11:47:55 -07:00
Vitaly Fedyunin	ccfdb30644	Revert D29413019: [torch] Various improvements to `torch.distributed.launch` and `torch.distributed.run` Test Plan: revert-hammer Differential Revision: D29413019 (`4e181dfc35`) Original commit changeset: 323bfbad9d0e fbshipit-source-id: 1f8ae4b3d0a23f3eaff28c37e9148efff25fafe2	2021-07-01 08:44:51 -07:00
Aliaksandr Ivanou	4e181dfc35	[torch] Various improvements to `torch.distributed.launch` and `torch.distributed.run` (#60925 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60925 * Make `torch.distributed.launch` restarts to 0 * Remove unnecessary `-use_env` warning, move `-use_env` warnings * Move `-use_env` warnings to `torch.distributed.launch` * Make default log level WARNING * Add new doc section around transitioning to `torch.distributed.run` * Make `torch.distributed.launch` not use error-propagation * Set default events handler to `null` that does not print events to console * Add reference from `torch.distributed.launch` to `torch.distributed.run` * Set correct preexec function that sends SIGTERM to child processes when parent dies Issues resolved: https://github.com/pytorch/pytorch/issues/60716 https://github.com/pytorch/pytorch/issues/60754 Test Plan: sandcastle python -m torch.distributed.launch --nproc_per_node 2 main.py -> uses 0 restarts python -m torch.distributed.run --nproc_per_node 2 main.py -> uses default for torchelastic, 0 restarts python -m torch.distributed.launch --nproc_per_node=4 --use_env --no_python main.py -> produces error python -m torch.distributed.launch --nproc_per_node=4 --use_env main.py -> no warning python -m torch.distributed.launch --nproc_per_node=4 --no_python main.py ->warning Output of running torch.distributed.launch without --use_env: $path/torch/distributed/launch.py:173: FutureWarning: The module torch.distributed.launch is deprecated and will be removed in future. Use torch.distributed.run. Note that --use_env is set by default in torch.distributed.run. If your script expects `--local_rank` argument to be set, please change it to read from `os.environ('LOCAL_RANK')` instead. New section: {F628923078} {F628974089} Reviewed By: kiukchung, cbalioglu Differential Revision: D29413019 fbshipit-source-id: 323bfbad9d0e4aba3b10ddd7a243ca6e48169630	2021-06-30 23:31:02 -07:00
Heitor Schueroff	f32f85e6da	Implemented torch.corrcoef (#60420 ) Summary: Implements `torch.corrcoef` similar to [`np.corrcoef`](https://numpy.org/doc/stable/reference/generated/numpy.corrcoef.html) using `torch.cov` implemented in https://github.com/pytorch/pytorch/pull/58311. closes https://github.com/pytorch/pytorch/issues/1254 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60420 Reviewed By: mruberry Differential Revision: D29474687 Pulled By: heitorschueroff fbshipit-source-id: f3c7c5610363aebd88274a51fc77e3cf879cb611	2021-06-30 12:36:02 -07:00
Heitor Schueroff	ec9c03c234	Implemented torch.cov (#58311 ) Summary: Based from https://github.com/pytorch/pytorch/pull/50466 Adds the initial implementation of `torch.cov` similar to `numpy.cov`. For simplicity, we removed support for many parameters in `numpy.cov` that are either redundant such as `bias`, or have simple workarounds such as `y` and `rowvar`. cc PandaBoi closes https://github.com/pytorch/pytorch/issues/19037 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58311 Reviewed By: jbschlosser Differential Revision: D29431651 Pulled By: heitorschueroff fbshipit-source-id: 167dea880f534934b145ba94291a9d634c25b01b	2021-06-29 14:02:39 -07:00
Jeff Yang	a8057e7ef1	docs: add `permute` in torch docs (#60821 ) Summary: fix https://github.com/pytorch/pytorch/issues/60181 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60821 Reviewed By: VitalyFedyunin Differential Revision: D29431949 Pulled By: jbschlosser fbshipit-source-id: 2353afceaa188315cde1f0c955897c4750809c8e	2021-06-28 11:20:35 -07:00
Michael Carilli	2fa6c7627e	[CUDA graphs][BC-breaking] Removes post-backward syncs on default stream (#60421 ) Summary: Before https://github.com/pytorch/pytorch/pull/57833, calls to backward() or grad() synced only the calling thread's default stream with autograd leaf streams at the end of backward. This made the following weird pattern safe: ```python with torch.cuda.stream(s): # imagine forward used many streams, so backward leaf nodes may run on many streams loss.backward() # no sync use grads ``` but a more benign-looking pattern was unsafe: ```python with torch.cuda.stream(s): # imagine forward used a lot of streams, so backward leaf nodes may run on many streams loss.backward() # backward() syncs the default stream with all the leaf streams, but does not sync s with anything, # so counterintuitively (even though we're in the same stream context as backward()!) # it is NOT SAFE to use grads here, and there's no easy way to make it safe, # unless you manually sync on all the streams you used in forward, # or move "use grads" back to default stream outside the context. use grads ``` mruberry ngimel and I decided backward() should have the [same user-facing stream semantics as any cuda op](https://pytorch.org/docs/master/notes/cuda.html#stream-semantics-of-backward-passes). In other words, the weird pattern should be unsafe, and the benign-looking pattern should be safe. Implementationwise, this meant backward() should sync its calling thread's current stream, not default stream, with the leaf streams. After https://github.com/pytorch/pytorch/pull/57833, backward syncs the calling thread's current stream AND default stream with all leaf streams at the end of backward. The default stream syncs were retained for temporary backward compatibility. This PR finishes https://github.com/pytorch/pytorch/pull/57833's work by deleting syncs on the default stream. With this PR, graph-capturing an entire backward() call should be possible (see the [test_graph_grad_scaling diffs](https://github.com/pytorch/pytorch/compare/master...mcarilli:streaming_backwards_remove_default_syncs?expand=1#diff-893b1eea27352f336f4cd832919e48d721e4e90186e63400b8596db6b82e7450R3641-R3642)). first paragraph has a formatting error which this PR should also fix. Pull Request resolved: https://github.com/pytorch/pytorch/pull/60421 Reviewed By: albanD Differential Revision: D29370344 Pulled By: ngimel fbshipit-source-id: 3248bc5fb92fc517db0c15c897e5d7250f67d7fe	2021-06-24 17:34:02 -07:00
sawradip	eddc5f40f9	Added GLU and FeatureAlphaDropout to nn docs (#60590 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/60563 and https://github.com/pytorch/pytorch/issues/60570 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60590 Reviewed By: albanD Differential Revision: D29352372 Pulled By: jbschlosser fbshipit-source-id: f81dd65deab1848a68dc202df252c416ce5214d0	2021-06-24 08:00:18 -07:00
Luca Wehrstedt	bb9e1150ea	Revert D29342234: [pytorch][PR] [CUDA graphs][BC-breaking] Removes post-backward syncs on default stream Test Plan: revert-hammer Differential Revision: D29342234 (`675cea1adb`) Original commit changeset: 98e6be7fdd85 fbshipit-source-id: 84022973248b2254210eee57402df2c4f4bc43c6	2021-06-24 04:49:28 -07:00
kshitij12345	dfd2edc025	[special] add zeta (#59623 ) Summary: Reference https://github.com/pytorch/pytorch/issues/50345 `zeta` was already present in the codebase to support computation of `polygamma`. However, `zeta` only had `double(double, double)` signature for CPU before the PR (which meant that computation `polygamma` were always upcasted to `double` for zeta part). With this PR, float computations will take place in float and double in double. Have also refactored the code and moved the duplicate code from `Math.cuh` to `Math.h` Note: For scipy, q is optional, and if it is `None`, it defaults `1` which corresponds to Reimann-Zeta. However, for `torch.specia.zeta`, I made it mandatory cause for me it feels odd without `q` this is Reimann-Zeta and with `q` it is the general Hurwitz Zeta. I think sticking to just general made more sense as passing `1` for q sounds trivial. Verify: * [x] Docs https://14234587-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.zeta Pull Request resolved: https://github.com/pytorch/pytorch/pull/59623 Reviewed By: ngimel Differential Revision: D29348269 Pulled By: mruberry fbshipit-source-id: a3f9ebe1f7724dbe66de2b391afb9da1cfc3e4bb	2021-06-24 00:00:12 -07:00
Akifumi Imanishi	26cdec6ce4	Support `torch.bitwise_{left/right}_shift` and `__rlshift__`, `__rrshift__` (#59544 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/58121 This PR implements `torch.bitwise_left_shift` and `torch.bitwise_right_shift` and `torch.Tensor.{__rlshift__/__rrshift__}`for compatibility with Python array API standard. (cc: mruberry, rgommers, emcastillo, kmaehashi) Pull Request resolved: https://github.com/pytorch/pytorch/pull/59544 Reviewed By: ngimel Differential Revision: D29348869 Pulled By: mruberry fbshipit-source-id: 329aee296cf890735e8a9f858bccfe87c03d06ca	2021-06-23 23:57:16 -07:00
Michael Carilli	675cea1adb	[CUDA graphs][BC-breaking] Removes post-backward syncs on default stream (#60421 ) Summary: Before https://github.com/pytorch/pytorch/pull/57833, calls to backward() or grad() synced only the calling thread's default stream with autograd leaf streams at the end of backward. This made the following weird pattern safe: ```python with torch.cuda.stream(s): # imagine forward used many streams, so backward leaf nodes may run on many streams loss.backward() # no sync use grads ``` but a more benign-looking pattern was unsafe: ```python with torch.cuda.stream(s): # imagine forward used a lot of streams, so backward leaf nodes may run on many streams loss.backward() # backward() syncs the default stream with all the leaf streams, but does not sync s with anything, # so counterintuitively (even though we're in the same stream context as backward()!) # it is NOT SAFE to use grads here, and there's no easy way to make it safe, # unless you manually sync on all the streams you used in forward, # or move "use grads" back to default stream outside the context. use grads ``` mruberry ngimel and I decided backward() should have the [same user-facing stream semantics as any cuda op](https://pytorch.org/docs/master/notes/cuda.html#stream-semantics-of-backward-passes). In other words, the weird pattern should be unsafe, and the benign-looking pattern should be safe. Implementationwise, this meant backward() should sync its calling thread's current stream, not default stream, with the leaf streams. After https://github.com/pytorch/pytorch/pull/57833, backward syncs the calling thread's current stream AND default stream with all leaf streams at the end of backward. The default stream syncs were retained for temporary backward compatibility. This PR finishes https://github.com/pytorch/pytorch/pull/57833's work by deleting syncs on the default stream. With this PR, graph-capturing an entire backward() call should be possible (see the [test_graph_grad_scaling diffs](https://github.com/pytorch/pytorch/compare/master...mcarilli:streaming_backwards_remove_default_syncs?expand=1#diff-893b1eea27352f336f4cd832919e48d721e4e90186e63400b8596db6b82e7450R3641-R3642)). first paragraph has a formatting error which this PR should also fix. Pull Request resolved: https://github.com/pytorch/pytorch/pull/60421 Reviewed By: VitalyFedyunin, albanD Differential Revision: D29342234 Pulled By: ngimel fbshipit-source-id: 98e6be7fdd8550872f0a78f9a66cb8dfe75abf63	2021-06-23 23:35:24 -07:00
Ilqar Ramazanli	63219f1f9f	To add Rectified Adam Algorithm to Optimizers (#58968 ) Summary: Fixes : https://github.com/pytorch/pytorch/issues/24892 In the paper : https://arxiv.org/pdf/1908.03265.pdf Liyuan Liu et al. suggested a new optimization algorithm with an essence of similar to Adam Algorithm. It has been discussed in the paper that, without warmup heuristic, in the early stage of adaptive optimization / learning algorithms sometimes we can get undesirable large variance which can slow overall convergence process. Authors proposed the idea of rectification of variance of adaptive learning rate when it is expected to be high. Differing from the paper, we selected variance tractability cut-off as 5 instead of 4. This adjustment is common practice, and could be found in the code-repository and also tensorflow swift optim library as well : `2f03dd1970/radam/radam.py (L156)` `f51ee4618d/Sources/TensorFlow/Optimizers/MomentumBased.swift (L638)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/58968 Reviewed By: vincentqb Differential Revision: D29310601 Pulled By: iramazanli fbshipit-source-id: b7bd487f72f1074f266687fd9c0c6be264a748a9	2021-06-23 18:27:57 -07:00
Ilqar Ramazanli	e8690dacb2	To add Nesterov Adam Algorithm to Optimizers (#59009 ) Summary: Fixes : https://github.com/pytorch/pytorch/issues/5804 In the paper : https://openreview.net/forum?id=OM0jvwB8jIp57ZJjtNEZ Timothy Dozat suggested a new optimization algorithm with an essence of combination of NAG and Adam algorithms. It is known that the idea of momentum can be improved with the Nesterov acceleration in optimization algorithms, and Dozat is investigating to apply this idea to momentum component of Adam algorithm. Author provided experiment evidence in their work to show excellence of the idea. In this PR we are implementing the proposed algorithm NAdam in the mentioned paper. Author has a preliminary work http://cs229.stanford.edu/proj2015/054_report.pdf where he shows the decay base constant should be taken as 0.96 which we also followed the same phenomenon here in this implementation similar to Keras. Moreover, implementation / coding practice have been followed similar to Keras in some other places as well: `f9d3868495/tensorflow/python/keras/optimizer_v2/nadam.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/59009 Reviewed By: gchanan, vincentqb Differential Revision: D29220375 Pulled By: iramazanli fbshipit-source-id: 4b4bb4b15f7e16f7527f368bbf4207ed345751aa	2021-06-23 08:21:43 -07:00
Weiqiang Wu	6a87e8d087	Implement erfcx() (#58194 ) Summary: Implement erfcx() https://github.com/pytorch/pytorch/issues/31945 Reference: https://github.com/pytorch/pytorch/issues/50345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58194 Reviewed By: ngimel Differential Revision: D29285979 Pulled By: mruberry fbshipit-source-id: 5bcfe77fddfabbeb8c8068658ba6d9fec6430399	2021-06-22 12:38:38 -07:00
Sam Estep	1abf45e37f	Revert D29241736: [pytorch][PR] To add Rectified Adam Algorithm to Optimizers Test Plan: revert-hammer Differential Revision: D29241736 (`0d2a936176`) Original commit changeset: 288b9b1f3125 fbshipit-source-id: 56c4ec98647c6f1822b130726741a1c9ca193670	2021-06-22 12:08:31 -07:00
Ilqar Ramazanli	0d2a936176	To add Rectified Adam Algorithm to Optimizers (#58968 ) Summary: Fixes : https://github.com/pytorch/pytorch/issues/24892 In the paper : https://arxiv.org/pdf/1908.03265.pdf Liyuan Liu et al. suggested a new optimization algorithm with an essence of similar to Adam Algorithm. It has been discussed in the paper that, without warmup heuristic, in the early stage of adaptive optimization / learning algorithms sometimes we can get undesirable large variance which can slow overall convergence process. Authors proposed the idea of rectification of variance of adaptive learning rate when it is expected to be high. Differing from the paper, we selected variance tractability cut-off as 5 instead of 4. This adjustment is common practice, and could be found in the code-repository and also tensorflow swift optim library as well : `2f03dd1970/radam/radam.py (L156)` `f51ee4618d/Sources/TensorFlow/Optimizers/MomentumBased.swift (L638)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/58968 Reviewed By: gchanan Differential Revision: D29241736 Pulled By: iramazanli fbshipit-source-id: 288b9b1f3125fdc6c7a7bb23fde1ea5c201c0448	2021-06-22 10:38:41 -07:00
Saketh Are	729f7cd52f	Implement histogram operator on CPU (#58780 ) Summary: The existing [torch.histc](https://pytorch.org/docs/stable/generated/torch.histc.html) operator is limited in comparison to [numpy.histogram](https://numpy.org/doc/stable/reference/generated/numpy.histogram.html). This PR adds torch.histogram on CPU. The new operator replicates numpy.histogram's behavior, including support for caller-specified bin edges and weights. It was motivated by previous community requests for histogram. The implementation was [benchmarked](https://docs.google.com/spreadsheets/d/1xCR0jODchVvwdVSAjiLsNCkmyictA6j1LNfDpWOafjw/edit?usp=sharing) against numpy.histogram as well as torch.histc. This implementation is weakly faster than numpy.histogram across all types of inputs tested, and performs in line with torch.histc for the limited inputs histc supports. mruberry Pull Request resolved: https://github.com/pytorch/pytorch/pull/58780 Test Plan: Added unit tests, OpInfo for the new torch.histogram operator. Tested execution time on a variety of input sizes and compared to numpy.histogram performance: https://docs.google.com/spreadsheets/d/1xCR0jODchVvwdVSAjiLsNCkmyictA6j1LNfDpWOafjw/edit?usp=sharing Reviewed By: ezyang Differential Revision: D29134626 Pulled By: saketh-are fbshipit-source-id: f2773085de1697f6bc6ffdeffe9a81267f51bdfc	2021-06-22 10:06:04 -07:00
kshitij12345	01e0296eb7	[special] migrate log1p, sinc, round to special namespace (#55878 ) Summary: Reference : https://github.com/pytorch/pytorch/issues/50345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/55878 Reviewed By: zou3519, janeyx99 Differential Revision: D29160593 Pulled By: mruberry fbshipit-source-id: f3ca9c541382bab33fb85d7817ce8ddc117c6826	2021-06-21 12:34:29 -07:00
Michael Wootton	2f3be2735f	Don't split oversize cached blocks (#44742 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/35901 This change is designed to prevent fragmentation in the Caching Allocator. Permissive block splitting in the allocator allows very large blocks to be split into many pieces. Once split too finely it is unlikely all pieces will be 'free' at that same time so the original allocation can never be returned. Anecdotally, we've seen a model run out of memory failing to alloc a 50 MB block on a 32 GB card while the caching allocator is holding 13 GB of 'split free blocks' Approach: - Large blocks above a certain size are designated "oversize". This limit is currently set 1 decade above large, 200 MB - Oversize blocks can not be split - Oversize blocks must closely match the requested size (e.g. a 200 MB request will match an existing 205 MB block, but not a 300 MB block) - In lieu of splitting oversize blocks there is a mechanism to quickly free a single oversize block (to the system allocator) to allow an appropriate size block to be allocated. This will be activated under memory pressure and will prevent _release_cached_blocks()_ from triggering Initial performance tests show this is similar or quicker than the original strategy. Additional tests are ongoing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/44742 Reviewed By: zou3519 Differential Revision: D29186394 Pulled By: ezyang fbshipit-source-id: c88918836db3f51df59de6d1b3e03602ebe306a9	2021-06-21 11:46:08 -07:00
Thomas J. Fan	c16f87949f	ENH Adds nn.ReflectionPad3d (#59791 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/27655 This PR adds a C++ and Python version of ReflectionPad3d with structured kernels. The implementation uses lambdas extensively to better share code from the backward and forward pass. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59791 Reviewed By: gchanan Differential Revision: D29242015 Pulled By: jbschlosser fbshipit-source-id: 18e692d3b49b74082be09f373fc95fb7891e1b56	2021-06-21 10:53:14 -07:00
Michael Carilli	f89ae9cb8d	Moves grid_sampler to autocast promote list (#58618 ) Summary: Should close https://github.com/pytorch/pytorch/issues/42218 Numerically, `grid_sampler` is fine in fp16 or fp32, but takes several inputs and expects their dtypes to match, so it belongs on the autocast promote list. `grid_sampler` currently uses `gpuAtomicAdd`, notoriously slow in fp16 because it calls cuda's atomicAdd __half overload which uses a software compare-and-swap loop internally. To allow good performance if both inputs happen to be FP16, the PR also modifies `grid_sampler_[2,3]d_backward_kernel`s to use `fastAtomicAdd` instead. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58618 Reviewed By: mruberry Differential Revision: D29257199 Pulled By: ngimel fbshipit-source-id: 3cc7505945b480427f2fc1beb36bee80bf3853b3	2021-06-21 10:22:36 -07:00
kshitij12345	5ec4ad7f54	[special] Add special.ndtri (#58650 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 TODO * [x] Add docs https://13865352-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.ndtri * [x] Add comments on implementation * [x] Clean-up Pull Request resolved: https://github.com/pytorch/pytorch/pull/58650 Reviewed By: H-Huang Differential Revision: D29160170 Pulled By: mruberry fbshipit-source-id: 50e4ea663920e97b8437d03d5b52bcd9dedc1a8d	2021-06-19 18:36:54 -07:00
Patrick Wang	8b55e9feaf	removed cat, equal, and stack from autocast promote list (#59497 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/59497 Reviewed By: zou3519 Differential Revision: D29185909 Pulled By: ngimel fbshipit-source-id: db96239106d9e46a2704b8f457fd0463dacc1f5c	2021-06-17 21:13:22 -07:00
Patrick	5948e6f653	removed gelu from autocast fp32 list (#59639 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/59639 Reviewed By: H-Huang Differential Revision: D29155914 Pulled By: ezyang fbshipit-source-id: feb117181894c2355768d5b1189b3d5f1649fc0b	2021-06-16 16:29:57 -07:00
Michael Suo	15f236f3e3	[package] fix tutorial link (#60113 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60113 The tutorial link in the docs was to an fb-only colab. Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D29169818 Pulled By: suo fbshipit-source-id: 374807c234a185bd515b8ffe1300e6cf8d821636	2021-06-16 11:27:25 -07:00
BowenBao	55530e2276	Update Autograd Export Docs (#56594 ) (#59534 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59534 Update autograd export docs Test Plan: Imported from OSS Reviewed By: nikithamalgifb, ansley Differential Revision: D29046606 Pulled By: SplitInfinity fbshipit-source-id: 36057f6bdfd3e5c071dbca05d327de7952904120 Co-authored-by: neginraoof <neginmr@utexas.edu>	2021-06-15 12:23:00 -07:00
Joel Schlosser	c645d39a77	Implementation of torch.isin() (#53125 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/3025 ## Background This PR implements a function similar to numpy's [`isin()`](https://numpy.org/doc/stable/reference/generated/numpy.isin.html#numpy.isin). The op supports integral and floating point types on CPU and CUDA (+ half & bfloat16 for CUDA). Inputs can be one of: * (Tensor, Tensor) * (Tensor, Scalar) * (Scalar, Tensor) Internally, one of two algorithms is selected based on the number of elements vs. test elements. The heuristic for deciding which algorithm to use is taken from [numpy's implementation](`fb215c7696/numpy/lib/arraysetops.py (L575)`): if `len(test_elements) < 10 * len(elements) ** 0.145`, then a naive brute-force checking algorithm is used. Otherwise, a stablesort-based algorithm is used. I've done some preliminary benchmarking to verify this heuristic on a devgpu, and determined for a limited set of tests that a power value of `0.407` instead of `0.145` is a better inflection point. For now, the heuristic has been left to match numpy's, but input is welcome for the best way to select it or whether it should be left the same as numpy's. Tests are adapted from numpy's [isin and in1d tests](`7dcd29aaaf/numpy/lib/tests/test_arraysetops.py`). Note: my locally generated docs look terrible for some reason, so I'm not including the screenshot for them until I figure out why. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53125 Test Plan: ``` python test/test_ops.py # Ex: python test/test_ops.py TestOpInfoCPU.test_supported_dtypes_isin_cpu_int32 python test/test_sort_and_select.py # Ex: python test/test_sort_and_select.py TestSortAndSelectCPU.test_isin_cpu_int32 ``` Reviewed By: soulitzer Differential Revision: D29101165 Pulled By: jbschlosser fbshipit-source-id: 2dcc38d497b1e843f73f332d837081e819454b4e	2021-06-14 13:50:53 -07:00
Meghan Lele	8e92a3a8b0	[docs] Add pickle security warning to package docs (#59959 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59959 Summary This commit replaces the warning on the `torch.package` documentation page about the module not being publicly released (which will no longer be true as of 1.9) with one that warns about security issues caused by the use of the `pickle` module. Test Plan 1) Built the docs locally. 2) Continuous integration. <img width="877" alt="Captura de Pantalla 2021-06-14 a la(s) 11 22 05 a m" src="https://user-images.githubusercontent.com/4392003/121940300-c98cab00-cd02-11eb-99dc-08e29632079a.png"> Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D29108429 Pulled By: SplitInfinity fbshipit-source-id: 3a0aeac0dc804a31203bc5071efb1c5bd6ef9725	2021-06-14 13:03:05 -07:00
Kushashwa Ravi Shrimali	cf38b20c61	Alias for `digamma` as `psi` to `special` namespace (#59143 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: mruberry kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59143 Reviewed By: jbschlosser Differential Revision: D28986909 Pulled By: mruberry fbshipit-source-id: bc8ff0375de968f3662b224689fa0a6b117f9c4e	2021-06-14 03:05:14 -07:00
Michael Carilli	be038d8989	[CUDA graphs] Make stream semantics of backward calls consistent with other cuda ops (ci-all edition) (#57833 ) Summary: ci-all resubmit of https://github.com/pytorch/pytorch/pull/54227. Tests look good except for a few distributed autograd failures (pytorch_linux_xenial_cuda10_2_cudnn7_py3_multigpu_test) and rocm failures (pr/pytorch-linux-bionic-rocm4.1-py3.6). The common denominator in rocm failures appears to be multi-gpu activity: some [multiprocess DDP failures](https://ci.pytorch.org/jenkins/job/pytorch-builds/job/pytorch-linux-bionic-rocm4.1-py3.6-test1/8115/console), some [single-process failures](https://ci.pytorch.org/jenkins/job/pytorch-builds/job/pytorch-linux-bionic-rocm4.1-py3.6-test2/8115/console) where the single process has autograd ops that span devices. jeffdaily jithunnair-amd sunway513, could one of you take a look? The streaming backward change is also beneficial to rocm, I expect. For debugging rocm failures, I think we should ignore the multiprocess/DDP tests and focus on the single process cases. The root cause is probably the same and the single process cases are simpler. ---------------------------------- Update: Rocm failures are due to https://github.com/pytorch/pytorch/issues/59750. `2718a54032` is a workaround, to be updated once https://github.com/pytorch/pytorch/issues/59750 is fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57833 Reviewed By: mruberry Differential Revision: D28942391 Pulled By: ngimel fbshipit-source-id: d6047e971c5f1c6386334bf3641402a92f12e2f8	2021-06-13 12:09:56 -07:00
Mike Ruberry	92513038e8	Revert D28994140: [pytorch][PR] Implemented torch.cov Test Plan: revert-hammer Differential Revision: D28994140 (`23c232554b`) Original commit changeset: 1890166c0a9c fbshipit-source-id: 73dfe1b00464e38f004f99960cdeeb604ed4b20a	2021-06-13 02:33:37 -07:00
Heitor Schueroff	23c232554b	Implemented torch.cov (#58311 ) Summary: Based from https://github.com/pytorch/pytorch/pull/50466 Adds the initial implementation of `torch.cov` similar to `numpy.cov`. For simplicity, we removed support for many parameters in `numpy.cov` that are either redundant such as `bias`, or have simple workarounds such as `y` and `rowvar`. cc PandaBoi TODO - [x] Improve documentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/58311 Reviewed By: mruberry Differential Revision: D28994140 Pulled By: heitorschueroff fbshipit-source-id: 1890166c0a9c01e0a536acd91571cd704d632f44	2021-06-11 09:40:50 -07:00
Meghan Lele	4025f95a20	[docs] Add table of contents to torch.package docs (#59842 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59842 Test Plan: Continuous integration. <img width="544" alt="Captura de Pantalla 2021-06-10 a la(s) 5 13 07 p m" src="https://user-images.githubusercontent.com/4392003/121612390-2ccec280-ca0f-11eb-87ad-fef632ba05ca.png"> Reviewed By: Lilyjjo Differential Revision: D29050627 Pulled By: SplitInfinity fbshipit-source-id: 76c25ed4002cbaf072036e2e14e7857c15077df7	2021-06-10 19:52:50 -07:00
Meghan Lele	0e222db087	[docs] Add explanation section to torch.package docs (#59833 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59833 Summary This commit adds an explanation section to the `torch.package` documentation. This section clarifies and illuminates various aspects of the internals of `torch.package` that might be of interest to users. Test Plan Continuous integration. Test Plan: Imported from OSS Reviewed By: Lilyjjo Differential Revision: D29050626 Pulled By: SplitInfinity fbshipit-source-id: 78e0cda00f69506ef2dfc52d6df63694b502269e	2021-06-10 19:52:48 -07:00
Meghan Lele	062dde7285	[docs] Add "how do I" section to torch.package docs (#59503 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59503 Summary This commit adds a "how do I..." section to the `torch.package` documentation. This section contains short guides about how to solve real-world problems that frequently recur while using `torch.package`. Test Plan Continuous integration. <img width="877" alt="Captura de Pantalla 2021-06-04 a la(s) 9 19 54 p m" src="https://user-images.githubusercontent.com/4392003/120879911-98321380-c57b-11eb-8664-c582c92b7837.png"> Test Plan: Imported from OSS Reviewed By: Lilyjjo Differential Revision: D29050629 Pulled By: SplitInfinity fbshipit-source-id: 2b7800732e0a3c1c947f110c05562aed5174a87f	2021-06-10 19:52:47 -07:00
Meghan Lele	6a18ca7a07	[docs] Add tutorials section to torch.package docs (#59499 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59499 Summary This commit adds a tutorials section to the torch.package docs. Test Plan Continuous integration. <img width="870" alt="Captura de Pantalla 2021-06-04 a la(s) 5 10 31 p m" src="https://user-images.githubusercontent.com/4392003/120874257-b9ced300-c55a-11eb-84dd-721cb7ac73ab.png"> Test Plan: Imported from OSS Reviewed By: Lilyjjo Differential Revision: D29050628 Pulled By: SplitInfinity fbshipit-source-id: c17ab0100a9d63e7af8da7a618143cedbd0a5872	2021-06-10 19:52:45 -07:00
Meghan Lele	a3db8e0a26	[docs] Add torch.package documentation preamble (#59491 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59491 Summary This commit adds a preamble to the `torch.package` documentation page that explains briefly what `torch.package` is. Test Plan Continous integration. <img width="881" alt="Captura de Pantalla 2021-06-04 a la(s) 3 57 01 p m" src="https://user-images.githubusercontent.com/4392003/120872203-d535e000-c552-11eb-841d-b38df19bc992.png"> Test Plan: Imported from OSS Reviewed By: Lilyjjo Differential Revision: D29050630 Pulled By: SplitInfinity fbshipit-source-id: 70a3fd43f076751c6ea83be3ead291686c641158	2021-06-10 19:51:37 -07:00
Rohan Varma	2f395f3b54	[reland] Document debugability features in torch.distributed (#59726 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59726 Reland of https://github.com/pytorch/pytorch/pull/59604 with indentation fix ghstack-source-id: 130979356 Test Plan: ci Reviewed By: SciPioneer Differential Revision: D29001923 fbshipit-source-id: 225d9dc5054c223b453f3b39749e2b62f61b9a2c	2021-06-09 16:40:11 -07:00
Luca Wehrstedt	f1786b293d	Revert D28972444: [pytorch][PR] Document debugability features in torch.distributed Test Plan: revert-hammer Differential Revision: D28972444 (`a9d2810817`) Original commit changeset: da5e8ee84f0d fbshipit-source-id: 94d3b3b75ddec74ea5b2b76f6a7519dc921ee2a7	2021-06-09 03:04:36 -07:00
Rohan Varma	a9d2810817	Document debugability features in torch.distributed (#59604 ) Summary: Adds comprehensive documentation around debugability features added to `torch.distributed` recently, including the `monitored_barrier` and TORCH_DISTRIBUTED_DEBUG env variable. ![dist_one](https://user-images.githubusercontent.com/8039770/121102672-0f052180-c7b3-11eb-974c-81dbbe102cb6.png) ![dist_two](https://user-images.githubusercontent.com/8039770/121102734-39ef7580-c7b3-11eb-94f7-c75469351440.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/59604 Reviewed By: jbschlosser, SciPioneer Differential Revision: D28972444 Pulled By: rohan-varma fbshipit-source-id: da5e8ee84f0d6f252c703c4d70ff2a0d5817cc4e	2021-06-08 23:52:19 -07:00
Jeffrey Wan	f52e202840	Add warning when accessing Tensor::grad() in the C++ API (#59362 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/35379 - Adds `retains_grad` attribute backed by cpp as a native function. The python bindings for the function are skipped to be consistent with `is_leaf`. - Tried writing it without native function, but the jit test `test_tensor_properties` seems to require that it be a native function (or alternatively maybe it could also work if we manually add a prim implementation?). - Python API now uses `retain_grad` implementation from cpp Pull Request resolved: https://github.com/pytorch/pytorch/pull/59362 Reviewed By: jbschlosser Differential Revision: D28969298 Pulled By: soulitzer fbshipit-source-id: 335f2be50b9fb870cd35dc72f7dadd6c8666cc02	2021-06-08 19:43:21 -07:00
James Reed	02d380450d	[FX][docs][EZ] Fix link to fuser example (#59670 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59670 Test Plan: Imported from OSS Reviewed By: jansel Differential Revision: D28975704 Pulled By: jamesr66a fbshipit-source-id: 2fb759224b5b1ecc62c0ab26563d2a35ed422794	2021-06-08 17:32:55 -07:00
Vasiliy Kuznetsov	dafa4b3517	quantization: improve documentation on natively supported backends (#58925 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58925 Cleans up documentation on natively supported backends. In particular: * adds a section title * deduplicates information about fbgemm/qnnpack * clarifies what `torch.backends.quantized.engine` does * adds code samples with default settings for `fbgemm` and `qnnpack` Test Plan: Imported from OSS Reviewed By: jerryzh168 Differential Revision: D28681840 Pulled By: vkuzo fbshipit-source-id: 51a6ab66934f657553351f6c84a638fd5f7b4e12	2021-06-07 17:29:03 -07:00
Thomas J. Fan	6ff001c125	DOC Improve documentation for LayerNorm (#59178 ) Summary: Closes https://github.com/pytorch/pytorch/issues/51455 I think the current implementation is aggregating over the correct dimensions. The shape of `normalized_shape` is only used to determine the dimensions to aggregate over. The actual values of `normalized_shape` are used when `elementwise_affine=True` to initialize the weights and biases. This PR updates the docstring to clarify how `normalized_shape` is used. Here is a short script comparing the implementations for tensorflow and pytorch: ```python import torch import torch.nn as nn import tensorflow as tf from tensorflow.keras.layers import LayerNormalization rng = np.random.RandomState() x = rng.randn(10, 20, 64, 64).astype(np.float32) # slightly non-trival x[:, :10, ...] = x[:, :10, ...] * 10 + 20 x[:, 10:, ...] = x[:, 10:, ...] * 30 - 100 # Tensorflow Layer norm x_tf = tf.convert_to_tensor(x) layer_norm_tf = LayerNormalization(axis=[-3, -2, -1], epsilon=1e-5) output_tf = layer_norm_tf(x_tf) output_tf_np = output_tf.numpy() # PyTorch Layer norm x_torch = torch.as_tensor(x) layer_norm_torch = nn.LayerNorm([20, 64, 64], elementwise_affine=False) output_torch = layer_norm_torch(x_torch) output_torch_np = output_torch.detach().numpy() # check tensorflow and pytorch torch.testing.assert_allclose(output_tf_np, output_torch_np) # manual comutation manual_output = ((x_torch - x_torch.mean(dim=(-3, -2, -1), keepdims=True)) / (x_torch.var(dim=(-3, -2, -1), keepdims=True, unbiased=False) + 1e-5).sqrt()) torch.testing.assert_allclose(output_torch, manual_output) ``` To get to the layer normalization as shown here: <img width="157" alt="Screen Shot 2021-05-29 at 2 13 52 PM" src="https://user-images.githubusercontent.com/5402633/120080691-1e37f100-c088-11eb-9060-4f263e4cd093.png"> One needs to pass in `normalized_shape` with shape `x.dim() - 1` with the size of the channels and all spatial dimensions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59178 Reviewed By: ejguan Differential Revision: D28931877 Pulled By: jbschlosser fbshipit-source-id: 193e05205b9085bb190c221428c96d2ca29f2a70	2021-06-07 14:34:10 -07:00
anjali411	3607478ecd	Conjugate View (#54987 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54987 Based off of ezyang (https://github.com/pytorch/pytorch/pull/44799) and bdhirsh (https://github.com/pytorch/pytorch/pull/43702) 's prototype: Here's a summary of the changes in this PR: This PR adds a new dispatch key called Conjugate. This enables us to make conjugate operation a view and leverage the specialized library functions that fast path with the hermitian operation (conj + transpose). 1. Conjugate operation will now return a view with conj bit (1) for complex tensors and returns self for non-complex tensors as before. This also means `torch.view_as_real` will no longer be a view on conjugated complex tensors and is hence disabled. To fill the gap, we have added `torch.view_as_real_physical` which would return the real tensor agnostic of the conjugate bit on the input complex tensor. The information about conjugation on the old tensor can be obtained by calling `.is_conj()` on the new tensor. 2. NEW API: a) `.conj()` -- now returning a view. b) `.conj_physical()` -- does the physical conjugate operation. If the conj bit for input was set, you'd get `self.clone()`, else you'll get a new tensor with conjugated value in its memory. c) `.conj_physical_()`, and `out=` variant d) `.resolve_conj()` -- materializes the conjugation. returns self if the conj bit is unset, else returns a new tensor with conjugated values and conj bit set to 0. e) `.resolve_conj_()` in-place version of (d) f) `view_as_real_physical` -- as described in (1), it's functionally same as `view_as_real`, just that it doesn't error out on conjugated tensors. g) `view_as_real` -- existing function, but now errors out on conjugated tensors. 3. Conjugate Fallback a) Vast majority of PyTorch functions would currently use this fallback when they are called on a conjugated tensor. b) This fallback is well equipped to handle the following cases: - functional operation e.g., `torch.sin(input)` - Mutable inputs and in-place operations e.g., `tensor.add_(2)` - out-of-place operation e.g., `torch.sin(input, out=out)` - Tensorlist input args - NOTE: Meta tensors don't work with conjugate fallback. 4. Autograd a) `resolve_conj()` is an identity function w.r.t. autograd b) Everything else works as expected. 5. Testing: a) All method_tests run with conjugate view tensors. b) OpInfo tests that run with conjugate views - test_variant_consistency_eager/jit - gradcheck, gradgradcheck - test_conj_views (that only run for `torch.cfloat` dtype) NOTE: functions like `empty_like`, `zero_like`, `randn_like`, `clone` don't propagate the conjugate bit. Follow up work: 1. conjugate view RFC 2. Add neg bit to re-enable view operation on conjugated tensors 3. Update linalg functions to call into specialized functions that fast path with the hermitian operation. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D28227315 Pulled By: anjali411 fbshipit-source-id: acab9402b9d6a970c6d512809b627a290c8def5f	2021-06-04 14:12:41 -07:00
Jeffrey Wan	4ae5764d47	Add is_inference to native functions (#58729 ) Summary: Adds `is_inference` as a native function w/ manual cpp bindings. Also changes instances of `is_inference_tensor` to `is_inference` to be consistent with other properties such as `is_complex`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58729 Reviewed By: mruberry Differential Revision: D28874507 Pulled By: soulitzer fbshipit-source-id: 0fa6bcdc72a4ae444705e2e0f3c416c1b28dadc7	2021-06-04 08:59:11 -07:00
Kushashwa Ravi Shrimali	44c20ce676	Alias for `i0` to `special` namespace (#59141 ) Summary: See https://github.com/pytorch/pytorch/issues/50345 cc: mruberry kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59141 Reviewed By: ngimel Differential Revision: D28784097 Pulled By: mruberry fbshipit-source-id: 9b61a21906ef337292686fd40e328502a79e6f09	2021-06-01 23:04:09 -07:00
Thomas J. Fan	8af6281201	DOC Adds register_module_full_backward_hook into docs (#58954 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/54443 Adds `register_module_full_backward_hook` into the index so it is rendered in the html docs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58954 Reviewed By: ngimel Differential Revision: D28801816 Pulled By: jbschlosser fbshipit-source-id: a2e737fe983e5d7e4e26d7639183bca34b571cb8	2021-06-01 15:47:10 -07:00
kshitij12345	fea7a79e0b	[special] Add ndtr (#58126 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 Plot: ![image](https://user-images.githubusercontent.com/19503980/117942099-54efd680-b328-11eb-8948-c3080779ce19.png) https://colab.research.google.com/drive/1Of67A042rOImj8wrLF_fUTgoy_wVEOZS?usp=sharing TODO: * [x] Add docs (https://13385714-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.ndtr) Pull Request resolved: https://github.com/pytorch/pytorch/pull/58126 Reviewed By: anjali411 Differential Revision: D28700957 Pulled By: mruberry fbshipit-source-id: 5b9991e97ec1e8fd01518cc9d9849108d35fe406	2021-05-30 21:12:04 -07:00
kshitij12345	5c18994674	[special] Add `i1` and `i1e` (#56352 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/50345 * [x] Check Docs https://12721710-65600975-gh.circle-artifacts.com/0/docs/special.html * [x] Investigate fp32 failure on CI?! (Fails on clang. Reproduced locally with clang-11) * [ ] Kernel vs Composite? * [x] Autograd for `i0e` for zero? Pull Request resolved: https://github.com/pytorch/pytorch/pull/56352 Reviewed By: anjali411 Differential Revision: D28700888 Pulled By: mruberry fbshipit-source-id: 91a3cbb94f5b8a3b063589ec38179848c11def83	2021-05-29 20:55:23 -07:00
Jeffrey Wan	9e60c7dee3	Add docstring for is_inference_mode_enabled (#59047 ) Summary: Fixes` #{issue number} Testing: ``` >>> import torch >>> torch.is_inference_mode_enabled.__doc__ '\nis_inference_mode_enabled(input) -> (bool)\n\nReturns True if inference mode is currently enabled.\n\nArgs:\n input (Tensor): the input tensor.\n' ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/59047 Reviewed By: ailzhang Differential Revision: D28726991 Pulled By: soulitzer fbshipit-source-id: c117c7d73e551a1b5f0e215f2aed528bf558ef7c	2021-05-26 19:27:33 -07:00
Joel Schlosser	a749e8edf5	Add UninitializedBuffer to nn docs (#59021 ) Summary: The `UninitializedBuffer` class was previously left out of `nn.rst`, so it was not included in the generated documentation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59021 Reviewed By: anjali411 Differential Revision: D28723044 Pulled By: jbschlosser fbshipit-source-id: 71e15b0c7fabaf57e8fbdf7fbd09ef2adbdb36ad	2021-05-26 14:36:05 -07:00
Jeffrey Wan	a7a5992d7d	Add no-grad inference mode note (#58513 ) Summary: Adds a note explaining the difference between several often conflated mechanisms in the autograd note Also adds a link to this note from the docs in `grad_mode` and `nn.module`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58513 Reviewed By: gchanan Differential Revision: D28651129 Pulled By: soulitzer fbshipit-source-id: af9eb1749b641fc1b632815634eea36bf7979156	2021-05-25 13:06:54 -07:00
Adnios	09a8f22bf9	Add mish activation function (#58648 ) Summary: See issus: https://github.com/pytorch/pytorch/issues/58375 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58648 Reviewed By: gchanan Differential Revision: D28625390 Pulled By: jbschlosser fbshipit-source-id: 23ea2eb7d5b3dc89c6809ff6581b90ee742149f4	2021-05-25 10:36:21 -07:00
Joel Schlosser	c58709b7bb	Helper function for skipping module parameter / buffer initialization (#57555 ) Summary: This PR introduces a helper function named `torch.nn.utils.skip_init()` that accepts a module class object + `args` / `kwargs` and instantiates the module while skipping initialization of parameter / buffer values. See discussion at https://github.com/pytorch/pytorch/issues/29523 for more context. Example usage: ```python import torch m = torch.nn.utils.skip_init(torch.nn.Linear, 5, 1) print(m.weight) m2 = torch.nn.utils.skip_init(torch.nn.Linear, 5, 1, device='cuda') print(m2.weight) m3 = torch.nn.utils.skip_init(torch.nn.Linear, in_features=5, out_features=1) print(m3.weight) ``` ``` Parameter containing: tensor([[-3.3011e+28, 4.5915e-41, -3.3009e+28, 4.5915e-41, 0.0000e+00]], requires_grad=True) Parameter containing: tensor([[-2.5339e+27, 4.5915e-41, -2.5367e+27, 4.5915e-41, 0.0000e+00]], device='cuda:0', requires_grad=True) Parameter containing: tensor([[1.4013e-45, 0.0000e+00, 0.0000e+00, 0.0000e+00, 0.0000e+00]], requires_grad=True) ``` Bikeshedding on the name / namespace is welcome, as well as comments on the design itself - just wanted to get something out there for discussion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57555 Reviewed By: zou3519 Differential Revision: D28640613 Pulled By: jbschlosser fbshipit-source-id: 5654f2e5af5530425ab7a9e357b6ba0d807e967f	2021-05-24 11:28:32 -07:00
Rohan Varma	071d49a970	Document monitored barrier (#58322 ) Summary: Will not land before the release, but would be good to have this function documented in master for its use in distributed debugability. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58322 Reviewed By: SciPioneer Differential Revision: D28595405 Pulled By: rohan-varma fbshipit-source-id: fb00fa22fbe97a38c396eae98a904d1c4fb636fa	2021-05-21 19:04:57 -07:00
Michael Carilli	e8c6a65074	Adds grid_sampler to autocast fp32 list for 1.9 (#58679 ) Summary: Temporary fix for https://github.com/pytorch/pytorch/issues/42218. Numerically, grid_sampler should be fine in fp32 or fp16. So grid_sampler really belongs on the promote list. But performancewise, native grid_sampler backward kernels use gpuAtomicAdd, which is notoriously slow in fp16. So the simplest functionality fix is to put grid_sampler on the fp32 list. In https://github.com/pytorch/pytorch/pull/58618 I implement the right long-term fix (refactoring kernels to use fp16-friendly fastAtomicAdd and moving grid_sampler to the promote list). But that's more invasive, and for 1.9 ngimel says this simple temporary fix is preferred. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58679 Reviewed By: soulitzer Differential Revision: D28576559 Pulled By: ngimel fbshipit-source-id: d653003f37eaedcbb3eaac8d7fec26c343acbc07	2021-05-20 14:05:09 -07:00
abladawood	1fc3e1e1fb	Abladawood patch 1 (#58496 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/58496 Reviewed By: soulitzer Differential Revision: D28562333 Pulled By: ailzhang fbshipit-source-id: aa9fcc03ba7ffe03db6cc5da353d37d679a0a160	2021-05-20 10:32:18 -07:00
Gary Miguel	703cfdc9ed	[JIT] improve documentation (#57991 ) Summary: * Fix lots of links. * Minor improvements for consistency, clarity or grammar. * Update jit_python_reference to note the limitations on __exit__. (Related to https://github.com/pytorch/pytorch/issues/41420). * Fix a comment in exit_transforms.cpp: removed the word "not" which made the comment say the opposite of the truth. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57991 Reviewed By: malfet Differential Revision: D28522247 Pulled By: SplitInfinity fbshipit-source-id: fc63a59d19ea6c89f957c9f7d451be17d1c5fc91	2021-05-19 11:47:32 -07:00
Horace He	79a258f448	s/foward/forward/g (#58497 ) Summary: Annoying typo. Prompted by these profiling results: https://github.com/pytorch/pytorch/issues/56419#issuecomment-825787828 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58497 Reviewed By: malfet Differential Revision: D28521081 Pulled By: Chillee fbshipit-source-id: ab91a2e167dd7d3387fd56106a6cff81f7a32f10	2021-05-19 11:42:42 -07:00
Richard Zou	e059fd40a8	Remove master documentation from being indexable by search engines (#58056 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58056 This PR addresses an action item in #3428: disabling search engine indexing of master documentation. This is desireable because we want to direct users to our stable documentation (instead of master documentation) because they are more likely to have a stable version of PyTorch installed. Test Plan: 1. run `make html`, check that the noindex tags are there 2. run `make html-stable, check that the noindex tags aren't there Reviewed By: bdhirsh Differential Revision: D28490504 Pulled By: zou3519 fbshipit-source-id: 695c944c4962b2bd484dd7a5e298914a37abe787	2021-05-18 06:20:09 -07:00
Rohan Varma	52bb8120b8	Mention distributed profiling in documentation (#58286 ) Summary: Added a simple section indicating distributed profiling is expected to work similar to other torch operators, and is supported for all communication backends out of the box. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58286 Reviewed By: bdhirsh Differential Revision: D28436489 Pulled By: rohan-varma fbshipit-source-id: ce1905a987c0ede8011e8086a2c30edc777b4a38	2021-05-14 09:43:00 -07:00
Jeffrey Wan	e1bb9d2d99	Reimplement spectral_norm using new parametrization functionality (#57784 ) Summary: Adds a new file under `torch/nn/utils/parametrizations.py` which should contain all the parametrization implementations For spectral_norm we add the `SpectralNorm` module which can be registered using `torch.nn.utils.parametrize.register_parametrization` or using a wrapper: `spectral_norm`, the same API the old implementation provided. Most of the logic is borrowed from the old implementation: - Just like the old implementation, there should be cases when retrieving the weight should perform another power iteration (thus updating the weight) and cases where it shouldn't. For example in eval mode `self.training=True`, we do not perform power iteration. There are also some differences/difficulties with the new implementation: - Using new parametrization functionality as-is there doesn't seem to be a good way to tell whether a 'forward' call was the result of parametrizations are unregistered (and leave_parametrizations=True) or when the injected property's getter was invoked. The issue is that we want perform power iteration in the latter case but not the former, but we don't have this control as-is. So, in this PR I modified the parametrization functionality to change the module to eval mode before triggering their forward call - Updates the vectors based on weight on initialization to fix https://github.com/pytorch/pytorch/issues/51800 (this avoids silently update weights in eval mode). This also means that we perform twice any many power iterations by the first forward. - right_inverse is just the identity for now, but maybe it should assert that the passed value already satisfies the constraints - So far, all the old spectral_norm tests have been cloned, but maybe we don't need so much testing now that the core functionality is already well tested Pull Request resolved: https://github.com/pytorch/pytorch/pull/57784 Reviewed By: ejguan Differential Revision: D28413201 Pulled By: soulitzer fbshipit-source-id: e8f1140f7924ca43ae4244c98b152c3c554668f2	2021-05-13 14:16:13 -07:00
Ivan Yashchuk	c1430c3425	Add torch.linalg.inv_ex without checking for errors by default (#58039 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58039 The new function has the following signature `inv_ex(Tensor inpit, *, bool check_errors=False) -> (Tensor inverse, Tensor info)`. When `check_errors=True`, an error is thrown if the matrix is not invertible; `check_errors=False` - responsibility for checking the result is on the user. `linalg_inv` is implemented using calls to `linalg_inv_ex` now. Resolves https://github.com/pytorch/pytorch/issues/25095 Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D28405148 Pulled By: mruberry fbshipit-source-id: b8563a6c59048cb81e206932eb2f6cf489fd8531	2021-05-13 09:42:15 -07:00
Jeffrey Wan	e71b526e7e	Add inference mode python bindings and tests (#58045 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/56608 - Adds binding to the `c10::InferenceMode` RAII class in `torch._C._autograd.InferenceMode` through pybind. Also binds the `torch.is_inference_mode` function. - Adds context manager `torch.inference_mode` to manage an instance of `c10::InferenceMode` (global). Implemented in `torch.autograd.grad_mode.py` to reuse the `_DecoratorContextManager` class. - Adds some tests based on those linked in the issue + several more for just the context manager Issues/todos (not necessarily for this PR): - Improve short inference mode description - Small example - Improved testing since there is no direct way of checking TLS/dispatch keys - Pull Request resolved: https://github.com/pytorch/pytorch/pull/58045 Reviewed By: agolynski Differential Revision: D28390595 Pulled By: soulitzer fbshipit-source-id: ae98fa036c6a2cf7f56e0fd4c352ff804904752c	2021-05-13 08:55:35 -07:00
Alexander Golynski	bc30c3165c	Update docs for get_future support (#58107 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58107 Test Plan: Imported from OSS Reviewed By: SciPioneer Differential Revision: D28387374 Pulled By: agolynski fbshipit-source-id: 70052afbb0b07ba341ea55f7ec30f7d9759b7bd4	2021-05-12 18:29:28 -07:00
Can Balioglu	028f2f62ac	[torch/elastic] Update the rendezvous docs (#58160 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58160 This PR updates the Torch Distributed Elastic documentation with references to the new `c10d` backend. ghstack-source-id: 128783809 Test Plan: Visually verified the correct Reviewed By: tierex Differential Revision: D28384996 fbshipit-source-id: a40b0c37989ce67963322565368403e2be5d2592	2021-05-12 16:54:28 -07:00
Michael Suo	01d0eb9dac	[package] Add an intern keyword (#57341 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57341 Require that users be explicit about what they are going to be interning. There are a lot of changes that are enabled by this. The new overall scheme is: PackageExporter maintains a dependency graph. Users can add to it, either explicitly (by issuing a `save_` call) or explicitly (through dependency resolution). Users can also specify what action to take when PackageExporter encounters a module (deny, intern, mock, extern). Nothing (except pickles, tho that can be changed with a small amount of work) is written to the zip archive until we are finalizing the package. At that point, we consult the dependency graph and write out the package exactly as it tells us to. This accomplishes two things: 1. We can gather up all* packaging errors instead of showing them one at a time. 2. We require that users be explicit about what's going in packages, which is a common request. Differential Revision: D28114185 Test Plan: Imported from OSS Reviewed By: SplitInfinity Pulled By: suo fbshipit-source-id: fa1abf1c26be42b14c7e7cf3403ecf336ad4fc12	2021-05-12 16:22:43 -07:00
Yi Wang	581bf01074	[Gradient Compression] Remove unnecessary warning on the rst file and the check on C++ version (#58170 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58170 Now comm hook can be supported on MPI and GLOO backends besides NCCL. No longer need these warnings and check. ghstack-source-id: 128799123 Test Plan: N/A Reviewed By: agolynski Differential Revision: D28388861 fbshipit-source-id: f56a7b9f42bfae1e904f58cdeccf7ceefcbb0850	2021-05-12 14:15:10 -07:00
albanD	cbd1227809	Add a note in the parametrize doc about the naming choice (#58142 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58142 Reviewed By: agolynski Differential Revision: D28386655 Pulled By: albanD fbshipit-source-id: c2793ac377ef7082c1840e1a50604da3ff9c61ac	2021-05-12 13:15:56 -07:00
Jithun Nair	ab6b5fa036	Add HIP (ROCm) semantics doc (#57871 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57871 Reviewed By: agolynski Differential Revision: D28385510 Pulled By: malfet fbshipit-source-id: 9cf69e52d026a1cf74cc12d8727ca17ae026235e	2021-05-12 12:34:07 -07:00
$PCTURBOX\anton$ PCTURBOX\anton	5ea87f9c24	Grammatically updated the tech docs (complex_numbers.rst) (#57540 ) Summary: Small grammatical change in complex_numbers.rst . -You can see the changes in the screenshot below - ![Capture](https://user-images.githubusercontent.com/38073192/117013956-01aed000-acf9-11eb-9d17-1e369de68585.PNG) Pull Request resolved: https://github.com/pytorch/pytorch/pull/57540 Reviewed By: albanD Differential Revision: D28233650 Pulled By: mrshenli fbshipit-source-id: 0cec7bb1f4bd61e929e2a8fc5292bc20b77aee35	2021-05-12 09:05:18 -07:00
Luca Wehrstedt	d623fb7e04	Add a disclaimer about limited CUDA support in RPC (#58023 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58023 Clearly state that some features of RPC aren't yet compatible with CUDA. ghstack-source-id: 128688856 Test Plan: None Reviewed By: agolynski Differential Revision: D28347605 fbshipit-source-id: e8df9a4696c61a1a05f7d2147be84d41aeeb3b48	2021-05-12 00:11:22 -07:00
Ilqar Ramazanli	8b816e9010	To implement gradient for Pytorch (#54617 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/56129 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54617 Reviewed By: anjali411 Differential Revision: D28057452 Pulled By: iramazanli fbshipit-source-id: 9bd86679282d34f5e5393e6447121586517eb4f0	2021-05-11 18:52:20 -07:00
Kimish Patel	b7d674eb21	Revert D28331386: [pytorch][PR] [torch/elastic] Update the rendezvous docs Test Plan: revert-hammer Differential Revision: D28331386 (`e4418b67c7`) Original commit changeset: 95dd32146222 fbshipit-source-id: 5522d4a09bc06ac42943eec9aa8bf5292cc778b2	2021-05-11 18:10:46 -07:00
Ivan Yashchuk	a90c229900	Remove the BETA status for torch.linalg (#58043 ) Summary: We are ready to move to the new stage for our `torch.linalg` module, which is stable (or STABLE?). Ref. https://github.com/pytorch/pytorch/issues/42666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58043 Reviewed By: ngimel Differential Revision: D28356172 Pulled By: mruberry fbshipit-source-id: e2c1effa79b9635b2ef0a820a03a0685105042bd	2021-05-11 16:11:48 -07:00
Gary Miguel	f9c8b7f1a8	[FX][docs] minor fixes (#58085 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58085 Reviewed By: mruberry Differential Revision: D28364553 Pulled By: jamesr66a fbshipit-source-id: 0d953672de9a86ecf5b1900b22e6ddef850dbe8f	2021-05-11 15:35:49 -07:00
Can Balioglu	e4418b67c7	[torch/elastic] Update the rendezvous docs (#57973 ) Summary: This PR updates the rendezvous documentation for the Torch Distributed Elastic section of PyTorch docs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57973 Reviewed By: kiukchung Differential Revision: D28331386 Pulled By: cbalioglu fbshipit-source-id: 95dd32146222aaeff246bd3c3d2caf0036a9011b	2021-05-11 15:32:50 -07:00
Luca Wehrstedt	3e46d6c9e4	Update docs to mention CUDA support for Future (#50048 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50048 To reflect the many changes introduced recently. In my mind, CUDAFuture should be considered a "private" subclass, which in practice should always be returned as a downcast pointer to an ivalue::Future. Hence, we should document the CUDA behavior in the superclass, even if it's CUDA-agnostic, since that's the interface the users will see also for CUDA-enabled futures. ghstack-source-id: 128640983 Test Plan: Built locally and looked at them. Reviewed By: mrshenli Differential Revision: D25757474 fbshipit-source-id: c6f66ba88fa6c4fc33601f31136422d6cf147203	2021-05-11 08:26:33 -07:00
Yi Wang	38500d5d7b	[RPC Framework] Move the annotation w/ bold effect out of the quotes (#57965 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57965 The bold effect does not work under quotes, so move it out. ghstack-source-id: 128570357 Test Plan: locally view {F614715259} Reviewed By: rohan-varma Differential Revision: D28329694 fbshipit-source-id: 299b427f4c0701ba70c84148f65203a6e2d6ac61	2021-05-10 16:51:23 -07:00
nikithamalgi	bf053a1296	Fix hasattr support type (#57950 ) Summary: `hasattr` is partially supported. This PR fixes that in the builtin table. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57950 Reviewed By: pbelevich Differential Revision: D28329005 Pulled By: nikithamalgifb fbshipit-source-id: c4cfba9badcc8f7cbc8250a5c21dfb62b35a83fc	2021-05-10 12:21:56 -07:00
Heitor Schueroff	4cf2c646c2	Added torch.linalg.matrix_norm (#57127 ) Summary: This PR is focused on the API for `linalg.matrix_norm` and delegates computations to `linalg.norm` for the moment. The main difference between the norms is when `dim=None`. In this case - `linalg.norm` will compute a vector norm on the flattened input if `ord=None`, otherwise it requires the input to be either 1D or 2D in order to disambiguate between vector and matrix norm - `linalg.vector_norm` will flatten the input - `linalg.matrix_norm` will compute the norm over the last two dimensions, treating the input as batch of matrices In future PRs, the computations will be moved to `torch.linalg.matrix_norm` and `torch.norm` and `torch.linalg.norm` will delegate computations to either `linalg.vector_norm` or `linalg.matrix_norm` based on the arguments provided. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57127 Reviewed By: mrshenli Differential Revision: D28186736 Pulled By: mruberry fbshipit-source-id: 99ce2da9d1c4df3d9dd82c0a312c9570da5caf25	2021-05-09 04:50:33 -07:00
Yi Wang	94080f45ab	[RPC Framework] Update rpc.rst (#57876 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57876 ghstack-source-id: 128484049 Test Plan: N/A Reviewed By: pritamdamania87 Differential Revision: D28305719 fbshipit-source-id: cc0d79fb46077a0d1cf6026c373893e7d3b7761e	2021-05-07 19:42:29 -07:00
Holly Sweeney	626ae7f036	Copy edit of TorchScript Language Reference (#57694 ) Summary: Initial copy edit of the file. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57694 Reviewed By: malfet, ngimel Differential Revision: D28289209 Pulled By: holly1238 fbshipit-source-id: 7035d6790767a2f758e6019ae63df16537ef2725	2021-05-07 12:17:32 -07:00
Philip Meier	0dd0151c64	add `torch.testing` to docs (#57247 ) Summary: Redo of https://github.com/pytorch/pytorch/issues/56373 out of stack. --- To reviewers: please be nitpicky. I've read this so often that I probably missed some typos and inconsistencies. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57247 Reviewed By: albanD Differential Revision: D28247402 Pulled By: mruberry fbshipit-source-id: 71142678ee5c82cc8c0ecc1dad6a0b2b9236d3e6	2021-05-07 09:16:39 -07:00
Nicolas Hug	1fc89d9ffc	Use proper Google Analytics id (#56578 ) Summary: This PR fixes the GA id and relies on `pytorch-sphinx-theme` to set the GA script instead of hard-coding it (this is supported since https://github.com/pytorch/pytorch_sphinx_theme/pull/110 was merged). Similar PRs were opened and merged in torchchvision/audio/text, e.g.: https://github.com/pytorch/vision/pull/3700 CC brianjo Pull Request resolved: https://github.com/pytorch/pytorch/pull/56578 Reviewed By: mrshenli Differential Revision: D28199244 Pulled By: ranman fbshipit-source-id: a20b7fd1b1da3ebff491286c3eeb1410f3c80670	2021-05-04 13:23:16 -07:00
Kiuk Chung	a80b215a9a	[1/n][torch/elastic] Move torchelastic docs .rst (#148 ) Summary: Pull Request resolved: https://github.com/pytorch/elastic/pull/148 Pull Request resolved: https://github.com/pytorch/pytorch/pull/56811 Moves docs sphinx `.rst` files from the torchelastic repository to torch. Note: only moves the rst files the next step is to link it to the main pytorch `index.rst` and write new `examples.rst` Reviewed By: H-Huang Differential Revision: D27974751 fbshipit-source-id: 8ff9f242aa32e0326c37da3916ea0633aa068fc5	2021-05-04 00:57:56 -07:00
Ilqar Ramazanli	15975cf6a6	To add priority of int/int? over int[] on signature matching and adding {h,v,d}split methods (#57346 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/54555 It has been discussed in the issue https://github.com/pytorch/pytorch/issues/54555 that {h,v,d}split methods unexpectedly matches argument of single int[] when it is expected to match single argument of int. The same unexpected behavior can happen in other functions/methods which can take both int[] and int? as single argument signatures. In this PR we solve this problem by giving higher priority to int/int? arguments over int[] while sorting signatures. We also add methods of {h,v,d}split methods here, which helped us to discover this unexpected behavior. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57346 Reviewed By: ezyang Differential Revision: D28121234 Pulled By: iramazanli fbshipit-source-id: 851cf40b370707be89298177b51ceb4527f4b2d6	2021-05-03 18:52:41 -07:00
Ivan Yashchuk	75a2a92b02	Add torch.linalg.cholesky_ex without checking for errors by default (#56724 ) Summary: The new function has the following signature `cholesky_ex(Tensor input, *, bool check_errors=False) -> (Tensor L, Tensor infos)`. When `check_errors=True`, an error is thrown if the decomposition fails; `check_errors=False` - responsibility for checking the decomposition is on the user. When `check_errors=False`, we don't have host-device memory transfers for checking the values of the `info` tensor. Rewrote the internal code for `torch.linalg.cholesky`. Added `cholesky_stub` dispatch. `linalg_cholesky` is implemented using calls to `linalg_cholesky_ex` now. Resolves https://github.com/pytorch/pytorch/issues/57032. Ref. https://github.com/pytorch/pytorch/issues/34272, https://github.com/pytorch/pytorch/issues/47608, https://github.com/pytorch/pytorch/issues/47953 Pull Request resolved: https://github.com/pytorch/pytorch/pull/56724 Reviewed By: ngimel Differential Revision: D27960176 Pulled By: mruberry fbshipit-source-id: f05f3d5d9b4aa444e41c4eec48ad9a9b6fd5dfa5	2021-05-01 18:48:27 -07:00
kshitij12345	d4ddb47719	[special] Add `xlog1py` (#55138 ) Summary: Reference : https://github.com/pytorch/pytorch/issues/50345 * [x] Check Rendered Document (https://12494173-65600975-gh.circle-artifacts.com/0/docs/special.html#torch.special.xlog1py) * [x] Tests in Binary Ufunc * [x] OpInfo * [x] Structured Kernel Pull Request resolved: https://github.com/pytorch/pytorch/pull/55138 Reviewed By: ngimel Differential Revision: D27961461 Pulled By: mruberry fbshipit-source-id: 30a8f41970a829bf50254aadf5615e8ce4148c7e	2021-04-30 05:51:13 -07:00
Yanan Cao	2aadeac0ff	Remove duplicate entry for filter in language ref v2 (#57154 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57154 Reviewed By: zou3519 Differential Revision: D28061690 Pulled By: gmagogsfm fbshipit-source-id: b895238c0425cc6b60f5e19c67fc5bc6e0115d7f	2021-04-29 04:52:50 -07:00
Lillian Johnson	31e59c3869	torch.package change `Folder` to `Directory` and add doc strings (#56925 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56925 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D28002145 Pulled By: Lilyjjo fbshipit-source-id: 6265970202d1530c4fb7ea10011b0e09094037d5	2021-04-28 13:03:12 -07:00
Nikitha Malgi	ce79bd255d	Fix doc issues (#57153 ) Summary: Fixes inconsistencies in the TorchScript Language reference. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57153 Reviewed By: zou3519, gmagogsfm Differential Revision: D28061449 Pulled By: nikithamalgifb fbshipit-source-id: a055c7b1417391afe00ec0b35e1042acb049feed	2021-04-28 11:47:10 -07:00
albanD	d16ed1ee8a	Add first draft of gradcheck note (#55966 ) Summary: You can find the latest rendered version in the `python_doc_build` CI job below, in the artifact tab of that build on circle CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/55966 Reviewed By: H-Huang Differential Revision: D28032446 Pulled By: albanD fbshipit-source-id: 227ad37b03d39894d736c19cae3195b4d56fc62f	2021-04-27 14:33:42 -07:00
Akifumi Imanishi	9da0f2e95e	Support `__pos__` and `positive` (#55891 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/55604. This PR implements `torch.Tensor.__pos__` and `torch.positive` for the compatibility with NumPy’s interface. (cc: mruberry, rgommers, emcastillo and kmaehashi) Pull Request resolved: https://github.com/pytorch/pytorch/pull/55891 Reviewed By: H-Huang Differential Revision: D28025928 Pulled By: mruberry fbshipit-source-id: e43e329a802f31bf8805f6efab5c2c7ef34c88b9	2021-04-27 13:23:59 -07:00
lezcano	d578e8cfa2	Improved docs for `torch.linalg` (#56265 ) Summary: This PR tries to make the docs of `torch.linalg` have/be: - More uniform notation and structure for every function. - More uniform use of back-quotes and the `:attr:` directive - More readable for a non-specialised audience through explanations of the form that factorisations take and when would it be beneficial to use what arguments in some solvers. - More connected among the different functions through the use of the `.. seealso::` directive. - More information on when do gradients explode / when is a function silently returning a wrong result / when things do not work in general I tried to follow the structure of "one short description and then the rest" to be able to format the docs like those of `torch.` or `torch.nn`. I did not do that yet, as I am waiting for the green light on this idea: https://github.com/pytorch/pytorch/issues/54878#issuecomment-816636171 What this PR does not do: - Clean the documentation of other functions that are not in the `linalg` module (although I started doing this for `torch.svd`, but then I realised that this PR would touch way too many functions). Fixes https://github.com/pytorch/pytorch/issues/54878 cc mruberry IvanYashchuk Pull Request resolved: https://github.com/pytorch/pytorch/pull/56265 Reviewed By: H-Huang Differential Revision: D27993986 Pulled By: mruberry fbshipit-source-id: adde7b7383387e1213cc0a6644331f0632b7392d	2021-04-27 11:16:09 -07:00
Yukio Siraichi	9d54475032	Hide module paths leaking in the documentation. (#54585 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/54354 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54585 Reviewed By: H-Huang Differential Revision: D28027037 Pulled By: mruberry fbshipit-source-id: 219874e143221f5e8349d007f88464e0be1a6243	2021-04-27 10:58:01 -07:00
iramazanli	3e006fc57e	Adding hsplit,vsplit and dsplit methods (#53536 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/53536 Reviewed By: albanD Differential Revision: D27938880 Pulled By: iramazanli fbshipit-source-id: f741119517783ec2bafa296622ee518b587dd127	2021-04-26 09:39:09 -07:00
IceTDrinker	689d3a70aa	Fix broken link to fx graph quant guide in quantization.rst (#56776 ) Summary: No oustanding issue, can create it if needed. Was looking for that resource and it was moved without fixing the documentation. Cheers Pull Request resolved: https://github.com/pytorch/pytorch/pull/56776 Reviewed By: heitorschueroff Differential Revision: D27967020 Pulled By: ezyang fbshipit-source-id: a5cd7d554da43a9c9e44966ccd0b0ad9eef2948c	2021-04-26 08:22:28 -07:00
Ilqar Ramazanli	70d9be0f42	Replace duplicative s with alpha (#56804 ) Summary: It is always easier to read a document when different objects / concepts denoted with different variables / representations. In this PR we make sure the [complex autograd](https://pytorch.org/docs/master/notes/autograd.html#autograd-for-complex-numbers) documentation, the variable of output and step size diverge. Fixes https://github.com/pytorch/pytorch/issues/53633 Pull Request resolved: https://github.com/pytorch/pytorch/pull/56804 Reviewed By: anjali411 Differential Revision: D27989959 Pulled By: iramazanli fbshipit-source-id: c271590ee744c8aeeff62bfaa2295429765ef64e	2021-04-25 16:27:09 -07:00
Ilqar Ramazanli	d1fe68e70b	To add single and chained learning schedulers to docs (#56705 ) Summary: In the optimizer documentation, many of the learning rate schedulers [examples](https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate) are provided according to a generic template. In this PR we provide a precise simple use case example to show how to use learning rate schedulers. Moreover, in a followup example we show an example how to chain two schedulers next to each other. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56705 Reviewed By: ezyang Differential Revision: D27966704 Pulled By: iramazanli fbshipit-source-id: f32b2d70d5cad7132335a9b13a2afa3ac3315a13	2021-04-23 09:36:00 -07:00
Stas Bekman	1dbbbbe904	[doc] FX Graph Mode Quantization - fix preamble (#52192 ) Summary: The pre-amble here is misformatted at least and is hard to make sense of: https://pytorch.org/docs/master/quantization.html#prototype-fx-graph-mode-quantization This PR is trying to make things easier to understand. As I'm new to this please verify that my modifications remain in line with what may have been meant originally. Thanks. Pull Request resolved: https://github.com/pytorch/pytorch/pull/52192 Reviewed By: ailzhang Differential Revision: D27941730 Pulled By: vkuzo fbshipit-source-id: 6c4bbf7c87d8fb87ab5d588b690a72045752e47a	2021-04-22 10:20:31 -07:00
Erjia Guan	8cf85a1152	[DataLoader][doc] Randomness for base_seed generator and NumPy seed (#56528 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56528 Tried to search across internal and external usage of DataLoader. People haven't started to use `generator` for `DataLoader`. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D27908487 Pulled By: ejguan fbshipit-source-id: 14c83ed40d4ba4dc988b121968a78c2732d8eb93	2021-04-22 09:40:45 -07:00
M.L. Croci	1f0223d6bb	Fix bug in gaussian_nll_loss (#56469 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53964. cc albanD almson ## Major changes: - Overhauled the actual loss calculation so that the shapes are now correct (in functional.py) - added the missing doc in nn.functional.rst ## Minor changes (in functional.py): - I removed the previous check on whether input and target were the same shape. This is to allow for broadcasting, say when you have 10 predictions that all have the same target. - I added some comments to explain each shape check in detail. Let me know if these should be shortened/cut. Screenshots of updated docs attached. Let me know what you think, thanks! ## Edit: Description of change of behaviour (affecting BC): The backwards-compatibility is only affected for the `reduction='none'` mode. This was the source of the bug. For tensors with size (N, D), the old returned loss had size (N), as incorrect summation was happening. It will now have size (N, D) as expected. ### Example Define input tensors, all with size (2, 3). `input = torch.tensor([[0., 1., 3.], [2., 4., 0.]], requires_grad=True)` `target = torch.tensor([[1., 4., 2.], [-1., 2., 3.]])` `var = 2*torch.ones(size=(2, 3), requires_grad=True)` Initialise loss with reduction mode 'none'. We expect the returned loss to have the same size as the input tensors, (2, 3). `loss = torch.nn.GaussianNLLLoss(reduction='none')` Old behaviour: `print(loss(input, target, var)) ` `# Gives tensor([3.7897, 6.5397], grad_fn=<MulBackward0>. This has size (2).` New behaviour: `print(loss(input, target, var)) ` `# Gives tensor([[0.5966, 2.5966, 0.5966], [2.5966, 1.3466, 2.5966]], grad_fn=<MulBackward0>)` `# This has the expected size, (2, 3).` To recover the old behaviour, sum along all dimensions except for the 0th: `print(loss(input, target, var).sum(dim=1))` `# Gives tensor([3.7897, 6.5397], grad_fn=<SumBackward1>.` ![doc1](https://user-images.githubusercontent.com/26558092/115391089-f7f47b00-a1d6-11eb-8726-e4da9057aee0.png) ![doc2](https://user-images.githubusercontent.com/26558092/115391094-f925a800-a1d6-11eb-954b-afd187f42bc7.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/56469 Reviewed By: jbschlosser, agolynski Differential Revision: D27894170 Pulled By: albanD fbshipit-source-id: 197890189c97c22109491c47f469336b5b03a23f	2021-04-22 07:43:48 -07:00
Meghan Lele	eac082891f	[package] Massage exporter docstrings (#56547 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56547 Summary This commit tweaks the docstrings of `PackageExporter` so that they look nicer on the docs website. Test Plan Continuous integration. Test Plan: Imported from OSS Reviewed By: ailzhang Differential Revision: D27912965 Pulled By: SplitInfinity fbshipit-source-id: 38c0a715365b8cfb9eecdd1b38ba525fa226a453	2021-04-21 14:06:54 -07:00

... 3 4 5 6 7 ...

1739 Commits