pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jason Ansel	d8e0c26e64	[dynamo] Support warnings.catch_warnings (#123511 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123511 Approved by: https://github.com/anijain2305	2024-04-08 22:27:46 +00:00
Jason Ansel	212e460dce	[dynamo] Support custom __setattr__ on UserDefinedObjectVariable (#123318 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123318 Approved by: https://github.com/anijain2305	2024-04-07 21:06:52 +00:00
Animesh Jain	5d0ac887b9	[dynamo][higher order ops] Make the subgraph sourceless (#123071 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123071 Approved by: https://github.com/jansel, https://github.com/zou3519 ghstack dependencies: #123046, #123058, #123059	2024-04-01 21:09:41 +00:00
Jason Ansel	781e8d2201	[dynamo] Support __next__ on UserDefinedObjectVariable (#122565 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122565 Approved by: https://github.com/yanboliang	2024-03-31 19:00:03 +00:00
Edward Z. Yang	deeeaded1f	Add metas for randint/rand factory functions out overload (#122375 ) Fixes https://github.com/pytorch/pytorch/issues/121897 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/122375 Approved by: https://github.com/lezcano	2024-03-25 04:01:38 +00:00
Jason Ansel	3e4a4bea12	[dynamo] Graph break on SymNode control flow (#122546 ) Fixes #111918 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122546 Approved by: https://github.com/ezyang	2024-03-24 07:22:02 +00:00
Jason Ansel	5f7e71c411	[dynamo] Add HASATTR guard for UserDefinedObject attrs (#122555 ) Fixes #111522 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122555 Approved by: https://github.com/Skylion007	2024-03-24 03:41:58 +00:00
Edward Z. Yang	c2651a7f0e	Make check_is_size clamp to sys.maxsize - 1, so sys.maxsize comparison returns False (#122372 ) Partially fixes https://github.com/pytorch/pytorch/issues/113002 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/122372 Approved by: https://github.com/lezcano ghstack dependencies: #122370	2024-03-21 17:14:42 +00:00
albanD	53d5276d69	Improve Dynamo support for torch function and class methods in general (#121365 ) I was originally trying to solve https://github.com/pytorch/pytorch/issues/120799 but got sidetracked along the way. This PR contains a couple fixes. Let me know if you want me to split them up! - Properly handle invalid user code when "super()" is called from non-method/classmethod. It will now properly raise the same error as CPython - Fix base VariableTracker `__str__` method shadowing all `__repr__` methods defined in subclasses - Fix accessing a classmethod on a user object to bind "cls" and not "self" - Fix custom class handling of super() call to properly handle mixed regular/class/static methods Locally , test_repros.py -k test_batch_norm_act still fails where the generated graph module is: ``` Call using an FX-traced Module, line 8 of the traced Module's generated forward function: x = self.forward(l_x_); self = l_x_ = None x_1 = self.L__self___act(x); x = None ``` note that "self" is being unset on the first line even though it is used on the second one. For reference, this is the test `c268ce4a6d/test/dynamo/test_repros.py (L1368-L1369)` I cannot figure out where the generated forward function comes from though, any hint would be welcome! Pull Request resolved: https://github.com/pytorch/pytorch/pull/121365 Approved by: https://github.com/jansel	2024-03-08 20:03:49 +00:00
Tugsbayasgalan Manlaibaatar	f01a23d01b	Don't aggressively rewrite asserts for symbolic expressions (#120564 ) Fixes: https://github.com/pytorch/pytorch/issues/118417 Pull Request resolved: https://github.com/pytorch/pytorch/pull/120564 Approved by: https://github.com/ezyang	2024-03-01 17:46:36 +00:00
Edward Z. Yang	2a08a51738	Add _assert_scalar and teach Inductor to codegen it (#114148 ) Inductor codegen for `_assert_async` is currently disabled because we don't really understand how to codegen `scalar_to_tensor` on a Sympy expression. I initially tried to see if I could get this to work, but I got into some weird problem involving stride sorting, so I decided to fix it properly by not going through a tensor. So we introduce an `_assert_scalar` which takes a scalar as an argument, avoiding needing to turn a SymBool into a tensor before asserting on it. I also add `_functional_assert_scalar` for good luck, although this doesn't do anything right now because https://github.com/pytorch/pytorch/pull/104203 still hasn't been landed. I need to customize the codegen for this operator, so I decide to directly implement it in Inductor, rather than trying to treat it as a generic ExternKernel. This leads to the new AssertScalar IR node. This is written carefully so that it doesn't get DCE'd by Inductor. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/114148 Approved by: https://github.com/jansel ghstack dependencies: #120800	2024-03-01 05:06:36 +00:00
Animesh Jain	e7039e3a0b	[dynamo][easy] Dynamo test changes (#120927 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120927 Approved by: https://github.com/yanboliang ghstack dependencies: #120864, #120730	2024-02-29 22:05:41 +00:00
Brian Hirsh	cccacf6c8e	add a test that non_overlapping checks dont generate too many guards (#120106 ) Pre-emptive test in OSS to ensure that models relying on the "non-overlapping guards" checks do not suffer drastically w.r.t. guard slowness. Current plan is to follow up on this with a "real" fix, to generate a linear number of these guards. Pull Request resolved: https://github.com/pytorch/pytorch/pull/120106 Approved by: https://github.com/mlazos	2024-02-20 18:38:59 +00:00
Brian Hirsh	6819452a08	fix multiple-fake-modes bug with compile + subclasses (#118191 ) This should fix the "multiple fake modes" errors we've been seeing with both float8 tensor and DTensor. Haven't added a test yet - will add one before landing. I also have a separate PR that would have made the error significantly nicer (the bad error resulted from us returning a FakeTensor at runtime): https://github.com/pytorch/pytorch/pull/118644 Pull Request resolved: https://github.com/pytorch/pytorch/pull/118191 Approved by: https://github.com/drisspg ghstack dependencies: #117667, #117666, #118209	2024-02-20 15:23:41 +00:00
Animesh Jain	80379ef0aa	[dynamo-must-fix] Use ID_MATCH for UserDefinedClass (#119853 ) Fixes https://github.com/pytorch/pytorch/issues/119715 Pull Request resolved: https://github.com/pytorch/pytorch/pull/119853 Approved by: https://github.com/jansel	2024-02-14 03:14:42 +00:00
Sergii Dymchenko	bd9db6a9c7	Update to TorchFix 0.4.0 (#119424 ) `torch.library.Library` updated to `torch.library._scoped_library` in files with many tests where it seems obvious to do, otherwise `noqa: TOR901` added - see https://github.com/pytorch/pytorch/pull/118318 for more context. Pull Request resolved: https://github.com/pytorch/pytorch/pull/119424 Approved by: https://github.com/zou3519	2024-02-12 23:30:12 +00:00
Brian Hirsh	02b60e76c9	make flash_attn_bw impl correct w.r.t. meta when k and v have different strides (#119500 ) `dv = at::empty_like(k)` and `dv = at::empty_like(v)` can be materially different, because `empty_like` tries to preserve the strides of the input when possible. So if `k` is contiguous, but `v`, is transposed, then before this PR, `dv` would be computed to be contiguous. Alternatively, we could change the meta implementation of `aten._scaled_dot_product_flash_attention` to this: ``` grad_q = torch.empty_like(query.transpose(1, 2)).transpose(1, 2) grad_k = torch.empty_like(key.transpose(1, 2)).transpose(1, 2) grad_v = torch.empty_like(key.transpose(1, 2)).transpose(1, 2) return grad_q, grad_k, grad_v ``` But (I think?) the logic in the sdpa backward impl was a typo. I noticed this because changing the meta formula as above was enough to fix the issue with the `aot_eager` backend in this [link](https://github.com/pytorch/pytorch/issues/116935#issuecomment-1914310523). A minimal repro that I made looks like this: ``` import torch # in this repro, "grad_out" and "value" are transposed tensors, # but "key" and "value" are contiguous a = torch.randn(2, 513, 16, 64, dtype=torch.float16, device='cuda').transpose(1, 2) b = torch.randn(2, 16, 513, 64, dtype=torch.float16, device='cuda') c = torch.randn(2, 16, 513, 64, dtype=torch.float16, device='cuda') d = torch.randn(2, 513, 16, 64, dtype=torch.float16, device='cuda').transpose(1, 2) e = torch.randn(2, 16, 513, 64, dtype=torch.float16, device='cuda') f = torch.randn(2, 16, 513, device='cuda') g = None h = None i = 513 j = 513 k = 0.0 l = False m = torch.tensor(1, dtype=torch.int64) n = torch.tensor(1, dtype=torch.int64) out1_ref, out2_ref, out3_ref = torch.ops.aten._scaled_dot_product_flash_attention_backward(a, b, c, d, e, f, g, h, i, j, k, l, m, n, scale=0.125) from torch._meta_registrations import meta__scaled_dot_product_flash_backward out1_test, out2_test, out3_test = meta__scaled_dot_product_flash_backward(a, b, c, d, e, f, g, h, i, j, k, l, m, n, scale=0.125) # prints True True print(out1_ref.is_contiguous()) print(out1_test.is_contiguous()) # prints True True print(out2_ref.is_contiguous()) print(out2_test.is_contiguous()) # prints True False print(out3_ref.is_contiguous()) print(out3_test.is_contiguous()) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/119500 Approved by: https://github.com/drisspg, https://github.com/ezyang, https://github.com/Skylion007	2024-02-12 22:12:29 +00:00
PyTorch MergeBot	34db6f1b13	Revert "make flash_attn_bw impl correct w.r.t. meta when k and v have different strides (#119500 )" This reverts commit `095f471307`. Reverted https://github.com/pytorch/pytorch/pull/119500 on behalf of https://github.com/DanilBaibak due to Broken trunk ([comment](https://github.com/pytorch/pytorch/pull/119500#issuecomment-1937003082))	2024-02-10 13:06:30 +00:00
Brian Hirsh	095f471307	make flash_attn_bw impl correct w.r.t. meta when k and v have different strides (#119500 ) `dv = at::empty_like(k)` and `dv = at::empty_like(v)` can be materially different, because `empty_like` tries to preserve the strides of the input when possible. So if `k` is contiguous, but `v`, is transposed, then before this PR, `dv` would be computed to be contiguous. Alternatively, we could change the meta implementation of `aten._scaled_dot_product_flash_attention` to this: ``` grad_q = torch.empty_like(query.transpose(1, 2)).transpose(1, 2) grad_k = torch.empty_like(key.transpose(1, 2)).transpose(1, 2) grad_v = torch.empty_like(key.transpose(1, 2)).transpose(1, 2) return grad_q, grad_k, grad_v ``` But (I think?) the logic in the sdpa backward impl was a typo. I noticed this because changing the meta formula as above was enough to fix the issue with the `aot_eager` backend in this [link](https://github.com/pytorch/pytorch/issues/116935#issuecomment-1914310523). A minimal repro that I made looks like this: ``` import torch # in this repro, "grad_out" and "value" are transposed tensors, # but "key" and "value" are contiguous a = torch.randn(2, 513, 16, 64, dtype=torch.float16, device='cuda').transpose(1, 2) b = torch.randn(2, 16, 513, 64, dtype=torch.float16, device='cuda') c = torch.randn(2, 16, 513, 64, dtype=torch.float16, device='cuda') d = torch.randn(2, 513, 16, 64, dtype=torch.float16, device='cuda').transpose(1, 2) e = torch.randn(2, 16, 513, 64, dtype=torch.float16, device='cuda') f = torch.randn(2, 16, 513, device='cuda') g = None h = None i = 513 j = 513 k = 0.0 l = False m = torch.tensor(1, dtype=torch.int64) n = torch.tensor(1, dtype=torch.int64) out1_ref, out2_ref, out3_ref = torch.ops.aten._scaled_dot_product_flash_attention_backward(a, b, c, d, e, f, g, h, i, j, k, l, m, n, scale=0.125) from torch._meta_registrations import meta__scaled_dot_product_flash_backward out1_test, out2_test, out3_test = meta__scaled_dot_product_flash_backward(a, b, c, d, e, f, g, h, i, j, k, l, m, n, scale=0.125) # prints True True print(out1_ref.is_contiguous()) print(out1_test.is_contiguous()) # prints True True print(out2_ref.is_contiguous()) print(out2_test.is_contiguous()) # prints True False print(out3_ref.is_contiguous()) print(out3_test.is_contiguous()) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/119500 Approved by: https://github.com/drisspg, https://github.com/ezyang, https://github.com/Skylion007	2024-02-10 02:04:56 +00:00
Aaron Orenstein	4dc53f777b	Fix dynamo failure w/ astype (#117952 ) The torch "fake" ndarray had some mismatches vs numpy.ndarray which caused test_sparse_to_sparse_compressed to fail under dynamo. This also fixes (because the test now hits it) a problem where unpacking a sequence with the incorrect number of args would assert in dynamo instead of graph breaking (because it would throw an exception). Added a unit test for this condition. Fixed: - torch._numpy._ndarray.astype() (actually used by the test) - torch._numpy._ndarray.put() (drive-by discovery) - torch._numpy._ndarray.view() (drive-by discovery) (burndown item 7) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117952 Approved by: https://github.com/yanboliang ghstack dependencies: #117951	2024-02-03 08:10:15 +00:00
Alexander Grund	865945cc1f	Convert `requires_cuda` to full decorator (#118281 ) Don't require using it as `@requires_cuda()` -> `@requires_cuda` instead No need for the partial function invoked many times Split out this change from the initial large refactoring in #117741 to hopefully get merged before conflicts arise Pull Request resolved: https://github.com/pytorch/pytorch/pull/118281 Approved by: https://github.com/ezyang	2024-01-25 15:50:21 +00:00
voznesenskym	fed45aee54	Replace invoking self.value if there is a user defined init, avoiding arbitrary code execution (#117818 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117818 Approved by: https://github.com/ezyang	2024-01-23 03:14:58 +00:00
PyTorch MergeBot	1174e82bde	Revert "Add _assert_scalar and teach Inductor to codegen it (#114148 )" This reverts commit `b6028acfa4`. Reverted https://github.com/pytorch/pytorch/pull/114148 on behalf of https://github.com/osalpekar due to Going to revert this given the broken torchrec PT2 tests internally: [D52648865](https://www.internalfb.com/diff/D52648865). Logs aren't too clear but @dstaay-fb can help debug as well ([comment](https://github.com/pytorch/pytorch/pull/114148#issuecomment-1886100368))	2024-01-11 02:30:22 +00:00
Edward Z. Yang	b6028acfa4	Add _assert_scalar and teach Inductor to codegen it (#114148 ) Inductor codegen for `_assert_async` is currently disabled because we don't really understand how to codegen `scalar_to_tensor` on a Sympy expression. I initially tried to see if I could get this to work, but I got into some weird problem involving stride sorting, so I decided to fix it properly by not going through a tensor. So we introduce an `_assert_scalar` which takes a scalar as an argument, avoiding needing to turn a SymBool into a tensor before asserting on it. I also add `_functional_assert_scalar` for good luck, although this doesn't do anything right now because https://github.com/pytorch/pytorch/pull/104203 still hasn't been landed. I need to customize the codegen for this operator, so I decide to directly implement it in Inductor, rather than trying to treat it as a generic ExternKernel. This leads to the new AssertScalar IR node. This is written carefully so that it doesn't get DCE'd by Inductor. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/114148 Approved by: https://github.com/jansel	2024-01-09 23:21:26 +00:00
voznesenskym	83e8a0721d	Reland #111196 (take 4) "Support tensors as Dict keys" (#116934 ) Fixes #ISSUE_NUMBER See that PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/116934 Approved by: https://github.com/ezyang, https://github.com/huydhn	2024-01-07 01:37:26 +00:00
PyTorch MergeBot	2dca3e99eb	Revert "Support tensors as Dict keys Re-PR of #111196 (#116785 )" This reverts commit `1badad9ce9`. Reverted https://github.com/pytorch/pytorch/pull/116785 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/116785#issuecomment-1879592261))	2024-01-06 08:22:33 +00:00
voznesenskym	1badad9ce9	Support tensors as Dict keys Re-PR of #111196 (#116785 ) This prepares the PR where we implement sets in terms of dicts. To do so, rather than storing internally a dictionary that maps literals to VariableTrackers, it stores (pretty much) a dictionary from VTs to VTs. To do so, keys are wrapped in an opaque internal class _Hashable. The Hashable class is opaque on purpose so that it fails hard if if it inadvertently leaks back into user code. We also found and fixed a number of latent bugs and inconsistencies in the way dynamo checked what can be a dict key. More generally, we make much clearer what are the things that need to be modified to add a new supported key type to Dicts. Fixes [#107595](https://www.internalfb.com/tasks?t=107595) Fixes [#111603](https://www.internalfb.com/tasks?t=111603) Re-PR of https://github.com/pytorch/pytorch/pull/111196 sadly due to reverts, we could not reuse @lezcano's original PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116785 Approved by: https://github.com/mlazos	2024-01-06 03:35:35 +00:00
Edward Z. Yang	53f8d17d1e	Specialize SymNodeVariable when used as module index (#114377 ) Fixes #114171 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/114377 Approved by: https://github.com/Skylion007	2024-01-05 13:51:52 +00:00
Guilherme Leobas	5c9464fb51	add CALL_FINALLY opcode (#116159 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/116159 Approved by: https://github.com/yanboliang	2023-12-27 19:01:08 +00:00
David Berard	7b7f11f230	[dynamo] test number of guards when inputs are views (#115793 ) After # 113734 landed (adding dynamic storage offsets), we found that compilation times increased significantly. The reason: tensors_definitely_do_not_overlap was doing comparisons on storage offsets which were adding guards `626b7dc847/torch/_functorch/_aot_autograd/input_output_analysis.py (L268-L276)` This guard is added on all pairs of tensors which are views of the same source tensor - i.e. it the number of guards can be quadratic in the number of input tensors. This PR adds a test to prevent similar regressions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115793 Approved by: https://github.com/yanboliang	2023-12-19 16:09:29 +00:00
Peter Bell	0e0dd8f985	[dynamo][BE] Move torchvision import inside of test_multi_import (#115677 ) Currently this skip imports torchvision, so if your torchvision install is broken then the entire file fails at collection time. This instead means only the test itself will fail. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115677 Approved by: https://github.com/lezcano	2023-12-13 14:16:31 +00:00
voznesenskym	76ced0df03	Consider storage_changed for assigning alias_of_input in aot_autograd when computing differentiable outputs that alias each other (#115315 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115315 Approved by: https://github.com/bdhirsh	2023-12-12 23:21:58 +00:00
Aaron Gokaslan	794545c11f	[BE]: Enable RUF015 codebase wide (#115507 ) Constant time access of first value in collection. This is a constant time operation instead of converting the item to a list to get the first item which is linear. The rule is turned on which automatically autofixes and enforces this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115507 Approved by: https://github.com/malfet	2023-12-11 15:51:01 +00:00
Jason Ansel	88642d44d9	[dynamo] Add RestrictedListSubclassVariable (#115057 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115057 Approved by: https://github.com/yanboliang ghstack dependencies: #115095, #115046	2023-12-05 19:01:23 +00:00
Jason Ansel	aa70e31610	[dynamo] Fix MutableSideEffects returning alias (#115095 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115095 Approved by: https://github.com/yanboliang	2023-12-05 19:01:03 +00:00
Jason Ansel	3d0bbb24a1	[dynamo] Improve support for list subclasses (#115052 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115052 Approved by: https://github.com/oulgen, https://github.com/eellison ghstack dependencies: #114830, #115047, #115048	2023-12-05 01:31:33 +00:00
Jason Ansel	a70c85ce90	[dynamo] Improve support for inspect.signature().parameters (#115047 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115047 Approved by: https://github.com/oulgen ghstack dependencies: #114830	2023-12-04 19:08:36 +00:00
voznesenskym	4cfe997490	[dynamo] handle setting .data on a tensor (#113080 ) Dynamo We don't want setattr in the graph. Setting data has interesting implications on both aliasing and on the autograd engine. The safe recipe is: 1) Disable grad 2) Call set_() 3) Manually lower the version counter on the object to hide it from the autograd engine This is effectively the same exact thing as setting .data, and it composes properly with aot_autograd and inductor. aot_autograd For aot_autograd, there's another snag. Specifically, when we invoke aot_autograd, we call `fake_mode.from_tensor()`, relying on memo to get the right tensor out. For .data mutations, this doesn't work, because the memoized fake_tensor is in the state it will be in at the end of the trace, not at the beginning. This means that the .data call is already applied, and the tensor shape (as in the case of these tests) mismatches. aot_autograd produces an invalid graph, with illegal calls like `torch.ops.aten.view.default(primals_2, [0])` where primals is actually sized `([6])` on input. The new plan here is to: 1) Record tensor fakification policy in dynamo 2) provide a fresh fake mode to all backends 3) Invoke from_tensor with the stored policy to get fresh new fake tensors in aot_autograd Pull Request resolved: https://github.com/pytorch/pytorch/pull/113080 Approved by: https://github.com/bdhirsh	2023-12-02 00:35:44 +00:00
David Berard	3fc58a6bbe	Revert "Make offsets dynamic by default (#113734 )" (#114889 ) This reverts commit `7c38b76efe`. if a graph has a lot of inputs which are views (with nonzero storage offset), then the check for overlapping tensor views will add a lot of guards (n^2?) `b35ca2cb94/torch/_functorch/_aot_autograd/input_output_analysis.py (L256-L260)` this was causing very slow compilations on an internal model. Differential Revision: [D51733774](https://our.internmc.facebook.com/intern/diff/D51733774) Pull Request resolved: https://github.com/pytorch/pytorch/pull/114889 Approved by: https://github.com/ckluk2, https://github.com/YuqingJ, https://github.com/aaronenyeshi	2023-12-01 16:49:42 +00:00
Jon Chuang	f66add9b85	[dynamo] graph break on `np.ndarray.tobytes` (#114208 ) We can't model this accurately across np and tnp https://github.com/pytorch/pytorch/issues/114204#issuecomment-1820269949 So let's not even try. Just graph break. Fixes: https://github.com/pytorch/pytorch/issues/114204 Pull Request resolved: https://github.com/pytorch/pytorch/pull/114208 Approved by: https://github.com/lezcano	2023-11-21 18:19:37 +00:00
Edward Z. Yang	59ad51e10a	Insert deferred runtime asserts into Dynamo FX graph (#113958 ) During the course of fake tensor propagation (and, potentially, also Dynamo execution, although I do not believe it is possible to exercise this right now), we may generate deferred runtime asserts, which represent "guards" on unbacked symbols which cannot be immediately checked on entry to a code block; instead, they have to be checked at runtime. However, we currently accumulate these deferred runtime asserts into the ShapeEnv, and don't do anything with them. This PR modifies Dynamo to automatically insert these runtime asserts into the FX graph, before passing it on to the backend compiler. The assert format coincides with the export assert format as practiced in `torch/_export/passes/add_runtime_assertions_for_constraints_pass.py`, but actually these passes are completely disjoint right now as I only handle deferred runtime asserts, while export only handles ranges (which I should probably also handle, but don't in this PR.) The assertions must be inserted by Dynamo, because you could potentially then pass the asserts onto another backend like "eager" which no longer looks at the ShapeEnv before. Thanks to previous work in export, these asserts are preserved in AOTAutograd, but they are dropped by Inductor, which needs to be fixed in future work. This piece will be a bit awkward, as Inductor would have preferred to work with the Sympy expressions directly, ah well. Here is what the Dynamo traced FX graph looks like for the test in question: ``` <eval_with_key>.0 class GraphModule(torch.nn.Module): def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ # File: /data/users/ezyang/c/pytorch/wu.py:8, code: y = x.item() item = l_x_.item() # No stacktrace found for following nodes ge_1 = item >= 0 scalar_tensor_default = torch.ops.aten.scalar_tensor.default(ge_1); ge_1 = None _assert_async_msg = torch.ops.aten._assert_async.msg(scalar_tensor_default, "Deferred runtime assert failed: i0 >= 0, where i0 was defined by 'item' (for more information, run with TORCH_LOGS=+dynamo,dynamic)"); scalar_tensor_default = None # File: /data/users/ezyang/c/pytorch/wu.py:9, code: torch._check_is_size _check_is_size = torch._check_is_size(item) # File: /data/users/ezyang/c/pytorch/wu.py:10, code: if y >= 0: ge = item >= 0; item = None # File: /data/users/ezyang/c/pytorch/wu.py:11, code: return x * 2 mul = l_x_ * 2; l_x_ = None return (mul,) ``` Note that we actually keep the `_check_is_size` in the graph redundantly. However, assert_async is retained in the graph, whereas _check_is_size ends up getting DCE'ed. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/113958 Approved by: https://github.com/aakhundov, https://github.com/tugsbayasgalan ghstack dependencies: #113978	2023-11-20 21:25:11 +00:00
Jon Chuang	100b9952b1	[dynamo] Fix user defined object sourceless callable (#114066 ) Fixes https://github.com/pytorch/pytorch/issues/114019 We do not need to guard on callable user object defined instantiated in graph Pull Request resolved: https://github.com/pytorch/pytorch/pull/114066 Approved by: https://github.com/ezyang	2023-11-20 18:38:03 +00:00
Edward Z. Yang	caffa44b1c	Correctly use real boolean operators, not bitwise in shape guard prints (#113927 ) Fixes https://github.com/pytorch/pytorch/issues/113875 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/113927 Approved by: https://github.com/voznesenskym	2023-11-18 04:24:45 +00:00
Peter Bell	9f47580ad7	[BE] Don't mutate torch.compile global config in tests (#113882 ) We should uniformly use `config.patch` so the configuration changes don't effect different tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113882 Approved by: https://github.com/lezcano	2023-11-17 16:49:48 +00:00
David Berard	7c38b76efe	Make offsets dynamic by default (#113734 ) Copied from @ezyang 's #113693. The motivation for this change is that we'd like to guard on storage offset in inductor, to make assumptions about data alignment. create_symbolic_sizes_strides_storage_offset() creates the sizes/strides/offset for fake tensors - they can either be integers or symints. This PR changes storage_offset to always be dynamic. In variables/builder.py, we remove a conditional so that all tensors get added to tracked_fakes. This is because the storage offset will be dynamic even if the other logic in builder.py suggests that it will be static; otherwise, we run into this issue: `1e260c851b/torch/fx/experimental/symbolic_shapes.py (L892-L895)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113734 Approved by: https://github.com/ezyang	2023-11-17 07:57:21 +00:00
Jon Chuang	277229d0c6	[dynamo] Fix incorrectly casting `SymNode` to `int` when input is `bool` (#113871 ) Fixes https://github.com/pytorch/pytorch/issues/113393, https://github.com/pytorch/pytorch/pull/113848#issuecomment-1814624510 Incorrectly casting symnode type will cause it to take the wrong path in symbolic_shapes Pull Request resolved: https://github.com/pytorch/pytorch/pull/113871 Approved by: https://github.com/jansel	2023-11-16 23:24:57 +00:00
PyTorch MergeBot	98df3088c3	Revert "Make offsets dynamic by default (#113734 )" This reverts commit `9efbb4ea73`. Reverted https://github.com/pytorch/pytorch/pull/113734 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it is causing a memory leak in one of the test `9efbb4ea73` ([comment](https://github.com/pytorch/pytorch/pull/113734#issuecomment-1815297222))	2023-11-16 20:56:27 +00:00
David Berard	9efbb4ea73	Make offsets dynamic by default (#113734 ) Copied from @ezyang 's #113693. The motivation for this change is that we'd like to guard on storage offset in inductor, to make assumptions about data alignment. create_symbolic_sizes_strides_storage_offset() creates the sizes/strides/offset for fake tensors - they can either be integers or symints. This PR changes storage_offset to always be dynamic. In variables/builder.py, we remove a conditional so that all tensors get added to tracked_fakes. This is because the storage offset will be dynamic even if the other logic in builder.py suggests that it will be static; otherwise, we run into this issue: `1e260c851b/torch/fx/experimental/symbolic_shapes.py (L892-L895)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113734 Approved by: https://github.com/ezyang	2023-11-16 06:49:09 +00:00
Brian Hirsh	cebad9867b	graph break on intermediate leaves that require grad (#113277 ) fixes https://github.com/pytorch/pytorch/issues/90552. This is a simpler fix that just detects the situation where AOTAutograd can't create a proper backward graph for the situation and graph breaks. This was technically a silent correctness issue before. This PR tries to always graph break when we see a factory function that returns a tensor requiring grad. I check this by seeing if the op returned a `TensorVariable` in dynamo, and if one of the input arguments was a `requires_grad=True` kwarg. I think this is high-fidelity enough, and I'm also hoping that this is uncommon enough that a graph break is reasonable here. The fix to avoid the graph break in user land is also pretty easy - just instantiate your tensor outside of the compiled region and plumb it in. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113277 Approved by: https://github.com/eellison ghstack dependencies: #113267, #113416, #113584	2023-11-16 02:47:45 +00:00
PyTorch MergeBot	5d170fce29	Revert "Support tensors as Dict keys (#111196 )" This reverts commit `b0805fa5d0`. Reverted https://github.com/pytorch/pytorch/pull/111196 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it is failing internally. I will provide the details there ([comment](https://github.com/pytorch/pytorch/pull/111196#issuecomment-1813410149))	2023-11-15 23:08:00 +00:00
Brian Hirsh	032e5a4528	handle cross-dtype views during AOTAutograd view-replay (#113416 ) Fixes https://github.com/pytorch/pytorch/issues/109053 I think "partitioning views out of the graph" will be a more robust fix for the class of errors that we've seen around incorrectly regenerating views at runtime. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113416 Approved by: https://github.com/ezyang ghstack dependencies: #113267	2023-11-15 19:55:47 +00:00
Brian Hirsh	720e866d18	graph break on out= ops with noncontiguous out args (#113267 ) Fixes https://github.com/pytorch/pytorch/issues/113010 In eager mode, when you call an out= op like `add(..., out=out_arg)` with an out argument that is noncontiguous, the noncontiguous out arg will be returned directly. When we functionalize though, functionalization replaces it with a call to `add(...)` which ignores the contiguity of the original out arg. Instead of trying to support this, this PR detects that situation and graph breaks Pull Request resolved: https://github.com/pytorch/pytorch/pull/113267 Approved by: https://github.com/albanD	2023-11-15 19:55:47 +00:00
lezcano	b0805fa5d0	Support tensors as Dict keys (#111196 ) This prepares the PR where we implement sets in terms of dicts. To do so, rather than storing internally a dictionary that maps literals to VariableTrackers, it stores (pretty much) a dictionary from VTs to VTs. To do so, keys are wrapped in an opaque internal class `_Hashable`. The Hashable class is opaque on purpose so that it fails hard if if it inadvertently leaks back into user code. We also found and fixed a number of latent bugs and inconsistencies in the way dynamo checked what can be a dict key. More generally, we make much clearer what are the things that need to be modified to add a new supported key type to Dicts. Fixes https://github.com/pytorch/pytorch/issues/107595 Fixes https://github.com/pytorch/pytorch/issues/111603 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111196 Approved by: https://github.com/jansel	2023-11-14 19:14:03 +00:00
Jon Chuang	e6eab49e11	[dynamo] graph break on setattr `requires_grad` (#113163 ) Main: `RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn` This PR: graph breaks and eager applies the mutation, new tensors are tracked Fixes https://github.com/pytorch/pytorch/issues/109505 (the original bug does not occur, but a new bug where the mutation isn't applied - because AOTAutograd is not `requires_grad` mutation aware - is mitigated) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113163 Approved by: https://github.com/bdhirsh	2023-11-09 13:13:29 +00:00
PaliC	2c4be77f02	Revert "[dynamo] Graph break on `setattr(Tensor, "data", Tensor)` (#113043 )" (#113297 ) This reverts commit `ddfe572534`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113297 Approved by: https://github.com/PaliC	2023-11-09 00:26:21 +00:00
PyTorch MergeBot	94d95a91a2	Revert "[dynamo] graph break on setattr `requires_grad` (#113163 )" This reverts commit `d261687d5f`. Reverted https://github.com/pytorch/pytorch/pull/113163 on behalf of https://github.com/PaliC due to relevant tests are not running for this pr, however, this is fixed after landing https://github.com/pytorch/pytorch/pull/113297/ ([comment](https://github.com/pytorch/pytorch/pull/113163#issuecomment-1802967236))	2023-11-09 00:23:04 +00:00
Jon Chuang	d261687d5f	[dynamo] graph break on setattr `requires_grad` (#113163 ) Main: `RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn` This PR: graph breaks and eager applies the mutation, new tensors are tracked Fixes https://github.com/pytorch/pytorch/issues/109505 (the original bug does not occur, but a new bug where the mutation isn't applied - because AOTAutograd is not `requires_grad` mutation aware - is mitigated) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113163 Approved by: https://github.com/bdhirsh	2023-11-08 19:51:23 +00:00
Edward Z. Yang	1f3fa13f0a	Handle unbacked SymInt sized outputs in AOTAutograd (#113159 ) Thanks aakhundov for constructing the test case. This PR was constructed by running the failing test case, and then fixing problems until we got all the way to the end. There are a few distinct fixes: * AOTAutograd performs equality tests on tensor metadata to determine if a metadata mutation had occurred. If we test i0 vs i1, we should report these are NOT equal, since obviously we have somehow resized the tensor from i0 to i1 (even if, on a particular run, it is possible i0 == i1). * There's a sketchy fix for `test_aot_autograd_exhaustive_matmul_cpu_float32` where we check if the output shape equals the tangent shape. Unfortunately, the same `definitely_true` treatment does not work here, it still fails on the example. I piled an extra sketchy fix on top of it, where I just try my best to avoid doing the view. Maybe we should have some sort of logging here. * Partitioner needs to get out a size for unbacked SymInt when partitioning. I just feed it a random heuristic value in this case, similar to how we've been dealing with this in Inductor. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/113159 Approved by: https://github.com/aakhundov, https://github.com/bdhirsh	2023-11-08 04:28:38 +00:00
Jason Ansel	9664190952	[dynamo] Eagerly install guards (#111415 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111415 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306	2023-11-07 19:55:19 +00:00
Jon Chuang	ddfe572534	[dynamo] Graph break on `setattr(Tensor, "data", Tensor)` (#113043 ) Fixes https://github.com/pytorch/pytorch/issues/113030 Alias information needs to be applied in eager before we can continue to trace the graph. ---- Perhaps this is too strict - couldn't we fx trace through the in-graph (pointer) aliasing, and track mutations through fake tensors instead, and still apply the aliasing mutation epilogue for further mutations outside of graph? 🤔 Regardless, it didn't seem to work too well when I tried this. Seems that `Tensor.__setattr__` doesn't work well in fx graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113043 Approved by: https://github.com/ezyang, https://github.com/voznesenskym	2023-11-07 03:56:21 +00:00
Ken Jin	674c104d12	Fix RecursionError in Inductor for large for loops (#112320 ) Fixes https://github.com/pytorch/pytorch/issues/111686 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112320 Approved by: https://github.com/peterbell10	2023-11-05 13:12:54 +00:00
Kazuaki Ishizaki	9089242048	Fix typo under test directory (#112346 ) This PR fixes typo in comments and messages under `test` directory. This PR also fixes related typo in messages under `torch` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112346 Approved by: https://github.com/kit1980, https://github.com/ezyang	2023-11-03 07:53:33 +00:00
Jon Chuang	2e40e09d57	[dynamo] `{*}Tensor.__init__` from list of Tensor/ndarray as `torch.stack(List[FakeTensor])` (#111741 ) Follow up to https://github.com/pytorch/pytorch/pull/111665 Fixes: https://github.com/pytorch/pytorch/issues/106207 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111741 Approved by: https://github.com/lezcano	2023-10-31 18:44:04 +00:00
Jon Chuang	e2e1189f41	[dynamo] Fix guard for ndarray calling `torch.as_tensor(None)` (#111665 ) Fixes https://github.com/pytorch/pytorch/issues/111662 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111665 Approved by: https://github.com/lezcano	2023-10-22 15:16:21 +00:00
Edward Z. Yang	126d422cf0	Error if you try to run Dynamo compiled function under torch.jit.trace (#111321 ) Fixes https://github.com/pytorch/pytorch/issues/111319 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/111321 Approved by: https://github.com/Chillee	2023-10-16 13:52:29 +00:00
Jon Chuang	ac768333be	[dynamo] fix prim lowering validation logic for dynamic shape args (#111208 ) Fixes https://github.com/pytorch/pytorch/issues/111199 Fixes https://github.com/pytorch/pytorch/issues/111203 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111208 Approved by: https://github.com/ezyang	2023-10-13 18:36:13 +00:00
Jon Chuang	5aa96fd336	[dynamo] list index: add more list types to testing, support namedtuple, improve error handling (#110919 ) Follow up: #110817 Minor improvements as discussed in prev PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/110919 Approved by: https://github.com/ezyang	2023-10-11 00:16:39 +00:00
Jon Chuang	db760527e0	fix(dynamo): list index via polyfill (#110817 ) Fixes https://github.com/pytorch/pytorch/issues/109031 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110817 Approved by: https://github.com/ezyang	2023-10-09 19:48:39 +00:00
Ken Jin	31d635803b	[Dynamo] Fx proxy for builtin all with list iterators (#109972 ) Fixes https://github.com/pytorch/pytorch/issues/109057. Fixes https://github.com/pytorch/pytorch/issues/103620. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109972 Approved by: https://github.com/ezyang	2023-10-04 07:59:26 +00:00
Animesh Jain	8ed08e5a7c	[dynamo] Graph break on rng get/set state - remove GeneratorStateSource (#109410 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109410 Approved by: https://github.com/ezyang ghstack dependencies: #109411	2023-09-22 22:31:55 +00:00
Animesh Jain	5349615240	[dynamo] Unblock a model with jit.isinstance (#109178 ) prevents this error ``` File "/tmp/jetter.azp5q59y/torch/fx/proxy.py", line 291, in create_arg python/0 raise NotImplementedError(f"argument of type: {type(a)}") python/0 torch._dynamo.exc.InternalTorchDynamoError: argument of type: <class 'typing._GenericAlias'> ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/109178 Approved by: https://github.com/yanboliang	2023-09-15 01:19:46 +00:00
Michael Voznesensky	9b2d43df93	Handle empty lists properly (#107803 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107803 Approved by: https://github.com/ezyang	2023-08-24 01:42:29 +00:00
Tugsbayasgalan Manlaibaatar	ee72071fc7	Avoid executing side-effectful graph_module as validation step (#107271 ) Dynamo currently runs the real graph module with real inputs as a way to match the return result of graph module with the eager return type. This is unsafe when graph module is side effectful. In the long term, we will get rid of this step. But in the short term, we just fakify the graph module again and run it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107271 Approved by: https://github.com/ezyang	2023-08-22 04:22:31 +00:00
Jason Lu	bc88028e8e	Back out "Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 )" (#106743 ) Summary: Original commit changeset: 81319beb97f3 Original Phabricator Diff: D47961182 Test Plan: revert to maintain backward compat with legacy ads_dper3 production package. Read details in: S357822 Reviewed By: atuljangra Differential Revision: D48131623 @diff-train-skip-merge (D48131623 landed internally) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106743 Approved by: https://github.com/malfet	2023-08-08 15:27:34 +00:00
Mikayla Gawarecki	d8e5f2aa6d	Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106224 Approved by: https://github.com/atalman, https://github.com/albanD	2023-07-31 17:18:56 +00:00
Edward Z. Yang	7b9d250f06	Change _dynamo.export to be export(f)(args, *kwargs) (#106109 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106109 Approved by: https://github.com/voznesenskym	2023-07-27 21:41:13 +00:00
Animesh Jain	afd955f3de	[dynamo][constant] Kwargs already supported for str methods (#105785 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105785 Approved by: https://github.com/yanboliang	2023-07-22 09:33:23 +00:00
Andrey Talman	c6653b65d8	Back out "Make adding buffers more like adding parameters (#104069 )" (#105581 ) Summary: D47537831 is breaking pyper tests: https://fb.workplace.com/groups/802176577445480/posts/1018902842439518/ with `TypeError: register_buffer() takes 3 positional arguments but 4 were given` Original commit changeset: d4b4069fbd38 Original Phabricator Diff: D47537831 Test Plan: ``` buck2 run //caffe2/torch/fb/training_toolkit/integration_tests/training_lifecycle/cogwheel_tests/pyper_release_v2:cogwheel_smallworld_inline_cvr_infer_pyper_pyper__canary_offline_training-launcher -- --run-harness-in-tupperware --build-fbpkg ads_dper3 --build-fbpkg training_platform ``` Reviewed By: atalman Differential Revision: D47600140 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105581 Approved by: https://github.com/mikaylagawarecki	2023-07-20 03:39:53 +00:00
Animesh Jain	af9a4e08fa	[dynamo][rewrite_asserts] Insert assertion msg in bytecode only when needed (#105549 ) Fixes https://github.com/pytorch/pytorch/issues/105513 The main issue is that we could call `self.LOAD_CONST` and change Dynamo stack, and then decide that we can't rewrite it later. This PR ensures that we change the dynamo stack only when we decide to rewrite asserts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105549 Approved by: https://github.com/tugsbayasgalan	2023-07-19 23:14:01 +00:00
Justin Chu	8a688277a2	[BE] Enable ruff's UP rules and autoformat dynamo / functorch and refs (#105432 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105432 Approved by: https://github.com/ezyang	2023-07-19 13:48:44 +00:00
ekamiti	32d422f335	Make adding buffers more like adding parameters (#104069 ) Add similar semantics for creating a buffer object similar to creating a parameter. This is done by introducing a new `Buffer` class that can be used for type disambiguation. The underlying functionality of registering a buffer remains the same as the `register_buffer` method has not been changed. The `persistent` parameter in the `Buffer` type is to indicate whether a buffer object should be persistent or not. Other non-test changes have to do with getting the new `Buffer` type recognized by inductor and dynamo. Remaining changes are test changes to make sure that the `Buffer` type can be used as a drop in replacement for `register_buffer` as it just leads to `register_buffer` being called. The addition of this new functionality still allows for normal tensors to be used as buffers so these changes are intended to be backwards compatible. Fixes #35735 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104069 Approved by: https://github.com/mikaylagawarecki	2023-07-17 17:59:05 +00:00
Brian Hirsh	c6b9c31a2c	[inductor] fix incorrect strides in copy() decomp, fix hf_LongFormer + hf_BigBird errors (#100115 ) Fixes https://github.com/pytorch/pytorch/issues/100067, https://github.com/pytorch/pytorch/issues/98268 and https://github.com/pytorch/pytorch/issues/93428. See the comment [here](https://github.com/pytorch/pytorch/issues/100067#issuecomment-1523856970) for details. The bug was that the decomposition that inductor uses for `aten.copy` doesn't respect the strides of the input in all cases. The fixes that I added should work, but will be pretty slow - we allocate a tensor (potentially larger than `self` if `self` is a slice), and perform an `as_strided_scatter` + `as_strided`. Longer term, stride-agnostic IR should let us remove this decomp? cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @anijain2305 @soumith @desertfire Pull Request resolved: https://github.com/pytorch/pytorch/pull/100115 Approved by: https://github.com/albanD, https://github.com/ngimel	2023-07-13 14:40:57 +00:00
Animesh Jain	9647a251cb	[dynamo] Dataclass variables with default field (#104840 ) The main complexity comes from the __init__ function of Dataclass variables which look something like this ``` [2023-07-10 05:01:29,548] torch._dynamo.symbolic_convert: [DEBUG] INLINING <code object __init__ at 0x7f7015154450, file "<string>", line 2> 3 0 LOAD_FAST 1 (b) 2 LOAD_FAST 0 (self) 4 STORE_ATTR 0 (b) 4 6 LOAD_FAST 2 (named_tensors) 8 LOAD_DEREF 0 (_HAS_DEFAULT_FACTORY) 10 IS_OP 0 12 POP_JUMP_IF_FALSE 20 14 LOAD_DEREF 1 (_dflt_named_tensors) 16 CALL_FUNCTION 0 18 JUMP_FORWARD 2 (to 22) >> 20 LOAD_FAST 2 (named_tensors) >> 22 LOAD_FAST 0 (self) 24 STORE_ATTR 1 (named_tensors) 26 LOAD_CONST 0 (None) 28 RETURN_VALUE ``` There are multiple issues * VariableBuilder call in functions.py was wrong. We were calling options as args. We were not setting source while tracking the new object. This led to no source for Dataclass variable, which has some new variables in its closures as seen in the above bytecode. * There is IS_OP in above bytecode, which brings more cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104840 Approved by: https://github.com/jansel	2023-07-13 01:25:57 +00:00
Michael Lazos	4b8378967a	Fix pytest test discovery for vscode (#104864 ) With the latest update, this test class name started breaking pytest test discovery in vscode Pull Request resolved: https://github.com/pytorch/pytorch/pull/104864 Approved by: https://github.com/Chillee, https://github.com/albanD, https://github.com/malfet	2023-07-10 14:56:41 +00:00
Yukio Siraichi	40b8d10d5e	Re-land: Turn translation validation on for tests and accuracy runs by default. (#104467 ) Re-landing: #103611 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104467 Approved by: https://github.com/malfet	2023-07-05 19:01:50 +00:00
PyTorch MergeBot	a2a8b4d415	Revert "Turn translation validation on for tests and accuracy runs by default. (#103611 )" This reverts commit `e311bed2a8`. Reverted https://github.com/pytorch/pytorch/pull/103611 on behalf of https://github.com/malfet due to Broke inductor tests ([comment](https://github.com/pytorch/pytorch/pull/103611#issuecomment-1614850276))	2023-06-30 15:54:18 +00:00
Yukio Siraichi	e311bed2a8	Turn translation validation on for tests and accuracy runs by default. (#103611 ) This PR turns translation validation on by default for tests and accuracy benchmark runs. It also installs Z3 on CI. The main changes are: - Add `--no-translation-validation` as an option in _test/run_tests.py_ - Set `PYTORCH_TEST_WITH_TV` environment variable - Add `TEST_WITH_TV` variable in _torch/testing/_internal/common_utils.py_ - Turn translation validation on for accuracy benchmarks in _benchmarks/dynamo/common.py_ - Add Z3 installation on CI scripts Pull Request resolved: https://github.com/pytorch/pytorch/pull/103611 Approved by: https://github.com/ezyang	2023-06-30 01:32:21 +00:00
Andy Rock	12f19b5dd9	consider `CALL_FINALLY` non-jumping in `stacksize_analysis` (#103621 ) Fixes #97811. This PR fixes a bug in `stacksize_analysis`. The pre-`python3.9` opcode `END_FINALLY` should be considered terminal. (edit: this is no longer what this PR does) With this change, [this](https://github.com/pytorch/pytorch/issues/97811#issuecomment-1591888590) previously failing example now passes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103621 Approved by: https://github.com/williamwen42	2023-06-29 20:23:20 +00:00
Richard Zou	408cb45e14	[Dynamo] Support threading.local getattr (#104292 ) Fixes #104066 threading.local has a custom `__getattribute__` so `_getattr_static` doesn't work with it. Since we know that threading.local's `__getattribute__` is well behaved (e.g. https://github.com/python/cpython/blob/3.11/Lib/_threading_local.py), we can just special case it. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/104292 Approved by: https://github.com/williamwen42, https://github.com/jansel	2023-06-29 14:32:37 +00:00
David Berard	a8b63d4d1b	[dynamo] If UserDefinedObjectVariable.var_getattr() is a callable, try handling as a TorchVariable (#104231 ) In some cases, a UserFunctionVariable can be constructed when the underlying function is actually a TorchVariable. One example is when an attribute on a UnspecializedNNModuleVariable is a torch function. In those cases, we should treat the UserFunctionVariable as a TorchVariable. This adds a check in UserDefinedObjectVariable.var_getattr() to try to create a TorchVariable instead of a UserFunctionVariable. Fixes #104172 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104231 Approved by: https://github.com/williamwen42, https://github.com/jansel	2023-06-28 02:39:03 +00:00
Brian Hirsh	3cfd677b1f	fix inference mode / PyDispatcher / Functionalize interaction (#103275 ) Fixes https://github.com/pytorch/pytorch/issues/103132 This is kind of annoying: Functionalization (and also vmap, I think?) manually figures out which ops have C++ CompositeImplicit decomps, and directly registers them to the Functionalize key. This is a problem for the PyDispatcher: We normally want the PyDispatcher to take precedence over the regular dispatcher. But in this case, we have a python decomp registered to `CompositeImplicitAutograd`, and a C++ decomp registered directly to the `Functionalize` key, so the C++ decomp gets precedence over the python decomp. The way this showed up was that a model was running `matmul()` under inference mode, so we never hit the autograd dispatch key, and go straight to the functionalize dispatch key. Matmul has both a python decomp and a c++ decomp, but we were running the C++ decomp. That C++ decomp isn't meant to be used with dynamic shapes, so we were failing with the "tried to call `.sizes()` on a tensor with dynamic shapes" error. For now, I had the PyDispatcher mimic the behavior of functionalization codegen: when you register a python decomp to the `CompositeImplicitAutograd` key, this PR just automatically registers that decomp to the `Functionalize` key at the same time. I'm trying to remember now why we didn't just add `Functionalize` (and all of the other functorch transform keys) directly to the `CompositeImplicitAutograd` alias keyset, but I couldn't remember (@zou3519 any chance you remember?). Pull Request resolved: https://github.com/pytorch/pytorch/pull/103275 Approved by: https://github.com/ezyang, https://github.com/zou3519	2023-06-21 15:19:55 +00:00
Michael Voznesensky	e5e9d563c2	Lift user defined attributes into inputs for certain cases (user defined types and tensors) (#103386 ) (1) Lazy (converts to dynamo variable on access only) (2) Uses existing side effect/reconstruct tech (3) not tensor opinionated Pull Request resolved: https://github.com/pytorch/pytorch/pull/103386 Approved by: https://github.com/jansel	2023-06-20 23:45:19 +00:00
Edward Z. Yang	ed3a61afcc	Add automatic_dynamic_shapes test configuration (#103598 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103598 Approved by: https://github.com/Skylion007	2023-06-15 19:55:57 +00:00
Edward Z. Yang	bc6ec97e02	Switch dynamic_shapes to True by default (#103597 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103597 Approved by: https://github.com/voznesenskym	2023-06-15 15:16:20 +00:00
Edward Z. Yang	ddf4cd69ec	Delete ifdyn and ifunspec combinators (#103596 ) Replaced with expect tests for ease of updating. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103596 Approved by: https://github.com/voznesenskym	2023-06-15 00:14:17 +00:00
Edward Z. Yang	9946499228	Continue simplifying dynamic shapes tests (#103592 ) Remove the static by default / no automatic dynamic configuration as this is about to become the default. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103592 Approved by: https://github.com/voznesenskym, https://github.com/Skylion007	2023-06-14 19:35:51 +00:00
Edward Z. Yang	49754f44ee	Rewrite size/stride/numel TensorVariable handling (#103438 ) The main concept behind this refactor is this: if we know that a size/stride/etc is constant, do NOT trace it into the graph, EXCEPT for any preexisting special cases that applied for static shapes. The refactor unfolds like this: 1. Delete the `dynamic_shapes` branches in torch/_dynamo/variables/builder.py which accept int/float/bool outputs. This is over-aggressive and we don't want to allow this (because if the operator returns a constant, we shouldn't have called wrap_fx_proxy in the first place.) This causes a bunch of failures because we are blindly feeding the result of size() call to wrap_fx_proxy when dynamic shapes is enabled. 2. Modify TensorVariable.call_method in torch/_dynamo/variables/tensor.py to avoid sending constant ints to wrap_fx_proxy. After normal specialization (which should be deleted, see https://github.com/pytorch/pytorch/pull/103434) we consult the fake tensor to see if the values in question have free variables or not. If they don't we short circuit tracing into graph. We only trace into graph if the operation in question is truly symbolic. Note that there is a near miss here: it's OK to trace x.size() call entirely into the graph, even if it doesn't have all dynamic shapes, because operator.getitem with int output is special cased in builder.py. This is a preexisting special case and I don't try to get rid of it. 3. It turns out that the change here also breaks torch_np compatibility layer. So I completely rewrite getattr handling in torch/_dynamo/variables/tensor.py to follow the same pattern (only trace into graph if truly dynamic). There's some minor housekeeping in torch/fx/experimental/symbolic_shapes.py and some test files. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103438 Approved by: https://github.com/larryliu0820	2023-06-12 19:36:24 +00:00
Edward Z. Yang	414ec6ce97	Turn off automatic_dynamic_shapes in prep for dynamic-by-default (#103320 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103320 Approved by: https://github.com/Skylion007	2023-06-10 02:49:59 +00:00
Edward Z. Yang	605a85249c	Fix graph break on boolean mask better (#103052 ) Previously I accidentally thought setitem takes each argument as a list. But if you write x[:, b] that actually is passed in as a tuple. Try harder. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103052 Approved by: https://github.com/desertfire	2023-06-07 14:40:56 +00:00
Richard Zou	5b700fc914	Disable fallback for custom kernels (#101131 ) Previous failed attempt was here: https://github.com/pytorch/pytorch/pull/97715. Basically we tried to disable fallback for all ops (aten + custom) but hit many CI failures due to missing fake tensor coverage. Let's just disable it for custom kernels for now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101131 Approved by: https://github.com/zou3519	2023-06-06 23:25:29 +00:00
Yanbo Liang	d92bb036a4	[Dynamo] Fix if condition on UnspecializedNNModuleVariable (#102583 ) Fixes #102315 The root cause is for ```UnspecializedNNModuleVariable``` which extends from ```UserDefinedObjectVariable```, if ```__bool__``` is missing, we should use ```__len__``` to infer a truth value. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102583 Approved by: https://github.com/jansel	2023-06-03 03:42:15 +00:00
David Berard	1035e33b38	[dynamo] test attaching attributes to an OptimizedModule (#102781 ) Test that the following passes: ```python mod = torch.compile(mod) mod.is_compiled = True assert "is_compiled" in dir(mod) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/102781 Approved by: https://github.com/yuguo68	2023-06-03 02:20:21 +00:00
Edward Z. Yang	5d57a348cd	Graph break on differentiable boolean mask setitem (#102843 ) Fixes https://github.com/pytorch/pytorch/issues/102841 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/102843 Approved by: https://github.com/voznesenskym	2023-06-02 22:34:52 +00:00
Animesh Jain	040d2cc969	[dynamo] Some torchrec_dlrm related fixes (#101953 ) Issue 1 of https://github.com/pytorch/pytorch/issues/101918 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101953 Approved by: https://github.com/jansel	2023-05-28 17:56:08 +00:00
Brian Hirsh	98ab11a2c3	separate out dynamo .requires_grad and .is_grad_enabled guards (#100570 ) Fixes https://github.com/pytorch/pytorch/issues/100977 This will hopefully fix this error (from [issue](https://github.com/pytorch/pytorch/issues/99616)) This PR fixes an internal model: we were running an inductor inference graph, but `torch.is_grad_enabled()` was True, causing us to error inside of the inference graph when we encountered an out= operator. I haven't been able to create a smaller repro - before landing this, I want to create a smaller repro to convince myself of why we need to separate out these guards. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100570 Approved by: https://github.com/ezyang	2023-05-24 14:58:40 +00:00
Michael Voznesensky	ea5eaa8692	Remove config check in specialize (#102098 ) Fixes Pull Request resolved: https://github.com/pytorch/pytorch/pull/102098 Approved by: https://github.com/ezyang	2023-05-24 01:26:22 +00:00
PyTorch MergeBot	d0bb8fdc64	Revert "[dynamo] Minor refactor to use is_allowed to decide inlining of NNModule methods (#101910 )" This reverts commit `8b2a9f81cc`. Reverted https://github.com/pytorch/pytorch/pull/101910 on behalf of https://github.com/DanilBaibak due to Break internal build ([comment](https://github.com/pytorch/pytorch/pull/101910#issuecomment-1556782524))	2023-05-22 08:37:12 +00:00
Animesh Jain	8b2a9f81cc	[dynamo] Minor refactor to use is_allowed to decide inlining of NNModule methods (#101910 ) Fixes #101609 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101910 Approved by: https://github.com/yanboliang	2023-05-20 03:34:20 +00:00
PyTorch MergeBot	7f3fed125e	Revert "separate out dynamo .requires_grad and .is_grad_enabled guards (#100570 )" This reverts commit `1fabee399d`. Reverted https://github.com/pytorch/pytorch/pull/100570 on behalf of https://github.com/PaliC due to breaking inductor tests along with #101219 ([comment](https://github.com/pytorch/pytorch/pull/100570#issuecomment-1555271267))	2023-05-19 21:29:09 +00:00
Brian Hirsh	1fabee399d	separate out dynamo .requires_grad and .is_grad_enabled guards (#100570 ) Fixes https://github.com/pytorch/pytorch/issues/100977 This will hopefully fix this error (from [issue](https://github.com/pytorch/pytorch/issues/99616)) This PR fixes an internal model: we were running an inductor inference graph, but `torch.is_grad_enabled()` was True, causing us to error inside of the inference graph when we encountered an out= operator. I haven't been able to create a smaller repro - before landing this, I want to create a smaller repro to convince myself of why we need to separate out these guards. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100570 Approved by: https://github.com/ezyang	2023-05-19 16:14:56 +00:00
Michael Voznesensky	4c1bc91f42	Support autograd.Function w/ grad (#99483 ) This PR adds support for tracing autograd.Function with grad. A few important bullet points outlining our approach: 1) Our goal is to verify soundness in order to add a call_function to the autograd.Function's `apply` to the graph. 2) We achieve (1) by either verifying soundness or rejecting soundness, by ensuring that both forward and backward of the autograd.Function are sound. 3) For the forward, if we verify soundness, we install its guards into the graph. 4) For the backward, if we verify soundness, we throw it out. However, backwards soundness verification is more onerous, and has a config driven set of banned attrs and methods for tensors. 1-4 above are achieved by turning the forward and backward into UserDefinedFunctionVariables, and inlining through them, relying on dynamo's soundness detection. If we graph break in these, we raise and treat them as unsound. As noted above, backwards is stricter yet. For the tracing, the safety comes from dynamo's HigherOrderOperator system. That system ensures that not only do we trace soundly, but that no new variables are lifted into inputs during the tracing, and that the forward and backwards are entirely self contained. Whenever we reject a function as unsound, we restore back, as usual. Due to some limitations in the lifting logic, we have an escape hatch we implemented for tensors that are known in forward, but cross into backwards through save_tensors (save) /saved_tensors (load). We escape hatch here to avoid having the known saved tensors coming from forward end up being accidentally treated as lifted variables (and rejected). This is sound, but a little hacky feeling. Additionally, due to some limitations in fx node removal, combined with how we produce subgraphs for the traces installed from HigherOrderOperators, we had to improve our node removal logic. In the event of a restore, we remove the old nodes from the graph, as usual in dynamo. However, because the references to these nodes may exist in subgraphs, we traverse any nodes users and remove them first if and only if they are in another graph. This is always sound, because removal should only be downstream of restoration at this point. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99483 Approved by: https://github.com/zou3519	2023-05-19 01:26:21 +00:00
Yanbo Liang	7052fb37bd	[Dynamo] Improve handling UnspecializedNNModuleVariable side effect (#101141 ) Fixes #101102 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101141 Approved by: https://github.com/jansel	2023-05-16 03:57:13 +00:00
PyTorch MergeBot	87f9160b67	Revert "[inductor] fix incorrect strides in copy() decomp, fix hf_LongFormer + hf_BigBird errors (#100115 )" This reverts commit `4c8ee583c3`. Reverted https://github.com/pytorch/pytorch/pull/100115 on behalf of https://github.com/jeanschmidt due to breaking internal tests ([comment](https://github.com/pytorch/pytorch/pull/100115#issuecomment-1547417287))	2023-05-15 08:31:58 +00:00
Brian Hirsh	4c8ee583c3	[inductor] fix incorrect strides in copy() decomp, fix hf_LongFormer + hf_BigBird errors (#100115 ) Fixes https://github.com/pytorch/pytorch/issues/100067 and https://github.com/pytorch/pytorch/issues/93428. See the comment [here](https://github.com/pytorch/pytorch/issues/100067#issuecomment-1523856970) for details. The bug was that the decomposition that inductor uses for `aten.copy` doesn't respect the strides of the input in all cases. The fixes that I added should work, but will be pretty slow - we allocate a tensor (potentially larger than `self` if `self` is a slice), and perform an `as_strided_scatter` + `as_strided`. Longer term, stride-agnostic IR should let us remove this decomp? cc @soumith @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @desertfire @ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/100115 Approved by: https://github.com/albanD	2023-05-12 00:50:35 +00:00
Yanbo Liang	369a256381	[Dynamo] Remove cross import in dynamo unit tests (#100851 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/100851 Approved by: https://github.com/jansel	2023-05-11 17:07:25 +00:00
Edward Z. Yang	1e89a56a5b	Apply static policy correctly to unspec (#98983 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98983 Approved by: https://github.com/ezyang	2023-05-10 05:59:12 +00:00
Bartosz Szmelczynski	44e73da444	Extend assert statement to include ListVariable (#100841 ) Fixes #100697 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100841 Approved by: https://github.com/lezcano, https://github.com/anijain2305	2023-05-10 01:57:10 +00:00
PyTorch MergeBot	d98d95fb9f	Revert "[Dynamo] Remove cross import in dynamo unit tests (#100851 )" This reverts commit `c4bbeb5b8a`. Reverted https://github.com/pytorch/pytorch/pull/100851 on behalf of https://github.com/jeanschmidt due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/100851#issuecomment-1540646623))	2023-05-09 18:30:01 +00:00
Yanbo Liang	c4bbeb5b8a	[Dynamo] Remove cross import in dynamo unit tests (#100851 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/100851 Approved by: https://github.com/jansel	2023-05-08 20:16:57 +00:00
Animesh Jain	52a36a98d9	[dynamo] Graph break on a list referencing self (#100296 ) Fixes https://github.com/pytorch/pytorch/issues/100150 I did not try hard to support this w/o a graph break. As this pattern is not common, current PR graph breaks and avoids an infinite recursion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100296 Approved by: https://github.com/jansel	2023-05-02 06:38:28 +00:00
Michael Lazos	dc27b842ba	Ensure optimizer state references are cleared (#100282 ) Fixes https://github.com/pytorch/pytorch/issues/100264 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100282 Approved by: https://github.com/janeyx99, https://github.com/yanboliang	2023-05-01 22:25:07 +00:00
Michael Voznesensky	aafc6ce8cc	Produce constant variables in cases where a SymNode is created with a constant (#100144 ) ` AOT_DYNAMIC_SHAPES=1 TORCHDYNAMO_DYNAMIC_SHAPES=1 benchmarks/dynamo/huggingface.py --performance --training --amp --backend eager --disable-cudagraphs --device cuda --only AllenaiLongformerBase --explain` Looks promising! Goes from: Dynamo produced 173 graphs covering 2760 ops with 160 graph breaks (14 unique) To: Dynamo produced 6 graphs covering 2298 ops with 15 graph breaks (7 unique) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100144 Approved by: https://github.com/ezyang	2023-05-01 21:32:11 +00:00
PyTorch MergeBot	89c43f4108	Revert "Produce constant variables in cases where a SymNode is created with a constant (#100144 )" This reverts commit `d7bdfd3454`. Reverted https://github.com/pytorch/pytorch/pull/100144 on behalf of https://github.com/ezyang due to ci failure is real ([comment](https://github.com/pytorch/pytorch/pull/100144#issuecomment-1529587039))	2023-05-01 11:10:48 +00:00
Michael Voznesensky	d7bdfd3454	Produce constant variables in cases where a SymNode is created with a constant (#100144 ) ` AOT_DYNAMIC_SHAPES=1 TORCHDYNAMO_DYNAMIC_SHAPES=1 benchmarks/dynamo/huggingface.py --performance --training --amp --backend eager --disable-cudagraphs --device cuda --only AllenaiLongformerBase --explain` Looks promising! Goes from: Dynamo produced 173 graphs covering 2760 ops with 160 graph breaks (14 unique) To: Dynamo produced 6 graphs covering 2298 ops with 15 graph breaks (7 unique) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100144 Approved by: https://github.com/ezyang	2023-04-30 17:13:57 +00:00
Animesh Jain	03806eddbf	[dynamo] Compile torchvision augmentations (#100292 ) Resolves https://github.com/pytorch/pytorch/issues/100112 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100292 Approved by: https://github.com/jansel	2023-04-29 02:59:41 +00:00
Tugsbayasgalan Manlaibaatar	d4bf76c2a4	Persist torch.assert in aten graph (#100101 ) This PR introduces a new operator called aten._assert_async.msg, which allows passing a tensor value and assertion message as inputs. As part of TorchDynamo, we're replacing the use of torch._assert with this new operator so that make_fx also knows how to handle assertions. This is subset of https://github.com/pytorch/pytorch/pull/98878, refer there for historic reviews. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100101 Approved by: https://github.com/jansel	2023-04-28 07:31:43 +00:00
Animesh Jain	006785cd46	[dynamo][hf_bigbird] Actually graph break on tensor.unsqueeze_/resize_ (#99986 ) Currently, we return `unimplemented` w/o a graph break on seeing a x.unsqueeze_ when x is input. This essentially means we fall back to the original frame. This PR actually graph breaks so that we can generate the continuation frame for the rest of the function. Instead of graph breaking at LOAD_ATTR, we delay the graph break to the actual CALL_FUNCTION, where its cleaner to graph break. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99986 Approved by: https://github.com/jansel	2023-04-26 18:50:06 +00:00
Aaron Gokaslan	e2a3817dfd	[BE] Enable C419 rule for any all shortcircuiting (#99890 ) Apparently https://github.com/pytorch/pytorch/pull/78142 made torch.JIT allow for simple generator expressions which allows us to enable rules that replace unnecessary list comprehensions with generators in any/all. This was originally part of #99280 but I split it off into this PR so that it can be easily reverted should anything break. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99890 Approved by: https://github.com/justinchuby, https://github.com/kit1980, https://github.com/malfet	2023-04-25 15:02:13 +00:00
Edward Z. Yang	0eb59ad093	Change export tracing_mode default to symbolic (#99877 ) Differential Revision: [D45231039](https://our.internmc.facebook.com/intern/diff/D45231039/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99877 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2023-04-25 00:12:12 +00:00
PyTorch MergeBot	c83e1f517d	Revert "Delete tracing_mode argument to export (#99555 )" This reverts commit `e9786149ab`. Reverted https://github.com/pytorch/pytorch/pull/99555 on behalf of https://github.com/DanilBaibak due to Break internal build	2023-04-24 08:21:41 +00:00
Jason Ansel	220712f4de	Fix torch.compile() on a skipped module (#98894 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98894 Approved by: https://github.com/xw285cornell	2023-04-22 16:10:55 +00:00
Edward Z. Yang	e9786149ab	Delete tracing_mode argument to export (#99555 ) You can have any color you want, as long as it's tracing_mode="symbolic" Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99555 Approved by: https://github.com/voznesenskym	2023-04-21 16:20:51 +00:00
Jason Ansel	d168161cd3	[dynamo] Fix example_inputs with unsqueeze_ (#98696 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98696 Approved by: https://github.com/yanboliang	2023-04-21 02:54:14 +00:00
Jason Ansel	e68e84ef8a	[dynamo] Support BUILD_MAP_UNPACK (#98664 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98664 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym	2023-04-20 18:41:50 +00:00
Edward Z. Yang	e47e8c9d98	Guard on default device (#99551 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99551 Approved by: https://github.com/voznesenskym, https://github.com/mlazos	2023-04-20 17:02:59 +00:00
William Wen	88c8c2b71b	[dynamo 3.11] implement 3.11 exceptiontable (#96511 ) Summary of changes: - Add CPython exceptiontable parsing/assembling functions in torch/_dynamo/bytecode_transformation.py, based on https://github.com/python/cpython/blob/3.11/Objects/exception_handling_notes.txt. - Add optional `exn_tab_entry` field to dynamo `Instruction`s in torch/_dynamo/bytecode_transformation.py in order to virtualize exception table entries (start, end, target instructions). - Add checks guarding against duplicate instructions in dynamo, so that jump/exceptiontable targets are unambiguous. See `get_indexof` in torch/_dynamo/bytecode_analysis.py. Ensure that bytecode generation throughout dynamo does not generate duplicate instructions. - Allow dynamo bytecode generation logic to generate nested exception table entries for developer convenience. CPython expects entries to not overlap, so we flatten nested entries during assembly in torch/_dynamo/bytecode_transformation.py:compute_exception_table. - Simulate the block stack in torch/_dynamo/symbolic_convert.py. CPython removed the block stack in 3.11, but dynamo needs it in order to keep track of active contexts. So we simulate the block stack as before by looking at exceptiontable entries in order to determine the current blocks. - Update context codegen in torch/_dynamo/resume_execution.py. The `SETUP_FINALLY` bytecode, which conveniently had a jump target to the finally block, was removed in 3.11, so we need to keep track of the jump target of the finally block using exceptiontables. Generating resume functions is more difficult since the original exceptiontable entries pointing to old cleanup code need to be modified to point to new cleanup code. - Fix a push_null bug in torch/_dynamo/variables/functions.py introduced by https://github.com/pytorch/pytorch/pull/98699 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96511 Approved by: https://github.com/jansel, https://github.com/yanboliang, https://github.com/albanD	2023-04-18 07:53:24 +00:00
Jason Ansel	47c685def3	[dynamo] Support DELETE_ATTR (#98698 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98698 Approved by: https://github.com/yanboliang	2023-04-15 20:31:40 +00:00
Jason Ansel	e9be0b0fb9	[dynamo] Support functools.wraps (#98699 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98699 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym	2023-04-15 03:24:06 +00:00
Jason Ansel	baa06790f8	Unbreak torch.compile on macos (#99119 ) It seems like #96980 made torch.compile() completely ignore the `backend=` arg on macos rendering the entire API useless even if the user wasn't using mps tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99119 Approved by: https://github.com/msaroufim	2023-04-14 15:30:27 +00:00
Jason Ansel	f84078b40b	[dynamo] Remove pointless graphs from with no_grad() (#98956 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98956 Approved by: https://github.com/voznesenskym	2023-04-14 00:25:40 +00:00
Brian Hirsh	670c5cf962	AOTAutograd: fix 'Trying to backward through the graph a second time' error (#98960 ) Fixes https://github.com/pytorch/pytorch/issues/97745. See discussion and comment in the PR for more details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98960 Approved by: https://github.com/bertmaher, https://github.com/albanD	2023-04-13 10:25:07 +00:00
Animesh Jain	8654699c54	[dynamo] Remove _dynamo.skip and fold it in _dynamo.disable (#98899 ) Summary There is confusion between`_dynamo.skip` and `_dynamo.disable`. This removes the `_dynamo.skip` API. The functionality is still available via `_dynamo.disable(recursive=False)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/98899 Approved by: https://github.com/jansel	2023-04-12 17:33:26 +00:00
Andrew Gu	c9adc4c376	[Dynamo] De-dup graph inputs (#98775 ) ### Overview This PR de-duplicates graph inputs in TorchDynamo, using the `Source` as the unique identifier for each input. This closes https://github.com/pytorch/pytorch/issues/98743 and https://github.com/pytorch/pytorch/issues/98625. ### Details `VariableBuilder.wrap_tensor()` should return a `VariableTracker` for the passed-in `value: Tensor`. If `value` is duplicated, we should avoid calling `OutputGraph.create_graph_input()` and `OutputGraph.add_grapharg()`. - Note that `create_graph_input()` and `add_grapharg()` are not 1:1. For a constant source and either `wrap_sym()` or `wrap_unspecialized_primitive()`, TorchDynamo still calls `create_graph_input()` but not `add_grapharg()`. - Note that `create_graph_input()` should be called before constructing the corresponding `VariableTracker`. TorchDynamo needs the `fx.Proxy` object to pass to `wrap_fx_proxy()`. In this PR, the `OutputGraph` saves an additional mapping `input_source_to_var` from each graph input's `Source` to its `VariableTracker`, which works because `Source` is now hashable. This mapping should be updated each time `create_graph_input()` is called. However, since we must construct the `VariableTracker` after `create_graph_input()` returns, we must have a separate call to the `OutputGraph` to update the mapping. If anyone has any suggestion on how to coalesce this logic and avoid having to remember to update `input_source_to_var` for each `create_graph_input()`, I would love to hear it. <details> <summary> Alternate Approach</summary> Initially, I tried having TorchDynamo construct a new but equivalent `VariableTracker` for the duplicated tensor. However, I abandoned this approach after hitting an assertion in `def wrap_fx_proxy_cls()` due to `"example_value"` already being in the proxy node's metadata because we were reusing the primary tensor's `Proxy` object. Reusing the exact `VariableTracker` also seems less error-prone instead of requiring constructing a new but identical `VariableTracker`. </details> ### Testing #### Global Variable Test ``` import torch @torch.compile() def f(): return x + x x = torch.randn(3) f() ``` Before: ``` ====== Forward graph 0 ====== <eval_with_key>.6 class <lambda>(torch.nn.Module): def forward(self, arg0_1: f32[3], arg1_1: f32[3]): # File: /data/users/ezyang/b/pytorch/ff.py:5, code: return x + x add: f32[3] = torch.ops.aten.add.Tensor(arg0_1, arg1_1); arg0_1 = arg1_1 = None return (add,) ``` After (only `arg0_1` and no more `arg1_1`): ``` ====== Forward graph 0 ====== <eval_with_key>.4 class <lambda>(torch.nn.Module): def forward(self, arg0_1: f32[3]): # File: dynamo/test_dup_global.py:8, code: return x + x add: f32[3] = torch.ops.aten.add.Tensor(arg0_1, arg0_1); arg0_1 = None return (add,) ``` #### FSDP Test Before we error on ``` File "/.../pytorch/torch/_guards.py", line 244, in __post_init__ assert self.input_source_a != self.input_source_b ``` and now there is no error. --- The rename from `name_to_input` to `input_name_to_proxy` is not part of the core logic change and is a remnant from initial attempts. I can undo it later if desired, but I also feel that the new name is more informative. It also fixes the type annotation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98775 Approved by: https://github.com/ezyang, https://github.com/voznesenskym	2023-04-11 18:07:20 +00:00
William Wen	117da58b65	[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 ) Enable most dynamo unittests for 3.11. There are a few tests that are skipped due to failures that will be addressed in upcoming PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98104 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym, https://github.com/albanD, https://github.com/jansel, https://github.com/jerryzh168, https://github.com/malfet	2023-04-10 20:04:10 +00:00
Jason Ansel	f4858fa8ef	Improve dynamo support for autograd.Function (#98158 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98158 Approved by: https://github.com/yanboliang, https://github.com/anijain2305	2023-04-10 00:33:51 +00:00
YJ Shi	5ceae85f1c	[Dynamo] Include UserDict in clone_inputs (#97725 ) Fixes #97724 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97725 Approved by: https://github.com/yanboliang	2023-04-08 00:19:35 +00:00
PyTorch MergeBot	22411b6f02	Revert "[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 )" This reverts commit `0066f3405f`. Reverted https://github.com/pytorch/pytorch/pull/98104 on behalf of https://github.com/huydhn due to Sorry for reverting your PR, but it is failing on CPU 3.11 test in trunk `0066f3405f`. This is probably a landrace	2023-04-07 00:05:30 +00:00
William Wen	0066f3405f	[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 ) Enable most dynamo unittests for 3.11. There are a few tests that are skipped due to failures that will be addressed in upcoming PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98104 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym, https://github.com/albanD, https://github.com/jansel, https://github.com/jerryzh168, https://github.com/malfet	2023-04-06 23:15:48 +00:00
PyTorch MergeBot	e394f6db5a	Revert "Improve dynamo support for autograd.Function (#98158 )" This reverts commit `4716fa2411`. Reverted https://github.com/pytorch/pytorch/pull/98158 on behalf of https://github.com/huydhn due to Sorry for reverting your PR, but it seems to breaks MacOS trunk job `4716fa2411`. The signal was missing from the PR because we disabled MacOS job yesterday due to https://github.com/pytorch/pytorch/issues/98362	2023-04-06 18:15:02 +00:00
Jason Ansel	4716fa2411	Improve dynamo support for autograd.Function (#98158 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98158 Approved by: https://github.com/yanboliang, https://github.com/anijain2305	2023-04-06 16:44:37 +00:00
knwng	e943b120a3	Fix incorrectly getting the name of OrderedDict's index in dynamo (#96940 ) Fixes #96737 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96940 Approved by: https://github.com/ezyang, https://github.com/voznesenskym	2023-04-05 03:53:45 +00:00
Tugsbayasgalan Manlaibaatar	75ac6fdcdd	Propogate dynamo shape_env to make_fx (#96437 ) Currently, when we use assume_static_by_default flag, dynamo won't produce any symbols for input tensors. But when we pass the dynamo generated graph onto make_fx via torchdynamo.export(aten_graph=True), there is no way to pass this flag. We enable this by directly passing the fake tensors dynamo used to make_fx and call make_fx with "real" mode with fake tensors from dynamo. Note that this is modified version of (https://github.com/pytorch/pytorch/pull/96143) Differential Revision: [D44561753](https://our.internmc.facebook.com/intern/diff/D44561753) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96437 Approved by: https://github.com/jansel, https://github.com/ezyang	2023-04-04 20:37:30 +00:00
Jason Ansel	b96fe9b61c	Fix issues related to ClassInstantier in HF models (#97997 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/97997 Approved by: https://github.com/anijain2305	2023-04-04 00:01:08 +00:00
Guang Yang	eeb18d1e54	Fix dynamo tests and re-enable internally (#97937 ) Summary: `:test_dynamo` has been broken for long time internally in Meta. This PR is to fix the broken test and re-enable it internally. - Using the root `pytest.ini` for pytest - Decouple tests so that one can be disabled with affecting others - Temporarily disable the test cases that require additional efforts to fix OSS CI doesn't provide test code coverage info. Meta internal test infra does. The value of re-enabling these tests internally is not only to collect test coverage info but help fbcode developers to build/test from fbcode. Test Plan: `buck test mode/dev-nosan //caffe2/test:test_dynamo` https://www.internalfb.com/intern/testinfra/testrun/7318349540623516 Differential Revision: D44325238 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97937 Approved by: https://github.com/ezyang	2023-04-03 20:47:13 +00:00
Jason Ansel	76074dc0a3	Improve support for dict subclasses (#98154 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98154 Approved by: https://github.com/anijain2305	2023-04-03 01:42:08 +00:00
Jason Ansel	35b3309539	Fix graph break from inline patched init (#98150 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98150 Approved by: https://github.com/anijain2305, https://github.com/yanboliang	2023-04-03 01:11:30 +00:00
Jason Ansel	bc9dd969e1	Support inlining no_grad() decorator (#98121 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98121 Approved by: https://github.com/anijain2305, https://github.com/voznesenskym	2023-04-03 00:24:56 +00:00
Jason Ansel	b9d3b3f595	Improve support for contextlib.nullcontext (#98111 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98111 Approved by: https://github.com/anijain2305	2023-04-02 02:33:14 +00:00
Tugsbayasgalan Manlaibaatar	7f9533e224	[Dynamo] Add UserError type (#97705 ) To get started the dynamo error message improvement effort, we discussed about adding new user error type which covers cases where the user used something that TorchDynamo doesn't support and there is clear actions they can take. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97705 Approved by: https://github.com/anijain2305, https://github.com/yanboliang	2023-04-01 01:18:00 +00:00
Animesh Jain	6b319d1525	[dynamo][graph break fix] inplace add for empty tuple (#97923 ) Fixes one of the frequent graph breaks in HF models. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97923 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-04-01 00:11:16 +00:00
Brian Hirsh	864ab93656	aot_autograd: avoid using intermediate_base logic unnecessarily (#97786 ) fixes https://github.com/pytorch/pytorch/issues/97691, see the issue for the proposed design. Now that we are employing AOTAutograd's "intermediate base" logic a lot less frequently, we might see some speedups in the benchmark suite. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97786 Approved by: https://github.com/jansel, https://github.com/soulitzer	2023-03-31 16:25:13 +00:00
Aaron Gokaslan	47dca20d80	[BE] Enable flake8-comprehension rule C417 (#97880 ) Enables flake8-comprehension rule C417. Ruff autogenerated these fixes to the codebase. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97880 Approved by: https://github.com/ezyang, https://github.com/kit1980, https://github.com/albanD	2023-03-30 14:34:24 +00:00
Edward Z. Yang	8372c5dc68	Refactor dynamic dims api, stateless internals, higher level export API (#96699 ) The purpose of this API is to execute a few large components of work: 1) Refactor all the internals of plumbing dynamic dimension information after dynamo to be stateless 2) Decouple allocation controls around dynamic dimensions from verification 3) For (2), for allocation, create an enum that dictates whether we are in DUCK (default today), STATIC (aka assume_static_default in the past), or DYNAMIC (aka user constrained, do not duck shape) 4) For (2), for verification, we separate out the list of dynamic ranges entirely from allocation. This means shape_env does not tracking for what we verify on, and instead, it is the callers job to invoke produce_guards() with the various things they want verified, specifically, with the valid ranges. We do use constrain ranges to refine value ranges when doing analysis. 5) We have decided, therefore, as an extension of (4) to double down on "late" checks versus "eager" checks, primarily because the mechanisms for gathering what actually matters happens during guards, and should be a purview of the caller seeking guards, not the shape env. However, for dynamo, these structures are essentially one and the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96699 Approved by: https://github.com/avikchaudhuri, https://github.com/ezyang	2023-03-29 16:55:49 +00:00
Edward Z. Yang	fb7f983357	Graph break on operators that fake tensor doesn't support (#97708 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/97708 Approved by: https://github.com/eellison	2023-03-28 19:49:54 +00:00
Edward Z. Yang	9e029f44b5	[EASY] Fix test that does nothing (#97722 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/97722 Approved by: https://github.com/jansel	2023-03-28 16:31:03 +00:00
Edward Z. Yang	32fdd44577	SymIntify maybe_multiply (#97675 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/97675 Approved by: https://github.com/albanD	2023-03-27 23:20:23 +00:00
Yanbo Liang	c1025af012	[Dynamo] throw better error message if assert with non-string message (#97297 ) Error message before this PR: ``` torch._dynamo.exc.Unsupported: missing: LOAD_ASSERTION_ERROR ``` After: ``` torch._dynamo.exc.Unsupported: assert with non-string message ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/97297 Approved by: https://github.com/tugsbayasgalan	2023-03-22 02:24:04 +00:00
Aaron Gokaslan	5471621497	[BE] Remove unnecessary dict comprehensions (#97116 ) Removes unnecessary dict comprehensions that optimize creation of dicts from iterables Pull Request resolved: https://github.com/pytorch/pytorch/pull/97116 Approved by: https://github.com/kit1980	2023-03-20 00:56:57 +00:00
Catherine Lee	8c2341c1b9	Remove pytest block list (#96698 ) Enables the last few files under pytest. xdist was causing problems with `profiler/test_profiler` `test_source_multithreaded` due to creating extra threads. Luckily we don't use it so we can disable it with `-p no:xdist`, but this is incompatible with pytest-rerunfailures==10.2, so upgrade to 10.3. I'd update the windows ami but idk how. `dynamo/test_optimizers` and `dynamo/test_repros` both had tests that used skip_if_pytest. https://github.com/pytorch/pytorch/pull/93251/files suggests that it is due to pytest assertion rewriting, so I added `PYTEST_DONT_REWRITE` to their module docstrings to prevent pytest from rewriting assertions. Disable test by issue in `dynamo/test_dynamic_shapes` seems sane. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96698 Approved by: https://github.com/huydhn, https://github.com/malfet	2023-03-16 04:22:42 +00:00
BowenBao	60a68477a6	Bump black version to 23.1.0 (#96578 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96578 Approved by: https://github.com/ezyang	2023-03-15 06:27:59 +00:00
Edward Z. Yang	99efe3ef5a	Generate type match guard for torch.Size input (#96421 ) I suppose hypothetically, if the user code ends up working polymorphically over the SizeVariable, in such a way that a tuple would work, this type match is not necessary. But we do not carefully test for this. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96421 Approved by: https://github.com/jansel, https://github.com/voznesenskym	2023-03-12 23:04:55 +00:00
Edward Z. Yang	80ce1a934e	Fix flaky Dynamo export tests (#96488 ) Planning to do a full writeup later. The short story is, sometimes the following chain of events happens: 1. We turn on Dynamo's custom frame handler 2. GC triggers (and all of the finalizers run under Dynamo) 3. GC hits a GeneratorExit frame 4. You end up in the custom frame handler with throw_flag == TRUE and PyErr_Occurred() != NULL If this happens and we blindly call into other Python functions (like the Python callback), the executed Python code will immediately raise an exception (because there's already an ambient exception set.) This is very, very confusing. The fix is to defer to the regular handler when throw_flag is TRUE. I triggered this locally with ``` PYTHONUNBUFFERED=1 pytest test/dynamo/test_dynamic_shapes.py -k 'Unspec and export and not dupes and not reorder' -v -x -s ``` But I also have some tests which trigger the problem synthetically. Fixes https://github.com/pytorch/pytorch/issues/93781 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96488 Approved by: https://github.com/albanD	2023-03-10 21:51:54 +00:00
Edward Z. Yang	bbe1b9bbd4	Fix https://github.com/pytorch/pytorch/issues/96278 (#96299 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96299 Approved by: https://github.com/ngimel	2023-03-09 22:13:52 +00:00
jon-chuang	7a192cc51c	dynamo: wrap graph break inst in try except block - with context manager setup/teardown (#94758 ) Replacement to https://github.com/pytorch/pytorch/pull/94672. Follow up to https://github.com/pytorch/pytorch/pull/94137. We simply replace the set grad mode try except blocks with one for a more generic contextmanager (using `__enter__` and `__exit__`), storing the context manager into a `symbolic_local` for the duration of the try block. (see https://github.com/pytorch/torchdynamo/issues/207 for the original motivation) This allows us to handle calling inner functions with graph breaks for any arbitrarily deep nesting of live context managers subclassing `AbstractContextManager`. (see tests) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94758 Approved by: https://github.com/yanboliang	2023-03-06 14:04:17 +00:00
Edward Z. Yang	d303665d33	Make int unspecialization actually work (#95621 ) OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". As part of this, I have to turn off unspecialization by default, as there are still latent bugs in inductor. The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from https://github.com/pytorch/pytorch/pull/95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes https://github.com/pytorch/pytorch/issues/95469 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/95621 Approved by: https://github.com/jansel, https://github.com/Chillee	2023-03-04 01:22:08 +00:00
Brian Hirsh	ddd6b53d80	fix embedding_backward_dense decomp with broadcasting (#95499 ) Fixes https://github.com/pytorch/pytorch/issues/95182 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95499 Approved by: https://github.com/ezyang, https://github.com/ngimel	2023-02-28 00:24:40 +00:00
Brian Hirsh	84e2d957a1	fix primtorch handling for sub.scalar with alpha and float64 arg (#95421 ) This fixes the primtorch issue stemming from https://github.com/pytorch/pytorch/issues/95181 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95421 Approved by: https://github.com/ngimel, https://github.com/SherlockNoMad	2023-02-28 00:24:38 +00:00
Edward Z. Yang	d78274b759	Automatically guard when SymInt is converted to int (#95479 ) During enablement, we disabled int() conversions because they were any easy way to footgun guards. We have enough of dynamic shapes working now that this is now causing spurious errors; e.g., if you feed a symbolic int to x.size(symint). We now allow for implicit conversions of SymInt to int here, posting a guard. We expect guard provenance to help people debug overspecialization. Fixes https://github.com/pytorch/pytorch/issues/95328 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/95479 Approved by: https://github.com/wconstab, https://github.com/voznesenskym, https://github.com/ngimel	2023-02-25 19:41:51 +00:00
Xuehai Pan	046e88a291	[BE] [3/3] Rewrite `super()` calls in test (#94592 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94592 Approved by: https://github.com/ezyang, https://github.com/seemethere	2023-02-12 22:20:53 +00:00
Aaron Gokaslan	67d9790985	[BE] Apply almost all remaining flake8-comprehension checks (#94676 ) Applies the remaining flake8-comprehension fixes and checks. This changes replace all remaining unnecessary generator expressions with list/dict/set comprehensions which are more succinct, performant, and better supported by our torch.jit compiler. It also removes useless generators such as 'set(a for a in b)`, resolving it into just the set call. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94676 Approved by: https://github.com/ezyang	2023-02-12 01:01:25 +00:00
Edward Z. Yang	c028fc4e25	Decouple PT2 dynamic shapes from the functorch setting (#94469 ) The functorch setting still exists, but now it is no longer necessary: we infer use of Python dispatcher by checking if the ambient FakeTensorMode has a ShapeEnv or not. The setting still exists, but it is for controlling direct AOTAutograd use now; for PT2, it's sufficient to use torch._dynamo.config.dynamic_shapes. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/94469 Approved by: https://github.com/Chillee, https://github.com/voznesenskym, https://github.com/jansel	2023-02-09 06:41:41 +00:00
Aaron Gokaslan	3ce1ebb6fb	Apply some safe comprehension optimizations (#94323 ) Optimize unnecessary collection cast calls, unnecessary calls to list, tuple, and dict, and simplify calls to the sorted builtin. This should strictly improve speed and improve readability. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94323 Approved by: https://github.com/albanD	2023-02-07 23:53:46 +00:00
min-jean-cho	900e09c872	[Dynamo] Support torch.Tensor.fn as TorchVariable, not UserDefinedObjectVariable, preventing graph break (#93243 ) As found in #92709, thanks to @ngimel and @jansel, currently `torch.Tensor.fn` points to `UserDefinedObjectVariable` rather than `TorchVariable`. The root cause is due to https://github.com/pytorch/pytorch/pull/92709#pullrequestreview-1273357406. To prevent this, build `TorchVariable` of `torch.Tensor.fn` pointing to `torch.ops.aten.fn`. This issue propagates to `torch.Tensor.fn` causing graph break with `nopython=True`. ```python import torch import torch._dynamo as dynamo #op = torch.ops.aten.abs_ # no graph break op = torch.Tensor.abs_ # graph break args = torch.empty(10) def foo(args): return op(args) opt_foo = dynamo.optimize("inductor", nopython=True)(foo) y_ = opt_foo(args) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/93243 Approved by: https://github.com/jansel	2023-02-07 09:26:50 +00:00
Jason Ansel	74592a43d0	Update tests to use ConfigModule.patch (#93254 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93254 Approved by: https://github.com/voznesenskym	2023-02-02 00:56:55 +00:00
Jason Ansel	10910758f4	Make dynamo tests work under pytest (#93251 ) This now runs without error: ``` pytest test/dynamo ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/93251 Approved by: https://github.com/ezyang, https://github.com/voznesenskym, https://github.com/mlazos	2023-02-01 02:11:52 +00:00
Edward Z. Yang	902b4dba75	Change capture_scalar_outputs to use SymInt/SymFloat rather than Tensor to model scalars (#93150 ) Previously, Dynamo faked support for item() when `capture_scalar_outputs` was True by representing it internally as a Tensor. With dynamic shapes, this is no longer necessary; we can represent it directly as a SymInt/SymFloat. Do so. Doing this requires you to use dynamic shapes; in principle we could support scalar outputs WITHOUT dynamic shapes but I won't do this unless someone hollers for it. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Differential Revision: [D42885775](https://our.internmc.facebook.com/intern/diff/D42885775) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93150 Approved by: https://github.com/voznesenskym	2023-01-31 21:23:23 +00:00
Jason Ansel	53a669869c	Remove checks for refs/prims (#93250 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93250 Approved by: https://github.com/voznesenskym	2023-01-30 21:42:10 +00:00
Yanbo Liang	b3e422948d	[Dynamo] Support out variants of ops mutate the tensors out of the function frame (#93177 ) Fixes #93136 Pull Request resolved: https://github.com/pytorch/pytorch/pull/93177 Approved by: https://github.com/jansel	2023-01-29 22:22:58 +00:00
Yanbo Liang	a6b51448f5	[Dynamo] Supports if condition on user defined object (#90892 ) Fixes Meta internal user case, see the pattern in unit test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90892 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-01-26 04:19:32 +00:00
Edward Z. Yang	9c487a4b91	Fix #92814 : assertion error when explicitly provide out=None (#92873 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/92873 Approved by: https://github.com/albanD, https://github.com/bdhirsh	2023-01-25 02:20:53 +00:00
Bin Bao	2037746e8d	[inductor] Rename aot_inductor_debug to aot_eager_decomp_partition (#92314 ) Summary: To make the naming more explicit, aot eager + decomposition + min_cut partition Pull Request resolved: https://github.com/pytorch/pytorch/pull/92314 Approved by: https://github.com/mlazos	2023-01-23 15:56:48 +00:00
Michael Lazos	bc9af74c99	Clear references to user tensors after compilation is finished (#92353 ) Fixes https://github.com/pytorch/torchdynamo/issues/2033 and https://github.com/pytorch/torchdynamo/issues/2005 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92353 Approved by: https://github.com/eellison	2023-01-18 06:43:30 +00:00
BowenBao	a72bcb3388	Do not leak SkipFrame exception to parent frames (#91059 ) Discovered by https://github.com/pytorch/torchdynamo/issues/2000, we noticed the exception `SkipFrame` to avoid repeatedly compiling frame of loop with graph breaks could leak to parent frames while inlining, which then prevents compiling. This PR checks at inlining if such exception is raised and would instead raise an `Unsupported` to the outer frame. The original behavior and goal of #88857 is unaffected: the inner frame that has loop would still be skipped. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91059 Approved by: https://github.com/jansel, https://github.com/thiagocrepaldi	2023-01-13 17:11:22 +00:00
Edward Z. Yang	7078ad5b8c	Reland "AOT Autograd refactor + cleanup, handle intermediate views of bases, use view replay, fix non-tensor input handling" (#92076 ) Original PR: https://github.com/pytorch/pytorch/pull/89532 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92076 Approved by: https://github.com/janeyx99, https://github.com/albanD	2023-01-12 21:32:05 +00:00
Tugsbayasgalan Manlaibaatar	b32b81a0c5	Make torch.split take symint as arg (#91724 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91724 Approved by: https://github.com/voznesenskym	2023-01-07 00:00:03 +00:00
Tugsbayasgalan Manlaibaatar	d4713b4c7d	[dynamo] Fix bug in tensor.item fake tensor propogation (#91668 ) When we run the node with fake value for tensor.item, it would previously error because the utility method doesn't know how to handle placeholder node. The tensor we are calling item can be input from user will be placeholder in the graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91668 Approved by: https://github.com/voznesenskym	2023-01-04 19:51:19 +00:00
Nuno Lopes	2b0abd4ce3	symbolic shapes: add parenthesis around FloorDiv expression (#91554 ) Before it would print the guard expression like: `23//2` and now: `2(3//2)` ```python print(23//2) # 3 print(2(3//2)) # 2 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91554 Approved by: https://github.com/ezyang	2023-01-03 11:12:08 +00:00
Edward Z. Yang	bbea58d500	Stop using GraphArgs for shape env guard source tracking (#90911 ) GraphArgs worked fairly well, but it was still missing sources sometimes. Now, we maintain an auxiliary data structure which we MUST populate whenever we fakeify a tensor / allocate a bare SymInt. This should guarantee once and for all that every symbol is available. Should fix swin_base_patch4_window7_224. While I was at it, I moved fakeification utility back to builder as it was only used at once call site. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90911 Approved by: https://github.com/voznesenskym	2022-12-16 05:22:56 +00:00
Edward Z. Yang	8fd31ac4da	Preserve original GraphArgs for shape guard codegen (#90665 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90665 Approved by: https://github.com/voznesenskym	2022-12-12 02:35:23 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	c8f5c194ca	Fix bug in dynamic shapes multiply (#90336 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90336 Approved by: https://github.com/ezyang	2022-12-09 00:59:50 +00:00
Ram Rachum	351d73b97f	Fix exception causes all over the codebase (#90271 ) This is the continuation to #90134 and hopefully the final PR in this series. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90271 Approved by: https://github.com/kit1980	2022-12-07 04:29:00 +00:00
Richard Zou	4068c5467d	[Reland] Move functorch/_src to torch/_functorch (#88756 ) (#90091 ) This will be the last disruptive functorch internals change. Why are we moving these files? - As a part of rationalizing functorch we are moving the code in functorch/_src to torch/_functorch - This is so that we can offer the functorch APIs as native PyTorch APIs (coming soon) and resolve some internal build issues. Why are we moving all of these files at once? - It's better to break developers all at once rather than many times Test Plan: - wait for tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/90091 Approved by: https://github.com/anijain2305, https://github.com/ezyang	2022-12-03 14:17:15 +00:00
Edward Z. Yang	8d333761a9	When dealing with dupe arguments, prefer leafifying if possible (#89896 ) See code comment for details. I also had to do some extra fixes: * `run_functionalized_fw_and_collect_metadata` now is able to handle duplicated arguments * `aot_wrapper_dedupe` now always returns boxed compiled functions * `aot_wrapper_dedupe` is now applied to inference compiler along with autograd compiler (preexisting) Fixes https://github.com/pytorch/torchdynamo/issues/1939 Fixes DebertaV2ForQuestionAnswering DebertaForMaskedLM DebertaForQuestionAnswering DebertaV2ForMaskedLM Repro command: ``` python benchmarks/dynamo/huggingface.py --performance --float32 -dcuda --training --inductor --no-skip --dashboard --only DebertaForQuestionAnswering --cold_start_latency ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89896 Approved by: https://github.com/bdhirsh	2022-12-01 13:42:29 +00:00
PyTorch MergeBot	218d9c6e09	Revert "Move functorch/_src to torch/_functorch (#88756 )" This reverts commit `52bc5c1cfe`. Reverted https://github.com/pytorch/pytorch/pull/88756 on behalf of https://github.com/clee2000 due to broke imports in tests `52bc5c1cfe` https://github.com/pytorch/pytorch/actions/runs/3574742513/jobs/6010814968 probably a landrace	2022-11-29 17:17:11 +00:00
Richard Zou	52bc5c1cfe	Move functorch/_src to torch/_functorch (#88756 ) This will be the last disruptive functorch internals change. Why are we moving these files? - As a part of rationalizing functorch we are moving the code in functorch/_src to torch/_functorch - This is so that we can offer the functorch APIs as native PyTorch APIs (coming soon) and resolve some internal build issues. Why are we moving all of these files at once? - It's better to break developers all at once rather than many times Test Plan: - wait for tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/88756 Approved by: https://github.com/ezyang	2022-11-29 13:55:42 +00:00
Brian Hirsh	e20ec44544	fixes for inductor <> batch norm (#89603 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89603 Approved by: https://github.com/albanD	2022-11-29 02:16:52 +00:00
PyTorch MergeBot	6ef702490d	Revert "Support set_rng_state with fake tensor (#89642 )" This reverts commit `2f8769d680`. Reverted https://github.com/pytorch/pytorch/pull/89642 on behalf of https://github.com/ezyang due to elias is right this is probably wrong	2022-11-28 19:13:33 +00:00
Edward Z. Yang	2f8769d680	Support set_rng_state with fake tensor (#89642 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89642 Approved by: https://github.com/anjali411	2022-11-28 14:49:30 +00:00
Edward Z. Yang	6904324781	Remove fake_tensor_propagation (#89646 ) You always have to run dynamo with fake tensors. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89646 Approved by: https://github.com/soumith	2022-11-25 03:27:32 +00:00
Edward Z. Yang	1aa1014b26	xfail maml test, instead of running it without fake tensor prop (#89645 ) A previous version of this patch graph breaks when torch.tensor fails, but that causes ``` PYTORCH_TEST_WITH_DYNAMO=1 python test/nn/test_embedding.py -k test_embedding_bag_1D_padding_idx_cpu_float32 ``` to start failing. Probably another latent bug that needs investigating. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89645 Approved by: https://github.com/albanD	2022-11-25 03:27:32 +00:00
Edward Z. Yang	9b13508ef3	Force test_rng_state to run with fake tensor prop (#89641 ) I'm not really sure what desertfire's intended follow up was on https://github.com/pytorch/pytorch/pull/87490 because when I remove the unsupported() call, dynamo tests pass. But the change here is conservative and I think strictly better than the current situation. The idea is to force fake tensor pop on for the test, and then just observe that we are doing a graph break. Clearly, export doesn't work, so I manually xfail it. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89641 Approved by: https://github.com/anjali411	2022-11-24 22:46:47 +00:00
Edward Z. Yang	c6be06d93a	Easy: These tests work with fake_tensor_propagation on (#89640 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89640 Approved by: https://github.com/anjali411, https://github.com/albanD	2022-11-24 22:46:45 +00:00
Edward Z. Yang	fd279fe85b	Make pytest work again on test/dynamo (#89631 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89631 Approved by: https://github.com/anjali411	2022-11-24 17:24:25 +00:00
Elias Ellison	a8d6b82167	Fix norm decomp when dtype is passed in (#89508 ) Fix for https://github.com/pytorch/torchdynamo/issues/1889. The wrapper was doing a downcast even when the dtype was explicitly passed in. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89508 Approved by: https://github.com/anijain2305	2022-11-23 20:49:09 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	04169c5b6e	Rewrite assert statement with torch._assert under config (#88246 ) This diff rewrites assert statement in python with torch._assert under config. The resulting graph looks something like: ``` SOURCE CODE: def f(x): assert x[0] == 3 return x.cos() CAPTURED GRAPH: graph(): %arg0 : [#users=2] = placeholder[target=arg0] %getitem : [#users=1] = call_function[target=operator.getitem](args = (%arg0, 0), kwargs = {}) %eq : [#users=1] = call_function[target=operator.eq](args = (%getitem, 3), kwargs = {}) %_assert : [#users=0] = call_function[target=torch._assert](args = (%eq, "assertion_error"), kwargs = {}) %cos : [#users=1] = call_method[target=cos](args = (%arg0,), kwargs = {}) return cos ``` Note that this introduces side-effect as it could error out while executing graph, but the assertion can eliminated via DCE if we choose to ignore it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88246 Approved by: https://github.com/jansel	2022-11-17 19:49:31 +00:00
PyTorch MergeBot	9d28775c1d	Revert "Rewrite assert statement with torch._assert under config (#88246 )" This reverts commit `62ba15e10e`. Reverted https://github.com/pytorch/pytorch/pull/88246 on behalf of https://github.com/DanilBaibak due to breaking internal builds	2022-11-16 09:45:49 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	62ba15e10e	Rewrite assert statement with torch._assert under config (#88246 ) This diff rewrites assert statement in python with torch._assert under config. The resulting graph looks something like: ``` SOURCE CODE: def f(x): assert x[0] == 3 return x.cos() CAPTURED GRAPH: graph(): %arg0 : [#users=2] = placeholder[target=arg0] %getitem : [#users=1] = call_function[target=operator.getitem](args = (%arg0, 0), kwargs = {}) %eq : [#users=1] = call_function[target=operator.eq](args = (%getitem, 3), kwargs = {}) %_assert : [#users=0] = call_function[target=torch._assert](args = (%eq, "assertion_error"), kwargs = {}) %cos : [#users=1] = call_method[target=cos](args = (%arg0,), kwargs = {}) return cos ``` Note that this introduces side-effect as it could error out while executing graph, but the assertion can eliminated via DCE if we choose to ignore it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88246 Approved by: https://github.com/jansel	2022-11-15 17:14:59 +00:00
Edward Z. Yang	cbdb683dc8	Add test that bias gradient is properly tested in same_two_models (#88995 ) See https://github.com/pytorch/pytorch/pull/88629#issuecomment-1313850324 for why this got broken. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/88995 Approved by: https://github.com/albanD	2022-11-15 02:55:43 +00:00
Animesh Jain	897d029a73	[reland][dynamo] fixes dict changed during runtime error (#88877 ) Reland https://github.com/pytorch/pytorch/pull/87526 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88877 Approved by: https://github.com/ezyang	2022-11-13 16:20:45 +00:00
Animesh Jain	2b12bfce88	[dynamo] Skip frame when graph break in a loop (#88857 ) This fixes excessing recompilation issue in tacotron2 but has few caveats - https://github.com/pytorch/torchdynamo/issues/330 For tacotron2, the repro is something like this ~~~ def inner(x): return torch.sin(x) def fn(x): for _ in range(100): inner(x) torch._dynamo.graph_break() return x ~~~ The problem here is that Dynamo has guards on the TUPLE_ITERATOR_LEN whenever a graph break happens. Therefore, we keep on recompiling. This PR checks if there is a backedge (helps with while loop) in presence of a graph break. If there is, Dynamo skips processing this frame. Therefore, Dynamo gets called when inner is called, and we compile only once. Note that, if there was no graph break, we will unroll the original loop, and see one graph with 100 sin operations (just as before, so no changes there). The caveat is - We are skipping the frame, so if we have something like this ~~~ def fn(x): for _ in range(100): # 1000s of lines of PyTorch code torch._dynamo.graph_break() return x ~~~ Dynamo will skip processing this frame, and might miss on the optimization. Completely open for suggestions. Happy to re-implement if there is a better way to handle this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88857 Approved by: https://github.com/jansel, https://github.com/yanboliang	2022-11-13 09:53:38 +00:00
Michael Voznesensky	06ce1338bc	[dynamo] Port all pytorch/dynamo and test/dynamo pieces over from symbolic-shapes branch (#88768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88768 Approved by: https://github.com/jansel, https://github.com/ezyang	2022-11-13 04:50:21 +00:00
ydwu4	3765621356	torchdynamo support self.modules() for nn_module (#88695 ) This PR allows models to call self.modules() during dynamo tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88695 Approved by: https://github.com/voznesenskym	2022-11-12 20:00:51 +00:00
PyTorch MergeBot	dba887766b	Revert "torchdynamo support modules() for nn_module (#88023 )" This reverts commit `96104c7b7e`. Reverted https://github.com/pytorch/pytorch/pull/88023 on behalf of https://github.com/ydwu4 due to [Internal breakages] https://www.internalfb.com/intern/sandcastle/job/9007200067589062/	2022-11-08 18:37:48 +00:00
Yidi Wu	96104c7b7e	torchdynamo support modules() for nn_module (#88023 ) Differential Revision: D40820879 This diff allows models to call self.modules() during dynamo tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88023 Approved by: https://github.com/tugsbayasgalan, https://github.com/voznesenskym, https://github.com/jansel	2022-11-08 18:22:03 +00:00
Yanbo Liang	bd1ffc6501	[Dynamo] Fix bug: GradMode doesn't carry grad state correctly after graph break (#88537 ) Fixes https://github.com/pytorch/torchdynamo/issues/1446 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88537 Approved by: https://github.com/jansel	2022-11-07 18:03:31 +00:00
Animesh Jain	c1dd13fb2f	[dynamo] Support compare op for userfunctionvariable (#88372 ) Helps reduce graph breaks for one of the training models Pull Request resolved: https://github.com/pytorch/pytorch/pull/88372 Approved by: https://github.com/jansel	2022-11-03 17:05:50 +00:00
PyTorch MergeBot	bf7c996dcb	Revert "torchdynamo support modules() for nn_module (#88023 )" This reverts commit `eb91e8a534`. Reverted https://github.com/pytorch/pytorch/pull/88023 on behalf of https://github.com/mehtanirav due to [Internal breakages](https://www.internalfb.com/intern/sandcastle/job/13510799692855066/insights)	2022-11-02 22:35:14 +00:00
Yanbo Liang	ccf6b558a4	[Dynamo] UserFunctionVariable supports type & ABCMeta as arguments (#88257 ) Fixes https://github.com/pytorch/torchdynamo/issues/1785 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88257 Approved by: https://github.com/ezyang	2022-11-02 06:58:04 +00:00
Yidi Wu	eb91e8a534	torchdynamo support modules() for nn_module (#88023 ) Differential Revision: D40820879 This diff allows models to call self.modules() during dynamo tracing. cc @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @chunyuan-w @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx Pull Request resolved: https://github.com/pytorch/pytorch/pull/88023 Approved by: https://github.com/tugsbayasgalan, https://github.com/voznesenskym, https://github.com/jansel	2022-11-01 17:10:45 +00:00
Bin Bao	2c1efe7472	Enable some PyTorch core tests with inductor (#87490 ) Summary: 1) Graph break on torch.random.set_rng_state since it blocks running inductor core tests; 2) Add several inductor-specific skips; 3) Enable several core tests for inductor CI; cc @jansel @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87490 Approved by: https://github.com/eellison	2022-10-26 18:58:33 +00:00
Michael Voznesensky	bc19494814	[Dynamo] Symbolic shape guards (#87570 ) Introduces symbolic shape guards into dynamo. In this PR, we take the existing fake tensor infra and plumbing in dynamo and we start passing a shape_env around. This shape_env does not get plumbed down to middle layers / backend yet - it only collects expressions from frontend invocations at the moment. We then translate these expressions into guards at the point where we take other guards installed throughout dynamo - and add them to check_fn. Part 1 of https://docs.google.com/document/d/1QJ-M4zfMkD-fjHIqW089RptjLl9EgozZGCceUbvmgfY/edit# cc @jansel @lezcano @fdrocha @mlazos @soumith @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87570 Approved by: https://github.com/ezyang	2022-10-25 21:15:40 +00:00
Tugsbayasgalan Manlaibaatar	7b5978254f	Add named_buffers to torchdynamo nn_module (#87644 ) Fixes: https://github.com/pytorch/torchdynamo/issues/1738 cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87644 Approved by: https://github.com/jansel	2022-10-25 17:00:56 +00:00
Animesh Jain	e46a8971e6	[dynamo] Support class members in nn modules (#87531 ) Fixes https://github.com/pytorch/torchdynamo/issues/1740 @voznesenskym cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang @penguinwu Pull Request resolved: https://github.com/pytorch/pytorch/pull/87531 Approved by: https://github.com/jansel	2022-10-24 18:48:49 +00:00
Edward Z. Yang	96691865b9	[dynamo] Unify raise_on_* config to suppress_errors and raise by default (#87440 ) I noticed that a lot of bugs are being suppressed by torchdynamo's default error suppression, and worse yet, there's no way to unsuppress them. After discussion with voz and soumith, we decided that we will unify error suppression into a single option (suppress_errors) and default suppression to False. If your model used to work and no longer works, try TORCHDYNAMO_SUPPRESS_ERRORS=1 to bring back the old suppression behavior. Signed-off-by: Edward Z. Yang <ezyang@fb.com> cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang Pull Request resolved: https://github.com/pytorch/pytorch/pull/87440 Approved by: https://github.com/voznesenskym, https://github.com/albanD	2022-10-21 17:03:29 +00:00
Jason Ansel	8f71e8de7e	Sync changes from pytorch/torchdynamo, enable tests (#86950 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86950 Approved by: https://github.com/Chillee	2022-10-14 23:08:58 +00:00
Jason Ansel	c7c09722ad	Move TorchDynamo into PyTorch core (#86461 ) Context: https://github.com/pytorch/torchdynamo/issues/1588 This PR moves [TorchDynamo](https://github.com/pytorch/torchdynamo) and TorchInductor into PyTorch core. - `torchdynamo` becomes `torch._dynamo` - `torchinductor` becomes `torch._inductor` This PR was generated by running `copy_to_core.sh` in https://github.com/pytorch/torchdynamo/pull/1538 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86461 Approved by: https://github.com/voznesenskym	2022-10-13 23:18:06 +00:00

... 3 4 5 6 7 ...

436 Commits