pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Oleg Khabinov	c3bc65d9d8	[dynamo] Restore constant tensor original FQNs (#116086 ) Differential Revision: D52192693 Pull Request resolved: https://github.com/pytorch/pytorch/pull/116086 Approved by: https://github.com/angelayi, https://github.com/muchulee8	2023-12-20 02:10:02 +00:00
zhxchen17	f78f23d753	[export] Turn off output value from sources for export. (#115442 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/115442 Approved by: https://github.com/tugsbayasgalan	2023-12-12 22:41:23 +00:00
David Berard	5c0976fa04	Revert "[dynamo] guarded config (#111299 )" (#115386 ) This reverts commit `5927e9cbf2`. Differential Revision: [D51959266](https://our.internmc.facebook.com/intern/diff/D51959266) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115386 Approved by: https://github.com/yanboliang, https://github.com/malfet ghstack dependencies: #115384, #115401, #115385	2023-12-11 19:35:42 +00:00
David Berard	b36fc6790e	Revert "[dynamo] Guard on `HAS_GRAPH_BREAKS` if graph breaks are present (i.e. cache miss if compiled object requires nopython) (#114073 )" (#115384 ) This reverts commit `0bb29f9450`. Differential Revision: [D51959267](https://our.internmc.facebook.com/intern/diff/D51959267) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115384 Approved by: https://github.com/malfet	2023-12-10 18:16:02 +00:00
PyTorch MergeBot	3e47e3f441	Revert "[export] Fix graph output mismatch issue with constant outputs. (#115280 )" This reverts commit `622688fab9`. Reverted https://github.com/pytorch/pytorch/pull/115280 on behalf of https://github.com/atalman due to ghfirst issue when importing, will reland this PR ([comment](https://github.com/pytorch/pytorch/pull/115280#issuecomment-1847903624))	2023-12-08 22:10:03 +00:00
zhxchen17	622688fab9	[export] Fix graph output mismatch issue with constant outputs. (#115280 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/115280 Approved by: https://github.com/tugsbayasgalan	2023-12-07 06:11:08 +00:00
Jason Ansel	aa70e31610	[dynamo] Fix MutableSideEffects returning alias (#115095 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115095 Approved by: https://github.com/yanboliang	2023-12-05 19:01:03 +00:00
rzou	c56d91ba39	Log pt2_compliant custom ops used with torch.compile (#115083 ) Summary: We already log non-pt2_compliant ops. This PR extends the logging to include pt2_compliant custom ops. We do not log all pt2_compliant ops (i.e. including builtin ops) because it would probably take too much memory Test Plan: Tested locally Pull Request resolved: https://github.com/pytorch/pytorch/pull/115083 Approved by: https://github.com/yanboliang, https://github.com/williamwen42	2023-12-05 00:51:33 +00:00
Yanbo Liang	8ef44e6110	[autograd.Function] Fix torch.compile w/ once_differentiable leads to opaque graph break (#113625 ) Fixes #106893 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113625 Approved by: https://github.com/zou3519	2023-12-04 21:37:06 +00:00
Jez Ng	47e6cc4d22	Remove yet more type-ignores in dynamo/inductor (#114684 ) Probably the last big batch for a while Pull Request resolved: https://github.com/pytorch/pytorch/pull/114684 Approved by: https://github.com/Skylion007	2023-11-28 22:09:38 +00:00
voznesenskym	081c5b3adc	Add Stateful/Stateless symbolic contexts, use fresh fake mode for dynamo backends (#113926 ) (#114526 ) Summary: The primary problem we are setting out to solve here is fake tensor freshness. Before this PR, fake tensors after dynamo represented fake tensors at the end of trace, so subsequent retraces like aot_autograd would start off with fake tensors in the wrong (end result) state, rather than their expected fresh state. The solution here is to start a fresh fake mode, and re-fakify the tensors. The nuance comes from ensuring that symbols are uniformly created for the symbolic sizes and strides of the tensor. This PR is the result of a lot of back and forth with ezyang and eellison. Initially, the first pass at this was not super different from what we have in the PR - the broad strokes were the same: 1) We cache source->symbol in shape_env 2) We pass policy objects around, stored at dynamo fakificaiton time, and reused for later fakification 3) We create a new fake mode for backends (from https://github.com/pytorch/pytorch/pull/113605/files) This is ugly, and has some layering violations. We detoured our decision making through a few other alternatives. Immutable/mutable fake tensor mode was the most interesting alternative, https://github.com/pytorch/pytorch/pull/113653, and was struck down on concerns of complexity in fake mode combined with it not covering all edge cases. We also detoured on what to do about tensor memoization returning back potentially different tensors than requested, and if that was an anti pattern (it is) we want to hack in with the symbol cache (we don't). We went back to the drawing board here, but with a few concessions: 1) the cache for source->symbol must live outside of shape_env, for both lifecycle, and layering reasons 2) A good amount of work needs to be done to pipe policy around fake_mode and meta_utils correctly, to cover all the cases (ezyang did this) cc penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 aakhundov kadeng imported-using-ghimport Test Plan: Imported from OSS Reviewed By: huydhn, Chillee Differential Revision: D51566250 Pulled By: voznesenskym Pull Request resolved: https://github.com/pytorch/pytorch/pull/114526 Approved by: https://github.com/Chillee, https://github.com/huydhn	2023-11-26 23:40:32 +00:00
PyTorch MergeBot	2f3beb715c	Revert "Add Stateful/Stateless symbolic contexts, use fresh fake mode for dynamo backends (#113926 )" This reverts commit `2ca1119d53`. Reverted https://github.com/pytorch/pytorch/pull/113926 on behalf of https://github.com/DanilBaibak due to Break internal build ([comment](https://github.com/pytorch/pytorch/pull/113926#issuecomment-1822713852))	2023-11-22 12:52:33 +00:00
voznesenskym	2ca1119d53	Add Stateful/Stateless symbolic contexts, use fresh fake mode for dynamo backends (#113926 ) The primary problem we are setting out to solve here is fake tensor freshness. Before this PR, fake tensors after dynamo represented fake tensors at the end of trace, so subsequent retraces like aot_autograd would start off with fake tensors in the wrong (end result) state, rather than their expected fresh state. The solution here is to start a fresh fake mode, and re-fakify the tensors. The nuance comes from ensuring that symbols are uniformly created for the symbolic sizes and strides of the tensor. This PR is the result of a lot of back and forth with @ezyang and @eellison. Initially, the first pass at this was not super different from what we have in the PR - the broad strokes were the same: 1) We cache source->symbol in shape_env 2) We pass policy objects around, stored at dynamo fakificaiton time, and reused for later fakification 3) We create a new fake mode for backends (from https://github.com/pytorch/pytorch/pull/113605/files) This is ugly, and has some layering violations. We detoured our decision making through a few other alternatives. Immutable/mutable fake tensor mode was the most interesting alternative, https://github.com/pytorch/pytorch/pull/113653, and was struck down on concerns of complexity in fake mode combined with it not covering all edge cases. We also detoured on what to do about tensor memoization returning back potentially different tensors than requested, and if that was an anti pattern (it is) we want to hack in with the symbol cache (we don't). We went back to the drawing board here, but with a few concessions: 1) the cache for source->symbol must live outside of shape_env, for both lifecycle, and layering reasons 2) A good amount of work needs to be done to pipe policy around fake_mode and meta_utils correctly, to cover all the cases (@ezyang did this) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113926 Approved by: https://github.com/ezyang, https://github.com/eellison	2023-11-20 23:06:37 +00:00
Edward Z. Yang	59ad51e10a	Insert deferred runtime asserts into Dynamo FX graph (#113958 ) During the course of fake tensor propagation (and, potentially, also Dynamo execution, although I do not believe it is possible to exercise this right now), we may generate deferred runtime asserts, which represent "guards" on unbacked symbols which cannot be immediately checked on entry to a code block; instead, they have to be checked at runtime. However, we currently accumulate these deferred runtime asserts into the ShapeEnv, and don't do anything with them. This PR modifies Dynamo to automatically insert these runtime asserts into the FX graph, before passing it on to the backend compiler. The assert format coincides with the export assert format as practiced in `torch/_export/passes/add_runtime_assertions_for_constraints_pass.py`, but actually these passes are completely disjoint right now as I only handle deferred runtime asserts, while export only handles ranges (which I should probably also handle, but don't in this PR.) The assertions must be inserted by Dynamo, because you could potentially then pass the asserts onto another backend like "eager" which no longer looks at the ShapeEnv before. Thanks to previous work in export, these asserts are preserved in AOTAutograd, but they are dropped by Inductor, which needs to be fixed in future work. This piece will be a bit awkward, as Inductor would have preferred to work with the Sympy expressions directly, ah well. Here is what the Dynamo traced FX graph looks like for the test in question: ``` <eval_with_key>.0 class GraphModule(torch.nn.Module): def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ # File: /data/users/ezyang/c/pytorch/wu.py:8, code: y = x.item() item = l_x_.item() # No stacktrace found for following nodes ge_1 = item >= 0 scalar_tensor_default = torch.ops.aten.scalar_tensor.default(ge_1); ge_1 = None _assert_async_msg = torch.ops.aten._assert_async.msg(scalar_tensor_default, "Deferred runtime assert failed: i0 >= 0, where i0 was defined by 'item' (for more information, run with TORCH_LOGS=+dynamo,dynamic)"); scalar_tensor_default = None # File: /data/users/ezyang/c/pytorch/wu.py:9, code: torch._check_is_size _check_is_size = torch._check_is_size(item) # File: /data/users/ezyang/c/pytorch/wu.py:10, code: if y >= 0: ge = item >= 0; item = None # File: /data/users/ezyang/c/pytorch/wu.py:11, code: return x * 2 mul = l_x_ * 2; l_x_ = None return (mul,) ``` Note that we actually keep the `_check_is_size` in the graph redundantly. However, assert_async is retained in the graph, whereas _check_is_size ends up getting DCE'ed. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/113958 Approved by: https://github.com/aakhundov, https://github.com/tugsbayasgalan ghstack dependencies: #113978	2023-11-20 21:25:11 +00:00
Jon Chuang	0bb29f9450	[dynamo] Guard on `HAS_GRAPH_BREAKS` if graph breaks are present (i.e. cache miss if compiled object requires nopython) (#114073 ) Fixes https://github.com/pytorch/pytorch/issues/114059 Pull Request resolved: https://github.com/pytorch/pytorch/pull/114073 Approved by: https://github.com/ezyang	2023-11-20 19:32:03 +00:00
Jon Chuang	5927e9cbf2	[dynamo] guarded config (#111299 ) --- Fixes: https://github.com/pytorch/pytorch/issues/110682 Replaces: https://github.com/pytorch/pytorch/pull/111074 The guards are installed based on config that is valid at the call to `torch.compile`, rather than at any subsequent call / triggered compilation. Subsequent compilations will restore the config if there is a config mismatch of the existing global config with the saved config. TODO: - [X] add tests Follow up PRs: - [x] add revised cache size computation (follow up PR: #111300 , based on: https://github.com/pytorch/pytorch/pull/107496) - [ ] handle run-only mode? - [ ] config restoration itself is not thread-safe (tracked: https://github.com/pytorch/pytorch/issues/111150) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111299 Approved by: https://github.com/ezyang	2023-11-17 09:59:58 +00:00
jon-chuang	c233cef8fd	[dynamo] Enforce lifetime of output fx graph and its metadata (#113517 ) Fixes https://github.com/pytorch/pytorch/issues/113516 Also asserts that by the time we modify the output's graph nodes, we are in the irreversible state of `should_exit`. Remove `creation_timestamp` from graph as it is only consumed by dynamo for checkpoint restore. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113517 Approved by: https://github.com/ezyang	2023-11-17 07:34:43 +00:00
Edward Z. Yang	8a183bf1ab	[BE] Consistently query tracing context for fake mode in Dynamo (#113768 ) Split from https://github.com/pytorch/pytorch/pull/113666 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/113768 Approved by: https://github.com/bdhirsh	2023-11-16 19:31:10 +00:00
Tugsbayasgalan Manlaibaatar	a7b75f586a	[RELAND] Disallow skipping dynamo (#110222 ) Previous discussion: https://github.com/pytorch/pytorch/pull/109476 In this PR, I made following additions to the original PR: 1) Unlifted graph module now runs the runtime assertions in its' forward call. 2) When we retrace, we make sure we run the assertions to make sure user is tracing the module with correct inputs with respect to the assumptions we made during first tracing. The way I do is that I create new graph module type with modified call method. And the runtime assertions happen under torchdynamo.disable so that it is just run in eager directly. The reason is we don't this to be traced part of the graph. 3) Both ep.module and capture_pre_autograd now returns _UnliftedGraphModule. Differential Revision: [D51078056](https://our.internmc.facebook.com/intern/diff/D51078056) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110222 Approved by: https://github.com/zhxchen17	2023-11-14 16:02:01 +00:00
Jez Ng	68278cf7a8	[dynamo] Initialize tensor_weakref_to_sizes_strides with a weak dict (#113412 ) Spotted while working on getting output_graph.py to typecheck. The type hint indicates that it was intended to be initialized with a WeakIdKeyDictionary, but the actual runtime value was a regular dict. Not sure if there's some kind of test we should add for this fix. Looks like the code was originally added in https://github.com/pytorch/pytorch/pull/100128. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113412 Approved by: https://github.com/Skylion007, https://github.com/voznesenskym ghstack dependencies: #113413, #113518, #113519	2023-11-13 22:53:47 +00:00
Jez Ng	d00c983b63	[dynamo] Make {testing,debug_utils,utils}.py pass follow_imports typechecking (#113519 ) Notes: * `debug_insert_nops` in testing.py was passing `None` to the compiler_fn parameter of `OutputGraph`, hence the modifications there. * I added `disable-error-code="method-assign"` to debug_utils.py as it does several such assignments. I guess mypy doesn't like it because it makes code near-impossible to safely typecheck. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113519 Approved by: https://github.com/Skylion007 ghstack dependencies: #113413, #113518	2023-11-11 22:15:46 +00:00
Jez Ng	a8cf04fd2a	[inductor] Make {output_graph,pad_mm}.py pass follow_imports typechecking (#113413 ) I changed OutputGraph.nn_modules' type to `Dict[str, Any]` because it seems that `register_attr_or_module` can populate it with essentially any type. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113413 Approved by: https://github.com/Skylion007	2023-11-11 22:15:46 +00:00
Jason Ansel	3914566c73	[dynamo] Refactor OrderedDict to dict (#113234 ) In Python3 all dicts are ordered. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113234 Approved by: https://github.com/oulgen, https://github.com/lezcano	2023-11-08 09:27:08 +00:00
Jason Ansel	9664190952	[dynamo] Eagerly install guards (#111415 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111415 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306	2023-11-07 19:55:19 +00:00
Jason Ansel	2964682490	[dynamo] Add LazyVariableTracker (#111306 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111306 Approved by: https://github.com/voznesenskym	2023-11-07 19:55:19 +00:00
Jason Ansel	64f326097b	[dynamo] Refactor handling of state in context managers (#112939 ) The prior handling was rather buggy... Pull Request resolved: https://github.com/pytorch/pytorch/pull/112939 Approved by: https://github.com/voznesenskym, https://github.com/yanboliang ghstack dependencies: #112897, #112898, #112920, #112899	2023-11-05 03:10:30 +00:00
Jon Chuang	fd6e571207	[aot_autograd / dynamo] restore grad_mode and other globals to state prior to tracing; add grad_mode mutations to runtime wrapper (#112396 ) Fixes https://github.com/pytorch/pytorch/issues/112072 Grad mode mutations, which are the responsibility of aotautograd, need to be persisted outside of the graph as side-effects in the runtime wrapper. To facilitate this, and to maintain global state hygeine, we restore the grad mode to their value prior to tracing, for both dynamo (alongside other global states) and aot_autograd. This is in line with the assumption that aot_autograd should work as though it were called from eager, before the given GraphModule has been run. It is assumed that other global state (autocast mode, torch function) already maintain hygeine via their context manager APIs. --- ### Future Work? Should we also do this for: 1. autocast mode 2. torch_function_enabled Answer: no. (at least at present) It is assumed that other global state (autocast mode, torch function) already maintain hygeine via their context manager APIs. Furthermore, mutating this state directly is currently unsupported in dynamo, unlike `set_grad_enabled` Repro: ```python import torch def fn(x): x = x + 1 torch.set_autocast_enabled(True) return x + 1 print(torch.compile(fn, fullgraph=True)(torch.zeros(1))) # torch._dynamo.exc.Unsupported: call_method UserDefinedObjectVariable(set_autocast_enabled) __call__ [ConstantVariable(bool)] {} ``` ```python import torch def fn(x): x = x + 1 torch.overrides.BaseTorchFunctionMode.__enter__() return x + 1, torch._C._is_torch_function_enabled() print(torch.compile(fn, fullgraph=True)(torch.zeros(1))) # torch._dynamo.exc.Unsupported: 'call_function TorchFunctionMode.__enter__ in skip_files /home/jonch/Desktop/Programming/mlsys/pytorch/torch/overrides.py, skipped according skipfiles.SKIP_DIRS' ``` ~~I believe 1. is clearly yes - even if it is a corner case (autocast only has ctx manager public API, while dynamo will always emit ctx manager exits before compiling the graph, so one needs to use the internal _enter_autocast API to directly perform a mutation).~~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/112396 Approved by: https://github.com/bdhirsh	2023-11-03 16:14:09 +00:00
Yanbo Liang	6f681ab5d9	[torch.compile] autograd.Function with multiple return values (#112475 ) Fixes #106389 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112475 Approved by: https://github.com/zou3519	2023-11-02 04:43:49 +00:00
Richard Zou	4f5acf8329	Log non-pt2_compliant ops encountered by Dynamo (#112581 ) Summary: See internal diff for more changes. Whenever we encounter a non-compliant op, we add it to a set on the OutputGraph. When a compilation event happens, we log the contents of this set. I'm planning on flipping the `only_allow_pt2_compliant_ops` config from False to True after the logging determines that existing models do not use non-compliant ops. Test Plan: - Tested the logging internally locally Differential Revision: D50884828 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112581 Approved by: https://github.com/yanboliang	2023-11-01 22:53:16 +00:00
rzou	1483097679	Update how Dynamo decides to graph break on an OpOverloadPacket (#112200 ) Previously, under config.only_allow_pt2_compliant_ops, Dynamo graph breaks when it see an OpOverloadPacket where any overloads are not PT2 compliant. This is potentially brittle: if someone (unlikely) adds a new overload for a custom operator, then this would cause a previously non-graph-breaking call to the OpOverloadPacket to graph break. In this PR: - When Dynamo is about to write a call to an operator to the FX graph, we check if it is PT2 compliant. - For OpOverload, we check to see if the tag is on it - For OpOverloadPacket, we do overload resolution and check to see if the tag is on the OpOverload that it resolves to. Test Plan: - new tests, existing tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/112200 Approved by: https://github.com/bdhirsh	2023-10-31 19:10:37 +00:00
Peter Bell	66c32d099a	Use `pytree.arg_tree_leaves` everywhere (#112394 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112394 Approved by: https://github.com/lezcano ghstack dependencies: #112391, #112392, #112393	2023-10-31 15:57:06 +00:00
Jon Chuang	479f5eb029	[dynamo] Remove dead code - `real_value_tensor_positive_aliases` (#111911 ) (legality) It is currently impossible (and should remain impossible) - (due to dedup guards - all static tensors are unique) - to access the same static tensor value from a different source. As for `getattr(nn.Module, tensor)` source collisions, we will never instantiate a `nn.Module getattr` source for a static tensor, due to: - side-effect tracking (as long as we track all static tensors - see also https://github.com/pytorch/pytorch/pull/112025 for extra sanity check) - See: `c8a5bb451e/torch/_dynamo/variables/builder.py (L227)` (no worse) In any case, this field is currently unused. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111911 Approved by: https://github.com/voznesenskym	2023-10-30 23:10:52 +00:00
Peter Bell	bbd5b935e4	Use `pytree.tree_leaves` everywhere (#112324 ) This changes all the instances I could find of `tree_flatten(...)[0]` or `x, _ = tree_flatten` to use `tree_leaves`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112324 Approved by: https://github.com/lezcano ghstack dependencies: #112327, #112323	2023-10-30 03:39:04 +00:00
Michael Lazos	1d9a7f9e43	[Reland] TensorWithTFOverride inheritance from TensorVariable (#111766 ) Accidentally merged https://github.com/pytorch/pytorch/pull/111730 with ghstack, so relanding Pull Request resolved: https://github.com/pytorch/pytorch/pull/111766 Approved by: https://github.com/jansel	2023-10-23 04:33:16 +00:00
voznesenskym	303c54dbd9	[dynamo] share a subgraph tracer across fwd and bwd in autograd.Function (#111588 ) Fixes https://github.com/pytorch/pytorch/issues/111031 The current design of autograd.Function tracing in dynamo is that we: 1) speculate fwd, and if its fine, 2) speculate bwd, and if its fine 3) install the .apply in the graph alongside fwd guards The mechanism for doing so involves creating HOPs for fwd, bwd, and apply. The speculation for fwd and bwd create their own subtracer. This is fine, until a proxy created in fwd is used in bwd. For a simple example, consider: ``` class Foo(Function): @staticmethod def forward(ctx, x): ctx.x0 = x.size(0) return x * 2 @staticmethod def backward(ctx, grad_out): return grad_out * ctx.x0 ``` the value stored at `x0` is a proxy - but it is a proxy belonging to the fwd speculation subtracer. Rather than teaching it to the subtracer for bwd, we choose to create a subtracer that covers both fwd and bwd speculation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111588 Approved by: https://github.com/zou3519	2023-10-20 21:32:02 +00:00
Bert Maher	0013611c81	[inductor] Allow backend compiler to skip (#111153 ) Summary: Sometimes the backend compiler can encounter a transient failure (in our case, a remote build service infrequently hits a hiccup). We'd rather run eager than fail the training job. Test Plan: Inject an exception in the RE path and run: ``` buck2 run @//mode/{opt,inplace} //caffe2/test/inductor:smoke ``` Differential Revision: D50234516 Pull Request resolved: https://github.com/pytorch/pytorch/pull/111153 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-10-14 02:44:15 +00:00
PyTorch MergeBot	2b6f281e5c	Revert "Remove dead code (#111207 )" This reverts commit `c2ed714f54`. Reverted https://github.com/pytorch/pytorch/pull/111207 on behalf of https://github.com/huydhn due to Sorry for reverting this, but it breaks lint `c2ed714f54` ([comment](https://github.com/pytorch/pytorch/pull/111207#issuecomment-1762126366))	2023-10-13 19:56:11 +00:00
lezcano	c2ed714f54	Remove dead code (#111207 ) This dictionary is not used anywhere. The _make_dupe_guard function does not exist anymore Pull Request resolved: https://github.com/pytorch/pytorch/pull/111207 Approved by: https://github.com/Skylion007, https://github.com/voznesenskym	2023-10-13 18:46:27 +00:00
soulitzer	bc49b1e50b	[reland] Use is_symbolic instead of testing isinstance in some place (#110676 ) reland of https://github.com/pytorch/pytorch/pull/110372 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110676 Approved by: https://github.com/ezyang ghstack dependencies: #110673, #110674, #110675	2023-10-10 19:37:17 +00:00
PyTorch MergeBot	bcd44dac60	Revert "Use is_symbolic instead of testing isinstance in some place (#110372 )" This reverts commit `8672d64fed`. Reverted https://github.com/pytorch/pytorch/pull/110372 on behalf of https://github.com/PaliC due to bottom diff is causing a plethora of internal failures ([comment](https://github.com/pytorch/pytorch/pull/110372#issuecomment-1749795074))	2023-10-05 23:37:37 +00:00
soulitzer	8672d64fed	Use is_symbolic instead of testing isinstance in some place (#110372 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110372 Approved by: https://github.com/ezyang ghstack dependencies: #110044, #110369, #110370, #110371	2023-10-04 22:56:42 +00:00
Kazuaki Ishizaki	2c1b009e39	Fix typo under torch/_dynamo directory (#110459 ) This PR fixes typo of comments in files under `torch/_dynamo` directory Pull Request resolved: https://github.com/pytorch/pytorch/pull/110459 Approved by: https://github.com/colesbury	2023-10-04 16:05:05 +00:00
ydwu4	5f7eff0adb	Replace node.meta source_fn with source_fn_stack (#108595 ) A resubmit of https://github.com/pytorch/pytorch/pull/108447. Copy over the descriptions: This is a follow-up of the discussion in https://github.com/pytorch/pytorch/pull/108356, where we want to repalce source_fn with source_fn_stack Before this PR, for the following example: ```python backend = EagerAndRecordGraphs() @torch.compile(backend=backend, fullgraph=True) def cond_f(pred, pred2, x, y): def true_fn(pred2, x, y): return x + y def false_fn(pred2, x, y): def true_fn2(x, y): return x.sin() - y.cos() def false_fn2(x, y): return x.cos() - y.sin() return control_flow.cond(pred2, true_fn2, false_fn2, (x, y)) return control_flow.cond(pred, true_fn, false_fn, (pred2, x, y)) ``` The graph captured is shown below: ```python class GraphModule(torch.nn.Module): def forward(self, L_pred_ : torch.Tensor, L_pred2_ : torch.Tensor, L_x_ : torch.Tensor, L_y_ : torch.Tensor): l_pred_ = L_pred_ l_pred2_ = L_pred2_ l_x_ = L_x_ l_y_ = L_y_ cond_true_1 = self.cond_true_1 cond_false_1 = self.cond_false_1 cond = torch.ops.higher_order.cond(l_pred_, cond_true_1, cond_false_1, [l_pred2_, l_x_, l_y_]); l_pred_ = cond_true_1 = cond_false_1 = l_pred2_ = l_x_ = l_y_ = None return (cond,) class GraphModule(torch.nn.Module): def forward(self, l_pred2_, l_x_, l_y_): add = l_x_ + l_y_; l_x_ = l_y_ = None return add class GraphModule(torch.nn.Module): def forward(self, l_pred2_, l_x_, l_y_): cond_true_0 = self.cond_true_0 cond_false_0 = self.cond_false_0 cond = torch.ops.higher_order.cond(l_pred2_, cond_true_0, cond_false_0, [l_x_, l_y_]); l_pred2_ = cond_true_0 = cond_false_0 = l_x_ = l_y_ = None return cond class GraphModule(torch.nn.Module): def forward(self, l_x_, l_y_): sin = l_x_.sin(); l_x_ = None cos = l_y_.cos(); l_y_ = None sub = sin - cos; sin = cos = None return sub class GraphModule(torch.nn.Module): def forward(self, l_x_, l_y_): cos = l_x_.cos(); l_x_ = None sin = l_y_.sin(); l_y_ = None sub = cos - sin; cos = sin = None return sub ``` the source_fn for inner cond, sin, cos will be a (name, target) tuple: ``` ('cond', <torch._ops.HigherOrderOperator object at xxx>) ('sin', 'sin') ('cos', 'cos') ('sub'. <built-in function sub>) ``` After this pr, the source_fn_stack will be a list of (name, target) tuple. The bottom of stack is the end of the list. ``` [('cond', <torch._ops.HigherOrderOperator object at xxx>), ('cond', <torch._ops.HigherOrderOperator object at xxx>)], [('cond', <torch._ops.HigherOrderOperator object at xxx>), ('cond', <torch._ops.HigherOrderOperator object at xxx>), ('sin', 'sin')], [('cond', <torch._ops.HigherOrderOperator object at xxx>), ('cond', <torch._ops.HigherOrderOperator object at xxx>), ('cos', 'cos')] [('cond', <torch._ops.HigherOrderOperator object at xxx>), ('cond', <torch._ops.HigherOrderOperator object at xxx>), ('sub', <built-in function sub>)] ``` Test Plan: See added tests in test_higher_order_ops.py and modify existing test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108595 Approved by: https://github.com/angelayi, https://github.com/zou3519	2023-09-28 18:18:36 +00:00
Michael Voznesensky	3beed41e12	[Easy] Remove hook warning where source is always guaranteed (#109898 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109898 Approved by: https://github.com/ezyang	2023-09-25 14:36:28 +00:00
William Wen	b904432e82	[dynamo] preserve some FX node metadata of GraphModules (#107067 ) Requested from @tugsbayasgalan: we want dynamo to preserve some FX node metadata when we trace `GraphModule`s (`nn_module_stack`, `source_fn`, `stack_trace`). This is helpful for the case when we export an aten-level `GraphModule`, add some (possibly non-torch or non-aten) ops, and we want to transform the graph back into an aten-level graph. Without preserving metadata, future passes that look at metadata (e.g. quantization passes) won't work. This feature also has the additional benefit of being able to preserve origin line of code when `print_readable`'ing a `GraphModule`. This is helpful when debugging graphs that have passed through dynamo several times. The added unit test demonstrates the added functionality of this PR. ~This PR is currently a proof-of-concept implementation that shows that preserving node metadata across dynamo is possible.~ This PR preserves node metadata across dynamo by doing the following: - ~inject a counter variable into the `GraphModule` source code, which is incremented every time a node is run~ - Construct a line number -> node index map in `GraphModule` as the source code is being generated. - pass a list of node metadata and the line number map to dynamo's bytecode analyzer - ~dynamo traces the counter as a `ConstantVariable`, so when we create a new proxy, we can determine which original node index this proxy corresponds by looking at the value of the traced counter~ - When we create a new proxy, get the current instruction's line number, and get the node index using the line number map - index into the original node metadata ~using the counter variable's tracked value.~ ~Some things that should be addressed off the top of my head:~ - ~Is this feature even desirable? (Do we really want Dynamo to have special behavior for `GraphModules`? Should we expect users to re-export `GraphModules`?)~ - ~Is there a better approach than to use a counter? We considered using node names, line numbers, and assuming that proxies are created in the same order as the nodes, but each of these 3 have shortcomings. For node names, we only have access to new node names, not the old ones. Using line number is fragile. The third is problematic since not all created nodes go through `create_proxy` (e.g. inputs). We currently generate a line number to node index map when the `GraphModule`'s code is generated.~ - ~What's the best way to send data across the "CPython gap"? That is, it is not obvious how to cleanly pass data from dynamo's `eval_frame.py:_TorchDynamoContext.__call__` to `symbolic_convert.py:InstructionTranslatorBase.__init__`. In this PR, we use a global.~ Differential Revision: [D49257108](https://our.internmc.facebook.com/intern/diff/D49257108) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107067 Approved by: https://github.com/jansel	2023-09-15 23:29:14 +00:00
Animesh Jain	f786fbdebd	Reland 3rd try [finishing colesbury's PR 100642] Guard on nn.Module dicts and type (#109323 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109323 Approved by: https://github.com/huydhn, https://github.com/voznesenskym	2023-09-15 08:44:14 +00:00
ydwu4	94a54b89aa	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. Note: More lines are printed for debug log due to newly added context manager and guard adds . Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-14 15:49:30 +00:00
Michael Voznesensky	064ae9ff33	Support register_hook on input tensors (#108903 ) The strategy in this PR is pretty straightforward. There are 2 kinds of hooks: 1) Hooks on objects with sources (inputs, params) 2) Hooks on objects w/o sources (intermediaries, and outputs). Note: As outputs can be made simple by how dynamo handles residuals, they could actually be handled as if they were inputs, but, for the sake of this PR, we will refer to hooks as either hooks on inputs (sourced), or hooks on intermediaries (not sourced). The plan: For tensors w/ a source: We record registered hooks, store them as a global, and associate them with the tensor in residuals. This means that when dynamo goes to create the frame, where we produce bytecode to stitch together our PT2 modified bytecode with the original eager code, we call `register_hook`. This registration of hooks in residuals is sound because (a) it happens right after a Pt2 frame region ends and (b) we know that the tensor is alive in f_locals, f_globals, or a module in the users invoking frame. This means we can soundly know it will be around to invoke `register_hook` on. As long as we guard on the identity of the lifted function, this is sound to do. For tensors w/o a source: Graph break - we will support this in a subsequent PR Handles: An interesting new component here is the creation of a `STORE_FAST `->`LOAD_FAST` associated with the handle, the return result of `register_hook`. If the user code stored the result of `register_hook` in a handle, we need to honor that. We do so by interceding into `STORE_FAST`, and recording the name of the local variable as directed by user code. We then honor that same name in the reconstructed bytecode. If the user did not store a hook, we merely pop the produced value to preserve the stack. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108903 Approved by: https://github.com/ezyang ghstack dependencies: #108846, #109092	2023-09-14 01:52:21 +00:00
PyTorch MergeBot	c5e7588613	Revert "[dynamo] preserve some FX node metadata of GraphModules (#107067 )" This reverts commit `1d42148fee`. Reverted https://github.com/pytorch/pytorch/pull/107067 on behalf of https://github.com/DanilBaibak due to Break internal build ([comment](https://github.com/pytorch/pytorch/pull/107067#issuecomment-1717321061))	2023-09-13 09:59:33 +00:00
Michael Voznesensky	de0b18fad9	Use user directed names for variables where possible (#109092 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109092 Approved by: https://github.com/ezyang ghstack dependencies: #108846	2023-09-13 07:44:04 +00:00

1 2 3 4 5

235 Commits