pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Animesh Jain	58afcd7b61	[dynamo][dict] Add UnspecializedNNModuleVariable to dict keys (#122812 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122812 Approved by: https://github.com/jansel ghstack dependencies: #122943, #123877, #123878	2024-04-13 02:07:35 +00:00
Jason Ansel	212e460dce	[dynamo] Support custom __setattr__ on UserDefinedObjectVariable (#123318 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123318 Approved by: https://github.com/anijain2305	2024-04-07 21:06:52 +00:00
Animesh Jain	8e98fda7a9	[dynamo][easy] Add AC test and improve graph break message (#121394 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/121394 Approved by: https://github.com/yanboliang	2024-04-06 01:02:45 +00:00
Pian Pawakapan	42c2a5477c	[export] nn_module_stack to return class name str (#123308 ) Previously, `node.meta["nn_module_stack"]` had type `Dict[str, Tuple[str, class]]` when exported, and later `Dict[str, Tuple[str, str]]` after de/serialization. This PR changes it to consistently be `Dict[str, Tuple[str, str]]` for round-trippability, i.e. ``` {..., 'L__self___conv': ('conv', 'torch.nn.modules.conv.Conv2d')} ``` `source_fn_stack` is left untouched in this PR. note: the `Union[type, str]` type annotations in ONNX are because ONNX goes through both `export.export()` and `_dynamo.export()` (which still has the original `Dict[str, Tuple[str, class]]` format). nn_module_stack from `export.export()` should consistently have the new format, and we verify/test for that in `_trace.py`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/123308 Approved by: https://github.com/zhxchen17, https://github.com/thiagocrepaldi	2024-04-05 21:48:22 +00:00
Joel Schlosser	07b618e2d4	Graph break cleanly in Dynamo for module parametrization (#121041 ) Fixes #118795 This is a graph breaking partial fix for #120914. We still need -actual- module parametrization tracing support, but at least it doesn't blow up hard now. Background: Module parametrization injects a property as the module parameter attribute that calls a `nn.Module` whose forward takes in a module parameter and returns a reparametrized module parameter. Example: ``` class MyParametrization(nn.Module): def forward(X): # This reparametrization just negates the original parameter value return -X m = nn.Linear(...) p = MyParametrization() register_parametrization(m, "weight", p) # Accessing the "weight" attribute will invoke p's forward() on m's original weight and return the output as the new weight. # m.weight here is now an injected property that does the above instead of an actual Parameter. # This property is defined in torch/nn/utils/parametrize.py. m.weight # NB: Parametrization changes the module type (e.g. torch.nn.utils.parametrize.ParametrizedLinear) print(type(m)) ``` Problem 1: Dynamo has special tracing rules for things in `torch.nn`. Parametrizing a module changes the type of the module and the parametrized attribute, so now these rules wrongly affect tracing here. To fix this: * For parametrized modules, call `convert_to_unspecialized()` to restart analysis where Dynamo starts inlining the module. Problem 2: The issue seen in #118795 is that Dynamo will see a dynamically constructed tensor when `m.weight` is called and introduce that to its `tensor_weakref_to_sizes_strides` cache during fake-ification. This tensor is also made to be a graph input, since it's a module parameter. When guards are created for this module parameter input, the logic calls `m.weight` again and tries to look the result up in the cache, but this is a different tensor now, giving the `KeyError` symptom. To fix this: * Replace Dynamo's `tensor_weakref_to_sizes_strides` cache with a `input_source_to_sizes_strides` cache. * This cache was originally introduced in #100128. Pull Request resolved: https://github.com/pytorch/pytorch/pull/121041 Approved by: https://github.com/anijain2305	2024-03-26 23:44:51 +00:00
Peter Bell	5790096059	[dynamo] Remove uses of `raise unimplemented` (#122136 ) `unimplemented` is a function that raises an error, so `raise unimplemented(...)` never reaches the `raise`. Another related issue is that `raise unimplemented(...) from e` doesn't attach the exception cause correctly. I fix this by adding a `from_exc` argument to `unimplemented`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/122136 Approved by: https://github.com/lezcano	2024-03-22 19:29:58 +00:00
Jason Ansel	477d154ffd	[dynamo] Add missing _nonvar_fields annotations (#122219 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122219 Approved by: https://github.com/anijain2305 ghstack dependencies: #122218	2024-03-20 07:53:18 +00:00
Yanbo Liang	2a63dd8889	[Dynamo] Support lazy module with namedtuple/dict input (#119972 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/119972 Approved by: https://github.com/jansel	2024-02-15 23:18:18 +00:00
Yanbo Liang	5356b5d1f0	[Dynamo][16/N] Move skipfiles to trace_rules.py (#119432 ) This is follow-up-1 for https://github.com/pytorch/pytorch/pull/118971#issue-2114082018. Only code motion and doc update in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/119432 Approved by: https://github.com/jansel	2024-02-09 18:18:23 +00:00
PyTorch MergeBot	eff93fbd86	Revert "[Dynamo][16/N] Move skipfiles to trace_rules.py (#119432 )" This reverts commit `56364124af`. Reverted https://github.com/pytorch/pytorch/pull/119432 on behalf of https://github.com/atalman due to Breaks internal tests ([comment](https://github.com/pytorch/pytorch/pull/119432#issuecomment-1936122795))	2024-02-09 15:25:25 +00:00
Yue Dong	915f9db03c	[Dynamo] Support kwargs for lazy module (#119445 ) Summary: Seems like `kwargs` is already support in `_infer_argument`, so we don't need the extra assertion `len(kwargs) == 0`. This optimization ensures compatibility with torch.compile() for LazyModules with kwargs inputs, preventing graph breaks. Test Plan: Unit tetst and CI Differential Revision: D53558778 Pull Request resolved: https://github.com/pytorch/pytorch/pull/119445 Approved by: https://github.com/yanboliang	2024-02-09 00:46:41 +00:00
Yanbo Liang	56364124af	[Dynamo][16/N] Move skipfiles to trace_rules.py (#119432 ) This is follow-up-1 for https://github.com/pytorch/pytorch/pull/118971#issue-2114082018. Only code motion and doc update in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/119432 Approved by: https://github.com/jansel	2024-02-08 09:41:52 +00:00
Edward Z. Yang	d03173e88c	Unify MYPYINDUCTOR and MYPY (#118432 ) The original motivation for MYPYINDUCTOR was a faster type checking configuration that only checked a subset of files. With the removal of `follow_imports = ignore`, we are now able to use dmypy to do fast incremental typechecking, eliminating the need for this. Perhaps erroneously, when I tee'ed up this PR I elected to delete the `follow_imports = skip` designations in the mypy-inductor.ini. This lead to a number of extra type error suppressions that I manually edited. You will need to review. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118432 Approved by: https://github.com/Skylion007 ghstack dependencies: #118414, #118418	2024-01-27 17:23:20 +00:00
voznesenskym	4c0d63180a	Support NNModules as dict keys (#116723 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/116723 Approved by: https://github.com/lezcano	2024-01-09 03:32:47 +00:00
Edward Z. Yang	53f8d17d1e	Specialize SymNodeVariable when used as module index (#114377 ) Fixes #114171 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/114377 Approved by: https://github.com/Skylion007	2024-01-05 13:51:52 +00:00
Yanbo Liang	f657b2b1f8	[Dynamo][10/N] Remove TorchVariable and is_allowed (#116312 ) After this refactor: * ```TorchVariable``` definition and all references are removed. * All ```is_allowed``` references except one are removed. - The only left one is in ```torch/_dynamo/decorators:_disallow_in_graph_helper```. It was called when users put ```disallow_in_graph``` decorator on a function. Since we use the lists in ```trace_rules``` to decide the function's trace rule, so the decorator would only be used as customer function rather than torch functions. I'll defer this to a separate decorator refactor PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116312 Approved by: https://github.com/jansel	2023-12-27 18:47:05 +00:00
PyTorch MergeBot	3b709d7c1e	Revert "[Dynamo][10/N] Remove TorchVariable and is_allowed (#116312 )" This reverts commit `015bd0e0a1`. Reverted https://github.com/pytorch/pytorch/pull/116312 on behalf of https://github.com/kit1980 due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/116312#issuecomment-1869825506))	2023-12-26 23:47:15 +00:00
Yanbo Liang	015bd0e0a1	[Dynamo][10/N] Remove TorchVariable and is_allowed (#116312 ) After this refactor: * ```TorchVariable``` definition and all references are removed. * All ```is_allowed``` references except one are removed. - The only left one is in ```torch/_dynamo/decorators:_disallow_in_graph_helper```. It was called when users put ```disallow_in_graph``` decorator on a function. Since we use the lists in ```trace_rules``` to decide the function's trace rule, so the decorator would only be used as customer function rather than torch functions. I'll defer this to a separate decorator refactor PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116312 Approved by: https://github.com/jansel	2023-12-23 09:44:09 +00:00
Michael Lazos	fbeca60b1f	Remove replace_all and make VTs mutable (#113725 ) 1. Removes calls to `replace_all` and `clone` and makes VTs mutable. 2. Properly handles Tuple Iterator mutation. Previously TupleIterator variables would only be properly reconstructed if they were advanced at least once in a frame. On calls to `next`, the source information would be lost (due to constructing a new iterator without using builder), which would ensure that during codegen the variable would be reconstructed from scratch. Now that VTs are mutated, the source is never lost, so we need to properly track mutation and handle it by replaying calls to `next` at the end of the modified bytecode. 3. Added test for checking iadd side effects, this was missing in our unit test coverage. 4. Fixed two incorrect sources, DelayGraphBreakVariable, and UserMethodVariable both relied on setting the source to AttrSource(parent, name) at the callsite of `var_getattr`. 5. Fixed a bug in inplace adding for lists, it would set the resulting VariableTracker's source to `None` which would utilize a different reconstruct path in codegen. Now this is handled explicitly by reconstructing vars when allow_cache=`False`, so that during side effect replay, the mutated var is correctly updated. In subsequent PRs: * Refactoring side effect tracking to be significantly simpler (I think we only need an `is_modified` flag) * Refactor `next_variables` iterator to match the signature of `next` * Remove all references to `options` in the code * Refactor VTs representing mutable collections to implement their own mutation update handling * Remove clone and/or make it specific to lists for creating slices * Add mutation tracking/replay for sets * Add mutation tracking/replay for iter.py * Removing setting source in builder (it's set at the top level after a var is returned) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113725 Approved by: https://github.com/jansel	2023-12-10 09:31:21 +00:00
angelayi	f27ab241a4	[dynamo] Fix UnspecializedNNModuleVariable's source (#113852 ) Fixes https://github.com/pytorch/pytorch/issues/113041 In the case where we have an object represented as an UnspecializedNNModuleVariable, the source of an attribute on that object is `AttrSource(base=NotNNModuleSource(base=NNModuleSource(base=AttrSource(base=LocalSource(local_name='self', cell_or_freevar=False), member='seq'))), member='b')`. This causes dynamo to add an extra attribute as it doesn't go to this [`register_attr` step](`eddce3c054/torch/_dynamo/variables/builder.py (L955-L962)`). However if we have an object represented as a UserDefinedObjectVariable, the source of an attribute on that object is `AttrSource(base=NNModuleSource(base=AttrSource(base=LocalSource(local_name='self', cell_or_freevar=False), member='seq')), member='b')`. It seems that UnspecializedNNModuleVariables should behave in the same was as UserDefinedObjectVariables, but the source in these two cases are different. So, I removed the part that changes the source in the UnspecializedNNModuleVariables, and it seems to work! And CI is green (+ reduced graph breaks). ``` def test_unspecialized_nnmodule(self): class TestModule(torch.nn.Module): def __init__(self): super().__init__() self.a = torch.tensor(1.0) def forward(self, x: torch.Tensor) -> torch.Tensor: return x + self.a def forward_hook( module: torch.nn.Module, inputs, output ) -> torch.Tensor: return 2 * output seq = torch.nn.Sequential(TestModule()).eval() seq.b = torch.tensor(2) handle = seq.register_forward_hook(forward_hook) class M(torch.nn.Module): def __init__(self): super().__init__() self.seq = seq def forward(self, x): # self.seq.b has source: AttrSource(base=NotNNModuleSource(base=NNModuleSource(base=AttrSource(base=LocalSource(local_name='self', cell_or_freevar=False), member='seq'))), member='b') return self.seq(x) + self.seq.b inp = (torch.randn(2, 8),) ep = export(M(), inp) ``` ``` def test_user_defined_var(self): class TestModule(torch.nn.Module): def __init__(self): super().__init__() self.a = torch.tensor(1.0) def forward(self, x: torch.Tensor) -> torch.Tensor: return x + self.a class UserDefined: def __init__(self): self.test_module = TestModule() self.b = torch.tensor(2) def __call__(self, x): return self.test_module(x) class M(torch.nn.Module): def __init__(self): super().__init__() self.seq = UserDefined() def forward(self, x): # self.seq.b has source: AttrSource(base=NNModuleSource(base=AttrSource(base=LocalSource(local_name='self', cell_or_freevar=False), member='seq')), member='b') return self.seq(x) + self.seq.b inp = (torch.randn(2, 8),) ep = export(M(), inp) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113852 Approved by: https://github.com/yanboliang	2023-11-17 08:17:27 +00:00
Jason Ansel	5fe96eaaf4	[dynamo] Remove VariableTracker.propagate (#111726 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111726 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306, #111415, #111725	2023-11-07 19:55:19 +00:00
Jason Ansel	843a8ecd24	[dynamo] Remove VariableTracker.add_options (#111725 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111725 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306, #111415	2023-11-07 19:55:19 +00:00
Jason Ansel	9664190952	[dynamo] Eagerly install guards (#111415 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111415 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306	2023-11-07 19:55:19 +00:00
Jason Ansel	7818a2887a	[dynamo] Replace InstructionTranslator.checkpoint with speculate/restart (#112902 ) In my work on making guards installed eagerly (look up the stack), I found that our checkpoint/restore mechanism is very broken. There is lots of state (especially in shape_env) which we don't checkpoint and restore properly. We also have lots of mutable state on variable trackers already which is not checkpointed/restored. (See other PRs in this stack for some spot fixes.) Since we wanted to get rid of this anyway for making VariableTracker mutable, I figured I would just switch to restarting analysis. For other usages of copy_graphstate/restore_graphstate: 1) Many usages were pointless and not needed, these are removed in PRs below this. 2) Some other usage (similar to this one) is removed in PRs above this. 3) The tricky one I am not handling is higher_order_ops, which uses checkpoint/restore a lot. There might be some cases there where this speculate/restart trick won't work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112902 Approved by: https://github.com/voznesenskym	2023-11-05 17:09:29 +00:00
Jason Ansel	c65c0682b1	[dynamo] Expand _nonvar_fields names (#111749 ) This should be a small compile time optimization, since we won't need to walk these fields in apply(). Pull Request resolved: https://github.com/pytorch/pytorch/pull/111749 Approved by: https://github.com/yanboliang	2023-10-23 02:58:16 +00:00
Michael Voznesensky	a902150a1e	[Easy] ConstantVariable() -> .create (#109896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109896 Approved by: https://github.com/ezyang	2023-09-22 22:30:15 +00:00
Michael Voznesensky	e4350d6d4e	Functools partial support in dynamo (#108846 ) The strategy for supporting functools partials is relatively straightforward. There are 2 cases we need to support: 1) Functools partials as input In this case, we are first seeing the functools partial and it is guaranteed to have a source. As such, the args, keywords, and func of the functools partial are passed through VariableBuilder. As this is the first time we are seeing these objects (as it is an input), we re-enter VariableBuilder with a source referencing the args, keywords, and func as attributes of the input to produce: - func: A callable VariableTracker (UDF, TorchVariable, etc) depending on the value of `func` - args: List[VariableTracker] - note, not ListVariableTracker! - keywords: Dict[str, VariableTracker] A major benefit of this structure is that it very elegantly matches the args to `call_function`. We then compose a FunctoolsPartialVariable from the VariableTrackers made above. 2) Functools partials created within compile In this case, we already have all the args as known VTs, and thus just compose a FunctoolsPartialVariable as we do for case (1). For both (1) and (2) - we propagate all guards from the func, args, and keyword VTs to the FunctoolsPartialVariable Pull Request resolved: https://github.com/pytorch/pytorch/pull/108846 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-09-09 17:25:02 +00:00
CK Luk	366baf690b	Back out "[Dynamo x FSDP] Add support for params, buffers, submodules on FSDPManagedNNModuleVariable (#107923 )" (#108823 ) Summary: Original commit changeset: 33650f7cb0fb Original Phabricator Diff: D48833682 Test Plan: See T162942232 for how we figured out that this diff caused significant numeric difference. Reviewed By: voznesenskym Differential Revision: D49082219 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108823 Approved by: https://github.com/xw285cornell	2023-09-08 14:39:43 +00:00
Zhengxu Chen	c75aec90d3	[dynamo] Record nn_module_stack also for unspecialized nn modules. (#108281 ) Summary: Currently node metadata "nn_module_stack" is only being used by export. For some export model, we still want to retain nn_module_stack for unspecialized module for various purposes. This diff add a path to also record nn_module_stack when unspecialized module has a source available. Test Plan: test_export_nn_module_stack_patched_module Differential Revision: D48841193 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108281 Approved by: https://github.com/yanboliang, https://github.com/tugsbayasgalan	2023-09-07 15:38:46 +00:00
voznesenskym	f3a8d57aea	[Dynamo x FSDP] Add support for params, buffers, submodules on FSDPManagedNNModuleVariable (#107923 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107923 Approved by: https://github.com/wconstab	2023-08-29 08:54:13 +00:00
Michael Voznesensky	8549abc347	Grab bag of DTensor enablement stuff (Enable whole graph capture for DTensor) (#105787 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105787 Approved by: https://github.com/ezyang	2023-07-30 00:17:45 +00:00
Danni Li	db4aed6a03	Include nn.ParameterDict in dynamo __getitem__ (#99771 ) Summary: Fix: #99735 Test Plan: Please see GitHub tests. Differential Revision: D45200616 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99771 Approved by: https://github.com/Skylion007, https://github.com/anijain2305	2023-07-11 08:19:04 +00:00
Will Feng	61736679cd	[Dynamo] No graph break for super(MyConv{1/2/3}d, self).forward and super(MyConvTranspose, self).forward (#102509 ) before the PR, running super(MyConv1d, self).forward or super(MyConvTranspose, self).foward, dynamo will create a graph break when executing NNModuleVariable.call_method and raise unimplemented error for name=_conv_forward / _output_padding. see issue for full detail: https://github.com/pytorch/pytorch/issues/101155 after the PR, for torch.nn.conv module with function name _conv_forward / _output_padding, we inline the function with tx.inline_user_function_return code refactor: added NNModuleVariable._inline_user_function_return_helper to consolidaste tx.inline_user_function_return into 1 place to keep code dry. after factor, there are 2 uncolidated inline_user_function_return with different ```fn``` and ```source``` logic. the code is still dry. For local testing, they are covered by test_modulelist, test_moduledict, test_conv_call_super_forward_directly and test_conv_transpose_call_super_forward_directly in test_modules.py Differential Revision: [D46494460](https://our.internmc.facebook.com/intern/diff/D46494460) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102509 Approved by: https://github.com/yanboliang	2023-06-06 22:01:17 +00:00
Wanchao Liang	c1db235040	[dynamo] fix module buffers call (#102251 ) This PR fixes module buffers call and extract module.buffers similar to module.parameters Pull Request resolved: https://github.com/pytorch/pytorch/pull/102251 Approved by: https://github.com/wconstab	2023-05-25 21:26:09 +00:00
Animesh Jain	5d6810a4ee	[dynamo][higher order op] Support nn.Module calls (#102022 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102022 Approved by: https://github.com/zou3519	2023-05-24 21:39:58 +00:00
Animesh Jain	7a17e9d0b6	[dynamo] Bugfix for unspecialized nn module variable (#101859 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101859 Approved by: https://github.com/yanboliang, https://github.com/shingjan	2023-05-20 00:46:56 +00:00
Mark Saroufim	bf52d570d9	torch.save/load torch.compiled models (#97565 ) Opening this so I can discuss with @albanD I built a proof of concept of an in place API for an nn.Module that allows us to save and load a torch.compiled model with no issues https://github.com/msaroufim/mlsys-experiments/blob/main/save-compiled-model.py So users can run` model.compile()` and then run `torch.save(model, "model.pt")` and `torch.load(model, "model.pt)` with no issues unlike the rather strange current suggestion we give to users which is `opt_mod = torch.compile(mod); torch.save(mod, "model.pt")` Right now I'm trying to extend this to work for nn.modules more generally TODO: Failing tests * [x] torch.jit.load -> issue was because of aliasing `__call__` to `_call_impl`, _call_impl used to be skipped when now it lo longer is so expanded the skip check. I added an explicit `torch.jit.load()` test now which @davidberard98 suggested * [x] functorch seems to be a flake - ran locally and it worked `pytest functorch/test_eager_transforms.py` * [x] a test infra flake - `test_testing.py::TestImports::test_no_mutate_global_logging_on_import_path_functorch` * [x] It seems like I broke inlining in dynamo though `python -m pytest test/dynamo/test_dynamic_shapes.py -k test_issue175` chatting with Voz about it but still not entirely sure how to fix - found a workaround after chatting with @yanboliang * [x] `pytest test/dynamo/test_modules.py` and `test/dynamo/test_dynamic_shapes` `test/dynamo/test_misc.py` seem to be failing in CI but trying it out locally they all pass tests passed with 0 failures * [x] `pytest test/profiler/test_profiler_tree.py ` these tests have ProfilerTrees explicitly printed and will now break if __call__ is not in tree - ran with `EXPECT_ACCEPT=1` * [x] `pytest test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning` a flake, ran this locally and it works fine * [x] I reverted my changes to `_dynamo/nn_module.py` since it looks like @wconstab is now directly handling `_call_impl` there but this is triggering an infinite inlining which is crashing * [x] Tried out to instead override `__call__`, python doesnt like this though https://github.com/pytorch/pytorch/pull/97565#issuecomment-1524570439 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97565 Approved by: https://github.com/aaronenyeshi, https://github.com/albanD, https://github.com/voznesenskym	2023-05-05 03:57:49 +00:00
PyTorch MergeBot	04d67e20a7	Revert "torch.save/load torch.compiled models (#97565 )" This reverts commit `87f08d717e`. Reverted https://github.com/pytorch/pytorch/pull/97565 on behalf of https://github.com/clee2000 due to sorry but I think this breaks dynamo tests `87f08d717e` ([comment](https://github.com/pytorch/pytorch/pull/97565#issuecomment-1535103171))	2023-05-04 17:07:33 +00:00
Mark Saroufim	87f08d717e	torch.save/load torch.compiled models (#97565 ) Opening this so I can discuss with @albanD I built a proof of concept of an in place API for an nn.Module that allows us to save and load a torch.compiled model with no issues https://github.com/msaroufim/mlsys-experiments/blob/main/save-compiled-model.py So users can run` model.compile()` and then run `torch.save(model, "model.pt")` and `torch.load(model, "model.pt)` with no issues unlike the rather strange current suggestion we give to users which is `opt_mod = torch.compile(mod); torch.save(mod, "model.pt")` Right now I'm trying to extend this to work for nn.modules more generally TODO: Failing tests * [x] torch.jit.load -> issue was because of aliasing `__call__` to `_call_impl`, _call_impl used to be skipped when now it lo longer is so expanded the skip check. I added an explicit `torch.jit.load()` test now which @davidberard98 suggested * [x] functorch seems to be a flake - ran locally and it worked `pytest functorch/test_eager_transforms.py` * [x] a test infra flake - `test_testing.py::TestImports::test_no_mutate_global_logging_on_import_path_functorch` * [x] It seems like I broke inlining in dynamo though `python -m pytest test/dynamo/test_dynamic_shapes.py -k test_issue175` chatting with Voz about it but still not entirely sure how to fix - found a workaround after chatting with @yanboliang * [x] `pytest test/dynamo/test_modules.py` and `test/dynamo/test_dynamic_shapes` `test/dynamo/test_misc.py` seem to be failing in CI but trying it out locally they all pass tests passed with 0 failures * [x] `pytest test/profiler/test_profiler_tree.py ` these tests have ProfilerTrees explicitly printed and will now break if __call__ is not in tree - ran with `EXPECT_ACCEPT=1` * [x] `pytest test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning` a flake, ran this locally and it works fine * [x] I reverted my changes to `_dynamo/nn_module.py` since it looks like @wconstab is now directly handling `_call_impl` there but this is triggering an infinite inlining which is crashing * [x] Tried out to instead override `__call__`, python doesnt like this though https://github.com/pytorch/pytorch/pull/97565#issuecomment-1524570439 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97565 Approved by: https://github.com/aaronenyeshi, https://github.com/albanD	2023-05-04 16:23:12 +00:00
David Berard	d976df49c5	[dynamo] don't use LazyModuleMixin.cls_to_become if it is None (#99943 ) TL;DR: This PR fixes handling for lazy modules where `cls_to_become is None`. In those cases, we should leave the type of the lazy module as the old value. Details: Lazy modules are intended to be initialized at execution; some of them are also supposed to switch to a different type after they have been initialized. However, not all are supposed to switch; see this logic from `nn/modules/lazy.py` ```python def _infer_parameters(self, ...): ... if module.cls_to_become is not None: module.__class__ = module.cls_to_become ``` i.e., we should leave the module type as the old value if `module.cls_to_become is None`. This PR updates dynamo's handling to match this behavior. Test `test_lazy_module_no_cls_to_become` added to `test/dynamo/test_module.py`. Differential Revision: [D45253698](https://our.internmc.facebook.com/intern/diff/D45253698) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99943 Approved by: https://github.com/jansel	2023-04-25 21:34:11 +00:00
Michael Voznesensky	04f7a2a5e1	Support module dict iter (#99503 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99503 Approved by: https://github.com/Chillee, https://github.com/jansel	2023-04-19 21:54:35 +00:00
Jason Ansel	47c685def3	[dynamo] Support DELETE_ATTR (#98698 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98698 Approved by: https://github.com/yanboliang	2023-04-15 20:31:40 +00:00
Yanbo Liang	05809c7d3b	[Dynamo] No graph break for explicit calling Conv{1/2/3}d.forward & ConvTranspose{1/2/3}d.forward (#99015 ) Before this PR, if users call ```Conv2d(x)```, dynamo handles it well(no graph break) and puts a ```call_module``` op in the FX graph. However, if users explicitly call ```Conv2d.forward(x)``` in another ```forward``` function, the inlining would be failed(caused graph break). This PR fixed this issue by translating the explicit ```Conv2d.forward(x)``` to ```Conv2d(x)```. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99015 Approved by: https://github.com/jansel, https://github.com/wconstab	2023-04-15 08:04:13 +00:00
Will Constable	6eab5e88c8	Graph-break on allowed modules if they have hooks (#97184 ) Allowed modules are stuck into dynamo's fx graph as call_module nodes, without dynamo doing any tracing of the module. This means during AOT trace time, hooks will fire during tracing when the call_module is executed, but the hooks themselves will disappear after that and not be present in the compiled program. (worse, if they performed any tensor operations, those would get traced so you could end up with part of the hook's functionality). To circumvent this, there are two options for 'allowed modules' with hooks. 1) don't treat them as 'allowed' - trace into them 2) graph-break, so the module is no longer part of the dynamo trace at all (1) will fail for users that opted into allowed modules becuase they know their module has problems being traced by dynamo. (2) causes graph breaks on common modules such as nn.Linear, just because they are marked as 'allowed'. It would help matters if we could differentiate between types of allowed modules (A) allowed to avoid overheads - used for common ops like nn.Linear (B) allowed to avoid dynamo graphbreaks caused by unsupported code Ideally, we'd use method (1) for group (A) and (2) for (B). For now, graph-break on all cases of allowed modules. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97184 Approved by: https://github.com/jansel	2023-04-15 01:46:15 +00:00
Will Constable	b8580b0897	Fix lazy_modules while enabling Unspecialized '__call__' tracing (#98516 ) This fixes a regression added in the following PR to graph-break on allowed modules with hooks, but has its own problems. - following #97184 PR makes 'allowed modules' with hooks graph-break, and lazy modules are allowed. (should we just make lazy modules not allowed ?) - graph-breaks at lazy modules fail the lazy module unit tests which assert no graphbreaks - this PR attempts to always 'initialize' lazy modules before tracing/calling into their __call__, and initializing a lazy module should delete all its hooks after firing them once, making the above issue go away Pull Request resolved: https://github.com/pytorch/pytorch/pull/98516 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-04-13 21:23:56 +00:00
Yanbo Liang	e20981bda9	[Dynamo] Fix Lazy Module initialization with constant arg (#98996 ) Fixes Meta internal user case Pull Request resolved: https://github.com/pytorch/pytorch/pull/98996 Approved by: https://github.com/williamwen42	2023-04-13 17:37:25 +00:00
Will Constable	a408ed24ba	Support module hooks in UnspecializedNNModuleVar (#98540 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98540 Approved by: https://github.com/yanboliang	2023-04-13 04:32:50 +00:00
Yanbo Liang	78ff7ca24a	[Dynamo] Fix Sequential nn module with duplicated submodule (#98880 ) Fixes #98852 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98880 Approved by: https://github.com/ngimel	2023-04-12 23:09:50 +00:00
Yanbo Liang	3b6a78ea87	[Dynamo] Lazy Module support list/tuple input (#98809 ) Fixes Meta internal user case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98809 Approved by: https://github.com/wconstab	2023-04-11 20:38:04 +00:00
Matthias Reso	96595617b9	Support Modules with custom __getitem__ method through fallback (#97932 ) This PR allows to torch.compile torch.nn.Module with custom __getitem__ methods but falling back to Python. Fixes #97720 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97932 Approved by: https://github.com/yanboliang	2023-04-04 20:42:17 +00:00

1 2

87 Commits