pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Michael Voznesensky	e4350d6d4e	Functools partial support in dynamo (#108846 ) The strategy for supporting functools partials is relatively straightforward. There are 2 cases we need to support: 1) Functools partials as input In this case, we are first seeing the functools partial and it is guaranteed to have a source. As such, the args, keywords, and func of the functools partial are passed through VariableBuilder. As this is the first time we are seeing these objects (as it is an input), we re-enter VariableBuilder with a source referencing the args, keywords, and func as attributes of the input to produce: - func: A callable VariableTracker (UDF, TorchVariable, etc) depending on the value of `func` - args: List[VariableTracker] - note, not ListVariableTracker! - keywords: Dict[str, VariableTracker] A major benefit of this structure is that it very elegantly matches the args to `call_function`. We then compose a FunctoolsPartialVariable from the VariableTrackers made above. 2) Functools partials created within compile In this case, we already have all the args as known VTs, and thus just compose a FunctoolsPartialVariable as we do for case (1). For both (1) and (2) - we propagate all guards from the func, args, and keyword VTs to the FunctoolsPartialVariable Pull Request resolved: https://github.com/pytorch/pytorch/pull/108846 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-09-09 17:25:02 +00:00
CK Luk	366baf690b	Back out "[Dynamo x FSDP] Add support for params, buffers, submodules on FSDPManagedNNModuleVariable (#107923 )" (#108823 ) Summary: Original commit changeset: 33650f7cb0fb Original Phabricator Diff: D48833682 Test Plan: See T162942232 for how we figured out that this diff caused significant numeric difference. Reviewed By: voznesenskym Differential Revision: D49082219 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108823 Approved by: https://github.com/xw285cornell	2023-09-08 14:39:43 +00:00
Zhengxu Chen	c75aec90d3	[dynamo] Record nn_module_stack also for unspecialized nn modules. (#108281 ) Summary: Currently node metadata "nn_module_stack" is only being used by export. For some export model, we still want to retain nn_module_stack for unspecialized module for various purposes. This diff add a path to also record nn_module_stack when unspecialized module has a source available. Test Plan: test_export_nn_module_stack_patched_module Differential Revision: D48841193 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108281 Approved by: https://github.com/yanboliang, https://github.com/tugsbayasgalan	2023-09-07 15:38:46 +00:00
voznesenskym	f3a8d57aea	[Dynamo x FSDP] Add support for params, buffers, submodules on FSDPManagedNNModuleVariable (#107923 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107923 Approved by: https://github.com/wconstab	2023-08-29 08:54:13 +00:00
Michael Voznesensky	8549abc347	Grab bag of DTensor enablement stuff (Enable whole graph capture for DTensor) (#105787 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105787 Approved by: https://github.com/ezyang	2023-07-30 00:17:45 +00:00
Danni Li	db4aed6a03	Include nn.ParameterDict in dynamo __getitem__ (#99771 ) Summary: Fix: #99735 Test Plan: Please see GitHub tests. Differential Revision: D45200616 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99771 Approved by: https://github.com/Skylion007, https://github.com/anijain2305	2023-07-11 08:19:04 +00:00
Will Feng	61736679cd	[Dynamo] No graph break for super(MyConv{1/2/3}d, self).forward and super(MyConvTranspose, self).forward (#102509 ) before the PR, running super(MyConv1d, self).forward or super(MyConvTranspose, self).foward, dynamo will create a graph break when executing NNModuleVariable.call_method and raise unimplemented error for name=_conv_forward / _output_padding. see issue for full detail: https://github.com/pytorch/pytorch/issues/101155 after the PR, for torch.nn.conv module with function name _conv_forward / _output_padding, we inline the function with tx.inline_user_function_return code refactor: added NNModuleVariable._inline_user_function_return_helper to consolidaste tx.inline_user_function_return into 1 place to keep code dry. after factor, there are 2 uncolidated inline_user_function_return with different ```fn``` and ```source``` logic. the code is still dry. For local testing, they are covered by test_modulelist, test_moduledict, test_conv_call_super_forward_directly and test_conv_transpose_call_super_forward_directly in test_modules.py Differential Revision: [D46494460](https://our.internmc.facebook.com/intern/diff/D46494460) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102509 Approved by: https://github.com/yanboliang	2023-06-06 22:01:17 +00:00
Wanchao Liang	c1db235040	[dynamo] fix module buffers call (#102251 ) This PR fixes module buffers call and extract module.buffers similar to module.parameters Pull Request resolved: https://github.com/pytorch/pytorch/pull/102251 Approved by: https://github.com/wconstab	2023-05-25 21:26:09 +00:00
Animesh Jain	5d6810a4ee	[dynamo][higher order op] Support nn.Module calls (#102022 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102022 Approved by: https://github.com/zou3519	2023-05-24 21:39:58 +00:00
Animesh Jain	7a17e9d0b6	[dynamo] Bugfix for unspecialized nn module variable (#101859 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101859 Approved by: https://github.com/yanboliang, https://github.com/shingjan	2023-05-20 00:46:56 +00:00
Mark Saroufim	bf52d570d9	torch.save/load torch.compiled models (#97565 ) Opening this so I can discuss with @albanD I built a proof of concept of an in place API for an nn.Module that allows us to save and load a torch.compiled model with no issues https://github.com/msaroufim/mlsys-experiments/blob/main/save-compiled-model.py So users can run` model.compile()` and then run `torch.save(model, "model.pt")` and `torch.load(model, "model.pt)` with no issues unlike the rather strange current suggestion we give to users which is `opt_mod = torch.compile(mod); torch.save(mod, "model.pt")` Right now I'm trying to extend this to work for nn.modules more generally TODO: Failing tests * [x] torch.jit.load -> issue was because of aliasing `__call__` to `_call_impl`, _call_impl used to be skipped when now it lo longer is so expanded the skip check. I added an explicit `torch.jit.load()` test now which @davidberard98 suggested * [x] functorch seems to be a flake - ran locally and it worked `pytest functorch/test_eager_transforms.py` * [x] a test infra flake - `test_testing.py::TestImports::test_no_mutate_global_logging_on_import_path_functorch` * [x] It seems like I broke inlining in dynamo though `python -m pytest test/dynamo/test_dynamic_shapes.py -k test_issue175` chatting with Voz about it but still not entirely sure how to fix - found a workaround after chatting with @yanboliang * [x] `pytest test/dynamo/test_modules.py` and `test/dynamo/test_dynamic_shapes` `test/dynamo/test_misc.py` seem to be failing in CI but trying it out locally they all pass tests passed with 0 failures * [x] `pytest test/profiler/test_profiler_tree.py ` these tests have ProfilerTrees explicitly printed and will now break if __call__ is not in tree - ran with `EXPECT_ACCEPT=1` * [x] `pytest test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning` a flake, ran this locally and it works fine * [x] I reverted my changes to `_dynamo/nn_module.py` since it looks like @wconstab is now directly handling `_call_impl` there but this is triggering an infinite inlining which is crashing * [x] Tried out to instead override `__call__`, python doesnt like this though https://github.com/pytorch/pytorch/pull/97565#issuecomment-1524570439 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97565 Approved by: https://github.com/aaronenyeshi, https://github.com/albanD, https://github.com/voznesenskym	2023-05-05 03:57:49 +00:00
PyTorch MergeBot	04d67e20a7	Revert "torch.save/load torch.compiled models (#97565 )" This reverts commit `87f08d717e`. Reverted https://github.com/pytorch/pytorch/pull/97565 on behalf of https://github.com/clee2000 due to sorry but I think this breaks dynamo tests `87f08d717e` ([comment](https://github.com/pytorch/pytorch/pull/97565#issuecomment-1535103171))	2023-05-04 17:07:33 +00:00
Mark Saroufim	87f08d717e	torch.save/load torch.compiled models (#97565 ) Opening this so I can discuss with @albanD I built a proof of concept of an in place API for an nn.Module that allows us to save and load a torch.compiled model with no issues https://github.com/msaroufim/mlsys-experiments/blob/main/save-compiled-model.py So users can run` model.compile()` and then run `torch.save(model, "model.pt")` and `torch.load(model, "model.pt)` with no issues unlike the rather strange current suggestion we give to users which is `opt_mod = torch.compile(mod); torch.save(mod, "model.pt")` Right now I'm trying to extend this to work for nn.modules more generally TODO: Failing tests * [x] torch.jit.load -> issue was because of aliasing `__call__` to `_call_impl`, _call_impl used to be skipped when now it lo longer is so expanded the skip check. I added an explicit `torch.jit.load()` test now which @davidberard98 suggested * [x] functorch seems to be a flake - ran locally and it worked `pytest functorch/test_eager_transforms.py` * [x] a test infra flake - `test_testing.py::TestImports::test_no_mutate_global_logging_on_import_path_functorch` * [x] It seems like I broke inlining in dynamo though `python -m pytest test/dynamo/test_dynamic_shapes.py -k test_issue175` chatting with Voz about it but still not entirely sure how to fix - found a workaround after chatting with @yanboliang * [x] `pytest test/dynamo/test_modules.py` and `test/dynamo/test_dynamic_shapes` `test/dynamo/test_misc.py` seem to be failing in CI but trying it out locally they all pass tests passed with 0 failures * [x] `pytest test/profiler/test_profiler_tree.py ` these tests have ProfilerTrees explicitly printed and will now break if __call__ is not in tree - ran with `EXPECT_ACCEPT=1` * [x] `pytest test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning` a flake, ran this locally and it works fine * [x] I reverted my changes to `_dynamo/nn_module.py` since it looks like @wconstab is now directly handling `_call_impl` there but this is triggering an infinite inlining which is crashing * [x] Tried out to instead override `__call__`, python doesnt like this though https://github.com/pytorch/pytorch/pull/97565#issuecomment-1524570439 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97565 Approved by: https://github.com/aaronenyeshi, https://github.com/albanD	2023-05-04 16:23:12 +00:00
David Berard	d976df49c5	[dynamo] don't use LazyModuleMixin.cls_to_become if it is None (#99943 ) TL;DR: This PR fixes handling for lazy modules where `cls_to_become is None`. In those cases, we should leave the type of the lazy module as the old value. Details: Lazy modules are intended to be initialized at execution; some of them are also supposed to switch to a different type after they have been initialized. However, not all are supposed to switch; see this logic from `nn/modules/lazy.py` ```python def _infer_parameters(self, ...): ... if module.cls_to_become is not None: module.__class__ = module.cls_to_become ``` i.e., we should leave the module type as the old value if `module.cls_to_become is None`. This PR updates dynamo's handling to match this behavior. Test `test_lazy_module_no_cls_to_become` added to `test/dynamo/test_module.py`. Differential Revision: [D45253698](https://our.internmc.facebook.com/intern/diff/D45253698) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99943 Approved by: https://github.com/jansel	2023-04-25 21:34:11 +00:00
Michael Voznesensky	04f7a2a5e1	Support module dict iter (#99503 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99503 Approved by: https://github.com/Chillee, https://github.com/jansel	2023-04-19 21:54:35 +00:00
Jason Ansel	47c685def3	[dynamo] Support DELETE_ATTR (#98698 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98698 Approved by: https://github.com/yanboliang	2023-04-15 20:31:40 +00:00
Yanbo Liang	05809c7d3b	[Dynamo] No graph break for explicit calling Conv{1/2/3}d.forward & ConvTranspose{1/2/3}d.forward (#99015 ) Before this PR, if users call ```Conv2d(x)```, dynamo handles it well(no graph break) and puts a ```call_module``` op in the FX graph. However, if users explicitly call ```Conv2d.forward(x)``` in another ```forward``` function, the inlining would be failed(caused graph break). This PR fixed this issue by translating the explicit ```Conv2d.forward(x)``` to ```Conv2d(x)```. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99015 Approved by: https://github.com/jansel, https://github.com/wconstab	2023-04-15 08:04:13 +00:00
Will Constable	6eab5e88c8	Graph-break on allowed modules if they have hooks (#97184 ) Allowed modules are stuck into dynamo's fx graph as call_module nodes, without dynamo doing any tracing of the module. This means during AOT trace time, hooks will fire during tracing when the call_module is executed, but the hooks themselves will disappear after that and not be present in the compiled program. (worse, if they performed any tensor operations, those would get traced so you could end up with part of the hook's functionality). To circumvent this, there are two options for 'allowed modules' with hooks. 1) don't treat them as 'allowed' - trace into them 2) graph-break, so the module is no longer part of the dynamo trace at all (1) will fail for users that opted into allowed modules becuase they know their module has problems being traced by dynamo. (2) causes graph breaks on common modules such as nn.Linear, just because they are marked as 'allowed'. It would help matters if we could differentiate between types of allowed modules (A) allowed to avoid overheads - used for common ops like nn.Linear (B) allowed to avoid dynamo graphbreaks caused by unsupported code Ideally, we'd use method (1) for group (A) and (2) for (B). For now, graph-break on all cases of allowed modules. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97184 Approved by: https://github.com/jansel	2023-04-15 01:46:15 +00:00
Will Constable	b8580b0897	Fix lazy_modules while enabling Unspecialized '__call__' tracing (#98516 ) This fixes a regression added in the following PR to graph-break on allowed modules with hooks, but has its own problems. - following #97184 PR makes 'allowed modules' with hooks graph-break, and lazy modules are allowed. (should we just make lazy modules not allowed ?) - graph-breaks at lazy modules fail the lazy module unit tests which assert no graphbreaks - this PR attempts to always 'initialize' lazy modules before tracing/calling into their __call__, and initializing a lazy module should delete all its hooks after firing them once, making the above issue go away Pull Request resolved: https://github.com/pytorch/pytorch/pull/98516 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-04-13 21:23:56 +00:00
Yanbo Liang	e20981bda9	[Dynamo] Fix Lazy Module initialization with constant arg (#98996 ) Fixes Meta internal user case Pull Request resolved: https://github.com/pytorch/pytorch/pull/98996 Approved by: https://github.com/williamwen42	2023-04-13 17:37:25 +00:00
Will Constable	a408ed24ba	Support module hooks in UnspecializedNNModuleVar (#98540 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98540 Approved by: https://github.com/yanboliang	2023-04-13 04:32:50 +00:00
Yanbo Liang	78ff7ca24a	[Dynamo] Fix Sequential nn module with duplicated submodule (#98880 ) Fixes #98852 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98880 Approved by: https://github.com/ngimel	2023-04-12 23:09:50 +00:00
Yanbo Liang	3b6a78ea87	[Dynamo] Lazy Module support list/tuple input (#98809 ) Fixes Meta internal user case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98809 Approved by: https://github.com/wconstab	2023-04-11 20:38:04 +00:00
Matthias Reso	96595617b9	Support Modules with custom __getitem__ method through fallback (#97932 ) This PR allows to torch.compile torch.nn.Module with custom __getitem__ methods but falling back to Python. Fixes #97720 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97932 Approved by: https://github.com/yanboliang	2023-04-04 20:42:17 +00:00
Yanbo Liang	a6bd21d935	[Dynamo] Eagerly initializing Lazy Module to reduce graph breaks (#97946 ) Fixes Meta internal user case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97946 Approved by: https://github.com/wconstab	2023-04-03 22:24:43 +00:00
Jason Ansel	35b3309539	Fix graph break from inline patched init (#98150 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98150 Approved by: https://github.com/anijain2305, https://github.com/yanboliang	2023-04-03 01:11:30 +00:00
Will Constable	c1a6dde79e	Make dynamo-FSDP skip guards (#97463 ) Create a new GuardSource for FSDP modules, and use it to opt out of guard installation. Based on @awgu's work in https://github.com/pytorch/pytorch/pull/97091 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97463 Approved by: https://github.com/voznesenskym, https://github.com/jansel, https://github.com/awgu	2023-03-28 04:04:34 +00:00
Yanbo Liang	c7fad13310	[Dynamo] Support nn.Module.named_children (#97216 ) Fixes Meta internal export case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97216 Approved by: https://github.com/jansel	2023-03-22 01:43:10 +00:00
Will Constable	a12e92d8e4	Support nn.Module forward hooks in torchdynamo (#92125 ) Tweak dynamo behavior in 2 places when calling nn.Modules, to route the call to __call__ instead of .forward(), since __call__ is the codepath that eager users hit and will dispatch to hooks correctly. (1) inside NNModuleVariable.call_function, which covers the common case of calling a module from code dynamo is already tracing (2) at the OptimizedModule layer, which is the entrypoint into a top-level nn.Module dynamo is about to compile This exposes a new bug: NNModuleVariable used to special-case calling module.forward() (which is a method) as a UserFunctionVariable with an extra 'self' arg. After tracing into module.__call__, there is no longer a special case for the eventual call into .forward, and it gets wrapped in a UserDefinedObjectVariable following standard behavior of ._wrap(). UDOV can't be called, so this broke some tests. - Fix: add a new special case in _wrap() that treats methods as a UserDefinedMethod instead of UserDefinedObjectVariable. Now, the forward method can be called. Also, fix NNModuleVar.call_method routing forward back to __call__ Pull Request resolved: https://github.com/pytorch/pytorch/pull/92125 Approved by: https://github.com/ezyang, https://github.com/jansel, https://github.com/voznesenskym	2023-02-24 05:10:29 +00:00
ydwu4	4d753b5045	[WIP][dynamo] simplify module_key creation logic (#94945 ) After some thoughts, I find it difficult to come up with a robust naming convention that satisfies the following constraints at the same time: 1. the new name should be a valid nn.Moule attribute (as required by minifier and it's a good thing to have in general) 2. it can cover various cases such as GetItemSource, GetAttrSource 3. it's easy to recover the original path 4. robust to users' naming scheme. Thanks to @yanboliang for pointing out the original access path is preserved in Source, now we just need to add an additonal value source.name() to node.meta["nn_module_stack"] to get the access path in original module. We also address some TODO in quantization, which relies on the original naming convention in nn_module_stack. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94945 Approved by: https://github.com/jansel, https://github.com/yanboliang	2023-02-20 07:28:04 +00:00
ydwu4	4b2d1beca2	[dynamo] keep submodule's name for nn.Sequential when unroolling (#94913 ) Currently, when unrolling an nn.Sequential, we use an integer to represent its submodule's name. This produces some difficulty in tracking the origin of the parameters in the export path: ```python model = nn.Sequential(OrderedDict([ ('conv1', nn.Conv2d(1,20,5)), ('relu1', nn.ReLU()), ('conv2', nn.Conv2d(20,64,5)), ('relu2', nn.ReLU()) ])) ``` Currently, the submodules will have names such as model.0, model.1 instead of model.conv1, model.relu1. This discrepency causes it difficult to track the origin of paramers because they are represented as model.conv1.foo and model.relu1.foo in model.named_parameters(). We replace enumerate() with named_children() to keep submodule's name. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94913 Approved by: https://github.com/jansel	2023-02-16 04:43:05 +00:00
David Berard	a4085ab837	[dynamo] support custom __getattr__ on torch.nn.Modules (#94658 ) Summary: torch.nn.Module implementations previously did not support custom implementations of `__getattr__`; if a torch.nn.Module subclass implemented `__getattr__` and we tried to access an attribute that was expected to be present in `__getattr__`, dynamo would not check `__getattr__` and would error out with an AttributeError. This PR copies the functionality from UserDefinedObjectVariable into torch.nn.Module so that it also supports `__getattr__` Example of a module which previously would fail: ```python class MyMod(torch.nn.Module): def __init__(self): super().__init__() self.custom_dict = {"queue": [torch.rand((2, 2)) for _ in range(3)]} self.other_attr = torch.rand((2, 2)) def __getattr__(self, name): custom_dict = self.custom_dict if name in custom_dict: return custom_dict[name] return super().__getattr__(name) def forward(self, x): return x @ self.other_attr + self.queue[-1] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94658 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-02-16 04:00:51 +00:00
Xuehai Pan	5b1cedacde	[BE] [2/3] Rewrite `super()` calls in functorch and torch (#94588 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94588 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-10 21:16:33 +00:00
Yanbo Liang	af5b01294e	[Dynamo] Fix bug if module calls module with static forward function (#93299 ) Fix a regression I found from 14k github models(10+ models failed since today), it's because of #93115. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93299 Approved by: https://github.com/williamwen42	2023-01-31 06:16:33 +00:00
William Wen	5bae580502	Don't graph break on patched module methods (#93115 ) Fix one case for https://github.com/pytorch/pytorch/pull/91018 since it's needed soon. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93115 Approved by: https://github.com/angelayi	2023-01-27 06:14:44 +00:00
Jerry Zhang	1464db08b4	[quant][pt2e] Support setting qconfig by module_type (#92355 ) Summary: This PR supports the following feature for QConfigMapping: ``` qconfig_mapping = QConfigMapping().set_object_type(torch.nn.Conv2d, qconfig) backend_config = get_qnnpack_pt2e_backend_config() m = prepare_pt2e(m, qconfig_mapping, example_inputs, backend_config) ``` which means users want to set the qconfig for all calls to `torch.nn.Conv2d` to use `qconfig`, note this is only verified for the case when the module is broken down to a single aten op right now, e.g. torch.nn.Conv2d will be torch.ops.aten.convolution op when traced through. will need to support more complicated modules that is broken down to multiple operators later, e.g. (MaxPool) Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qconfig_module_type Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/92355 Approved by: https://github.com/jcaip	2023-01-20 03:18:21 +00:00
Will Constable	8e2e648f84	Propagate sources in VariableBuilder and add SuperSource (#91729 ) Motivation When adding support for default args (#90575), a lot of VariableTrackers missing sources were encountered. Currently, in a lot of cases it seems OK to skip the source for VariableTrackers created (especially during inlining), but that assumption breaks down when inlining functions with default arguments. Summary of changes - propagate the self.source of the VariableBuilder to the new variables being built, which seems like it was an omission previously - Add SuperSource to track usages of super(), so that SuperVariables can support function calls with default args Pull Request resolved: https://github.com/pytorch/pytorch/pull/91729 Approved by: https://github.com/ezyang	2023-01-12 05:04:18 +00:00
PyTorch MergeBot	6a3ddd0171	Revert "Don't graph break on patched module methods or aliased methods (#91018 )" This reverts commit `d6fc2d82ca`. Reverted https://github.com/pytorch/pytorch/pull/91018 on behalf of https://github.com/kit1980 due to After this PR, inductor / cuda11.6-py3.10-gcc7-sm86 / test fails every time with CUDA out of memory during OPTForCausalLM	2022-12-21 19:54:15 +00:00
William Wen	d6fc2d82ca	Don't graph break on patched module methods or aliased methods (#91018 ) See added tests for the cases that were fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91018 Approved by: https://github.com/Morgan77523, https://github.com/anijain2305	2022-12-21 16:29:15 +00:00
Yanbo Liang	2e0ce24890	[Dynamo] Support access nn.Module keys (#90502 ) Fixes https://github.com/pytorch/torchdynamo/issues/1973 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90502 Approved by: https://github.com/jansel	2022-12-12 09:15:42 +00:00
Jerry Zhang	797544f1c4	[dynamo][ez] Change module type to str for easier downstream parsing (#90429 ) Summary: att Test Plan: NA Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/90429 Approved by: https://github.com/SherlockNoMad	2022-12-09 02:00:18 +00:00
Ram Rachum	351d73b97f	Fix exception causes all over the codebase (#90271 ) This is the continuation to #90134 and hopefully the final PR in this series. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90271 Approved by: https://github.com/kit1980	2022-12-07 04:29:00 +00:00
William Wen	ebeecbf833	Dynamo FX graph stack traceback fix (#87136 ) Migration from https://github.com/pytorch/torchdynamo/pull/1655. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87136 Approved by: https://github.com/voznesenskym	2022-12-06 02:22:16 +00:00
Yanbo Liang	d88b555577	[Dynamo] Fix source/reconstruction bugs in NNModule named_* calls (#89729 ) Fixes https://github.com/pytorch/torchdynamo/issues/1931 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89729 Approved by: https://github.com/ezyang	2022-11-30 06:05:47 +00:00
Will Constable	77df2ca9b6	Special-case fsdp wrapped modules to be Unspecialized (#89330 ) ### Summary Making dynamo treat the nn.Modules inside FSDP wrappers as 'Unspecialized' results in dynamo-produced graphs where nn.module parameters are inputs to the graph rather than attributes of the outer graphmodule. This helps in FSDP since it forces dynamo to pick the latest copy of the parameters off the user's nn.Module (which FSDP mutates every pre_forward), solving the ordering issue in backward. ### Details Imagine this toy model ``` class MyModule(torch.nn.Module): def __init__(self, a, b): super(MyModule, self).__init__() self.net = nn.Sequential( nn.Linear(a, b), nn.ReLU(), ) def forward(self, x): return self.net(x) class ToyModel(nn.Module): def __init__(self): super(ToyModel, self).__init__() self.net = nn.Sequential( *[MyModule(10, 10000)] + [MyModule(10000, 1000)] + [MyModule(1000, 5)] ) def forward(self, x): return self.net(x) ``` Where FSDP is recursively wrapped around each `MyModule`, then dynamo-compiled, with dynamo already configured to skip/break in FSDP code. You'd expect to get 3 compiled AOT functions, corresponding to the contents of `MyModule`, and then see FSDP's communication ops happen inbetween them (eagerly). This almost happens (everything works out fine in forward), but in backward there is an ordering issue. FSDP creates a flat buffer for all the parameters that are bucketed together, and then creates views into this buffer to replace the original parameters. On each iteration of forward, it creates a new view after 'filling' the flatbuffer with data from an all-gather operation, to 'unshard' the parameters from remote devices. Dynamo traces the first such view and stores it in a compiled graphmodule. During tracing, we see (1) view created for first MyModule, (2) compile first MyModule, (3) ... for the rest of layers Then during runtime, we see (A) view created for first MyModule (and orphaned), (B) execute first compiled MyModule, using old view, ... This is a problem, because we want backward hooks to run right after each compiled-backward, but autograd executes those hooks in an order mirroring their execution order during forward. Since we are forever using the views created during steps (1, 3, .. N), which all happen before the steps (A, B, ...), this means that all the hooks will happen after all the compiled backwards. An illustration of the problem - a torchviz graph showing the 2 possible orderings of autograd, and a profile showing the view-backwards ops happening after all the compiled backwards, and before all the backward hooks. <img width="2069" alt="image" src="https://user-images.githubusercontent.com/4984825/202828002-32dbbd15-8fc3-4281-93e9-227ab5e32683.png"> <img width="2069" alt="image" src="https://user-images.githubusercontent.com/4984825/202828632-33e40729-9a7f-4e68-9ce1-571e3a8dd2dd.png"> A solution is to make dynamo not specialize on these nn modules. It is worth pointing out that this nn.module specialization is de-facto failing, as we are modifying .parameters and this bypasses dynamo's __setattr__ monkeypatch, which should have automatically kicked us out to Unspecialized and forced a recompile. After unspecializing, the new views (created during steps A, C, ...) are actually _used_ at runtime by the module, making their creation order interleaved, making autograd execute their backwards interleaved. The new torchviz graph (this time with names added for the view tensors): <img width="2043" alt="image" src="https://user-images.githubusercontent.com/4984825/202828480-d30005ba-0d20-45d8-b647-30b7ff5e91d3.png"> And a new profile showing the interleaving of compiled backwards and hooks, allowing overlapping of reduce-scatter. <img width="2293" alt="image" src="https://user-images.githubusercontent.com/4984825/202828533-bb20a041-19b8-499c-b3cf-02808933df47.png"> @jansel @davidberard98 @aazzolini @mrshenli @awgu @ezyang @soumith @voznesenskym @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89330 Approved by: https://github.com/davidberard98	2022-11-29 01:24:03 +00:00
Edward Z. Yang	1da633f98a	Access named parameters/buffers/etc via getattr rather than index (#89625 ) I'm not sure why this never caused problems before. The error manifests as `TypeError: 'MyModule' object is not subscriptable` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89625 Approved by: https://github.com/albanD	2022-11-28 00:19:48 +00:00
Michael Voznesensky	06ce1338bc	[dynamo] Port all pytorch/dynamo and test/dynamo pieces over from symbolic-shapes branch (#88768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88768 Approved by: https://github.com/jansel, https://github.com/ezyang	2022-11-13 04:50:21 +00:00
ydwu4	3765621356	torchdynamo support self.modules() for nn_module (#88695 ) This PR allows models to call self.modules() during dynamo tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88695 Approved by: https://github.com/voznesenskym	2022-11-12 20:00:51 +00:00
PyTorch MergeBot	dba887766b	Revert "torchdynamo support modules() for nn_module (#88023 )" This reverts commit `96104c7b7e`. Reverted https://github.com/pytorch/pytorch/pull/88023 on behalf of https://github.com/ydwu4 due to [Internal breakages] https://www.internalfb.com/intern/sandcastle/job/9007200067589062/	2022-11-08 18:37:48 +00:00
Yidi Wu	96104c7b7e	torchdynamo support modules() for nn_module (#88023 ) Differential Revision: D40820879 This diff allows models to call self.modules() during dynamo tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88023 Approved by: https://github.com/tugsbayasgalan, https://github.com/voznesenskym, https://github.com/jansel	2022-11-08 18:22:03 +00:00

1 2

61 Commits