pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jerry Zhang	1464db08b4	[quant][pt2e] Support setting qconfig by module_type (#92355 ) Summary: This PR supports the following feature for QConfigMapping: ``` qconfig_mapping = QConfigMapping().set_object_type(torch.nn.Conv2d, qconfig) backend_config = get_qnnpack_pt2e_backend_config() m = prepare_pt2e(m, qconfig_mapping, example_inputs, backend_config) ``` which means users want to set the qconfig for all calls to `torch.nn.Conv2d` to use `qconfig`, note this is only verified for the case when the module is broken down to a single aten op right now, e.g. torch.nn.Conv2d will be torch.ops.aten.convolution op when traced through. will need to support more complicated modules that is broken down to multiple operators later, e.g. (MaxPool) Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qconfig_module_type Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/92355 Approved by: https://github.com/jcaip	2023-01-20 03:18:21 +00:00
Will Constable	8e2e648f84	Propagate sources in VariableBuilder and add SuperSource (#91729 ) Motivation When adding support for default args (#90575), a lot of VariableTrackers missing sources were encountered. Currently, in a lot of cases it seems OK to skip the source for VariableTrackers created (especially during inlining), but that assumption breaks down when inlining functions with default arguments. Summary of changes - propagate the self.source of the VariableBuilder to the new variables being built, which seems like it was an omission previously - Add SuperSource to track usages of super(), so that SuperVariables can support function calls with default args Pull Request resolved: https://github.com/pytorch/pytorch/pull/91729 Approved by: https://github.com/ezyang	2023-01-12 05:04:18 +00:00
PyTorch MergeBot	6a3ddd0171	Revert "Don't graph break on patched module methods or aliased methods (#91018 )" This reverts commit `d6fc2d82ca`. Reverted https://github.com/pytorch/pytorch/pull/91018 on behalf of https://github.com/kit1980 due to After this PR, inductor / cuda11.6-py3.10-gcc7-sm86 / test fails every time with CUDA out of memory during OPTForCausalLM	2022-12-21 19:54:15 +00:00
William Wen	d6fc2d82ca	Don't graph break on patched module methods or aliased methods (#91018 ) See added tests for the cases that were fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91018 Approved by: https://github.com/Morgan77523, https://github.com/anijain2305	2022-12-21 16:29:15 +00:00
Yanbo Liang	2e0ce24890	[Dynamo] Support access nn.Module keys (#90502 ) Fixes https://github.com/pytorch/torchdynamo/issues/1973 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90502 Approved by: https://github.com/jansel	2022-12-12 09:15:42 +00:00
Jerry Zhang	797544f1c4	[dynamo][ez] Change module type to str for easier downstream parsing (#90429 ) Summary: att Test Plan: NA Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/90429 Approved by: https://github.com/SherlockNoMad	2022-12-09 02:00:18 +00:00
Ram Rachum	351d73b97f	Fix exception causes all over the codebase (#90271 ) This is the continuation to #90134 and hopefully the final PR in this series. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90271 Approved by: https://github.com/kit1980	2022-12-07 04:29:00 +00:00
William Wen	ebeecbf833	Dynamo FX graph stack traceback fix (#87136 ) Migration from https://github.com/pytorch/torchdynamo/pull/1655. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87136 Approved by: https://github.com/voznesenskym	2022-12-06 02:22:16 +00:00
Yanbo Liang	d88b555577	[Dynamo] Fix source/reconstruction bugs in NNModule named_* calls (#89729 ) Fixes https://github.com/pytorch/torchdynamo/issues/1931 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89729 Approved by: https://github.com/ezyang	2022-11-30 06:05:47 +00:00
Will Constable	77df2ca9b6	Special-case fsdp wrapped modules to be Unspecialized (#89330 ) ### Summary Making dynamo treat the nn.Modules inside FSDP wrappers as 'Unspecialized' results in dynamo-produced graphs where nn.module parameters are inputs to the graph rather than attributes of the outer graphmodule. This helps in FSDP since it forces dynamo to pick the latest copy of the parameters off the user's nn.Module (which FSDP mutates every pre_forward), solving the ordering issue in backward. ### Details Imagine this toy model ``` class MyModule(torch.nn.Module): def __init__(self, a, b): super(MyModule, self).__init__() self.net = nn.Sequential( nn.Linear(a, b), nn.ReLU(), ) def forward(self, x): return self.net(x) class ToyModel(nn.Module): def __init__(self): super(ToyModel, self).__init__() self.net = nn.Sequential( *[MyModule(10, 10000)] + [MyModule(10000, 1000)] + [MyModule(1000, 5)] ) def forward(self, x): return self.net(x) ``` Where FSDP is recursively wrapped around each `MyModule`, then dynamo-compiled, with dynamo already configured to skip/break in FSDP code. You'd expect to get 3 compiled AOT functions, corresponding to the contents of `MyModule`, and then see FSDP's communication ops happen inbetween them (eagerly). This almost happens (everything works out fine in forward), but in backward there is an ordering issue. FSDP creates a flat buffer for all the parameters that are bucketed together, and then creates views into this buffer to replace the original parameters. On each iteration of forward, it creates a new view after 'filling' the flatbuffer with data from an all-gather operation, to 'unshard' the parameters from remote devices. Dynamo traces the first such view and stores it in a compiled graphmodule. During tracing, we see (1) view created for first MyModule, (2) compile first MyModule, (3) ... for the rest of layers Then during runtime, we see (A) view created for first MyModule (and orphaned), (B) execute first compiled MyModule, using old view, ... This is a problem, because we want backward hooks to run right after each compiled-backward, but autograd executes those hooks in an order mirroring their execution order during forward. Since we are forever using the views created during steps (1, 3, .. N), which all happen before the steps (A, B, ...), this means that all the hooks will happen after all the compiled backwards. An illustration of the problem - a torchviz graph showing the 2 possible orderings of autograd, and a profile showing the view-backwards ops happening after all the compiled backwards, and before all the backward hooks. <img width="2069" alt="image" src="https://user-images.githubusercontent.com/4984825/202828002-32dbbd15-8fc3-4281-93e9-227ab5e32683.png"> <img width="2069" alt="image" src="https://user-images.githubusercontent.com/4984825/202828632-33e40729-9a7f-4e68-9ce1-571e3a8dd2dd.png"> A solution is to make dynamo not specialize on these nn modules. It is worth pointing out that this nn.module specialization is de-facto failing, as we are modifying .parameters and this bypasses dynamo's __setattr__ monkeypatch, which should have automatically kicked us out to Unspecialized and forced a recompile. After unspecializing, the new views (created during steps A, C, ...) are actually _used_ at runtime by the module, making their creation order interleaved, making autograd execute their backwards interleaved. The new torchviz graph (this time with names added for the view tensors): <img width="2043" alt="image" src="https://user-images.githubusercontent.com/4984825/202828480-d30005ba-0d20-45d8-b647-30b7ff5e91d3.png"> And a new profile showing the interleaving of compiled backwards and hooks, allowing overlapping of reduce-scatter. <img width="2293" alt="image" src="https://user-images.githubusercontent.com/4984825/202828533-bb20a041-19b8-499c-b3cf-02808933df47.png"> @jansel @davidberard98 @aazzolini @mrshenli @awgu @ezyang @soumith @voznesenskym @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89330 Approved by: https://github.com/davidberard98	2022-11-29 01:24:03 +00:00
Edward Z. Yang	1da633f98a	Access named parameters/buffers/etc via getattr rather than index (#89625 ) I'm not sure why this never caused problems before. The error manifests as `TypeError: 'MyModule' object is not subscriptable` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89625 Approved by: https://github.com/albanD	2022-11-28 00:19:48 +00:00
Michael Voznesensky	06ce1338bc	[dynamo] Port all pytorch/dynamo and test/dynamo pieces over from symbolic-shapes branch (#88768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88768 Approved by: https://github.com/jansel, https://github.com/ezyang	2022-11-13 04:50:21 +00:00
ydwu4	3765621356	torchdynamo support self.modules() for nn_module (#88695 ) This PR allows models to call self.modules() during dynamo tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88695 Approved by: https://github.com/voznesenskym	2022-11-12 20:00:51 +00:00
PyTorch MergeBot	dba887766b	Revert "torchdynamo support modules() for nn_module (#88023 )" This reverts commit `96104c7b7e`. Reverted https://github.com/pytorch/pytorch/pull/88023 on behalf of https://github.com/ydwu4 due to [Internal breakages] https://www.internalfb.com/intern/sandcastle/job/9007200067589062/	2022-11-08 18:37:48 +00:00
Yidi Wu	96104c7b7e	torchdynamo support modules() for nn_module (#88023 ) Differential Revision: D40820879 This diff allows models to call self.modules() during dynamo tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88023 Approved by: https://github.com/tugsbayasgalan, https://github.com/voznesenskym, https://github.com/jansel	2022-11-08 18:22:03 +00:00
PyTorch MergeBot	bf7c996dcb	Revert "torchdynamo support modules() for nn_module (#88023 )" This reverts commit `eb91e8a534`. Reverted https://github.com/pytorch/pytorch/pull/88023 on behalf of https://github.com/mehtanirav due to [Internal breakages](https://www.internalfb.com/intern/sandcastle/job/13510799692855066/insights)	2022-11-02 22:35:14 +00:00
Yidi Wu	eb91e8a534	torchdynamo support modules() for nn_module (#88023 ) Differential Revision: D40820879 This diff allows models to call self.modules() during dynamo tracing. cc @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @chunyuan-w @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx Pull Request resolved: https://github.com/pytorch/pytorch/pull/88023 Approved by: https://github.com/tugsbayasgalan, https://github.com/voznesenskym, https://github.com/jansel	2022-11-01 17:10:45 +00:00
Tugsbayasgalan Manlaibaatar	7b5978254f	Add named_buffers to torchdynamo nn_module (#87644 ) Fixes: https://github.com/pytorch/torchdynamo/issues/1738 cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87644 Approved by: https://github.com/jansel	2022-10-25 17:00:56 +00:00
Animesh Jain	e46a8971e6	[dynamo] Support class members in nn modules (#87531 ) Fixes https://github.com/pytorch/torchdynamo/issues/1740 @voznesenskym cc @jansel @lezcano @fdrocha @mlazos @soumith @voznesenskym @yanboliang @penguinwu Pull Request resolved: https://github.com/pytorch/pytorch/pull/87531 Approved by: https://github.com/jansel	2022-10-24 18:48:49 +00:00
Michael Voznesensky	2fd008ed43	[dynamo] Add support for invoking nn sequential (#87156 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87156 Approved by: https://github.com/jansel	2022-10-20 18:14:40 +00:00
PyTorch MergeBot	f3cc588d09	Revert "Dynamo FX graph stack traceback fix (#87136 )" This reverts commit `89e6078bc3`. Reverted https://github.com/pytorch/pytorch/pull/87136 on behalf of https://github.com/clee2000 due to causing a lot of tests to fail on master even though pr is green	2022-10-19 18:57:24 +00:00
William Wen	89e6078bc3	Dynamo FX graph stack traceback fix (#87136 ) Migration from https://github.com/pytorch/torchdynamo/pull/1655. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87136 Approved by: https://github.com/voznesenskym	2022-10-19 17:15:43 +00:00
Sherlock Huang	88b76ae9ea	Store type(module) in the module stack (#87149 ) - As requested by quantization team, it prefer storing type(module) in the module stack. - Consequently, as module stack gets verbose, we skip printing module stack in the gm.print_readable() Pull Request resolved: https://github.com/pytorch/pytorch/pull/87149 Approved by: https://github.com/jerryzh168, https://github.com/jansel	2022-10-18 18:12:37 +00:00
Michael Voznesensky	2b03a941f7	[dynamo] graph capture for calls to arbitrary self. methods on nn module (#87040 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/87040 Approved by: https://github.com/jansel	2022-10-18 16:54:40 +00:00
Michael Suo	4814270708	[dynamo] Introduce `get_real_value` API to TensorVariable (#87091 ) Right now, example_value is doing two jobs: - We use it to propagate metadata (e.g. return type, shapes, etc.) throughout the graph - We use it to satisfy queries for the actual value (e.g. torch.cond, `assume_constant_result`) This is further complicated by the fact that we have two modes, one where `example_value` is a fake tensor, and one where it is a real tensor (this is the `fake_tensor_propagation` config flag). This leads to scenarios where we don't support every combination of job + mode, e.g. if `fake_tensor_propagation=False`, `assume_constant_result` is broken. This is made worse by the fact that "fake tensor mode" is the default and is required if you want dynamic shapes to work. So, this PR introduces a `get_real_value` API that just runs the graph up to `node` in order to get a concrete value. This API is orthogonal to `example_value`, so it doesn't care about `fake_tensor_propagation`. When `fake_tensor_propagation=True`: `example_value` is a fake tensor, you must use the `get_real_value` API to get a concrete value. This will be the only configuration in the future. When `fake_tensor_propagation=False`: `example_value` and `get_real_value` will produce the same value. This is redundant but we will be removing this config soon. To support this, I introduce a cache for computed real values, to memoize the work involved if we're asking for real values a lot. I attached this state to `OutputGraph` because it seems to be what historically managed `example_value` lifetimes, but idk. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87091 Approved by: https://github.com/wconstab	2022-10-17 20:14:43 +00:00
Jason Ansel	c7c09722ad	Move TorchDynamo into PyTorch core (#86461 ) Context: https://github.com/pytorch/torchdynamo/issues/1588 This PR moves [TorchDynamo](https://github.com/pytorch/torchdynamo) and TorchInductor into PyTorch core. - `torchdynamo` becomes `torch._dynamo` - `torchinductor` becomes `torch._inductor` This PR was generated by running `copy_to_core.sh` in https://github.com/pytorch/torchdynamo/pull/1538 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86461 Approved by: https://github.com/voznesenskym	2022-10-13 23:18:06 +00:00

26 Commits