pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Peter Bell	a3a2486be8	[dynamo] Avoid eager imports of classes with custom VariableTrackers (#112319 ) Currently custom VariableTrackers exist for classes that live outside of pytorch. For these cases dynamo currently eagerly imports the module to get the class object to compare against. This instead uses `sys.modules.get("module.path")` such that the module is never imported by dynamo itself, but if the user has imported the module then we will still access the module and grab the type we need to compare against. I noticed this issue because importing `KeyedJaggedTensor` fails half-way through if `fbgemm_gpu` has been built with an incompatible PyTorch version, in which case it retries the import again each time! Pull Request resolved: https://github.com/pytorch/pytorch/pull/112319 Approved by: https://github.com/lezcano, https://github.com/ezyang	2023-11-07 22:45:54 +00:00
Jason Ansel	356f3458c4	[dynamo] Remove incorrect sources (#112961 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112961 Approved by: https://github.com/voznesenskym, https://github.com/Skylion007 ghstack dependencies: #111306, #111415, #111725, #111726, #112962	2023-11-07 22:01:40 +00:00
Jason Ansel	5fe96eaaf4	[dynamo] Remove VariableTracker.propagate (#111726 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111726 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306, #111415, #111725	2023-11-07 19:55:19 +00:00
Jason Ansel	843a8ecd24	[dynamo] Remove VariableTracker.add_options (#111725 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111725 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306, #111415	2023-11-07 19:55:19 +00:00
Jason Ansel	9664190952	[dynamo] Eagerly install guards (#111415 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111415 Approved by: https://github.com/voznesenskym ghstack dependencies: #111306	2023-11-07 19:55:19 +00:00
Jason Ansel	f5088d2e45	[dynamo] fix None routing bug during var_getattr on UDO (#111614 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111614 Approved by: https://github.com/jansel	2023-10-29 01:57:43 +00:00
Jason Ansel	c7b78fb76c	[dynamo] Replace recursively_contains with parents_tracker (#112122 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112122 Approved by: https://github.com/voznesenskym	2023-10-28 06:46:48 +00:00
Yanbo Liang	061bf1a153	[5/N] Make torch context manager a TorchCtxManagerClassVariable (#111622 ) Major change in this PR is to make torch context manager class a separate ```TorchCtxManagerClassVariable```, since we have dynamo implementation for these ctx managers. I was thinking to wrap them as ```UserDefinedClassVariable``` and do dispatch at ```USCVariable.call_function```, but it seems almost the same amount of work and this way is more clear. This is on the way of moving ```TorchVariable``` to ```TorchFunctionVariable``` which will only handle the functions who would be allowed in graph (e.g, ```torch.sin```) and constant folded (e.g, ```torch.is_floating_point```). All other torch functions would be go through skip/inline rules, and would be wrapped as ```UserFunctionVariable``` (for inlined) and ```SkipFilesVariable``` (for skipped). The next steps: * Wrap torch modules, classes, objects as regular ```PythonModuleVariable```, ```UserDefinedClassVariable``` and ```UserDefinedObjectVariable```. * Generate the allow in graph torch functions list and wrap them as ```TorchFunctionVariable```. * Finally merge ```skipfiles.check``` and ```is_allowed``` into one function ```allow_skip.check(fn)``` which would return a Enum of allow, skip and inline. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111622 Approved by: https://github.com/jansel	2023-10-27 21:26:54 +00:00
PyTorch MergeBot	5344468712	Revert "[dynamo] Properly track user-defined types for `type()` (#110794 )" This reverts commit `ad4ccf9689`. Reverted https://github.com/pytorch/pytorch/pull/110794 on behalf of https://github.com/ezyang due to looks like this actually fails internal tests ([comment](https://github.com/pytorch/pytorch/pull/110794#issuecomment-1778002262))	2023-10-24 20:42:26 +00:00
Ken Jin	ad4ccf9689	[dynamo] Properly track user-defined types for `type()` (#110794 ) Closes https://github.com/pytorch/pytorch/issues/110315. Thanks to @ezyang for the easy repro! Pull Request resolved: https://github.com/pytorch/pytorch/pull/110794 Approved by: https://github.com/ezyang	2023-10-23 17:34:23 +00:00
Tugsbayasgalan Manlaibaatar	bf7307adf8	Support inference_mode decorator (#109274 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109274 Approved by: https://github.com/williamwen42	2023-09-27 22:21:42 +00:00
Yanbo Liang	a81cb0de16	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-26 00:03:41 +00:00
PyTorch MergeBot	829b5c0949	Revert "[Dynamo] Support python class member_descriptor (#109956 )" This reverts commit `12cd776d90`. Reverted https://github.com/pytorch/pytorch/pull/109956 on behalf of https://github.com/jeanschmidt due to multiple slow jobs broken ([comment](https://github.com/pytorch/pytorch/pull/109956#issuecomment-1733706269))	2023-09-25 13:25:45 +00:00
Yanbo Liang	12cd776d90	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-25 03:15:39 +00:00
Michael Voznesensky	a902150a1e	[Easy] ConstantVariable() -> .create (#109896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109896 Approved by: https://github.com/ezyang	2023-09-22 22:30:15 +00:00
Edward Yang	88600e7d2e	[RELAND] Force synced KJT to trace unbacked SymInt (#108960 ) (#109216 ) Summary: The basic concept behind this diff is to modify Dynamo's tracing behavior when it encounters a KeyedJaggedTensor that is synced (aka has `_length_per_key` and `_offset_per_key` populated). These fields are lists of integers; ordinarily, Dynamo will optimistically try to specialize on integers, however, for KJTs, we know that these integers will definitely vary from run-to-run. Furthermore, ordinarily, we would also specialize these integers if they are 0/1, but we will frequently expect features in KJTs to be 0/1. The fix is to detect KJTs and treat these integers as unbacked integers. This is NOT a universally sound optimization: when treating these integers as unbacked, we never report them as equal to zero or one. In return, we always generate graphs that generalize no matter the length of values on features. This is enough to trace through APS sparse arch, torchrec_dlrm and some small split-cat examples. The special integer behavior is triggered by a dynamically scoped `force_unspec_int_unbacked_size_like` variable on TracingContext, which we trigger when we wrap a KJT. There probably are other ways to do this, but this was simple and worked. Test Plan: ``` buck2 test mode/dev-nosan //pytorch/benchmark/fb/test_gpu:run_test_gpu ``` from aakhundov 1. first build feed_lower_benchmark: ``` buck2 build --show-output mode/opt -c python.package_style=inplace -c fbcode.enable_gpu_sections=true -c fbcode.platform=platform010 -c fbcode.split-dwarf=true hpc/new/models/feed/benchmark:feed_lower_benchmark ``` 2. then run the lowering of the model with it: ``` TORCHINDUCTOR_MAX_AUTOTUNE=1 TORCHINDUCTOR_UNIQUE_KERNEL_NAMES=1 TORCH_LOGS="output_code,graph_code" TORCH_COMPILE_DEBUG=1 ../buck-out/v2/gen/fbcode/79c6b019ee0f9469/hpc/new/models/feed/benchmark/__feed_lower_benchmark__/feed_lower_benchmark.par --load=manifold://ig_inference_model/tree/user/facebook/fblearner/predictor/960999465/60/gpu_lowering/input.predictor --skip-trt --skip-ait --sync-mode=0 --enable-aot-inductor --lower-presets="ig_stories" --gpu-trace ``` cf https://docs.google.com/document/d/1yD30xYrdmM8r2HTdmXnZTg0-MHVexfVrAa0294m1AUE/edit?pli=1#heading=h.qiv3fp7e6zg0 From torchrec: https://www.internalfb.com/intern/wiki/Torchrec/Development/Testing_production_models/ From ge0405 baseline (without your diff): f477293168 your diff: f477292363 ``` buck2 test //caffe2/test/dynamo:test_dynamo_torchrec buck2 run 'fbcode//mode/opt' fbcode//pytorch/benchmark/fb/test_gpu:run_test_gpu -- 'pytorch.benchmark.fb.test_gpu.test_gpu.TestBenchmarkFbGpu.test_train_blue_reels_vdd_v3_inductor_speedup' ``` Differential Revision: D49236757 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109216 Approved by: https://github.com/voznesenskym	2023-09-18 14:39:44 +00:00
weifengpy	9021fb8dac	[dynamo] implement custom dict variable as a general solution for HF's ModelOutput class (#105044 ) before the PR, for HF's ModelOutput class, we use dicts.py/DataClassVariable with our own implementation on __getItem__, __setAttr__, __setItem__. There is a risk that ModelOutput logic may change since it is a user code after the PR, we inline __getItem__, __setAttr__, __setItem__ using dicts.py/CustomizedDictVariable so the logic always keep AA unit test * python test/dynamo/test_model_output.py -k test_HF_bert_model_output test on HF benchmark * python benchmarks/dynamo/huggingface.py -d cuda --inference --accuracy --progress --inductor --print-dataframe-summary 2>&1 * all metric are the same before/after the PR, including pass rate, unique_graphs, graph_breaks, unique_graph_breaks * before the PR: P790393916 * after the PR: P790368991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105044 Approved by: https://github.com/jansel	2023-09-14 17:15:50 +00:00
Michael Voznesensky	064ae9ff33	Support register_hook on input tensors (#108903 ) The strategy in this PR is pretty straightforward. There are 2 kinds of hooks: 1) Hooks on objects with sources (inputs, params) 2) Hooks on objects w/o sources (intermediaries, and outputs). Note: As outputs can be made simple by how dynamo handles residuals, they could actually be handled as if they were inputs, but, for the sake of this PR, we will refer to hooks as either hooks on inputs (sourced), or hooks on intermediaries (not sourced). The plan: For tensors w/ a source: We record registered hooks, store them as a global, and associate them with the tensor in residuals. This means that when dynamo goes to create the frame, where we produce bytecode to stitch together our PT2 modified bytecode with the original eager code, we call `register_hook`. This registration of hooks in residuals is sound because (a) it happens right after a Pt2 frame region ends and (b) we know that the tensor is alive in f_locals, f_globals, or a module in the users invoking frame. This means we can soundly know it will be around to invoke `register_hook` on. As long as we guard on the identity of the lifted function, this is sound to do. For tensors w/o a source: Graph break - we will support this in a subsequent PR Handles: An interesting new component here is the creation of a `STORE_FAST `->`LOAD_FAST` associated with the handle, the return result of `register_hook`. If the user code stored the result of `register_hook` in a handle, we need to honor that. We do so by interceding into `STORE_FAST`, and recording the name of the local variable as directed by user code. We then honor that same name in the reconstructed bytecode. If the user did not store a hook, we merely pop the produced value to preserve the stack. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108903 Approved by: https://github.com/ezyang ghstack dependencies: #108846, #109092	2023-09-14 01:52:21 +00:00
PyTorch MergeBot	1d32c9c7f2	Revert "Force synced KJT to trace unbacked SymInt (#108960 )" This reverts commit `f9a250c35b`. Reverted https://github.com/pytorch/pytorch/pull/108960 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/108960#issuecomment-1715850779))	2023-09-12 14:37:36 +00:00
Edward Yang	f9a250c35b	Force synced KJT to trace unbacked SymInt (#108960 ) Summary: The basic concept behind this diff is to modify Dynamo's tracing behavior when it encounters a KeyedJaggedTensor that is synced (aka has `_length_per_key` and `_offset_per_key` populated). These fields are lists of integers; ordinarily, Dynamo will optimistically try to specialize on integers, however, for KJTs, we know that these integers will definitely vary from run-to-run. Furthermore, ordinarily, we would also specialize these integers if they are 0/1, but we will frequently expect features in KJTs to be 0/1. The fix is to detect KJTs and treat these integers as unbacked integers. This is NOT a universally sound optimization: when treating these integers as unbacked, we never report them as equal to zero or one. In return, we always generate graphs that generalize no matter the length of values on features. This is enough to trace through APS sparse arch, torchrec_dlrm and some small split-cat examples. The special integer behavior is triggered by a dynamically scoped `force_unspec_int_unbacked_size_like` variable on TracingContext, which we trigger when we wrap a KJT. There probably are other ways to do this, but this was simple and worked. Test Plan: ``` buck2 test mode/dev-nosan //pytorch/benchmark/fb/test_gpu:run_test_gpu ``` from aakhundov 1. first build feed_lower_benchmark: ``` buck2 build --show-output mode/opt -c python.package_style=inplace -c fbcode.enable_gpu_sections=true -c fbcode.platform=platform010 -c fbcode.split-dwarf=true hpc/new/models/feed/benchmark:feed_lower_benchmark ``` 2. then run the lowering of the model with it: ``` TORCHINDUCTOR_MAX_AUTOTUNE=1 TORCHINDUCTOR_UNIQUE_KERNEL_NAMES=1 TORCH_LOGS="output_code,graph_code" TORCH_COMPILE_DEBUG=1 ../buck-out/v2/gen/fbcode/79c6b019ee0f9469/hpc/new/models/feed/benchmark/__feed_lower_benchmark__/feed_lower_benchmark.par --load=manifold://ig_inference_model/tree/user/facebook/fblearner/predictor/960999465/60/gpu_lowering/input.predictor --skip-trt --skip-ait --sync-mode=0 --enable-aot-inductor --lower-presets="ig_stories" --gpu-trace ``` cf https://docs.google.com/document/d/1yD30xYrdmM8r2HTdmXnZTg0-MHVexfVrAa0294m1AUE/edit?pli=1#heading=h.qiv3fp7e6zg0 From torchrec: https://www.internalfb.com/intern/wiki/Torchrec/Development/Testing_production_models/ From ge0405 baseline (without your diff): f477293168 your diff: f477292363 Differential Revision: D49019987 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108960 Approved by: https://github.com/voznesenskym	2023-09-12 03:44:24 +00:00
Huy Do	5a4fe05a15	Revert "Force synced KJT to trace unbacked SymInt (#107788 )" (#108684 ) This reverts commit `3b92ef814d`. So let's manually revert it instead. (Not sure why the bot doesn't work on https://github.com/pytorch/pytorch/pull/107788) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108684 Approved by: https://github.com/ezyang	2023-09-06 19:15:45 +00:00
Edward Z. Yang	3b92ef814d	Force synced KJT to trace unbacked SymInt (#107788 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107788 Approved by: https://github.com/voznesenskym	2023-09-06 03:18:26 +00:00
voznesenskym	5d85d897e0	Torchrec Enablement Fixes - Re-PR 107910 (#108018 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108018 Approved by: https://github.com/wconstab	2023-08-28 19:47:53 +00:00
Jason Ansel	f877d0a4bf	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-26 01:41:29 +00:00
PyTorch MergeBot	eefce56b66	Revert "[dynamo] Treat monkey patched .forward as dynamic (#107104 )" This reverts commit `79b3a9f945`. Reverted https://github.com/pytorch/pytorch/pull/107104 on behalf of https://github.com/ZainRizvi due to Breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/107104#issuecomment-1692072018))	2023-08-24 16:55:33 +00:00
Jason Ansel	79b3a9f945	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-23 19:03:02 +00:00
Animesh Jain	12b0372a75	[dynamo] Continue on fbgemm import fail (#107622 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107622 Approved by: https://github.com/voznesenskym	2023-08-22 02:16:31 +00:00
zhxchen17	8d6a487d69	[dynamo] Make KeyedJaggedTensor a variable. (#107319 ) This is extracted from https://github.com/pytorch/pytorch/pull/107156/ to model KeyedKaggedTensor as a first class concept in dynamo. Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/107319 Approved by: https://github.com/ezyang	2023-08-18 17:15:46 +00:00
Tugsbayasgalan Manlaibaatar	df50f91571	Support fx_pytree in dynamo (#105574 ) This PR does two things: 1. Make dynamo trace through fx_pytree (on top of torch.utils._pytree) so that generated graph modules can be retraced. 2. Fix bug where unflatten not returning dynamo VariableTracker. Differential Revision: [D47734623](https://our.internmc.facebook.com/intern/diff/D47734623) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105574 Approved by: https://github.com/yanboliang, https://github.com/ydwu4	2023-07-29 05:08:15 +00:00
Wanchao Liang	c76c84bde4	[dynamo] make ProcessGroupVariable a DistributedVariable (#105593 ) This PR move the ProcessGroupVariable from UDO to DistributedVT so that Distributed VTs are consolidated together Pull Request resolved: https://github.com/pytorch/pytorch/pull/105593 Approved by: https://github.com/voznesenskym	2023-07-26 06:42:50 +00:00
Michael Voznesensky	54a673bdcf	Initial sourceless builder (#104734 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104734 Approved by: https://github.com/ezyang	2023-07-24 02:48:32 +00:00
Animesh Jain	88aa51fe85	[dynamo] Support defaults for namedtuples (#105341 ) Fixes https://github.com/pytorch/pytorch/issues/103008 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105341 Approved by: https://github.com/jansel	2023-07-17 23:52:57 +00:00
Richard Zou	408cb45e14	[Dynamo] Support threading.local getattr (#104292 ) Fixes #104066 threading.local has a custom `__getattribute__` so `_getattr_static` doesn't work with it. Since we know that threading.local's `__getattribute__` is well behaved (e.g. https://github.com/python/cpython/blob/3.11/Lib/_threading_local.py), we can just special case it. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/104292 Approved by: https://github.com/williamwen42, https://github.com/jansel	2023-06-29 14:32:37 +00:00
Animesh Jain	2bb83cd45c	[dynamo][ac] Minor refactor for better code organization and a bugfix (#104276 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104276 Approved by: https://github.com/zou3519	2023-06-29 12:57:59 +00:00
David Berard	a8b63d4d1b	[dynamo] If UserDefinedObjectVariable.var_getattr() is a callable, try handling as a TorchVariable (#104231 ) In some cases, a UserFunctionVariable can be constructed when the underlying function is actually a TorchVariable. One example is when an attribute on a UnspecializedNNModuleVariable is a torch function. In those cases, we should treat the UserFunctionVariable as a TorchVariable. This adds a check in UserDefinedObjectVariable.var_getattr() to try to create a TorchVariable instead of a UserFunctionVariable. Fixes #104172 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104231 Approved by: https://github.com/williamwen42, https://github.com/jansel	2023-06-28 02:39:03 +00:00
Animesh Jain	75dab587ef	[dynamo] FSDP + AC + torch.compile (#103953 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103953 Approved by: https://github.com/wanchaol	2023-06-24 01:40:56 +00:00
Yukio Siraichi	7550ec16a4	Add support for dictionary with torch object keys. (#103158 ) Fixes: #101979 This PR adds support for dictionaries with torch object as keys in dynamo. The main problem was that, for example, the source built for `d[torch.float]` (`d` being a dictionary) was `ODictGetItemSource(GlobalSource('d'), index=torch.float)`. When `Source.name` method was called, we got `odict_getitem(G['d'], torch.float)`. Evaluating that string raised an error, since `torch` was only available in the global dictionary `G` as `G["torch"]`. Instead, this PR builds the source: `ODictGetItemSource(GlobalSource('d'), index=AttrSource(GlobalSource('torch'), 'float'))`. The to-be-evaluated string is correctly generated as: `odict_getitem(G['d'], G['torch'].float)`. Here's a minimal example that reproduces the error, before this PR: ```python import torch d = { torch.float16: torch.float32, } @torch.compile def f(): return torch.randn(3, dtype=d[torch.float16]) f() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103158 Approved by: https://github.com/mlazos	2023-06-09 20:18:49 +00:00
ydwu4	3c896a5adb	[dynamo] fix torch.distributions lazy_attribute failure (#103208 ) Fixes #93340. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103208 Approved by: https://github.com/yanboliang	2023-06-08 17:26:54 +00:00
Yanbo Liang	d92bb036a4	[Dynamo] Fix if condition on UnspecializedNNModuleVariable (#102583 ) Fixes #102315 The root cause is for ```UnspecializedNNModuleVariable``` which extends from ```UserDefinedObjectVariable```, if ```__bool__``` is missing, we should use ```__len__``` to infer a truth value. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102583 Approved by: https://github.com/jansel	2023-06-03 03:42:15 +00:00
Yanbo Liang	9fa82c90f7	[Dynamo] Correct UserDefinedObjectVariable.var_getattr on function/method type (#102580 ) Fixes #102329 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102580 Approved by: https://github.com/jansel	2023-06-01 05:04:13 +00:00
Will Constable	e344ff4113	Support dynamo tracing collectives with processgroup arg (#102222 ) Previously, other types of rank descriptors worked but pg caused dynamo to break down when tracing the internal func that converts pg to rank list. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102222 Approved by: https://github.com/wanchaol, https://github.com/voznesenskym	2023-05-27 03:01:49 +00:00
Yanbo Liang	7052fb37bd	[Dynamo] Improve handling UnspecializedNNModuleVariable side effect (#101141 ) Fixes #101102 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101141 Approved by: https://github.com/jansel	2023-05-16 03:57:13 +00:00
Michael Voznesensky	9f3c6b1b63	Fix graph break in a common `func(self, args)` pattern (Faster stable diffusion) (#100444 ) Stable Diffusion has a pattern like this: ``` def forward(self, hidden_states, encoder_hidden_states=None, attention_mask=None, cross_attention_kwargs): # The `Attention` class can call different attention processors / attention functions # here we simply pass along all tensors to the selected processor class # For standard processors that are defined here, `cross_attention_kwargs` is empty return self.processor( self, hidden_states, encoder_hidden_states=encoder_hidden_states, attention_mask=attention_mask, *cross_attention_kwargs, ) ``` Wherein processor is something like `AttnProcessor2_0`, which is callable but not an NNModule. This allows for a significant speedup in stable diffusion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100444 Approved by: https://github.com/anijain2305	2023-05-04 03:38:52 +00:00
Andrew Gu	23de2e0620	[Dynamo] Fix staticmethods for FSDP (#100117 ) This PR fixes capturing static methods for FSDP-managed modules. Previously, if a static method was invoked using `self.<staticmethod>`, then Dynamo would pass `self` twice to the method, causing a graph break due to the method being "unsupported". This PR achieves this by checking for `staticmethod` and using `UserFunctionVariable` instead of `UserMethodVariable`, which handles the correct calling convention. This fixes FSDP + PT2 on HuggingFace's `T5ForConditionalGeneration`, which otherwise reports an error like the following based on the most recent trunk: ``` Output 0 of AsStridedBackward0 is a view of a view which was created in no_grad mode and is being modified inplace with grad mode enabled. ``` This is in reference to the `scores` tensor in `scores += position_bias_masked` ([code](`a0ae2310ec/src/transformers/models/t5/modeling_t5.py (L559)`)). I am not clear if this PR's fix is actually masking a different problem though. I wonder if there are edge cases with respect to Dynamo resuming execution and input mutations. Possibly, this PR only side steps the problem because there is no more recompilation at the static method `_relative_position_bucket()` ([code](`a0ae2310ec/src/transformers/models/t5/modeling_t5.py (L443)`)). In `UserDefinedObjectVariable.var_getattr()`, there is an existing branch: `e5291e633f/torch/_dynamo/variables/user_defined.py (L395-L398)` I am not clear on when this branch can be triggered since if `subobj` is a static method, it still takes the `FunctionTypes` branch: `e5291e633f/torch/_dynamo/variables/user_defined.py (L403-L404)` To preserve backward compatibility, the current version of this PR only modifies this `FunctionTypes` branch to differentiate between `staticmethod` and not `staticmethod`. The PR that added this `FunctionTypes` branch is https://github.com/pytorch/pytorch/pull/92050/, and I checked that the added test `test_torch_distributions_functions()` only exercises the non-`staticmethod` case (since `Independent.log_prob` is not a `staticmethod`). The last commit in `pytorch` that touched the `staticmethod` branch before https://github.com/pytorch/pytorch/pull/92050/ was the move from the `torchdynamo` repo into `pytorch`, so I cannot easily tell which test cases it corresponds to. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100117 Approved by: https://github.com/anijain2305	2023-04-28 14:31:20 +00:00
Yanbo Liang	789070986c	[Dynamo] Implementing generic context manager by inlining __enter__ and __exit__ (#98725 ) This is a draft version of generic context manager, I believe there are some scenarios that I didn't anticipate. I posted this draft for discussion and check if this is the right direction. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98725 Approved by: https://github.com/jansel	2023-04-20 01:16:15 +00:00
Jason Ansel	47c685def3	[dynamo] Support DELETE_ATTR (#98698 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98698 Approved by: https://github.com/yanboliang	2023-04-15 20:31:40 +00:00
Yanbo Liang	ad1d842234	[Dynamo] Make python random calls real random (#98812 ) Fixes #95425 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98812 Approved by: https://github.com/wconstab	2023-04-11 16:57:34 +00:00
Jason Ansel	55afaa46a4	Support functools.partial and itertools.product (#98120 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98120 Approved by: https://github.com/anijain2305	2023-04-03 18:23:25 +00:00
Jason Ansel	76074dc0a3	Improve support for dict subclasses (#98154 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98154 Approved by: https://github.com/anijain2305	2023-04-03 01:42:08 +00:00
Jason Ansel	35b3309539	Fix graph break from inline patched init (#98150 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98150 Approved by: https://github.com/anijain2305, https://github.com/yanboliang	2023-04-03 01:11:30 +00:00

1 2

66 Commits