pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Animesh Jain	cdb9df5efe	[dynamo][guards] De-dupe DUPLICATE_INPUT guard (#134354 ) Hard to write a test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/134354 Approved by: https://github.com/jansel ghstack dependencies: #134272	2024-08-24 15:17:56 +00:00
Animesh Jain	b3821f1da1	[dynamo][guards][logs] Generate code_parts for debugging (#134181 ) Fixes https://github.com/pytorch/pytorch/issues/132692 Pull Request resolved: https://github.com/pytorch/pytorch/pull/134181 Approved by: https://github.com/youkaichao, https://github.com/jansel ghstack dependencies: #133742, #134016, #134039	2024-08-22 23:40:37 +00:00
Animesh Jain	fee677eeb6	[fbode-testing][dynamo][reland][inline-inbuilt-nn-modules] Mark attri… (#134136 ) Shuai wants to test this internally before https://github.com/pytorch/pytorch/pull/133713 can go in. Creating a separate PR for ghmport. Pull Request resolved: https://github.com/pytorch/pytorch/pull/134136 Approved by: https://github.com/yanboliang	2024-08-22 17:54:58 +00:00
Avik Chaudhuri	0d7ac1966a	kill sharing of constraints (#134045 ) Summary: Previously, reuse of the same `Dim` was encoded by "sharing" internal constraints among constraint targets. This kind of sharing, implemented using `shared` fields between `_Constraint`s, was originally motivated by `dynamic_dim`, specifically to support `==` between `dynamic_dim`s, but we no longer need to maintain this overcomplicated structure: we can simply use names of `Dims` to directly encode sharing information. Thus this PR vastly simplifies the structure of `_Constraint` by removing `shared` fields. As a result, both `_Constraint` and its moral subclass, `_DerivedConstraint`, are 1-1 with `Dim` and its moral subclass, `DerivedDim`. Note that this will break `==` over `dynamic_dim`, so an immediate follow-up will be to remove `dynamic_dim` entirely from our public API. (It's been more than 6 months since the deprecation warning anyway.) I just didn't want to deal with that process in the same PR. Test Plan: existing Differential Revision: D61559413 Pull Request resolved: https://github.com/pytorch/pytorch/pull/134045 Approved by: https://github.com/pianpwk	2024-08-22 04:40:47 +00:00
PyTorch MergeBot	68425e68fe	Revert "[dynamo][reland][inline-inbuilt-nn-modules] Mark attributes of nn mod… (#133714 )" This reverts commit `e8d3c4be36`. Reverted https://github.com/pytorch/pytorch/pull/133714 on behalf of https://github.com/anijain2305 due to fails internally ([comment](https://github.com/pytorch/pytorch/pull/133714#issuecomment-2302171472))	2024-08-21 14:21:06 +00:00
Animesh Jain	bd0db490bf	[dynamo][set] Fix EQUALS_MATCH guard for constant sets and lists (#134016 ) Fixes https://github.com/pytorch/pytorch/issues/133509 Pull Request resolved: https://github.com/pytorch/pytorch/pull/134016 Approved by: https://github.com/laithsakka, https://github.com/jansel ghstack dependencies: #133742	2024-08-21 12:41:52 +00:00
Michael Lazos	09e366cb57	[Dynamo] Add torch function mode stack guard to dynamo (#133130 ) This PR adds a guard on the torch function mode stack state at the beginning of tracing. The way this is implemented is via a new leaf guard which is passed the initial stack state at construction and compares it to the stack state at the time the guard is run. Details: The stack state is extracted via popping all modes, appending them to a list, and pushing all modes back. This list is stored on the output graph and read during guard construction to pass to the stack mode guard. There the length and types of the modes are recorded. Next time the guard is run it compares this recorded state to the current mode stack state. To implement this in python a helper function was added to utils.py and this is used if cpp guards are not enabled. Pull Request resolved: https://github.com/pytorch/pytorch/pull/133130 Approved by: https://github.com/anijain2305	2024-08-20 07:14:33 +00:00
Animesh Jain	e8d3c4be36	[dynamo][reland][inline-inbuilt-nn-modules] Mark attributes of nn mod… (#133714 ) Relands https://github.com/pytorch/pytorch/pull/132539 Relands https://github.com/pytorch/pytorch/pull/132736 Pull Request resolved: https://github.com/pytorch/pytorch/pull/133714 Approved by: https://github.com/jansel	2024-08-20 05:57:52 +00:00
PyTorch MergeBot	35f36363ec	Revert "[dtensor] move DTensor to public namespace (#133113 )" This reverts commit `2ee6b97464`. Reverted https://github.com/pytorch/pytorch/pull/133113 on behalf of https://github.com/wanchaol due to looks like it break some internal type imports ([comment](https://github.com/pytorch/pytorch/pull/133113#issuecomment-2295670911))	2024-08-19 05:00:19 +00:00
Wanchao Liang	2ee6b97464	[dtensor] move DTensor to public namespace (#133113 ) Moving DTensor to be in the public namespace, to formally add the documentation page that includes all the public APIs. This includes: * many path renames and path import fixes * a dedicated doc page without too much content yet (adding in the next PRs) * To preserve the BC for users still using the `torch.distributed._tensor`, I added a shim script to redirect old path calls to the new module The BC preserving is evidented by the fact that all DTensor tests are still working without changing the public imports. So it's safe to land the changes Pull Request resolved: https://github.com/pytorch/pytorch/pull/133113 Approved by: https://github.com/XilunWu ghstack dependencies: #133305, #133306	2024-08-17 05:09:52 +00:00
Edward Z. Yang	90d2593b3e	Revert #132806 , #132736 , #132539 , #132487 (#133570 ) This reverts commit `25df063f04`. This reverts commit `de00c79583`. This reverts commit `419b76c4ac`. This reverts commit `bc57d5b6ff`. Differential Revision: [D61335013](https://our.internmc.facebook.com/intern/diff/D61335013) Pull Request resolved: https://github.com/pytorch/pytorch/pull/133570 Approved by: https://github.com/albanD, https://github.com/jansel, https://github.com/anijain2305	2024-08-15 20:54:21 +00:00
Xuehai Pan	24dee99cb7	Populate submodules of `torch._C` to `sys.modules` recursively (#132216 ) See comment: `e9d1c26275/torch/__init__.py (L938-L950)` This PR recursively sets the submodules in the C extension to `sys.modules` (e.g., `_C._dynamo.eval_frame`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/132216 Approved by: https://github.com/ezyang	2024-08-08 10:20:25 +00:00
PyTorch MergeBot	ff81ca8e0c	Revert "Populate submodules of `torch._C` to `sys.modules` recursively (#132216 )" This reverts commit `672ce4610e`. Reverted https://github.com/pytorch/pytorch/pull/132216 on behalf of https://github.com/PaliC due to was breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/132216#issuecomment-2274112397))	2024-08-07 18:45:00 +00:00
Brian Hirsh	e6eee04875	dynamo: use equality guards instead of id guards for Placement/DeviceMesh (#124401 ) After talking to @anijain2305, we probably can't land this since it won't work for C++ guards. But we should still be able to do better than ID_MATCH Pull Request resolved: https://github.com/pytorch/pytorch/pull/124401 Approved by: https://github.com/anijain2305	2024-08-06 17:14:44 +00:00
Animesh Jain	419b76c4ac	[dynamo] Reland 132308, 132314, 132318, 132334 - Make builtin nn modules attributes static (#132539 ) Relanding 4 PRs ending at https://github.com/pytorch/pytorch/pull/132334 Pull Request resolved: https://github.com/pytorch/pytorch/pull/132539 Approved by: https://github.com/Skylion007, https://github.com/yanboliang, https://github.com/mlazos	2024-08-03 02:08:22 +00:00
PyTorch MergeBot	b8f7019df0	Revert "[dynamo] Track params/buffers and mark them as static (#132334 )" This reverts commit `babb249a89`. Reverted https://github.com/pytorch/pytorch/pull/132334 on behalf of https://github.com/anijain2305 due to broke internal tests ([comment](https://github.com/pytorch/pytorch/pull/132334#issuecomment-2265942261))	2024-08-02 18:41:19 +00:00
Animesh Jain	babb249a89	[dynamo] Track params/buffers and mark them as static (#132334 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/132334 Approved by: https://github.com/ezyang, https://github.com/mlazos	2024-08-02 08:55:43 +00:00
Xuehai Pan	672ce4610e	Populate submodules of `torch._C` to `sys.modules` recursively (#132216 ) See comment: `e9d1c26275/torch/__init__.py (L938-L950)` This PR recursively sets the submodules in the C extension to `sys.modules` (e.g., `_C._dynamo.eval_frame`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/132216 Approved by: https://github.com/ezyang	2024-08-01 12:04:59 +00:00
Animesh Jain	612ea35395	[dynamo] Introduce UnspecializedBuiltinNNModuleSource (#132312 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/132312 Approved by: https://github.com/yanboliang ghstack dependencies: #132302, #132304	2024-08-01 06:21:05 +00:00
Animesh Jain	e772547d70	[dynamo][rename/refactor] Rename guard_source NN_MODULE to SPECIALIZED_NN_MODULE (#132302 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/132302 Approved by: https://github.com/yanboliang	2024-08-01 04:35:43 +00:00
Xuehai Pan	e74ba1b34a	[BE][Easy][15/19] enforce style for empty lines in import segments in `torch/_d*/` (#129767 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129767 Approved by: https://github.com/anijain2305	2024-07-31 21:18:11 +00:00
Animesh Jain	e2b941a1b4	[dynamo] Rename TENSOR_ALIASING to OBJECT_ALIASING. Permit OBJECT_ALIASING for dict guards (#131480 ) Fixes https://github.com/pytorch/pytorch/issues/129667 Pull Request resolved: https://github.com/pytorch/pytorch/pull/131480 Approved by: https://github.com/williamwen42 ghstack dependencies: #131347, #131367, #131378, #131389, #131405	2024-07-24 00:06:53 +00:00
Animesh Jain	e49c0acc39	[dynamo] Revert https://github.com/pytorch/pytorch/pull/130416 (#131058 ) All the changes brought by the original PR have been addressed in alternative ways in the stack. Why the original PR has to be reverted requires more effort because there is some bad interaction with export. Pull Request resolved: https://github.com/pytorch/pytorch/pull/131058 Approved by: https://github.com/williamwen42	2024-07-19 17:26:24 +00:00
Animesh Jain	ac76dd606f	[dynamo] Alternative way to skip empty hooks guards on inbuilt nn modules (#131057 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/131057 Approved by: https://github.com/williamwen42, https://github.com/jansel ghstack dependencies: #131056	2024-07-19 04:42:38 +00:00
Michael Lazos	470f07c840	Add guard override capability for tensor subclass metadata (#130780 ) Fixes https://github.com/pytorch/pytorch/issues/114405 Pull Request resolved: https://github.com/pytorch/pytorch/pull/130780 Approved by: https://github.com/anijain2305, https://github.com/bdhirsh ghstack dependencies: #130779	2024-07-17 19:13:53 +00:00
Michael Lazos	bea6762c01	Add guards on subclass metadata (#130779 ) This PR adds guards in dynamo which verify the equality of tensor subclass metadata along with tests verifying the expected recompile behavior. The next PR adds the capability to override the guard behavior to possibly perform the check in a less expensive manner. Toward fixing https://github.com/pytorch/pytorch/issues/114405 Pull Request resolved: https://github.com/pytorch/pytorch/pull/130779 Approved by: https://github.com/anijain2305, https://github.com/bdhirsh	2024-07-17 19:13:52 +00:00
Animesh Jain	1a266def4f	[dynamo][unsoundness but very controlled] Skip guards on inbuilt nn module hooks (#130420 ) Reduces the guard overhead from 2.1k units to 1k units. Compared to no-inlining (0.4k units), this reduces the slowdown from 5x to 2.5x. This introduces unsoundness, but only for hooks for inbuilt nn modules (user defined nn module hooks are fine). Each builtin nn module adds 4 empty ordered dict checks in the check_fn. This blows up for models with large numbers of builtin nn modules. With this PR, we skip those guards. There is no other easy way I can think of right now to control the guard overhead. Pull Request resolved: https://github.com/pytorch/pytorch/pull/130420 Approved by: https://github.com/jansel ghstack dependencies: #130654	2024-07-15 06:19:53 +00:00
Yidi Wu	1cae60a87e	Caching attr_proxy for nn_module attribute to fix guard check failure (#130280 ) Fixes https://github.com/pytorch/pytorch/issues/129939 Differential Revision: [D59594605](https://our.internmc.facebook.com/intern/diff/D59594605) Pull Request resolved: https://github.com/pytorch/pytorch/pull/130280 Approved by: https://github.com/anijain2305	2024-07-11 18:21:35 +00:00
Xuehai Pan	973037be6a	[BE][Easy] apply autofix for ruff rules unnecessary-collection-call (C408): `list()` / `tuple()` / `dict()` (#130199 ) This PR changes the empty collection factory call to Python literals: - `list()` -> `[]` - `tuple()` -> `()` - `dict()` -> `{}` The Python literals are more performant and safer. For example, the bytecode for building an empty dictionary: ```bash $ python3 -m dis - <<EOS import collections d1 = {} d2 = dict() dict = collections.OrderedDict d3 = dict() EOS ``` ```text 0 0 RESUME 0 1 2 LOAD_CONST 0 (0) 4 LOAD_CONST 1 (None) 6 IMPORT_NAME 0 (collections) 8 STORE_NAME 0 (collections) 3 10 BUILD_MAP 0 12 STORE_NAME 1 (d1) 4 14 PUSH_NULL 16 LOAD_NAME 2 (dict) 18 CALL 0 26 STORE_NAME 3 (d2) 6 28 LOAD_NAME 0 (collections) 30 LOAD_ATTR 8 (OrderedDict) 50 STORE_NAME 2 (dict) 7 52 PUSH_NULL 54 LOAD_NAME 2 (dict) 56 CALL 0 64 STORE_NAME 5 (d3) 66 RETURN_CONST 1 (None) ``` The dict literal `{}` only has one bytecode `BUILD_MAP`, while the factory call `dict()` has three `PUSH_NULL + LOAD_NAME + CALL`. Also, the factory call is not safe if users override the `dict` name in `locals` or `globals` (see the example of replacing with `OrderedDict` above). Pull Request resolved: https://github.com/pytorch/pytorch/pull/130199 Approved by: https://github.com/malfet	2024-07-11 17:30:28 +00:00
Animesh Jain	f7d7b94017	[dynamo][unspecialized-nn-module] Distinguish between user-defined and builtin nn module (#130416 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/130416 Approved by: https://github.com/jansel ghstack dependencies: #130285, #130368	2024-07-11 14:13:24 +00:00
Animesh Jain	fed8b0055f	[dynamo][bufgix] Fix the value for key manager (#130368 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/130368 Approved by: https://github.com/jansel ghstack dependencies: #130285	2024-07-11 14:13:19 +00:00
Animesh Jain	9c612df504	[dynamo][cpp-guards][QOL] Print NO_TENSOR_ALIASING guard once (#130285 ) NO_TENSOR_ALIASING guard lists all tensors. Printing it on every occurence is ugly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/130285 Approved by: https://github.com/jansel	2024-07-11 14:13:14 +00:00
Edward Z. Yang	e836ee1955	Enhancements to recompiles logs (#130043 ) ---- - We now record on CacheEntry what the compile id that populated it was, so now we can say why a specific frame was rejected - Add structured log for recompiles under name artifact "recompile_reasons". As it stands, it's not terribly structured, but this was the easiest thing I could do to start - Slightly reformat multi-reason printing; since we only report one guard failure seems better to have it as a single line Example output: ``` V0703 10:34:13.273000 140345997743104 torch/_dynamo/guards.py:2590] [0/1] [__recompiles] Recompiling function f in /data/users/ezyang/a/pytorch/b.py:3 V0703 10:34:13.273000 140345997743104 torch/_dynamo/guards.py:2590] [0/1] [__recompiles] triggered by the following guard failure(s): V0703 10:34:13.273000 140345997743104 torch/_dynamo/guards.py:2590] [0/1] [__recompiles] - 0/0: tensor 'L['x']' size mismatch at index 0. expected 4, actual 5 ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/130043 Approved by: https://github.com/anijain2305	2024-07-09 03:40:56 +00:00
Animesh Jain	7ea8a3c9b8	[dynamo] Validate check_fn (#118448 ) Fixes - https://github.com/pytorch/pytorch/issues/128090 Tracker issue here - https://github.com/pytorch/pytorch/issues/129937 Pull Request resolved: https://github.com/pytorch/pytorch/pull/118448 Approved by: https://github.com/jansel, https://github.com/ezyang	2024-07-05 18:04:12 +00:00
Joel Schlosser	6897631ceb	Guard on inner tensor names for traceable wrapper subclasses (#129618 ) Fixes #129601 Background: it's possible that a traceable wrapper subclass will have an optional inner tensor constituent (e.g. NJT's cached min / max sequence lengths). To specify this, the subclass's `__tensor_flatten__()` impl should leave out any unspecified optional inner tensors in the returned list of `attrs`. This PR guards on the list of inner tensor `attrs` returned in `subclass.__tensor_flatten__()[0]`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/129618 Approved by: https://github.com/anijain2305	2024-06-28 16:30:25 +00:00
Animesh Jain	17d1723aee	[dynamo][unspecialized-nn-modules] Remove dead (also incorrect) code (#129316 ) This code is unused because we just inline the `.parameters` call. The code was also wrong because side-effects only track the first level of mutations. An object might not marked mutated if one of the child objects (like a dict) is mutated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/129316 Approved by: https://github.com/jansel	2024-06-23 03:02:27 +00:00
Animesh Jain	c008488b9c	[dynamo][guards] Dont run TYPE_MATCH for DICT_LENGTH C++ guard (#129163 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/129163 Approved by: https://github.com/williamwen42, https://github.com/jansel	2024-06-21 06:27:19 +00:00
Animesh Jain	f2f4dde2d3	[dynamo] Remove ID_MATCH for FSDPModuleVariable (#129015 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/129015 Approved by: https://github.com/yf225 ghstack dependencies: #129098	2024-06-20 19:23:32 +00:00
Animesh Jain	ea47d542ca	[dynamo][guards] Remove BOOL_FALSE - not needed after C++ guards (#129098 ) PyDict_Size is very fast ... earlier with Python guards, Cpython will go through layers of fluff to finally call the PyDict_Size. With C++ guards, its not needed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/129098 Approved by: https://github.com/jansel	2024-06-20 14:40:27 +00:00
Will Feng	979edbbe12	[Traceable FSDP2] Dynamo support FSDP2 use_training_state context manager (#127854 ) Improve Dynamo to support the FSDP2 `use_training_state()` context manager. Test command: ` pytest -rA test/distributed/_composable/fsdp/test_fully_shard_compile.py::TestFullyShardCompile::test_dynamo_trace_use_training_state ` Pull Request resolved: https://github.com/pytorch/pytorch/pull/127854 Approved by: https://github.com/yanboliang	2024-06-16 08:48:52 +00:00
Animesh Jain	7e092a62e6	[dynamo] Support weakref objects (#128533 ) Fixes https://github.com/pytorch/pytorch/issues/125720 I was earlier worried that DELETE_* or STORE_* on referent values should result in a graph break, because they could invalidate the weak ref. But then @zou3519 pointed out that weakref invalidation will happen EVENTUALLY, CPython provides no guarantees when the weakref will be invalidated (even when the user calls del x and x is the last reference). So any code that relies on del x to invalidate the weakref of x right away is BAD code. CPython provide no guarantees. Therefore we can (ab)use this nuance, and can just ignore DELETE_* or STORE_* on the referent objects. The only corner case is when Dynamo is reconstructing the weakref object. Dynamo will have a hard time being correct here, so just SKIP_FRAME on such a case. This is rare. Cpython notes 1) https://docs.python.org/3/library/weakref.html 2) https://docs.python.org/3/reference/datamodel.html#index-2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128533 Approved by: https://github.com/jansel	2024-06-15 02:16:25 +00:00
Aaron Orenstein	dcfa7702c3	Flip default value for mypy disallow_untyped_defs [1/11] (#127838 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127838 Approved by: https://github.com/oulgen	2024-06-08 18:16:33 +00:00
Animesh Jain	bb6bfd9ad8	[dynamo][compile-time] Cache the child guard managers (#127377 ) Reduces compile time of MobileBertForMaskedLM model from 39 seconds to 26 seconds. This was a regression introduced by #125202. Before that PR, compile time was 24 seconds. The extra two seconds is just because we are going through enormous number of guards. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127377 Approved by: https://github.com/jansel	2024-05-31 04:23:56 +00:00
Animesh Jain	1876f0fec1	[dynamo][nn module guards] Use TENSOR_MATCH, and not ID_MATCH, for numpy tensors (#126246 ) Fixes speech_transformer regression here - https://hud.pytorch.org/benchmark/torchbench/inductor_no_cudagraphs?startTime=Tue%2C%2007%20May%202024%2019%3A22%3A54%20GMT&stopTime=Tue%2C%2014%20May%202024%2019%3A22%3A54%20GMT&granularity=hour&mode=training&dtype=amp&lBranch=main&lCommit=02093b6c6ae1046368e2500881d0bb5880873386&rBranch=main&rCommit=b24ad7eab55eaf660893dddae949fc714e434338 Thanks to @eellison and @bdhirsh for isolating the regression to nn module guards. Pull Request resolved: https://github.com/pytorch/pytorch/pull/126246 Approved by: https://github.com/jansel ghstack dependencies: #126203	2024-05-16 01:57:59 +00:00
Animesh Jain	90461d4986	[dynamo] Detect monkeypatching on nn module forward method (#126203 ) An alternative was https://github.com/pytorch/pytorch/pull/124975. Though it was safer because it was adding guards for every inlined function, it was causing guard overhead for a few models of > 20%. The overhead of this PR is minimal for the common unpatched case. Fixes an internal issue - [fb.workplace.com/groups/1075192433118967/permalink/1411067766198097](https://fb.workplace.com/groups/1075192433118967/permalink/1411067766198097/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/126203 Approved by: https://github.com/ezyang	2024-05-15 20:41:13 +00:00
Edward Z. Yang	2ba102f689	Implement native support for float inputs in Dynamo and ShapeEnv (#125325 ) The big idea is that floats are treated as Tensors on input/output to the FX graph, but on the inside, we immediately call item() on the synthetic Tensor and record regular float operations on it. Canonicalization to Tensor operations will happen in a standalone FX pass. This behavior is controlled by `specialize_float` config variable when set to False. The generated graph looks like this for the test `test_unspec_float_output`: ``` def forward(self, L_x_: "f32[3]", L_y_: "f32[]"): l_x_ = L_x_ l_y_ = L_y_ # File: /data/users/ezyang/a/pytorch/test/dynamo/test_unspec.py:511 in f, code: return x + 1, y * 2 add: "f32[3]" = l_x_ + 1; l_x_ = None item: "Sym(zf0)" = l_y_.item(); l_y_ = None mul: "Sym(2zf0)" = item 2; item = None scalar_tensor: "f32[]" = torch.scalar_tensor(mul); mul = None return (add, scalar_tensor) ``` The ingredients: * torch/_dynamo/variables/builder.py When `specialize_float` is False, we wrap float literals with `wrap_symfloat`. This is an unholy mashup of `wrap_symint` and `wrap_unspecialized_primitive`. The overall strategy is that we first generate a tensor argument (because that's what we want to show up into the FX graph), but then immediately call item() on the tensor argument to get a SymNodeVariable, which we will do the rest of the tracing with. Importantly, this SymNodeVariable is backed with the source of the original float: this means we can guard on the resulting value (something we could NOT do with UnspecializedPythonVariable). This has to be done manually, because if you literally call item() on the tensor, you will end up with an unbacked float. There is a bit of copy paste from wrap_symint and wrap_unspecialized_primitive which we can try to factor out, but this really is its own thing and you should review every line of code in the function. * torch/fx/experimental/symbolic_shapes.py We now can generate guards on float inputs, and these guards are handled inside of ShapeEnv. So we need to be able to allocate (backed!) float symbols, and produce guards for them. Fairly straightforward generalization. * torch/_dynamo/codegen.py I also need to maintain the invariant that there are no float outputs to the FX graph. I chose to do this at codegen time. When we detect a SymNodeVariable on the return stack for a float, we on the fly convert it (via `as_tensor`) to a TensorVariable, which is the true output. We then special case the output bytecode to call item() on it again. The tensor conversion is memoized on SymNodeVariable since we typically run the code generation process twice. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/125325 Approved by: https://github.com/lezcano, https://github.com/jansel	2024-05-14 04:10:01 +00:00
Animesh Jain	ae5e2ab92e	[dynamo][fsdp] Use Tensor match for FSDP modules (#125827 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/125827 Approved by: https://github.com/yf225, https://github.com/jansel ghstack dependencies: #125828, #125805	2024-05-09 21:26:15 +00:00
ydwu4	461ffaaaf3	[dynamo] support torchbind object input (#124978 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/124978 Approved by: https://github.com/jansel	2024-05-07 03:02:00 +00:00
Aaron Gokaslan	1dd42e42c4	[BE]: Try TCH autofixes on torch/ (#125536 ) Tries TCH autofixes and see what breaks Pull Request resolved: https://github.com/pytorch/pytorch/pull/125536 Approved by: https://github.com/ezyang	2024-05-05 23:13:59 +00:00
Animesh Jain	5ba777f46e	[guards][cpp-guards] Optimize NN module getattr guards (#124522 ) Improves the guard overhead of MobileBert model with nn module guards from 92000 units to 20000 units. Pull Request resolved: https://github.com/pytorch/pytorch/pull/124522 Approved by: https://github.com/jansel ghstack dependencies: #125439, #125421	2024-05-04 22:08:56 +00:00
Animesh Jain	8706da2bad	[dynamo][cpp-guards] Improve recompilation reason logic for NO_TENSOR_ALIASING guard (#125439 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/125439 Approved by: https://github.com/williamwen42	2024-05-03 04:49:41 +00:00
Animesh Jain	a13a0a2479	[dynamo][easy] Simple fixes to prepare for nn module guards (#125316 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/125316 Approved by: https://github.com/williamwen42 ghstack dependencies: #125275	2024-05-02 12:08:11 +00:00
Edward Z. Yang	da5d2d9b3e	Hotfix: restore CPP guard string in structured trace (#125303 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/125303 Approved by: https://github.com/albanD	2024-05-02 03:57:19 +00:00
Animesh Jain	e68d65dae2	[dynamo][cpp-guards] Differentiate dict guards wrt to guarding on key order (#124779 ) We guard on key order 1) When a key is a non-constant object 2) When we actually need key order - like .values, .items etc For dicts/OrderedDicts that do not require key order guarding, we just rely on usual `GuardManger + DictGetItemGuardAccessor`. This is faster than going through the `list(d.keys())` based design for OrderedDicts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/124779 Approved by: https://github.com/jansel	2024-04-25 08:20:35 +00:00
Jason Ansel	11e6f84ad8	[dynamo] Graph break on uninitialized nn.Module (#123790 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123790 Approved by: https://github.com/anijain2305 ghstack dependencies: #123700, #123705, #123786	2024-04-12 19:03:13 +00:00
Animesh Jain	b9675e820e	[dynamo][cpp-guards] Improve the logs (#123780 ) For this program ~~~ @torch.compile(backend="eager") def fn(x, y, d): return x * y * d["foo"] * d["bar"] ~~~ Python logs are ~~~ V0410 15:48:57.778000 140318524949632 torch/_dynamo/guards.py:1785] [0/0] [__guards] GUARDS: V0410 15:48:57.778000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] ___check_type_id(L['d'], 8833952) # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.778000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] len(L['d']) == 2 # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] list(L['d'].keys()) == ['foo', 'bar'] # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] hasattr(L['x'], '_dynamo_dynamic_indices') == False # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] hasattr(L['y'], '_dynamo_dynamic_indices') == False # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] ___check_type_id(L['d']['bar'], 8842592) # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] L['d']['bar'] == 2 # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] ___check_type_id(L['d']['foo'], 8842592) # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] L['d']['foo'] == 4 # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] utils_device.CURRENT_DEVICE == None # _dynamo/output_graph.py:450 in init_ambient_guards V0410 15:48:57.779000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] check_tensor(L['x'], Tensor, DispatchKeySet(CPU, BackendSelect, ADInplaceOrView, AutogradCPU), torch.float32, device=None, requires_grad=False, size=[4], stride=[1]) # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:48:57.780000 140318524949632 torch/_dynamo/guards.py:1803] [0/0] [__guards] check_tensor(L['y'], Tensor, DispatchKeySet(CPU, BackendSelect, ADInplaceOrView, AutogradCPU), torch.float32, device=None, requires_grad=False, size=[4], stride=[1]) # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn ~~~ CPP logs are ~~~ V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1792] [0/0] [__guards] GUARDS: V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] TREE_GUARD_MANAGER: V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] +- RootGuardManager V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| +- DEFAULT_DEVICE: utils_device.CURRENT_DEVICE == None # _dynamo/output_graph.py:450 in init_ambient_guards V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| +- GLOBAL_STATE: ___check_global_state() V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| +- DictSubclassGuardManager: source=L['d'], accessed_by=DictGetItemGuardAccessor(d) V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- KeyValueManager pair at index=0 V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| +- KeyManager: GuardManager: source=list(L['d'].keys())[0] V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| \| +- EQUALS_MATCH: list(L['d'].keys())[0] == 'foo' # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| +- ValueManager: GuardManager: source=L['d']['foo'] V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| \| +- EQUALS_MATCH: L['d']['foo'] == 4 # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- KeyValueManager pair at index=1 V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| +- KeyManager: GuardManager: source=list(L['d'].keys())[1] V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| \| +- EQUALS_MATCH: list(L['d'].keys())[1] == 'bar' # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| +- ValueManager: GuardManager: source=L['d']['bar'] V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| \| \| +- EQUALS_MATCH: L['d']['bar'] == 2 # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| +- GuardManager: source=L['x'], accessed_by=DictGetItemGuardAccessor(x) V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- TENSOR_MATCH: check_tensor(L['x'], Tensor, DispatchKeySet(CPU, BackendSelect, ADInplaceOrView, AutogradCPU), torch.float32, device=None, requires_grad=False, size=[4], stride=[1]) # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- NO_HASATTR: hasattr(L['x'], '_dynamo_dynamic_indices') == False # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- NO_TENSOR_ALIASING: check_no_aliasing(L['x'], L['y']) V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| +- GuardManager: source=L['y'], accessed_by=DictGetItemGuardAccessor(y) V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- TENSOR_MATCH: check_tensor(L['y'], Tensor, DispatchKeySet(CPU, BackendSelect, ADInplaceOrView, AutogradCPU), torch.float32, device=None, requires_grad=False, size=[4], stride=[1]) # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- NO_HASATTR: hasattr(L['y'], '_dynamo_dynamic_indices') == False # return x * y * d["foo"] * d["bar"] # examples/ord_dicts.py:24 in fn V0410 15:49:41.607000 140481927914624 torch/_dynamo/guards.py:1769] [0/0] [__guards] \| \| +- NO_TENSOR_ALIASING: check_no_aliasing(L['x'], L['y']) ~~~~ This info is also present in this gist for better viewing - https://gist.github.com/anijain2305/b418706e4ad4ec2d601530bc24cf8a20 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123780 Approved by: https://github.com/ezyang, https://github.com/jansel ghstack dependencies: #123773, #123787	2024-04-11 22:23:28 +00:00
Animesh Jain	b0b7aa201c	[dynamo][cpp-guards] Introduce DictSubclassGuardManager (#123773 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123773 Approved by: https://github.com/jansel	2024-04-11 22:23:28 +00:00
Animesh Jain	1346ebf12e	[dynamo][guards] Delay DUPLICATE_INPUT guard because of incorrect ordering (#123605 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123605 Approved by: https://github.com/jansel ghstack dependencies: #123606	2024-04-10 07:30:02 +00:00
Animesh Jain	7283c37c98	[dynamo] Keep guards on global function (#123423 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123423 Approved by: https://github.com/jansel	2024-04-09 04:23:11 +00:00
Animesh Jain	07cecf4168	[dynamo][cpp-guards] Fix bug for slices (#123516 ) Automatic testing as soon as we turn on cpp guards by default. Pull Request resolved: https://github.com/pytorch/pytorch/pull/123516 Approved by: https://github.com/jansel ghstack dependencies: #123515	2024-04-07 21:09:05 +00:00
Animesh Jain	8c84fe3c86	[dynamo][guards] Forward fix for #123302 (#123485 ) For some reason, adding a `TYPE_CHECK` in DATA_PTR_MATCH guard in https://github.com/pytorch/pytorch/issues/123302 increases optimizer guard overhead for `MT5ForConditionalGeneration` by 10x. There is nothing special about MT5. As we are going to move towards the CPP guards soon, there is no reason to investigate this deeper. We can use `ID_MATCH` instead of `DATA_PTR` match. Today both cant be serialized, so there is no one preference over the other. Pull Request resolved: https://github.com/pytorch/pytorch/pull/123485 Approved by: https://github.com/mlazos	2024-04-06 02:34:06 +00:00
Animesh Jain	22b9987144	[dynamo][cpp-guards] ListGetItemGuardAccessor and TupleGetItemGuardAccessor (#123396 ) Speeds up the guard-overhead microbenchmark by around 10% normalized to main-branch CPP guards ~~~ import torch @torch.compile(backend="eager") def fn(x, lst): for l in lst: x = x + l return x n = 1000 lst = [i for i in range(n)] x = torch.randn(4) print(fn(x, lst)) print("Sucess") ~~~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/123396 Approved by: https://github.com/jansel ghstack dependencies: #123285, #123302, #123303	2024-04-05 22:10:04 +00:00
Animesh Jain	6694628170	[dynamo][guards] Remove workaround after #122858 (#123303 ) Not needed since https://github.com/pytorch/pytorch/pull/122858 has landed Pull Request resolved: https://github.com/pytorch/pytorch/pull/123303 Approved by: https://github.com/mlazos ghstack dependencies: #123285, #123302	2024-04-04 03:52:50 +00:00
Animesh Jain	5b45ec8892	[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors (#123302 ) We should sparingly use ID_MATCH guards. When it comes to performance, ID_MATCH is much faster DATA_PTR for Python guards. However, the difference is very small in C++. So, its worth just using DATA_PTR_MATCH. Pull Request resolved: https://github.com/pytorch/pytorch/pull/123302 Approved by: https://github.com/mlazos ghstack dependencies: #123285	2024-04-04 03:52:50 +00:00
Animesh Jain	fb7664d5bf	[dynamo][optimizer][guard-overhead] NOT_NONE guard for param.grad instead of TENSOR_MATCH (#123285 ) For optimizers, we do an DATA_PTR match for parameters. For param.grad, we were doing TENSOR_MATCH, but what we really need to guard is if param.grad is None or not. Therefore, I add a new guard called NOT_NONE. Further improves the guard overhead ![image](https://github.com/pytorch/pytorch/assets/13822661/574598ac-ca71-4e5e-9e75-8774577cd58f) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123285 Approved by: https://github.com/mlazos, https://github.com/jansel	2024-04-04 03:52:47 +00:00
Animesh Jain	d91db70295	[dynamo][cpp-guards] Optimize tensor.grad accessor (#123226 ) For LayoutLM model, reduces C++ guard overhead by 1.48x. These are the numbers ![image](https://github.com/pytorch/pytorch/assets/13822661/25cfc35b-b67d-4903-8403-71fa931dacdd) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123226 Approved by: https://github.com/jansel	2024-04-03 05:32:13 +00:00
Animesh Jain	969bbf8e82	[dynamo][guards] Skip aliasing guards for optimizers (#123044 ) I am ok if people don't want this PR to be merged. For optimizers, we know that the state dict and param_group have same parameters. So, I think its ok to skip TENSOR_MUST_ALIAS guards. Similarly for state tensors, all of them are different. Therefore, we can skip the tensor aliasing guards. With this PR, these are the numbers for Megatron which has 394 parameters <img width="290" alt="image" src="https://github.com/pytorch/pytorch/assets/13822661/0ce75dc6-4299-46bb-bf3c-7989ebc7cfc4"> C++ numbers jump a lot because of 2 reasons 1) We are now not doing INCREF/DECREF for a large number of tensors. 2) For python guards, we can expect higher numbers but that requires some more plumbing because the Python tensor guards are all collapsed into one. Pull Request resolved: https://github.com/pytorch/pytorch/pull/123044 Approved by: https://github.com/jansel, https://github.com/mlazos	2024-04-02 08:51:00 +00:00
Animesh Jain	234287aa16	[dynamo][cpp-guards] DUAL_LEVEL guard (#123058 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123058 Approved by: https://github.com/jansel ghstack dependencies: #123046	2024-04-01 21:09:38 +00:00
Animesh Jain	99d939f51f	[dynamo] Bugfix for HASATTR guard (#122947 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122947 Approved by: https://github.com/jansel ghstack dependencies: #122828	2024-03-29 18:50:33 +00:00
Animesh Jain	8d676a6e8e	[dynamo][cpp-guards] Bugfix for size/strides for tensor match (#122828 ) This got missed because CPP guard manager is not ON by default. Pull Request resolved: https://github.com/pytorch/pytorch/pull/122828 Approved by: https://github.com/mlazos, https://github.com/jansel	2024-03-28 00:16:49 +00:00
Animesh Jain	ceff2205e9	[dynamo][cpp-guards] Bugfix to pass on correct example_value (#122769 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122769 Approved by: https://github.com/jansel ghstack dependencies: #122646, #122647, #122716	2024-03-27 19:40:46 +00:00
Animesh Jain	5b42c41b19	[dynamo][improve-guard-overhead] Skip TENSOR_MATCH guards on parameters for optimizers (#122647 ) 1.32x guard overhead reduction (1.092 vs vs 0.827 ms) for MegatronBertForCausalLM with 394 params. Pull Request resolved: https://github.com/pytorch/pytorch/pull/122647 Approved by: https://github.com/jansel, https://github.com/mlazos ghstack dependencies: #122646	2024-03-27 19:40:43 +00:00
Joel Schlosser	07b618e2d4	Graph break cleanly in Dynamo for module parametrization (#121041 ) Fixes #118795 This is a graph breaking partial fix for #120914. We still need -actual- module parametrization tracing support, but at least it doesn't blow up hard now. Background: Module parametrization injects a property as the module parameter attribute that calls a `nn.Module` whose forward takes in a module parameter and returns a reparametrized module parameter. Example: ``` class MyParametrization(nn.Module): def forward(X): # This reparametrization just negates the original parameter value return -X m = nn.Linear(...) p = MyParametrization() register_parametrization(m, "weight", p) # Accessing the "weight" attribute will invoke p's forward() on m's original weight and return the output as the new weight. # m.weight here is now an injected property that does the above instead of an actual Parameter. # This property is defined in torch/nn/utils/parametrize.py. m.weight # NB: Parametrization changes the module type (e.g. torch.nn.utils.parametrize.ParametrizedLinear) print(type(m)) ``` Problem 1: Dynamo has special tracing rules for things in `torch.nn`. Parametrizing a module changes the type of the module and the parametrized attribute, so now these rules wrongly affect tracing here. To fix this: * For parametrized modules, call `convert_to_unspecialized()` to restart analysis where Dynamo starts inlining the module. Problem 2: The issue seen in #118795 is that Dynamo will see a dynamically constructed tensor when `m.weight` is called and introduce that to its `tensor_weakref_to_sizes_strides` cache during fake-ification. This tensor is also made to be a graph input, since it's a module parameter. When guards are created for this module parameter input, the logic calls `m.weight` again and tries to look the result up in the cache, but this is a different tensor now, giving the `KeyError` symptom. To fix this: * Replace Dynamo's `tensor_weakref_to_sizes_strides` cache with a `input_source_to_sizes_strides` cache. * This cache was originally introduced in #100128. Pull Request resolved: https://github.com/pytorch/pytorch/pull/121041 Approved by: https://github.com/anijain2305	2024-03-26 23:44:51 +00:00
Jason Ansel	5f7e71c411	[dynamo] Add HASATTR guard for UserDefinedObject attrs (#122555 ) Fixes #111522 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122555 Approved by: https://github.com/Skylion007	2024-03-24 03:41:58 +00:00
Guilherme Leobas	4eaa000acc	Teach dynamo about torch.func.jvp (#119926 ) List of changes: - Replace JVP_NESTING by torch._C._functorch.maybe_current_level() - Remove all increment nesting functions from wrap_fx_proxy_cls - fwAD.make_dual receives the dual_level as keyword argument - Add jvp_increment_nesting, set_fwd_grad_enabled and dual_level context managers to dynamo Pull Request resolved: https://github.com/pytorch/pytorch/pull/119926 Approved by: https://github.com/zou3519	2024-03-22 20:25:47 +00:00
PyTorch MergeBot	0696db8202	Revert "Teach dynamo about torch.func.jvp (#119926 )" This reverts commit `17489784b6`. Reverted https://github.com/pytorch/pytorch/pull/119926 on behalf of https://github.com/peterbell10 due to broken mac jobs on main ([comment](https://github.com/pytorch/pytorch/pull/119926#issuecomment-2010327997))	2024-03-20 18:34:43 +00:00
Guilherme Leobas	17489784b6	Teach dynamo about torch.func.jvp (#119926 ) List of changes: - Replace JVP_NESTING by torch._C._functorch.maybe_current_level() - Remove all increment nesting functions from wrap_fx_proxy_cls - fwAD.make_dual receives the dual_level as keyword argument - Add jvp_increment_nesting, set_fwd_grad_enabled and dual_level context managers to dynamo Pull Request resolved: https://github.com/pytorch/pytorch/pull/119926 Approved by: https://github.com/zou3519	2024-03-20 13:09:19 +00:00
PyTorch MergeBot	36e5c1dcab	Revert "Teach dynamo about torch.func.jvp (#119926 )" This reverts commit `edd04b7c16`. Reverted https://github.com/pytorch/pytorch/pull/119926 on behalf of https://github.com/jeanschmidt due to lots of breakages in pull jobs, checking if reverting this one will help ([comment](https://github.com/pytorch/pytorch/pull/119926#issuecomment-2007915919))	2024-03-19 18:59:46 +00:00
Guilherme Leobas	edd04b7c16	Teach dynamo about torch.func.jvp (#119926 ) List of changes: - Replace JVP_NESTING by torch._C._functorch.maybe_current_level() - Remove all increment nesting functions from wrap_fx_proxy_cls - fwAD.make_dual receives the dual_level as keyword argument - Add jvp_increment_nesting, set_fwd_grad_enabled and dual_level context managers to dynamo Pull Request resolved: https://github.com/pytorch/pytorch/pull/119926 Approved by: https://github.com/zou3519	2024-03-19 13:06:42 +00:00
Animesh Jain	8860c625ea	[dynamo][guards-cpp-refactor] Integrate cpp guard manager with CheckFnManager (#120726 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120726 Approved by: https://github.com/jansel	2024-03-19 03:11:31 +00:00
Oguz Ulgen	7c5e29ae71	Back out "Support `triton.language.dtype` with `torch.compile` (#121690 )" (#122108 ) Summary: Some hard to deal with package import/export related problems. Lets revert and start with clean slate. Test Plan: CI Differential Revision: D55024877 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122108 Approved by: https://github.com/ezyang	2024-03-18 20:50:28 +00:00
Animesh Jain	c568b84794	[dynamo][guards] Move backend match to eval_frame (#121954 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/121954 Approved by: https://github.com/jansel	2024-03-17 06:52:10 +00:00
Oguz Ulgen	79ee6bbde3	Support `triton.language.dtype` with `torch.compile` (#121690 ) Putting this PR as an RFC since I have resorted to some horrible hacks in order to make this work. ``` (Pdb) p triton.language.float32 triton.language.fp32 (Pdb) p str(triton.language.float32) 'fp32' (Pdb) p repr(triton.language.float32) 'triton.language.fp32' ``` This means that we need to "rewrite" them for fx graph and inductor execution. This PR allows Mamba2 to work with `torch.compile`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/121690 Approved by: https://github.com/Skylion007	2024-03-12 23:21:46 +00:00
Jason Ansel	4f19b5f7ef	[dynamo] Remove extra guard for tensor constant attrs (#121106 ) Also deletes some unused code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/121106 Approved by: https://github.com/yanboliang, https://github.com/anijain2305	2024-03-05 17:16:04 +00:00
Nikita Shulga	a3b81666b1	[Dynamo] Fix guards for code objects (#120909 ) By comparing them only by id, and raising an assert if someone calls into `EQUALS_MATCH` Which render following example compileable: ```python import torch @torch.compile() def foo(x, y): code = compile(y, "foo", "exec") exec(y) return x print(foo(torch.rand(3), "print('Hello World')")) ``` Fixes https://github.com/pytorch/pytorch/issues/120647 Pull Request resolved: https://github.com/pytorch/pytorch/pull/120909 Approved by: https://github.com/jansel	2024-03-02 02:17:17 +00:00
cpuhrsch	576c0482a5	Remove hard numpy dependency from guards.py (#119519 ) I'm not sure if this is the ideal behavior / best fix for this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/119519 Approved by: https://github.com/albanD	2024-02-29 14:37:33 +00:00
Avik Chaudhuri	5472923998	derived dim (#118729 ) With the current `Dim`-based dynamic shapes API for export, one can express that shapes of different input shapes must be equal by reusing the same `Dim`. However, non-trivial relationships between such input shapes cannot be expressed. Recently we are seeing more and more examples of code that require this additional expressibility, e.g., where a pair of shapes might differ by one, or a shape might be double another (or simply even). This PR introduces the concept of a "derived" `Dim`, i.e., a linear arithmetic expression over a `Dim`. By using a combination of `Dim`s and derived `Dim`s to specify input shapes, the desired relationships can be expressed naturally. E.g., a pair of shapes might be `dim` and `dim + 1`, or `dim` and `2dim`, or even `2dim` and `dim + 1`. We extend the current infrastructure that translates `Dim`s to deprecated `dynamic_dim`-based constraints to work with derived `Dim`s. As usual, we raise constraint violation errors when shape guards cannot be verified given a dynamic shapes spec; suggest fixes; and raise runtime errors when future inputs violate the spec. Importantly, some guards that used to cause forced specializations in the constraint solver because they were deemed "too complex" now do not do so, because they can now be specified as constraints. Since this was what motivated the introduction of a `disable_constraint_solver` flag to some internal APIs, we may not need that flag any more. Note that shapes of placeholders in exported programs can now contain symbolic expressions and not just symbols. Differential Revision: D53254587 Pull Request resolved: https://github.com/pytorch/pytorch/pull/118729 Approved by: https://github.com/ezyang	2024-02-28 19:48:32 +00:00
Animesh Jain	e9a961f66a	[dynamo][refactor] Use originating_source for HASATTR (#120723 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120723 Approved by: https://github.com/jansel ghstack dependencies: #120520, #120590, #120721	2024-02-28 05:00:59 +00:00
Animesh Jain	5a53c0ff23	[dynamo][refactor] Rename LIST_LENGTH to SEQUENCE_LENGTH, separate DICT_LENGTH (#120721 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120721 Approved by: https://github.com/jansel ghstack dependencies: #120520, #120590	2024-02-28 02:19:10 +00:00
Edward Z. Yang	1a1fc1047d	Add structured trace logs (#120289 ) Overall design: https://docs.google.com/document/d/1CX_hJ0PNy9f3R1y8TJrfkSeLkvGjjjLU84BSXgS2AZ8/edit How to read the diff: * Most files are me augmenting pre-existing logging with structured variants. For the most part it's simple (esp FX graphs, which have a canonical string representation); it gets more complicated when I decided to JSON-ify some data structure instead of keeping the ad hoc printing (notably, guards and dynamo output graph sizes) * torch/_functorch/_aot_autograd/collect_metadata_analysis.py is some unrelated fixes I noticed while auditing artifact logs * torch/_logging/_internal.py has the actual trace log implementation. The trace logger is implement as a logger named torch.__trace which is disconnected from the logging hierarchy. It gets its own handler and formatter (TorchLogsFormatter with _is_trace True). `trace_structured` is the main way to emit a trace log. Unusually, there's a separate "metadata" and "payload" field. The metadata field should not be too long (as it is serialized as a single line) and is always JSON (we put contextual things like compile id in it); the payload field can be long and is emitted after the metadata log line and can span multiple lines. * torch/_logging/structured.py contains some helpers for converting Python data structures into JSON form. Notably, we have a string interning implementation here, which helps reduce the cost of serializing filenames into the log. * test/dynamo/test_structured_trace.py the tests are cribbed from test_logging.py, but all rewritten to use expect tests on munged versions of what we'd actually output. Payloads are never tested, since they tend not be very stable. https://github.com/ezyang/tlparse is a POC Rust program that can interpret these logs. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/120289 Approved by: https://github.com/Skylion007 ghstack dependencies: #120712	2024-02-28 01:01:41 +00:00
PyTorch MergeBot	f3dd2a544c	Revert "Add structured trace logs (#120289 )" This reverts commit `9dfaef962c`. Reverted https://github.com/pytorch/pytorch/pull/120289 on behalf of https://github.com/kit1980 due to breaking internal builds, see D54230697 ([comment](https://github.com/pytorch/pytorch/pull/120289#issuecomment-1967477120))	2024-02-27 19:49:05 +00:00
Animesh Jain	8a59f49da2	[dynamo][compile-time] Collect guard debug stack info only with logs enabled (#120520 ) Reduces backend=eager compile time from 33 to 19 seconds for `MobileBertForQuestionAnswering`. This also helps an internal model where guards.add function is taking 124 seconds. Pull Request resolved: https://github.com/pytorch/pytorch/pull/120520 Approved by: https://github.com/mlazos	2024-02-27 01:51:16 +00:00
Edward Z. Yang	9dfaef962c	Add structured trace logs (#120289 ) Overall design: https://docs.google.com/document/d/1CX_hJ0PNy9f3R1y8TJrfkSeLkvGjjjLU84BSXgS2AZ8/edit How to read the diff: * Most files are me augmenting pre-existing logging with structured variants. For the most part it's simple (esp FX graphs, which have a canonical string representation); it gets more complicated when I decided to JSON-ify some data structure instead of keeping the ad hoc printing (notably, guards and dynamo output graph sizes) * torch/_functorch/_aot_autograd/collect_metadata_analysis.py is some unrelated fixes I noticed while auditing artifact logs * torch/_logging/_internal.py has the actual trace log implementation. The trace logger is implement as a logger named torch.__trace which is disconnected from the logging hierarchy. It gets its own handler and formatter (TorchLogsFormatter with _is_trace True). There's a teensy bit of FB specific code to automatically enable trace logging if a /logs directory exists. `trace_structured` is the main way to emit a trace log. Unusually, there's a separate "metadata" and "payload" field. The metadata field should not be too long (as it is serialized as a single line) and is always JSON (we put contextual things like compile id in it); the payload field can be long and is emitted after the metadata log line and can span multiple lines. * torch/_logging/structured.py contains some helpers for converting Python data structures into JSON form. Notably, we have a string interning implementation here, which helps reduce the cost of serializing filenames into the log. * test/dynamo/test_structured_trace.py the tests are cribbed from test_logging.py, but all rewritten to use expect tests on munged versions of what we'd actually output. Payloads are never tested, since they tend not be very stable. https://github.com/ezyang/tlparse is a POC Rust program that can interpret these logs. Testing that the fbcode detection works at https://www.internalfb.com/mlhub/pipelines/runs/fblearner/534553450 (Meta-only) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/120289 Approved by: https://github.com/Skylion007	2024-02-27 00:04:23 +00:00
Edward Z. Yang	fd3cf88f27	Rewrite docs about why we guard on dynamic dims (#120566 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/120566 Approved by: https://github.com/desertfire	2024-02-26 18:58:30 +00:00
Animesh Jain	c18623b7ed	[dynamo] Reland 120147 - - Use EQUALS_MATCH guard for mod.training (#120578 ) To fix Memory leak discovered in https://github.com/pytorch/pytorch/issues/112090 Pull Request resolved: https://github.com/pytorch/pytorch/pull/120578 Approved by: https://github.com/jansel	2024-02-26 03:49:47 +00:00
Animesh Jain	834c7a1d3e	[dynamo][refactor] Move some helper functions to global scope (#120426 ) This is to prepare for guard C++ refactor work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/120426 Approved by: https://github.com/ezyang	2024-02-25 04:38:20 +00:00
PyTorch MergeBot	722afe6171	Revert "[dynamo] Use EQUALS_MATCH guard for mod.training (#120147 )" This reverts commit `b642a18e80`. Reverted https://github.com/pytorch/pytorch/pull/120147 on behalf of https://github.com/williamwen42 due to memory leak, see https://github.com/pytorch/pytorch/issues/112090 ([comment](https://github.com/pytorch/pytorch/pull/120147#issuecomment-1960522018))	2024-02-22 23:46:55 +00:00
Animesh Jain	b642a18e80	[dynamo] Use EQUALS_MATCH guard for mod.training (#120147 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120147 Approved by: https://github.com/jansel ghstack dependencies: #120132, #120140, #120145	2024-02-18 00:31:36 +00:00
Animesh Jain	0b11b0edd6	[dynamo][refactor] Use existing helper functions for CLOSURE_MATCH (#120145 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120145 Approved by: https://github.com/jansel, https://github.com/Fidget-Spinner ghstack dependencies: #120132, #120140	2024-02-18 00:31:36 +00:00
Animesh Jain	757fc663a8	[dynamo][refactor] Use TYPE_MATCH instead of manually constructing guard (#120140 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120140 Approved by: https://github.com/jansel, https://github.com/yanboliang ghstack dependencies: #120132	2024-02-17 16:03:36 +00:00

1 2 3 4 5 ...

341 Commits