pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Edward Z. Yang	f19e07b056	Memoize local_scalar_dense calls, refactor all memos (#125623 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/125623 Approved by: https://github.com/eellison	2024-05-11 21:12:35 +00:00
PyTorch MergeBot	c6e5d0d2e6	Revert "Memoize local_scalar_dense calls, refactor all memos (#125623 )" This reverts commit `fcbf2b61e6`. Reverted https://github.com/pytorch/pytorch/pull/125623 on behalf of https://github.com/malfet due to Broke ROCM, see https://github.com/pytorch/pytorch/actions/runs/9026074378/job/24804583041 ([comment](https://github.com/pytorch/pytorch/pull/125623#issuecomment-2105444091))	2024-05-11 01:58:39 +00:00
Tugsbayasgalan Manlaibaatar	d7fe3c4123	[RELAND] Switch default behavoir of export IR to be predispatch (#125860 ) This PR switches export IR from aot-dispatch to pre-dispatch IR. What is pre-dispatch IR and why should you care? Currently the default IR returned by torch.export can contain only functional ATen operators after ALL pytorch dispatcher decompositions (for example, CompositeImplicitAutograd) run. In contrast, pre-dispatch IR refers to an IR that can contain all functional ATen operators (i.e., not just from the core subset), before any decomposition happens, as well as operators that manipulate autograd state. Pre-dispatch IR closely resembles eager PyTorch computation, but is still functional and serializable by torch.export. As a result: You can train the pre-dispatch IR in eager mode as the IR contains necessary information for the autograd engine to automatically generate a backward graph. You can write sound graph transformations more easily as the IR is functional. Since it is an ATen IR, it is still normalized. For example, torch.add has multiple overloads, but aten.add.Tensor is unique in this IR. If you want to get the core aten IR out of torch.export, you will need to: ``` ep = torch.export.export(M(), inputs) ep_for_core_aten = ep.run_decompositions() ``` Differential Revision: [D57172986](https://our.internmc.facebook.com/intern/diff/D57172986) Pull Request resolved: https://github.com/pytorch/pytorch/pull/125860 Approved by: https://github.com/zhxchen17	2024-05-10 17:36:53 +00:00
Edward Z. Yang	fcbf2b61e6	Memoize local_scalar_dense calls, refactor all memos (#125623 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/125623 Approved by: https://github.com/eellison	2024-05-10 01:52:55 +00:00
Pian Pawakapan	c9a258e474	[export] handle constant aliasing for export (#125509 ) Summary: Currently export will [error out](`2b5ae2611e/torch/export/_trace.py (L477)`) if a constant is aliased. This PR supports this by modifying ConstantAttrMap to map constants to a list of FQNs instead of a single FQN, populating the ExportedProgram constants dict to contain multiple entries to the same constant. Test Plan: added test case in test_export.py Differential Revision: D56955654 Pull Request resolved: https://github.com/pytorch/pytorch/pull/125509 Approved by: https://github.com/angelayi, https://github.com/ydwu4	2024-05-10 00:14:37 +00:00
angelayi	13545fe68a	[export] Don't create a new fake mode if dynamo tracing (#125185 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/125185 Approved by: https://github.com/mikekgfb	2024-05-09 23:43:08 +00:00
Tugsbayasgalan Manlaibaatar	0e419b9146	Fix graph partitioner and make runtime assertion work with submodules in export (#125793 ) Summary: This fix does three things: 1. When we add inputs from partioner to the top level graph module, we insert in the order of partioner which is not guaranteed to be same as original graph inputs. This PR fixes that. 2. When we replace autograd ops with HOP, we create new submodules and access their outputs via getitem calls. As a result, previous node names associated with getitem gets updated, resulting in the graph being different from produced graph signature. So I just update the graph signature accordingly. 3. We run runtime_assertion pass before autograd HOP pass because the constraints won't be populated correctly. Differential Revision: [D57130314](https://our.internmc.facebook.com/intern/diff/D57130314) Pull Request resolved: https://github.com/pytorch/pytorch/pull/125793 Approved by: https://github.com/zhxchen17	2024-05-09 18:13:46 +00:00
Avik Chaudhuri	1b8891a31d	make torch._check understand Eq commutativity (#125629 ) Summary: Given `torch._check(a == b)` we can still get a data-dependent error needing `b == a`. Simple fix. ``` def forward(self, x1, x2, x3, y): z1 = x1.item() z2 = x2.item() z3 = x3.item() torch._check((z2 + z3) == z1) # torch._check(z1 == (z2 + z3)) didn't work, now does if z2 + z3 == z1: return y * 2 else: return y + 3 ``` Differential Revision: D57014730 Pull Request resolved: https://github.com/pytorch/pytorch/pull/125629 Approved by: https://github.com/ezyang	2024-05-08 21:39:21 +00:00
Pian Pawakapan	f4b2d50fd7	[export] disable_forced_specializations (#124949 ) Summary: By default, some inferred dynamic shapes guards/constraints that are not expressible with the current dynamic shapes language will lead to specialization to the concrete input values provided. If disable_forced_specializations is set to True, we will not specialize, and will not perform runtime checks on such produced guards. Instead, we allow the user to specify arbitrary shapes, and fail during runtime if the inputs are invalid. Constraints expressible with the language (e.g. ranges, linear derived dims) will still be enforced, and behavior for all other guards remains the same. Cases where we typically specialize are reshapes: ``` x: [4, 6] # [s0, s1] x = x.reshape([x.shape[0] - 1, -1]) # this emits a guard Mod(s0s1, s0-1) = 0, we specialize on s0=4, s1=6 x: [4, 6], y: [24] # [s0, s1], [s2] x = x.reshape([-1]) + y # this emits a guard s0s1 = s2, we specialize on s0=4, s1=6, s2=24 ``` For now only applicable for non-strict mode (need to figure out how to pass this flag into dynamo's call of produce_guards). Test Plan: Added test case that checks compilation, runtime, and suggested fixes behavior. Differential Revision: D56361177 Pull Request resolved: https://github.com/pytorch/pytorch/pull/124949 Approved by: https://github.com/avikchaudhuri	2024-05-08 18:42:39 +00:00
angelayi	8be4c1bc2f	[export] Add metadata for nodes insert_deferred_runtime_asserts (#125414 ) Fixes [internal error](https://fb.workplace.com/groups/1075192433118967/permalink/1416709435633930/). The issue is that the asserting nodes added in the `insert_deferred_runtime_assertion` pass do not contain metadata that the ExportedProgram requires the graph to have. One solution to fix this is to retrace the entire module, or another solution is to manually add back this metadata. This diff implements the latter solution (manually add back the metadata) through hooking into fx.graph's `create_node` function, and adding export-specific metadata for every node that is created. The reason I did this is so that the `insert_deferred_runtime_assertion` does not have to know about what metadata export wants. Pull Request resolved: https://github.com/pytorch/pytorch/pull/125414 Approved by: https://github.com/zhxchen17, https://github.com/BoyuanFeng	2024-05-07 23:15:21 +00:00
Angela Yi	4332fc4095	[export] Allow constant attr mutation (#125424 ) Test Plan: CI Differential Revision: D56893728 Pull Request resolved: https://github.com/pytorch/pytorch/pull/125424 Approved by: https://github.com/pianpwk	2024-05-07 00:34:57 +00:00
Pian Pawakapan	3827810453	[export] suggest constant dim values in dynamic shapes fixes (#125458 ) [https://www.internalfb.com/diff/D54924742](https://github.com/pytorch/pytorch/pull/121860) allowed specifying integer values for static dims in dynamic shapes. This changes suggested fixes to suggest the actual value instead of the current "None". Test Plan: existing export tests cover this Differential Revision: D56921142 Pull Request resolved: https://github.com/pytorch/pytorch/pull/125458 Approved by: https://github.com/avikchaudhuri	2024-05-06 17:44:19 +00:00
Pian Pawakapan	ef757a5c00	[export] use tree_map for _flatten_dynamic_shapes (#125415 ) Summary: Fixing the implementation of `_flatten_dynamic_shapes()`, to follow how `_process_dynamic_shapes()` does it. The previous implementation would misinterpret some nested dynamic shapes specs, causing it to miss out on some shapes specs, for example with nested inputs/constant input tuples: ``` inputs = ( (2, 1), ( torch.randn(2, 1), torch.randn(2, 2), torch.randn(2, 3), ) ) dynamic_shapes = ( (None, None), ( None, None, None, ) ) ``` This would get interpreted as 2 shapes specs for 2d and 3d tensors. Fixing so this doesn't happen. Test Plan: Existing export tests Differential Revision: D56894923 Pull Request resolved: https://github.com/pytorch/pytorch/pull/125415 Approved by: https://github.com/angelayi	2024-05-03 04:59:17 +00:00
Avik Chaudhuri	746da8755c	switch tests from constrain_as* to torch._check* (#125253 ) To fix data-dependent errors we want to recommend that people use `torch._check` APIs. The `constrain_as` APIs should be fully subsumed by them, and in the future we should kill them entirely. Differential Revision: D56774333 Pull Request resolved: https://github.com/pytorch/pytorch/pull/125253 Approved by: https://github.com/ezyang	2024-05-01 21:01:27 +00:00
Avik Chaudhuri	e7846447e0	dynamic shapes builder API (#124898 ) This PR introduces a new way of building `dynamic_shapes` for export. The idea is to build up a mapping from input tensors to the dynamic shapes that should be assigned to their corresponding fake tensors. This mapping is automatically converted to the current form of `dynamic_shapes`, which must exactly match the structure of inputs. We do this by using pytree utils. With the current `dynamic_shapes`, we had to be careful about user-defined classes that are registered with pytree, since such classes are not necessarily polymorphic containers; they may be fine containing tensors, but not dynamic shapes. Thus we had decided to allow input instances of such classes to be associated with dynamic shapes in flattened form. This decision needs to be mirrored in this PR as well. To make it easier to keep these code paths in sync, we refactor the current recursive procedure for associating inputs with dynamic shapes to use the same pytree utils. This needs minor fixes to a few tests where `dynamic_shapes` were not exactly matching the structure of inputs. Differential Revision: D56551992 Pull Request resolved: https://github.com/pytorch/pytorch/pull/124898 Approved by: https://github.com/zhxchen17	2024-04-30 03:59:49 +00:00
Pian Pawakapan	946e202c07	[export] Restore user input names to unlifted graph modules (#124765 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/122842 Currently, calling ep.module() on an ExportedProgram leads to a GraphModule with a default forward signature (e.g. arg_0, arg_1, ...). This leads to original placeholder names disappearing for retracing/re-exporting. Fixing this issue by creating a forward_arg_names field (will take renaming suggestions for this), that stores the positional & keyword arg names that are used. These names aren't present in the call_spec currently stored, and requires a major version bump for the ExportedProgram schema. Test Plan: Tests exist for export, but names are now changed from generic (e.g. arg_0, arg_1) to follow user inputs (e.g. x, y) Differential Revision: D56484994 Pull Request resolved: https://github.com/pytorch/pytorch/pull/124765 Approved by: https://github.com/zhxchen17	2024-04-29 20:58:17 +00:00
Zhengxu Chen	7bb89bcaa4	[export] Fix state dict reparametrization in non-strict. (#124847 ) Summary: There are multiple things implemented incorrectly in non strict for reparametrizing state dict: 1. The same fake tensor should be generated for duplicated weights. 2. We should snapshot state dict in the beginning to always hold the invariant that ep.state_dict == mod.state_dict() 3. We will overwrite real weights with fake weights if we don't restore the weights in LIFO ordering. 4. We don't turn on strict checking which could sliently fail on corner cases. This diff aims to solve all these issues at once. Test Plan: CI Differential Revision: D56505020 Pull Request resolved: https://github.com/pytorch/pytorch/pull/124847 Approved by: https://github.com/pianpwk	2024-04-25 22:44:16 +00:00
Zhengxu Chen	02ed2992d9	[export] Capture tensor.to() under export. (#123732 ) Summary: We use to skip tensor.to() during tracing when the device is the same. This will bring some performance improvement in eager but making graph capture losing the semantics from original model. In this diff, we add an additional condition to skip the fast path when we don't have actual data inside a tensor, which is the case when we're using FakeTensor / FunctionalTensor to trace the model. This won't have perf impact on previous eager models while making sure we can capture the _to_copy() node in the graph. Test Plan: buck test mode/opt caffe2/test:test_export -- -r device_to Differential Revision: D55969674 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123732 Approved by: https://github.com/angelayi, https://github.com/tugsbayasgalan	2024-04-24 23:12:19 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	674e15ae07	Back out "Switch to predispatch" (#124860 ) Summary: Original commit changeset: 1f155b3a0bfc Original Phabricator Diff: D56273267 Test Plan: CI Differential Revision: D56526505 Pull Request resolved: https://github.com/pytorch/pytorch/pull/124860 Approved by: https://github.com/angelayi	2024-04-24 17:28:33 +00:00
Tugsbayasgalan Manlaibaatar	c933af2709	Switch to predispatch (#123573 ) This PR switches export IR from aot-dispatch to pre-dispatch IR. What is pre-dispatch IR and why should you care? Currently the default IR returned by torch.export can contain only functional ATen operators after ALL pytorch dispatcher decompositions (for example, CompositeImplicitAutograd) run. In contrast, pre-dispatch IR refers to an IR that can contain all functional ATen operators (i.e., not just from the core subset), before any decomposition happens, as well as operators that manipulate autograd state. Pre-dispatch IR closely resembles eager PyTorch computation, but is still functional and serializable by torch.export. As a result: - You can train the pre-dispatch IR in eager mode as the IR contains necessary information for the autograd engine to automatically generate a backward graph. - You can write sound graph transformations more easily as the IR is functional. - Since it is an ATen IR, it is still normalized. For example, torch.add has multiple overloads, but aten.add.Tensor is unique in this IR. If you want to get the core aten IR out of `torch.export`, you will need to: ``` ep = torch.export.export(M(), inputs) ep_for_core_aten = ep.run_decompositions() ``` Differential Revision: [D56273267](https://our.internmc.facebook.com/intern/diff/D56273267) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123573 Approved by: https://github.com/gmagogsfm	2024-04-24 00:51:09 +00:00
Pian Pawakapan	10b9d4d19c	[export] handle Dim.lower = 0, 1 for ep.run_decompositions() (#123602 ) Summary: With pre-dispatch export and ep.run_decompositions(), range constraints are updated through looking at ShapeEnv.var_to_range. However the lower bounds on these may be incorrect - analysis on un-specialized symbols are done with lower bounds of 2, which mismatch with user-specified bounds (may be 0, 1). This updates `_get_updated_range_constraints()` to use the old range constraints if possible. Test Plan: Existing pre-dispatch/dynamic shapes test case. Differential Revision: D55899872 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123602 Approved by: https://github.com/tugsbayasgalan	2024-04-19 21:29:36 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	a8cf91c395	Fix predispatch tracing for aten::lift_fresh_copy (#124198 ) Differential Revision: D56200666 Previously, when we hit the Functionalize kernel for lift_fresh_copy, we directly dispatch self.clone() to proxy dispatch. As a result, we end up receiving a functional tensor at proxy dispatch. As a work around, I unwrap self manually. Not sure, why it works ok in aot-dispatch tho Pull Request resolved: https://github.com/pytorch/pytorch/pull/124198 Approved by: https://github.com/bdhirsh	2024-04-18 17:02:38 +00:00
Boyuan Feng	aa2da0cdd2	[Export] Add runtime assert to non-strict export (#123681 ) This PR moves insert_deferred_runtime_asserts from dynamo to torch.fx.passes and uses it to add runtime assertion for non-strict export. Differential Revision: D55944267 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123681 Approved by: https://github.com/tugsbayasgalan, https://github.com/angelayi	2024-04-18 16:13:27 +00:00
Tugsbayasgalan Manlaibaatar	dd3cea3291	Fix derived dim bugs in ep.run_decomp (#123326 ) Differential Revision: [D55730289](https://our.internmc.facebook.com/intern/diff/D55730289) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123326 Approved by: https://github.com/avikchaudhuri	2024-04-17 04:00:55 +00:00
Pian Pawakapan	90d1720861	[export] Restore original placeholder names (part 3: constant input de/serialization) (#123590 ) Summary: note: breaking the original diff D55225818 into 3 parts (top-level renaming, higher-order-op subgraphs, constant input de/serialization) because of its size. Stacked PR to restore original names to placeholder nodes, replacing the default names arg0_1, arg1_1, ... This PR supports constant argument placeholder (e.g. forward(self, x, y=1)) names and de/serialization, by adding a name field for ConstantArguments in the graph signature, and ConstantInputSpec in the input specs for serialization. Test Plan: verification checks on placeholder names for all export() calls, unit test in test/export/test_export.py Differential Revision: D55506949 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123590 Approved by: https://github.com/angelayi, https://github.com/zhxchen17	2024-04-15 19:09:41 +00:00
Avik Chaudhuri	5961e23e76	primitive attribute assignment (#123898 ) This PR ensures that assignment of attributes of primitive type work without needing any code changes in non-strict mode. (In a previous PR we banned attribute assignments of tensor type unless such attributes are registered as buffers.) While strict mode errors on (all) attribute assignments, non-strict doesn't care, so one might assume that this kind of attribute assignment should already work in non-strict. However, there's a problem: we run through the program once for metadata collection and then run through it again for tracing, so the values observed during tracing (and potentially burned into the graph) do not reflect what should have been observed had the metadata collection pass not run. So the only thing this PR needs to do is restore values of assigned attributes of primitive type once the metadata collection pass has run. We do this by moving the attribute assignment detecting context manager from the overall `aot_export` call in `_trace.py` to the metadata collection pass in `aot_autograd.py`, and extending it. The rest of the PR moves some utils around. Differential Revision: D56047952 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123898 Approved by: https://github.com/angelayi	2024-04-13 05:27:52 +00:00
Thiago Crepaldi	23dbe2b517	Add test for skipping hf logging during export (#123410 ) https://github.com/pytorch/pytorch/pull/123402 already supports hf logging because HF logger is based on logging module This PR adds a test to guard this against regression, only Pull Request resolved: https://github.com/pytorch/pytorch/pull/123410 Approved by: https://github.com/BowenBao, https://github.com/malfet	2024-04-12 17:42:46 +00:00
Pian Pawakapan	07e0faf3ef	[export] set AOTAutograd ctx to enable_grad with pre-dispatch export (#123671 ) Summary: Currently, torch.export (through AOTAutograd) compiles with a torch.no_grad() wrapper, which affects the presence of `set_grad_enabled` nodes in pre-dispatch export graphs. This changes the wrapper to nullcontext (i.e. enable grad) if `pre_dispatch=True`. An example that previously failed without `with torch.no_grad()` is below: ``` class Model(torch.nn.Module): def forward(self, x, y): with torch.enable_grad(): x = x + y return x model = Model() exported_program = torch.export._trace._export( model, (torch.tensor(2), torch.tensor(3)), dynamic_shapes=None, pre_dispatch=True, strict=False ) ``` The pass would inline the add call, but then try to construct a HOO subgraph with no inputs/outputs: ``` def forward(self): _set_grad_enabled_1 = torch._C._set_grad_enabled(False) ``` Test Plan: Test case checking that nullcontext & no_grad wrappers lead to different export behaviors (regarding set grad subgraphs). Reviewed By: tugsbayasgalan Differential Revision: D55777804 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123671 Approved by: https://github.com/tugsbayasgalan	2024-04-12 16:16:23 +00:00
Pian Pawakapan	d0ccf599cc	[export] Restore original placeholder names (part 2: higher-order-op subgraph naming) (#123587 ) Summary: note: breaking the original diff [D55225818](https://www.internalfb.com/diff/D55225818) into 3 parts (top-level renaming, higher-order-op subgraphs, constant input de/serialization) because of its size. Stacked PR to restore original names to placeholder nodes, replacing the default names arg0_1, arg1_1, ... This PR propagates node names to higher-order-op subgraph placeholders, retaining the top-level names and handling naming collisions by suffixing other non-placeholder nodes in the subgraph with an index. This is the same handling as in fx.Graph/fx.Node, but implemented separately as a pass. Since the input schemas of HOO subgraphs are very different, they are enumerated in _name_hoo_subgraph_placeholders(). Currently cond, map_impl, and wrap_with_set_grad_enabled are handled, but other ops can be easily added. Test Plan: verification checks on placeholder names for all export() calls, unit test in test/export/test_export.py Differential Revision: D55456749 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123587 Approved by: https://github.com/angelayi	2024-04-11 22:40:46 +00:00
Avik Chaudhuri	10a03c56e5	fix leaky fake tensor on attribute assignment, support buffer assignment (#122337 ) In non-strict, assignment of attributes in a model causes their state to contain fake tensors post-tracing, which leads to incorrect results on running the exported model. We now error when this happens, asking the user to use buffers instead. Next, we add support for assignment of buffers. The final values of the buffers turn into outputs of the graph. Since the buffers are already lifted as inputs and populated with the initial values when the model is run, this leads to a simple programming model where the driver of the model can feed the outputs back as inputs for successive runs. Differential Revision: D55146852 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122337 Approved by: https://github.com/bdhirsh, https://github.com/tugsbayasgalan	2024-04-11 18:08:31 +00:00
Edward Z. Yang	efa36ef092	Natively support int truncation, don't guard on positive/negative (#122827 ) This doesn't entirely fix the original problem that prompted this, but it seems to just be getting stuck in export constraint formatting now which seems like progress to me. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/122827 Approved by: https://github.com/avikchaudhuri	2024-04-11 15:22:32 +00:00
PyTorch MergeBot	cf8139b956	Revert "Fix derived dim bugs in ep.run_decomp (#123326 )" This reverts commit `4322874282`. Reverted https://github.com/pytorch/pytorch/pull/123326 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/123326#issuecomment-2048389042))	2024-04-10 20:35:01 +00:00
Tugsbayasgalan Manlaibaatar	4322874282	Fix derived dim bugs in ep.run_decomp (#123326 ) Differential Revision: [D55730289](https://our.internmc.facebook.com/intern/diff/D55730289) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123326 Approved by: https://github.com/avikchaudhuri	2024-04-10 18:54:03 +00:00
FFFrog	fe4d1aff05	UFMT formatting on test/export (#123520 ) Partially addresses https://github.com/pytorch/pytorch/issues/123062 Ran lintrunner on: test/export Detail: ```Shell $ lintrunner -a --take UFMT --all-files ok No lint issues. Successfully applied all patches. ``` Co-authored-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/123520 Approved by: https://github.com/ezyang	2024-04-10 05:38:42 +00:00
PyTorch MergeBot	786c6db519	Revert "UFMT formatting on test/export (#123520 )" This reverts commit `ec7551d1b7`. Reverted https://github.com/pytorch/pytorch/pull/123520 on behalf of https://github.com/PaliC due to lint is still broken ([comment](https://github.com/pytorch/pytorch/pull/123520#issuecomment-2046223260))	2024-04-10 00:06:30 +00:00
FFFrog	ec7551d1b7	UFMT formatting on test/export (#123520 ) Partially addresses https://github.com/pytorch/pytorch/issues/123062 Ran lintrunner on: test/export Detail: ```Shell $ lintrunner -a --take UFMT --all-files ok No lint issues. Successfully applied all patches. ``` Co-authored-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/123520 Approved by: https://github.com/shink, https://github.com/ezyang	2024-04-09 23:24:13 +00:00
Angela Yi	b287dbbc24	[export] Fix naming if state dict contains colons (#123601 ) Test Plan: buck2 run mode/opt //aps_models/pyper/ads:train\[inplace\] +training.ir_serializer=on_disk https://www.internalfb.com/intern/everpaste/?handle=GICWmAB0g_Z1StMCAMxuhJI6U9pHbsIXAAAz Reviewed By: tugsbayasgalan Differential Revision: D55894742 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123601 Approved by: https://github.com/pianpwk	2024-04-09 21:25:08 +00:00
Thiago Crepaldi	1b5944358e	Ignore logging.Logger.* calls during dynamo export (#123402 ) Follow up for https://github.com/pytorch/pytorch/pull/123368 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123402 Approved by: https://github.com/williamwen42	2024-04-09 18:51:00 +00:00
PyTorch MergeBot	d04957c0c6	Revert "Ignore logging.Logger.* calls during dynamo export (#123402 )" This reverts commit `75933ff523`. Reverted https://github.com/pytorch/pytorch/pull/123402 on behalf of https://github.com/DanilBaibak due to Broken trunk ([comment](https://github.com/pytorch/pytorch/pull/123402#issuecomment-2044236088))	2024-04-09 06:28:12 +00:00
PyTorch MergeBot	b9d2b75bac	Revert "Add test for skipping hf logging during export (#123410 )" This reverts commit `ba55ef8e21`. Reverted https://github.com/pytorch/pytorch/pull/123410 on behalf of https://github.com/DanilBaibak due to Broken trunk ([comment](https://github.com/pytorch/pytorch/pull/123402#issuecomment-2044236088))	2024-04-09 06:28:12 +00:00
Pian Pawakapan	8bd6223730	[export] construct set_grad_enabled HOO subgraph inside other HOO subgraphs (#123391 ) Summary: Reference: https://github.com/pytorch/pytorch/pull/121736 Previously set_grad_enabled nodes in HOO subgraphs (e.g. cond) were inlined and not replaced with their own HOO subgraphs. This diff recursively does that. Example: ``` class Model(torch.nn.Module): def forward(self, x, y): def true_fn(x, y): with torch.enable_grad(): return x - y return torch.cond( x.sum() > 0, true_fn, lambda x, y: x + y, [x, y], ) ``` Before (printing out `ep.graph_module.true_graph_0`): ``` class <lambda>(torch.nn.Module): def forward(self, arg0_1: "i64[]", arg1_1: "i64[]"): # No stacktrace found for following nodes _set_grad_enabled = torch._C._set_grad_enabled(True) sub: "i64[]" = torch.ops.aten.sub.Tensor(arg0_1, arg1_1); arg0_1 = arg1_1 = None _set_grad_enabled_1 = torch._C._set_grad_enabled(False) return (sub,) ``` After: ``` class GraphModule(torch.nn.Module): def forward(self, arg0_1: "i64[]", arg1_1: "i64[]"): # No stacktrace found for following nodes submod_3 = self.submod_1 sub: "i64[]" = torch._higher_order_ops.wrap.wrap_with_set_grad_enabled(True, submod_3, arg0_1, arg1_1); submod_3 = arg0_1 = arg1_1 = None return (sub,) class GraphModule(torch.nn.Module): def forward(self, arg0_1: "i64[]", arg1_1: "i64[]"): # No stacktrace found for following nodes sub: "i64[]" = torch.ops.aten.sub.Tensor(arg0_1, arg1_1); arg0_1 = arg1_1 = None return sub ``` Differential Revision: D55770138 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123391 Approved by: https://github.com/tugsbayasgalan	2024-04-09 02:08:03 +00:00
Thiago Crepaldi	ba55ef8e21	Add test for skipping hf logging during export (#123410 ) https://github.com/pytorch/pytorch/pull/123402 already supports hf logging because HF logger is based on logging module This PR adds a test to guard this against regression, only Pull Request resolved: https://github.com/pytorch/pytorch/pull/123410 Approved by: https://github.com/BowenBao, https://github.com/malfet ghstack dependencies: #123402	2024-04-08 23:20:30 +00:00
Thiago Crepaldi	75933ff523	Ignore logging.Logger.* calls during dynamo export (#123402 ) Follow up for https://github.com/pytorch/pytorch/pull/123368 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123402 Approved by: https://github.com/williamwen42	2024-04-08 22:50:54 +00:00
Pian Pawakapan	d7f23f6826	[export] Restore original placeholder names (part 1: top-level renaming) (#122904 ) Summary: This PR restores original names to placeholder nodes, replacing the default names arg0_1, arg1_1, and so on. User inputs now follow the signature of mod.forward(), for example forward(x, y) produces nodes x, y. If the tensors are nested in dictionaries, lists, tuples, or dataclasses, the names are a concatenation of the path to the tensor, e.g. x = {'a': torch.randn(4), 'b': [torch.randn(4), torch.randn(4)]} produces nodes x_a, x_b_0, x_b_1. Parameters, buffers, constants, and custom objects follow the FQN of the object, prefixed by "p", "b", "c", and "obj" respectively. For example, self.bar.l0.weight gets you p_bar_l0_weight. Effect tokens are named token_1, token_2, and so on, since they are not grounded in model inputs or named attributes. note: breaking the original diff into 3 parts (top-level renaming, higher-order-op subgraphs, constant input de/serialization) because of its size. Examples: ```python # params, buffers, constants, inputs, torch.cond ExportedProgram: class GraphModule(torch.nn.Module): def forward(self, p_l0_weight: "f32[4, 4]", p_l0_bias: "f32[4]", c_alpha: "f32[4]", b_beta: "f32[4]", x_0_a: "f32[4, 4]", y: "f32[4, 4]"): # No stacktrace found for following nodes mul: "f32[4, 4]" = torch.ops.aten.mul.Tensor(x_0_a, x_0_a) t: "f32[4, 4]" = torch.ops.aten.t.default(p_l0_weight); p_l0_weight = None addmm: "f32[4, 4]" = torch.ops.aten.addmm.default(p_l0_bias, y, t); p_l0_bias = y = t = None return addmm # model code class Bar(torch.nn.Module): def forward(self, x): return x * x class Foo(torch.nn.Module): def __init__(self): super().__init__() self.bar = Bar() self.l0 = torch.nn.Linear(4, 4) self.alpha = torch.randn(4) self.register_buffer('beta', torch.randn(4)) def forward(self, x, y): x = x[0]['a'] mul = self.bar(x) z1 = self.l0(y) return z1 # custom objects, dataclasses, tokens, constant inputs ExportedProgram: class GraphModule(torch.nn.Module): def forward(self, token_1: "f32[0]", obj_attr, data_x: "f32[4, 4]", data_y: "f32[4, 4]", mode): # No stacktrace found for following nodes mul: "f32[4, 4]" = torch.ops.aten.mul.Scalar(data_x, 30); data_x = None div: "f32[4, 4]" = torch.ops.aten.div.Tensor_mode(data_y, 1.0, rounding_mode = 'floor'); data_y = None add: "f32[4, 4]" = torch.ops.aten.add.Tensor(mul, div); mul = div = None with_effects = torch._higher_order_ops.effects.with_effects(token_1, torch.ops._TorchScriptTesting.takes_foo.default, obj_attr, add); token_1 = obj_attr = add = None getitem: "f32[0]" = with_effects[0] getitem_1: "f32[4, 4]" = with_effects[1]; with_effects = None return (getitem, getitem_1) # model code class Foo(torch.nn.Module): def __init__(self): super().__init__() self.attr = torch.classes._TorchScriptTesting._Foo(10, 20) def forward(self, data, a=1.0, mode="floor"): x = self.attr.add_tensor(data.x) + torch.div(data.y, a, rounding_mode=mode) x = torch.ops._TorchScriptTesting.takes_foo(self.attr, x) return x dataclass class DataClass: x: Tensor y: Tensor register_dataclass_as_pytree_node( DataClass, serialized_type_name="test.DataClass" ) args = (DataClass(x=torch.randn(4, 4), y=torch.randn(4, 4)), ) kwargs = {'mode': 'floor'} ep = torch.export.export(Foo(), args, kwargs, strict=False) ``` Test Plan: verification checks on placeholder names for all export() calls, unit test in test/export/test_export.py Differential Revision: D55456418 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122904 Approved by: https://github.com/angelayi, https://github.com/thiagocrepaldi	2024-04-05 18:56:00 +00:00
Pian Pawakapan	4b1b4db231	[export] Add stack_trace for non-strict export (#121034 ) This addresses 2 issues with stack_trace metadata: - stack_trace is currently missing from nodes in non-strict export - in strict mode, stack_trace is populated for placeholder nodes, which may not be well-defined (with multiple uses) We filter the call stack during tracing for calls from forward() methods, or ops in `torch.__init__.py` (e.g. sym_size_int, sym_constrain_range, etc.) to populate stack_trace. A node-level check is also added to _export_non_strict(). Pull Request resolved: https://github.com/pytorch/pytorch/pull/121034 Approved by: https://github.com/angelayi	2024-04-04 22:35:33 +00:00
Tugsbayasgalan Manlaibaatar	1ea6d3a9b4	Fix conv decomp when running to core-aten (#123283 ) Differential Revision: [D55709374](https://our.internmc.facebook.com/intern/diff/D55709374) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123283 Approved by: https://github.com/angelayi	2024-04-04 01:14:09 +00:00
William Wen	9e0838ff27	fix typo in export/test_export.py (#123228 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123228 Approved by: https://github.com/pianpwk	2024-04-03 20:17:22 +00:00
angelayi	ed457c7dbe	[export] Add torch_fn (#122693 ) This PR adds a new metadata, `torch_fn` which is meant to replace `source_fn_stack` as `source_fn_stack` is not entirely well defined between strict/nonstrict. Previous discussion [here](https://docs.google.com/document/d/1sPmmsmh6rZFWH03QBOe49MaXrQkP8SxoG8AOMb-pFk4/edit#heading=h.anmx9qknhvm). `torch_fn` represents the torch function that a particular aten operator came from. For example, `torch.nn.Linear` goes down to the `torch.nn.functional.linear` at the `__torch_function__` layer, and then `aten.t/aten.addmm` in the `__torch_dispatch__` layer. So the nodes `aten.t/aten.addmm` will now have the `torch_fn` metadata containing the `torch.nn.functional.linear`. The `torch_fn` metadata is a tuple of 2 strings: a unique identifier for each torch function call, and the actual torch function `f"{fn.__class__}.{fn.__name__}"`. The purpose of the first value is to distinguish between 2 consecutive calls to the same function. For example, if we had 2 calls to `torch.nn.Linear`, the nodes and corresponding metadata would look something like: ``` aten.t - ("linear_1", "builtin_function_or_method.linear"), aten.addmm - ("linear_1", "builtin_function_or_method.linear"), aten.t - ("linear_2", "builtin_function_or_method.linear"), aten.addmm - ("linear_2", "builtin_function_or_method.linear"), ``` Higher order ops -- currently we can get the torch_fn metadata for nodes within the HOO's subgraph, but after retracing, this becomes the `(cond, higher_order_op.cond)` :( This is because `fx_traceback.set_current_meta` points to the cond node in the toplevel graph, rather than the original node in the subgraph. I think this is because `fx.Interpreter` does not go into the cond subgraphs. (will discuss with Yidi more ab this) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122693 Approved by: https://github.com/tugsbayasgalan	2024-03-30 06:47:15 +00:00
Tugsbayasgalan Manlaibaatar	76d8020e62	Add tests for pre_dispatch + run_decomp flow and taskify failures (#122508 ) Differential Revision: [D55448616](https://our.internmc.facebook.com/intern/diff/D55448616) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122508 Approved by: https://github.com/angelayi, https://github.com/zhxchen17	2024-03-29 01:47:07 +00:00
PyTorch MergeBot	3beb9d85a6	Revert "Add non strict inline constraints and runtime assertions to non-strict exported program (#122722 )" This reverts commit `b693fff5d7`. Reverted https://github.com/pytorch/pytorch/pull/122722 on behalf of https://github.com/BoyuanFeng due to This breaks torchrec.distributed.tests.test_pt2.TestPt2: test_kjt__getitem__ ([comment](https://github.com/pytorch/pytorch/pull/122722#issuecomment-2026078351))	2024-03-28 20:42:35 +00:00

1 2 3 4 5 ...

305 Commits