pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Xuehai Pan	e7eeee473c	[BE][Easy][14/19] enforce style for empty lines in import segments in `torch/_[a-c]/` and `torch/_[e-h]/` and `torch/_[j-z]*/` (#129765 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129765 Approved by: https://github.com/ezyang	2024-07-31 10:42:50 +00:00
PyTorch MergeBot	b9912f31ef	Revert "[export] fix zero arg export in training_ir (#130990 )" This reverts commit `50436d5bdb`. Reverted https://github.com/pytorch/pytorch/pull/130990 on behalf of https://github.com/clee2000 due to failing some executorch and torchrec tests internally D60006710 ([comment](https://github.com/pytorch/pytorch/pull/130990#issuecomment-2243395316))	2024-07-22 16:49:25 +00:00
Yidi Wu	50436d5bdb	[export] fix zero arg export in training_ir (#130990 ) Fixed TrainingIRToRunDecomp failures for test_tensor_attribute_zero_args and also a few re-tracability failures because run_decomposition does a retracing. edit: also remove the eliminate_dead_code() in _unlift because of one onnx test failure: a constant tensor attr was lifted as constant_tensor input but it's not used in the graph after aot_autograd due to a short cut in its decomposition. This causes the setattr to be removed by eliminate_dead_code but the graph signature still contains the name of that buffer, which causes an inconsitency between the transformed graph and ep's original signature after _unlift. And it seems that this has happened a few times where some nodes are accidentally removed and we're in an inconsistent state. The alternative of removing it would be: every time we call elimiate_dead_code, we verify the consistency of the graph with 1. the graph before transformation and 2. all the meta datas but i think this deserves a complete design. Pull Request resolved: https://github.com/pytorch/pytorch/pull/130990 Approved by: https://github.com/pianpwk	2024-07-20 02:35:13 +00:00
Pian Pawakapan	745324e487	[export] turn on hybrid symints by default (#130775 ) Sets `prefer_deferred_runtime_asserts_over_guards=True` for export, so any guards emitted from `SymNode.expect_true` (for example, guards that are implicitly required to be true for an op to succeed) won't lead to constraint violations. Instead these should appear in the graph as runtime asserts, or potentially as replacement expressions for placeholder shapes. For example, this reshape op should emit s0 * s1 = s2, deferred as a runtime assert. ``` x = torch.randn(4, 8) # [s0, s1] y = torch.randn(32) # [s2] out = x.reshape(-1) + y # this emits Eq(s0 * s1, s2), and we represent y's shape as [s0s1] in the graph. ``` However, other complex guards can still cause export to fail, for instance guards emitted from `SymNode.guard_bool/guard_size_oblivious` (e.g. explicit if-else conditions in user code or lower-level op implementations hit during tracing) can still raise constraint violations. These can be deferred with `allow_complex_guards_as_runtime_asserts=True`. We don't yet make this default, because while this makes export more likely to succeed, it results in non-trivial asserts being emitted that often represent specialization to a variant of the op, or checks related to 0/1 specialization. We also remove forced specializations for export and kill the `_disable_forced_specializations` flag - now any guard we can't express with Dims/DerivedDims either are handled with Hybrid SymInts, or should be resolved with rewriting or deferring. Follow up: Currently, `ShapeEnv._set_replacement()` is called for complex equality expressions (e.g. s2 -> s0s1 in the example above), and the ExportedProgram stores `s0*s1` in the input placeholder. This isn't checked for validity when the program is run, so an option is to avoid replacement and/or runtime assert on equality. Pull Request resolved: https://github.com/pytorch/pytorch/pull/130775 Approved by: https://github.com/avikchaudhuri	2024-07-18 17:40:58 +00:00
Yidi Wu	cb4bec311a	Fix nodes has more than one output users after replace_set_grad_with_hop pass (#129716 ) Summary: Previously, when we inline the subgraphs that doesn't have a different require_grad environment, we didn't clean up the nodes's users in subgraph and direcly used them to to replace the output of the call_modules. This records dead depencies in node.users. This PR fixes this. Test Plan: Added a new test. Also see the torchrec tests: Step 1: buck run mode/dev-nosan //aimp/experimental/pt2:pt2_export -- --model-entity-id 934687114 --output /tmp/934687114.zip --use-torchrec-eager-mp --use-manifold Step 2: buck run mode/opt -c python.package_style=inplace -c fbcode.enable_gpu_sections=true aimp/cli:cli -- --platform=aps --template=disagg_gpu_aps_pt2 --pt2 --model-entity-id=934687114 non-request-only-tagging torchrec-shard-and-quantize gpu-disagg-split assign-device materialize-weights script-and-save Differential Revision: D59132214 Pull Request resolved: https://github.com/pytorch/pytorch/pull/129716 Approved by: https://github.com/angelayi	2024-07-09 17:04:03 +00:00
Yidi Wu	dd00f5e78d	Fixes T192448049 (#129146 ) Differential Revision: D58767610 Pull Request resolved: https://github.com/pytorch/pytorch/pull/129146 Approved by: https://github.com/angelayi	2024-06-25 17:50:15 +00:00
Aaron Orenstein	ea614fb2b1	Flip default value for mypy disallow_untyped_defs [2/11] (#127839 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127839 Approved by: https://github.com/oulgen	2024-06-08 18:23:08 +00:00
Jiashen Cao	254783ce80	[Fix]: populate input parameter name when convert TorchScript to ExportedProgram (#126787 ) ## Goal As title ## Design Based on the fact that each TorchScript module has a `code` property which provides the original source code for the `forward` function, I implemented a function to extrapolate `forward` function signature by using the AST parser. Some other tradeoff * Directly parsing src code as string --> will be very buggy * Directly using `compile` function in Python to get the function object --> raises a lot of exceptions because of missing packages or undefined variable names Pull Request resolved: https://github.com/pytorch/pytorch/pull/126787 Approved by: https://github.com/angelayi, https://github.com/tugsbayasgalan	2024-05-28 17:33:44 +00:00
Aaron Gokaslan	3cb16ebf08	[BE]: Update ruff to 0.4.5 (#126979 ) Update ruff to 0.4.5 and addresses some false negatives that have been found in the newer version. Pull Request resolved: https://github.com/pytorch/pytorch/pull/126979 Approved by: https://github.com/ezyang	2024-05-24 18:38:35 +00:00
Matthew Hoffman	81277baa0c	Remove removed ruff rule TRY200 (#126256 ) My TOML linter is complaining that "TRY200" is not acceptable for the `tool.ruff.lint` schema. From the ruff docs: https://docs.astral.sh/ruff/rules/reraise-no-cause/ > This rule has been removed and its documentation is only available for historical reasons. > > This rule is identical to [B904](https://docs.astral.sh/ruff/rules/raise-without-from-inside-except/) which should be used instead. and we are currently explicitly ignoring B904. Pull Request resolved: https://github.com/pytorch/pytorch/pull/126256 Approved by: https://github.com/Skylion007	2024-05-17 16:31:05 +00:00
Pian Pawakapan	f4b2d50fd7	[export] disable_forced_specializations (#124949 ) Summary: By default, some inferred dynamic shapes guards/constraints that are not expressible with the current dynamic shapes language will lead to specialization to the concrete input values provided. If disable_forced_specializations is set to True, we will not specialize, and will not perform runtime checks on such produced guards. Instead, we allow the user to specify arbitrary shapes, and fail during runtime if the inputs are invalid. Constraints expressible with the language (e.g. ranges, linear derived dims) will still be enforced, and behavior for all other guards remains the same. Cases where we typically specialize are reshapes: ``` x: [4, 6] # [s0, s1] x = x.reshape([x.shape[0] - 1, -1]) # this emits a guard Mod(s0s1, s0-1) = 0, we specialize on s0=4, s1=6 x: [4, 6], y: [24] # [s0, s1], [s2] x = x.reshape([-1]) + y # this emits a guard s0s1 = s2, we specialize on s0=4, s1=6, s2=24 ``` For now only applicable for non-strict mode (need to figure out how to pass this flag into dynamo's call of produce_guards). Test Plan: Added test case that checks compilation, runtime, and suggested fixes behavior. Differential Revision: D56361177 Pull Request resolved: https://github.com/pytorch/pytorch/pull/124949 Approved by: https://github.com/avikchaudhuri	2024-05-08 18:42:39 +00:00
Pian Pawakapan	90d1720861	[export] Restore original placeholder names (part 3: constant input de/serialization) (#123590 ) Summary: note: breaking the original diff D55225818 into 3 parts (top-level renaming, higher-order-op subgraphs, constant input de/serialization) because of its size. Stacked PR to restore original names to placeholder nodes, replacing the default names arg0_1, arg1_1, ... This PR supports constant argument placeholder (e.g. forward(self, x, y=1)) names and de/serialization, by adding a name field for ConstantArguments in the graph signature, and ConstantInputSpec in the input specs for serialization. Test Plan: verification checks on placeholder names for all export() calls, unit test in test/export/test_export.py Differential Revision: D55506949 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123590 Approved by: https://github.com/angelayi, https://github.com/zhxchen17	2024-04-15 19:09:41 +00:00
Pian Pawakapan	d0ccf599cc	[export] Restore original placeholder names (part 2: higher-order-op subgraph naming) (#123587 ) Summary: note: breaking the original diff [D55225818](https://www.internalfb.com/diff/D55225818) into 3 parts (top-level renaming, higher-order-op subgraphs, constant input de/serialization) because of its size. Stacked PR to restore original names to placeholder nodes, replacing the default names arg0_1, arg1_1, ... This PR propagates node names to higher-order-op subgraph placeholders, retaining the top-level names and handling naming collisions by suffixing other non-placeholder nodes in the subgraph with an index. This is the same handling as in fx.Graph/fx.Node, but implemented separately as a pass. Since the input schemas of HOO subgraphs are very different, they are enumerated in _name_hoo_subgraph_placeholders(). Currently cond, map_impl, and wrap_with_set_grad_enabled are handled, but other ops can be easily added. Test Plan: verification checks on placeholder names for all export() calls, unit test in test/export/test_export.py Differential Revision: D55456749 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123587 Approved by: https://github.com/angelayi	2024-04-11 22:40:46 +00:00
Angela Yi	b287dbbc24	[export] Fix naming if state dict contains colons (#123601 ) Test Plan: buck2 run mode/opt //aps_models/pyper/ads:train\[inplace\] +training.ir_serializer=on_disk https://www.internalfb.com/intern/everpaste/?handle=GICWmAB0g_Z1StMCAMxuhJI6U9pHbsIXAAAz Reviewed By: tugsbayasgalan Differential Revision: D55894742 Pull Request resolved: https://github.com/pytorch/pytorch/pull/123601 Approved by: https://github.com/pianpwk	2024-04-09 21:25:08 +00:00
Pian Pawakapan	d7f23f6826	[export] Restore original placeholder names (part 1: top-level renaming) (#122904 ) Summary: This PR restores original names to placeholder nodes, replacing the default names arg0_1, arg1_1, and so on. User inputs now follow the signature of mod.forward(), for example forward(x, y) produces nodes x, y. If the tensors are nested in dictionaries, lists, tuples, or dataclasses, the names are a concatenation of the path to the tensor, e.g. x = {'a': torch.randn(4), 'b': [torch.randn(4), torch.randn(4)]} produces nodes x_a, x_b_0, x_b_1. Parameters, buffers, constants, and custom objects follow the FQN of the object, prefixed by "p", "b", "c", and "obj" respectively. For example, self.bar.l0.weight gets you p_bar_l0_weight. Effect tokens are named token_1, token_2, and so on, since they are not grounded in model inputs or named attributes. note: breaking the original diff into 3 parts (top-level renaming, higher-order-op subgraphs, constant input de/serialization) because of its size. Examples: ```python # params, buffers, constants, inputs, torch.cond ExportedProgram: class GraphModule(torch.nn.Module): def forward(self, p_l0_weight: "f32[4, 4]", p_l0_bias: "f32[4]", c_alpha: "f32[4]", b_beta: "f32[4]", x_0_a: "f32[4, 4]", y: "f32[4, 4]"): # No stacktrace found for following nodes mul: "f32[4, 4]" = torch.ops.aten.mul.Tensor(x_0_a, x_0_a) t: "f32[4, 4]" = torch.ops.aten.t.default(p_l0_weight); p_l0_weight = None addmm: "f32[4, 4]" = torch.ops.aten.addmm.default(p_l0_bias, y, t); p_l0_bias = y = t = None return addmm # model code class Bar(torch.nn.Module): def forward(self, x): return x * x class Foo(torch.nn.Module): def __init__(self): super().__init__() self.bar = Bar() self.l0 = torch.nn.Linear(4, 4) self.alpha = torch.randn(4) self.register_buffer('beta', torch.randn(4)) def forward(self, x, y): x = x[0]['a'] mul = self.bar(x) z1 = self.l0(y) return z1 # custom objects, dataclasses, tokens, constant inputs ExportedProgram: class GraphModule(torch.nn.Module): def forward(self, token_1: "f32[0]", obj_attr, data_x: "f32[4, 4]", data_y: "f32[4, 4]", mode): # No stacktrace found for following nodes mul: "f32[4, 4]" = torch.ops.aten.mul.Scalar(data_x, 30); data_x = None div: "f32[4, 4]" = torch.ops.aten.div.Tensor_mode(data_y, 1.0, rounding_mode = 'floor'); data_y = None add: "f32[4, 4]" = torch.ops.aten.add.Tensor(mul, div); mul = div = None with_effects = torch._higher_order_ops.effects.with_effects(token_1, torch.ops._TorchScriptTesting.takes_foo.default, obj_attr, add); token_1 = obj_attr = add = None getitem: "f32[0]" = with_effects[0] getitem_1: "f32[4, 4]" = with_effects[1]; with_effects = None return (getitem, getitem_1) # model code class Foo(torch.nn.Module): def __init__(self): super().__init__() self.attr = torch.classes._TorchScriptTesting._Foo(10, 20) def forward(self, data, a=1.0, mode="floor"): x = self.attr.add_tensor(data.x) + torch.div(data.y, a, rounding_mode=mode) x = torch.ops._TorchScriptTesting.takes_foo(self.attr, x) return x dataclass class DataClass: x: Tensor y: Tensor register_dataclass_as_pytree_node( DataClass, serialized_type_name="test.DataClass" ) args = (DataClass(x=torch.randn(4, 4), y=torch.randn(4, 4)), ) kwargs = {'mode': 'floor'} ep = torch.export.export(Foo(), args, kwargs, strict=False) ``` Test Plan: verification checks on placeholder names for all export() calls, unit test in test/export/test_export.py Differential Revision: D55456418 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122904 Approved by: https://github.com/angelayi, https://github.com/thiagocrepaldi	2024-04-05 18:56:00 +00:00
Avik Chaudhuri	b3f24b57fb	fix accidental specialization with faketensor input checks (#121460 ) Summary: When fake tensors are passed to a graph module and we do runtime assertions on them, we can accidentally trigger specialization guards. It's better to just relax the checking for these. Test Plan: confirmed that problem in T181400371 is now fixed Differential Revision: D54658960 Pull Request resolved: https://github.com/pytorch/pytorch/pull/121460 Approved by: https://github.com/angelayi	2024-03-08 08:02:37 +00:00
Avik Chaudhuri	5472923998	derived dim (#118729 ) With the current `Dim`-based dynamic shapes API for export, one can express that shapes of different input shapes must be equal by reusing the same `Dim`. However, non-trivial relationships between such input shapes cannot be expressed. Recently we are seeing more and more examples of code that require this additional expressibility, e.g., where a pair of shapes might differ by one, or a shape might be double another (or simply even). This PR introduces the concept of a "derived" `Dim`, i.e., a linear arithmetic expression over a `Dim`. By using a combination of `Dim`s and derived `Dim`s to specify input shapes, the desired relationships can be expressed naturally. E.g., a pair of shapes might be `dim` and `dim + 1`, or `dim` and `2dim`, or even `2dim` and `dim + 1`. We extend the current infrastructure that translates `Dim`s to deprecated `dynamic_dim`-based constraints to work with derived `Dim`s. As usual, we raise constraint violation errors when shape guards cannot be verified given a dynamic shapes spec; suggest fixes; and raise runtime errors when future inputs violate the spec. Importantly, some guards that used to cause forced specializations in the constraint solver because they were deemed "too complex" now do not do so, because they can now be specified as constraints. Since this was what motivated the introduction of a `disable_constraint_solver` flag to some internal APIs, we may not need that flag any more. Note that shapes of placeholders in exported programs can now contain symbolic expressions and not just symbols. Differential Revision: D53254587 Pull Request resolved: https://github.com/pytorch/pytorch/pull/118729 Approved by: https://github.com/ezyang	2024-02-28 19:48:32 +00:00
Max Ren	b2a318d856	[PyTorch][ExportedProgram] add 'is_lifted_tensor_constant' and 'get_lifted_tensor_constant' utils (#120546 ) as title Differential Revision: [D54149274](https://our.internmc.facebook.com/intern/diff/D54149274/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/120546 Approved by: https://github.com/kirklandsign	2024-02-27 07:16:55 +00:00
angelayi	cbbc309cae	[pytree][reland] Require pytree serialized_type_name (#120636 ) Relanding https://github.com/pytorch/pytorch/pull/119718 as the diff which prevents breakages of torchrec [D53857843](https://www.internalfb.com/diff/D53857843) has landed Pull Request resolved: https://github.com/pytorch/pytorch/pull/120636 Approved by: https://github.com/avikchaudhuri	2024-02-27 06:53:33 +00:00
ydwu4	8d81e61fb6	[export] make node_inline_ also inline the get_item calls (#119913 ) As titled. Before the PR, after we split then inline_, there will be getitem calls in the graph while the original graph module doesn't have them. This PR removes the additional get_item calls by inlining. Test Plan: Added new test cases for graphs that return multiple outputs and takes multiple inputs Pull Request resolved: https://github.com/pytorch/pytorch/pull/119913 Approved by: https://github.com/tugsbayasgalan ghstack dependencies: #119732, #119736, #119810	2024-02-17 02:18:27 +00:00
ydwu4	4769e6916a	[export] add node_inline_ to prepare replacing set_grad_enabled with hop (#119736 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/119736 Approved by: https://github.com/tugsbayasgalan ghstack dependencies: #119732	2024-02-17 02:18:11 +00:00
ydwu4	068659ddc2	[export] add sequential_split to prepare replacing set_grad_enabled with hop (#119732 ) This pr is the 1/N pr of transforming the global state mutating ops such as torch._C.set_grad_enabled calls in pre-dispatch graph into a higher order op so that the graph becomes more functional. We make use of split_module to help us do the transformation. This pr preserves the node.name in original module by adding a new kwarg `keep_original_node_name` to split_module. For a graph looks like this: ```python def forward(self, arg_0): arg0_1, = fx_pytree.tree_flatten_spec(([arg_0], {}), self._in_spec) add = torch.ops.aten.add.Tensor(arg0_1, 1); arg0_1 = None sin = torch.ops.aten.sin.default(add); add = None sum_1 = torch.ops.aten.sum.default(sin); sin = None _set_grad_enabled = torch._C._set_grad_enabled(False) add_1 = torch.ops.aten.add.Tensor(sum_1, 1); sum_1 = None _set_grad_enabled_1 = torch._C._set_grad_enabled(True) sub = torch.ops.aten.sub.Tensor(add_1, 1) return pytree.tree_unflatten((add_1, sub), self._out_spec) ``` Before the change, split graph returns the following graphs and subgraphs (notice the change from `add` -> `add_tensor`, `sin` -> `sin_default`: ```python def forward(self, arg_0): arg0_1, = fx_pytree.tree_flatten_spec(([arg_0], {}), self._in_spec) submod_0 = self.submod_0(arg0_1); arg0_1 = None submod_1 = self.submod_1(submod_0); submod_0 = None submod_2 = self.submod_2(submod_1) return pytree.tree_unflatten((submod_1, submod_2), self._out_spec) # submod_0 def forward(self, arg0_1): add_tensor = torch.ops.aten.add.Tensor(arg0_1, 1); arg0_1 = None sin_default = torch.ops.aten.sin.default(add_tensor); add_tensor = None sum_default = torch.ops.aten.sum.default(sin_default); sin_default = None return sum_default # submod_1 def forward(self, sum_1): _set_grad_enabled = torch._C._set_grad_enabled(False) add_tensor = torch.ops.aten.add.Tensor(sum_1, 1); sum_1 = None return add_tensor # submod_2 def forward(self, add_1): _set_grad_enabled = torch._C._set_grad_enabled(True) sub_tensor = torch.ops.aten.sub.Tensor(add_1, 1); add_1 = None return sub_tensor """) ``` After the change, the test produce the following graph, all the node names in original graph module are preserved in sub_modules. ```python def forward(self, arg_0): sub, = fx_pytree.tree_flatten_spec(([arg_0], {}), self._in_spec) submod_0 = self.submod_0(sub); sub = None submod_1 = self.submod_1(submod_0); submod_0 = None submod_2 = self.submod_2(submod_1) return pytree.tree_unflatten((submod_1, submod_2), self._out_spec) # submod_0 def forward(self, arg0_1): add = torch.ops.aten.add.Tensor(arg0_1, 1); arg0_1 = None sin = torch.ops.aten.sin.default(add); add = None sum_1 = torch.ops.aten.sum.default(sin); sin = None return sum_1 # submod_1 def forward(self, sum_1): _set_grad_enabled = torch._C._set_grad_enabled(False) add_1 = torch.ops.aten.add.Tensor(sum_1, 1); sum_1 = None return add_1 # submod_2 def forward(self, add_1): _set_grad_enabled_1 = torch._C._set_grad_enabled(True) sub = torch.ops.aten.sub.Tensor(add_1, 1); add_1 = None return sub ``` Note that currently, we call split_module on the graph after pre-dispatch aot. The difference is even larger if we `split_module` the graph module produced by dynamo, where all the original variables names in user program are preserved after dynamo but lost after `split_module` without this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/119732 Approved by: https://github.com/tugsbayasgalan	2024-02-17 02:18:04 +00:00
Wilson Hong	3f4dd9bfa4	Back out "[pytree] Require serialized_type_name" (#120041 ) Summary: D53785493 breaks apf.rec.ir.tests.ir_export_deserialize_test.IRExportDeserializeTest: test_export_deserialize_ebc failed: https://www.internalfb.com/sandcastle/workflow/3436246515685789584 Test Plan: buck2 test mode/opt apf/rec/ir/tests:ir_export_deserialize_test Differential Revision: D53834881 Co-authored-by: Wilson Hong <wilsonhong@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/120041 Approved by: https://github.com/ydwu4	2024-02-16 10:02:25 +00:00
angelayi	b4c7afe101	[pytree] Require serialized_type_name (#119718 ) Differential Revision: [D53785493](https://our.internmc.facebook.com/intern/diff/D53785493) Pull Request resolved: https://github.com/pytorch/pytorch/pull/119718 Approved by: https://github.com/suo	2024-02-15 20:32:44 +00:00
Angela Yi	0827510fd3	[export] Remove torch._export.export (#119095 ) XLA changes: https://github.com/pytorch/xla/pull/6486 Test Plan: CI Differential Revision: D53316196 Pull Request resolved: https://github.com/pytorch/pytorch/pull/119095 Approved by: https://github.com/ydwu4, https://github.com/zhxchen17, https://github.com/tugsbayasgalan, https://github.com/avikchaudhuri, https://github.com/jerryzh168	2024-02-08 21:22:04 +00:00
Michael Suo	bf4e171539	[export] support non-persistent buffers (#118969 ) Summary: X-link: https://github.com/pytorch/executorch/pull/1817 Basic support for non-persistent buffers, which are buffers that do not show up in the state dict. One weird twist is that most of our other systems (FX, aot_export, dynamo) have completely buggy handling of non-persistent buffers. I tried to go on a wild goose chase to fix them all, but it got to be too much. So I introduced some sad rewrite passes in `_export` make the final state dict correctly align with the original module's state dict. This exposed some bugs/ambiguous handling of parameters/buffers in existing test code. For example, `TestSaveLoad.test_save_buffer` traced over a module that was not in the root module hierarchy and caused some weird behavior. I think we should error explicitly on use cases like this: https://github.com/pytorch/pytorch/issues/118410. For now I just rewrote the tests or skipped them. As a side effect, this diff tightened up quite a few sloppy behaviors around state dict handling: - Tensor attributes were getting promoted to be buffers—bad! - Tracing through a module not in the children of the root module would add its parameters/buffers to the state dict—bad! This behavior is unlikely to show up in user code since the model would be totally broken, but did show up in a bunch of tests. #buildmore Test Plan: unit tests sandcastle Differential Revision: D53340041 Pull Request resolved: https://github.com/pytorch/pytorch/pull/118969 Approved by: https://github.com/guangy10, https://github.com/huydhn, https://github.com/titaiwangms	2024-02-02 19:16:08 +00:00
Aaron Gokaslan	1562dae62c	[BE]: Apply RUF025 dict.fromkeys preview rule (#118637 ) Simplifies and optimizes dict construction using the `fromkeys` classmethod ctor. This also makes it really obvious when all the keys will have the same static value, which could be a bug if unintentional. It is also significantly faster than using a dict comprehension. The rule is in preview, but I am adding a forward fix for when it becomes stable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/118637 Approved by: https://github.com/albanD	2024-01-30 20:46:54 +00:00
suo	4ee8aa6028	[export] adopt KeyPath API in nonstrict mode (#118609 ) This PR rewrites two paths to use the newly-added keypaths API in pytree: First: we were hand-rolling a tree_map during fakification because we wanted to track sources. This PR uses keypaths instead, which can do the same thing without needing custom code. Second: our constraint error formatting was referencing placeholder names in error messages. These placeholder names are not otherwise user-visible, so they are super confusing to users (e.g. "which input does arg1_3 correspond to?"). This diff uses the `keystr` API to format the error message. This necessitated some small refactors—generating the keystr is expensive so doing it in an f-string was very bad. It can also be further improved—we can inspect the signature so that instead of `*args[0]` we can give people the actual argument name, which would be the ideal UX. But leaving that for later. Differential Revision: [D53139358](https://our.internmc.facebook.com/intern/diff/D53139358/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/118609 Approved by: https://github.com/zhxchen17 ghstack dependencies: #118607, #118608	2024-01-30 19:14:11 +00:00
Angela Yi	413a434846	[export] Convert all export tests to .module() (#118425 ) Test Plan: CI Differential Revision: D53075379 Pull Request resolved: https://github.com/pytorch/pytorch/pull/118425 Approved by: https://github.com/suo	2024-01-29 23:06:54 +00:00
Angela Yi	5c56822be2	[export] Various fixes to .module() (#118272 ) Summary: While turning on .module() for all the export tests, I uncovered some bugs with .module() and while fixing them I ended up rewriting some of the code... Some of the bugs were: * bad kwargs support on the unlifted module * no support for user input mutations * (at the commit hash i was working off of) no support for custom objects * there were no tests on unlifting weights from cond/map submodules Test Plan: CI Differential Revision: D53075380 Pull Request resolved: https://github.com/pytorch/pytorch/pull/118272 Approved by: https://github.com/suo	2024-01-26 21:05:07 +00:00
Angela Yi	7dac2f9f2d	[export][ez] Fix getting meta["val"] (#117313 ) Summary: For integer inputs, they do not have a meta["val"]. Test Plan: `buck run @//mode/dev-nosan //executorch/examples/portable/scripts:export -- -m emformer_predict` passes the export step Differential Revision: D52716419 Pull Request resolved: https://github.com/pytorch/pytorch/pull/117313 Approved by: https://github.com/kirklandsign, https://github.com/tugsbayasgalan	2024-01-12 06:17:38 +00:00
Angela Yi	8e2d63cbc3	[export][reland] Remove runtime assertion pass (#115597 ) Summary: Reland of https://github.com/pytorch/pytorch/pull/115196 D52054112 to fix internal failures. Test Plan: CI Differential Revision: D52054110 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115597 Approved by: https://github.com/ydwu4, https://github.com/zhxchen17	2023-12-15 03:22:03 +00:00
PyTorch MergeBot	4186932bac	Revert "[export] Remove runtime assertion pass (#115196 )" This reverts commit `c163b3c035`. Reverted https://github.com/pytorch/pytorch/pull/115196 on behalf of https://github.com/atalman due to Broke internal test ([comment](https://github.com/pytorch/pytorch/pull/115196#issuecomment-1847778344))	2023-12-08 20:07:04 +00:00
angelayi	c163b3c035	[export] Remove runtime assertion pass (#115196 ) Reland of https://github.com/pytorch/pytorch/pull/111949/ Pull Request resolved: https://github.com/pytorch/pytorch/pull/115196 Approved by: https://github.com/avikchaudhuri	2023-12-07 01:44:11 +00:00
Xuehai Pan	2a3d8e50fb	[pytree] test aligned API signature for C++ and Python pytree (#112485 ) Add tests to ensure the C++ and Python pytree provide the same APIs with identical signatures. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112485 Approved by: https://github.com/zou3519	2023-11-30 17:50:06 +00:00
Xuehai Pan	89a1fe6966	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Changes: 1. Add `_private_register_pytree_node` API in both C++ and Python pytree. In C++ pytree, the API will only register pytree node for C++ pytree. In Python pytree, the API will only register pytree node for Python pytree. 2. Do not allow registering a type as pytree node twice in the Python pytree. 3. Add thread lock to the Python pytree node register API. 4. The old `_register_pytree_node` API will call the `_private_register_pytree_node` API and raise a deprecation warning. 5. Add a new `register_pytree_node` API to register node type in both C++ and Python implementations. 6. Add tests to ensure a warning will be raised when the old private function is called. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-28 11:41:38 +00:00
PyTorch MergeBot	01366efcc9	Revert "[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 )" This reverts commit `4e4a6ad6ec`. Reverted https://github.com/pytorch/pytorch/pull/112111 on behalf of https://github.com/DanilBaibak due to Break internal build ([comment](https://github.com/pytorch/pytorch/pull/112111#issuecomment-1824099658))	2023-11-23 09:59:32 +00:00
Xuehai Pan	4e4a6ad6ec	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Changes: 1. Add `_private_register_pytree_node` API in both C++ and Python pytree. In C++ pytree, the API will only register pytree node for C++ pytree. In Python pytree, the API will only register pytree node for Python pytree. 2. Do not allow registering a type as pytree node twice in the Python pytree. 3. Add thread lock to the Python pytree node register API. 4. The old `_register_pytree_node` API will call the `_private_register_pytree_node` API and raise a deprecation warning. 5. Add a new `register_pytree_node` API to register node type in both C++ and Python implementations. 6. Add tests to ensure a warning will be raised when the old private function is called. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-21 19:53:13 +00:00
Tugsbayasgalan Manlaibaatar	a7b75f586a	[RELAND] Disallow skipping dynamo (#110222 ) Previous discussion: https://github.com/pytorch/pytorch/pull/109476 In this PR, I made following additions to the original PR: 1) Unlifted graph module now runs the runtime assertions in its' forward call. 2) When we retrace, we make sure we run the assertions to make sure user is tracing the module with correct inputs with respect to the assumptions we made during first tracing. The way I do is that I create new graph module type with modified call method. And the runtime assertions happen under torchdynamo.disable so that it is just run in eager directly. The reason is we don't this to be traced part of the graph. 3) Both ep.module and capture_pre_autograd now returns _UnliftedGraphModule. Differential Revision: [D51078056](https://our.internmc.facebook.com/intern/diff/D51078056) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110222 Approved by: https://github.com/zhxchen17	2023-11-14 16:02:01 +00:00
PyTorch MergeBot	2a271a3efa	Revert "[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 )" This reverts commit `a0d00349ed`. Reverted https://github.com/pytorch/pytorch/pull/112111 on behalf of https://github.com/PaliC due to _private_register_pytree_node now checks for duplicate registering, unfortunately, this breaks composability with torchrec internally :( ([comment](https://github.com/pytorch/pytorch/pull/112111#issuecomment-1806130993))	2023-11-10 17:24:40 +00:00
Xuehai Pan	a0d00349ed	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-10 02:41:30 +00:00
Xuehai Pan	5e2adc8650	[pytree] align function signature between C++ and Python pytree (#112482 ) Change the argument name in C++ and Python pytree APIs. Also add a test to ensure the function signatures are the same in the two implementations. - #112485 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112482 Approved by: https://github.com/zou3519	2023-11-10 02:37:48 +00:00
PyTorch MergeBot	66150b29e3	Revert "[pytree] align function signature between C++ and Python pytree (#112482 )" This reverts commit `4893a2814f`. Reverted https://github.com/pytorch/pytorch/pull/112482 on behalf of https://github.com/PaliC due to changing _register_pytree_node's signature is bc breaking, please revert the signature and reland ([comment](https://github.com/pytorch/pytorch/pull/112482#issuecomment-1804909926))	2023-11-10 00:59:23 +00:00
PyTorch MergeBot	9a90989121	Revert "[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 )" This reverts commit `95f52611c7`. Reverted https://github.com/pytorch/pytorch/pull/112111 on behalf of https://github.com/PaliC due to in the bottom diff in the stack changing _register_pytree_node's signature is bc breaking, please revert the signature and reland ([comment](https://github.com/pytorch/pytorch/pull/112111#issuecomment-1804892924))	2023-11-10 00:38:28 +00:00
Xuehai Pan	95f52611c7	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-08 05:02:03 +00:00
Xuehai Pan	4893a2814f	[pytree] align function signature between C++ and Python pytree (#112482 ) Change the argument name in C++ and Python pytree APIs. Also add a test to ensure the function signatures are the same in the two implementations. - #112485 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112482 Approved by: https://github.com/zou3519	2023-11-07 01:26:41 +00:00
angelayi	ff35e1e45b	[pytree] Add custom treespec fqn field (#112428 ) Custom classes that are serialized with pytree are serialized by default with `f”{class.__module__}.{class.__name__}”`. This is a dependency from our serialized program directly into the outer Python environment. If a user moves the class to a different directory, the serialized program will be unable to be loaded. So, we will require users to pass in an FQN if they want to serialize their custom treespec type. Differential Revision: [D50886366](https://our.internmc.facebook.com/intern/diff/D50886366) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112428 Approved by: https://github.com/suo	2023-11-02 00:26:41 +00:00
Xuehai Pan	a7a0955790	[pytree][BE] reorganize imports and format code style and update type hints (#112268 ) Reland PR: - #112109 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112268 Approved by: https://github.com/Skylion007	2023-10-28 16:30:24 +00:00
angelayi	a432f37e49	Serialize pytree to json string (#106116 ) Fixes https://github.com/pytorch/pytorch/pull/102577#issuecomment-1650905536 Serializing to json is more stable, and renamed the API: ``` # Takes in a treespec and returns the serialized treespec as a string. Also optionally takes in a protocol version number. def treespec_dumps(treespec: TreeSpec, protocol: Optional[int] = None) -> str: # Takes in a serialized treespec and outputs a TreeSpec def treespec_loads(data: str) -> TreeSpec: ``` If users want to register their own serialization format for a given pytree, they can go through the `_register_treespec_serializer` API which optionally takes in a `getstate` and `setstate` function. ``` _register_treespec_serializer(type_, *, getstate, setstate) # Takes in the context, and outputs a json-dumpable context def getstate(context: Context) -> DumpableContext: # Takes in a json-dumpable context, and reconstructs the original context def setstate(dumpable_context: DumpableContext) -> Context: ``` We will serialize to the following dataclass, and then json.dump this it to string. ``` class TreeSpec type: Optional[str] # a string name of the type. null for the case of a LeafSpec context: Optional[Any] # optional, a json dumpable format of the context children_specs: List[TreeSpec], } ``` If no getstate/setstate function is registered, we will by default serialize the context using `json.dumps/loads`. We will also serialize the type through `f"{typ.__module__}.{typ.__name__}"`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106116 Approved by: https://github.com/zou3519	2023-08-27 14:34:49 +00:00
Chen Lai	4f2ff1d019	add get buffer from exported program (#107809 ) Summary: We have the util function to get params, for parity we also need util function to get buffer` Test Plan: ``` buck test //caffe2/test:test_export ``` Differential Revision: D48610877 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107809 Approved by: https://github.com/JacobSzwejbka	2023-08-25 05:46:04 +00:00

1 2

52 Commits