pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Yidi Wu	92f125e622	[export] improve error message for deserializing custom triton op (#152029 ) In https://github.com/pytorch/pytorch/issues/151746, users ran into an error where a custom triton op cannot be resolved into an operator from string target. We improve the error message by reminding users to register the same custom operator at de-serialization time. Now the error looks like this: ```python torch._export.serde.serialize.SerializeError: We failed to resolve torch.ops.triton_kernel.add.default to an operator. If it's a custom op/custom triton op, this is usally because the custom op is not registered when deserializing. Please import the custom op to register it before deserializing. Otherwise, please file an issue on github. Unsupported target type for node Node(target='torch.ops.triton_kernel.add.default', inputs=[NamedArgument(name='x', arg=Argument(as_tensor=TensorArgument(name='linear')), kind=1), NamedArgument(name='y', arg=Argument(as_tensor=TensorArgument(name='mul')), kind=1)], outputs=[Argument(as_tensor=TensorArgument(name='add'))], metadata={'stack_trace': 'File "/data/users/yidi/pytorch/test.py", line 50, in forward\n output = triton_add(dense_output, bias)', 'nn_module_stack': 'L__self__,,__main__.SimpleModel', 'torch_fn': 'add.default_1;OpOverload.add.default'}, is_hop_single_tensor_return=None): <class 'str'>.``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/152029 Approved by: https://github.com/jingsh	2025-04-24 20:22:05 +00:00
Pian Pawakapan	fd3d339e17	[dynamic shapes] be less aggressive with runtime assert CSE for bounds (#151590 ) Fixes #150540 Fixes #147772 Stops trying to CSE bound expressions, only does exact deduplication for runtime asserts. Adds the test cases to check that AOTAutograd doesn't data-dependent error out when retracing due to not seeing the asserts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/151590 Approved by: https://github.com/laithsakka	2025-04-23 23:07:00 +00:00
angelayi	6a1b820255	[export] Enable symint inputs for AdditionalInputs and ShapesCollection (#151842 ) With `AdditionalInputs`, the behavior is the same as with tensors: ```python class M(torch.nn.Module): def forward(self, x, y): return x + y additional_inputs = torch.export.AdditionalInputs() additional_inputs.add((5, 5)) additional_inputs.add((3, 5)) additional_inputs.add((5, 4)) ep = torch.export.export( M(), (6, 7), dynamic_shapes=additional_inputs, strict=False ) ``` With `ShapesCollection`, we now need to wrap integer inputs as `_IntWrapper` so that we can have a unique identifier for each integer input. ```python class M(torch.nn.Module): def forward(self, x, y): return x + y from torch.export.dynamic_shapes import _IntWrapper args = (_IntWrapper(5), _IntWrapper(5)) # Or we can do `args = pytree.tree_map_only(int, lambda a: _IntWrapper(a), orig_args)` shapes_collection = torch.export.ShapesCollection() shapes_collection[args[0]] = Dim.DYNAMIC shapes_collection[args[1]] = Dim.DYNAMIC ep = torch.export.export( M(), args, dynamic_shapes=shapes_collection, strict=False ) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/151842 Approved by: https://github.com/pianpwk	2025-04-22 22:29:18 +00:00
Tugsbayasgalan Manlaibaatar	2c27597d6a	Infra for handling builtin ops (min, max, math.pow) (#151348 ) Reapply of https://github.com/pytorch/pytorch/pull/150003 Differential Revision: [D73050801](https://our.internmc.facebook.com/intern/diff/D73050801/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/151348 Approved by: https://github.com/zhxchen17 ghstack dependencies: #151347	2025-04-22 17:20:09 +00:00
Pian Pawakapan	cd1317f92f	[export] suggest dynamic re-export in input constraints hook (#151624 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/151624 Approved by: https://github.com/angelayi	2025-04-21 22:29:46 +00:00
angelayi	d5dda82586	[export] Integrate meta kernel generation with draft-export (#150809 ) If a custom operator does not contain a fake impl, currently draft-export will use the real-tensor propagation to get an output for the operator and continue tracing. However if we retrace the exported model using `ep.run_decompositions`, or `export`, or run the exported program with fake tensors, we'll still fail because there's no fake impl. With this PR, after draft-export we will generate an operator profile for each operator call that we encounter, and store this on the report attached to the exported program `ep._report.op_profiles`. Users can then use `torch._library.fake_profile.register_fake_profile` to temporarily generate and register a fake impl based on these operator profiles. This way future fake tensor retracing will work. The workflow would look something like: ```python class M(torch.nn.Module): def forward(self, a, b): res = torch.ops.mylib.foo8(a, b) # no fake impl return res ep = export(M(), (torch.ones(3, 4), torch.ones(3, 4)) # this fails bc no fake impl ep = draft_export(M(), (torch.ones(3, 4), torch.ones(3, 4)) ep.run_decompositions() # this fails bc no fake impl # this registers fake impls based on the profiles with torch._library.fake_profile.register_fake_profile(ep._report.op_profiles): decomp = ep.run_decompositions() # this works new_inp = ( torch.ones(2, 3, 4), torch.ones(2, 3, 4), ) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/150809 Approved by: https://github.com/zou3519	2025-04-17 20:52:31 +00:00
Pian Pawakapan	6dddd6520d	[dynamic shapes] add sym_and, sym_or (#150456 ) This has been pretty helpful for the size-oblivious rewrite. Wanted the variadic args version to avoid `sym_or(a, sym_or(b, sym_or(c, d)))` in favor of `sym_or(a, b, c, d)`. Happy to change this to ban the 1-arg version. This is better than plain and/or because the whole symbolic expression gets preserved, and if we guard on it or defer as a runtime assert, we preserve all branches. Pull Request resolved: https://github.com/pytorch/pytorch/pull/150456 Approved by: https://github.com/laithsakka	2025-04-14 18:18:06 +00:00
Yiming Zhou	dbcd0b571d	Back out "[AOTI] Always use oss schema for ExternKernelNodes serialization" (#151026 ) Summary: Revert for FC breaking Test Plan: CI Differential Revision: D72802075 Pull Request resolved: https://github.com/pytorch/pytorch/pull/151026 Approved by: https://github.com/hl475	2025-04-10 22:36:35 +00:00
angelayi	e6969c1bd8	[export] Symint support (nonstrict, Dim.DYNAMIC) (#150198 ) Fixes https://github.com/pytorch/pytorch/issues/113682 only in the non-strict export case. Also we only support Dim.DYNAMIC/AUTO, not named-Dims Pull Request resolved: https://github.com/pytorch/pytorch/pull/150198 Approved by: https://github.com/pianpwk	2025-04-10 15:06:23 +00:00
Sherlock Huang	81f60f3880	Expand allowed_getattr_types_for_subgm to torch.Tensor (#150867 ) Summary: att regular weight has the type of torch.nn.parameter.Parameter buffer and tensor constant has the type of torch.Tensor both types are valid. Test Plan: CI Differential Revision: D72657275 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150867 Approved by: https://github.com/zhxchen17	2025-04-09 11:01:45 +00:00
Yiming Zhou	89505f4498	[AOTI] Always use oss schema for ExternKernelNodes serialization (#150197 ) Summary: Added a field `protocol` to `ExternKernelNodes` and all the lowering pass will always use the oss schema to serialize external kernel nodes from now on. Test Plan: CI Differential Revision: D72020444 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150197 Approved by: https://github.com/zhxchen17	2025-04-08 22:35:28 +00:00
Pian Pawakapan	f8b53f4a75	[export] raise when Dim.DYNAMIC 0/1 specializes (#150716 ) Previously we didn't catch this, mark_dynamic() just doesn't allocate a symbol for it Differential Revision: D72486930 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150716 Approved by: https://github.com/angelayi	2025-04-07 18:58:42 +00:00
Yiming Zhou	a3f9e04656	[export] Make aoti_call_delegate hop traceable (#148804 ) Summary: The `aoti_call_delegate` hop now uses a stateless `original_gm` for tracing with fake tensors and the OSS AOTI Runner for running with real tensors Differential Revision: D70738393 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148804 Approved by: https://github.com/SherlockNoMad	2025-04-03 20:44:31 +00:00
angelayi	5314a6fe82	[export] Fix deserialization issue (#150515 ) An internal model was serialized in 2023, and is now breaking while loading with the following error: ``` File "<eval_with_key>.1675", line 4 def forward(self, arg1163_1, arg1164_1, , arg1166_1, , arg1168_1, arg1169_1, arg1170_1, , arg1172_1, arg1173_1, arg1174_1, arg1175_1, arg1176_1, arg1177_1, arg1178_1, arg1179_1, arg1180_1, arg1181_1, arg1182_1, arg1183_1, arg1184_1, arg1185_1, arg1186_1, arg1187_1, arg1188_1, arg1189_1, arg1190_1, arg1191_1, arg1192_1, arg1193_1, arg1194_1, arg1195_1, arg1196_1, arg1197_1, arg1198_1, arg1199_1, arg1200_1, arg1201_1, arg1202_1, arg1203_1, arg1204_1, arg1205_1, arg1206_1, arg1207_1, arg1208_1, arg1209_1, arg1210_1, arg1211_1, arg1212_1, arg1213_1, arg1214_1, arg1215_1, arg1216_1, , arg1218_1, arg1219_1, arg1220_1, arg1221_1, arg1222_1, arg1223_1, arg1224_1, , arg1226_1, arg1227_1, arg1228_1, , arg1230_1, , , , , , , , , , , , , , , ): ^ SyntaxError: invalid syntax ``` The syntax errors are due to inputs that are `None` when exporting. Prior to changes in https://github.com/pytorch/pytorch/pull/123590 (landed 4/2024), input specs for none inputs look like `InputSpec(userInput=UserInputSpec(arg=Argument(asNone=True)))`, and during deserialization when creating a node, we would just use a dummy name `arg`. After to those changes, the input specs for none inputs look like `InputSpec(constantInput=InputToConstantInputSpec(name='y', value=ConstantValue(asNone=True)))`, and when creating a node we would use the name `y` as the name. However the PR didn't handle the case if it's loading an old package which doesn't have this name, so ended up putting empty names in the placeholder nodes. This error was uncovered after https://github.com/pytorch/pytorch/pull/149717, where we now use the GraphModule's python codegen to run the UnflattenedModule instead of going through the interpreter path. The placeholder nodes having empty names caused the python codegen to fail. Pull Request resolved: https://github.com/pytorch/pytorch/pull/150515 Approved by: https://github.com/yushangdi	2025-04-03 15:27:45 +00:00
Pian Pawakapan	90ddb33141	[export] specialize for aten.to (#149235 ) Changes decomposition behavior of `aten.to` to respect the aliasing/non-aliasing behavior in eager, and to specialize to the input/conversion dtype & device. Before change: we always decompose `aten.to` into `_to_copy`, regardless of aliasing behavior. This leads us to ban mutations on the result of `_to_copy` when aliased, since we can't guarantee correct program semantics. This meant users had to explicitly call `.clone()` before mutating. In the special cases where we don’t ban mutations (e.g. dtype conversion), we add runtime assertions on the input & conversion dtype/devices in the decomposed program (see https://github.com/pytorch/pytorch/pull/142420). After change: we decompose to the aliasing/non-aliasing behavior that matches eager, allowing mutations in all cases. We also add dtype/device assertions for all `aten.to` ops, starting in the pre-dispatch graph, basically specializing the program to the dtype/devices. Differential Revision: D71229547 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149235 Approved by: https://github.com/tugsbayasgalan	2025-04-03 05:20:10 +00:00
Avik Chaudhuri	1017927c83	multidimensional slicing (#150104 ) Differential Revision: D71962884 Fixes #150057 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150104 Approved by: https://github.com/angelayi	2025-04-02 20:57:16 +00:00
Xuehai Pan	a10b765bf1	[pytree] add APIs to determine a class is a namedtuple or PyStructSequence (#113257 ) Changes in this PR: 1. Add `is_structseq` and `is_structseq_class` functions to determine a object or a class is PyStructSequence. 2. Add a generic class `structseq` which can be used as the registration key for PyStructSequence types like `namedtuple` for Named Tuple types. 3. Change `is_namedtuple` to accept subclasses of namedtuple to be namedtuple. Before this PR, only namedtuple class directly created by `collections.namedtuple` or `typing.NamedTuple` were namedtuple classes while their subclasses were not. This PR makes `is_namedtuple` return true for subclasses of namedtuple class. Resolves #75982. New tests are included in this PR. - #75982 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113257 Approved by: https://github.com/zou3519	2025-04-01 10:40:43 +00:00
Tugsbayasgalan Manlaibaatar	7e7e5698cc	Suppress more warnings (#149833 ) Differential Revision: [D71702307](https://our.internmc.facebook.com/intern/diff/D71702307) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149833 Approved by: https://github.com/malfet, https://github.com/Skylion007	2025-04-01 05:33:04 +00:00
Pian Pawakapan	925fd4aa2e	[export] min/max ranges for dim hints (#149590 ) Differential Revision: D71522032 Adds min/max ranges to Dim.AUTO/DYNAMIC/STATIC, so users can do `Dim.AUTO(min=2, max=2048)`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/149590 Approved by: https://github.com/tugsbayasgalan	2025-03-31 21:32:20 +00:00
Pian Pawakapan	103bf64a3c	[export] refactor _Dim into Dim (#149891 ) Summary: forward fix T218515233 Test Plan: test_export Differential Revision: D71769231 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149891 Approved by: https://github.com/jingsh, https://github.com/angelayi	2025-03-28 06:19:03 +00:00
Avik Chaudhuri	21bcbbfb5e	fix range constraints for expr (#150103 ) During tracing it is possible for a `s1: VR[2, inf]` to be replaced by a `s0: VR[3, inf]` (note smaller range) by the shape env. But after export, unfortunately we'd previously record `range_constraints[s0] = VR[2, inf]` (note larger range), which is incorrect. This is because we'd map `s1.node.expr` (`s0`) to the `var_to_range` of `s1.node._expr` (`s1`) when creating `range_constraints`. The comment surrounding this code suggests this predated `bound_sympy`, but now we can do better. For users, this means that when using `Dim.DYNAMIC` previously they wouldn't get input constraints checked sufficiently, now they do (shifting errors early). Differential Revision: D71962694 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150103 Approved by: https://github.com/zhxchen17	2025-03-27 22:11:39 +00:00
Avik Chaudhuri	09aa63ea2c	preserve custom meta in placeholders (#149661 ) Fixes #147338 Differential Revision: [D71573533](https://our.internmc.facebook.com/intern/diff/D71573533/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149661 Approved by: https://github.com/junpeiz, https://github.com/angelayi	2025-03-21 19:09:38 +00:00
Shangdi Yu	46dd226702	Fakify torchbind objects in compile_fx and add tests for SigridTransformsInstanceTorchBind (#149529 ) Summary: We need to properly fakify torchbind objects, including the ones in graph module attributes, so the resgitered fake implementation works properly. - _fakify_script_objects in `compile_fx` - Allow fake torchbind objects in `torchbind_constants` Remove `node.meta["unbacked_bindings"]` for `aot_compile` in `compile_fx`. Otherwise `ShapeProp` will fail when trying to resolve the `unbacked_bindings` of `with_effect` tokens. Update `sigrid_transforms_test` to use the latest `torch._inductor.aot_compile` API. Add a test for `Fakify torchbind objects in compile_fx and add tests for SigridTransformsInstanceTorchBind` in `e2e_test`. Test Plan: ``` buck run //caffe2/torch/fb/sparsenn:sigrid_test -- -r test_transform_torch_bind buck run //sigmoid/inference/test:e2e_test_cpu -- -r SigridTransforms buck2 run mode/dev-nosan sigmoid/inference/ts_migration:pt2i_readiness_main -- --model_id 545017754 --test_suite ads_all --mode test_preproc ``` Differential Revision: D70013257 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149529 Approved by: https://github.com/angelayi	2025-03-21 18:58:28 +00:00
Zhengxu Chen	f47aa08130	[export] Support python assertion with symints. (#149444 ) Summary: This diff ports some technique from torch.fx symbolic trace to trace through Python asserts when we run into data dependent symbolic shape assertions, so that we can achieve the same effect as torch dynamo to automatically turn assert into torch.check()s. Test Plan: buck test mode/opt caffe2/test:test_export -- -r test_python_asserts_with_sym_int Differential Revision: D71425360 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149444 Approved by: https://github.com/tugsbayasgalan	2025-03-20 23:07:45 +00:00
Zhengxu Chen	80dfce2cc3	[export] Handle non OpNamespace type during decomposition. (#149431 ) Summary: Turns out we can have non OpNamespace object in torch.ops._dir. We should just throw away those during iteration. Test Plan: eyes Differential Revision: D71417992 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149431 Approved by: https://github.com/tugsbayasgalan	2025-03-20 15:36:15 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	9a184b1074	Monkeypatch fake mode so it errors on invalid custom ops (#149410 ) Internal version: [D71294776](https://www.internalfb.com/diff/D71294776) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149410 Approved by: https://github.com/gmagogsfm	2025-03-20 04:50:57 +00:00
Pian Pawakapan	96828a2155	[export] refactor DimHints for type errors (#149424 ) Differential Revision: D71414367 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149424 Approved by: https://github.com/justinchuby, https://github.com/avikchaudhuri	2025-03-19 18:51:07 +00:00
Angela Yi	ea02aac2ca	[export] Update remove runtime asserts pass (#149198 ) Test Plan: CI -- Removing asserts should be a noop Differential Revision: D69566851 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149198 Approved by: https://github.com/pianpwk	2025-03-18 23:07:25 +00:00
Aaron Gokaslan	a0ac63cbd9	[BE]: Apply ruff PERF403 to use dict comprehensions more often (#149257 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/149257 Approved by: https://github.com/jansel	2025-03-18 00:46:07 +00:00
PyTorch MergeBot	24cfeec2c7	Revert "[BE]: Apply ruff PERF403 to use dict comprehensions more often (#149257 )" This reverts commit `bfee141666`. Reverted https://github.com/pytorch/pytorch/pull/149257 on behalf of https://github.com/malfet due to Let's see if it helps restore compiler benchmark sanity, see `8bc7bd94a5/1` ([comment](https://github.com/pytorch/pytorch/pull/149257#issuecomment-2731133812))	2025-03-17 22:57:00 +00:00
Aaron Gokaslan	bfee141666	[BE]: Apply ruff PERF403 to use dict comprehensions more often (#149257 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/149257 Approved by: https://github.com/jansel	2025-03-16 23:52:58 +00:00
Tugsbayasgalan Manlaibaatar	6b1b95ad2a	Support subclass constructor capturing in export (#147014 ) Notable TODOs: 1. Need to implement AutogradHOP to get rid of subclasses before serializing 2. Need to implement mechanism to figure out what subclasses will be used in export when they are not expressed in the inputs Differential Revision: [D69640673](https://our.internmc.facebook.com/intern/diff/D69640673) Pull Request resolved: https://github.com/pytorch/pytorch/pull/147014 Approved by: https://github.com/bdhirsh	2025-03-16 18:19:19 +00:00
PyTorch MergeBot	f9b4856989	Revert "[pytree] add APIs to determine a class is a namedtuple or PyStructSequence (#113257 )" This reverts commit `c95a6b416b`. Reverted https://github.com/pytorch/pytorch/pull/113257 on behalf of https://github.com/ZainRizvi due to Sorry but this is breaking internally. @zou3519 can you please help land this internally? See the sigmoid tests in D71198793 for details. To validate the fixes internally, you can follow the instructions here: https://fburl.com/fixing-ghfirst-reverts ([comment](https://github.com/pytorch/pytorch/pull/113257#issuecomment-2725982539))	2025-03-14 23:13:34 +00:00
Xuehai Pan	c95a6b416b	[pytree] add APIs to determine a class is a namedtuple or PyStructSequence (#113257 ) Changes in this PR: 1. Add `is_structseq` and `is_structseq_class` functions to determine a object or a class is PyStructSequence. 2. Add a generic class `structseq` which can be used as the registration key for PyStructSequence types like `namedtuple` for Named Tuple types. 3. Change `is_namedtuple` to accept subclasses of namedtuple to be namedtuple. Before this PR, only namedtuple class directly created by `collections.namedtuple` or `typing.NamedTuple` were namedtuple classes while their subclasses were not. This PR makes `is_namedtuple` return true for subclasses of namedtuple class. Resolves #75982. New tests are included in this PR. - #75982 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113257 Approved by: https://github.com/zou3519	2025-03-14 08:50:30 +00:00
Yiming Zhou	15cd6921a5	[export] Fix tensor_constant and buffer naming conflicts in TS converter (#148803 ) Summary: In TS converter, tensor constants are traced as BUFFER and later we will convert them back to CONSTANT_TENSOR. So we need to prevent naming conflicts during lift constant pass. Test Plan: CI Differential Revision: D70826426 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148803 Approved by: https://github.com/angelayi	2025-03-14 00:38:12 +00:00
Shangdi Yu	cf19efd3d9	Support basic TorchBind in aot_compile and aoti_compile_and_package (#148506 ) Summary: Codegen - Skip some codegen parts for torchbind (such as arg decleration) because they are loaded in proxy executor, so we do not need to declare torchbind args in cpp code - Added a helper method to get the schema of CallTorchBind HOP. The returned schema is only the schema of `obj.method()`. Serialization Add support for torchbind object in serialization - For CallTorchBind HOP, we need to handle it specially because of it's schema. The output serialized args is in the format of `(obj, method, args, kwargs)`. - it.TorchBindObject inputs are serialized to `as_custom_obj` Argument. Packaging* Add torchbind objects file and `custom_objs_config.json` file to generated files output of `aot_compile`. The json file is stored in the `data/aotinductor/<model_name>` folder in pt2 archive. The torchbind objects are stored in data/constants/ folder in pt2 archive. The format of torchbind objects are `f"{CUSTOM_OBJ_FILENAME_PREFIX}{custom_obj_idx}"`. e.g. `custom_obj_0`. CustomClassHolder objects implement their own pickle methods. Note that this `custom_objs_config.json` file is different from the `model_constants_config.json` file produced in package_sigmoid(). The keys in `custom_objs_config` directly correspond to the arg name in extern nodes json. The key in `model_constants_config.json` produced by `package_sigmoid` is the attribute name in the user mode code. This is required for both internal and OSS torchbind support. For OSS torchbind support, we also need to package torchbind_constants into the .pt2 output. Work Left We still need to add torchbind support in ProxyExecutor for inductor.aoti_load_package to work. See other diffs in the stack. Test Plan: ``` buck run fbcode//mode/dev-nosan //caffe2/test/inductor:torchbind -- -r schema buck run fbcode//mode/dev-nosan //caffe2/test/inductor:torchbind -- -r aot_compile ``` Differential Revision: D69490718 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148506 Approved by: https://github.com/angelayi	2025-03-11 20:55:18 +00:00
cyy	295f2ed4d1	Fix "invalid application of 'sizeof' to an incomplete type" (#148854 ) Fixes with C++23 and constexpr std::unique_ptr Pull Request resolved: https://github.com/pytorch/pytorch/pull/148854 Approved by: https://github.com/Skylion007	2025-03-11 00:40:00 +00:00
PyTorch MergeBot	ebd087e4b5	Revert "[pytree] add APIs to determine a class is a namedtuple or PyStructSequence (#113257 )" This reverts commit `f08146b67b`. Reverted https://github.com/pytorch/pytorch/pull/113257 on behalf of https://github.com/jovianjaison due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/113257#issuecomment-2711299830))	2025-03-10 17:19:21 +00:00
Avik Chaudhuri	6cf360be04	fix lost input mutations with export_tracepoint (#148709 ) Preserving module call signatures in the presence of input mutation cause incorrect results. The root cause turned out to be that export tracepoints would unwrap / wrap functional args that would lose mutation info on those args. Differential Revision: [D70734821](https://our.internmc.facebook.com/intern/diff/D70734821/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/148709 Approved by: https://github.com/angelayi	2025-03-07 09:36:18 +00:00
Xuehai Pan	f08146b67b	[pytree] add APIs to determine a class is a namedtuple or PyStructSequence (#113257 ) Changes in this PR: 1. Add `is_structseq` and `is_structseq_class` functions to determine a object or a class is PyStructSequence. 2. Add a generic class `structseq` which can be used as the registration key for PyStructSequence types like `namedtuple` for Named Tuple types. 3. Change `is_namedtuple` to accept subclasses of namedtuple to be namedtuple. Before this PR, only namedtuple class directly created by `collections.namedtuple` or `typing.NamedTuple` were namedtuple classes while their subclasses were not. This PR makes `is_namedtuple` return true for subclasses of namedtuple class. Resolves #75982. New tests are included in this PR. - #75982 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113257 Approved by: https://github.com/zou3519	2025-03-06 18:59:02 +00:00
Pian Pawakapan	c677f3251f	[export] don't use unbacked_renamings in export (#147574 ) Plan: avoid the use of unbacked renamings, and introduce a pass run in `_produce_aten_artifact` that recomputes unbacked bindings. Decided to do this because in we don't serialize unbacked renamings (or any ShapeEnv state), so this used to compose poorly with de/serialization. This hopefully establishes the invariant that the unbacked binding keys are always in sync with the example values (i.e. same indices, and removed if the symbol is replaced / specialized). For de/serialization, we don't stored unbacked bindings, and just rerun the pass. Involved a refactor of compute_unbacked_bindings. Pull Request resolved: https://github.com/pytorch/pytorch/pull/147574 Approved by: https://github.com/avikchaudhuri	2025-03-04 21:43:49 +00:00
Zain Rizvi	f30776c37a	[BE] Upgrade to mypy 1.14 (#145966 ) Upgrade mypy version Pull Request resolved: https://github.com/pytorch/pytorch/pull/145966 Approved by: https://github.com/Skylion007	2025-03-04 20:58:26 +00:00
Zhengxu Chen	915b9c80ab	[export] Sync aoti schema to schema.py (#148017 ) Summary: Synchronizing internal AOTI schema to OSS schema.py Test Plan: CI Differential Revision: D70271151 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148017 Approved by: https://github.com/yiming0416	2025-02-27 21:46:11 +00:00
Avik Chaudhuri	8d921eb97f	export method (#147573 ) The `export` API takes a `nn.Module` and traces its `forward` method. However sometimes it is useful to export different methods of a `nn.Module`, either as a one-off for debugging or as a set of methods that are called in some sequence outside `export` (e.g., `encode` / `decode`). When multiple methods of the same module instance are exported, they should share the same of the common module instance. This PR adds a couple of utils in `torch._export.utils` for this workflow. The `wrap_method` util wraps a method as a `nn.Module` that can then be exported. See included test. We recommend using the same module instance to export multiple methods on that instance, in which case they are guaranteed to share state. On serde, this state sharing is lost, so we provide another util, `sync_state`, to re-sync the state. These utils are meant to be eventually replaced by API-level changes, but for now this can unblock users who need this workflow. In particular, in the future we can accept one or multiple method entrypoints, with their own args / kwargs / dynamic shape specifications, which can create a variant of `ExportedProgram` with multiple graphs that share state; then we can automatically ensure that the state sharing is preserved through serde. Differential Revision: [D69960801](https://our.internmc.facebook.com/intern/diff/D69960801/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/147573 Approved by: https://github.com/tugsbayasgalan	2025-02-25 20:58:54 +00:00
Tugsbayasgalan Manlaibaatar	6a6de0e09d	better error message (#147532 ) Differential Revision: [D69939736](https://our.internmc.facebook.com/intern/diff/D69939736) Pull Request resolved: https://github.com/pytorch/pytorch/pull/147532 Approved by: https://github.com/avikchaudhuri, https://github.com/zou3519	2025-02-21 17:08:47 +00:00
Aaron Orenstein	db4ce78d46	PEP585: More UP006 fixes (#146392 ) This should be the final PR before we can enable RUFF UP006. Pull Request resolved: https://github.com/pytorch/pytorch/pull/146392 Approved by: https://github.com/justinchuby, https://github.com/albanD, https://github.com/Skylion007	2025-02-20 06:18:13 +00:00
Yidi Wu	77aa602871	[torchbind] Differentiate ScriptModule and ScriptObject with qualified name (#147399 ) Summary: This pr add a _is_script_object method to differentiate scriptModule and scriptObject, where the formal inherits from ScriptObject in C++ so they both passes the isinstance(obj, torch.ScriptObject) check. The qualified name of ScriptObject (i.e. custom class) would starts with "__torch__.torch.classes", this has been a widely used assumption for dealing with custom class across our code base. Test Plan: Add new test. Differential Revision: D69685316 Pull Request resolved: https://github.com/pytorch/pytorch/pull/147399 Approved by: https://github.com/yushangdi	2025-02-20 04:57:57 +00:00
Avik Chaudhuri	24738768a8	more dist ops in non strict (#147417 ) Summary: Previously we added support for `all_reduce` to non strict. This PR extends this support to other non-functional collectives that are remapped in Dynamo: `all_gather`, `all_gather_into_tensor`, `all_to_all_single`, `reduce_scatter_tensor`. Test Plan: added unit tests Differential Revision: D69813991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/147417 Approved by: https://github.com/angelayi	2025-02-19 21:29:16 +00:00
angelayi	0c8028e877	[export] Loosen symint input serialization (#147237 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/147237 Approved by: https://github.com/avikchaudhuri	2025-02-18 13:03:47 +00:00
Chen Lai	708428704e	patch for block-wise quantization + pt2e (#146946 ) Summary: https://github.com/pytorch/pytorch/pull/144492 was reverted due to duplicate kernel registration. This PR will re-introduce the patch Differential Revision: D69488779 Pull Request resolved: https://github.com/pytorch/pytorch/pull/146946 Approved by: https://github.com/jerryzh168, https://github.com/andrewor14	2025-02-18 01:15:26 +00:00

1 2 3 4 5 ...

898 Commits