pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Thiago Crepaldi	7e941a932b	Store user model to simplify ONNXProgram.{adapt_torch_*,__call__} APIs (#115281 ) Currently (after https://github.com/pytorch/pytorch/pull/114407), the user has must pass the original user ``model`` to APIs such as ``ONNXProgram.__call__``, ``ONNXProgram.adapt_torch_inputs_to_onnx`` and ``ONNXProgram.adapt_torch_outputs_to_onnx`` APIs. This was needed because when the model is fakefied, a version of the non-fakefied model is needed so that the Initializers, buffers and constants can be extracted from a real model (and used as input to the ONNX model). That approach brings an unnecessary usability burden to the user when the model is not fakefied, because the model that was already passed to ``torch.onnx.dynamo_export`` could be used to extract ``state_dict``. This PR adds ``ONNXProgram._model_torch`` attribute to store the user model and demote ``model`` argument of the aforementioned APIs to optional, only (as opposed to required). As a result, for the fakefied model scenario, the user still need to pass the required model, but for non fakefied models, the persisted model is implicitly used to extract the model state_dict, making it easier to use. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115281 Approved by: https://github.com/BowenBao ghstack dependencies: #114407	2023-12-09 07:46:12 +00:00
BowenBao	2962271f58	[ONNX][dynamo_export] Extend expected fx output types for int, float, bool (#115431 ) Fixes exporting ops, such as `aten::_scaled_dot_product_flash_attention` that returns int, float, bool typed outputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115431 Approved by: https://github.com/titaiwangms, https://github.com/thiagocrepaldi	2023-12-09 03:24:48 +00:00
Thiago Crepaldi	3c54ff6bcd	Update ONNX's IO Adapter to support FakeTensor with ExportedProgram (#114407 ) Currently, the ONNX exporter using torch.nn.Module as input can support FakeTensor because the ONNX model stores all initializers When using torch.export.ExportedProgram as input, the initializers are lifted as inputs. In order to execute the ONNX model, we need to pass a reference to the non-fake model to the ONNXProgram.adapt_torch_inputs_to_onnx API, so that initializers can be fetched from the model and fed to the ONNX model as input ps: https://github.com/pytorch/pytorch/issues/115461 will track the API revision for the cases where additional `model_with_state_dict` are required to produce complete ONNX files exported with fake support. This is also tracked by the umbrella fake tensor issue https://github.com/pytorch/pytorch/issues/105464 FYI @BowenBao Pull Request resolved: https://github.com/pytorch/pytorch/pull/114407 Approved by: https://github.com/BowenBao	2023-12-09 01:48:27 +00:00
titaiwangms	ae5365819d	[ONNX] Extend `test_fx_op_consistency.py` to cover ExportedProgram model type (#114886 ) This PR covers `ExportedProgram` to `test_fx_op_consistency.py`, which helps us identify the necessary but missing io_steps. Next, we should refactor the tests to actually cover all ops supported by registry. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114886 Approved by: https://github.com/thiagocrepaldi	2023-12-07 00:03:23 +00:00
titaiwangms	1aa54bdebf	[ONNX] Fix op level debug on complex dtype support (#114885 ) Previous to this PR, op level debug mismatches whenever it comes to complex dtype matching, because in ONNX, we support real representation. This PR makes sure we use real representation to compare the results. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114885 Approved by: https://github.com/BowenBao	2023-12-01 02:17:27 +00:00
BowenBao	8cfc95368f	[Experimental][ONNX] Export with symbolic shapes in proto (#112179 ) Experimental feature to store symbolic shapes produced by torch dynamo inside the exported onnx model. There is no official ONNX spec to support nodes within FunctionProto to have value info, https://github.com/onnx/onnx/issues/5487. The names for value info are generated uniquely to be retrievable based on the call site and call stack. This requires onnxscript with https://github.com/microsoft/onnxscript/tree/bowbao/export_symbolic_shapes Pull Request resolved: https://github.com/pytorch/pytorch/pull/112179 Approved by: https://github.com/titaiwangms, https://github.com/thiagocrepaldi	2023-11-30 02:03:32 +00:00
titaiwangms	e83c05c833	[ONNX] Add ONNX ExportedProgram tests (#114633 ) Fix #114166 Fix #113705 This PR references tests from `test_export.py` to make sure the exported program from PyTorch can all be successfully exported into ONNX model. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114633 Approved by: https://github.com/thiagocrepaldi	2023-11-28 19:03:13 +00:00
Xuehai Pan	89a1fe6966	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Changes: 1. Add `_private_register_pytree_node` API in both C++ and Python pytree. In C++ pytree, the API will only register pytree node for C++ pytree. In Python pytree, the API will only register pytree node for Python pytree. 2. Do not allow registering a type as pytree node twice in the Python pytree. 3. Add thread lock to the Python pytree node register API. 4. The old `_register_pytree_node` API will call the `_private_register_pytree_node` API and raise a deprecation warning. 5. Add a new `register_pytree_node` API to register node type in both C++ and Python implementations. 6. Add tests to ensure a warning will be raised when the old private function is called. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-28 11:41:38 +00:00
Zhengxu Chen	e0d2a24967	Reland "[export] Support user input mutation. [1/2]" (#114496 ) (#114596 ) Summary: Serialization not implemented yet. Will do in the next diff. Resolving Github issues: https://github.com/pytorch/pytorch/issues/112429 https://github.com/pytorch/pytorch/issues/114142 Test Plan: onnx doc test ``` python -m xdoctest /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/onnx/_internal/exporter.py ONNXProgram.model_signature:0 ``` Differential Revision: D51588558 Pull Request resolved: https://github.com/pytorch/pytorch/pull/114596 Approved by: https://github.com/angelayi	2023-11-27 20:19:04 +00:00
PyTorch MergeBot	01366efcc9	Revert "[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 )" This reverts commit `4e4a6ad6ec`. Reverted https://github.com/pytorch/pytorch/pull/112111 on behalf of https://github.com/DanilBaibak due to Break internal build ([comment](https://github.com/pytorch/pytorch/pull/112111#issuecomment-1824099658))	2023-11-23 09:59:32 +00:00
Thiago Crepaldi	a76bb5d84d	Add support for models with mutated buffer on torch.onnx.dynamo_export (#112272 ) This PR adds a unit test that leverages `torch.export.ExportedProgram` models that mutates registered buffers. Although the exporter already works out of the box in such scenario, the GraphModule and the exported ONNX model have extra outputs containing the mutated buffers. On future runs of the ONNX model, the mutated buffers are used as input to the model. The aforementioned extra inputs and outputs are by design and the `ONNXProgram.model_signature` can be used to fetch detailed input/output schema for the exported model. However, when we want to compare pytorch output to ONNX's, there is a mismatch between the schema because pytorch output does not include the mutated buffers present on the ONNX output. This PR extends `onnx_program.adapt_torch_outputs_to_onnx(torch_outputs)` so that the mutated buffers are prepended to the Pytorch output, matching the ONNX schema. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112272 Approved by: https://github.com/titaiwangms, https://github.com/BowenBao	2023-11-23 09:59:02 +00:00
PyTorch MergeBot	a43edd836c	Revert "Add support for models with mutated buffer on torch.onnx.dynamo_export (#112272 )" This reverts commit `c4a22d6918`. Reverted https://github.com/pytorch/pytorch/pull/112272 on behalf of https://github.com/huydhn due to Sorry for reverting you change but it is failing dynamo test in trunk `c4a22d6918` ([comment](https://github.com/pytorch/pytorch/pull/112272#issuecomment-1823897964))	2023-11-23 07:07:56 +00:00
Thiago Crepaldi	c4a22d6918	Add support for models with mutated buffer on torch.onnx.dynamo_export (#112272 ) This PR adds a unit test that leverages `torch.export.ExportedProgram` models that mutates registered buffers. Although the exporter already works out of the box in such scenario, the GraphModule and the exported ONNX model have extra outputs containing the mutated buffers. On future runs of the ONNX model, the mutated buffers are used as input to the model. The aforementioned extra inputs and outputs are by design and the `ONNXProgram.model_signature` can be used to fetch detailed input/output schema for the exported model. However, when we want to compare pytorch output to ONNX's, there is a mismatch between the schema because pytorch output does not include the mutated buffers present on the ONNX output. This PR extends `onnx_program.adapt_torch_outputs_to_onnx(torch_outputs)` so that the mutated buffers are prepended to the Pytorch output, matching the ONNX schema. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112272 Approved by: https://github.com/titaiwangms, https://github.com/BowenBao	2023-11-23 03:39:18 +00:00
CYuxian	9bab96c78c	[ONNX] Consider negative dim in _index_fill_reshape_helper (#114050 ) Fix export issue of index_copy op with negative dim. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114050 Approved by: https://github.com/thiagocrepaldi	2023-11-22 15:40:57 +00:00
BowenBao	bd44bdb675	[ONNX][dynamo_export] Turn off opmath for type promotion (#113780 ) Although opmath is the right thing to do to retain on-par precision, it inserts upcasts everywhere in the graph. This is particularly hard for backend to optimize since there is no way to differentiate between inserted upcasts and model code casts. Hence we consolidate the input dtype to the result dtype to avoid this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113780 Approved by: https://github.com/titaiwangms, https://github.com/justinchuby	2023-11-22 05:39:00 +00:00
Thiago Crepaldi	3f736c2d77	Add ONNXProgram.__call__ API to run model with ONNX Runtime (#113495 ) Currently the user can use torch.onnx.dynamo_export to export the model. to ONNX. ```python import torch class Model(torch.nn.Module): def forward(self, x): return x + 1.0 onnx_program = torch.onnx.dynamo_export( Model(), torch.randn(1, 1, 2, dtype=torch.float), ) ``` The next step would be instantiating a ONNX runtime to execute it. ```python import onnxruntime # type: ignore[import] onnx_input = self.adapt_torch_inputs_to_onnx(args, *kwargs) options = options or {} providers = options.get("providers", onnxruntime.get_available_providers()) onnx_model = self.model_proto.SerializeToString() ort_session = onnxruntime.InferenceSession(onnx_model, providers=providers) def to_numpy(tensor): return ( tensor.detach().cpu().numpy() if tensor.requires_grad else tensor.cpu().numpy() ) onnxruntime_input = { k.name: to_numpy(v) for k, v in zip(ort_session.get_inputs(), onnx_input) } return ort_session.run(None, onnxruntime_input) ``` This PR provides the `ONNXProgram.__call__` method as facilitator to use ONNX Runtime under the hood, similar to how `torch.export.ExportedProgram.__call__` which allows the underlying `torch.fx.GraphModule` to be executed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113495 Approved by: https://github.com/titaiwangms	2023-11-22 01:48:45 +00:00
CYuxian	b88abb1674	[ONNX] Fix export issue of aten::layer_norm in opset 17 (#114058 ) For torch.nn.LayerNorm, weight and bias could be None(when parameter elementwise_affine is False or bias is False), but for onnx op LayerNormalization from opset 17, weight and bias cannot be None. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114058 Approved by: https://github.com/thiagocrepaldi	2023-11-21 22:45:50 +00:00
Xuehai Pan	4e4a6ad6ec	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Changes: 1. Add `_private_register_pytree_node` API in both C++ and Python pytree. In C++ pytree, the API will only register pytree node for C++ pytree. In Python pytree, the API will only register pytree node for Python pytree. 2. Do not allow registering a type as pytree node twice in the Python pytree. 3. Add thread lock to the Python pytree node register API. 4. The old `_register_pytree_node` API will call the `_private_register_pytree_node` API and raise a deprecation warning. 5. Add a new `register_pytree_node` API to register node type in both C++ and Python implementations. 6. Add tests to ensure a warning will be raised when the old private function is called. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-21 19:53:13 +00:00
titaiwang	957312a4cf	[ONNX] Relax unsupported node analysis on complex dtype (#113785 ) In cases like #113444, users usually stop at UnsupportedNodeAnalysis with unsupported nodes information. Although in SARIF, they can clearly see it's due to lack of COMPLEX support, in screen error message, it's only showing original FX node name, such as `aten.mul.Tensor`. ~~This PR catches the information from diagnostic messages and reveal it to users.~~ The root cause is that UnsupportedNodeAnalysis is leveraging on `onnxfunction_dispatcher.get_function_overloads()` to decide if an ATen is supported or not. However, in `onnxfunction_dispatcher.get_function_overloads()`, lacking of complex function support is considered unsupported. This PR defines Unsupported FX nodes as not in registry. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113785 Approved by: https://github.com/thiagocrepaldi	2023-11-17 20:11:20 +00:00
BowenBao	31459e3e56	[ONNX][dynamo_export] Add 'aten::rsub' type promotion (#113697 ) The logic is the same as 'aten::sub'. Needed by llama2. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113697 Approved by: https://github.com/justinchuby, https://github.com/thiagocrepaldi ghstack dependencies: #113404	2023-11-17 00:50:05 +00:00
Thiago Crepaldi	e19ea53e1d	Add optional torch.export.ExportGraphSignature to ONNXProgram (#113477 ) When the ONNX model is exported from a torch.export.ExportedProgram, a torch.export.ExportedGraphSignature is available with the specification of the model inputs and outputs. ExportedGraphSignature includes information such as the mapping between the exported input/buffer/output ONNX name to the original pytorch input/buffer/output name. It also specifies the kind of the input, such as user_input, parameter, buffer or constant_tensor. Outputs kind can be user_output, loss_output, buffer_mutation, etc Such information can be useful to understand what the ONNX model expects as inputs and how the output will look like when the ONNX input/output differs from the original PyTorch input/output schema. When the ONNX model is exported from a Callable or regular torch.nn.MOdule, such information is not available and ONNXProgram.model_signature will yield NOne Pull Request resolved: https://github.com/pytorch/pytorch/pull/113477 Approved by: https://github.com/BowenBao	2023-11-16 22:04:44 +00:00
PyTorch MergeBot	bbd73c746e	Revert "[ONNX][dynamo_export] Add 'aten::rsub' type promotion (#113697 )" This reverts commit `48800e9bb0`. Reverted https://github.com/pytorch/pytorch/pull/113697 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it is failing in trunk `48800e9bb0`. The failure on the PR is legit https://github.com/pytorch/pytorch/actions/runs/6884783862/job/18728219414, let me take a look on why Dr.CI marks it as flaky ([comment](https://github.com/pytorch/pytorch/pull/113697#issuecomment-1813790907))	2023-11-16 04:59:32 +00:00
BowenBao	48800e9bb0	[ONNX][dynamo_export] Add 'aten::rsub' type promotion (#113697 ) The logic is the same as 'aten::sub'. Needed by llama2. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113697 Approved by: https://github.com/justinchuby, https://github.com/thiagocrepaldi ghstack dependencies: #113404	2023-11-16 03:31:07 +00:00
AllenTiTaiWang	d561654d99	[ONNX] Support more sympy operations in fx-onnx exporter (#112758 ) Fix https://github.com/microsoft/onnx-converters-private/issues/190 This PR retires built-in function mapping by adding built-in ops into torchlib (https://github.com/microsoft/onnxscript/pull/1135), and provide a runtime tests to guard the operation conversion. More built-in ops are supported in torchlib as well. ~~NOTE: `native_batch_norm` regression is caused by https://github.com/microsoft/onnxscript/issues/1140. Will fix it before I merge this.~~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/112758 Approved by: https://github.com/justinchuby, https://github.com/thiagocrepaldi	2023-11-14 03:40:48 +00:00
Thiago Crepaldi	cfee3bcf97	Add inheritance to ONNX's InputAdaptStep and OutputAdaptSet impl (#113476 ) This is a minor compliance change that specifies the InputAdaptStep and OutputAdapStep as the base class for the actual implementations Pull Request resolved: https://github.com/pytorch/pytorch/pull/113476 Approved by: https://github.com/justinchuby	2023-11-13 21:27:44 +00:00
AllenTiTaiWang	e8e3afb784	[ONNX] Refactor MaxPool to support dynamic inputs (#113318 ) In https://github.com/pytorch/pytorch/pull/106270, the solution managed to solve the [`ceil_model` corner issue](https://github.com/onnx/onnx/issues/5711) with the usage of `get_pool_ceil_padding`. However, padding the ceil in converter side only works when we already know the input shapes, therefore, a regression happens when users want to do dynamic inputs. This PR provides (1) refactor codes with torchlib implementation, (2) add dynamic shapes test, and (3) disable the corner tests with comments saying re-enable it when the [real fix from ONNX](https://github.com/onnx/onnx/pull/5741) is merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113318 Approved by: https://github.com/thiagocrepaldi	2023-11-10 23:23:49 +00:00
PyTorch MergeBot	2a271a3efa	Revert "[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 )" This reverts commit `a0d00349ed`. Reverted https://github.com/pytorch/pytorch/pull/112111 on behalf of https://github.com/PaliC due to _private_register_pytree_node now checks for duplicate registering, unfortunately, this breaks composability with torchrec internally :( ([comment](https://github.com/pytorch/pytorch/pull/112111#issuecomment-1806130993))	2023-11-10 17:24:40 +00:00
Xuehai Pan	a0d00349ed	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-10 02:41:30 +00:00
PyTorch MergeBot	9a90989121	Revert "[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 )" This reverts commit `95f52611c7`. Reverted https://github.com/pytorch/pytorch/pull/112111 on behalf of https://github.com/PaliC due to in the bottom diff in the stack changing _register_pytree_node's signature is bc breaking, please revert the signature and reland ([comment](https://github.com/pytorch/pytorch/pull/112111#issuecomment-1804892924))	2023-11-10 00:38:28 +00:00
Thiago Crepaldi	16f82198ca	Export ReduleL1/ReduceL2 ONNX ops for aten::linalg_vector_norm(ord={1,2}) (#113173 ) After #84624, aten::linalg_vector_norm started being used instead of aten::norm. In the ONNX exporter, the latter leveraged Reduce{L1,L2} when p={1,2}, which resulted in more optimized code in the ONNX Runtime This PR extends aten::linal_vector_norm to also use Reduce{L1,L2} when ord={1,2}, producing an equivalent ONNX subgraph This PR is a WIP. Pending work include checking argument equivalence between `aten::norm` and `aten::linalg_vector_norm` and maybe re-enable tests disabled by #84624 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113173 Approved by: https://github.com/justinchuby	2023-11-08 19:08:43 +00:00
Xuehai Pan	95f52611c7	[pytree] register pytree node type in both C++ pytree and Python pytree (#112111 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112111 Approved by: https://github.com/zou3519	2023-11-08 05:02:03 +00:00
Thiago Crepaldi	eefe327b11	Rename torch.onnx.ExportOutput* to ONNXProgram* (#112263 ) Since PyTorch 2.1, torch.export API was introduced and the term "export" got overloaded due to the already existing torch.onnx.export API. The torch.onnx.dynamo_export API was introduced on pyTorch 2.0 and it exposed a torch.onnx.ExportOutput which now can be confused with torch.export.export output To prevent such ambiguity and standardize names around the new torch.export.ExportedProgram, this PR renames torch.onnx.ExportOutput to torch.onnx.ONNXProgram Pull Request resolved: https://github.com/pytorch/pytorch/pull/112263 Approved by: https://github.com/BowenBao ghstack dependencies: #112444	2023-11-06 22:27:15 +00:00
Adam Louly	c27a03a4e5	[ONNX] Cast scale back to fp16 after _attention_scale. (#112554 ) ### Description: The problem is that the graph was cast to `fp32` at a certain point but never reverted to `fp16`, causing the rest of the graph to run on `fp32`. This change aims to fix that issue and improve performance. ### Changes Made: - Modified the ONNX exporter code to ensure that the graph is correctly cast back to `fp16` after a necessary cast to `fp32`. ### Why This Change is Necessary: This change is necessary to ensure that the exported ONNX graph remains in `fp16` where appropriate, leading to significant gains in performance and memory savings. Without this fix, the graph would run entirely in `fp32`, causing suboptimal performance. ### Testing: - Performed extensive testing with various models and scenarios to validate the correctness of the changes. ### Benchmarking Results: Experiments Ran on: 8 GPUS - Tesla V100 - 32GB Before Fix: ort + 4 hidden layers + without fix - Train Runtime: 78.7088 seconds - Train Samples per Second: 10.164 - Train Steps per Second: 1.271 - Train Loss: 5.624655108451844 - Epoch: 0.3 After Fix: ort + 4 hidden layers + with fix - Train Runtime: 72.5636 seconds - Train Samples per Second: 11.025 - Train Steps per Second: 1.378 - Train Loss: 5.6252727746963505 - Epoch: 0.3 We can see 7.79% perf gain after this fix. - I only ran it on 4 hidden layers due to GPU constraints, the perf gain is going to be much higher on the full model. - You could see the gain on other models that uses _attention_scale as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112554 Approved by: https://github.com/BowenBao, https://github.com/thiagocrepaldi	2023-11-02 23:18:53 +00:00
Thiago Crepaldi	01e4984bac	Add decomposition for dynamo_export + ExportedProgram and remove None from input (#112444 ) This PR introduces the ability to produce GraphModules with Core ATen IR only through decompositions. It also removes `None` from user inputs as ONNX does not supports them Tests for these features will be executed when #112289 is merged, but for reference, they are as below: ```python def test_log_sigmoid(self): # This produces op as `torch.ops.aten.log_sigmoid_forward`, instead of the more # conventional `torch.ops.aten.log_sigmoid`. class Model(torch.nn.Module): def __init__(self): super().__init__() self.m = torch.nn.LogSigmoid() def forward(self, x): return self.m(x) input = torch.randn(2) self.run_test_with_fx_to_onnx_exporter_and_onnx_runtime( Model(), (input,), model_type=self.model_type ) def test_none_input(self): class NoneInputModel(torch.nn.Module): def forward( self, x: torch.Tensor, y: Optional[torch.Tensor], z: torch.Tensor ): if y is None: return x + z return x + y + z self.run_test_with_fx_to_onnx_exporter_and_onnx_runtime( NoneInputModel(), (torch.randn(1, 2), None, torch.randn(1, 2)), model_type=self.model_type, ) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/112444 Approved by: https://github.com/BowenBao	2023-11-02 02:30:59 +00:00
Wei-Sheng Chin	ab20bab729	[ONNX] Fix partial name matching when searching parameter tensors (#112517 ) Now we remove name in `onnx_input_names` once it's matched by a parameter so that the same name won't be matched twice. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112517 Approved by: https://github.com/thiagocrepaldi	2023-11-01 18:25:26 +00:00
Peter Bell	66c32d099a	Use `pytree.arg_tree_leaves` everywhere (#112394 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112394 Approved by: https://github.com/lezcano ghstack dependencies: #112391, #112392, #112393	2023-10-31 15:57:06 +00:00
Peter Bell	bbd5b935e4	Use `pytree.tree_leaves` everywhere (#112324 ) This changes all the instances I could find of `tree_flatten(...)[0]` or `x, _ = tree_flatten` to use `tree_leaves`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112324 Approved by: https://github.com/lezcano ghstack dependencies: #112327, #112323	2023-10-30 03:39:04 +00:00
BowenBao	b365acba28	[ONNX] A better way to safe guard 2GB model serialization (#111984 ) Summary - faster than previous try-catch. - more stable than previous try-catch. In some circumstances serializing models > 2GB into a single protobuf file ends up with a corrupted file without raising an exception. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111984 Approved by: https://github.com/justinchuby	2023-10-25 19:18:37 +00:00
Thiago Crepaldi	9d4dbebc34	Add support to ExportedProgram as input to torch.onnx.dynamo_export (#111497 ) Fixes #109889 This PR adds `torch.export.export` as another `FXGraphExtractor` implementation. `torch.onnx.dynamo_export` automatically uses this new FX tracer when a `torch.export.ExportedProgram` is specified as `model` Implementation is back compatible, thus non `ExportedProgram` models are handled the exact same way as before Pull Request resolved: https://github.com/pytorch/pytorch/pull/111497 Approved by: https://github.com/BowenBao	2023-10-25 18:11:19 +00:00
PyTorch MergeBot	5120c97f32	Revert "Add support to ExportedProgram as input to torch.onnx.dynamo_export (#111497 )" This reverts commit `4f42edfb6e`. Reverted https://github.com/pytorch/pytorch/pull/111497 on behalf of https://github.com/huydhn due to Sorry for reverting your change, it is failing ONNX test in trunk `4f42edfb6e`, possibly a landrace ([comment](https://github.com/pytorch/pytorch/pull/111497#issuecomment-1778519212))	2023-10-25 05:07:00 +00:00
Thiago Crepaldi	4f42edfb6e	Add support to ExportedProgram as input to torch.onnx.dynamo_export (#111497 ) Fixes #109889 This PR adds `torch.export.export` as another `FXGraphExtractor` implementation. `torch.onnx.dynamo_export` automatically uses this new FX tracer when a `torch.export.ExportedProgram` is specified as `model` Implementation is back compatible, thus non `ExportedProgram` models are handled the exact same way as before Pull Request resolved: https://github.com/pytorch/pytorch/pull/111497 Approved by: https://github.com/BowenBao	2023-10-25 00:17:43 +00:00
Peter Bell	46e80ce58a	[ATen] Support multi dim any and all reductions (#110310 ) This adds a new overload to `all` and `any` with support for multiple reduction dims. ``` all.dims(Tensor self, int[1]? dim=None, bool keepdim=False) -> Tensor any.dims(Tensor self, int[1]? dim=None, bool keepdim=False) -> Tensor ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/110310 Approved by: https://github.com/lezcano, https://github.com/albanD, https://github.com/justinchuby	2023-10-24 21:33:53 +00:00
BowenBao	5eac44bc72	Ignore beartype if its version is 0.16.0 (#111859 ) With this fix, 'beartype' 0.16.0 should be ignored and not crash PyTorch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111859 Approved by: https://github.com/justinchuby, https://github.com/thiagocrepaldi	2023-10-24 02:11:26 +00:00
Aaron Gokaslan	cb856b08b2	[BE]: Attach cause to some exceptions and enable RUFF TRY200 (#111496 ) Did some easy fixes from enabling TRY200. Most of these seem like oversights instead of intentional. The proper way to silence intentional errors is with `from None` to note that you thought about whether it should contain the cause and decided against it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111496 Approved by: https://github.com/malfet	2023-10-19 21:56:36 +00:00
Thiago Crepaldi	9ab6ac5bc1	[ONNX] Fix aten::new_zeros due to TorchScript behavior change on Pytorch 2.1 Fix #110935 (#110956 ) Fixes #110597 Summary: * Generic code: The `torch._C.Value.node().mustBeNone()` is encapsulated into the high-level API `JitScalarType.from_value` ; `_is_none` was also extended to allow either `None` or `torch._C.Value.node.mustBeNone()`, so users don't manually call into TorchScript API when implementing operators * Specific to `new_zeros` (and ops of ` _like` and `new_`): When checking `dtype`, we always must use ` _is_none`, which will call proposed by #110935 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110956 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2023-10-16 18:28:20 +00:00
veritas-Qiu	a3e9b80082	Fix torch.diagonal for torch.onnx.export when dim1<0 or dim2<0 (#111130 ) in many cases, torch.diagonal will pass (dim1=-2, dim2=-1), onnx export will always fail in these cases this pr try to fix the bug Pull Request resolved: https://github.com/pytorch/pytorch/pull/111130 Approved by: https://github.com/thiagocrepaldi	2023-10-13 22:05:53 +00:00
Kazuaki Ishizaki	f7ce19d40a	Fix typo under torch/onnx directory (#110697 ) This PR fixes typo of comments in files under `torch/onnx` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110697 Approved by: https://github.com/ezyang	2023-10-06 18:21:00 +00:00
Fabrice Pont	053367b1ed	fix: flake8-bugbear code B024 (#107265 ) See #106571 item B024 This fix concerns the addition of `abstractmethod` to methods declared inside abstract classes. Should I also include PEP8 compliant reformatting on the files I had to modify ? Pull Request resolved: https://github.com/pytorch/pytorch/pull/107265 Approved by: https://github.com/kit1980	2023-10-04 23:52:52 +00:00
ruiren	e4414716d5	[onnx] support attn_mask fp16 type (#110306 ) When users define customized `attention mask` using `dtype=torch.float16`, e.g. ``` from torch.nn import functional as F float_min = torch.finfo(torch.float16).min attention_mask_fp16 = (attention_mask * 1.0).masked_fill(attention_mask, float_min).to(torch.float16) attn_output = F.scaled_dot_product_attention( query_layer_, key_layer_, value_layer_, attention_mask_fp16, 0.0, is_causal=False ) ``` the onnx graph cannot be exported. When q, k ,v have the fp16 type, we can support this `attn_mask` to be `fp16` type, by adding ``` elif ( _type_utils.JitScalarType.from_value(attn_mask) == _type_utils.JitScalarType.FLOAT in (_type_utils.JitScalarType.FLOAT, _type_utils.JitScalarType.HALF) ``` This can export `.onnx` graph. Fixes #109336 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110306 Approved by: https://github.com/titaiwangms	2023-10-01 14:50:58 +00:00
PyTorch MergeBot	a5364b12bb	Revert "[ONNX] Remove the depreacated function `_export` (#109763 )" This reverts commit `d7c05bb2e8`. Reverted https://github.com/pytorch/pytorch/pull/109763 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/109763#issuecomment-1734201053))	2023-09-25 17:47:21 +00:00

1 2 3 4 5 ...

1293 Commits