pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
PyTorch MergeBot	3f840cc627	Revert "Ignore shape inference exception from Caffe2 ATen fallback (#90408 )" This reverts commit `1d4e872370`. Reverted https://github.com/pytorch/pytorch/pull/90408 on behalf of https://github.com/huydhn due to Sorry for reverting your PR, but it breaks lint check https://hud.pytorch.org/pr/90408#11855039599. Please fix the error and reland your change	2023-03-08 17:28:21 +00:00
Thiago Crepaldi	1d4e872370	Ignore shape inference exception from Caffe2 ATen fallback (#90408 ) Fixes #87318 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90408 Approved by: https://github.com/BowenBao	2023-03-08 16:57:48 +00:00
Thiago Crepaldi	a9a3a1bd14	Apply peephole for eval mode when constant folding is enabled only (#95801 ) Fixes https://github.com/microsoft/onnx-converters-private/issues/150 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95801 Approved by: https://github.com/BowenBao	2023-03-01 23:07:38 +00:00
Thiago Crepaldi	d978395f55	Deprecate Caffe2 ONNX exporter (#94994 ) Discussed on Weekly meeting with Meta on 2/16/2023 with @kit1980 @malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/94994 Approved by: https://github.com/Skylion007, https://github.com/BowenBao	2023-02-17 15:41:11 +00:00
Xuehai Pan	b005ec62b9	[BE] Remove dependency on `six` and `future` (#94709 ) Remove the Python 2 and 3 compatibility library [six](https://pypi.org/project/six) and [future](https://pypi.org/project/future) and `torch._six`. We only support Python 3.8+ now. It's time to retire them. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94709 Approved by: https://github.com/malfet, https://github.com/Skylion007	2023-02-14 09:14:14 +00:00
Justin Chu	6cef200af9	[ONNX] Wrap symbolic method calls with graph context (#94746 ) This should address #93370 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94746 Approved by: https://github.com/BowenBao	2023-02-13 21:29:39 +00:00
Aaron Gokaslan	9171f7d4cd	[BE] Modernize PyTorch even more for 3.8 with pyupgrade (#94520 ) Applies some more pyupgrade fixits to PyTorch Pull Request resolved: https://github.com/pytorch/pytorch/pull/94520 Approved by: https://github.com/ezyang	2023-02-10 18:02:50 +00:00
AllenTiTaiWang	04b06c9627	[ONNX] Use optional op to keep None in results for ONNX internal tests (#84789 ) All this time, PyTorch and ONNX has different strategy for None in output. And in internal test, we flatten the torch outputs to see if the rest of them matched. However, this doesn't work anymore in scripting after Optional node is introduced, since some of None would be kept. #83184 forces script module to keep all Nones from Pytorch, but in ONNX, the model only keeps the ones generated with Optional node, and deletes those meaningless None. This PR uses Optional node to keep those meaningless None in output as well, so when it comes to script module result comparison, Pytorch and ONNX should have the same amount of Nones. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84789 Approved by: https://github.com/BowenBao	2023-02-08 23:04:47 +00:00
AllenTiTaiWang	b27ac6dc56	[ONNX] Add full checker mode in torch.onnx.export (#83186 ) Fix #82589 Why: 1. full_check works in `onnx::checker::check_model` function as it turns on strict_mode in `onnx::shape_inference::InferShapes()` which I think that was the intention of this part of code. 2. strict_mode catches failed shape type inference (invalid ONNX model from onnx perspective) and ONNXRUNTIME can't run these invalid models, as ONNXRUNTIME actually rely on ONNX shape type inference to optimize ONNX graph. Why we don't set it True for default? >>> some of existing users use other platform, such as caffe2 to run ONNX model which doesn't need valid ONNX model to run. 3. This PR doesn't change the original behavior of `check_onnx_proto`, but add a warning message for those models which can't pass strict shape type inference, saying the models would fail on onnxruntime. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83186 Approved by: https://github.com/justinchuby, https://github.com/thiagocrepaldi, https://github.com/jcwchen, https://github.com/BowenBao	2023-02-08 22:47:25 +00:00
Ram Rachum	77f9b2e8bf	Fix exception causes in fx, nn and onnx packages (#90134 ) This is a continuation of #90118 @kit1980 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90134 Approved by: https://github.com/kit1980	2022-12-06 04:34:58 +00:00
Keval Morabia	3d247a8bcd	Fix unconvertible_ops as per #89261 (#89299 ) Fixes #89261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89299 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-11-21 20:40:04 +00:00
AllenTiTaiWang	c3acb9c885	[ONNX] Add Internal Utils: onnx_proto_utils.py for onnx/onnx-script/onnx_proto (#88376 ) Added `onnx_proto_utils.py` for onnx/onnx-script related process. The idea is like jit_utils.py, and to simplify what we have in `torch/onnx/utils.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/88376 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-11-17 03:08:09 +00:00
AllenTiTaiWang	abe41aee77	[ONNX] Support custom Op with onnx-script local function (#86906 ) Extend `register_custom_op` to support onnx-script local function. The FunctionProto from onnx-script is represented by custom op and inserted into ModelProto for op execution. NOTE: I did experiments on >2GB case of a simple model with large initializers: ```python import torch class Net(torch.nn.Module): def __init__(self, B, C): super().__init__() self.layer_norm = torch.nn.LayerNorm((B, C), eps=1e-3) def forward(self, x): return self.layer_norm(x) N, B, C = 3, 25000, 25000 model = Net(B, C) x = torch.randn(N, B, C) torch.onnx.export(model, x, "large_model.onnx", opset_version=12) ``` And it turns out we won't get model_bytes > 2GB after `_export_onnx` pybind cpp function, as we split initializer in external files in that function, and have serialization before return the model bytes, which protobuf is not allowed to be larger than 2GB at any circumstances. The test cases can be found in the next PR #86907 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/86906 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-11-16 15:08:55 +00:00
Thiago Crepaldi	5f0783bd6d	Fix ATen Fallback for BUILD_CAFFE2=0 for ONNX-only ops (#88504 ) Follow-up for #87735 Once again, because BUILD_CAFFE2=0 is not tested for ONNX exporter, one scenario slipped through. A use case where the model can be exported without aten fallback when operator_export_type=ONNX_ATEN_FALLBACK and BUILD_CAFFE2=0 A new unit test has been added, but it won't prevent regressions if BUILD_CAFFE2=0 is not executed on CI again Fixes #87313 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88504 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-11-11 17:43:46 +00:00
Thiago Crepaldi	2aed670710	Fix ONNX operator_export_type on the new registry (#87735 ) Fixes #87313 Our ONNX pipelines do not run with BUILD_CAFFE2=0, so tests for operator_export_type ONNX_ATEN and ONNX_ATEN_FALLBACK will not be fully tested, allowing regressions to happen again. We need to run the same set of tests for both BUILD_CAFFE2=0 and 1 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87735 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-11-02 15:54:40 +00:00
AllenTiTaiWang	4210cebc16	[ONNX] Add internal node kind parsing (#87638 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87638 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-10-29 11:51:23 +00:00
Justin Chu	c600ce39ed	[ONNX] Refactor UnsupportedOperatorError arguments (#85349 ) Merged the first two arguments because we always use qualified names to identify symbolic functions Pull Request resolved: https://github.com/pytorch/pytorch/pull/85349 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-10-26 00:21:58 +00:00
Justin Chu	56a744bf47	[ONNX] Reland: Update training state logic to support ScriptedModule (#86745 ) In https://github.com/pytorch/pytorch/issues/86325, it was reported that ScriptedModule do not have a training attribute and will fail export because we don't expect them as input. Also - Parameterized the test_util_funs test Thanks @borisfom for the suggestion! Fixes #86325 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86745 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-10-14 19:44:47 +00:00
PyTorch MergeBot	056cfb0464	Revert "[ONNX] Update training state logic to support ScriptedModule (#86745 )" This reverts commit `960b98128e`. Reverted https://github.com/pytorch/pytorch/pull/86745 on behalf of https://github.com/janeyx99 due to `960b98128e` broke onnx tests on trunk	2022-10-14 05:40:20 +00:00
Justin Chu	960b98128e	[ONNX] Update training state logic to support ScriptedModule (#86745 ) In https://github.com/pytorch/pytorch/issues/86325, it was reported that ScriptedModule do not have a training attribute and will fail export because we don't expect them as input. Also - Parameterized the test_util_funs test Thanks @borisfom for the suggestion! Fixes #86325 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86745 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-10-14 01:31:40 +00:00
Justin Chu	69b927701a	[ONNX] Update user documentation (#85819 ) - Remove mentions of `SymbolicContext` in the doc - Comment out the PythonOp example so that it is not shown to users - Updated code blocks and wording - Changed to recommend using `pip` for installing onnx. Now adds a deprecation message to the docs (demo only): ![image](https://user-images.githubusercontent.com/11205048/193327649-f789b369-6b59-49e0-8bba-34a6785eb128.png) Fixes #85608 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85819 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-09-30 19:35:34 +00:00
BowenBao	8f4edf1e1d	[ONNX] Initial version of diagnostics infrastructure. (#85107 ) This PR introduces a general Python diagnostics infrastructure powered by SARIF, and the exporter diagnostics module that builds on top of it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85107 Approved by: https://github.com/abock, https://github.com/justinchuby	2022-09-30 07:47:26 +00:00
Justin Chu	dc63948dc9	[ONNX] Update behavior for `register_custom_op_symbolic` (#85636 ) Update `register_custom_op_symbolic`'s behavior to _only register the symbolic function at a single version_. This is more aligned with the semantics of the API signature. As a result of this change, opset 7 and opset 8 implementations are now seen as fallback when the opset_version >= 9. Previously any ops internally registered to opset < 9 are not discoverable by an export version target >= 9. Updated the test to reflect this change. The implication of this change is that users will need to register a symbolic function to the exact version when they want to override an existing symbolic. They are not impacted if (1) an implementation does not existing for the op, or (2) they are already registering to the exact version for export. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85636 Approved by: https://github.com/BowenBao	2022-09-29 04:24:06 +00:00
Justin Chu	7cdd39b393	[ONNX] Update `unconvertible_ops` (#85595 ) Update `unconvertible_ops` to create a list of unconvertible ops using the updated registry. - Use fewer passes in the jit process instead to avoid errors during conversion in the ONNX fallback mode - Actually check the registry to find implemented ops - Fix type hints for `_create_jit_graph` and `_jit_pass_onnx_remove_inplace_ops_for_onnx` Pull Request resolved: https://github.com/pytorch/pytorch/pull/85595 Approved by: https://github.com/BowenBao	2022-09-29 00:52:21 +00:00
Justin Chu	85d8441fba	[ONNX] Deprecate setter functions for global variables (#85165 ) `_set_opset_version` and `_set_operator_export_type` are previously deprecated. This PR decorates them with the deprecation decorator, so warnings are emitted. - Remove usage of `_set_opset_version` and `_set_operator_export_type` in favor of setting the globals vars directly in torch.onnx internal - Update `GLOBALS.operator_export_type`'s default to not be None to tighten types - Remove usage of `_set_onnx_shape_inference` Pull Request resolved: https://github.com/pytorch/pytorch/pull/85165 Approved by: https://github.com/BowenBao, https://github.com/AllenTiTaiWang	2022-09-28 22:43:43 +00:00
Justin Chu	c42a408baa	[ONNX] Create decorator to handle symbolic context (#84776 ) - Create decorator to handle old style custom symbolics that require context - Deprecate `torch.onnx.SymbolicContext` in favor of `GraphContext`. Added deprecation message - Remove README reference of SymbolicContext Pull Request resolved: https://github.com/pytorch/pytorch/pull/84776 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-09-28 22:36:54 +00:00
Justin Chu	3d2316670f	[ONNX] Create GraphContext and load `g.op` method to the class (#84728 ) This PR create the `GraphContext` class and relays all graph methods to _C.Graph as well as implements the `g.op` method. The GraphContext object is passed into the symbolic functions in place of _C.Graph for compatibility with existing symbolic functions. This way (1) we can type annotate all `g` args because the method is defined and (2) we can use additional context information in symbolic functions. (3) no more monkey patching on `_C.Graph` Also - Fix return type of `_jit_pass_fixup_onnx_controlflow_node` - Create `torchscript.py` to house torch.Graph related functions - Change `GraphContext.op` to create nodes in the Block instead of the Graph - Create `add_op_with_blocks` to handle scenarios where we need to directly manipulate sub-blocks. Update loop and if symbolic functions to use this function. ## Discussion Should we put all the context inside `SymbolicContext` and make it an attribute in the `GraphContext` class? This way we only define two attributes `GraphContext.graph` and `GraphContext.context`. Currently all context attributes are directly defined in the class. ### Decision Keep GraphContext flatand note that it will change in the future. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84728 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-09-28 22:21:55 +00:00
Thiago Crepaldi	6cfe555f4f	[ONNX] Apply Common Subexpression Elimination pass to ONNX export (#85665 ) ## Summary Exporting graphs with Autocast may fail due to a limitation on JIT tracer. By disabling Autocast cache, tracer works, but there can be performance hit when there is reuse of weights in convolution, for example By applying CSE, such performance loss can be reverted. ps: As a comment at #84092 mentioned, disabling Autocast cache is an acceptable workaround and used throughout PyTorch code. Fixes #84092 ## Examples of before and after CSE being applied: ### Example: eliminating `%17` and reusing `%16` instead ```python # BEFORE graph(%0 : Float(requires_grad=0, device=cpu)): %3 : Scalar = aten::ScalarImplicit(%0), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: %13 : int = prim::Constant[value=3](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %14 : int = prim::Constant[value=4](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %15 : int[] = prim::ListConstruct(%13, %14), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: %16 : NoneType = prim::Constant(), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: %17 : NoneType = prim::Constant(), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: %18 : Device = prim::Constant[value="cpu"](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %19 : bool = prim::Constant[value=0](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %20 : Float(3, 4, strides=[4, 1], requires_grad=0, device=cpu) = aten::full(%15, %3, %16, %17, %18, %19), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 return (%20) AFTER graph(%0 : Float(requires_grad=0, device=cpu)): %3 : Scalar = aten::ScalarImplicit(%0), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: %13 : int = prim::Constant[value=3](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %14 : int = prim::Constant[value=4](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %15 : int[] = prim::ListConstruct(%13, %14), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: %16 : NoneType = prim::Constant(), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: %18 : Device = prim::Constant[value="cpu"](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %19 : bool = prim::Constant[value=0](), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 %20 : Float(3, 4, strides=[4, 1], requires_grad=0, device=cpu) = aten::full(%15, %3, %16, %16, %18, %19), scope: test_onnx_opset.TestONNXOpset.test_full.<locals>.MyModule:: # /home/thiagofc/dev/github/pytorch/test/onnx/test_onnx_opset.py:347:0 return (%20) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/85665 Approved by: https://github.com/ngimel, https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-09-27 21:26:32 +00:00
Justin Chu	2f50d2f685	[ONNX] Update docs on symbolic registration (#85290 ) - Move inline instructions on editing symbolic functions to the README - Add a line on using the symbolic function registration decorator. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85290 Approved by: https://github.com/BowenBao	2022-09-22 13:37:11 +00:00
Justin Chu	9d1155235b	[ONNX] Create decorators for symbolic function registration (#84709 ) This PR creates and tests the decorators proposed in #83787 Pull Request resolved: https://github.com/pytorch/pytorch/pull/84709 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-09-17 01:01:04 +00:00
Justin Chu	cd7e6d4ad1	[ONNX] New symbolic function registry (#84382 ) ## Summary The change brings the new registry for symbolic functions in ONNX. The `SymbolicRegistry` class in `torch.onnx._internal.registration` replaces the dictionary and various functions defined in `torch.onnx.symbolic_registry`. The new registry - Has faster lookup by storing only functions in the opset version they are defined in - Is easier to manage and interact with due to its class design - Builds the foundation for the more flexible registration process detailed in #83787 Implementation changes - Breaking: Remove `torch.onnx.symbolic_registry` - `register_custom_op_symbolic` and `unregister_custom_op_symbolic` in utils maintain their api for compatibility - Update _onnx_supported_ops.py for doc generation to include quantized ops. - Update code to register python ops in `torch/csrc/jit/passes/onnx.cpp` ## Profiling results -0.1 seconds in execution time. -34% time spent in `_run_symbolic_function`. Tested on the alexnet example in public doc. ### After ``` └─ 1.641 export <@beartype(torch.onnx.utils.export) at 0x7f19be17f790>:1 └─ 1.641 export torch/onnx/utils.py:185 └─ 1.640 _export torch/onnx/utils.py:1331 ├─ 0.889 _model_to_graph torch/onnx/utils.py:1005 │ ├─ 0.478 _optimize_graph torch/onnx/utils.py:535 │ │ ├─ 0.214 PyCapsule._jit_pass_onnx_graph_shape_type_inference <built-in>:0 │ │ │ [2 frames hidden] <built-in> │ │ ├─ 0.190 _run_symbolic_function torch/onnx/utils.py:1670 │ │ │ └─ 0.145 Constant torch/onnx/symbolic_opset9.py:5782 │ │ │ └─ 0.139 _graph_op torch/onnx/_patch_torch.py:18 │ │ │ └─ 0.134 PyCapsule._jit_pass_onnx_node_shape_type_inference <built-in>:0 │ │ │ [2 frames hidden] <built-in> │ │ └─ 0.033 [self] ``` ### Before ![image](https://user-images.githubusercontent.com/11205048/188032302-688d881e-860d-4046-bdba-90da54233576.png) ### Start up time The startup process takes 0.03 seconds. Calls to `inspect` will be eliminated when we switch to using decorators for registration in #84448 ![image](https://user-images.githubusercontent.com/11205048/188208910-250f0434-475d-4872-9abc-781535519305.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84382 Approved by: https://github.com/AllenTiTaiWang, https://github.com/BowenBao	2022-09-16 21:45:16 +00:00
Justin Chu	2fa8142cf9	[ONNX] Rename constants for clarity (#84645 ) Rename constants to make them more clear. Fix styles to upper case. Removed `onnx_stable_opsets` because it can be computed from `ONNX_MIN_OPSET` and `ONNX_MAX_OPSET`. Fixes #84643 Pull Request resolved: https://github.com/pytorch/pytorch/pull/84645 Approved by: https://github.com/BowenBao	2022-09-09 01:22:14 +00:00
Thibault	d9ceda49c4	ONNX: fix default function value in _optimize_graph (#83996 ) The default value for params_dict in _optimize_graph, which is None, throw the following error: > _C._jit_pass_onnx_unpack_quantized_weights( > TypeError: _jit_pass_onnx_unpack_quantized_weights(): incompatible function arguments. The following argument types are supported: > 1. (arg0: torch::jit::Graph, arg1: Dict[str, IValue], arg2: bool) -> Dict[str, IValue] Replacing it by an empty dict fixes the issue (and makes more sense). Pull Request resolved: https://github.com/pytorch/pytorch/pull/83996 Approved by: https://github.com/BowenBao	2022-09-06 23:32:16 +00:00
Justin Chu	388368b699	[ONNX] Fix type annotations and enable type checking for all apis (#84091 ) Enable runtime type checking for all torch.onnx public apis, symbolic functions and most helpers (minus two that does not have a checkable type: `_.JitType` does not exist) by adding the beartype decorator. Fix type annotations to makes unit tests green. Profile: export `torchvision.models.alexnet(pretrained=True)` ``` with runtime type checking: 21.314 / 10 passes without runtime type checking: 20.797 / 10 passes + 2.48% ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/84091 Approved by: https://github.com/BowenBao, https://github.com/thiagocrepaldi	2022-09-03 01:40:18 +00:00
titaiwang	ece0002c4b	[ONNX] Disable autocast cache in exporter (#84219 ) This PR provides a temporary fix on #84092 in exporter to avoid more cases falling into this bug. A long-term fix will be provided later. A simple repro with torch.onnx.export is still under investigation, as torch.jit.trace() is not the API we call inside torch.onnx.export, and it may introduce the difference. Therefore, a test case is provided here only. A specific test one can use, ```python import torch import onnxruntime from onnxruntime.training.ortmodule import DebugOptions, LogLevel from onnxruntime.training.ortmodule import ORTModule class MyModule(torch.nn.Module): def __init__(self): super().__init__() self.cv1 = torch.nn.Conv2d(3, 3, 5, 2, 1) def forward(self, x): x = self.cv1(x) return x x = torch.randn(10, 3, 20, 20) * 2 m = MyModule().eval() x = x.cuda() m = m.cuda() debug_options = DebugOptions(log_level=LogLevel.VERBOSE, save_onnx=True, onnx_prefix="ViT-B") m = ORTModule(m, debug_options=debug_options) with torch.cuda.amp.autocast(dtype=torch.float16, cache_enabled=True): loss = m(x) ``` AND make assertion fail in ORTModule `17ccd6fa02/orttraining/orttraining/python/training/ortmodule/_io.py (L578-L581)` Without the fix, the user will see the weight/bias of Conv node becomes constant. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84219 Approved by: https://github.com/BowenBao, https://github.com/thiagocrepaldi	2022-09-01 00:34:37 +00:00
titaiwang	18264432f7	[ONNX] replace all _C._flatten to torch.jit._flatten (#83598 ) _C._flatten is exactly the same as torch.jit._flatten. Unifying them to reduce confusion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83598 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-09-01 00:31:28 +00:00
BowenBao	806878518f	[ONNX][Reland] Export node and value with scope name (#82040 ) Introduce `_jit_pass_onnx_assign_node_and_value_names` to parse and assign scoped name for nodes and values in exported onnx graph. Module layer information is obtained from `ONNXScopeName` captured in `scope` attribute in nodes. For nodes, the processed onnx node name are stored in attribute `onnx_name`. For values, the processed onnx output name are stored as `debugName`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82040 Approved by: https://github.com/AllenTiTaiWang, https://github.com/justinchuby, https://github.com/abock	2022-08-29 20:10:38 +00:00
PyTorch MergeBot	8e6207bcd8	Revert "[ONNX] Export node and value with scope name (#82040 )" This reverts commit `6a3666282d`. Reverted https://github.com/pytorch/pytorch/pull/82040 on behalf of https://github.com/weiwangmeta due to Diff reverted internally	2022-08-29 06:36:18 +00:00
PyTorch MergeBot	d8cc8368ab	Revert "[ONNX] Fix type annotations and enable type checking for all apis (#84091 )" This reverts commit `6446da1730`. Reverted https://github.com/pytorch/pytorch/pull/84091 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally	2022-08-28 12:28:58 +00:00
Justin Chu	6446da1730	[ONNX] Fix type annotations and enable type checking for all apis (#84091 ) Enable runtime type checking for all torch.onnx public apis, symbolic functions and most helpers (minus two that does not have a checkable type: `_.JitType` does not exist) by adding the beartype decorator. Fix type annotations to makes unit tests green. Profile: export `torchvision.models.alexnet(pretrained=True)` ``` with runtime type checking: 21.314 / 10 passes without runtime type checking: 20.797 / 10 passes + 2.48% ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/84091 Approved by: https://github.com/BowenBao	2022-08-27 04:40:41 +00:00
BowenBao	6a3666282d	[ONNX] Export node and value with scope name (#82040 ) Introduce `_jit_pass_onnx_assign_node_and_value_names` to parse and assign scoped name for nodes and values in exported onnx graph. Module layer information is obtained from `ONNXScopeName` captured in `scope` attribute in nodes. For nodes, the processed onnx node name are stored in attribute `onnx_name`. For values, the processed onnx output name are stored as `debugName`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82040 Approved by: https://github.com/AllenTiTaiWang, https://github.com/justinchuby, https://github.com/abock	2022-08-26 20:59:12 +00:00
Justin Chu	bf25a140f9	[ONNX] Add runtime type checking to `export` (#83673 ) This PR adds an internal wrapper on the [beartype](https://github.com/beartype/beartype) library to perform runtime type checking in `torch.onnx`. It uses beartype when it is found in the environment and is reduced to a no-op when beartype is not found. Setting the env var `TORCH_ONNX_EXPERIMENTAL_RUNTIME_TYPE_CHECK=ERRORS` will turn on the feature. setting `TORCH_ONNX_EXPERIMENTAL_RUNTIME_TYPE_CHECK=DISABLED` will disable all checks. When not set and `beartype` is installed, a warning message is emitted. Now when users call an api with invalid arguments e.g. ```python torch.onnx.export(conv, y, path, export_params=True, training=False) # traning should take TrainingModel, not bool ``` they get ``` Traceback (most recent call last): File "bisect_m1_error.py", line 63, in <module> main() File "bisect_m1_error.py", line 59, in main reveal_error() File "bisect_m1_error.py", line 32, in reveal_error torch.onnx.export(conv, y, cpu_model_path, export_params=True, training=False) File "<@beartype(torch.onnx.utils.export) at 0x1281f5a60>", line 136, in export File "pytorch/venv/lib/python3.9/site-packages/beartype/_decor/_error/errormain.py", line 301, in raise_pep_call_exception raise exception_cls( # type: ignore[misc] beartype.roar.BeartypeCallHintParamViolation: @beartyped export() parameter training=False violates type hint <class 'torch._C._onnx.TrainingMode'>, as False not instance of <protocol "torch._C._onnx.TrainingMode">. ``` when `TORCH_ONNX_EXPERIMENTAL_RUNTIME_TYPE_CHECK` is not set and `beartype` is installed, a warning message is emitted. ``` >>> torch.onnx.export("foo", "bar", "f") <stdin>:1: CallHintViolationWarning: Traceback (most recent call last): File "/home/justinchu/dev/pytorch/torch/onnx/_internal/_beartype.py", line 54, in _coerce_beartype_exceptions_to_warnings return beartyped(args, kwargs) File "<@beartype(torch.onnx.utils.export) at 0x7f1d4ab35280>", line 39, in export File "/home/justinchu/anaconda3/envs/pytorch/lib/python3.9/site-packages/beartype/_decor/_error/errormain.py", line 301, in raise_pep_call_exception raise exception_cls( # type: ignore[misc] beartype.roar.BeartypeCallHintParamViolation: @beartyped export() parameter model='foo' violates type hint typing.Union[torch.nn.modules.module.Module, torch.jit._script.ScriptModule, torch.jit.ScriptFunction], as 'foo' not <protocol "torch.jit.ScriptFunction">, <protocol "torch.nn.modules.module.Module">, or <protocol "torch.jit._script.ScriptModule">. Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/home/justinchu/dev/pytorch/torch/onnx/_internal/_beartype.py", line 63, in _coerce_beartype_exceptions_to_warnings return func(args, **kwargs) File "/home/justinchu/dev/pytorch/torch/onnx/utils.py", line 482, in export _export( File "/home/justinchu/dev/pytorch/torch/onnx/utils.py", line 1422, in _export with exporter_context(model, training, verbose): File "/home/justinchu/anaconda3/envs/pytorch/lib/python3.9/contextlib.py", line 119, in __enter__ return next(self.gen) File "/home/justinchu/dev/pytorch/torch/onnx/utils.py", line 177, in exporter_context with select_model_mode_for_export( File "/home/justinchu/anaconda3/envs/pytorch/lib/python3.9/contextlib.py", line 119, in __enter__ return next(self.gen) File "/home/justinchu/dev/pytorch/torch/onnx/utils.py", line 95, in select_model_mode_for_export originally_training = model.training AttributeError: 'str' object has no attribute 'training' ``` We see the error is caught right when the type mismatch happens, improving from what otherwise would become `AttributeError: 'str' object has no attribute 'training'` Pull Request resolved: https://github.com/pytorch/pytorch/pull/83673 Approved by: https://github.com/BowenBao	2022-08-25 21:24:37 +00:00
BowenBao	daca0ee5e2	[ONNX] Introduce ONNXScopeName (#82038 ) Update `_setup_trace_module_map` to always record module/layer info in `Scope` attribute for nodes. Extend `Scope` name to not only record module typename, but also module object variable name. Both names are formatted and stored as `name` attribute in `Scope`. Introduce `ONNXScopeName` class to manage the formatting and parsing. Updated local function export code adjusting to this update. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82038 Approved by: https://github.com/AllenTiTaiWang, https://github.com/justinchuby, https://github.com/abock, https://github.com/malfet	2022-08-22 20:49:21 +00:00
Justin Chu	e4f74f0891	[ONNX] Update the default opset to version 14 (#83284 ) Update the default opset by running the `update_default_opset_version.py` script. The update is done in a regularly to ensure we are in sync with the onnx updates. All changes are produced by the script. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83284 Approved by: https://github.com/AllenTiTaiWang, https://github.com/malfet, https://github.com/BowenBao	2022-08-18 19:13:38 +00:00
BowenBao	017ecb782d	[ONNX] Update legacy code, initialize onnx_shape_inference=True by default (#82767 ) Legacy code has onnx_shape_inference=False by default, which is misleading as every other export api sets it to True unless otherwise overriden by caller. There is only two tests that need updating according to this change. * test_utility_funs.py::test_constant_fold_shape. The resulting number of nodes in graph is increased by 1, due to that previously the extra constant node was added as initializer. * test_utility_funs.py::test_onnx_function_substitution_pass. Enabling onnx shape inference discovered discrepancy in test input shape and supplied dynamic axes arguments. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82767 Approved by: https://github.com/justinchuby, https://github.com/abock	2022-08-10 21:50:13 +00:00
Justin Chu	f5701a1f9a	[ONNX] Remove unused patching methods (#83006 ) ### Description <!-- What did you change and why was it needed? --> Remove unused patching methods: - `torch._C.Graph.constant` - unpatch `torch._C.Node.__getitem__` and move the helper function to `symbolic_helper` Add typing annotations ### Issue <!-- Link to Issue ticket or RFP --> #76254 ### Testing <!-- How did you test your change? --> Unit tested Pull Request resolved: https://github.com/pytorch/pytorch/pull/83006 Approved by: https://github.com/BowenBao	2022-08-09 19:24:03 +00:00
qqaatw	9b4dc56c83	[ONNX] Fix quantization outputs' dtype (#79690 ) Part of #79263 Previously, all quantized PyTorch tensors are all casted to the dtypes which comply with ONNX's definition, i.e. `scale` is casted to `double`, and `zero_point` is casted to `int64`. These casts lead to inconsistent dtypes when comparing PyTorch's outputs and ONNX runtime's outputs. Now, `cast_onnx_accepted` argument is added to `unpack_quantized_tensor` function. When making example inputs for ONNX, we cast them to the ONNX compliant dtypes; otherwise, they are casted to PyTorch default types for quantization. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79690 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-08-09 18:32:03 +00:00
shubhambhokare1	95d873855e	[ONNX] Inline prim::PythonOp for Autograd Function Export (#74765 ) Add flag (inline_autograd) to enable inline export of model consisting of autograd functions. Currently, this flag should only be used in TrainingMode.EVAL and not for training. An example: If a model containing ``autograd.Function`` is as follows ``` class AutogradFunc(torch.autograd.Function): @staticmethod def forward(ctx, i): result = i.exp() result = result.log() ctx.save_for_backward(result) return result ``` Then the model is exported as ``` graph(%0 : Float): %1 : Float = ^AutogradFunc(%0) return (%1) ``` If inline_autograd is set to True, this will be exported as ``` graph(%0 : Float): %1 : Float = onnx::Exp(%0) %2 : Float = onnx::Log(%1) return (%2) ``` If one of the ops within the autograd module is not supported, that particular node is exported as is mirroring ONNX_FALLTHROUGH mode Fixes: #61813 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74765 Approved by: https://github.com/BowenBao, https://github.com/malfet	2022-08-03 23:30:19 +00:00
Huy Do	6ea422dd0b	Format torch/onnx with ufmt (#82137 ) This is the last batch for the new ufmt (black + usort) linter. After this, black linter can finally be replaced. The previous PR to format ONNX tests was #81335 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82137 Approved by: https://github.com/kit1980, https://github.com/AllenTiTaiWang	2022-07-25 22:42:21 +00:00
Justin Chu	d1d2687d34	[ONNX] Fix potentially unbound variables (#79789 ) Pylint alerts that some variables may be unbound. This PR fixes the errors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79789 Approved by: https://github.com/garymm	2022-06-29 17:01:49 +00:00

1 2 3 4 5 ...

293 Commits