pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Aaron Gokaslan	660e8060ad	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-22 23:16:38 +00:00
PyTorch MergeBot	d59a6864fb	Revert "[BE]: Update ruff to 0.285 (#107519 )" This reverts commit `88ab3e4322`. Reverted https://github.com/pytorch/pytorch/pull/107519 on behalf of https://github.com/ZainRizvi due to Sorry, but this PR breaks internal tests. @ezyang, can you please hep them get unblocked? It seems like one of the strings was prob accidentally modified ([comment](https://github.com/pytorch/pytorch/pull/107519#issuecomment-1688833480))	2023-08-22 19:53:32 +00:00
Aaron Gokaslan	88ab3e4322	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-20 01:36:18 +00:00
Jerry Zhang	28be2c674a	[quant][pt2e] Move specific quantizer related things outside of main quant code base (#106806 ) (#107259 ) Summary: Currently in quantizer/quantize_pt2e we import things from specific quantizers (XNNPACKQuantizer, QuantizationConfig) etc. this PR removes them so it's clearer that they are not part of the core quantization code base This PR also removed get_supported_operators from main Quantizer since we haven't seen a clear need for this API Test Plan: CIs Imported from OSS Differential Revision: D48340367 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107259 Approved by: https://github.com/kimishpatel	2023-08-18 21:29:09 +00:00
Jerry Zhang	d3c4ec767b	[quant][pt2e] Fix handling for SharedQuantizationSpec (#106922 ) Summary: Previously if we have: ``` conv1 -> cat conv2 / ``` and configure output of conv1/conv2 to be int8 quantized, and cat also int8 quantized and with shared inputs, it will not produce expected results (input of cat will not be shared) The problem is that there is some missing checks when inserting observers for input for cat This PR fixes the problem. Fixes: https://github.com/pytorch/pytorch/issues/106760 Test Plan: python tes/test_quantization.py TestQuantzePT2E.test_shared_qspec Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106922 Approved by: https://github.com/kimishpatel	2023-08-16 21:16:45 +00:00
Jerry Zhang	4afab40b56	[quant][pt2e] Removed mean/hardtanh annotations and refactored adaptive_avg_pool annotation (#106805 ) Summary: Removed annotations for some ops, since they are handled in torch/ao/quantization/pt2e/_propagate_annotation.py Test Plan: CIs Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106805 Approved by: https://github.com/kimishpatel	2023-08-10 04:51:06 +00:00
Jerry Zhang	97ce979e5d	[quant][pt2e] Add reference representation for quantized conv2d (#105784 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_quantize_dequantize_per_channel Although right now it is not really testing things since there is some problem with dynamo export Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105784 Approved by: https://github.com/kimishpatel ghstack dependencies: #105783	2023-08-09 22:41:35 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	a44c072c89	Make InternalModel and Resnet work with rexportable flow (#106676 ) Summary: Internal model and Resnet uses "re-export" flow now. Also did some refactoring to make the code little cleaner Some changes for OSS: 1. Correctly use the "cached" fake tensors so that static symbols are still resolved to static 2. Change logic in PassBase to allocate static shapes for parameters 3. Add "is_torch_exported" tag to every node to make it survive during various graph transformations. 4. Added experimental wrapper API for quantization team to get pre_dispatch=True graph. Note that it doesn't actually do that right now. But we plan to switch soon. Test Plan: CI Differential Revision: D47890878 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106676 Approved by: https://github.com/jerryzh168	2023-08-09 20:10:48 +00:00
Jerry Zhang	e1a1780626	[quant][pt2e] Move annotate functions in XNNPACKQuantizer to utils (#106642 ) Summary: This is to allow sharing these annotate functions by other quantizers so that writing a new quantizer is easier note that these annotation functions will be maintained by XNNPACKQuantizer developers instead of AO team Test Plan: python test/test_quantization.py TestQuantizePT2E Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106642 Approved by: https://github.com/andrewor14	2023-08-09 18:52:39 +00:00
Jerry Zhang	69ecad6f2b	[quant][pt2e] Add reference representation for quantize_per_channel and dequantize_per_channel (#105783 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_quantize_dequantize_per_channel Although right now it is not really testing things since there is some problem with dynamo export Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105783 Approved by: https://github.com/kimishpatel	2023-08-09 01:39:52 +00:00
Jiaxu Zhu	9e35df4adc	[pytorch][ao] force weight observer/fake_quant to be on the same device as the weight tensor (#106755 ) Summary: As title. There's a corner case where both cpu and gpu are avaiable, although the model is moved to cpu, the newly created PTQ weight observer is still on gpu. Therefore, during the convert, this line will fail https://fburl.com/4rhipfvb Test Plan: CI Differential Revision: D48141494 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106755 Approved by: https://github.com/jerryzh168	2023-08-09 00:22:49 +00:00
Jerry Zhang	2156f0434c	[quant][pt2e] Add reference representation for quantized adaptive_avg_pool2d (#105709 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_adaptive_avg_pool2d Although right now it is not really testing things since there is some problem with dynamo export Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105709 Approved by: https://github.com/andrewor14 ghstack dependencies: #105708	2023-08-04 18:49:14 +00:00
Jerry Zhang	9e301949ec	[quant][pt2e] Add reference representation for quantized max_pool2d (#105708 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_maxpool2d Although right now it is not really testing things since there is some problem with dynamo export Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105708 Approved by: https://github.com/andrewor14	2023-08-04 08:19:52 +00:00
Jerry Zhang	820e68b58a	[quant][pt2e] Add reference representation for quantized add - relu (#105707 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_add_relu Although right now it is not really testing things since there is some problem with dynamo export Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105707 Approved by: https://github.com/andrewor14	2023-08-03 00:42:06 +00:00
Jerry Zhang	d528a137e0	[quant][pt2e][quantizer] Suppoert set_module_type in XNNPACKQuantizer (#106094 ) Summary: Added support to allow users to set configurations based on module type in XNNPACKQuantizer, can also serve as an example for implementing new quantizers Test Plan: python test/test_quantization.py TestQuantizePT2E.test_xnnpack_quantizer_set_module_type Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106094 Approved by: https://github.com/andrewor14 ghstack dependencies: #106087	2023-08-02 08:33:58 +00:00
Leon	850ad54139	correct spelling mistake (#106309 ) Fixes #ISSUE_NUMBER correct spelling mistake Pull Request resolved: https://github.com/pytorch/pytorch/pull/106309 Approved by: https://github.com/kit1980	2023-08-02 04:38:23 +00:00
Jerry Zhang	92a22a8098	[quant][pt2e][quantizer] Suppoert set_module_name in XNNPACKQuantizer (#106087 ) Summary: Added support to allow users to set configurations based on module name in XNNPACKQuantizer, can also serve as an example for implementing new quantizers Test Plan: python test/test_quantization.py TestQuantizePT2E.test_xnnpack_quantizer_set_module_name Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106087 Approved by: https://github.com/andrewor14	2023-08-02 01:19:23 +00:00
PyTorch MergeBot	93b2036bef	Revert "[quant][pt2e] store scale/zero_point as tensor attributes to support serialization (#105894 )" This reverts commit `3ca71ed735`. Reverted https://github.com/pytorch/pytorch/pull/105894 on behalf of https://github.com/huydhn due to breaking executorch tests internally ([comment](https://github.com/pytorch/pytorch/pull/105894#issuecomment-1654831950))	2023-07-28 01:16:02 +00:00
Edward Z. Yang	7b9d250f06	Change _dynamo.export to be export(f)(args, *kwargs) (#106109 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106109 Approved by: https://github.com/voznesenskym	2023-07-27 21:41:13 +00:00
Jerry Zhang	3ca71ed735	[quant][pt2e] store scale/zero_point as tensor attributes to support serialization (#105894 ) Summary: Currently scale/zero_point for per tensor quant is stored as burnt in literals, this means these values can't be serialized in state_dict, this PR changes them to buffers/Tensors so that they can be serialized Test Plan: python test/test_quantization.py TestQuantizePT2E Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D47770963](https://our.internmc.facebook.com/intern/diff/D47770963) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105894 Approved by: https://github.com/kimishpatel	2023-07-26 20:15:06 +00:00
Jerry Zhang	3a77f9aaaf	[quant][api] Move torch.ao.quantization.pt2e.quantizer to torch.ao.quantization.quantizer (#105885 ) Summary: moving quantizer to torch.ao.quantization to make it a public api, since pt2e is a folder for implementations Test Plan: CIs sanity check: "buck test //executorch/backends/xnnpack/test:test_xnnpack_quantized_models -- test_resnet18" Differential Revision: D47727838 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105885 Approved by: https://github.com/andrewor14	2023-07-26 18:20:09 +00:00
Jerry Zhang	d767cff7c7	[quant][fx] Fix docs for prepare_fx/prepare_qat_fx (#105979 ) Summary: Fixes: https://github.com/pytorch/pytorch/issues/103661 Test Plan: visual inspectation of docs https://pytorch.org/docs/2.0/generated/torch.ao.quantization.quantize_fx.prepare_fx.html#torch.ao.quantization.quantize_fx.prepare_fx Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105979 Approved by: https://github.com/andrewor14	2023-07-26 09:56:18 +00:00
Aaron Gokaslan	6d43c89f37	[BE]: Update Ruff to 0.0.280 (#105724 ) Removes unusued loop values in python dictionary iteration. Automated fix from Ruff master Pull Request resolved: https://github.com/pytorch/pytorch/pull/105724 Approved by: https://github.com/ezyang, https://github.com/janeyx99	2023-07-22 23:03:34 +00:00
Jerry Zhang	143c83d637	[quant][pt2e][be] Remove unneeded code (#105676 ) Summary: att Test Plan: CIs Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105676 Approved by: https://github.com/andrewor14	2023-07-21 00:51:22 +00:00
Jerry Zhang	dff4e034b8	[quant][pt2e][be] Rename qnnpack quantizer to xnnpack quantizer (#105551 ) Summary: att Test Plan: sandcastle CI and OSS CI Reviewed By: andrewor14 Differential Revision: D47422894 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105551 Approved by: https://github.com/andrewor14	2023-07-20 03:52:40 +00:00
Max Ren	bc6bca9d42	[XNNPACK][QS8] torch.slice (#105252 ) Differential Revision: [D47487423](https://our.internmc.facebook.com/intern/diff/D47487423/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105252 Approved by: https://github.com/digantdesai	2023-07-19 23:36:02 +00:00
leslie-fang-intel	fa6be2fa6f	[Quant][PT2E] Remove x86 inductor pt2e backend config (#105039 ) Summary For the Quantization PT2E path, we recommend to use `X86InductorQuantizer` instead of backend config of `x86_inductor_pt2e_backend_config`. Remove the `x86_inductor_pt2e_backend_config` and the relevant testing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105039 Approved by: https://github.com/jgong5, https://github.com/jerryzh168	2023-07-19 23:18:29 +00:00
Justin Chu	c0d8a4af0a	[BE] Enable ruff's UP rules and autoformat ao/ (#105430 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105430 Approved by: https://github.com/albanD, https://github.com/malfet	2023-07-19 13:44:37 +00:00
Jerry Zhang	554052f321	[quant][pt2e][be] Rename prepare_pt2e_quantizer to prepare_pt2e (#105484 ) Summary: att Test Plan: sandcastle and OSS CI Reviewed By: andrewor14 Differential Revision: D47422892 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105484 Approved by: https://github.com/andrewor14	2023-07-19 04:51:37 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	5666d20bb8	Add unlifting pass under private config (#104897 ) Summary: We wanna do this little by little. For now, I tried only on DissectedPartsModel which needs to use aot_export version. Test Plan: CI Reviewed By: zhxchen17 Differential Revision: D46785735 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104897 Approved by: https://github.com/JacobSzwejbka	2023-07-19 01:16:35 +00:00
maxren	88f1885ec9	[XNNPACK][QS8] torch.cat (#104800 ) Differential Revision: [D47304143](https://our.internmc.facebook.com/intern/diff/D47304143/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104800 Approved by: https://github.com/digantdesai	2023-07-19 00:15:05 +00:00
Nikita Shulga	78829d6e07	Fix `isinstance` check in `quat_utils` (#105476 ) Calling `isinstance(x, Tuple[Node, Node])` would either fail, or raise a type error on a more modern Python, as none of the tuples are actually instances of `Tuple` ```python >>> from typing import Tuple >>> from torch.fx import Node >>> edge_or_node=(Node(None, "foo", "output", "foo", None, None), Node(None, "bar", "output", "bar", None, None)) >>> isinstance(edge_or_node, tuple) and len(edge_or_node) == 2 and all(isinstance(x, Node) for x in edge_or_node) True >>> isinstance(edge_or_node, Tuple[Node, Node]) Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/Users/malfet/miniconda3/lib/python3.10/typing.py", line 994, in __instancecheck__ return self.__subclasscheck__(type(obj)) File "/Users/malfet/miniconda3/lib/python3.10/typing.py", line 997, in __subclasscheck__ raise TypeError("Subscripted generics cannot be used with" TypeError: Subscripted generics cannot be used with class and instance checks ``` <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at 40fa451</samp> > _Fix type annotation_ > _Quantize nodes in the graph_ > _Autumn leaves falling_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/105476 Approved by: https://github.com/jerryzh168	2023-07-18 21:16:05 +00:00
Jerry Zhang	ed2b9f1af1	[quant][pt2e] rename _quantize_pt2e to quantize_pt2e (#105377 ) Summary: att Test Plan: CIs Reviewed By: andrewor14 Differential Revision: D47234357 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105377 Approved by: https://github.com/andrewor14	2023-07-18 16:46:05 +00:00
Nikita Shulga	5837e95d30	[Reland] Update mypy to 1.4.1 (#105227 ) This PR re-lands - [Typing] Fix PEP 484 Violation (#105022) - Update mypy to 1.4.1 (#91983) That were reverted due to the conflict with internal source repo. Mostly fixes for PEP-484 violation (i.e. when default arg is set to None, but type is not annotated as optional) Plus few real fixes: - Add missing `_get_upgraders_entry_map` to `torch/_C/__init__.pyi` - Add missing return statement to `torch._export. deserialize_graph` - Fix error message in `torch.ao.ns.fx.weight_utils.get_lstm_mod_weights` - Add assert it `torch/optim/optimizer.py` that Optional list is not None TODO (in followup PR): - Fix erroneous `isinstance` check in `torch/ao/quantization/_pt2e/qat_utils.py` Unrelated, to bypass CI failures due to the gcc9 dependency update in Ubuntu-18.04: - Add hack to squash older libstdc++ from conda environment in favor one from OS to `.ci/docker/install_conda.sh` - Update bazel cuda builds to focal, as with libstdc++-6.0.32 bazel builds loose the ability to catch exceptions (probably because they link with cupti statically, but I could not found where it is done) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105227 Approved by: https://github.com/atalman, https://github.com/albanD, https://github.com/Skylion007	2023-07-15 20:30:20 +00:00
Jerry Zhang	7b4d080496	[quant][pt2e] Rename _pt2e to pt2e (#104668 ) Summary: X-link: https://github.com/pytorch/executorch/pull/3 att Test Plan: Imported from OSS Differential Revision: D47202807 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104668 Approved by: https://github.com/andrewor14	2023-07-15 06:34:17 +00:00
PyTorch MergeBot	15fd1ea118	Revert "[Reland] Update mypy to 1.4.1 (#105227 )" This reverts commit `c9c4f8efc3`. Reverted https://github.com/pytorch/pytorch/pull/105227 on behalf of https://github.com/atalman due to trying to mitigate ci sev #105248 ([comment](https://github.com/pytorch/pytorch/pull/105227#issuecomment-1636510935))	2023-07-14 22:28:35 +00:00
Nikita Shulga	c9c4f8efc3	[Reland] Update mypy to 1.4.1 (#105227 ) This PR re-lands - [Typing] Fix PEP 484 Violation (#105022) - Update mypy to 1.4.1 (#91983) That were reverted due to the conflict with internal source repo. Mostly fixes for PEP-484 violation (i.e. when default arg is set to None, but type is not annotated as optional) Plus few real fixes: - Add missing `_get_upgraders_entry_map` to `torch/_C/__init__.pyi` - Add missing return statement to `torch._export. deserialize_graph` - Fix error message in `torch.ao.ns.fx.weight_utils.get_lstm_mod_weights` - Add assert it `torch/optim/optimizer.py` that Optional list is not None TODO (in followup PR): - Fix erroneous `isinstance` check in `torch/ao/quantization/_pt2e/qat_utils.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105227 Approved by: https://github.com/atalman, https://github.com/albanD, https://github.com/Skylion007	2023-07-14 20:45:12 +00:00
PyTorch MergeBot	3c5a494d7a	Revert "Update mypy to 1.4.1 (#91983 )" This reverts commit `634659e262`. Reverted https://github.com/pytorch/pytorch/pull/91983 on behalf of https://github.com/malfet due to It's dependent change was reverted, so reverting this one as well, to keep CI clean ([comment](https://github.com/pytorch/pytorch/pull/91983#issuecomment-1636059709))	2023-07-14 15:59:16 +00:00
Jerry Zhang	90b50f0303	[quant][pt2e] change internal code to only import from _quantize_pt2e (#105162 ) Summary: This is to make public api clear so that we can make implementation details change easier in the future Test Plan: CIs Differential Revision: D47445767 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105162 Approved by: https://github.com/andrewor14	2023-07-14 05:14:29 +00:00
Tuan Tran	85745cd3d9	Fix bug in fuse_modules (#105069 ) Summary: This diff fixes the issue reported in https://github.com/pytorch/pytorch/issues/105063 and also related to internal caffe2 bug (reproduced error in internal fb pytorch: N3945540) Test Plan: Wait for sandcastle with the added unit test in caffe2/torch/ao/quantization/eager/test_fuse_eager Differential Revision: D47402357 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105069 Approved by: https://github.com/jerryzh168	2023-07-13 23:39:59 +00:00
Nikita Shulga	634659e262	Update mypy to 1.4.1 (#91983 ) Mostly fixes for PEP-484 violation (i.e. when default arg is set to None, but type is not annotated as optional) Plus few real fixes: - Add missing `_get_upgraders_entry_map` to `torch/_C/__init__.pyi` - Add missing return statement to `torch._export. deserialize_graph` - Fix error message in `torch.ao.ns.fx.weight_utils.get_lstm_mod_weights` - TODO (in followup PR): - Fix erroneous `isinstance` check in `torch/ao/quantization/_pt2e/qat_utils.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91983 Approved by: https://github.com/kit1980, https://github.com/ZainRizvi, https://github.com/huydhn, https://github.com/thiagocrepaldi, https://github.com/aaronenyeshi	2023-07-13 16:30:36 +00:00
Aaron Gokaslan	96b91ab248	Fix merged lintrunner error (#105005 ) Fixes lintrunner linter race condition. Follow up to #104917 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105005 Approved by: https://github.com/malfet, https://github.com/ezyang	2023-07-11 22:04:49 +00:00
Aaron Gokaslan	2f95a3d0fc	[BE]: Apply ruff PERF fixes to torch (#104917 ) Applies automated ruff fixes in the PERF modules and enables all automatic ones. I also updated ruff which applied some additional fixes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104917 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-07-11 20:45:21 +00:00
Andrew Or	4b29829ece	[quant][pt2] Fix QAT convert for mobilenetv2 (#104110 ) Summary: QAT convert for mobilenetv2 was previously not working because we incorrectly applied dropout during eval as well as training. This is because, for exported models, model.eval() does not change the behavior of dropout, unlike models with torch ops. This commit simulates the effects of model.eval() for exported models as well by replacing the aten dropout pattern before eval. As of this commit, end-to-end QAT numerics now match for mobilenetv2 between FX and PT2. Test Plan: python test/test_quantization.py TestQuantizePT2EModels.test_qat_mobilenet_v2 Differential Revision: D46750343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104110 Approved by: https://github.com/jerryzh168	2023-07-11 18:42:42 +00:00
maxren	332f2057df	[XNNPACK][QS8] torch.nn.ELU (#104307 ) Differential Revision: [D47075933](https://our.internmc.facebook.com/intern/diff/D47075933/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104307 Approved by: https://github.com/digantdesai	2023-07-11 00:35:13 +00:00
maxren	c4e084e3c7	[XNNPACK][QS8] torch.nn.ConstantPad2d (#104306 ) Differential Revision: [D47075932](https://our.internmc.facebook.com/intern/diff/D47075932/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104306 Approved by: https://github.com/digantdesai	2023-07-11 00:35:02 +00:00
maxren	2c960c73a3	[XNNPACK][QS8] torch.permute (#104305 ) Differential Revision: [D47075934](https://our.internmc.facebook.com/intern/diff/D47075934/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104305 Approved by: https://github.com/digantdesai	2023-07-11 00:34:58 +00:00
maxren	d41c4a8338	[XNNPACK][QS8] torch.clamp (#104304 ) Differential Revision: [D47075935](https://our.internmc.facebook.com/intern/diff/D47075935/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104304 Approved by: https://github.com/digantdesai	2023-07-11 00:34:58 +00:00
leslie-fang-intel	2a21469a77	[Quant][PT2E] Enable conv2d unary and binary recipe for x86 inductor quantizer (#98826 ) Summary - Recipe to annotate `conv2d_relu` for `X86InductorQuantizer` is added. - Recipe to annotate `conv2d_add` for `X86InductorQuantizer` is added. - Recipe to annotate `conv2d_add_relu` for `X86InductorQuantizer` is added. Test Plan ``` python -u -m pytest -s -v test_x86inductor_quantizer.py -k TestQuantizePT2EX86Inductor ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/98826 Approved by: https://github.com/jerryzh168	2023-07-04 00:01:10 +00:00
Kimish Patel	bd0f0f40a1	[PT2][Quant] Enable symbolic shape in linear quantization (#104473 ) When tracing with symbolic shapes, arbitrary sym_size nodes can appear in the graph. Earlier changes did not account for this and quantizer fails to annotate the right nodes. This diff fixes that by not annotating sym_size nodes, which should really not be relevant for quantization. As next steps, we should validate in quant workflow that a) sym_int nodes are not being quantized and b) add similar support, as this diff, for generic annotations Differential Revision: [D47132050](https://our.internmc.facebook.com/intern/diff/D47132050/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104473 Approved by: https://github.com/jerryzh168	2023-07-01 05:14:30 +00:00
Digant Desai	36c4dad197	[ET][XNNPACK] Add support for quantized LeakyReLU (#104309 ) Summary: Also adds support for backend_config Test Plan: `buck test fbcode//mode/dev-nosan fbcode//executorch/backends/xnnpack/test:` Reviewed By: mcr229 Differential Revision: D47043207 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104309 Approved by: https://github.com/salilsdesai, https://github.com/manuelcandales	2023-06-30 17:42:22 +00:00
Jerry Zhang	ecca9591d5	[quant][pt2e] Add reference representation for quantize/dequantize operators (#104395 ) Summary: Similar to quantized add, in this PR we added the reference represenation for quantize/dequantize operators Test Plan: buck2 test caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_representation_quantize (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)' buck2 test caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_representation_dequantize (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)' Reviewed By: kimishpatel Differential Revision: D46959928 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104395 Approved by: https://github.com/andrewor14	2023-06-30 04:32:18 +00:00
leslie-fang-intel	945a257277	[Quant][PT2E] Supported customized _EQUIVALENT_TYPES in Module Partition API (#102516 ) Summary `Module Partition API` can simplify the pattern match process in Quantization annotation. However, current implementation of `Module Partition API` has hardcoded `_EQUIVALENT_TYPES` `999bae0f54/torch/ao/quantization/_pt2e/graph_utils.py (L13-L20)`. So, PyTorch Extension Libraries such as [intel-extension-for-pytorch](https://github.com/intel/intel-extension-for-pytorch) can't use `Module Partition API` with customized `_EQUIVALENT_TYPES` . In this PR, we plan to enable customized `_EQUIVALENT_TYPES` by pass in parameter. Test Plan ``` python -m pytest test_graph_utils.py -k test_customized_equivalet_types_dict ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/102516 Approved by: https://github.com/jgong5, https://github.com/kimishpatel	2023-06-28 00:20:25 +00:00
Jerry Zhang	c98896b76f	[quant][pt2e] Add more precise representation for quantized add (#104130 ) Summary: The planned e2e for quantization in pytorch 2.0 export is the following: float_model -> prepare_pt2e -> calibration -> convert_pt2e -> ... inside convert_pt2e, we will first produce a q/dq representation of the quantized model, similar to the previous output of convert_to_reference_fx in fx grah mode quantization: ``` torch.ops.quantized_decomposed.dequantize_per_tensor -> torch.ops.aten.add -> torch.ops.quantized_decomopsed.quantize_per_tensor torch.ops.quantized_decomposed.dequantize_per_tensor / ``` Then we'll rewrite the above to a more precise representation that express the intention in a more precise manner, since here we actually want to do int8 addition, instead of simulating the int8 addition with fp32 operations, the representation for quantized add is: ``` def quantized_add(x_i8, x_scale, x_zero_point, y_i8, y_scale, y_zero_point, out_scale, out_zero_point): x = (x_scale / out_scale) * x_i8 y = (y_scale / out_scale) * y_i8 out = x + y out -= (x_zero_point * x_scale - y_zero_point * y_scale) / out_scale out += out_zero_point return out ``` Test Plan: ``` buck2 test caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_representation_add (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)' ``` Reviewed By: kimishpatel Differential Revision: D45628032 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104130 Approved by: https://github.com/kimishpatel	2023-06-27 20:11:30 +00:00
Digant Desai	ef285faeba	[ET][XNNPACK] Add support for quantized Multiply (#104134 ) Summary: Also adds support for backend_config with relu fusion since XNNPACK allows it. We should revisit the relu fusion once we gain more clarity on quantSrcPartition or some other way to do these fusion and not having to add all combinations. We should really rename the backend config to et_xnnpack.py or something TODO Test Plan: `buck test fbcode//mode/dev-nosan fbcode//executorch/backends/xnnpack/test:` Differential Revision: D46985169 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104134 Approved by: https://github.com/mcr229, https://github.com/salilsdesai	2023-06-27 16:59:28 +00:00
Digant Desai	bd8841101b	[ET][XNNPACK] Add support for quantized Sub (#104090 ) Summary: Also adds support for backend_config with relu fusion since XNNPACK allows it. We should revisit the relu fusion once we gain more clarity on quantSrcPartition or some other way to do these fusion and not having to add all combinations. We should really rename the backend config to et_xnnpack.py or something TODO Test Plan: `buck test fbcode//mode/dev-nosan fbcode//executorch/backends/xnnpack/test:` Differential Revision: D46924209 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104090 Approved by: https://github.com/mcr229	2023-06-26 16:32:15 +00:00
HDCharles	8176cd8c0f	[ao] fixing quantized prelu workflow (#103455 ) Summary: https://github.com/pytorch/pytorch/issues/100654 noticed prelu was not running its observers when the quantization flow was being run, this was a bug which is now fixed and the relevant prelu tests also now check for this. Also added a corrected observer for PReLU to qconfig_mapping Test Plan: python test/test_quantization.py TestStaticQuantizedModule.test_prelu Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/103455 Approved by: https://github.com/jerryzh168	2023-06-23 16:45:40 +00:00
Andrew Or	7320ef5651	[quant][pt2] Add prepare QAT test for mobilenetv2 (#104068 ) Summary: Prepare QAT for mobilenetv2 has matching numerics with FX. There were two changes needed to achieve this, however. First, this commit adds observer sharing for ReLU6, which is used extensively throughout this model. Second, in the tests we have to use the same manual seed every time we call the models in order to get the same results between FX and PT2. This is because there is a dropout at the end of the model. Test Plan: python test/test_quantization.py TestQuantizePT2EModels.test_qat_mobilenet_v2 Reviewed By: kimishpatel Differential Revision: D46707786 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104068 Approved by: https://github.com/jerryzh168	2023-06-23 16:34:25 +00:00
andrewor14	0d5f1cb666	[quant] Add torch.flatten to executorch backend_config (#103988 ) Summary: This is needed to make the short-term and long-term quantization numerics match for mobilenetv2. Test Plan: python test/test_quantization.py TestQuantizeFx Reviewers: jerryzh, kimishpatel Subscribers: jerryzh, kimishpatel Differential Revision: [D46909962](https://our.internmc.facebook.com/intern/diff/D46909962) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103988 Approved by: https://github.com/jerryzh168	2023-06-22 22:11:48 +00:00
Andrew Or	303ff84b04	[quant][pt2] Update special qspecs after QAT rewrite (#103970 ) Summary: Special qspecs like `SharedQuantizationSpec` and `DerivedQuantizationSpec` refer to other nodes in the graph. However, after subgraph rewriting in QAT, the nodes referred to in these special qspecs may be replaced by new nodes. This could lead to the following error when inserting observers according to these qspecs: ``` AssertionError: please make sure only refer to edge or node that has observer/fake_quant inserted: 'getitem' not in dict_keys([(arg0, convolution_default_1), (mul_tensor, convolution_default_1), getitem_3]) ``` This commit fixes this by keeping track of the nodes that are replaced during subgraph rewriting in QAT, and using this mapping to update the dangling references used in these special qspecs. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qat_update_shared_qspec Reviewed By: jerryzh168 Differential Revision: D46606614 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103970 Approved by: https://github.com/jerryzh168	2023-06-22 20:05:57 +00:00
Andrew Or	873f772df2	[quant][pt2] Fix QAT convert for resnet18 (#103759 ) Summary: Before this commit, only prepare QAT numerics matched between PT2 and FX for resnet18. Convert numerics diverged, however, for two reasons: (1) Existing patterns did not handle inplace ReLUs. This commit fixes this by adding extra patterns that use these ReLUs instead of the normal ones. (2) Subgraph rewriter could not handle skip connections in quantized models, because the dequantize node is used in both the conv node within the match pattern, and an inplace add node outside of the match pattern. This led the subgraph matcher to filter out the match, complaining that it was not self contained. This commit fixes this problem by duplicating the dequantize nodes, one for each user, such that subsequent matches will be self contained. Test Plan: python test/test_quantization.py TestQuantizePT2EModels.test_qat_resnet18 Reviewed By: jerryzh168 Differential Revision: D46564114 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103759 Approved by: https://github.com/jerryzh168	2023-06-21 15:36:07 +00:00
leslie-fang-intel	9832cfbbfe	Quantization oneDNN backend only support VNNI CPU (#103653 ) Summary - Update the quantization document that default qconfig with oneDNN backend is recommended to be used on CPUs with Vector Neural Network Instruction support. - Add the warning message when user uses default qconfig with oneDNN backend on CPU without Vector Neural Network Instruction support. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103653 Approved by: https://github.com/jgong5, https://github.com/malfet	2023-06-19 09:50:07 +00:00
leslie-fang-intel	dbc8eb2a8f	[Quant][PT2E]Enable x86 inductor quantizer (#98730 ) Summary - Enable `X86InductorQuantizer` basics. - Recipe to annotate conv2d is added. Test Plan ``` python -u -m pytest -s -v test_x86inductor_quantizer.py -k TestQuantizePT2EX86Inductor ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/98730 Approved by: https://github.com/jgong5, https://github.com/jerryzh168	2023-06-17 06:10:23 +00:00
Andrew Or	2bc56bec07	[quant][pt2] Handle literal conv args in convert QAT (#103731 ) Summary: Similar to the prepare case, we need to manually copy over literal conv args such as padding and stride to the new, replaced conv nodes, since these args are not captured by the subgraph rewriter. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qat_conv_bn_fusion_literal_args Reviewed By: jerryzh168 Differential Revision: D46383130 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103731 Approved by: https://github.com/jerryzh168	2023-06-16 17:15:37 +00:00
Andrew Or	dad29f906b	[quant][pt2] Fix no conv bias in convert QAT (#103298 ) Summary: Previously, the QAT pattern for conv + bn with no conv bias was not actually replaced in convert. This commit adds an extra pattern in the convert path for this case and the numerics now match FX's. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_prepare_qat_conv_bn_fusion_no_conv_bias Reviewed By: jerryzh168 Differential Revision: D46382819 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103298 Approved by: https://github.com/jerryzh168	2023-06-16 01:59:48 +00:00
Kimish Patel	90ee6a7354	[PT2][Quant] Update op names for decomposed quantized lib (#103251 ) Summary: Dynamo trace, via dynamo.export, with aten_graph, generates graph with nodes whose target is an isntance of torch._ops.OpOverload. Quantization workflow inserting quantize/dequantize ops which are sometimes instances of torch._ops.OpOverload (quantize_per_tensor.tensor) while other times instances of torch._ops.OpOverloadPacket (quantizer_per_tensor) is a bit inconsistent. Also not sure if it is a valid exported model, if it has nodes with target of type torch._ops.OpOverloadPacket. Without op overload name attached to the 'target', it fails during executorch tracing. Reason is that executorch tracing expects node's targets to be instances of torch._ops.OpOverload and not torch._ops.OpOverloadPacket. So for consistency and tracing reasons, fixing convert pass to insert ops which are torch._ops.OpOverload Test Plan: CI Reviewed By: jerryzh168 Differential Revision: D46342822 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103251 Approved by: https://github.com/andrewor14	2023-06-15 04:37:58 +00:00
Piotr Sebastian Kluska	b4056ba744	chore: Update ModelReportObserver variables to buffers (#97971 ) This commit changes ModelReportObserver variables to buffers similar to other observers. This will allow for gathering data on other device than CPU. Moreover, updates InputWeightEqualizationDetector to compute weight stats that are on GPU Tested with running tests `test/quantization/fx/test_model_report_fx.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/97971 Approved by: https://github.com/vkuzo	2023-06-15 03:15:41 +00:00
Kimish Patel	49dcf48e66	[PT2][Quant] Change quat conv bn fusion code (#103556 ) Summary: Dynamo burn in scalars instead of keeping them on module. This results in quantize_per_tensor and dequantize_per_tensor nodes to have burnt in scale and zero point value, if we trace them scalar. Graph rewrite ignores literals and when match pattern is replaced with replacement pattern, we lose the scale/zp and other values from nodes in original graph and instead get one from replacement graph. This diff fixes that for q/dq per tensor node by manually copying these values over. Note that this is not robust because it works only when there is only a single q/dq node Test Plan: quantization_pt2e Reviewed By: andrewor14 Differential Revision: D46614000 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103556 Approved by: https://github.com/andrewor14	2023-06-14 18:37:43 +00:00
Jerry Zhang	0cd155b042	[reland][quant][pt2e] Annotate GRU module (#103358 ) (#103526 ) Summary: att, we use module partition API to identify the GRU submodule and annotate all necessary patterns Test Plan: buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' Differential Revision: D46689428 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103526 Approved by: https://github.com/andrewor14	2023-06-13 23:43:10 +00:00
PyTorch MergeBot	13777e3391	Revert "[quant][pt2e] Annotate GRU module (#103358 )" This reverts commit `23892d8ee4`. Reverted https://github.com/pytorch/pytorch/pull/103358 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/103358#issuecomment-1588729657))	2023-06-13 07:45:40 +00:00
Jerry Zhang	23892d8ee4	[quant][pt2e] Annotate GRU module (#103358 ) Summary: att, we use module partition API to identify the GRU submodule and annotate all necessary patterns Test Plan: buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' Reviewed By: kimishpatel Differential Revision: D46384329 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103358 Approved by: https://github.com/HDCharles	2023-06-13 04:10:13 +00:00
Yash Vardhan	6ed3c4499a	Fix fuse_custom_config_dict arg from being None (#102154 ) `fuse_custom_config_dict` in [fuse_modules.py](https://github.com/pytorch/pytorch/blob/main/torch/ao/quantization/fuse_modules.py#L164) being passed as None even if a fuse_custom_config_dict is provided. This patch fixes the `fuse_custom_config_dict` from being passed as None. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102154 Approved by: https://github.com/kit1980	2023-06-13 03:45:20 +00:00
maxren	f37be77813	[Quant][XNNPACK] Delegate add_relu fusion (#103266 ) Quantized Resnet currently sees fused add-relu ``` --> dq \ add --> relu --> quant / --> dq ``` Let us support this fusion in the delegate as xnnpack can use the output_min and output_max of the op nodes to clamp the values and perform a fused add - relu operation Differential Revision: [D45258028](https://our.internmc.facebook.com/intern/diff/D45258028/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103266 Approved by: https://github.com/jerryzh168	2023-06-12 04:35:29 +00:00
Andrew Or	89d57f269f	[quant][pt2] Fix convert in Conv + BN + ReLU QAT fusion (#102993 ) Summary: Previously, the QAT pattern for conv + bn + relu was not actually replaced in convert. This is because the quantized QAT pattern used in convert doesn't actually have a relu node. This commit adds this extra pattern in the convert path and the numerics now match FX's. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qat_conv_bn_relu_numerics Reviewed By: jerryzh168 Differential Revision: D46372411 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102993 Approved by: https://github.com/jerryzh168	2023-06-08 22:10:29 +00:00
Kimish Patel	a49aefdce2	[PT2][Quant] In linear partition include functional.linear (#103186 ) Summary: as title Test Plan: tested in subsequent diff Reviewed By: jerryzh168 Differential Revision: D46342824 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103186 Approved by: https://github.com/jerryzh168	2023-06-08 09:48:09 +00:00
Kimish Patel	471407cf78	[PT2][Quant] Use composble quantizer for embedding + static conv + dynamic (#103116 ) Summary: In this diff we test a module that does a) emedding lookup b) runs 1D (converted to 2D) conv and c) runs linear on the output of 1d conv. a is quantized using embedding quantizer. c is quantized using dynamic quantization. b is quantized using static quantization. We compose quantizer from [a, c, b]. Tested it against similar fx config. Test Plan: test_embedding_conv_linear_quantization Reviewed By: jerryzh168 Differential Revision: D46267688 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103116 Approved by: https://github.com/jerryzh168	2023-06-07 17:34:59 +00:00
Kimish Patel	8e0837cf84	[PT2][Quant] Move embedding quantization to osss (#103088 ) Summary: This is in preperation to enable embeddign quantization on models with embeddings. Test Plan: test_embedding_quantizer Reviewed By: jerryzh168 Differential Revision: D46267689 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103088 Approved by: https://github.com/andrewor14	2023-06-06 23:07:57 +00:00
Xuan Xie	6261055471	dst_bin_of_end_center is defined twice (#102755 ) (line 995 and line 1011) both definations are the same. Delete one of them. Fixes Pull Request resolved: https://github.com/pytorch/pytorch/pull/102755 Approved by: https://github.com/janeyx99	2023-06-06 21:17:07 +00:00
Kimish Patel	8824101fb6	[PT2][Quant] Introduce composable quantizer (#102846 ) Summary: Using composable quantizer, we can now composable two or more quantizers. In the test here we compose quantizer configured with dynamic linear quantization, with quantizer configured for static quantization. Note that composable quantizer has strict order in which annotations are applied Test Plan: test_composable_quantizer* Reviewed By: jerryzh168 Differential Revision: D46267690 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102846 Approved by: https://github.com/andrewor14	2023-06-06 14:01:55 +00:00
Jerry Zhang	5fbbae4283	[quant][pt2e][be] Cleanup prepare function in _pt2e (#103022 ) Summary: att Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' ``` Differential Revision: D46346087 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103022 Approved by: https://github.com/andrewor14	2023-06-06 04:33:05 +00:00
Andrew Or	604a414bfc	[quant][pt2] Fix convert in Conv + BN QAT fusion (#102224 ) Summary: Previously, the test for the convert flow in Conv + BN QAT fusion was not enabled by mistake. However, reenabling this test uncovered several bugs: (1) The replaced nodes returned by subgraph rewriter were not handled correctly. This is because a recent change in the subgraph rewriter (#100556) fixed only the prepare case but not the convert case. This commit brings this fix to the convert case as well and deduplicates some code between the two cases. (2) When folding BN into conv, we used the wrong arg index to get the BN eps value. This resulted in an incorrect conv weight. (3) In FX, we currently do a hack for weighted modules where we observe the weights once in convert in order to ensure we get the right shapes for these weight observers. This caused the numerics to diverge between PT2 and FX. This commit fixes this by skipping this unnecessary hack for `_convert_to_reference_decomposed_fx`. (4) Per channel support was simply missing. This commit adds support for this by matching the quantize_per_channel and dequantize_per_channel ops in addition to the existing ones. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qat_conv_bn_numerics Reviewed By: jerryzh168 Differential Revision: D46097783 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102224 Approved by: https://github.com/jerryzh168	2023-06-05 18:09:28 +00:00
Jerry Zhang	eb0971cfe9	[quant][pt2e][be] Remove _input_output_share_observers and _reuse_input_obs_or_fq from QuantizationAnnotation (#102854 ) Summary: att, after we support SharedQuantizationSpec we don't need these things anymore, this PR refactors the uses of _input_output_share_observers to SharedQuantizationSpec Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' ``` Reviewed By: andrewor14 Differential Revision: D46301342 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102854 Approved by: https://github.com/andrewor14	2023-06-03 07:31:09 +00:00
Kimish Patel	a53acafd2b	[PT2][Quant] Enable dynamic quantization (#102703 ) Enable dynamic quantization of linear layers. Differential Revision: [D46235070](https://our.internmc.facebook.com/intern/diff/D46235070/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102703 Approved by: https://github.com/andrewor14	2023-06-02 17:52:14 +00:00
Kimish Patel	2301b624ae	[PT2][Quant] Update quconfig to contain input/qoutput activation qspec (#102702 ) As title Differential Revision: [D46342823](https://our.internmc.facebook.com/intern/diff/D46342823/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102702 Approved by: https://github.com/andrewor14	2023-06-02 17:41:46 +00:00
Kimish Patel	6492b7d22e	[PT2][Quant][BE] Refactor qnnpack_quantizer.py (#102701 ) This diff refactors annotate functions so as to couple annotate functions with corresponding quantization configs that they support. This will help in dynamic quantization which is only supported for linear layers Differential Revision: [D46235071](https://our.internmc.facebook.com/intern/diff/D46235071/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102701 Approved by: https://github.com/jerryzh168	2023-06-02 17:14:56 +00:00
Jerry Zhang	ce8d31551b	[quant][be] Change return type for zero_point to be int32 Tensor (#102234 ) Summary: This is probably a typo Test Plan: CI Reviewed By: salilsdesai Differential Revision: D46172706 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102234 Approved by: https://github.com/salilsdesai	2023-06-01 18:30:44 +00:00
Jerry Zhang	d930bfc419	[quant][pt2e][be] Add QuantizationSpecBase (#102582 ) Summary: Make all quantization spec to inherit from the same base class in order to simplify the typing for QuantizationAnnotation Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' ``` Reviewed By: kimishpatel Differential Revision: D46173954 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102582 Approved by: https://github.com/andrewor14	2023-06-01 17:55:22 +00:00
Jerry Zhang	f14ac74fce	[quant][pt2e] Add support for FixedQParamsQuantizationSpec (#102439 ) Summary: This PR adds support for FixedQParamsQuantizationSpec: ``` dataclass(eq=True, frozen=True) class FixedQParamsQuantizationSpec(QuantizationSpecBase): dtype: torch.dtype scale: float zero_point: int quant_min: Optional[int] = None quant_max: Optional[int] = None qscheme: Optional[torch.qscheme] = None ``` This is useful to define quantization spec for operators like sigmoid which has predefined and fixed scale/zero_point Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' buck2 test mode/opt caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_fixed_qparams_qspec (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)' ``` Reviewed By: kimishpatel Differential Revision: D46153082 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102439 Approved by: https://github.com/kimishpatel	2023-05-30 21:28:13 +00:00
Kimish Patel	af70fe9f3e	[PT2][Quant] Enable test_qnnpack_quantizer_conv_linear test (#102399 ) Earlier this test was disabled due to pattern matching not working correctly. Enablign this test now since we moved to module partitioner based matching. Differential Revision: [D46130722](https://our.internmc.facebook.com/intern/diff/D46130722/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102399 Approved by: https://github.com/jerryzh168	2023-05-28 06:44:16 +00:00
Kimish Patel	0d876f7d43	[PT2][Quant] Move observer sharing ops to use module partitions (#102398 ) As title Differential Revision: [D46095331](https://our.internmc.facebook.com/intern/diff/D46095331/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102398 Approved by: https://github.com/jerryzh168	2023-05-28 05:50:15 +00:00
Kimish Patel	9fac5afbcc	[PT2][Quant] Move add/add relu pattern via module partitioner (#102397 ) This diff uses module partitioners to find add and add + relu patterns. Differential Revision: [D46095330](https://our.internmc.facebook.com/intern/diff/D46095330/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102397 Approved by: https://github.com/jerryzh168	2023-05-28 05:47:43 +00:00
Kimish Patel	3d8f405022	[PT2][Quant] Move maxpool_2d quant to use module partitioners (#102396 ) As summary Differential Revision: [D46095332](https://our.internmc.facebook.com/intern/diff/D46095332/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102396 Approved by: https://github.com/jerryzh168	2023-05-28 05:44:54 +00:00
Kimish Patel	d997e3aac6	[PT2][Quant] Use module partitions for conv2d and conv2d + relu (#102395 ) In this diff we continue to use source partition for identifying node patterns to annotate. Here we expand the usecase for conv2d+relu and conv2d Differential Revision: [D46095329](https://our.internmc.facebook.com/intern/diff/D46095329/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102395 Approved by: https://github.com/jerryzh168	2023-05-28 05:40:45 +00:00
Kimish Patel	4cb6add471	[PT2][Quant] Use module partition for fused patterns (#102394 ) This diff introduces utility `find_sequential_partitions`. This utility allows one to specify sequential pattern of nn.Module/nn.functional and returns a list. Each item in the list contains a List[SourcePartition] that represents sequentially connected partitions that are of the pattern requested. For example `find_sequential_partitions(model, [nn.Conv2d, nn.ReLU])` will find all nn.Conv2d and nn.ReLU partitions that are sequentially connected. Furthmore, move to using `find_sequential_partitions` for conv_bn/conv_bn_relu for QAT. Differential Revision: [D45948057](https://our.internmc.facebook.com/intern/diff/D45948057/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D45948057/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/102394 Approved by: https://github.com/jerryzh168	2023-05-28 05:29:16 +00:00
Jerry Zhang	eda5abf5e0	[quant][pt2e] Fix propagate_annotation after recent refactors (#102422 ) Summary: Recently we changed the annotation from "target_dtype_info" to "quantization_annotation" and introduced QuantizationAnnotation API and SharedQuantizationSpec API for users to convey sharing between input/outputs, this PR updates the _propagate_annotation pass to accommadate the recent changes Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' ``` Reviewed By: kimishpatel Differential Revision: D46153084 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102422 Approved by: https://github.com/kimishpatel	2023-05-27 16:01:47 +00:00
Jerry Zhang	23223402eb	[quant][pt2e] Add Support for DerivedQuantizationSpec (#102282 ) Summary: ``` """ 4. DerivedQuantizationSpec this is the quantization spec for the Tensors whose quantization parameters are derived from other Tensors """ class DerivedQuantizationSpec(QuantizationSpecBase): # specifies which Tensors the quantization parameters are derived from # this can either be an edge from argument to node, or a node derived_from: List[EdgeOrNode] derive_qparams_fn: Callabale[List[ObserverOrFakeQuantize], Tuple[Tensor, Tensor]] ... ``` Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' buck2 test mode/opt caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' ``` Reviewed By: kimishpatel Differential Revision: D46097855 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102282 Approved by: https://github.com/andrewor14	2023-05-27 00:24:39 +00:00
Jerry Zhang	ed87508b32	[quant][pt2e] Add support for SharedQuantizationSpec (#102184 ) Summary: This PR adds support for SharedQuantizationSpec, it's used to express the sharing between two Tensors in the prepared graph, the Tensor will either be input of some node (expressed as a Tuple of fx nodes) or output of some node (expressed as an fx Node) Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- 'caffe2/test:quantization_pt2e' buck2 test mode/opt caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' ``` Differential Revision: D46043026 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102184 Approved by: https://github.com/kimishpatel, https://github.com/leslie-fang-intel	2023-05-25 17:31:59 +00:00
Riley Dulin	424c930f76	Add quantization lowering for nn.PixelShuffle and nn.PixelUnshuffle (#101926 ) Similar to https://github.com/pytorch/pytorch/pull/96160 but for the modules nn.PixelShuffle and nn.PixelUnshuffle. torch.nn.PixelUnshuffle accepts both float and quantized inputs. However, previously we would unnecessarily dequantize quantized inputs into floats before passing them to the function. This commit fixes this by lowering the pattern [dequant - PixelShuffle - quant]. [dequant - PixelUnshuffle - quant]. Test Plan: python test/test_quantization.py TestQuantizeFxOps.test_pixel_shuffle_module python test/test_quantization.py TestQuantizeFxOps.test_pixel_unshuffle_module Pull Request resolved: https://github.com/pytorch/pytorch/pull/101926 Approved by: https://github.com/jerryzh168	2023-05-24 19:33:26 +00:00
Jerry Zhang	3baa67caee	[quant][pt2e][be] Move annotate helper function to quantizer/utils.py (#102127 ) Summary: att Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' ``` Reviewed By: kimishpatel Differential Revision: D46001285 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102127 Approved by: https://github.com/kimishpatel	2023-05-24 16:13:28 +00:00
Matthew Hoffman	29da75cc55	Enable mypy allow redefinition (#102046 ) Related #101528 I tried to enable this in another PR but it uncovered a bunch of type errors: https://github.com/pytorch/pytorch/actions/runs/4999748262/jobs/8956555243?pr=101528#step:10:1305 The goal of this PR is to fix these errors. --- This PR enables [allow_redefinition = True](https://mypy.readthedocs.io/en/stable/config_file.html#confval-allow_redefinition) in `mypy.ini`, which allows for a common pattern: > Allows variables to be redefined with an arbitrary type, as long as the redefinition is in the same block and nesting level as the original definition. `allow_redefinition` allows mypy to be more flexible by allowing reassignment to an existing variable with a different type... for instance (from the linked PR): `4a1e9230ba/torch/nn/parallel/data_parallel.py (L213)` A `Sequence[Union[int, torch.device]]` is narrowed to `Sequence[int]` thru reassignment to the same variable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102046 Approved by: https://github.com/ezyang	2023-05-24 07:05:30 +00:00
Jerry Zhang	94ed26d177	[quant][pt2e] prepare_pt2e use quantization spec directly (#102054 ) Summary: In this PR we aligned with the design of annotation API and uses quantization spec directly for annotation. main change is in prepare, we consume quantization_spec object directly instead of the observer or fake quant constructor, we create the constructor inside prepare, and annotation api users only need to interact with quantization spec object after this PR Test Plan: ``` buck2 test mode/opt caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' ``` Reviewed By: kimishpatel Differential Revision: D45934088 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102054 Approved by: https://github.com/kimishpatel	2023-05-23 23:25:56 +00:00
Jerry Zhang	f7c736e1e7	[quant][pt2e] Add observer_or_fake_quant_ctr to QuantizationSpec (#101920 ) Summary: This is the second refactor to align the annotation API with design, next step is to change prepare_pt2e to consume QuantizationSpec object directly Test Plan: ``` buck2 test mode/optcaffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' ``` Reviewed By: kimishpatel Differential Revision: D45927416 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101920 Approved by: https://github.com/andrewor14	2023-05-23 05:48:23 +00:00
Jerry Zhang	15495f2d96	[quant][pt2e] Introduce QuantizationAnnotation API (#101708 ) Summary: This diff adds QuantizationAnnotation and also refactors the existing annotation to use this object ``` dataclass class QuantizationAnnotation: # How some input nodes should be quantized, expressed as QuantizationSpec # a map from torch.fx.Node to QuantizationSpec input_qspec_map: Dict[Node, QuantizationSpec] # How the output of this node is quantized, expressed as QuantizationSPec output_qspec: QuantizationSpec class QuantizationSpec: dtype: torch.dtype is_dynamic: bool = False quant_min: Optional[int] = None quant_max: Optional[int] = None qscheme: Optional[torch.qscheme] = None ch_axis: Optional[int] = None # TODO: follow up PR will add this # Kind of observer such as MinMaxObserver, PerChannelHistogramObserver etc. # observer_or_fake_quant_type: Union[ObserverBase, FakeQuantizeBase] ``` Example after full refactor: ``` int8_qspec = QuantizationSpec(dtype=torch.int8, ...) weight_qspec = QuantizationSpec(dtype=torch.int8, ...) conv_node["quantization_annotation"] = QuantizationAnnotation( input_qspec_map={input_node: int8_qspec, weight_node: weight_qspec} output_qspec=int8_qspec, ) ``` Note: right now input_qspec_map and output_qspec map are still using observer and fake quant constructors. Follow up PR: change the input_qspec_map and output_qspec to use QuantizationSpec directly Test Plan: ``` buck2 test mode/optcaffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' ``` Differential Revision: D45895027 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101708 Approved by: https://github.com/andrewor14	2023-05-19 22:54:27 +00:00
Nitin Jain	556bb691fd	[AO]Fix observed LSTM layer setup individually observed LSTM (#101299 ) Summary: We have found that `_get_lstm_with_individually_observed_parts()` is missing setup step which sets up the LSTM layer state initializing weights and biases of this layer. This diff fixes the observed numerical discrepancy seen by CTRL team in using the above API. Test Plan: N3358643 Differential Revision: D45821681 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101299 Approved by: https://github.com/andrewor14	2023-05-18 19:15:01 +00:00
andrewor14	8e51521cee	[quant][pt2] Handle maxpool + conv + bn case in prepare QAT (#100941 ) Summary: This commit fixes a bug where we copy the metadata from the wrong node after replace_pattern. This happened in the case of [maxpool -> getitem1 -> conv -> bn -> getitem2], where `getitem1` is the placeholder node fed into the fused conv + bn pattern, and we incorrectly copied the metadata from `getitem1` instead of from `getitem2`. We fix this bug by filtering out the placeholder nodes before doing the metadata copying. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_prepare_qat_conv_bn_fusion_getitem_placeholder Reviewers: jerryzh168, kimishpatel Differential Revision: [D45916751](https://our.internmc.facebook.com/intern/diff/D45916751) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100941 Approved by: https://github.com/jerryzh168	2023-05-17 17:36:32 +00:00
Kimish Patel	07e759eca2	[PT2][Quant] Move to module partitioner for linear pattern quantization (#101122 ) Subgraph matcher is somewhat unreliable as the pattern can vary depending on the dimensionality of input tensor used to trace _and_ what appears before linear Differential Revision: [D45713915](https://our.internmc.facebook.com/intern/diff/D45713915/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101122 Approved by: https://github.com/jerryzh168	2023-05-17 15:47:08 +00:00
Kimish Patel	2c807a4acf	[PT2][Quant] Remove None annotations (#101120 ) None annotations are not needed anymore. Remove them. Differential Revision: [D45713917](https://our.internmc.facebook.com/intern/diff/D45713917/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101120 Approved by: https://github.com/jerryzh168	2023-05-17 14:38:34 +00:00
Angela Yi	9e023e1818	[fx] Better replacements finder in subgraph rewriter (#100556 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100556 Approved by: https://github.com/mcr229	2023-05-16 14:08:44 +00:00
andrewor14	964e61ee95	[quant][pt2] Handle no conv bias in prepare QAT fusion (#100610 ) Summary: This commit adds support for conv + BN fusion for the case where conv has no bias. Since the replacement patterns with and without conv bias are substantially different, we perform the replacement for each of these two cases separately. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_prepare_qat_conv_bn_fusion_no_conv_bias Reviewers: jerryzh168, kimishpatel Differential Revision: [D45743510](https://our.internmc.facebook.com/intern/diff/D45743510) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100610 Approved by: https://github.com/jerryzh168	2023-05-16 04:05:53 +00:00
PyTorch MergeBot	13056ca229	Revert "[fx] Better replacements finder in subgraph rewriter (#100556 )" This reverts commit `9842d1ef94`. Reverted https://github.com/pytorch/pytorch/pull/100556 on behalf of https://github.com/izaitsevfb due to Reverting temporarily to unblock diff train, see D45743510 and #100610 ([comment](https://github.com/pytorch/pytorch/pull/100556#issuecomment-1548934932))	2023-05-16 03:50:06 +00:00
Angela Yi	9842d1ef94	[fx] Better replacements finder in subgraph rewriter (#100556 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100556 Approved by: https://github.com/mcr229	2023-05-15 20:00:59 +00:00
andrewor14	4434b9af6a	[quant][pt2] Handle constant conv args in prepare QAT fusion (#100525 ) Summary: Previously, we would only match and replace conv + BN patterns with default constant args for conv (stride, padding, dilation etc.). If the user sets one of these args to values that are different from the default, we would simply not fuse the pattern. This is due to a limitation in the subgraph rewriter: see https://github.com/pytorch/pytorch/issues/100419. This commit works around the above limitation by first configuring the subgraph rewriter to ignore literals when matching, and then manually copy over the constant args to the new subgraph after `replace_pattern`. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_prepare_qat_conv_bn_fusion_constant_args Reviewers: jerryzh168, kimishpatel Differential Revision: [D45515437](https://our.internmc.facebook.com/intern/diff/D45515437) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100525 Approved by: https://github.com/jerryzh168	2023-05-12 19:15:47 +00:00
leslie-fang-intel	a66de845de	[Quant][PT2E]Fix pt2e quantization maxpool input observer issue (#100961 ) Summary Fix the issue https://github.com/pytorch/pytorch/issues/100959. The root cause is for node of `torch.ops.aten.max_pool2d_with_indices.default`, there are 2 output node as output tensor and max indices. So in its `node.meta["val"]` is a tuple of `FakeTensors` (For example: `'val': (FakeTensor(..., size=(1, 2, s1, s1)), FakeTensor(..., size=(1, 2, s1, s1), dtype=torch.int64))`). It will fail the check of inserting observer since which only accept one `FakeTensor` case. Test Plan ``` python -m pytest test_quantize_pt2e.py -k test_max_pool2d_quantizer ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/100961 Approved by: https://github.com/jerryzh168, https://github.com/jgong5	2023-05-11 06:14:34 +00:00
Jerry Zhang	058d740f59	[reland][quant][pt2e] Change input act annotation to a map and allow dynamic quantization for non zeroth argument (#101005 ) (#101041 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/101005 Previously the node annotation looks like the following: ``` node.meta["..."] = { "input_act_obs_or_fq_ctr": ..., "weight_obs_or_fq_ctr": ..., "weight_index": 1, } ``` Basically we need specifiy the index for weight and also have a separate key for weight config, in this PR we changed that to: ``` node.meta["..."] = { "input_act_obs_or_fq_ctr_map": {input_node: ..., weight_node: ...}, } ``` This can support specifying the observer/fake quant constructor for any argument of the node Test Plan: buck2 test @//mode/opt //caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' Differential Revision: D45719781 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101041 Approved by: https://github.com/andrewor14	2023-05-10 17:43:21 +00:00
PyTorch MergeBot	2241aaa60c	Revert "[quant][pt2e] Change input act annotation to a map and allow dynamic quantization for non zeroth argument (#101005 )" This reverts commit `f08ddae888`. Reverted https://github.com/pytorch/pytorch/pull/101005 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/101005#issuecomment-1541143426))	2023-05-10 01:27:47 +00:00
Jerry Zhang	f08ddae888	[quant][pt2e] Change input act annotation to a map and allow dynamic quantization for non zeroth argument (#101005 ) Summary: Previously the node annotation looks like the following: ``` node.meta["..."] = { "input_act_obs_or_fq_ctr": ..., "weight_obs_or_fq_ctr": ..., "weight_index": 1, } ``` Basically we need specifiy the index for weight and also have a separate key for weight config, in this PR we changed that to: ``` node.meta["..."] = { "input_act_obs_or_fq_ctr_map": {input_node: ..., weight_node: ...}, } ``` This can support specifying the observer/fake quant constructor for any argument of the node Test Plan: buck2 test @//mode/opt //caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_resnet18_with_quantizer_api (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2EModels)' Reviewed By: kimishpatel Differential Revision: D45553195 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101005 Approved by: https://github.com/kimishpatel	2023-05-10 00:42:25 +00:00
Jerry Zhang	c3f3cb5b0f	[quant][pt2e] Support conv bn fusion in convert step for QAT flow (#100442 ) Summary: This PR adds support for folding bn weights into conv for QAT flow, this is equivalent to the QAT branch of `from_float` in eager mode quantized conv module: https://github.com/pytorch/pytorch/blob/main/torch/ao/nn/quantized/modules/conv.py#L223 Items that needs followup: * there are some workaround I did because quantize_per_tensor is using float/int args and dynamo does not support these args, need to fix after we change the quantized model representation and also change these args to Tensor Test Plan: buck2 test @//mode/opt //caffe2/test:quantization_pt2e -- --exact 'caffe2/test:quantization_pt2e - test_convert_qat_conv_bn_fusion (quantization.pt2e.test_quantize_pt2e.TestQuantizePT2E)' Reviewed By: andrewor14 Differential Revision: D45344281 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100442 Approved by: https://github.com/kimishpatel	2023-05-09 19:43:51 +00:00
Aaron Gokaslan	8769fb854d	[BE] Fix flake8 B027 errors - missing abstractmethod decorator (#100715 ) Enables B027 and applies fixes by adding abstract method decorators. Autofix generated by ruff master. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100715 Approved by: https://github.com/ezyang	2023-05-09 17:28:48 +00:00
andrewor14	4154c8ea15	[quant][pt2] Add Conv + BN + ReLU fusion for prepare QAT (#100283 ) Summary: This follows https://github.com/pytorch/pytorch/pull/98568, which lays all the groundwork for Conv + BN fusion in prepare QAT. Conv + BN + ReLU fusion can reuse the same match and replace patterns and is handled similarly. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_prepare_qat_conv_bn_relu_fusion python test/test_quantization.py TestQuantizePT2E.test_prepare_qat_conv_bn_relu_numerics Reviewers: kimishpatel, jerryzh168 Differential Revision: [](https://our.internmc.facebook.com/intern/diff/) Differential Revision: [D45515494](https://our.internmc.facebook.com/intern/diff/D45515494) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100283 Approved by: https://github.com/jerryzh168	2023-05-07 20:35:16 +00:00
Kimish Patel	24e9b8f5f4	[PT2E][Quant] Use subgraph matcher annotate linear pattern (#100566 ) This diff adds subgraph matcher for pattern matching. Furthermore, we also move annotations for the matched subgraph in a way that only input and output nodes of the matched subgraph have quantization related valid annotations. Differential Revision: [D45535539](https://our.internmc.facebook.com/intern/diff/D45535539/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100566 Approved by: https://github.com/jerryzh168	2023-05-04 21:31:59 +00:00
Richard Barnes	6370ac0251	[codemod] Replace hasattr with getattr in caffe2/torch/ao/quantization/stubs.py (#100597 ) Summary: The pattern ``` X.Y if hasattr(X, "Y") else Z ``` can be replaced with ``` getattr(X, "Y", Z) ``` The [getattr](https://www.w3schools.com/python/ref_func_getattr.asp) function gives more succinct code than the [hasattr](https://www.w3schools.com/python/ref_func_hasattr.asp) function. Please use it when appropriate. This diff is very low risk. Green tests indicate that you can safely Accept & Ship. Test Plan: Sandcastle Reviewed By: vkuzo Differential Revision: D44886422 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100597 Approved by: https://github.com/Skylion007	2023-05-04 16:36:23 +00:00
Richard Barnes	6120c5842c	[codemod] Replace hasattr with getattr in caffe2/torch/ao/quantization/utils.py (#100361 ) Summary: The pattern ``` X.Y if hasattr(X, "Y") else Z ``` can be replaced with ``` getattr(X, "Y", Z) ``` The [getattr](https://www.w3schools.com/python/ref_func_getattr.asp) function gives more succinct code than the [hasattr](https://www.w3schools.com/python/ref_func_hasattr.asp) function. Please use it when appropriate. This diff is very low risk. Green tests indicate that you can safely Accept & Ship. Test Plan: Sandcastle Reviewed By: jerryzh168 Differential Revision: D44886493 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100361 Approved by: https://github.com/Skylion007	2023-05-04 14:46:38 +00:00
Kimish Patel	771a9debbe	[PT2E][Quant] Refactor quantizer and qnnpack qantizer code to support dqlinear config (#99399 ) This diff introduces a few refactors: - Move observer creation to utils.py. - Use quantization spec to supply args to observers. - Use annotation function registration corresponding QuantizationConfig. This will be later used in dynamic quantized linear. Differential Revision: [D45073790](https://our.internmc.facebook.com/intern/diff/D45073790/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99399 Approved by: https://github.com/jerryzh168	2023-05-03 03:23:32 +00:00
Kimish Patel	8ec0a939a2	[PT2E][Quant] Fix but in quant spec of symmetric static quant (#99398 ) Activation quant spec should have qscheme = per_tensor_affine Weights quant spec should have ch_axis=0 for per_channel_symmetric Differential Revision: [D45073789](https://our.internmc.facebook.com/intern/diff/D45073789/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99398 Approved by: https://github.com/jerryzh168	2023-05-03 00:36:03 +00:00
Max Ren	151d76cc23	[quant][pt2e] remove dropout from fx quant Differential Revision: D45250152nnPull Request resolved: https://github.com/pytorch/pytorch/pull/99935	2023-04-27 11:22:41 -07:00
andrewor14	6c550bb4d5	[quant][be] Easier way to override default in QConfigMapping (#99888 ) Summary: This commit adds a private helper function to override the default QConfig in the default QConfigMapping. Previously we needed to override all the object_types manually while skipping the fixed qparams ops. This led to duplicate code every time someone wanted a new default QConfig. After this commit, we can just call the same helper function instead. Test Plan: python test/test_quantization.py TestQuantizeFx Reviewers: jerryzh168, vkuzo Pull Request resolved: https://github.com/pytorch/pytorch/pull/99888 Approved by: https://github.com/vkuzo, https://github.com/jerryzh168	2023-04-26 18:14:01 +00:00
Jerry Zhang	df3455b716	[reland][quant][pt2e][refactor] Cleanup the logic for deciding whether to insert observer/fq or not (#99220 ) (#99767 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/99220 Previously we have two places we need to decide whether to insert observer or fake quantizer or not: (1) input arguments of a node (2) output of a node, and right now we have separate code to do this in this PR, the logic is unified in `_needs_obs_or_fq` helper function that takes the target_dtype and is_dynamic from previous output and target_dtype and is_dynamic for the current Tensor we are looking at let's use an example for conv node: ``` conv = convolution(input, weight, bias, ...) ``` let's say we have `input_node` object for argument `input`, and `conv_node` for `conv` node in the graph (1) input arguments, e.g. `input` the target_dtype/is_dynamic from previous output is the node that produces `input`, we get this from input_node.meta["target_dtype_info"]["output_act_obs_or_fq"] the taregt_dtype/is_dynamic for the current argument `input`, comes from conv_node.meta["target_dtype_info"]["input_act_obs_or_fq"] similarly for weight it comes from conv_node.meta["target"]["weightobs_or_fq"] etc. (2) output for conv node the target_dtype/is_dynamic from previous output will be the floating point output from the fp32 convolution operator, so it is hardcoded to be (torch.float, False), however, technically we should get this from node.meta["val"], but since the current code base is shared by fx graph mode quantization and pytorch 2.0 export quantization, we cannot do that, we can revisit after we decide to deprecate fx graph mode quantization the target_dtype/is_dynamic for the current output comes from conv_node.meta["target_dtype_info"]["output_act_obs_or_fq"] there is one caveat here about dynamic quantization, that is explained in the comment, so I won't repeat here Note: also fixed some places in `_get_arg_target_dtype_as_input_to_node` and `_get_arg_target_is_dynamic_as_input_to_node` to make sure "not specified" == specifying a fp32 placeholder observer as well Next: we can merge the two get target dtype and get is_dynamic function to reduce code duplication Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps python test/test_quantization.py TestQuantizeFxModels python test/test_quantization.py TestQuantizePT2E python test/test_quantization.py TestQuantizePT2EModels Imported from OSS Differential Revision: D45198323 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99767 Approved by: https://github.com/kimishpatel	2023-04-25 16:53:02 +00:00
Aaron Gokaslan	e2a3817dfd	[BE] Enable C419 rule for any all shortcircuiting (#99890 ) Apparently https://github.com/pytorch/pytorch/pull/78142 made torch.JIT allow for simple generator expressions which allows us to enable rules that replace unnecessary list comprehensions with generators in any/all. This was originally part of #99280 but I split it off into this PR so that it can be easily reverted should anything break. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99890 Approved by: https://github.com/justinchuby, https://github.com/kit1980, https://github.com/malfet	2023-04-25 15:02:13 +00:00
PyTorch MergeBot	c83e1f517d	Revert "Delete tracing_mode argument to export (#99555 )" This reverts commit `e9786149ab`. Reverted https://github.com/pytorch/pytorch/pull/99555 on behalf of https://github.com/DanilBaibak due to Break internal build	2023-04-24 08:21:41 +00:00
maxren	e63c502baa	[Executorch][XNNPACK] Quantized Max Pool 2d (#99587 ) Adding support for Quantized Max Pool 2d Additions: - Add quantized max pool 2d to executorch backend config - modify max pool node visitors to grab quant params from input/output - Add qmaxpool 2d patterns for partitioners Differential Revision: [D44977783](https://our.internmc.facebook.com/intern/diff/D44977783/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99587 Approved by: https://github.com/jerryzh168	2023-04-22 07:17:13 +00:00
maxren	a964a3dbed	[quant][pt2e] add all convs-relu fusion qat configs (#99586 ) Currently when prepare_qat_fx with executorch backend config we do not properly quantize conv or conv - relu To fix this we add all the necessary qat configs for conv and conv-relu Differential Revision: [D45135947](https://our.internmc.facebook.com/intern/diff/D45135947/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99586 Approved by: https://github.com/jerryzh168	2023-04-22 06:44:23 +00:00
maxren	c139dfd71e	[quant][pt2e] add dropout to executorch backend config (#99585 ) OD Model has a dropout layer in training, In order to match eager mode qat, we also fake quantize the drop out layer in prepare_qat_fx. To do this we add the dropout layer to the default_op_configs in which the observation type uses a different observer from its input Differential Revision: [D45095936](https://our.internmc.facebook.com/intern/diff/D45095936/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99585 Approved by: https://github.com/jerryzh168	2023-04-22 06:41:44 +00:00
PyTorch MergeBot	75e754800f	Revert "[quant][pt2e][refactor] Cleanup the logic for deciding whether to insert observer/fq or not (#99220 )" This reverts commit `d56adb1b54`. Reverted https://github.com/pytorch/pytorch/pull/99220 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally	2023-04-21 18:04:21 +00:00
Jerry Zhang	d56adb1b54	[quant][pt2e][refactor] Cleanup the logic for deciding whether to insert observer/fq or not (#99220 ) Summary: Previously we have two places we need to decide whether to insert observer or fake quantizer or not: (1) input arguments of a node (2) output of a node, and right now we have separate code to do this in this PR, the logic is unified in `_needs_obs_or_fq` helper function that takes the target_dtype and is_dynamic from previous output and target_dtype and is_dynamic for the current Tensor we are looking at let's use an example for conv node: ``` conv = convolution(input, weight, bias, ...) ``` let's say we have `input_node` object for argument `input`, and `conv_node` for `conv` node in the graph (1) input arguments, e.g. `input` the target_dtype/is_dynamic from previous output is the node that produces `input`, we get this from input_node.meta["target_dtype_info"]["output_act_obs_or_fq"] the taregt_dtype/is_dynamic for the current argument `input`, comes from conv_node.meta["target_dtype_info"]["input_act_obs_or_fq"] similarly for weight it comes from conv_node.meta["target"]["weightobs_or_fq"] etc. (2) output for conv node the target_dtype/is_dynamic from previous output will be the floating point output from the fp32 convolution operator, so it is hardcoded to be (torch.float, False), however, technically we should get this from node.meta["val"], but since the current code base is shared by fx graph mode quantization and pytorch 2.0 export quantization, we cannot do that, we can revisit after we decide to deprecate fx graph mode quantization the target_dtype/is_dynamic for the current output comes from conv_node.meta["target_dtype_info"]["output_act_obs_or_fq"] there is one caveat here about dynamic quantization, that is explained in the comment, so I won't repeat here Note: also fixed some places in `_get_arg_target_dtype_as_input_to_node` and `_get_arg_target_is_dynamic_as_input_to_node` to make sure "not specified" == specifying a fp32 placeholder observer as well Next: we can merge the two get target dtype and get is_dynamic function to reduce code duplication Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps python test/test_quantization.py TestQuantizeFxModels python test/test_quantization.py TestQuantizePT2E python test/test_quantization.py TestQuantizePT2EModels Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D45167585](https://our.internmc.facebook.com/intern/diff/D45167585) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99220 Approved by: https://github.com/kimishpatel	2023-04-21 16:58:35 +00:00
Edward Z. Yang	e9786149ab	Delete tracing_mode argument to export (#99555 ) You can have any color you want, as long as it's tracing_mode="symbolic" Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99555 Approved by: https://github.com/voznesenskym	2023-04-21 16:20:51 +00:00
andrewor14	22af604e1b	[quant][pt2] Add Conv + BN fusion for prepare QAT (#98568 ) Summary: This commit adds the `prepare_qat_pt2e` API and the fusion logic for Conv + BN. We use the subgraph rewriter to match and replace the pattern with the existing logic in `nniqat.ConvBn2d`. Note this is not the end-to-end flow yet. In particular, the convert flow needs to swap the new subgraph with another one that merges the batchnorm stats back into conv. The Conv + BN fusion is implemented in the following steps: 1. Annotate all nodes in the pattern `[conv - bn - getitem]` 2. Match and replace this pattern with the fused QAT pattern (note that this is a larger subgraph than the original one) 3. Copy over metadata from the original nodes to the corresponding nodes in the new subgraph, to ensure the stack traces and dtype annotations are preserved 4. Prepare will insert fake quantizes in the right places based on the annotations Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qat_conv_bn_fusion Reviewers: jerryzh168, kimishpatel, yanboliang Pull Request resolved: https://github.com/pytorch/pytorch/pull/98568 Approved by: https://github.com/kimishpatel	2023-04-20 20:15:28 +00:00
Jerry Zhang	36acad58b6	[quant][pt2e][refactor] Move the annotation for observer sharing ops into separate util (#99384 ) Summary: In order to keep quantizer simple, we want to move the annotation code for operators like flatten, hardtanh etc. to a separate utility function that is called after the quantizer annotation is done, this makes these ops (operator list) not configurable by user, and also makes prepare_pt2e operator aware instead of operator agnostic, this design is not final, we may change it in the future if we find there are use cases that need these to be configurable or if we feel it is important for prepare_pt2e to stay agnostic to operator/operator patterns Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qnnpack_quantizer_obs_sharing_ops Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D45071006](https://our.internmc.facebook.com/intern/diff/D45071006) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99384 Approved by: https://github.com/kimishpatel	2023-04-19 23:49:33 +00:00
Nikita Shulga	8a89eec2f8	[BE] Do not use unicode quotes (#99446 ) They are mostly used in commented code examples, but even Python-3.12 does not recognize `“foobar”` as valid string literal I.e. just `s/[“”]/"/` Pull Request resolved: https://github.com/pytorch/pytorch/pull/99446 Approved by: https://github.com/huydhn, https://github.com/ezyang	2023-04-18 22:59:56 +00:00
Kimish Patel	c0be06667f	[PT2E][Quant] Support for embedding op quantization via ExecuTorchNativeQuantizer (#99106) ExecuTorchNativeQuantizer ExecuTorchNativeQuantizer is a terribly name, I admit, however lets fix it once we align on what the quantized kernel lib within executorch runtime should be called Differential Revision: [D44986258](https://our.internmc.facebook.com/intern/diff/D44986258/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D44986258/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/99106 Approved by: https://github.com/jerryzh168	2023-04-18 16:59:37 +00:00
maxren	80eab63587	[Quant][pt2e] torch.mean and ReLU6 (#98984 ) Add nn.Module ReLU6 in addition to functional relu6. Also add torch .mean to quantization config Differential Revision: [D44901038](https://our.internmc.facebook.com/intern/diff/D44901038/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98984 Approved by: https://github.com/jerryzh168	2023-04-17 18:33:04 +00:00
maxren	444a9769ae	[quant][pt2e] QAT Linear (#98897 ) Differential Revision: [D44901039](https://our.internmc.facebook.com/intern/diff/D44901039/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98897 Approved by: https://github.com/tiandiao123, https://github.com/manuelcandales	2023-04-17 18:27:39 +00:00
maxren	568935caca	[quant][pt2e] QAT conv + bn + relu (#98896 ) Differential Revision: [D44901040](https://our.internmc.facebook.com/intern/diff/D44901040/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98896 Approved by: https://github.com/manuelcandales	2023-04-17 18:24:08 +00:00
Kimish Patel	cdab6c8df9	[PT2E][Quant] Support specifying None for obs_or_fq_ctr in target_dtype_info (#99071 ) It is cleaner for quantizer to say what does not need observation instead of putting fp32 observers. This diff add support for that by checking if target_dtype_info contains none for specific observers and if so skip inserting observers for those. Differential Revision: [D44971357](https://our.internmc.facebook.com/intern/diff/D44971357/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99071 Approved by: https://github.com/jerryzh168	2023-04-17 16:37:16 +00:00
Kimish Patel	36a95625da	[PT2E][Quant][BE] Refactor observer code (#99066 ) Combine per channel and per tensor observer code Differential Revision: [D44918494](https://our.internmc.facebook.com/intern/diff/D44918494/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99066 Approved by: https://github.com/jerryzh168	2023-04-17 16:17:36 +00:00
Kimish Patel	31f311a816	[PT2E][Quantization] Refactor Quantizer and QNNPACKQuantizer (#99063 ) This diff renames quantization spec/config and operator config. It moves these datastructures to base quantizer. Base quantizer API now has get_supported_operators that returns list of patterns that a quantizer quantizes. There are two choices being debated for how to convey to user what a particular quantizer will quantize. 1. Modules. We just convey what nn.Modules will be quantized. Of course that does not mean that equivalent functional variants wont be quantized, however for simplifity we just use nn.Module. If certain ops are quatnzied in fused manner then that will considered internal details. Pros and cons of this approach pros: - Simple. Only nn Modules are listed. - User does not have to see fusion patterns. Cons: - confusing perhaps because it is not clear if supported = nn.Conv2d also means that the quantizer supported functional.conv2d - Hiding fusion pattern means user has no say in not fusing. Meaning if conv2d + relu is fused and user configures to quantize only conv, quantizer will also quantize the following relu as if conv2d + relu are fused. 2. Patterns. Be explicit about what is supported and enumerate all possible compbinations. Pros: - it is very clear what quantizer will do. no surprises. Cons: - It is not simple to parse. - It can be argued taht fusion is internal detail of the quantizer. So some quantizer implementation may chose to expose fusion patterns, while others may not and may not even provide any configurability. One option is to move set_supported_operators/modules out of base quantizer and let each quantizer define its own way of communicating what is supported. Issue with this is that when we want to "Compose" multiple quantizers there is no way for user to define the order of composition if user does not know what a quantizer supports. For exampl quantizer A may quantizer conv + relu while B only conv, but B's implementation is fast. In that case you may compose (B, A) such B quantizes conv and A quantizes relu. Not knowning what A and B support, makes such composition harder Differential Revision: [D44895547](https://our.internmc.facebook.com/intern/diff/D44895547/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D44895547/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/99063 Approved by: https://github.com/jerryzh168	2023-04-17 00:34:18 +00:00
Jerry Zhang	6a568779b6	[quant][pt2e][improvement] Remove the need to annotate all nodes with default annotation (#99001 ) Summary: This PR changes prepare to use some default observer/fq constructor when "target_dtype_info" is not set, this allows user to not initialize all nodes to default observer/fq constructor. Note we may still need to annotate intermediate node after this PR, there will be a follow up PR to allow users to only annotate things they want to quantize Test Plan: python test/test_quantization.py TestQuantizePT2E python test/test_quantization.py TestQuantizePT2EModels Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/99001 Approved by: https://github.com/kimishpatel, https://github.com/andrewor14	2023-04-13 09:31:51 +00:00
Wyatt Borsos	6361c3debc	Return zero_point from determine_qparams as a int64 (#98746 ) Summary: In some cases, zero_point is returned as an int tensor. We want it to be a long. This fixes a failed assertion in Executorch op_choose_qparams: https://www.internalfb.com/code/fbsource/[4609e7dbbf2e]/fbcode/executorch/kernels/quantized/cpu/op_choose_qparams.cpp?lines=49-52 Test Plan: CI Reviewed By: jerryzh168 Differential Revision: D44764070 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98746 Approved by: https://github.com/jerryzh168	2023-04-11 19:01:05 +00:00
Kazuaki Ishizaki	a13a63ae9a	Fix typos under torch/ao directory (#97679 ) This PR fixes typos in comments and messages of `.py` files under `torch/ao` directory Pull Request resolved: https://github.com/pytorch/pytorch/pull/97679 Approved by: https://github.com/janeyx99, https://github.com/kit1980	2023-04-10 22:25:15 +00:00
Jerry Zhang	c5269ad6c6	[quant][pt2e] Add support for a few ops in QNNPackQuantizer to enable quantizing internal model (#98560 ) Summary: This PR adds support for adaptive_avg_pool2d (traced as mean.dim), mean and hardtanh to QNNPackQuantizer Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qnnpack_quantizer_obs_sharing_ops Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/98560 Approved by: https://github.com/andrewor14	2023-04-07 19:26:45 +00:00
maxren	483fd3351a	[Quant] Add get_symmetric_qnnpack_qat_qconfig_mapping (#98569 ) Differential Revision: [D44776230](https://our.internmc.facebook.com/intern/diff/D44776230/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98569 Approved by: https://github.com/andrewor14	2023-04-07 17:57:56 +00:00

1 2 3 4 5 ...

902 Commits