pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Aaron Gokaslan	29cc293725	[BE]: FURB142 - Remove set mutations. Use set update (#124551 ) Uses set mutation methods instead of manually reimplementing (update, set_difference etc). Pull Request resolved: https://github.com/pytorch/pytorch/pull/124551 Approved by: https://github.com/ezyang	2024-04-21 14:12:33 +00:00
andrewor14	5c0a380bdf	[pt2e][qat] Support conv-transpose-bn[-relu] QAT fusion (#123652 ) Summary: This commit adds support for QAT fusion for the [conv-transpose-bn] and [conv-transpose-bn-relu] patterns. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn1d.test_qat_conv_transpose_bn python test/test_quantization.py TestQuantizePT2EQAT_ConvBn1d.test_qat_conv_transpose_bn_relu python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d.test_qat_conv_transpose_bn python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d.test_qat_conv_transpose_bn_relu Reviewers: jerryzh168 Subscribers: jerryzh168, supriyar Tasks: https://github.com/pytorch/pytorch/issues/122224 Differential Revision: [D55930704](https://our.internmc.facebook.com/intern/diff/D55930704) Pull Request resolved: https://github.com/pytorch/pytorch/pull/123652 Approved by: https://github.com/jerryzh168	2024-04-12 17:16:02 +00:00
Zhengxu Chen	dacc73669c	[export] Make quantizer compatible with the standard nn_module_stack. (#122819 ) Summary: When we migrate to torch.export, we won't put L['self'] as the prefix for all the fqn in nn_module_stack. This diff adds the branch to handle the new case. Test Plan: buck test mode/opt caffe2/test/quantization:test_quantization -- -r set_module_name Differential Revision: D55436617 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122819 Approved by: https://github.com/tugsbayasgalan	2024-03-28 19:36:46 +00:00
Riley Dulin	44796682d0	[torch][ao] Fix module name filter for pytorch2 quantization for underscores (#119344 ) Summary: There was a bug in the module name filter for modules that had an underscore already in them, as it was replaced with a "dot" notation. This is because it was thought that underscores always meant a module separator, but this isn't the case for modules whose name contains an underscore. Test Plan: Added a unit test. Before this change, that test failed (due to applying the wrong qscheme). Now it passes. Differential Revision: D53502771 Pull Request resolved: https://github.com/pytorch/pytorch/pull/119344 Approved by: https://github.com/jerryzh168	2024-02-10 00:29:08 +00:00
Jerry Zhang	8f1bc876b2	[quant] Support custom qmin/qmax for activation and weight for xnnpack quantizer (#117305 ) Summary: att, this allows us to experiment with 4 bit quant in xnnpack Test Plan: python test/test_quantization.py -k test_dynamic_linear_int4_weight Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/117305 Approved by: https://github.com/digantdesai	2024-01-17 03:22:49 +00:00
Max Ren	d2033a0639	[quant][pt2e][xnnpack_quantizer] add support for linear_relu (#117052 ) Add support for linear_relu annotation for XNNPACKQuantizer, this allows the input to linear and the output to relu to share the same quantization parameter.s Differential Revision: [D52574086](https://our.internmc.facebook.com/intern/diff/D52574086/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/117052 Approved by: https://github.com/jerryzh168, https://github.com/digantdesai	2024-01-09 23:19:52 +00:00
Jerry Zhang	1474dad28c	[quant][pt2e][xnnpack] Add support for QAT dynamic quantization for linear in XNNPACKQuantizer (#113288 ) Summary: FX graph mode quant workflow and also pt2e flow relies on the `is_dynamic` flag in observer/quantizationspec to convert an observer to dynamic quantization patterns (choose_qparams -> q -> dq), this PR added is_dynamic flag for all observers so that it's possible to convert these observers to the pattern. However, this dynamic quantization pattern (choose_qparams -> q -> dq) is actually only valid for MovingAverageObserver(averaging_constant=1) for the computation before convert and after convert to match in the context of QAT. So we'll have some sanity checks in other observers to make sure the is_dynamic is False. Test Plan: python test/test_quantization.py TestXNNPACKQuantizer.test_qat_dynamic_linear Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D51124725](https://our.internmc.facebook.com/intern/diff/D51124725) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113288 Approved by: https://github.com/kimishpatel	2023-12-04 23:06:38 +00:00
andrewor14	8241fe6edb	[quant][pt2][be] Rewrite QAT annotations using subgraph matcher (#113709 ) Summary: This is the recommended way to write quantizers according to https://pytorch.org/tutorials/prototype/pt2e_quantizer.html#a-note-on-ir-for-pt2e-quantization-flow. It is agnostic to changes in the aten IR and can be easily extended to support conv1d-bn and conv3d-bn fusion patterns in the future. This is the first step towards rewriting XNNPACKQuantizer using this subgraph matcher. Test Plan: python test/test_quantization.py TestQuantizePT2EQAT_ConvBn2d Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D51366525](https://our.internmc.facebook.com/intern/diff/D51366525) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113709 Approved by: https://github.com/jerryzh168	2023-11-16 03:57:37 +00:00
Jerry Zhang	501d118255	[quant][pt2e] Add transform_for_annotation method in Quantizer (#113115 ) Summary: Adding the method so that people can do some transformations before annotation to make the graph easier to annotate Test Plan: python test/test_quantization.py TestQuantizePT2E.test_transform_for_annotation Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D51141080](https://our.internmc.facebook.com/intern/diff/D51141080) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113115 Approved by: https://github.com/kimishpatel	2023-11-09 20:23:29 +00:00
Jerry Zhang	3db0095ea2	[reland][quant][pt2e][be] Cleanup observer insertion logic (#111828 ) (#112453 ) Summary: att, after SharedQuantizationSpec bug fix we are doing some checks before hand, this can simplify the logic when we insert observers Test Plan: contbuild & OSS CI, see `bf998a2c5d` Test plan from GitHub: python test/test_quantization.py TestQuantizePT2E CIs Differential Revision: D50816224 Pulled By: jerryzh168 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112453 Approved by: https://github.com/andrewor14	2023-10-31 17:33:24 +00:00
PyTorch MergeBot	797d7100de	Revert "[quant][pt2e][be] Cleanup observer insertion logic (#111828 )" This reverts commit `bf998a2c5d`. Reverted https://github.com/pytorch/pytorch/pull/111828 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/111828#issuecomment-1782154648))	2023-10-27 01:35:27 +00:00
Jerry Zhang	bf998a2c5d	[quant][pt2e][be] Cleanup observer insertion logic (#111828 ) Summary: att, after SharedQuantizationSpec bug fix we are doing some checks before hand, this can simplify the logic when we insert observers Test Plan: python test/test_quantization.py TestQuantizePT2E CIs Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/111828 Approved by: https://github.com/kimishpatel ghstack dependencies: #111827	2023-10-25 03:48:36 +00:00
Kimish Patel	a8760f1b42	[Quantization] Add a test for QAT + PTQ selective quantization in (#111689 ) xnnpack quantizer Summary: For some workflows you want to quantize some parts of the model via qat and then continue eager mode training. After training, you want to export the whole model and perform PTQ on the rest. Test Plan: test added Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D50510480](https://our.internmc.facebook.com/intern/diff/D50510480) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111689 Approved by: https://github.com/jerryzh168	2023-10-24 23:25:38 +00:00
Jerry Zhang	c9b8e06060	[quant] Enable quantization for wav2letter (#109830 ) Summary: Also added annotation support for conv1d_relu and conv1d in XNNPACKQuantizer, the quantized results still matches fx quant path (didn't quantize conv1d) so tests are not disabled Test Plan: with-proxy buck2 run executorch/examples/quantization:example -- -m=w2l --verify Differential Revision: D49479546 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109830 Approved by: https://github.com/kimishpatel	2023-09-29 00:47:34 +00:00
Jerry Zhang	1b51d29b66	[quant][pt2e] Enable constant folding for quantize ops (#109343 ) Summary: This PR added constant folding for quantize ops so that instead of storing fp32 weight in the quantized model, we'll get int8/int16 etc. weight Test Plan: python test/test_quantization.py TestQuantizePT2E.test_fold_quantize also will verify in executorch later Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D49399210](https://our.internmc.facebook.com/intern/diff/D49399210) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109343 Approved by: https://github.com/kimishpatel, https://github.com/jgong5	2023-09-27 06:04:45 +00:00
Jerry Zhang	b01b934aca	[quant][be] Cleanup xnnpack_quantizer implementation (#108921 ) Summary: att Test Plan: python test/test_quantization.py TestQuantizePT2E Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/108921 Approved by: https://github.com/andrewor14	2023-09-12 19:28:41 +00:00
Jerry Zhang	b0de6a8002	[quant][executorch] Support inception_v4 in examples (#108382 ) Summary: Verified that pt2e quant flow matches the fx flow with executorch backend config Test Plan: with-proxy buck2 run executorch/examples/quantization:example -- -m=ic4 --verify ``` [INFO 2023-08-31 16:08:06,923 example.py:77] prepare sqnr: inf [INFO 2023-08-31 16:08:06,932 example.py:81] quant diff max: 0.0 [INFO 2023-08-31 16:08:06,936 example.py:85] quant sqnr: inf ``` full output: https://www.internalfb.com/intern/paste/P818520579/ Differential Revision: D48889075 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108382 Approved by: https://github.com/kimishpatel	2023-09-08 17:39:31 +00:00
Kimish Patel	37b0d76e35	[Quantization] Make annotation util functions return annotated nodes (#107106 ) Summary: Having annotation functions return nodes that are annotated is useful specifically for adding "quantization_tag" to those nodes Test Plan: CI Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48488415](https://our.internmc.facebook.com/intern/diff/D48488415) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107106 Approved by: https://github.com/jerryzh168 ghstack dependencies: #107105	2023-09-02 06:19:55 +00:00
Jerry Zhang	a9fe0b5b74	[quant][pt2e] Move propagate_annotation from quant flow to quantizer (#108320 ) Summary: Previously we run propagate_annotation by default in quantization flow to propagate annotations for ops like reshape, view etc. Not all quantizers would need this so we moved this to xnnpack_quantizer_utils for now. Next Step: * make propagate_annotation function configurable with a custom list of ops * remove unneeded ops in `_is_share_obs_or_fq_op` Test Plan: python test/test_quantization.py TestQuantizePT2E Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48856985](https://our.internmc.facebook.com/intern/diff/D48856985) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108320 Approved by: https://github.com/kimishpatel	2023-09-01 01:49:19 +00:00
Jerry Zhang	9ae3d7ca90	[reland][quant][pt2e][xnnpack_quantizer] Add support for mul and mul_relu (#107930 ) (#107992 ) Summary: att Test Plan: buck2 run executorch/examples/quantization:example -- -m=mv3 --verify Differential Revision: D48588121 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107992 Approved by: https://github.com/digantdesai, https://github.com/mcr229	2023-08-27 14:50:03 +00:00
PyTorch MergeBot	8d44b0f5a5	Revert "[quant][pt2e][xnnpack_quantizer] Add support for mul and mul_relu (#107930 )" This reverts commit `1d1739dc6d`. Reverted https://github.com/pytorch/pytorch/pull/107930 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/107930#issuecomment-1694069330))	2023-08-26 00:37:02 +00:00
Jerry Zhang	1d1739dc6d	[quant][pt2e][xnnpack_quantizer] Add support for mul and mul_relu (#107930 ) Summary: att Test Plan: buck2 run executorch/examples/quantization:example -- -m=mv3 --verify Differential Revision: D48588121 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107930 Approved by: https://github.com/kimishpatel	2023-08-25 23:36:19 +00:00
Jerry Zhang	28be2c674a	[quant][pt2e] Move specific quantizer related things outside of main quant code base (#106806 ) (#107259 ) Summary: Currently in quantizer/quantize_pt2e we import things from specific quantizers (XNNPACKQuantizer, QuantizationConfig) etc. this PR removes them so it's clearer that they are not part of the core quantization code base This PR also removed get_supported_operators from main Quantizer since we haven't seen a clear need for this API Test Plan: CIs Imported from OSS Differential Revision: D48340367 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107259 Approved by: https://github.com/kimishpatel	2023-08-18 21:29:09 +00:00
Jerry Zhang	4afab40b56	[quant][pt2e] Removed mean/hardtanh annotations and refactored adaptive_avg_pool annotation (#106805 ) Summary: Removed annotations for some ops, since they are handled in torch/ao/quantization/pt2e/_propagate_annotation.py Test Plan: CIs Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106805 Approved by: https://github.com/kimishpatel	2023-08-10 04:51:06 +00:00
Jerry Zhang	e1a1780626	[quant][pt2e] Move annotate functions in XNNPACKQuantizer to utils (#106642 ) Summary: This is to allow sharing these annotate functions by other quantizers so that writing a new quantizer is easier note that these annotation functions will be maintained by XNNPACKQuantizer developers instead of AO team Test Plan: python test/test_quantization.py TestQuantizePT2E Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106642 Approved by: https://github.com/andrewor14	2023-08-09 18:52:39 +00:00
Jerry Zhang	d528a137e0	[quant][pt2e][quantizer] Suppoert set_module_type in XNNPACKQuantizer (#106094 ) Summary: Added support to allow users to set configurations based on module type in XNNPACKQuantizer, can also serve as an example for implementing new quantizers Test Plan: python test/test_quantization.py TestQuantizePT2E.test_xnnpack_quantizer_set_module_type Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106094 Approved by: https://github.com/andrewor14 ghstack dependencies: #106087	2023-08-02 08:33:58 +00:00
Jerry Zhang	92a22a8098	[quant][pt2e][quantizer] Suppoert set_module_name in XNNPACKQuantizer (#106087 ) Summary: Added support to allow users to set configurations based on module name in XNNPACKQuantizer, can also serve as an example for implementing new quantizers Test Plan: python test/test_quantization.py TestQuantizePT2E.test_xnnpack_quantizer_set_module_name Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106087 Approved by: https://github.com/andrewor14	2023-08-02 01:19:23 +00:00
Edward Z. Yang	7b9d250f06	Change _dynamo.export to be export(f)(args, *kwargs) (#106109 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106109 Approved by: https://github.com/voznesenskym	2023-07-27 21:41:13 +00:00
Jerry Zhang	3a77f9aaaf	[quant][api] Move torch.ao.quantization.pt2e.quantizer to torch.ao.quantization.quantizer (#105885 ) Summary: moving quantizer to torch.ao.quantization to make it a public api, since pt2e is a folder for implementations Test Plan: CIs sanity check: "buck test //executorch/backends/xnnpack/test:test_xnnpack_quantized_models -- test_resnet18" Differential Revision: D47727838 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105885 Approved by: https://github.com/andrewor14	2023-07-26 18:20:09 +00:00

29 Commits