pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jerry Zhang	1b51d29b66	[quant][pt2e] Enable constant folding for quantize ops (#109343 ) Summary: This PR added constant folding for quantize ops so that instead of storing fp32 weight in the quantized model, we'll get int8/int16 etc. weight Test Plan: python test/test_quantization.py TestQuantizePT2E.test_fold_quantize also will verify in executorch later Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D49399210](https://our.internmc.facebook.com/intern/diff/D49399210) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109343 Approved by: https://github.com/kimishpatel, https://github.com/jgong5	2023-09-27 06:04:45 +00:00
andrewor14	7da3c938cf	[quant][be] Move QAT tests to its own file (#108061 ) Test Plan: python test/test_quantization.py TestQuantizePT2EQAT python test/test_quantization.py TestQuantizePT2EQATModels Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Pull Request resolved: https://github.com/pytorch/pytorch/pull/108061 Approved by: https://github.com/jerryzh168	2023-09-15 18:34:44 +00:00
Jerry Zhang	58a883093f	[quant][pt2e] Add test for serialize and deserialize quantized model (#109158 ) Summary: att Test Plan: python test/test_quantization.py TestQuantizePT2E.test_save_load Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/109158 Approved by: https://github.com/andrewor14 ghstack dependencies: #108924, #108925	2023-09-15 00:50:55 +00:00
Jerry Zhang	9187559e75	[quant][be] Remove test/quantization/pt2e/test_quantize_pt2e_fx.py (#108925 ) Summary: this is no longer needed since we have the quantizer api now Test Plan: . Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/108925 Approved by: https://github.com/andrewor14 ghstack dependencies: #108924	2023-09-14 18:35:17 +00:00
Jerry Zhang	41e2189843	[quant] Remove reference representation rewrite for adaptive_avg_pool2d (#108924 ) Summary: integer adaptive_avg_pool2d is not well defined due to different possible ways of rounding fp32 value to integer value, and this op isn't too critical for numerics (since it appears not too often), so we'll skip this for now. we might need to revert the changes that adds integer impl for adaptive_avg_pool op as well Test Plan: python test/test_quantization.py TestQuantizePT2ERepresentation Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/108924 Approved by: https://github.com/kimishpatel	2023-09-14 10:18:36 +00:00
Jerry Zhang	c914ca7577	[quant][be] Add TestPT2ERepresentation test case (#108923 ) Summary: att Test Plan: python test/test_quantization.py TestPT2ERepresentation Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/108923 Approved by: https://github.com/andrewor14	2023-09-14 02:01:38 +00:00
Andrew Or	e8a402c56e	[quant][pt2] Fix and rename `move_model_to_eval` (#108891 ) Summary: This commit fixes two silent correctness problems with the current implementation of `move_model_to_eval`: (1) Previously the user had to manually call `eliminate_dead_code` before calling `move_model_to_eval`, otherwise the dropout pattern won't actually get eliminated. This is because subgraph rewriter complains the match is not self-contained, and so silently does not do the replacement. (2) We wish to error when the user calls `model.train()` or `model.eval()` on an exported model. This error is raised correctly immediately after export today, but no longer raised after the user calls prepare or convert. We fix (1) by moving the `eliminate_dead_code` call into `move_model_to_eval`, and fix (2) by ensuring the respective errors are thrown after prepare and convert as well. Additionally, this commit renames `move_model_to_eval` to `move_exported_model_to_eval` to be more explicit. bypass-github-export-checks Test Plan: python test/test_quantization.py TestQuantizePT2E.test_disallow_eval_train python test/test_quantization.py TestQuantizePT2E.test_move_exported_model_to_eval Imported from OSS Differential Revision: D49097293 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108891 Approved by: https://github.com/jerryzh168	2023-09-11 15:37:01 +00:00
Jerry Zhang	b0de6a8002	[quant][executorch] Support inception_v4 in examples (#108382 ) Summary: Verified that pt2e quant flow matches the fx flow with executorch backend config Test Plan: with-proxy buck2 run executorch/examples/quantization:example -- -m=ic4 --verify ``` [INFO 2023-08-31 16:08:06,923 example.py:77] prepare sqnr: inf [INFO 2023-08-31 16:08:06,932 example.py:81] quant diff max: 0.0 [INFO 2023-08-31 16:08:06,936 example.py:85] quant sqnr: inf ``` full output: https://www.internalfb.com/intern/paste/P818520579/ Differential Revision: D48889075 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108382 Approved by: https://github.com/kimishpatel	2023-09-08 17:39:31 +00:00
Kurt Mohler	3f88e3105f	Reland: Remove remaining global `set_default_dtype` calls from tests (#108088 ) Fixes #68972 Relands #107246 To avoid causing Meta-internal CI failures, this PR avoids always asserting that the default dtype is float in the `TestCase.setUp/tearDown` methods. Instead, the assert is only done if `TestCase._default_dtype_check_enabled == True`. `_default_dtype_check_enabled` is set to True in the `if __name__ == "__main__":` blocks of all the relevant test files that have required changes for this issue Pull Request resolved: https://github.com/pytorch/pytorch/pull/108088 Approved by: https://github.com/ezyang	2023-09-07 03:04:34 +00:00
Jerry Zhang	32a16d4999	[quant][pt2e] Support int16 quantization (#108453 ) Summary: Previously we can only use native pytorch int dtypes that has corresponding quantized dtypes (e.g. quint8, qint8), this PR removes this assumption in observers/fake_quants so that users can use all pytorch native dtypes (except for int64, we can add it later if need) the main addition here is int16. Test Plan: python test/test_quantization.py TestQuantizePT2E Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/108453 Approved by: https://github.com/kimishpatel	2023-09-06 19:31:20 +00:00
Kimish Patel	ffc0c46092	[Quantization] Add metadata porting for nodes added by quantization (#107107 ) Summary: This diff adds adding metadata to q-dq nodes by inferring the quatization intent from node annotations. Annotations on the node are way for user to specify how a node or subgraph is supposed to be quantized. We continue to use that information to copy metadata on Q/DQ node from appropriate nodes. Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48488416](https://our.internmc.facebook.com/intern/diff/D48488416) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107107 Approved by: https://github.com/jerryzh168 ghstack dependencies: #107105, #107106, #107899, #107900	2023-09-02 06:38:14 +00:00
Kimish Patel	eb67c452c8	[Quant] Add DQ duplication pass (#107900 ) Summary: During convert step observers are first replaced by Q-DQ pair. In some scenarios like following output DQ has a fan out. ---> OP2 -> Q -> DQ / OP -> Q -> DQ - \ ---> OP3 -> Q -> DQ If either op OP2 or OP3 are configured to be quantized, then the input is expected to quantized. In this case quantized equivalent of some pattern, that quantizer asked to be quantized, should look like: [DQ -> {pattern} -> Q]. However, in scenario like above where DQ node is shared between multiple "quantized" patterns, boundary of "quantized" pattern is not clear because DQ now belongs to multiple quantized patterns. This poses challenge for: - Porting metadata: which "quantized" partition this DQ node belongs - Quantized representation, equivalently, needs to identify self-contained quantized pattern that is replaced by its equivalent pattern that captures compute in the quantized precision. Test Plan: test_duplicate_dq_pass Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48663147](https://our.internmc.facebook.com/intern/diff/D48663147) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107900 Approved by: https://github.com/jerryzh168, https://github.com/andrewor14, https://github.com/leslie-fang-intel ghstack dependencies: #107105, #107106, #107899	2023-09-02 06:20:03 +00:00
leslie-fang-intel	fb808c30c7	x86_inductor_quantizer switches to new graph capture API (#108214 ) Summary Update `X86InductorQuantizer` and related testcase to the new graph capture API `capture_pre_autograd_graph`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108214 Approved by: https://github.com/jgong5, https://github.com/jerryzh168	2023-09-01 00:43:45 +00:00
andrewor14	057b807178	[quant] Move dropout replacement to `move_model_to_eval` (#108184 ) Summary: This commit adds a public facing `torch.ao.quantization.move_model_to_eval` util function for QAT users. Instead of calling model.eval() on an exported model (which doesn't work, see https://github.com/pytorch/pytorch/issues/103681), the user would call this new util function instead. This ensures special ops such as dropout and batchnorm (not supported yet) will have the right behavior when the graph is later used for inference. Note: Support for an equivalent `move_model_to_train` will be added in the future. This is difficult to do for dropout currently because the eval pattern of dropout is simply a clone op, which we cannot just match and replace with a dropout op. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_move_model_to_eval Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Differential Revision: [D48814735](https://our.internmc.facebook.com/intern/diff/D48814735) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108184 Approved by: https://github.com/jerryzh168	2023-08-30 16:33:17 +00:00
Jerry Zhang	147b3495e2	[quant][pt2e] Add reference representation for dynamic quantized linear (#108073 ) Summary: att Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_dynamic_linear buck2 test 'fbcode//mode/opt' fbcode//caffe2/test:quantization_pt2e -- 'test_representation_dynamic_linear' Reviewed By: kimishpatel Differential Revision: D48703076 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108073 Approved by: https://github.com/andrewor14	2023-08-29 07:12:55 +00:00
andrewor14	199e23bc3a	[quant][be] Clean up QAT tests in test_quantize_pt2e.py (#107991 ) Summary: This commit does 4 main things: 1. When verifying QAT numerics, automatically check both the per tensor and the per channel cases, and automatically verify convert numerics 2. When verifying the QAT graph, automatically check both the per tensor and the per channel cases 3. Merge verify graph and verify numerics tests for conv-bn 4. Fix `test_prepare_qat_conv_bn_fusion_getitem_placeholder`, which was no longer testing the right thing recent capture changes, since the maxpool op is no longer followed by a getitem node. However, we do still need this test for other ops that are followed by getitem nodes (e.g. standalone BN). Items (1) - (3) make the QAT tests significantly less verbose and easier to read. Test Plan: python test/test_quantization.py TestQuantizePT2E python test/test_quantization.py TestQuantizePT2EModels Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel, supriyar Pull Request resolved: https://github.com/pytorch/pytorch/pull/107991 Approved by: https://github.com/jerryzh168	2023-08-28 21:12:00 +00:00
Jerry Zhang	9ae3d7ca90	[reland][quant][pt2e][xnnpack_quantizer] Add support for mul and mul_relu (#107930 ) (#107992 ) Summary: att Test Plan: buck2 run executorch/examples/quantization:example -- -m=mv3 --verify Differential Revision: D48588121 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107992 Approved by: https://github.com/digantdesai, https://github.com/mcr229	2023-08-27 14:50:03 +00:00
Xia, Weiwen	e9b0f62a19	[Quant][PT2E] Enable linear and linear-unary post-op quant recipe for x86 inductor quantizer (#106781 ) Summary Add linear and linear-unary post-op quantization recipe to x86 inductor quantizer. For PT2E with Inductor. With this, the quantization path will add `quant-dequant` pattern for linear and linear-unary post op. Test plan python test/test_quantization.py -k test_linear_with_quantizer_api python test/test_quantization.py -k test_linear_unary_with_quantizer_api Pull Request resolved: https://github.com/pytorch/pytorch/pull/106781 Approved by: https://github.com/leslie-fang-intel, https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #105818	2023-08-27 10:50:17 +00:00
Xia, Weiwen	a6d3da1835	[Quant] Add int8 linear op impl for quantization PT2E with Inductor. input is an int8 CPU tensor; weight is an int8 MdkldnnCPU tensor. (#105818 ) Summary Add a new onednn qlinear op for quantization PT2E with Inductor. input is an int8 CPU tensor and weight is an int8 MkldnnCPU tensor. Test plan python test/test_quantization.py -k test_qlinear_pt2e Pull Request resolved: https://github.com/pytorch/pytorch/pull/105818 Approved by: https://github.com/jgong5, https://github.com/leslie-fang-intel, https://github.com/jerryzh168	2023-08-27 08:13:12 +00:00
leslie-fang-intel	1147a28b0b	[Quant][PT2E] Add cat and avg_pool2d recipe into x86InductorQuantizer (#106836 ) Summary Add `cat` and `avg_pool2d` quantization recipe as input output share observer into `x86InductorQuantizer`. Test Plan ``` clear && python -m pytest test_x86inductor_quantizer.py -k test_cat_recipe clear && python -m pytest test_x86inductor_quantizer.py -k test_cat_recipe_same_inputs clear && python -m pytest test_x86inductor_quantizer.py -k test_cat_recipe_single_input clear && python -m pytest test_x86inductor_quantizer.py -k test_avg_pool2d_recipe ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/106836 Approved by: https://github.com/jgong5, https://github.com/jerryzh168	2023-08-26 16:51:13 +00:00
Jerry Zhang	15d4dedbbf	[quant][pt2e] Add reference representation rewrite for statically quantized linear (#107994 ) Summary: att Test Plan: ``` python test/test_quantization.py TestQuantizePT2E.test_representation_linear buck2 test 'fbcodemode/opt' fbcodecaffe2/test:quantization_pt2e -- 'test_representation_linear' ``` Reviewed By: kimishpatel Differential Revision: D48674862 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107994 Approved by: https://github.com/mcr229, https://github.com/guangy10	2023-08-26 15:39:52 +00:00
leslie-fang-intel	9319dd1c7c	[Quant][Inductor] Enable the lowering of quantized maxpool2d (#105906 ) Summary Enable the `dq-maxpool2d-q` pattern match and lower into `torch.ops.quantized.max_pool2d`. Test Plan ``` python -m pytest test_mkldnn_pattern_matcher.py -k test_qmaxpool2d python -m pytest test_quantized_op.py -k test_max_pool2d_pt2e ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105906 Approved by: https://github.com/jgong5, https://github.com/eellison ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456, #105639	2023-08-26 08:36:47 +00:00
leslie-fang-intel	70ca18f8a0	[Quant][PT2E] Enable X86InductorQuantizer single quantizable op(maxpool2d) (#105639 ) Summary In this PR, we mainly enable 2 things. - Enable the skeleton of quantization recipe for single quantizable operators in `X86InductorQuantizer`. - Add quantization recipe of `maxpool2d` and annotate it as input./output share observer. Test Plan ``` python -m pytest test_x86inductor_quantizer.py -k test_maxpool2d_recipe ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105639 Approved by: https://github.com/jgong5, https://github.com/jerryzh168 ghstack dependencies: #104580, #104581, #104588, #104590, #105455, #105456	2023-08-26 08:34:15 +00:00
andrewor14	240bdbea61	[quant][pt2e] Fix annotation for conv no bias case (#107971 ) Summary: This fixes the no bias case for conv annotations. Previously this would result in an index out of bounds, since the new aten.conv2d op may not have the bias arg (unlike the old aten.convolution op). This was not caught because of a lack of test cases, which are added in this commit. Test Plan: python test/test_quantization.py TestQuantizePT2E.test_qat_conv_no_bias python test/test_quantization.py TestQuantizePT2E.test_qat_conv_bn_relu_fusion_no_conv_bias Reviewers: jerryzh168, kimishpatel Subscribers: jerryzh168, kimishpatel Differential Revision: [D48696874](https://our.internmc.facebook.com/intern/diff/D48696874) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107971 Approved by: https://github.com/jerryzh168	2023-08-26 01:01:54 +00:00
Jerry Zhang	f92f69dbfb	[quant][pt2e] Enable testing for reference quant model representations (#107474 ) Summary: Previously these tests were disabled due to time out in dynamo export in fbcode, this might have been resolved, so trying to enable again Test Plan: python test/test_quantization.py TestQuantizePT2E Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48619072](https://our.internmc.facebook.com/intern/diff/D48619072) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107474 Approved by: https://github.com/andrewor14	2023-08-26 00:37:45 +00:00
PyTorch MergeBot	8d44b0f5a5	Revert "[quant][pt2e][xnnpack_quantizer] Add support for mul and mul_relu (#107930 )" This reverts commit `1d1739dc6d`. Reverted https://github.com/pytorch/pytorch/pull/107930 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/107930#issuecomment-1694069330))	2023-08-26 00:37:02 +00:00
Jerry Zhang	1d1739dc6d	[quant][pt2e][xnnpack_quantizer] Add support for mul and mul_relu (#107930 ) Summary: att Test Plan: buck2 run executorch/examples/quantization:example -- -m=mv3 --verify Differential Revision: D48588121 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107930 Approved by: https://github.com/kimishpatel	2023-08-25 23:36:19 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	2b7271c703	Support cond and out_dtype for predispatch (#107941 ) Summary: Title Test Plan: CI Differential Revision: D48675742 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107941 Approved by: https://github.com/jerryzh168	2023-08-25 17:37:16 +00:00
leslie-fang-intel	8ef057255d	[Quant][PT2E] Enable qconv for quantization 2.0 export (#104580 ) Summary Enable `qconv1d/2d/3d`, `qconv2d_relu`, `qconv2d_add`, and `qconv2d_add_relu` operator for quantization 2.0 export with oneDNN library. Test Plan ``` python -u -m pytest -s -v test_quantized_op.py -k test_qconv1d_pt2e python -u -m pytest -s -v test_quantized_op.py -k test_qconv2d_pt2e python -u -m pytest -s -v test_quantized_op.py -k test_qconv3d_pt2e python -u -m pytest -s -v test_quantized_op.py -k test_qconv2d_relu_pt2e python -u -m pytest -s -v test_quantized_op.py -k test_qconv2d_add_pt2e python -u -m pytest -s -v test_quantized_op.py -k test_qconv2d_add_relu_pt2e ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104580 Approved by: https://github.com/jgong5, https://github.com/jerryzh168	2023-08-25 17:34:45 +00:00
Jerry Zhang	a0cfaf0688	[quant][pt2e] Make sure XNNPACKQuantizer works with the pre_dispatch=True (#107872 ) Summary: att Test Plan: ``` buck test //executorch/backends/xnnpack/test:test_xnnpack_quantized_models -- test_resnet18 buck2 test 'fbcode//mode/opt' fbcode//caffe2/test:quantization_pt2e ``` Reviewed By: andrewor14, tugsbayasgalan Differential Revision: D48415977 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107872 Approved by: https://github.com/andrewor14	2023-08-25 05:04:01 +00:00
Jerry Zhang	16fcb07846	[quant][pt2e] Add support for channel in DerivedQuantizationSpec (#107833 ) Summary: att Test Plan: python test/test_quantization.py TestQuantizePT2E.test_derived_qspec_per_channel Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48630535](https://our.internmc.facebook.com/intern/diff/D48630535) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107833 Approved by: https://github.com/andrewor14	2023-08-24 07:45:13 +00:00
vasiliy	61fe49b8ed	pt2: make aot_eager backend handle basic float8 operations (#107783 ) Summary: Reland of https://github.com/pytorch/pytorch/pull/107642 with a fix for tests on Windows. Makes aot_eager backend of torch.compile handle basic float8 operations. This is useful for float8 training UX. Test Plan: ``` python test/test_quantization.py -k test_pt2_traceable_aot_eager ``` Reviewers: Subscribers: Tasks: Tags: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/107783 Approved by: https://github.com/albanD	2023-08-23 18:10:53 +00:00
Aaron Gokaslan	660e8060ad	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-22 23:16:38 +00:00
PyTorch MergeBot	5025fb9213	Revert "pt2: make aot_eager backend handle basic float8 operations (#107642 )" This reverts commit `24147a8e1c`. Reverted https://github.com/pytorch/pytorch/pull/107642 on behalf of https://github.com/huydhn due to Sorry for reverting this, but it is failing Windows CPU test in trunk. The Windows failures on your PR looks related I think ([comment](https://github.com/pytorch/pytorch/pull/107642#issuecomment-1688999380))	2023-08-22 22:17:36 +00:00
PyTorch MergeBot	d59a6864fb	Revert "[BE]: Update ruff to 0.285 (#107519 )" This reverts commit `88ab3e4322`. Reverted https://github.com/pytorch/pytorch/pull/107519 on behalf of https://github.com/ZainRizvi due to Sorry, but this PR breaks internal tests. @ezyang, can you please hep them get unblocked? It seems like one of the strings was prob accidentally modified ([comment](https://github.com/pytorch/pytorch/pull/107519#issuecomment-1688833480))	2023-08-22 19:53:32 +00:00
vasiliy	24147a8e1c	pt2: make aot_eager backend handle basic float8 operations (#107642 ) Summary: Makes aot_eager backend of torch.compile handle basic float8 operations. This is useful for float8 training UX. Test Plan: ``` python test/test_quantization.py -k test_pt2_traceable_aot_eager ``` Reviewers: Subscribers: Tasks: Tags: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/107642 Approved by: https://github.com/albanD	2023-08-22 18:57:14 +00:00
Tugsbayasgalan Manlaibaatar	ee72071fc7	Avoid executing side-effectful graph_module as validation step (#107271 ) Dynamo currently runs the real graph module with real inputs as a way to match the return result of graph module with the eager return type. This is unsafe when graph module is side effectful. In the long term, we will get rid of this step. But in the short term, we just fakify the graph module again and run it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107271 Approved by: https://github.com/ezyang	2023-08-22 04:22:31 +00:00
Aaron Gokaslan	88ab3e4322	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-20 01:36:18 +00:00
Jerry Zhang	28be2c674a	[quant][pt2e] Move specific quantizer related things outside of main quant code base (#106806 ) (#107259 ) Summary: Currently in quantizer/quantize_pt2e we import things from specific quantizers (XNNPACKQuantizer, QuantizationConfig) etc. this PR removes them so it's clearer that they are not part of the core quantization code base This PR also removed get_supported_operators from main Quantizer since we haven't seen a clear need for this API Test Plan: CIs Imported from OSS Differential Revision: D48340367 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107259 Approved by: https://github.com/kimishpatel	2023-08-18 21:29:09 +00:00
Jerry Zhang	d3c4ec767b	[quant][pt2e] Fix handling for SharedQuantizationSpec (#106922 ) Summary: Previously if we have: ``` conv1 -> cat conv2 / ``` and configure output of conv1/conv2 to be int8 quantized, and cat also int8 quantized and with shared inputs, it will not produce expected results (input of cat will not be shared) The problem is that there is some missing checks when inserting observers for input for cat This PR fixes the problem. Fixes: https://github.com/pytorch/pytorch/issues/106760 Test Plan: python tes/test_quantization.py TestQuantzePT2E.test_shared_qspec Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106922 Approved by: https://github.com/kimishpatel	2023-08-16 21:16:45 +00:00
Jiaxu Zhu	152203d3c3	[pytorch][ao] Add `torch.matmul` in FloatFunctional/QFunctional (#106831 ) Summary: As title Test Plan: new unit tests Differential Revision: D48172841 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106831 Approved by: https://github.com/jerryzh168	2023-08-10 22:43:36 +00:00
Jerry Zhang	79449e6272	[quant][pt2e][fix] Remove the requirement of using no_grad for reference model that contains quantized conv2d (#106924 ) Summary: att we don't actually need gradient for conv2d, just need it to run without error, so we delayed the error of out_dtype gradient to the time when user actually requested it Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_conv2d Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106924 Approved by: https://github.com/zou3519, https://github.com/kimishpatel	2023-08-10 19:16:10 +00:00
Jerry Zhang	97ce979e5d	[quant][pt2e] Add reference representation for quantized conv2d (#105784 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_quantize_dequantize_per_channel Although right now it is not really testing things since there is some problem with dynamo export Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105784 Approved by: https://github.com/kimishpatel ghstack dependencies: #105783	2023-08-09 22:41:35 +00:00
Tugsbayasgalan (Tugsuu) Manlaibaatar	a44c072c89	Make InternalModel and Resnet work with rexportable flow (#106676 ) Summary: Internal model and Resnet uses "re-export" flow now. Also did some refactoring to make the code little cleaner Some changes for OSS: 1. Correctly use the "cached" fake tensors so that static symbols are still resolved to static 2. Change logic in PassBase to allocate static shapes for parameters 3. Add "is_torch_exported" tag to every node to make it survive during various graph transformations. 4. Added experimental wrapper API for quantization team to get pre_dispatch=True graph. Note that it doesn't actually do that right now. But we plan to switch soon. Test Plan: CI Differential Revision: D47890878 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106676 Approved by: https://github.com/jerryzh168	2023-08-09 20:10:48 +00:00
Jerry Zhang	69ecad6f2b	[quant][pt2e] Add reference representation for quantize_per_channel and dequantize_per_channel (#105783 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_quantize_dequantize_per_channel Although right now it is not really testing things since there is some problem with dynamo export Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105783 Approved by: https://github.com/kimishpatel	2023-08-09 01:39:52 +00:00
Jason Lu	bc88028e8e	Back out "Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 )" (#106743 ) Summary: Original commit changeset: 81319beb97f3 Original Phabricator Diff: D47961182 Test Plan: revert to maintain backward compat with legacy ads_dper3 production package. Read details in: S357822 Reviewed By: atuljangra Differential Revision: D48131623 @diff-train-skip-merge (D48131623 landed internally) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106743 Approved by: https://github.com/malfet	2023-08-08 15:27:34 +00:00
Jerry Zhang	2156f0434c	[quant][pt2e] Add reference representation for quantized adaptive_avg_pool2d (#105709 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_adaptive_avg_pool2d Although right now it is not really testing things since there is some problem with dynamo export Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105709 Approved by: https://github.com/andrewor14 ghstack dependencies: #105708	2023-08-04 18:49:14 +00:00
Jerry Zhang	9e301949ec	[quant][pt2e] Add reference representation for quantized max_pool2d (#105708 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_maxpool2d Although right now it is not really testing things since there is some problem with dynamo export Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105708 Approved by: https://github.com/andrewor14	2023-08-04 08:19:52 +00:00
Jerry Zhang	820e68b58a	[quant][pt2e] Add reference representation for quantized add - relu (#105707 ) Summary: Implementing reference representation for quantized ops we decided in https://docs.google.com/document/d/17h-OEtD4o_hoVuPqUFsdm5uo7psiNMY8ThN03F9ZZwg/edit#heading=h.ov8z39149wy8 Test Plan: python test/test_quantization.py TestQuantizePT2E.test_representation_add_relu Although right now it is not really testing things since there is some problem with dynamo export Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/105707 Approved by: https://github.com/andrewor14	2023-08-03 00:42:06 +00:00
Jerry Zhang	ba387b8830	[easy][be] operator_config -> quantization_config renaming (#106479 ) Summary: att Test Plan: CIs Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/106479 Approved by: https://github.com/andrewor14	2023-08-03 00:36:44 +00:00

1 2 3 4 5 ...

1529 Commits