pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Xuehai Pan	30293319a8	[BE][Easy][19/19] enforce style for empty lines in import segments in `torch/[o-z]*/` (#129771 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129771 Approved by: https://github.com/justinchuby, https://github.com/janeyx99	2024-08-01 17:07:14 +00:00
Edward Z. Yang	3bf922a6ce	Apply UFMT to low traffic torch modules (#106249 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106249 Approved by: https://github.com/Skylion007	2023-07-29 23:37:30 +00:00
Vasiliy Kuznetsov	f15ab8a7f2	AO migration: replace torch internal callsites (#94170 ) Summary: Do the following renames: `torch.quantization` -> `torch.ao.quantization` `torch.nn.quantized` -> `torch.ao.nn.quantized` `torch.nn.quantizable` -> `torch.ao.nn.quantizable` `torch.nn.qat` -> `torch.ao.nn.qat` `torch.nn.intrinsic` -> `torch.ao.nn.intrinsic` And then, do `torch.ao.nn.quantized._reference` -> `torch.ao.nn.quantized.reference` to clean up the aftermath of https://github.com/pytorch/pytorch/pull/84974 Then, manually update `test/test_module_init.py` to fix hanging whitespace due to the replace. Run this script to do the replacements: https://gist.github.com/vkuzo/7f7afebf8c31b9ba48306223e68a1c82 This is for https://github.com/pytorch/pytorch/issues/81667 Test plan: CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/94170 Approved by: https://github.com/jerryzh168	2023-02-07 02:32:23 +00:00
HDCharles	f286cbebce	[ao][fx] fixing public v private graph_module.py (#88395 ) Summary: made _is_observed_module, _is_observed_standalone_module private Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D41015545](https://our.internmc.facebook.com/intern/diff/D41015545) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88395 Approved by: https://github.com/jcaip	2022-12-15 02:15:04 +00:00
HDCharles	258860fa3a	[ao][fx] fixing public v private for pattern_utils.py (#88397 ) Summary: made _DEFAULT_FUSION_PATTERNS, _register_fusion_pattern, _DEFAULT_QUANTIZATION_PATTERNS, _DEFAULT_OUTPUT_FAKE_QUANTIZE_MAP, _DEFAULT_OUTPUT_OBSERVER_MAP, _register_quant_pattern, _sorted_patterns_dict private Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D41015537](https://our.internmc.facebook.com/intern/diff/D41015537) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88397 Approved by: https://github.com/jcaip	2022-12-14 03:40:02 +00:00
HDCharles	79156c11c3	[ao][fx] fixing public v private match_utils.py (#88396 ) Summary: made _is_match, _find_matches, _MatchResult private also added __all__ to lower_to_qnnpack.py Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D41015540](https://our.internmc.facebook.com/intern/diff/D41015540) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88396 Approved by: https://github.com/jcaip	2022-12-13 20:16:55 +00:00
andrewor14	13fcc412be	[Quant][fx][bc-breaking] Remove unused functions in fx/utils.py (#90025 ) Summary and BC-breaking notes: This commit removes the following unused functions from both the `torch.quantization` and the `torch.ao.quantization` namespaces: ``` graph_pretty_str get_per_tensor_qparams quantize_node get_qconv_op create_qparam_nodes node_return_type_is_int is_get_tensor_info_node ``` Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps python test/test_quantization.py TestAOMigrationQuantizationFx Reviewers: jerryzh168, vkuzo Subscribers: jerryzh168, vkuzo Pull Request resolved: https://github.com/pytorch/pytorch/pull/90025 Approved by: https://github.com/HDCharles	2022-12-07 01:31:28 +00:00
andrewor14	d80056312a	[Quant][fx][bc-breaking] Rename fx/patterns.py (#89872 ) Summary: This commit renames fx/quantization_patterns.py to fx/quantize_handler.py, and fx/fusion_patterns.py to fx/fuse_handler.py. This is because these files contain only QuantizeHandler and FuseHandler respectively, so the new names are more descriptive. A future commit will further break BC by removing all the empty QuantizeHandler classes. BC-breaking notes: The following classes under the `torch.ao.quantization.fx.quantization_patterns` namespace are migrated to the `torch.ao.quantization.fx.quantize_handler` namespace: ``` QuantizeHandler BinaryOpQuantizeHandler CatQuantizeHandler ConvReluQuantizeHandler LinearReLUQuantizeHandler BatchNormQuantizeHandler EmbeddingQuantizeHandler RNNDynamicQuantizeHandler DefaultNodeQuantizeHandler FixedQParamsOpQuantizeHandler CopyNodeQuantizeHandler GeneralTensorShapeOpQuantizeHandler CustomModuleQuantizeHandler StandaloneModuleQuantizeHandler ``` The following classes under the `torch.ao.quantization.fx.fusion_patterns` namespace are migrated to the `torch.ao.quantization.fx.fuse_handler` namespace: ``` DefaultFuseHandler FuseHandler ``` Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Reviewers: jerryzh168, vkuzo Subscribers: jerryzh168, vkuzo Pull Request resolved: https://github.com/pytorch/pytorch/pull/89872 Approved by: https://github.com/jerryzh168	2022-12-01 17:37:07 +00:00
HDCharles	25476f2e4b	[ao] fixing public v private for quantization_types (#86031 ) Summary: the main problem with this was that the different objects defined simply as 'Any' should theoretically be public but making them public either A) results in an error about the module being 'typing' rather than whatever module it should be or B) you set the module manually, thereby changing the module for the original 'Any' class. note: QuantizeHandler has a similar issue where its simply defined as 'Any' Pattern was defined in multiple places which was causing issues so i just moved it to a single place given the note at the top of quantization_types.py indicating these definitions should be moved to utils at some point anyway. Finally i changed any references to these objects to point at the correct locations. Note: i didn't see any fb internal references to NodePattern or QuantizerCls that would cause issues. Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/86031 Approved by: https://github.com/jerryzh168	2022-10-12 20:06:30 +00:00
Hao Li	aa40503954	Add Custom Module Support List (#82606 ) Summary: Add a global custon module support list for the users to specify the modules they want the equalization process support. To use this list, import it from the _equalize.py file and append module in it. Unittest passed to check global support list: https://pxl.cl/28RKG Test Plan: buck1 test mode/dev //on_device_ai/odai/tests/transforms:test_transforms -- --exact 'on_device_ai/odai/tests/transforms:test_transforms - test_custom_support_list (on_device_ai.odai.tests.transforms.test_input_weight_for_turing.TestInputWeight)' Reviewed By: jerryzh168 Differential Revision: D38264244 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82606 Approved by: https://github.com/HDCharles	2022-08-03 17:48:51 +00:00
Vasiliy Kuznetsov	7b4e92acef	fx quant: refactor qconfig setting out of find_matches Summary: Refactors `find_matches` function to only find subgraph matches and not assign qconfigs to them. Moves the qconfig assignment outside of the function. No logic change. This will useful for prototyping future tools for quantizing parts of the model. These tools will need to know the matches and will reuse the `find_matches` function, but they will assign their own qconfigs to them using a different strategy. Test plan: ``` python test/test_quantization.py -k Fx ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/79713 Approved by: https://github.com/jerryzh168	2022-06-17 18:52:00 +00:00
Jerry Zhang	74454bdb46	[quant][fx] Move backend_config folder to torch.ao.quantization Summary: Following https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md we implemented the backend configuration for fbgemm/qnnpack backend, currently it was under fx folder, but we'd like to use this for all different workflows, including eager, fx graph and define by run quantization, this PR moves it to torch.ao.quantization namespace so that it can be shared by different workflows Also moves some utility functions specific to fx to fx/backend_config_utils.py and some files are kept in fx folder (quantize_handler.py and fuse_handler.py) Test Plan: python test/teset_quantization.py TestQuantizeFx python test/teset_quantization.py TestQuantizeFxOps python test/teset_quantization.py TestQuantizeFxModels python test/test_quantization.py TestAOMigrationQuantization python test/test_quantization.py TestAOMigrationQuantizationFx Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75823 Approved by: https://github.com/vkuzo	2022-04-19 15:38:57 +00:00
Jerry Zhang	975c9f15bd	[quant] Rename _convert_do_not_use.py to convert.py (#74322 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74322 att, also change all references to _convert_do_not_use Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestAOMigrationQuantizationFx Imported from OSS Reviewed By: andrewor14 Differential Revision: D34936430 fbshipit-source-id: c96fb887847383bf47f0ec4219127e96e2b63b2d (cherry picked from commit 8ad5a9e031e6ca4ede2656d9b2f7906a82b57c1c)	2022-03-17 18:57:08 +00:00
Jerry Zhang	a6bed4deaa	[quant][fx] Remove convert.py since it is not used now (#74276 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74276 Removing convert.py since we have rerouted the traffic to _convert_do_not_use, we'll do a rename in the follow up PR Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D34914261 fbshipit-source-id: 09ad520d95fa91c525222a69474930efb3571088 (cherry picked from commit 8aeb33206f3572132356fe78395aa3ce6aff11cd)	2022-03-17 18:57:08 +00:00
Charles David Hernandez	c1d070d0f0	[ao] Fixing obs insertion through dtype propagation (#73274 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73274 As noticed in https://discuss.pytorch.org/t/calibration-of-model-in-post-training-static-quantization-using-fx-api/143661/6 and related to https://github.com/pytorch/pytorch/issues/72698 when using fx quantizaiton, if an op like view was used in a model and the index parameters were passed in to the ops with a variable rather than hard coded, fx would mistakenly insert observers for them, leading to an error when the observer tried to do tensor only operations on a non-tensor. To fix this, an API was added to specify non tensor arguments for various ops to enable better dtype propagation. NON_TENSOR_ARG_DICT is a nested dict whose first key is a named tuple which contains matching parameters for ops with nontensor args, the inner dict's keys are dtypes and the values are a list of those arg indices that take use such dtypes. Alternatively, instead of a list, the inner dict value can also be a function that takes the node as an argument and returns the list of arg indices. Theoretically this api can support arbitrary functions but the current implmentation is limited to simpler functions given the particular issue this fixes seems to be rare. Note: although torch.unsqueeze and torch.transpose are listed in quantization_patterns.py, those ops appear to be untraceable by fx. I've included tests for their cases but fixing this issue is beyond the scope of this PR Test Plan: python test/test_quantization.py test_non_reference_size ... python test/test_quantization.py test_non_reference_<op> Imported from OSS Reviewed By: jerryzh168 Differential Revision: D34410122 fbshipit-source-id: fc09949ca8a2d6473876a4b6c214eb91e9a9dae2 (cherry picked from commit 3a1375d677b7c98d62b1f5c839645698c39b32b9)	2022-03-16 01:41:17 +00:00
Jerry Zhang	d39ad0543a	[quant][fx] Remove Fuser class in fusion implementation (#73470 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73470 att, this does not affect user apis since we are only exposing fuse_fx as a public api Test Plan: python test/test_quantization.py TestFuseFx Imported from OSS Reviewed By: vkuzo Differential Revision: D34495260 fbshipit-source-id: 3aa253bc7190e50acc7229186f210901ebc5481b (cherry picked from commit a88517ff6feff7abbece2234d82fd53e33702237)	2022-03-01 09:29:21 +00:00
Vasiliy Kuznetsov	b999f87503	fx quant: move _parent_name to common utils (#69720 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69720 This function is also useful for DBR quant, moving it from FX utils to common utils. Test Plan: ``` python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeDBR ``` Reviewed By: jerryzh168 Differential Revision: D33003473 Pulled By: vkuzo fbshipit-source-id: 20360682c69d614a645c14fc29d3ee023d6b2623	2021-12-17 05:59:46 -08:00
Jerry Zhang	a73c6a45b6	[reland][quant][graphmode][fx] Enable fuse handler for sequence of 3 ops (#70006 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70006 reland: fixing some mypy errors that was missed before This PR enables fuse handler for sequence of three ops, and merges all fuse handlers into one TODO: we can also move this to backend_config_dict folder Test Plan: regression fusion test ``` python test/test_quantization.py TestFuseFx ``` Imported from OSS Imported from OSS Reviewed By: supriyar Differential Revision: D33144606 fbshipit-source-id: ca34f282018a0fb4d04c7e35119eaf2d64258e78	2021-12-16 15:04:16 -08:00
Alban Desmaison	6f9844693f	Revert D32974907: [quant][graphmode][fx] Enable fuse handler for sequence of 3 ops Test Plan: revert-hammer Differential Revision: D32974907 (`bf089840ac`) Original commit changeset: ba205e74b566 Original Phabricator Diff: D32974907 (`bf089840ac`) fbshipit-source-id: e47838f3008ba014d884aef53460df654f0cf731	2021-12-15 05:46:49 -08:00
Jerry Zhang	bf089840ac	[quant][graphmode][fx] Enable fuse handler for sequence of 3 ops (#69658 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69658 This PR enables fuse handler for sequence of three ops, and merges all fuse handlers into one TODO: we can also move this to backend_config_dict folder Test Plan: regression fusion test ``` python test/test_quantization.py TestFuseFx ``` Imported from OSS Reviewed By: vkuzo Differential Revision: D32974907 fbshipit-source-id: ba205e74b566814145f776257c5f5bb3b24547c1	2021-12-14 19:04:21 -08:00
Vasiliy Kuznetsov	d549c8de78	fx quant: enable linear-bn1d fusion for PTQ (#66484 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66484 https://github.com/pytorch/pytorch/pull/50748 added linear - bn1d fusion in Eager mode, for PTQ only. This PR also enables this in FX graph mode. We reuse the existing conv-bn-relu fusion handler, renaming `conv` to `conv_or_linear` for readability. The QAT version is saved for a future PR, for both eager and FX graph. Test Plan: ``` python test/test_quantization.py TestFuseFx.test_fuse_linear_bn_eval ``` Imported from OSS Reviewed By: bdhirsh Differential Revision: D31575392 fbshipit-source-id: f69d80ef37c98cbc070099170e335e250bcdf913	2021-10-18 10:14:28 -07:00
Jerry Zhang	508845f2b5	[quant] AO migration of the `torch/quantization/quantize_fx.py` and `torch/quantization/fx/` (#65033 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65033 1. Move the file: ``` hg mv caffe2/torch/quantization/fx caffe2/torch/ao/quantization/fx hg mv caffe2/torch/quantization/quantize_fx.py caffe2/torch/ao/quantization/quantize_fx.py ``` 2. Create new files ``` touch caffe2/torch/quantization/quantize_fx.py touch caffe2/torch/quantization/fx/__init__.py ``` 3. import things in the new files 4. add tests to test/quantization/ao_migration/test_quantization_fx.py this is because we have some fx import in quantize_fx and fx/.py Test Plan: buck test mode/dev //caffe2/test:quantization Reviewed By: vkuzo, z-a-f Differential Revision: D30949749 fbshipit-source-id: 9e5d4d039c8a0a0820bc9040e224f0d2c26886d3	2021-09-22 09:29:15 -07:00
Jerry Zhang	14347d0dd5	[quant][fx][graphmode] Fix a bug for sub (#65109 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65109 Previously for sub we set the dtype for sub with qconfig since it's matched with a QuantizeHandler, however this is incorrect, the dtype for sub is decided by whether the output is quantized or not, so we added a check of is_output_quantized while deciding the dtype for the output of sub. Later: is_output_quantized now depends on is_reference, which is pretty confusing and it may cause problems down the road, we should remove this dependency in the future. Test Plan: python test/test_quantization.py TestQuantizeFx.test_sub_scalar Imported from OSS Reviewed By: vkuzo Differential Revision: D30977826 fbshipit-source-id: 551fd63bd61b43b3c3415944ff73174e3a21cc8a	2021-09-20 10:36:09 -07:00
Jerry Zhang	670853295a	[quant][tensorrt] Add tensorrt backend config (#64623 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64623 The config api will change, but we'll add configs gradually for TensorRT to unblock experimentation Test Plan: python torch/fx/experimental/fx2trt/example/unittests.py Imported from OSS Reviewed By: vkuzo Differential Revision: D30800474 fbshipit-source-id: 3c4640de1205a0f19b62943ab84f386d80394ec2	2021-09-14 15:27:33 -07:00
Vasiliy Kuznetsov	1577c106dc	torch.ao migration: numeric suite, eager and fx (#64817 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64817 This migrates `torch.quantization._numeric_suite` to `torch.ao.ns._numeric_suite`, and `torch.quantization._numeric_suite_fx` to `torch.ao.ns._numeric_suite_fx`. 1. move the files ``` HG: move eager mode hg mv caffe2/torch/quantization/_numeric_suite.py caffe2/torch/ao/ns/ HG: move fx hg mv caffe2/torch/quantization/_numeric_suite_fx.py caffe2/torch/ao/ns/ hg mv caffe2/torch/quantization/ns/* caffe2/torch/ao/ns/fx/ ``` 2. create new versions of `_numeric_suite.py` and `_numeric_suite_fx.py` with imports 3. update all FB callsites Test Plan: buck test mode/dev //caffe2/test:quantization Reviewed By: z-a-f Differential Revision: D30867538 fbshipit-source-id: 120ee830434ca490c1183a187a518eebcbbaf22c	2021-09-12 12:00:45 -07:00
Jerry Zhang	d4a86c1f3b	[quant][fx2trt] Add lowering support for reference linear/conv modules (#64368 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64368 Test Plan: python torch/fx/experimental/fx2trt/example/quantized_resnet_test.py Imported from OSS Reviewed By: 842974287 Differential Revision: D30708738 fbshipit-source-id: 88142b7ce43ed96093597112dab03a2d277de993	2021-09-10 22:25:27 -07:00
Jerry Zhang	ef2c9d7d8a	[quant][fix] Fix quantization for sub_scalar (#64603 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64603 We'll insert observer only when both the operator and dtype is supported Test Plan: python test/test_quantization.py TestQuantizeFx.test_sub_scalar Imported from OSS Reviewed By: vkuzo Differential Revision: D30797025 fbshipit-source-id: a77c21e2749405534fc245374cf33a0657a3d2c8	2021-09-09 17:18:31 -07:00
Zafar Takhirov	9cc44aad21	[quant] AO migration of the `quantize.py` (resubmission) (#64445 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64445 AO Team is migrating the existing torch.quantization into torch.ao.quantization. We are doing it one file at a time to make sure that the internal callsites are updated properly. This migrates the quantize.py from torch.quantization to torch.ao.quantization. At this point both locations will be supported. Eventually the torch.quantization will be deprecated. Test Plan: `buck test mode/dev //caffe2/test:quantization` Reviewed By: HDCharles Differential Revision: D30734870 fbshipit-source-id: dc204f3cc46bff2cc81c95159eab9d333b43bb4b	2021-09-08 04:58:47 -07:00
Zafar Takhirov	046ed57a4d	Revert D30055886: [quant] AO migration of the `quantize.py` Test Plan: revert-hammer Differential Revision: D30055886 (`44e3ed88c9`) Original commit changeset: 8ef7470f9fa6 fbshipit-source-id: c5bd3ead43a2d44b9e56872ec5bd7a195bdac725	2021-09-02 16:59:59 -07:00
Jerry Zhang	ed89937d2c	[quant][graphmode][fx] Add fbgemm backend_config_dict (#64288 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64288 This is just to setup the file structure and unblock experimentation. The format for backend_config_dict will change in the future Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: zou3519 Differential Revision: D30699457 fbshipit-source-id: 28211a4def05d34757850c045a36e311f54760fe	2021-09-01 16:32:43 -07:00
Jerry Zhang	7ffcf15503	[quant][graphmode][api] Add backend_config_dict to prepare_fx api (#64135 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64135 We want to start aligning the api with the design in https://github.com/pytorch/pytorch/wiki/Extending-PyTorch-Quantization-to-Custom-Backends We plan to gradually move things from `prepare_custom_config_dict` and `convert_custom_config_dict` to `backend_config_dict` and allow custom backend developer to define their own way of quantizing operators. Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: zou3519 Differential Revision: D30699456 fbshipit-source-id: e3c068da8d3da2270f57719f7159cc71cafa8598	2021-09-01 15:32:47 -07:00
Jerry Zhang	8f88f797db	[quant][graphmode][fx] Add reference quantized conv module (#63828 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63828 Added reference quantized conv module for the custom backend flow, the reference quantized module will have the following code: ``` w(float) -- quant - dequant \ x(float) ------------- F.conv2d --- ``` In the full model, we will see ``` w(float) -- quant - dequant \ x -- quant --- dequant -- F.conv2d --- quant - dequant ``` and the backend should be able to fuse the ops with `*` into a quantized linear Test Plan: python test/test_quantization.py TestQuantizeFx.test_conv_linear_reference Imported from OSS Reviewed By: vkuzo Differential Revision: D30504749 fbshipit-source-id: e1d8c43a0e0d6d9ea2375b8ca59a9c0f455514fb	2021-08-30 14:23:17 -07:00
Zafar Takhirov	44e3ed88c9	[quant] AO migration of the `quantize.py` (#64086 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64086 AO Team is migrating the existing torch.quantization into torch.ao.quantization. We are doing it one file at a time to make sure that the internal callsites are updated properly. This migrates the `quantize.py` from torch.quantization to `torch.ao.quantization`. At this point both locations will be supported. Eventually the torch.quantization will be deprecated. Test Plan: `buck test mode/opt //caffe2/test:quantization` Reviewed By: jerryzh168, raghuramank100 Differential Revision: D30055886 fbshipit-source-id: 8ef7470f9fa640c0042bef5bb843e7a05ecd0b9f	2021-08-29 20:30:01 -07:00
Jerry Zhang	0d0605eaa9	[quant][graphmode][fx] Add reference quantized linear module (#63627 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63627 Added reference quantized linear module for the custom backend flow, the reference quantized module will have the following code: ``` w(float) -- quant - dequant \ x(float) ------------- F.linear --- ``` In the full model, we will see ``` w(float) -- quant - dequant \ x -- quant --- dequant -- F.linear --- quant - dequant ``` and the backend should be able to fuse the ops with `*` into a quantized linear Test Plan: python test/test_quantization.py TestQuantizeFx.test_conv_linear_reference Imported from OSS Reviewed By: vkuzo Differential Revision: D30504750 fbshipit-source-id: 5729921745c2b6a0fb344efc3689f3b170e89500	2021-08-27 22:53:24 -07:00
Supriya Rao	294db0603f	[quant] Add support for linear_relu fusion for FP16 dynamic quant (#63826 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63826 Support the conversion of the intrinsic linearRelu module to the quantized dynamic LinearReLU module Verify the support works for both linear module and functional linear fusion Test Plan: python test/test_quantization.py test_dynamic_with_fusion Imported from OSS Reviewed By: iramazanli Differential Revision: D30503513 fbshipit-source-id: 70446797e9670dfef7341cba2047183d6f88b70f	2021-08-26 21:12:06 -07:00
Supriya Rao	c7027f19ef	[quant][fx] Add support for dynamic linear + relu fusion (INT8) (#63799 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63799 Add a new module that can be used for module swap with the nni.LinearReLU module in convert function. Supports INT8 currently (since FP16 op doesn't have relu fusion yet). Fixes #55393 Test Plan: python test/test_quantization.py test_dynamic_fusion Imported from OSS Reviewed By: heitorschueroff Differential Revision: D30502812 fbshipit-source-id: 3668e4f001a0626d469e17ac323acf582ee28a51	2021-08-26 21:10:46 -07:00
Jerry Zhang	0301c3bc01	[quant][graphmode][fx] Make maxpool and flatten produce the reference pattern (#63501 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63501 Currently some of the ops are considered as working with both float and quantized input, so we may have things like "quant - some_op - dequant" this might not work well with the backend, we may consider change everything to produce "quant - dequant - some_op - quant - dequant" instead in the future, this PR fixes it for maxpool and flatten only to unblock resnet benchmarking on TensorRT Test Plan: python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: mruberry Differential Revision: D30402788 fbshipit-source-id: 892c5ff6552775070e2c1453f65846590fb12735	2021-08-24 21:31:01 -07:00
Jerry Zhang	c8527bc398	[qunat][graphmode][fx] Add a separate lower_to_native_backend function for relu (#62861 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62861 This PR adds a lower_to_native_backend function to lower a quantized reference model to a model that uses fbgemm/qnnpack ops. We'll gradually add support and remove the fbgemm/qnnpack specific handling in quantization_patterns.py Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D30165828 fbshipit-source-id: de1149cd7e7c1840c17c251cd4d35004afd015b7	2021-08-24 21:07:03 -07:00
Jerry Zhang	5b28e3c183	[quant][graphmode][fx] Add reference option support for binary ops (#62698 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62698 We also removed the special handling in match_utils for binary ops Test Plan: python test/test_quantize.py TestQuantizeFx python test/test_quantize.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D30093781 fbshipit-source-id: 58cc972de8211a80dd4d111e25dc4ad36057933f	2021-08-24 18:22:11 -07:00
Charles David Hernandez	6c3ebccc00	Updating the names of these functions (#63513 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63513 updating these names per Jerry's nits in the previous pr Test Plan: Imported from OSS Reviewed By: jerryzh168 Differential Revision: D30406710 fbshipit-source-id: a9f1577a2b8c4a93f5005e0f6278b7d7348d8b66	2021-08-19 13:34:34 -07:00
Charles David Hernandez	877e6f2be3	Bugfix for fuse qconfig comparison (#63384 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63384 In some cases the changes to qconfig on module would cause the fusions to fail. This bugfix solves that problem by adding a qconfig_function_comparison that compares the functions within the qconfig rather than the modules the qconfigs are on. The comparison looks at the partial object within QConfig.activation/weight.p and compares args, keywords and func. This is necessary to do mannually because partial doesn't have __eq__ implemented and so == reverts to is. Test Plan: python test/test_quantization.py TestFuseFx.test_problematic_fuse_example Imported from OSS Reviewed By: supriyar, ejguan Differential Revision: D30386264 fbshipit-source-id: 51e358c021c39d6f48dc12ad2a82b2838677b9de	2021-08-18 13:31:56 -07:00
Jerry Zhang	cd5e9dcc1d	[quant][graphmode][fx][fix] Fix quantization for tuple arguments (#63376 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63376 Previously when tuple is an argument for a quantizable op it would be transformed to a list by mistake, this PR fixes that. Test Plan: python test/test_quantization.py TestQuantizeFx.test_preserve_tuple Imported from OSS Reviewed By: raghuramank100 Differential Revision: D30357642 fbshipit-source-id: 82d10805d9c00c003cc99983dca68b6455ff7b2e	2021-08-17 17:01:24 -07:00
Jerry Zhang	bcddc71f26	[quant][graphmode][fx][bc-breaking] Support for reference pattern for fixqparam ops in eval mode (#62608 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62608 Insert extra fixeqparam fake quant in the output of fixed qparam ops in fbgemm e.g. sigmoid so that we can produce reference patterns for these ops Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: iramazanli Differential Revision: D30053978 fbshipit-source-id: c527944b6e791bb4d45ebe96265af52794203695	2021-08-17 14:42:40 -07:00
Supriya Rao	b0396e39f4	[quant][fx] Ensure qconfig works for QAT with multiple modules (#63343 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63343 The previous implementation had a bug where we were trying to modify an ordered dict value while iterating through it. This fixes it by creating a copy before modifying it. Test Plan: python test/test_quantization.py TestQuantizeFx.test_qconfig_qat_module_type Imported from OSS Reviewed By: raghuramank100 Differential Revision: D30346116 fbshipit-source-id: 0e33dad1163e8bff3fd363bfd04de8f7114d7a3a	2021-08-17 11:40:51 -07:00
Jerry Zhang	990c2190d1	[quant][graphmode] Reference pattern support for elu (#62607 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62607 Removing the quantize handler for elu since it can be covered by DefaultNodeQuantizeHandler Test Plan: python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: iramazanli Differential Revision: D30053977 fbshipit-source-id: 426789443e928bb01a88907de616cbda5866f621	2021-08-10 14:00:39 -07:00
Jerry Zhang	cb7f35d47a	[quant][refactor] Checking activation_dtype instead of activation_post_process (#62489 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62489 Addressing comment from previous PR: https://github.com/pytorch/pytorch/pull/62374#discussion_r679354145 Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: iramazanli Differential Revision: D30053980 fbshipit-source-id: 79c216410282eccd6f0a8f24e38c55c4d18ec0d0	2021-08-10 12:17:36 -07:00
Jerry Zhang	3c1d1170a4	[quant][graphmode][fx] Attach a weight qparam dict to linear and conv in reference quantized model (#62488 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62488 Instead of attaching weight observer/fake_quant to the float linear and conv, we can compute the quantization parameters and attach that as a dictionary to these modules so that we can reduce the model size and make the reference module clearer TODO: the numerics for linear and conv in reference quantized model is still not correct since we did not quantize weight, we may explore things like parameterization to implement this support Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D30053979 fbshipit-source-id: b5f8497cf6cf65eec924df2d8fb10a9e154b8cab	2021-08-09 16:55:14 -07:00
Angela Yi	d9154b9b26	[quant] Input-Weight Equalization - allow logical evaluation (#61603 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61603 Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D29686878 fbshipit-source-id: 67ca4cab98b3d592ff2bb8db86499789b85bd582	2021-08-06 15:10:32 -07:00
Angela Yi	836b2431dc	[quant] Input-Weight Equalization - selective equalization (#61916 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61916 Functions used to run selective equalization based on the SQNR obtained from running the Numeric Suite. After running the Numeric Suite between the equalized and float model, we will get the SQNR between the two models and construct an equalization_qconfig_dict that specifies to only equalize the layers with the highest quantization errors. How to run: ``` layer_to_sqnr_dict = get_layer_sqnr_dict(float_model, equalized_model, input) eq_qconfig_dict = get_equalization_qconfig_dict(layer_to_sqnr_dict, equalized_model, num_layers_to_equalize) prepared = prepare_fx(float_model, qconfig_dict, eq_qconfig_dict) ... ``` Test Plan: `python test/test_quantization.py TestEqualizeFx.test_selective_equalization` Imported from OSS Reviewed By: supriyar Differential Revision: D29796950 fbshipit-source-id: 91f0f8427d751beaea32d8ffc2f3b8aa8ef7ea95	2021-08-06 09:29:03 -07:00
Angela Yi	91ef19309e	[quant] Input-weight equalization - branch support (#62366 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62366 In the case of models with branches, we are unable to equalize the branching part in the graph. For example, given this graph: ``` conv2 / \ x -> conv1 -> add ``` After prepare, we will ignore the branched layers (conv1 and conv2) and will not insert the equalization observers. A warning message will also be printed with the layers that are unable to be equalized. ``` conv2 -> out_quant_obs2 / \ x -> input_quant_obs -> conv1 -> out_quant_obs1 -> add ``` Test Plan: `python test/test_quantization.py TestEqualizeFx.test_input_weight_equalization_prepare` Imported from OSS Reviewed By: malfet, supriyar Differential Revision: D29982585 fbshipit-source-id: 706297e7f1861975998dfa83e7ca59af09d80618	2021-08-03 12:45:25 -07:00

1 2 3 4 5 ...

339 Commits