pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Nikita Shulga	c0dd9b3b67	Revert "[Executorch][Quantization][BE] Refactor Choose Qparams (#92592 )" This reverts commit `59071ab1e7`. It breaks `quantization.jit.test_ondevice_quantization.TestOnDeviceDynamicPTQFinalize`, which is not run in OSS, but is mandatory for internal CI.	2023-01-23 09:13:02 -08:00
Jacob Szwejbka	59071ab1e7	[Executorch][Quantization][BE] Refactor Choose Qparams (#92592 ) Summary: Should hopefully be a little faster. Definitely cleaner to not create an observer inside the op Test Plan: ci Differential Revision: D42154677 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92592 Approved by: https://github.com/jerryzh168	2023-01-20 01:36:47 +00:00
HDCharles	1ca9d43d4e	[ao] quantize.py fixing public v private (#87521 ) Summary: made _register_activation_post_process_hook, _add_observer, _get_unique_devices_, _get_observer_dict private Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D40709277](https://our.internmc.facebook.com/intern/diff/D40709277) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87521 Approved by: https://github.com/jerryzh168	2022-12-14 22:50:39 +00:00
Vasiliy Kuznetsov	22a1b5e243	quantization: deprecate observer compute_dtype and replace with is_dynamic (#85431 ) Summary: This PR deprecates the `compute_dtype` field on observers, and replaces it with the `is_dynamic` field on observers. This is better aligned with the reference model spec. Test plan: ``` python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/85431 Approved by: https://github.com/jerryzh168	2022-11-24 07:07:34 +00:00
andrewor14	19e66fcec2	[Quant] Allow setting fixed qparams for inner LSTM ops (#88456 ) Summary: In both eager and FX graph mode quantization, `torch.ao.nn.quantizable.LSTM` is used as an observed custom module, which is responsible for inserting its own observers. By default, the user specifies a single QConfig for the custom module (either through QConfigMapping or by setting the "qconfig" attribute"), and all inner ops will [inherit this QConfig](`dc00bb51b8/torch/ao/nn/quantizable/modules/rnn.py (L366-L378)`) and use the same observer/fake_quantize constructors. Today, users who wish to override this behavior must extend `torch.ao.nn.quantizable.LSTM` and write a lot of custom code to manually assign the QConfigs to the inner ops. This commit alleviates this burden on the user by providing a helper function to assign QConfigs with custom observers. An example use case of this is providing a reference implementation for a backend kernel that hardcodes qparams for efficiency. Example usage: ``` import torch from torch.ao.quantization import get_default_qconfig_mapping from torch.ao.quantization.fx.custom_config import ( PrepareCustomConfig, ConvertCustomConfig, ) class MyModel(torch.nn.Module): ... class UserLSTM(torch.ao.nn.quantizable.LSTM): @classmethod def from_float(cls, other): assert isinstance(other, cls._FLOAT_MODULE) linear_output_obs_ctr = FixedQParamsObserver.with_args( scale=2 -11, zero_point=2 15, dtype=torch.qint32) sigmoid_obs_ctr = FixedQParamsObserver.with_args( scale=2 -16, zero_point=0, dtype=torch.qint32) tanh_obs_ctr = FixedQParamsObserver.with_args( scale=2 -15, zero_point=2 15, dtype=torch.qint32) cell_state_obs_ctr = FixedQParamsObserver.with_args( scale=2 -11, zero_point=0, dtype=torch.qint32) hidden_state_obs_ctr = FixedQParamsObserver.with_args( scale=2 -7, zero_point=2 7, dtype=torch.quint8) return torch.ao.quantization.utils._get_lstm_with_individually_observed_parts( float_lstm=other, linear_output_obs_ctr=linear_output_obs_ctr, sigmoid_obs_ctr=sigmoid_obs_ctr, tanh_obs_ctr=tanh_obs_ctr, cell_state_obs_ctr=cell_state_obs_ctr, hidden_state_obs_ctr=hidden_state_obs_ctr, ) qconfig_mapping = get_default_qconfig_mapping() example_inputs = (torch.rand(5, 3, 50), torch.rand(1, 3, 50), torch.randn(1, 3, 50)) prepare_custom_config = PrepareCustomConfig() \ .set_float_to_observed_mapping(torch.nn.LSTM, UserLSTM) convert_custom_config = ConvertCustomConfig() \ .set_observed_to_quantized_mapping(UserLSTM, torch.ao.nn.quantized.LSTM) model = MyModel() model = prepare_fx(model, qconfig_mapping, example_inputs, prepare_custom_config=prepare_custom_config) model(example_inputs) # calibrate model = convert_fx(model, convert_custom_config=convert_custom_config) model(example_inputs) ``` Test Plan: python test/test_quantization.py TestQuantizeFx.test_static_lstm_with_custom_fixed_qparams Reviewers: jerryzh168, vkuzo Subscribers: jerryzh168, vkuzo Pull Request resolved: https://github.com/pytorch/pytorch/pull/88456 Approved by: https://github.com/jerryzh168, https://github.com/vkuzo	2022-11-18 16:27:12 +00:00
Jacob Szwejbka	7f55db4fb0	add quantize_decomposed_dynamic to op lib (#88855 ) Summary: Needed for dynamic quant reference pattern graphs. Test Plan: added unittest Differential Revision: D41205030 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88855 Approved by: https://github.com/jerryzh168	2022-11-16 16:59:36 +00:00
Jerry Zhang	0e3b5ea026	[quant][fx] Add _convert_to_reference_decomposed (#87094 ) Summary: _convert_to_reference_decomposed is a private convert function in fx graph mode quantization flow to convert a calibrated/trained model to a reference quantized model with decomposed quantized tensor representations. Test Plan: python test/test_quantization.py TestQuantizeFx.test__convert_to_reference_decomposed_fx Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/87094 Approved by: https://github.com/andrewor14	2022-10-27 01:22:08 +00:00
HDCharles	25476f2e4b	[ao] fixing public v private for quantization_types (#86031 ) Summary: the main problem with this was that the different objects defined simply as 'Any' should theoretically be public but making them public either A) results in an error about the module being 'typing' rather than whatever module it should be or B) you set the module manually, thereby changing the module for the original 'Any' class. note: QuantizeHandler has a similar issue where its simply defined as 'Any' Pattern was defined in multiple places which was causing issues so i just moved it to a single place given the note at the top of quantization_types.py indicating these definitions should be moved to utils at some point anyway. Finally i changed any references to these objects to point at the correct locations. Note: i didn't see any fb internal references to NodePattern or QuantizerCls that would cause issues. Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/86031 Approved by: https://github.com/jerryzh168	2022-10-12 20:06:30 +00:00
Jiaxu Zhu	bc919ac796	[torch.ao.quantization] include torch.qint32 for static quant (#86345 ) Summary: include `torch.qint32` to `activation_is_statically_quantized` and `get_quant_type` so that fakequantize with `dtype=torch.qint32` won't be skipped Test Plan: updated `test_custom_module_class` Differential Revision: D40128178 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86345 Approved by: https://github.com/jerryzh168	2022-10-06 20:05:56 +00:00
Vasiliy Kuznetsov	09965957cd	quantization: align observer dtype with reference model spec (#85345 ) Summary: Before this PR, the `dtype` attribute of observers was not clearly defined. It originally meant `interface_dtype` in the eager mode workflow, which is how the codebase before this PR is using it. In the new reference model spec, `dtype` attribute of an observer represents the `dtype` value which needs to be passed into a `quantize` function in the reference model spec. This PR aligns the codebase to this definition of dtype. In detail: 1. change util functions to interpret `dtype` using the reference model definition 2. change `prepare` to interpret `dtype` using the reference model definition 3. change observers for dynamic quantization to interpret `dtype` using the reference model definition. A future PR (left out of this one to keep LOC small) will deprecate the `compute_dtype` field and instead expose `is_dynamic` on observers. " Test plan: ``` python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps ``` Differential Revision: [D39675209](https://our.internmc.facebook.com/intern/diff/D39675209) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85345 Approved by: https://github.com/z-a-f, https://github.com/jerryzh168	2022-09-21 06:34:26 +00:00
Andrew Or	78144b9f35	[Quant][fx][bc-breaking] Replace custom_config_dict with config objects Pull Request resolved: https://github.com/pytorch/pytorch/pull/79066 Following https://github.com/pytorch/pytorch/pull/78452, this commit replaces the following config dicts with python objects: - prepare_custom_config_dict -> PrepareCustomConfig - convert_custom_config_dict -> ConvertCustomConfig - fuse_custom_config_dict -> FuseCustomConfig This leads to better type safety and better user experience in notebook settings due to improved auto completion. The new APIs are as follows: ``` from torch.ao.quantization.fx.custom_config import PrepareCustomConfig prepare_custom_config = PrepareCustomConfig() \ .set_float_to_observed_mapping(float_class, observed_class) \ .set_non_traceable_module_names(["mod1", "mod2"]) \ .set_non_traceable_module_classes([class1, class2]) \ .set_input_quantized_indexes([0, 1]) \ .set_output_quantized_indexes([0]) \ .set_preserved_attributes(["attr1", "attr2"]) convert_custom_config = ConvertCustomConfig() \ .set_observed_to_quantized_mapping(observed_class, quantized_class) \ .set_preserved_attributes(["attr1", "attr2"]) model = prepare_fx( model, qconfig_mapping, example_inputs, prepare_custom_config=prepare_custom_config) model(data) model = convert_fx(model, convert_custom_config=convert_custom_config) ``` For backwards compatibility, prepare_fx, prepare_qat_fx, and convert_fx will continue to accept Dicts, which will be converted to the relevant CustomConfig object internally. Note that this commit does not modify existing tests to use the new API; they will continue to pass in Dicts as before, which still works but triggers a deprecation warning. This will be handled in a future commit. Differential Revision: [D37088095](https://our.internmc.facebook.com/intern/diff/D37088095/) Approved by: https://github.com/jerryzh168	2022-06-16 17:50:07 +00:00
Jerry Zhang	063c93665c	[quant] follow up fixes for prepare_fx/prepare_qat_fx calls in classyvision (#105 ) (#78660 ) Summary: X-link: https://github.com/fairinternal/ClassyVision/pull/105 As follow up for https://github.com/pytorch/pytorch/pull/76496, we fixes the TODOs in quantization tests by providing correct example_inputs in the tests Test Plan: classyvision sandcastle and ossci Static Docs Preview: classyvision \|[Full Site](https://our.intern.facebook.com/intern/staticdocs/eph/D36818665/V1/classyvision/)\| \|Modified Pages\| Differential Revision: D36818665 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78660 Approved by: https://github.com/vkuzo	2022-06-03 01:08:45 +00:00
Nikita Shulga	8f7e3791ef	Make PyTorch importable on python-3.7.0 (#78500 ) By stringifying "typing.OrderedDict", as [`typing.OrderedDict`](https://docs.python.org/3.10/library/typing.html#typing.OrderedDict) were introduced by Python-3.7.2+ See similar fix in `21a82fb519` Partially addresses https://github.com/pytorch/pytorch/issues/78499 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78500 Approved by: https://github.com/atalman	2022-05-31 06:11:30 +00:00
Jerry Zhang	7ea5fa3dd4	[reland][quant] Add utility function get_fqn_to_example_inputs Summary: After https://github.com/pytorch/pytorch/pull/77608 `example_inputs` is required input for `prepare_fx` and `prepare_qat_fx`. This makes quantizing submodules harder, so we added this utility function to get a dictionary from fqn to submodule example_inputs Example Call: ``` example_inputs = (tensor0,) get_fqn_to_example_inputs(m, example_inputs) ``` Example output: ``` { "linear1": (tensor1,), "linear2": (tensor2,), "sub": (tensor3,), "sub.linear1": (tensor4,), ... } ``` Test Plan: python test/test_quantization.py TestUtils Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/78286 Approved by: https://github.com/dzdang	2022-05-25 23:31:51 +00:00
PyTorch MergeBot	87148f2b59	Revert "[quant] Add utility function get_fqn_to_example_inputs" This reverts commit `50a44fe461`. Reverted https://github.com/pytorch/pytorch/pull/78146 on behalf of https://github.com/suo due to as it broke master	2022-05-25 06:37:32 +00:00
Jerry Zhang	50a44fe461	[quant] Add utility function get_fqn_to_example_inputs Summary: After https://github.com/pytorch/pytorch/pull/77608 `example_inputs` is required input for `prepare_fx` and `prepare_qat_fx`. This makes quantizing submodules harder, so we added this utility function to get a dictionary from fqn to submodule example_inputs Example Call: ``` example_inputs = (tensor0,) get_fqn_to_example_inputs(m, example_inputs) ``` Example output: ``` { "linear1": (tensor1,), "linear2": (tensor2,), "sub": (tensor3,), "sub.linear1": (tensor4,), ... } ``` Test Plan: python test/test_quantization.py TestUtils Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/78146 Approved by: https://github.com/vkuzo	2022-05-25 03:07:16 +00:00
Charles David Hernandez	02e30a09f7	[ao][sparsity] make sparsity and PTQ compose (#74845 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74845 This PR adds support for quantization flow to detect parametrized modules and match them using their original module types. This mainly involved using the new type_before_parametrizations function rather than type to check for module mathcing Test Plan: python test/test_ao_sparsity.py TestComposability Imported from OSS Reviewed By: jerryzh168 Differential Revision: D35240274 fbshipit-source-id: 7294d89c9c2e069e51d8b9bafa45c15f92bed124 (cherry picked from commit ed5cdb7b636c42e040d1b4a67b6b94604d06e1ff)	2022-04-05 03:35:41 +00:00
Terry Chen	b82df92c33	[quant] Fix qmin/qmax when using customized qrange (#74717 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74717 currently the weight map to 0 and max_float to 65535 due to incorrect qmin/qmax in qin16 customized qrange the expectation from the set observers is the integer representation is supposed to be a signed int16 i.e -32768 to 32767. Test Plan: Imported from OSS Reviewed By: jerryzh168 Differential Revision: D35129924 fbshipit-source-id: 924902dd7e64c1218971422ba2451c2a484fd2f4 (cherry picked from commit 95659cdeeec7b3a01a64355244847e211c6dd2a6)	2022-03-31 07:49:17 +00:00
Jiaxu Zhu	7c1f3cc89e	[quant] Populate FakeQuantize quant_min/quant_max to observer (#74581 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74581 As title, currently the quant_min/quant_max of the FakeQuantize are not populated to the observer. We plan to populate when they are both not None. To do this we need to do 1. Remove the current default quant_min/quant_max value (0/255) as it's not universal for various dtype. 2. Move the upper bound/lower bound check before creating the observer. Test Plan: ``` [jiaxuzhu@devvm3400.frc0 /data/users/jiaxuzhu/fbsource/fbcode] buck test mode/dev //caffe2/test:quantization -- --exact 'caffe2/test:quantization - test_quant_min_max_override (quantization.core.test_workflow_module.TestFakeQuantize)' Parsing buck files: finished in 0.8 sec Downloaded 0/2 artifacts, 0.00 bytes, 100.0% cache miss (for updated rules) Building: finished in 9.5 sec (100%) 18535/84579 jobs, 2/84579 updated Total time: 10.3 sec More details at https://www.internalfb.com/intern/buck/build/1cab97ef-0788-4d06-92ed-a828995e3bde BUILD SUCCEEDED Tpx test run coordinator for Facebook. See https://fburl.com/tpx for details. Running with tpx session id: 24be645e-eebc-45d6-8111-052ef1225fa0 Trace available for this run at /tmp/tpx-20220323-094106.724238-24be645e-eebc-45d6-8111-052ef1225fa0/trace.log RemoteExecution session id: reSessionID-24be645e-eebc-45d6-8111-052ef1225fa0-tpx Started reporting to test run: https://www.internalfb.com/intern/testinfra/testrun/5066549674998735 ✓ ListingSuccess: caffe2/test:quantization : 483 tests discovered (20.179) ✓ Pass: caffe2/test:quantization - test_quant_min_max_override (quantization.core.test_workflow_module.TestFakeQuantize) (18.896) Summary Pass: 1 ListingSuccess: 1 If you need help understanding your runs, please follow the wiki: https://fburl.com/posting_in_tpx_users Finished test run: https://www.internalfb.com/intern/testinfra/testrun/5066549674998735 ``` Reviewed By: jerryzh168 Differential Revision: D34971236 fbshipit-source-id: 4407fd03116a296053256b333f7ce6d28dcc9c42 (cherry picked from commit f6980bccea802f220cc5b6dfe1bf3a3a3eef0a34)	2022-03-24 18:23:40 +00:00
Jerry Zhang	7ddf212f33	[quant][fx] Fully align convert with the reference model design and simplify the implementation (#73863 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73863 This PR fully aligns the convert function with the design: https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md and simplifies the implementation of convert function by always produce a reference quantized model (with reference patterns) first, and then lower the model to a quantized model that is runnable with PyTorch native backend (fbgemm/qnnpack). This PR makes the convert.py much easier to understand than the previous implementation, and we are able to remove majority of code in quantization_patterns.py as well (in followup PRs). Test Plan: ``` python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps python test/test_quantization.py TestFXNumericSuiteCoreAPIs python test/test_quantization.py TestFXNumericSuiteCoreAPIsModels ``` and other internal/oss regression tests Imported from OSS Reviewed By: andrewor14 Differential Revision: D34778506 fbshipit-source-id: 0678b66addf736039a8749b352f6f569caca962b (cherry picked from commit 33ec9caf23f3ab373d827117efbd9db0668b2437)	2022-03-11 17:11:30 +00:00
Vasiliy Kuznetsov	727debb18e	dbr quant: enable reference module support for torch.qint32 (#73493 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73493 This PR enables basic support for reference modules in DBR quant. For now, the support is limited to: 1. modules that have reference versions defined only (no functions) 2. torch.qint32 dtype only Currently, the reference module logic is enabled whenever dtype is torch.qint32. This is done because this is needed the earliest for the first use case. A future PR will support more dtypes and also add the `is_reference` flag to the API. Test Plan: ``` python test/test_quantization.py TestQuantizeDBR.test_conv_int32_reference_model ``` Reviewed By: jerryzh168 Differential Revision: D34520759 Pulled By: vkuzo fbshipit-source-id: 363db715315c5c7c20962a1818330ce288948778 (cherry picked from commit 6ccdfe2889c252211f191edc49f4147f66e803a4)	2022-03-04 17:35:31 +00:00
Jerry Zhang	5613527ef9	[quant][fx] Add lowering support for functional ops using DefaultNodeQuantizeHandler (#73120 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73120 att This is to align our implementation with https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D34354038 fbshipit-source-id: 873a867e62bd541ef236974c697fac2334bf02ea (cherry picked from commit 3fce7cade2f057b985833659c2cb365ee4d6d9f3)	2022-02-26 19:29:58 +00:00
Jerry Zhang	ee5b8f0c64	[quant][fx] Move MatchAllNode from match_utils.py to utils.py under quantization (#73344 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73344 not user facing as of now, since we haven't advertised the backend_config_dict api, we need this in fuser_method_mapping.py, this is to avoid circular dependency Test Plan: python test/test_quantization.py TestQuantizeFx Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D34441778 fbshipit-source-id: 7a01c359e4b21e9e98345dc7781f735628209a20 (cherry picked from commit 758537094c5a98a17a8825b3f240c8d5acdd72b0)	2022-02-25 17:36:14 +00:00
Terry Chen	f67cf03526	[Quant] Add qint32 quantization support (#72472 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72472 Add dtype=int32 support for observer Test Plan: python3 test/test_quantization.py TestObserver.test_per_tensor_observers Imported from OSS Reviewed By: jerryzh168 Differential Revision: D34056640 fbshipit-source-id: 4fa15a7274cfbb6a7dd4e698e3989cc0c0626e7b (cherry picked from commit `bf4351de45`)	2022-02-16 03:45:15 +00:00
Jerry Zhang	8b67b83c6e	[quant][fx][improvement] Add lowering support for FixedQParamsOpQuantizeHandler (#72488 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72488 This is an effort to move the current implementation towards the reference quantized model design: https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md so that we use reference model in the default fbgemm/qnnpack path Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D34062364 fbshipit-source-id: 50c4a86644c3f5f6fb03d2a98aa7376895c0fc84 (cherry picked from commit `ed8122e44d`)	2022-02-11 18:13:29 +00:00
Jerry Zhang	ac0cac7724	[quant][fx][devs] Add lowering support for torch.cat (#72487 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72487 This is an effort to move the current implementation towards the reference quantized model design: https://github.com/pytorch/rfcs/blob/master/RFC-0019-Extending-PyTorch-Quantization-to-Custom-Backends.md so that we use reference model in the default fbgemm/qnnpack path Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: vkuzo Differential Revision: D34062366 fbshipit-source-id: 86673bead79180a7509b51bd577f328e90f24893 (cherry picked from commit `de3e443384`)	2022-02-09 06:09:57 +00:00
Jerry Zhang	4b69a2373f	[quant][fx] Add lowering support for ops in GeneralTensorShapeOpQuantizeHandler (#72387 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72387 Also make GeneralTensorShapeOpQuantizeHandler produce reference patterns by default Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: albanD, terrychenism Differential Revision: D34025005 fbshipit-source-id: 01ca62cce727bbf4579ba8fb2b8c40198f327b86 (cherry picked from commit `7f3a9ab4c5`)	2022-02-09 02:10:20 +00:00
Terry Chen	e4500306c8	[Quant] Enable default reference path for CopyNodeQuantizeHandler (#71168 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71168 In this PR we want to enable the reference path by default for CopyNodeQuantizeHandler Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps Imported from OSS Reviewed By: andrewor14 Differential Revision: D33715995 fbshipit-source-id: eda44892fcea3a1cba54ac75bc020f73e1becc8c (cherry picked from commit `a2cf63f68d`)	2022-01-25 23:32:11 +00:00
Vasiliy Kuznetsov	c21a540866	dbr quant: support dynamic linear (#70257 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70257 Makes dynamic quantization for linear module work in DBR quant. Coverage for more ops and functionals will be in future PRs. Test Plan: ``` python test/test_quantization.py -k DBR ``` Reviewed By: jerryzh168 Differential Revision: D33262300 Pulled By: vkuzo fbshipit-source-id: c1cb0f9dd3f42216ad6ba19f4222b171ff170174	2022-01-06 13:24:55 -08:00
Vasiliy Kuznetsov	b999f87503	fx quant: move _parent_name to common utils (#69720 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69720 This function is also useful for DBR quant, moving it from FX utils to common utils. Test Plan: ``` python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeDBR ``` Reviewed By: jerryzh168 Differential Revision: D33003473 Pulled By: vkuzo fbshipit-source-id: 20360682c69d614a645c14fc29d3ee023d6b2623	2021-12-17 05:59:46 -08:00
Jerry Zhang	e5a1ee0e5a	[quant][graphmode] Refactor fusion to use the new Pattern format (#68770 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68770 Previous fusion only works for a sequnce of ops, which is not general enough for fusion patterns that is defined by a subgraph, this PR refactors that to make it more general Test Plan: ``` python test/test_quantization.py TestFuseFx ``` Imported from OSS Reviewed By: vkuzo Differential Revision: D32602637 fbshipit-source-id: a7897c62081b9d71c67fb56e78484cf68deaacf6	2021-12-07 16:12:40 -08:00
Jerry Zhang	9cb52327a8	[quant][refactor] Move pattern type definition to ao/quantization/utils.py (#68769 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68769 att, since we want to use this type in fuser_method_mapping in later PRs Test Plan: no change to logic, just regression test on ci ``` python test/test_quantization.py ``` Imported from OSS Reviewed By: vkuzo Differential Revision: D32602636 fbshipit-source-id: 15b95241431dfca9b1088d0920bf75705b37aa9a	2021-12-07 11:00:22 -08:00
Zafar Takhirov	02dec91212	[quant] AO migration of the `torch/quantization/utils.py` (phase 1) (#64919 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64919 AO Team is migrating the existing torch.quantization into torch.ao.quantization. We are doing it one file at a time to make sure that the internal callsites are updated properly. This migrates the quantization utilities. ghstack-source-id: 138303325 Test Plan: `buck test mode/dev //caffe2/test:quantization` Reviewed By: jerryzh168 Differential Revision: D30899082 fbshipit-source-id: 85eb38c419e417147e71758b682cd095308dd0c9	2021-09-16 21:30:18 -07:00

33 Commits