pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Yuanyuan Chen	a60d9e1f6d	Fix flake8 B028 warnings (#166224 ) This PR fixes flake8 B028 warning by specifying stacklevel=2 in `warnings.warn`. The advantage is that users can know more contextual information about PyTorch warnings. Pull Request resolved: https://github.com/pytorch/pytorch/pull/166224 Approved by: https://github.com/ezyang	2025-10-26 06:18:55 +00:00
PyTorch MergeBot	8daef35cf1	Revert "[Code Clean] Clean asserts in torch/ao/quantization (root, quantizer, backend_config) (#165433 )" This reverts commit `df64c0c464`. Reverted https://github.com/pytorch/pytorch/pull/165433 on behalf of https://github.com/clee2000 due to I think this broke some quantization tests ([comment](https://github.com/pytorch/pytorch/pull/165433#issuecomment-3429741770))	2025-10-21 22:10:19 +00:00
zhudada	df64c0c464	[Code Clean] Clean asserts in torch/ao/quantization (root, quantizer, backend_config) (#165433 ) Replace assert statements with explicit if/raise patterns in: - torch/ao/quantization/~ - torch/ao/quantization/quantizer/ - torch/ao/quantization/backend_config/ fix partialy #164878 Pull Request resolved: https://github.com/pytorch/pytorch/pull/165433 Approved by: https://github.com/albanD	2025-10-20 22:42:51 +00:00
Yuanyuan Chen	e30f01b5b5	[1/N] Simplify "in" operation for containers of a single item (#164224 ) These issues are detected by ruff [FURB171](https://docs.astral.sh/ruff/rules/single-item-membership-test/#single-item-membership-test-furb171). Pull Request resolved: https://github.com/pytorch/pytorch/pull/164224 Approved by: https://github.com/rec, https://github.com/Skylion007	2025-09-30 19:59:43 +00:00
Xuehai Pan	279cae52e7	[BE][PYFMT] migrate PYFMT for `torch/ao/` to `ruff format` (#148185 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/148185 Approved by: https://github.com/ezyang	2025-06-14 16:47:04 +00:00
Jerry Zhang	d23aa7e182	Add deprecation warning for `torch.ao.quantization` (#153892 ) Summary: att Test Plan: (ao) $ PYTHONWARNINGS='default' python Python 3.10.14 \| packaged by conda-forge \| (main, Mar 20 2024, 12:45:18) [GCC 12.3.0] on linux Type "help", "copyright", "credits" or "license" for more information. >>> from torch.ao.quantization.quantizer.xnnpack_quantizer import XNNPACKQuantizer printing warning /anaconda3/envs/ao/lib/python3.10/site-packages/torch/ao/quantization/__init__.py:36: DeprecationWarning: torch.ao.quantization is deprecated. Plan is to 1. Remove eager mode quantization (torch.ao.quantization.quantize, torch.ao.quantization.quantize_dynamic), please migrate to use torchao eager mode quantize_ API instead 2. Remove fx graph mode quantization (torch.ao.quantization.quantize_fx.prepare_fx, torch.ao.quantization.quantize_fx.convert_fx, please migrate to use torchao pt2e quantization API instead (prepare_pt2e, convert_pt2e) 3. pt2e quantization has been migrated to torchao (https://github.com/pytorch/ao/tree/main/torchao/quantization/pt2e) see https://dev-discuss.pytorch.org/t/torch-ao-quantization-migration-plan/2810 for more details warnings.warn( >>> a = XNNPACKQuantizer() /anaconda3/envs/ao/lib/python3.10/site-packages/torch/ao/quantization/quantizer/xnnpack_quantizer.py:281: DeprecationWarning: XNNPACKQuantizer is deprecated! Please use xnnpack quantizer in ExecuTorch (https://github.com/pytorch/executorch/tree/main/backends/xnnpack/quantizer) instead warnings.warn(f"{self.__class__.__name__} is deprecated! Please use xnnpack quantizer in ExecuTorch (https://github.com/pytorch/executorch/tree/main/backends/xnnpack/quantizer) instead", DeprecationWarning) >>> Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/153892 Approved by: https://github.com/Skylion007	2025-05-28 16:25:30 +00:00
Kaustubh Vartak	c7a9599100	Handle meta tensors in FX quantization (#144726 ) Summary: D66895899 got reverted in D67565250 because of pytorch OSS linter failure. Adding back with the format the linter suggested https://github.com/pytorch/pytorch/actions/runs/12443655335/job/34743090791 Test Plan: buck run fbcode//mode/dev-nosan fbcode//torchrec/fb/quant/tests:test_embedding_modules Reviewed By: emlin Differential Revision: D68132568 Pull Request resolved: https://github.com/pytorch/pytorch/pull/144726 Approved by: https://github.com/iamzainhuda, https://github.com/janeyx99	2025-01-15 16:49:43 +00:00
PyTorch MergeBot	197954e14b	Revert "Handle meta tensors in FX quantization (#142262 )" This reverts commit `e97b97af56`. Reverted https://github.com/pytorch/pytorch/pull/142262 on behalf of https://github.com/janeyx99 due to this PR broke lint ([comment](https://github.com/pytorch/pytorch/pull/142262#issuecomment-2558233022))	2024-12-21 20:34:09 +00:00
Kaustubh Vartak	e97b97af56	Handle meta tensors in FX quantization (#142262 ) Summary: If module being quantized contains a some meta tensors and some tensors with actual device, we should not fail quantization. Quantization should also not fail if new quantized module is created on a meta device. Differential Revision: D66895899 Pull Request resolved: https://github.com/pytorch/pytorch/pull/142262 Approved by: https://github.com/iamzainhuda	2024-12-21 13:19:30 +00:00
Tom Ritchford	dc23f1944a	Remove unused Python variables in torch/[_-a]* (#133492 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/133492 Approved by: https://github.com/albanD	2024-12-12 17:39:14 +00:00
PyTorch MergeBot	5c97ac9721	Revert "Remove unused Python variables in torch/[_-a]* (#133492 )" This reverts commit `fda975a7b3`. Reverted https://github.com/pytorch/pytorch/pull/133492 on behalf of https://github.com/clee2000 due to Sorry, I need to revert this in order to revert something else. The only thing you need to do is rebase and remerge ([comment](https://github.com/pytorch/pytorch/pull/133492#issuecomment-2536635516))	2024-12-11 17:29:12 +00:00
Tom Ritchford	fda975a7b3	Remove unused Python variables in torch/[_-a]* (#133492 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/133492 Approved by: https://github.com/albanD	2024-12-10 21:48:44 +00:00
Nitin Jain	476e0697f5	Fix for split gates enabled quantizable LSTM subclass (#140818 ) Summary: ### Motivation In D65283170, we need subclass of quantizable LSTM to enable split_gates. Also, required for tests. ### What's the change? As subclass is not part of no_observer() set, an improper observer is added after the quantizable LSTM module. Here, we switch class check change to issubclass check on no_observer set. Test Plan: - N6206576 - CI. Reviewed By: andrewor14 Differential Revision: D65989314 Pull Request resolved: https://github.com/pytorch/pytorch/pull/140818 Approved by: https://github.com/andrewor14	2024-11-15 20:15:52 +00:00
Xuehai Pan	2ce734cee9	[BE] enable UFMT for `torch/ao/quantization/` (#128863 ) Part of #123062 - #123062 Pull Request resolved: https://github.com/pytorch/pytorch/pull/128863 Approved by: https://github.com/ezyang ghstack dependencies: #128861, #128862	2024-07-25 04:17:54 +00:00
Aaron Orenstein	62bcdc0ac9	Flip default value for mypy disallow_untyped_defs [4/11] (#127841 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127841 Approved by: https://github.com/oulgen	2024-06-08 18:36:48 +00:00
Kwanghoon An	b704c7cf0f	Re trying Support min/max carry over for eager mode from_float method (#127576 ) Summary: Original commit changeset: 2605900516c8 Original Phabricator Diff: D57977896 Test Plan: Re enabling due to prod failure Reviewed By: jerryzh168 Differential Revision: D57978925 Pull Request resolved: https://github.com/pytorch/pytorch/pull/127576 Approved by: https://github.com/jerryzh168	2024-05-31 19:08:07 +00:00
Kwanghoon An	c404b2968c	Support min/max carry over for eager mode from_float method (#127309 ) Summary: After QAT is completed or given pre-tuned weight observer via tunable PTQ algorithm, it should not over-write again with a given weight, at least for static QAT never. Dynamic QAT also does not require to re-run weight observer again by design. This is a fix Test Plan: Signals Differential Revision: D57747749 Pull Request resolved: https://github.com/pytorch/pytorch/pull/127309 Approved by: https://github.com/jerryzh168	2024-05-29 19:33:26 +00:00
Jiaxu Zhu	595af261b2	[ao] Support Subclasses of `FloatFunctional` in eager mode prepare (#109646 ) Summary: As title, if a module is subclassing `nnq.FloatFunctional`, also adding observers to it like `nnq.FloatFunctional` Test Plan: CI Differential Revision: D49431968 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109646 Approved by: https://github.com/jerryzh168	2023-09-20 08:09:55 +00:00
Justin Chu	c0d8a4af0a	[BE] Enable ruff's UP rules and autoformat ao/ (#105430 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105430 Approved by: https://github.com/albanD, https://github.com/malfet	2023-07-19 13:44:37 +00:00
Kazuaki Ishizaki	a13a63ae9a	Fix typos under torch/ao directory (#97679 ) This PR fixes typos in comments and messages of `.py` files under `torch/ao` directory Pull Request resolved: https://github.com/pytorch/pytorch/pull/97679 Approved by: https://github.com/janeyx99, https://github.com/kit1980	2023-04-10 22:25:15 +00:00
Vasiliy Kuznetsov	f15ab8a7f2	AO migration: replace torch internal callsites (#94170 ) Summary: Do the following renames: `torch.quantization` -> `torch.ao.quantization` `torch.nn.quantized` -> `torch.ao.nn.quantized` `torch.nn.quantizable` -> `torch.ao.nn.quantizable` `torch.nn.qat` -> `torch.ao.nn.qat` `torch.nn.intrinsic` -> `torch.ao.nn.intrinsic` And then, do `torch.ao.nn.quantized._reference` -> `torch.ao.nn.quantized.reference` to clean up the aftermath of https://github.com/pytorch/pytorch/pull/84974 Then, manually update `test/test_module_init.py` to fix hanging whitespace due to the replace. Run this script to do the replacements: https://gist.github.com/vkuzo/7f7afebf8c31b9ba48306223e68a1c82 This is for https://github.com/pytorch/pytorch/issues/81667 Test plan: CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/94170 Approved by: https://github.com/jerryzh168	2023-02-07 02:32:23 +00:00
HDCharles	a01c1ee594	[ao] making _is_activation_post_process private with BC (#90554 ) same function in observer and quantize, consolidated to a single function note: this is a recreation of D40709276 which caused severa breakages due to not maintaining BC for models with cached code with calls to the old function name Differential Revision: [D41793604](https://our.internmc.facebook.com/intern/diff/D41793604/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D41793604/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/90554 Approved by: https://github.com/jcaip	2022-12-16 08:09:33 +00:00
HDCharles	1ca9d43d4e	[ao] quantize.py fixing public v private (#87521 ) Summary: made _register_activation_post_process_hook, _add_observer, _get_unique_devices_, _get_observer_dict private Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D40709277](https://our.internmc.facebook.com/intern/diff/D40709277) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87521 Approved by: https://github.com/jerryzh168	2022-12-14 22:50:39 +00:00
PyTorch MergeBot	9d209e7834	Revert "[ao] making _is_activation_post_process private (#87520 )" This reverts commit `45c62a3377`. Reverted https://github.com/pytorch/pytorch/pull/87520 on behalf of https://github.com/bigfootjon due to Diff reverted internally	2022-11-21 16:48:26 +00:00
PyTorch MergeBot	f3db03612f	Revert "[ao] maintain BC for is_activation_post_process (#89260 )" This reverts commit `c5fafb4e16`. Reverted https://github.com/pytorch/pytorch/pull/89260 on behalf of https://github.com/DanilBaibak due to breaking internal builds	2022-11-21 16:38:20 +00:00
Shen Li	e0251de42f	[Easy] Use prepend arg to register forward hooks in quantize.py (#89391 ) Differential Revision: [D41431110](https://our.internmc.facebook.com/intern/diff/D41431110) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89391 Approved by: https://github.com/awgu	2022-11-21 05:19:47 +00:00
HDCharles	c5fafb4e16	[ao] maintain BC for is_activation_post_process (#89260 ) Summary: tests are failing due to code packaged with trained models calling now defunct function names (is_activation_post_process). this diff maintains BC temporarily until the cached code can be refreshed Test Plan: no functional change Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/89260 Approved by: https://github.com/jerryzh168	2022-11-18 07:58:51 +00:00
HDCharles	45c62a3377	[ao] making _is_activation_post_process private (#87520 ) Summary: same function in observer and quantize, consolidated to a single function. Note the definitions were slightly different, I've changed the definition to be maximally inclusive so that the name of the function is more accurate Test Plan: python test/test_public_bindings.py python test/test_quantization.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D40709276](https://our.internmc.facebook.com/intern/diff/D40709276) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87520 Approved by: https://github.com/jcaip	2022-11-16 21:31:57 +00:00
HDCharles	6fe4ccc7cb	[ao] qconfig.py fix public v private (#87515 ) Summary: made is_reuse_input_qconfig, _activation_is_memoryless, _partial_wrapper_equals, _obs_or_fq_ctr_equals, _add_module_to_qconfig_obs_ctr, _assert_valid_qconfig private Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D40709280](https://our.internmc.facebook.com/intern/diff/D40709280) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87515 Approved by: https://github.com/jcaip	2022-11-09 22:30:03 +00:00
andrewor14	7560a7b27c	[Quant] Respect non_leaf_module_list for activation modules (#88498 ) Summary: This commit fixes the bug where `non_leaf_module_list` was not respected for activation modules like `torch.nn.Sigmoid` and `torch.nn.Tanh`. Today, these modules default to `default_fixed_qparams_range_0to1_fake_quant`, and there is no way to configure them to use any other activation_post_process (e.g. FixedQParamsObserver) (see this [mapping](`dc00bb51b8/torch/ao/quantization/quantization_mappings.py (L188-L193)`)). `non_leaf_module_list` is a "list of non-leaf modules we want to add observer" (see prepare docstring). If the user explicitly specified to insert observers for these modules, we should respect that instead of continuing to use the default. Test Plan: python test/test_quantization.py TestQuantizeEagerPTQStatic.test_activations_in_non_leaf_module_list Reviewers: vkuzo, jerryzh168 Subscribers: vkuzo, jerryzh168 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88498 Approved by: https://github.com/jerryzh168	2022-11-04 22:46:55 +00:00
HDCharles	14db44ad72	[ao] fixing public v private for quantize.py (#86023 ) Summary: just needed to add __all__ Test Plan: python test/test_public_bindings.py Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/86023 Approved by: https://github.com/jerryzh168	2022-10-05 19:40:42 +00:00
zaf	c92e5ac95b	[quant][ao_migration] `torch.nn.quantized.modules` → `torch.ao.nn.quantized.modules` (#78713 ) Context: In order to avoid the cluttering of the `torch.nn` namespace the quantized modules namespace is moved to `torch.ao.nn`. The list of the `nn.quantized` files that are being migrated: - [ ] `torch.nn.quantized` → `torch.ao.nn.quantized` - [X] `torch.nn.quantized.functional` → `torch.ao.nn.quantized.functional` - [X] [Current PR] `torch.nn.quantized.modules` → `torch.ao.nn.quantized.modules` - [ ] `torch.nn.quantized.dynamic` → `torch.ao.nn.quantized.dynamic` - [ ] `torch.nn.quantized._reference` → `torch.ao.nn.quantized._reference` - [ ] `torch.nn.quantizable` → `torch.ao.nn.quantizable` - [ ] `torch.nn.qat` → `torch.ao.nn.qat` - [ ] `torch.nn.qat.modules` → `torch.ao.nn.qat.modules` - [ ] `torch.nn.qat.dynamic` → `torch.ao.nn.qat.dynamic` - [ ] `torch.nn.intrinsic` → `torch.ao.nn.intrinsic` - [ ] `torch.nn.intrinsic.modules` → `torch.ao.nn.intrinsic.modules` - [ ] `torch.nn.intrinsic.qat` → `torch.ao.nn.intrinsic.qat` - [ ] `torch.nn.intrinsic.quantized` → `torch.ao.nn.intrinsic.quantized` - [ ] `torch.nn.intrinsic.quantized.modules` → `torch.ao.nn.intrinsic.quantized.modules` - [ ] `torch.nn.intrinsic.quantized.dynamic` → `torch.ao.nn.intrinsic.quantized.dynamic` Majority of the files are just moved to the new location. However, specific files need to be double checked: - Documentation @vkuzo - docs/source/conf.py - docs/source/quantization.rst - [quantize_fx](torch/ao/quantization/quantize_fx.py) @jerryzh168 - [common test routine](test/quantization/ao_migration/common.py) @HDCharles - JIT stuff @jamesr66a - torch/csrc/jit/passes/hoist_conv_packed_params.cpp - torch/csrc/jit/passes/quantization/helper.h - torch/csrc/jit/serialization/import_source.cpp Differential Revision: [D38926012](https://our.internmc.facebook.com/intern/diff/D38926012/) Differential Revision: [D38926012](https://our.internmc.facebook.com/intern/diff/D38926012) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78713 Approved by: https://github.com/jerryzh168	2022-08-25 16:50:33 +00:00
PyTorch MergeBot	6a9c02339d	Revert "[quant][ao_migration] `torch.nn.quantized.modules` → `torch.ao.nn.quantized.modules` (#78713 )" This reverts commit `432f037498`. Reverted https://github.com/pytorch/pytorch/pull/78713 on behalf of https://github.com/janeyx99 due to Reverting for breaking (trunk-only) ios build	2022-08-22 07:32:37 +00:00
zaf	432f037498	[quant][ao_migration] `torch.nn.quantized.modules` → `torch.ao.nn.quantized.modules` (#78713 ) Context: In order to avoid the cluttering of the `torch.nn` namespace the quantized modules namespace is moved to `torch.ao.nn`. The list of the `nn.quantized` files that are being migrated: - [ ] `torch.nn.quantized` → `torch.ao.nn.quantized` - [X] `torch.nn.quantized.functional` → `torch.ao.nn.quantized.functional` - [X] [Current PR] `torch.nn.quantized.modules` → `torch.ao.nn.quantized.modules` - [ ] `torch.nn.quantized.dynamic` → `torch.ao.nn.quantized.dynamic` - [ ] `torch.nn.quantized._reference` → `torch.ao.nn.quantized._reference` - [ ] `torch.nn.quantizable` → `torch.ao.nn.quantizable` - [ ] `torch.nn.qat` → `torch.ao.nn.qat` - [ ] `torch.nn.qat.modules` → `torch.ao.nn.qat.modules` - [ ] `torch.nn.qat.dynamic` → `torch.ao.nn.qat.dynamic` - [ ] `torch.nn.intrinsic` → `torch.ao.nn.intrinsic` - [ ] `torch.nn.intrinsic.modules` → `torch.ao.nn.intrinsic.modules` - [ ] `torch.nn.intrinsic.qat` → `torch.ao.nn.intrinsic.qat` - [ ] `torch.nn.intrinsic.quantized` → `torch.ao.nn.intrinsic.quantized` - [ ] `torch.nn.intrinsic.quantized.modules` → `torch.ao.nn.intrinsic.quantized.modules` - [ ] `torch.nn.intrinsic.quantized.dynamic` → `torch.ao.nn.intrinsic.quantized.dynamic` Majority of the files are just moved to the new location. However, specific files need to be double checked: - Documentation @vkuzo - docs/source/conf.py - docs/source/quantization.rst - [quantize_fx](torch/ao/quantization/quantize_fx.py) @jerryzh168 - [common test routine](test/quantization/ao_migration/common.py) @HDCharles - JIT stuff @jamesr66a - torch/csrc/jit/passes/hoist_conv_packed_params.cpp - torch/csrc/jit/passes/quantization/helper.h - torch/csrc/jit/serialization/import_source.cpp Differential Revision: [D36860145](https://our.internmc.facebook.com/intern/diff/D36860145/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78713 Approved by: https://github.com/jerryzh168	2022-08-22 01:38:55 +00:00
HDCharles	3f1dc7ec00	[quant] Create default custom modules for LSTM and MHA (#79960 ) Summary: Currently we expect the users to provide custom modules for LSTM and MHA. However, as we almost always ask the users to use those modules in the custom context, it is better to make this behavior default. In this case we try to align with the base quantization API, if the user specifies a custom_config_dict then that is used, however if the value is left as None then the default is used. If a user would like to both use the default and modify it, they have to do so manually, however the default is accessible by get_default_custom_config_dict Additionally, the NS which uses prepare to insert custom observers for its purposes had to be slightly modified to pass in an empty custom_config_dict in order to avoid modifying the custom modules. due to weird CI issues with previous PR, previous discussion can be found: https://github.com/pytorch/pytorch/pull/71192 Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79960 Approved by: https://github.com/z-a-f	2022-06-30 00:00:46 +00:00
Charles David Hernandez	9bb21fac95	[ao][sparsity] make sparsity compose with PTQ convert (#74846 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74846 This PR primarily allows the PTQ convert function to work with parametrized modules. Given that the parametrized weight is what is used by default in convert, as long as sparsifier.step() has already been called, the converted model will use the sparisified weights. There is currently no way to handle things if sparsifier.step() has not been called. Lastly, added the is_leaf_or_only_parametrized function because parametrized modules no longer look like leaves due to the parametrizations module attached to them Test Plan: python test/test_ao_sparsity.py TestComposability Imported from OSS Reviewed By: vkuzo Differential Revision: D35240275 fbshipit-source-id: 48529f2a83edfe6d8a2d2dff8ca3d08a3fb0d553 (cherry picked from commit 9d6361482e2885db964e02b0222cd23c9f4d469e)	2022-04-06 04:27:16 +00:00
Charles David Hernandez	02e30a09f7	[ao][sparsity] make sparsity and PTQ compose (#74845 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74845 This PR adds support for quantization flow to detect parametrized modules and match them using their original module types. This mainly involved using the new type_before_parametrizations function rather than type to check for module mathcing Test Plan: python test/test_ao_sparsity.py TestComposability Imported from OSS Reviewed By: jerryzh168 Differential Revision: D35240274 fbshipit-source-id: 7294d89c9c2e069e51d8b9bafa45c15f92bed124 (cherry picked from commit ed5cdb7b636c42e040d1b4a67b6b94604d06e1ff)	2022-04-05 03:35:41 +00:00
Charles David Hernandez	bf091f78a6	[AO][bugfix] Fixing FX QAT but for untraceable modules (#74277 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74277 see issue: https://github.com/pytorch/pytorch/issues/74240 this fixes that issue by skipping the children of untraceable modules during propagate_qconfig. This required extending said function to take the prepare_custom_config_dict as an optional argument. Test Plan: python test/test_quantization.py python test/test_quantization.py TestQuantizeFx.test_qat_skip_untraced Imported from OSS Reviewed By: vkuzo Differential Revision: D34916074 fbshipit-source-id: 11caba2cbf78566fb51adf698b01bbba0275de28 (cherry picked from commit 5324c48e4c3277bb12a716a4408151c86006ee47)	2022-03-30 15:08:45 +00:00
Terry Chen	4e6aefaf72	[Qunat] Refactor reference module mapping (#72755 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72755 Add is_refernece flag in convert function Test Plan: python3 test/test_quantization.py TestQuantizeEagerOps.test_conv_transpose_2d Imported from OSS Reviewed By: mruberry Differential Revision: D34188856 fbshipit-source-id: 291014a7b3b4d4b40ca0ca76a80711097dcc4b58 (cherry picked from commit cfba3b8dc0373708712c0d847d590f0d587df002)	2022-03-08 06:48:04 +00:00
Terry Chen	16e2f5d291	[quant] Add ConvTranspose reference module - Reland #73031 (#73094 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73094 Add ConvTranspose reference module Test Plan: python3 test/test_quantization.py TestQuantizeEagerOps.test_conv_transpose_2d Imported from OSS Reviewed By: jerryzh168 Differential Revision: D34352228 fbshipit-source-id: 03062d6b441bc5a3298ec094f421a69c4c3d5c40 (cherry picked from commit `2f2bdd4fcf`)	2022-02-23 02:31:42 +00:00
Jane Xu	477d1bd6cf	Revert D34313425: [quant] Add ConvTranspose reference module Test Plan: revert-hammer Differential Revision: D34313425 (`710f12f58e`) Original commit changeset: 3eeec1b24a51 Original Phabricator Diff: D34313425 (`710f12f58e`) fbshipit-source-id: aecf9113d2e4cef3ccf4e1a9c4c33b07dc2ad385 (cherry picked from commit `3fcb9cd14d`)	2022-02-18 17:31:20 +00:00
Terry Chen	710f12f58e	[quant] Add ConvTranspose reference module (#73031 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73031 Add ConvTranspose reference module Test Plan: python3 test/test_quantization.py TestQuantizeEagerOps.test_conv_transpose_2d Imported from OSS Reviewed By: jerryzh168 Differential Revision: D34313425 fbshipit-source-id: 3eeec1b24a51c7951c4d4b0c7dca43a012468b85 (cherry picked from commit `0ee7c1cc39`)	2022-02-18 06:29:12 +00:00
Nikita Shulga	e6fd28fb05	Revert D34126542: [Qunat] Add ConvTranspose reference module Test Plan: revert-hammer Differential Revision: D34126542 (`7a031ec17f`) Original commit changeset: 7da167695a1f Original Phabricator Diff: D34126542 (`7a031ec17f`) fbshipit-source-id: 14e40884807b9908017ae30af83a8dea23ff1f0f (cherry picked from commit `f99a7f5a69`)	2022-02-16 22:24:15 +00:00
Terry Chen	7a031ec17f	[Qunat] Add ConvTranspose reference module (#72473 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72473 Add ConvTranspose reference module Test Plan: python3 test/test_quantization.py TestQuantizeEagerOps.test_conv_transpose_op Imported from OSS Reviewed By: jerryzh168 Differential Revision: D34126542 fbshipit-source-id: 7da167695a1fd9c141059bce14cce4f0608b086c (cherry picked from commit `dee22dcf48`)	2022-02-16 01:56:28 +00:00
Terry Chen	ce3215db70	Fix nnq.dropout in vision mobilenetv3 pretrain model (#71438 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71438 Fix issue https://github.com/pytorch/vision/issues/5198 skip observer for nn.dropout to load pretrain model Test Plan: python -c "import torchvision; torchvision.models.quantization.mobilenet_v3_large(pretrained=True, quantize=True)" Imported from OSS Reviewed By: HDCharles Differential Revision: D33641707 fbshipit-source-id: 14ea26557c4ff3b942cf46bf06610db0b8f06b05 (cherry picked from commit `0b8b178d26`)	2022-01-22 00:02:48 +00:00
Charles David Hernandez	e47771cca0	[ao] Removing unused allow list arguments from propagate_qconfig and helper (#71104 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71104 This shouldn't change any functionality given that those variables were not used. It should be noted that a similar variable is used in add_observer which is why it wasn't removed from there. ghstack-source-id: 146940043 Test Plan: python test/test_quantization.py Imported from OSS Reviewed By: vkuzo Differential Revision: D33510352 fbshipit-source-id: c66ed72c2b71a6e1822f9311467adaa1f4b730d0	2022-01-13 16:07:29 -08:00
Jerry Zhang	b7259b8660	[quant][be] Add a check in prepare_qat to make sure the model is in training mode (#69879 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69879 att Test Plan: ``` python test/test_quantization.py TestQuantizationAwareTraining ``` Imported from OSS Reviewed By: vkuzo Differential Revision: D33080989 fbshipit-source-id: 55a631284365ec9dfd6bd7469688490ab1891d41	2021-12-22 11:00:00 -08:00
Jerry Zhang	5db711f9d3	[quant][be] Replace QConfigDynamic with QConfig in code (#69864 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69864 att, will have a follow up PR that removes QConfigDynamic in the api Test Plan: regression tests ``` python test/test_quantization.py TestPostTrainingStatic python test/test_quantization.py TestPostTrainingDynamic python test/test_quantization.py TestQuantizeFx ``` Imported from OSS Reviewed By: vkuzo Differential Revision: D33073235 fbshipit-source-id: 6c1a1647032453803c55cdad7c04154502f085db	2021-12-17 22:30:57 -08:00
andrewor	4a8f27445d	[Quant] Add dynamic QAT Linear module (#67325 ) Summary: Summary: This commit adds the `torch.nn.qat.dynamic.modules.Linear` module, the dynamic counterpart to `torch.nn.qat.modules.Linear`. Functionally these are very similar, except the dynamic version expects a memoryless observer and is converted into a dynamically quantized module before inference. Pull Request resolved: https://github.com/pytorch/pytorch/pull/67325 Test Plan: `python3 test/test_quantization.py TestQuantizationAwareTraining.test_dynamic_qat_linear` Reviewers: Charles David Hernandez, Jerry Zhang Subscribers: Charles David Hernandez, Supriya Rao, Yining Lu Tasks: 99696812 Tags: pytorch Reviewed By: malfet, jerryzh168 Differential Revision: D32178739 Pulled By: andrewor14 fbshipit-source-id: 5051bdd7e06071a011e4e7d9cc7769db8d38fd73	2021-11-08 10:24:25 -08:00
Supriya Rao	8a974a482c	[quant] Add support for quantization of Embedding{Bag} in dynamic quant APIs (#65674 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65674 Before this PR user had to use the eager mode static quantization APIs to quantize Embedding/EmbeddingBag modules. With this PR they can use either the static or dynamic quantization APIs for Embedding quantization The only qconfig supported for embedding quantization is float_qparams_weight_only_qconfig whcih is currently enforced in the from_float method of the quantized Embedding/Embedding modules. To combine embedding quantization with Linear dynamic quantization, user can use the qconfig_dict to specify different qconfig for each module type. The prepare/convert APIs can still be used to quantize Embeddings, with the caveat that user need to ensure input to Embedding ops are FP32. Addresses Issue #65185 ghstack-source-id: 139935419 Test Plan: python test/test_quantization.py Imported from OSS Reviewed By: gchanan Differential Revision: D31211199 fbshipit-source-id: 8c747881caee5ccbf8b93c6704b08d132049dea4	2021-10-06 23:19:38 -07:00

1 2

56 Commits