pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Vasiliy Kuznetsov	bfc75fe078	quant: add QAT fused Linear-Bn1d [1/x]: prepared module (#72431 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72431 Adds support for a fused QAT observed module for `Linear` followed by `BatchNorm1d`. In this PR, only the support for prepared module with fake_quants in the right places is added. A future PR will add support for `convert`, and tests for eager and FX graph mode workflows. Similar to conv-bn, we rescale the weight before applying the fake quant, and undo the rescaling after the linear operation. Test Plan: ``` python test/test_quantization.py TestQuantizeEagerQATNumerics.test_linear_bn ``` Imported from OSS Reviewed By: jerryzh168, raghuramank10000 Differential Revision: D34044427 fbshipit-source-id: 47a519173939ca4824d2c6e6ea7a599764a8ed10	2022-02-18 05:10:28 -08:00
Anjali Chourdia	a637e69569	Reland torch.ops API change machinery with the core functionality disabled (#71785 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71785 see https://github.com/pytorch/pytorch/pull/67254 ghstack-source-id: 147648699 Test Plan: github CI Reviewed By: albanD Differential Revision: D33777229 fbshipit-source-id: 517b36be9743025eb40d708d380dae62e3663184	2022-02-02 08:03:28 -08:00
Terry Chen	0b8b178d26	Fix nnq.dropout in vision mobilenetv3 pretrain model (#71438 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71438 Fix issue https://github.com/pytorch/vision/issues/5198 skip observer for nn.dropout to load pretrain model Test Plan: python -c "import torchvision; torchvision.models.quantization.mobilenet_v3_large(pretrained=True, quantize=True)" Imported from OSS Reviewed By: HDCharles Differential Revision: D33641707 fbshipit-source-id: 14ea26557c4ff3b942cf46bf06610db0b8f06b05	2022-01-21 15:59:58 -08:00
Yan Li	03ab65023a	backout D33469839 (#71443 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71443 cogwheel test inline_cvr_infer_canary_pyper_model_publish is timing out. The convert_fx call takes > 20 mins for local and local_ro sub modules, which used to take ~ 2 mins. Test Plan: Fblearn flow run * the following cmd took 1113 seconds before the diff and 5002 seconds after. flow-cli clone-locally 320014219 --run-as-secure-group pytorch_at_scale --operators pyper_model_publish_workflow.pyper_model_publish_workflow.process_torch_package_model_files.process_non_sparse_parameters[0] Cogwheel test * Cogwheel test with packages in B3588 (the last good run) took 4694.48s * Cogwheel test with packages in B3590 (the first timeout) took 13975.83s * Cogwheel test with the following packages took 4535.04s * all packages in B3588 except the model publish * the model publish built with D33469839 (`043e84b3d2`) reversed (created D33633570) Reviewed By: albanD, jerryzh168 Differential Revision: D33633570 fbshipit-source-id: dc5e777c48a90c551641a3f79126461f6a60449e	2022-01-18 15:49:02 -08:00
Terry Chen	e7c87e8b44	[quant] fix dropout in FX graph mode quantization (#71043 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71043 fix issue #68250 dropout break fx graph model quantization Test Plan: python test/test_quantization.py TestStaticQuantizedModule Imported from OSS Reviewed By: vkuzo Differential Revision: D33490176 fbshipit-source-id: 155546505b28ffc635ada65a1464b9d622dbc235	2022-01-13 15:59:59 -08:00
Charles David Hernandez	83b45fe166	[ao] disabling dynamic conv/convT ops (#71110 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71110 as mentioned in https://github.com/pytorch/pytorch/issues/70480 the dynamic conv ops are currently missing a key feature to bring their performance in line with other dynamic ops, this diff disables conv/convT from being automatically quantized with convert dynamic Test Plan: buck test //caffe2/test:quantization --test-selectors test_quantized_module#TestDynamicQuantizedModule Reviewed By: vkuzo Differential Revision: D33511152 fbshipit-source-id: 50618fbe734c898664c390f896e70c68f1df3208	2022-01-13 11:28:02 -08:00
anjali411	043e84b3d2	Per-overload torch.ops API (#67254 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67254 Fixes https://github.com/pytorch/pytorch/issues/65997 BC breaking: `output = torch.ops._test.leaky_relu(self=torch.tensor(-1.0))` now fails with the error `TypeError: __call__() got multiple values for argument 'self'` since we call into `OpOverloadBundle`'s `__call__` method that has `self` bound to it as its first argument. Follow up work: 1. disallow `default` as an overload name for aten operators. 2. Add a method to obtain a list of all overloads (exclude the ones registered by JIT) 3. Add methods/properties to `OpOverload` to access more schema information (types of input and output args etc) cc ezyang gchanan Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D33469839 Pulled By: anjali411 fbshipit-source-id: c3fc43460f1c7c9651c64b4d46337be21c400621	2022-01-10 17:29:06 -08:00
Michael Suo	402f2934bf	Revert D33262228: Per-overload torch.ops API Test Plan: revert-hammer Differential Revision: D33262228 (`8e6d1738a4`) Original commit changeset: 600dbf511514 Original Phabricator Diff: D33262228 (`8e6d1738a4`) fbshipit-source-id: 238fa88ea9c4f26c7511334765c07452fbca9655	2022-01-05 22:10:11 -08:00
anjali411	8e6d1738a4	Per-overload torch.ops API (#67254 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67254 Fixes https://github.com/pytorch/pytorch/issues/65997 TODO: disallow `default` as an overload name for aten operators. BC breaking: `output = torch.ops._test.leaky_relu(self=torch.tensor(-1.0))` now fails with the error `TypeError: __call__() got multiple values for argument 'self'` since we call into `OpOverloadBundle`'s `__call__` method that has `self` bound to it as its first argument. cc ezyang gchanan Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D33262228 Pulled By: anjali411 fbshipit-source-id: 600dbf511514ea9b41aea3e6b1bc1102dab08909	2022-01-05 15:17:41 -08:00
Zafar	07932e2735	[sparsity] Convert function for sparse kernels without a context manager (#66778 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66778 This removes the hack of the context manager that would communicate the zeros block shape to the quantization convert. The conversion will assume that the converted modules have `sparse_params` (which is added by the sparsifier). Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D31835721 Pulled By: z-a-f fbshipit-source-id: c5fd2da3b09a728a2296765c00ca69275dbca3b1	2021-12-09 02:58:57 -08:00
Ben Koopman	6c9cf5e6ea	[quant][embedding qat] eager mode QAT for Embeddings (#66429 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66429 Test Plan: Imported from OSS Reviewed By: HDCharles, supriyar Differential Revision: D31618284 Pulled By: b-koopman fbshipit-source-id: 0c0e2e86b98da9f29e9b2fc2a35c59424f94cbba	2021-11-18 05:57:11 -08:00
Charles David Hernandez	09615cd0b0	Adding Dynamic Conv and ConvT ops/modules (#68176 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68176 it should be noted that for the modules, reduce_range is set to true by default in a similar fashion to linear_dynamic. Test Plan: python test/test_quantization.py TestDynamicQuantizedModule python test/test_quantization.py TestDynamicQuantizedConv python test/test_quantization.py TestQuantizedConv Imported from OSS Reviewed By: kimishpatel Differential Revision: D32374003 fbshipit-source-id: 011562bd0f4d817387d53bb113df2600aa60a7a3	2021-11-15 16:42:25 -08:00
Charles David Hernandez	e795315c63	Changes and fixes to prepare for dynamic conv (#68175 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68175 This slightly alters the way from_float works so it will work with placeholder observers. It also fixes a but with ConvTranspose3d and ConvTranspose1d where the parameters like kernel_size, stride...etc weren't set properly. New tests were added to check for this type of issue as well. Test Plan: python test/test_quantization.py TestQuantizedOps python test/test_quantization.py TestStaticQuantizedModule Imported from OSS Reviewed By: z-a-f Differential Revision: D32374004 fbshipit-source-id: caaa548d12d433d9c1fa0abc8597a7d31bb4e8af	2021-11-11 23:55:04 -08:00
andrewor	4a8f27445d	[Quant] Add dynamic QAT Linear module (#67325 ) Summary: Summary: This commit adds the `torch.nn.qat.dynamic.modules.Linear` module, the dynamic counterpart to `torch.nn.qat.modules.Linear`. Functionally these are very similar, except the dynamic version expects a memoryless observer and is converted into a dynamically quantized module before inference. Pull Request resolved: https://github.com/pytorch/pytorch/pull/67325 Test Plan: `python3 test/test_quantization.py TestQuantizationAwareTraining.test_dynamic_qat_linear` Reviewers: Charles David Hernandez, Jerry Zhang Subscribers: Charles David Hernandez, Supriya Rao, Yining Lu Tasks: 99696812 Tags: pytorch Reviewed By: malfet, jerryzh168 Differential Revision: D32178739 Pulled By: andrewor14 fbshipit-source-id: 5051bdd7e06071a011e4e7d9cc7769db8d38fd73	2021-11-08 10:24:25 -08:00
Ben Koopman	a58ff186e8	[quant][embedding qat] Add basic EmbeddingBag QAT fakeQuant workflow (#65443 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65443 Test Plan: Imported from OSS Reviewed By: dagitses, supriyar Differential Revision: D31456445 Pulled By: b-koopman fbshipit-source-id: 0edda6e272d9005fce65f2ba6a5e6abc831836de	2021-10-07 20:19:29 -07:00
Supriya Rao	8a974a482c	[quant] Add support for quantization of Embedding{Bag} in dynamic quant APIs (#65674 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65674 Before this PR user had to use the eager mode static quantization APIs to quantize Embedding/EmbeddingBag modules. With this PR they can use either the static or dynamic quantization APIs for Embedding quantization The only qconfig supported for embedding quantization is float_qparams_weight_only_qconfig whcih is currently enforced in the from_float method of the quantized Embedding/Embedding modules. To combine embedding quantization with Linear dynamic quantization, user can use the qconfig_dict to specify different qconfig for each module type. The prepare/convert APIs can still be used to quantize Embeddings, with the caveat that user need to ensure input to Embedding ops are FP32. Addresses Issue #65185 ghstack-source-id: 139935419 Test Plan: python test/test_quantization.py Imported from OSS Reviewed By: gchanan Differential Revision: D31211199 fbshipit-source-id: 8c747881caee5ccbf8b93c6704b08d132049dea4	2021-10-06 23:19:38 -07:00
Zafar Takhirov	02dec91212	[quant] AO migration of the `torch/quantization/utils.py` (phase 1) (#64919 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64919 AO Team is migrating the existing torch.quantization into torch.ao.quantization. We are doing it one file at a time to make sure that the internal callsites are updated properly. This migrates the quantization utilities. ghstack-source-id: 138303325 Test Plan: `buck test mode/dev //caffe2/test:quantization` Reviewed By: jerryzh168 Differential Revision: D30899082 fbshipit-source-id: 85eb38c419e417147e71758b682cd095308dd0c9	2021-09-16 21:30:18 -07:00
Charles David Hernandez	8a094e3270	[quant]ao migration for quantization mappings and fuser method mappings hg mv (#64985 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64985 moving quantization_mappings.py and fuser_method_mappings.py to the ao folder while retaining backwards compatibility also added dict test ghstack-source-id: 138215312 Test Plan: buck test mode/dev //caffe2/test:quantization https://www.internalfb.com/intern/testinfra/testrun/7036874471986444 buck test mode/dev //caffe2/test:quantization -- TestAOMigrationQuantization https://www.internalfb.com/intern/testinfra/testrun/5348024625792701 Reviewed By: z-a-f Differential Revision: D30982551 fbshipit-source-id: 00f53bd44009d6012a7de852000aad6885131edb	2021-09-16 12:59:20 -07:00

18 Commits