pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Shijun Kong	09e2fb8f6e	Make LinearPackedParams works with both torchscript and torch.package (#71656 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71656 Customized `__getstate__`/`__setstate__` didn't call super (torch.nn.Module), and won't restore attributes (e.g. `_modules`) after being serialized and deserialized via torch.package After a few iteration, as it turns out, pack/unpack linear param has been supported in torchbind class already, no need to hack torch module anymore. Test Plan: `buck test caffe2/test/:quantization -- test_linear_api` Reviewed By: jerryzh168 Differential Revision: D33711086 fbshipit-source-id: 3a36d10c64b7da414d3657d2ef766bb9a9290ea9 (cherry picked from commit `6337b6c207`)	2022-02-07 18:39:28 +00:00
Andrew Or	3e43c478a8	[Quant][fx] Lower reference conv[1-3]d module (#69228 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69228 Implement lowering logic for reference conv modules, similar to https://github.com/pytorch/pytorch/pull/65723. ghstack-source-id: 145058198 Test Plan: python test/test_quantization.py TestQuantizeFx.test_conv_lowering Imported from OSS Reviewed By: anjali411 Differential Revision: D32890743 fbshipit-source-id: 04f2500628c60b0fbc84d22705164215e190aeba	2021-12-14 11:23:39 -08:00
Jerry Zhang	06e49ea088	[not4land][quant][fx][graphmode] lower reference linear module example (#65723 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65723 Example lowering reference linear module to fbgemm/qnnpack quantized linear module Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D31567461 fbshipit-source-id: 0b8fffaf8e742ec15cb07bf6a4672cf3e856db2d	2021-10-18 13:14:39 -07:00
Zafar Takhirov	b23709df03	[ao_migration] torch/nn/quantized: torch.quantization -> torch.ao.quantization (#65900 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65900 This changes the imports in the `caffe2/torch/nn/quantized` to include the new import locations. ``` codemod -d torch/nn/quantized --extensions py 'torch.quantization' 'torch.ao.quantization' ``` Test Plan: `python test/run_test.py` Reviewed By: jerryzh168 Differential Revision: D31301193 fbshipit-source-id: 58efb1ad51a8b441e2a3bd5b91af11eab6b9331f	2021-10-08 16:19:53 -07:00
Vasiliy Kuznetsov	b2c7c1dfcf	fix copy.deepcopy on LinearPackedParams (#64367 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64367 This is the same thing as https://github.com/pytorch/pytorch/pull/56154 but for quantized linear. It fixes the behavior of `copy.deepcopy` on these modules. Before this PR, copied instances of `LinearPackedParams` were not properly initialized, and inspecting them raised errors of missing `_modules`. After this PR, inspecting and using the copies works. Test Plan: ``` python test/test_quantization.py TestStaticQuantizedModule.test_linear_api ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D30702667 fbshipit-source-id: 38c26d1e72663416eeb989985b77ffc2052c12b9	2021-09-02 06:30:42 -07:00
Jerry Zhang	4a3eea9a6a	[quant][graphmode][fx] Produce reference linear module in convert (#60152 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60152 Test Plan: python test/test_quantization.py TestQuantizeFx Imported from OSS Reviewed By: vkuzo Differential Revision: D29188263 fbshipit-source-id: f7bbbef5d4d747eadf7a627a4e77a5ec9bb0bc94	2021-06-20 20:08:12 -07:00
Basil Hosmer	58d1b3639b	fix nn.MHA scriptability (#58727 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58727 Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D28593830 Pulled By: bhosmer fbshipit-source-id: 37dee9efededaea9985a2bf040df1ba4b46f6580	2021-05-26 15:29:49 -07:00
Sam Estep	75024e228c	Add lint for unqualified `type: ignore` (#56290 ) Summary: The other half of https://github.com/pytorch/pytorch/issues/56272. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56290 Test Plan: CI should pass on the tip of this PR, and we know that the lint works because the following CI runs (before this PR was finished) failed: - https://github.com/pytorch/pytorch/runs/2384511062 - https://github.com/pytorch/pytorch/actions/runs/765036024 Reviewed By: seemethere Differential Revision: D27867219 Pulled By: samestep fbshipit-source-id: e648f07b6822867e70833e23ddafe7fb7eaca235	2021-04-21 08:07:23 -07:00
Zafar	e12008d110	[quant] Mapping for the `_LinearWithBias` (#49964 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49964 `torch.nn.modules.linear._LinearWithBias` is only used in the transformers, and is completely identical to the `torch.nn.Linear`. This PR creates a mapping so that this module would be treated the same as the Linear. Test Plan: ``` python test/test_quantization.py TestDynamicQuantizedModule TestStaticQuantizedModule ``` Differential Revision: D25731589 Reviewed By: jerryzh168 Pulled By: z-a-f fbshipit-source-id: 1b2697014e250e97d3010cdb542f9d130b71fbc3	2021-01-07 13:57:29 -08:00
Jerry Zhang	636fa8fda8	[quant] Add backend_independent option for quantized linear module (#48192 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48192 This is to allow producing a backend independent quantized module since some backend don't have packed weight for linear Test Plan: test_quantized_module.py Imported from OSS Reviewed By: raghuramank100 Differential Revision: D25061645 fbshipit-source-id: a65535e53f35af4f2926af0ee330fdaae6dae996	2020-11-21 00:32:27 -08:00
Jerry Zhang	8aaca4b46a	[reland][quant] Remove nn.quantized.ReLU module and nn.quantized.functional.relu (#47415 ) (#48038 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48038 nn.ReLU works for both float and quantized input, we don't want to define an nn.quantized.ReLU that does the same thing as nn.ReLU, similarly for nn.quantized.functional.relu this also removes the numerical inconsistency for models quantizes nn.ReLU independently in qat mode Test Plan: Imported from OSS Imported from OSS Reviewed By: vkuzo Differential Revision: D25000462 fbshipit-source-id: e3609a3ae4a3476a42f61276619033054194a0d2	2020-11-17 09:52:21 -08:00
Vasiliy Kuznetsov	4779553921	Revert "[quant] Remove nn.quantized.ReLU module and nn.quantized.functional.relu (#47415 )" (#47949 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47949 This reverts commit `1478e5ec2a`. Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D24966363 Pulled By: vkuzo fbshipit-source-id: ca1126f699eef84027a15df35962728296c8a790	2020-11-14 08:40:30 -08:00
Jerry Zhang	1478e5ec2a	[quant] Remove nn.quantized.ReLU module and nn.quantized.functional.relu (#47415 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47415 nn.ReLU works for both float and quantized input, we don't want to define an nn.quantized.ReLU that does the same thing as nn.ReLU, similarly for nn.quantized.functional.relu this also removes the numerical inconsistency for models quantizes nn.ReLU independently in qat mode Test Plan: Imported from OSS Reviewed By: z-a-f Differential Revision: D24747035 fbshipit-source-id: b8fdf13e513a0d5f0c4c6c9835635bdf9fdc2769	2020-11-12 10:56:30 -08:00
Nikita Shulga	b60ffcdfdd	Enable typechecks for torch.nn.quantized.modules.linear (#44154 ) Summary: Also import `Optional` directly from `typing` rather than from `_jit_internal` Pull Request resolved: https://github.com/pytorch/pytorch/pull/44154 Reviewed By: seemethere Differential Revision: D23511833 Pulled By: malfet fbshipit-source-id: f78c5fd679c002b218e4d287a9e56fa198171981	2020-09-03 19:52:49 -07:00
Gao, Xiang	37658b144b	Remove useless py2 compatibility import __future__, part 1 (#43808 ) Summary: To avoid conflicts, this PR does not remove all imports. More are coming in further PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/43808 Reviewed By: wanchaol Differential Revision: D23436675 Pulled By: ailzhang fbshipit-source-id: ccc21a1955c244f0804277e9e47e54bfd23455cd	2020-09-02 19:15:11 -07:00
Supriya Rao	4db8ca1129	[quant] Create nn.quantized.dynamic.EmbeddingBag (#43088 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43088 Create quantized module that the user can use to perform embedding bag quantization The module uses the EmbeddingPackedParams to store the weights which can be serialized /deserialized using TorchBind custom classes (C++ get/setstate code) Following PR will add support for `from_float` to convert from float to quantized module Test Plan: python test/test_quantization.py TestDynamicQuantizedModule.test_embedding_bag_api Imported from OSS Reviewed By: vkuzo Differential Revision: D23167519 fbshipit-source-id: 029d7bb44debf78c4ef08bfebf267580ed94d033	2020-08-21 11:45:02 -07:00
James Reed	780fa2b489	Switch torch.save to zipfile serialization and swap quantization to that (#39460 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39460 Test Plan: Imported from OSS Differential Revision: D21865748 Pulled By: jamesr66a fbshipit-source-id: 90fddf366fcb3030e09ed79fb3e038f0175875a5	2020-06-10 17:19:55 -07:00
James Reed	c1e7758b5e	Back out "Revert D20229168: [quantization] Use torchbind for Linear PackedParams" (#38101 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38101 Original commit changeset: 29e8a4d3b8bf ghstack-source-id: 103730417 Test Plan: waitforsadcastle Differential Revision: D21471381 fbshipit-source-id: a922cdf31ba32021e7264ae1454c646c0bfd7ef4	2020-05-08 10:53:06 -07:00
Nikita Shulga	4bc0a7f86a	Revert D20229168: [quantization] Use torchbind for Linear PackedParams Test Plan: revert-hammer Differential Revision: D20229168 Original commit changeset: 3607cac9aa5b fbshipit-source-id: 29e8a4d3b8bffd95ff6a58b46c4f1c1e23770304	2020-05-07 19:47:45 -07:00
James Reed	eaf9b28c55	[quantization] Use torchbind for Linear PackedParams (#34140 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34140 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D20229168 Pulled By: jamesr66a fbshipit-source-id: 3607cac9aa5b4b044572329742baed03350491c6	2020-05-07 19:03:44 -07:00
Zafar Takhirov	463f7920bd	repr and _*state_dict for qRNN (#31540 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31540 Fixes #31468 Test Plan: Imported from OSS Differential Revision: D19205894 Pulled By: z-a-f fbshipit-source-id: 80c36f74aa20a125ea8d74a54e9905576f1bc6d7	2020-03-19 20:49:50 -07:00
Supriya Rao	e236e15934	[quant] Run weight_post_process for QAT (#33852 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33852 This fixes an issue for QAT models. During eval if we call `prepare_qat` and `convert` before calling `load_state_dict` it throws an error because the weight info (num channels) is not updated in the observer module. It is not an issue for per-tensor case Fixes issue #33830 Test Plan: python test/test_quantization.py EagerModePostTrainingQuantTest.test_eval_after_train python test/test_quantization.py EagerModeQuantizationAwareTrainingTest.test_eval_after_train Imported from OSS Differential Revision: D20212996 fbshipit-source-id: a04af8fe4df2e555270ae4d6693f5777d86f8a46	2020-03-04 14:01:32 -08:00
James Reed	812b1ad869	[quantization] FP16 dynamic quantized Linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32331 Test Plan: Imported from OSS Differential Revision: D19441158 Pulled By: jamesr66a fbshipit-source-id: c04247ffe707be68718c486c31bc6c6040f7dc11	2020-01-27 15:45:32 -08:00
Jianyu Huang	0bebfe2143	Add the explicit per-tensor/per-channel quant info when we print the module (#30591 ) Summary: As Title says. We would like to explicitly distinguish per-tensor/per-channel scheme when we print the module. Here is an example for Lenet after applying the per-channel dynamic quantization: Before this PR: ``` FloatModel( (conv1): Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1)) (conv2): Conv2d(20, 50, kernel_size=(5, 5), stride=(1, 1)) (fc1): DynamicQuantizedLinear( in_features=800, out_features=500 (_packed_params): LinearPackedParams() ) (fc2): DynamicQuantizedLinear( in_features=500, out_features=10 (_packed_params): LinearPackedParams() ) ) ``` After this PR: ``` FloatModel( (conv1): Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1)) (conv2): Conv2d(20, 50, kernel_size=(5, 5), stride=(1, 1)) (fc1): DynamicQuantizedLinear( in_features=800, out_features=500, qscheme=torch.per_channel_affine (_packed_params): LinearPackedParams() ) (fc2): DynamicQuantizedLinear( in_features=500, out_features=10, qscheme=torch.per_channel_affine (_packed_params): LinearPackedParams() ) ) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/30591 Differential Revision: D18764366 Pulled By: jianyuh fbshipit-source-id: e897ab42ace6b82b2a90729ba788313c7873de1a	2019-12-02 20:14:46 -08:00
James Reed	05a1644ce3	Fix BC for quantized linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30481 Test Plan: Imported from OSS Differential Revision: D18714602 Pulled By: jamesr66a fbshipit-source-id: d51206c22cf2446e98053446789c6324c0481321	2019-11-26 17:38:09 -08:00
James Reed	97fae401f0	Use LinearPackedParams everywhere Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30198 Test Plan: Imported from OSS Differential Revision: D18628003 Pulled By: jamesr66a fbshipit-source-id: 76ff0248fd859e805a15cde555d26dd2138636fa	2019-11-22 11:31:17 -08:00
Zafar Takhirov	675a4cb9fb	Extracted quantize/dequantize out of linear. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29173 Test Plan: Imported from OSS Differential Revision: D18318561 Pulled By: z-a-f fbshipit-source-id: 89317bb5f56e31221ed9ed02bf727ce39f44ebf8	2019-11-08 14:35:15 -08:00
Zafar Takhirov	a5ac7f6387	Changing observer name Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27779 Test Plan: Imported from OSS Differential Revision: D17886605 Pulled By: z-a-f fbshipit-source-id: 68c50b482e65015336ff27171fd730da493525b6	2019-10-17 11:36:03 -07:00
Michael Suo	341262754f	module dedupe (#26666 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26666 Changes: - Introduce a `ConcreteModuleType` concept. This acts both as the key into the type cache, and as the source of truth for `ModuleValue::attr` queries. It needs to do both jobs because that's how we ensure correctness (if the types are different, it's because `ModuleValue::attr` would return different things). - Now `recursive_script` will first construct a `ConcreteModuleType` and search for a pre-existing type before starting compilation. - All previous paths to creating a `ScriptModule` (including inheriting from `ScriptModule`) are now rewritten to go through `create_script_module`, so that we have only a single place where construction happens. Behavioral changes: - Big change to `torch.jit.ScriptModule` inheritance: all attributes are now recursively scripted if possible, matching recursive scripting semantics. This makes it hard to keep something from being scripted (for example, a Python submodule). Possibly we'll need an `ignore()` type thing for attributes. In particular, this adds `self.training` to every ScriptModule, since it's present on every `nn.Module`. - I believe this change to be transparent to existing users of the inheritance API, since if you had an attribute that is unscriptable that you never used, there is no error. In some cases, we will create new attributes (even if they are unused), which will increase serialized model size from before. Test Plan: Imported from OSS Differential Revision: D17551196 Pulled By: suo fbshipit-source-id: b476d1c9feb3ddfd63406d90989aaf9dfe890591	2019-10-12 09:51:57 -07:00
davidriazati	0046092178	Reduce special casing around 'training' (#27109 ) Summary: Most of this was old cruft left over from special handling of `training` before we had a `bool` type. This makes all modules have a `training` attribute that is true by default and removes all other special handling. Fixes #26884 ](https://our.intern.facebook.com/intern/diff/17728129/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/27109 Pulled By: driazati Differential Revision: D17728129 fbshipit-source-id: 8ddc9fbb07a953dd05529538bfdd01ed88b5cb57	2019-10-07 13:52:59 -07:00
Zafar Takhirov	27dc595215	Rename _intrinsic to intrinsic Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27194 Test Plan: Imported from OSS Differential Revision: D17704957 Pulled By: zafartahirov fbshipit-source-id: 46f02d129aa77c3047b2a6c606bfadd831a6b0fc	2019-10-02 18:53:06 -07:00
Raghuraman Krishnamoorthi	dddae3f854	Fuse module enhancements (#26457 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26457 Enhancement to fuse module to support sequentials, fuse list can now be just like the state dict. Also add support for Conv-Relu and linear-relu fusion Also support inplace and out of place fusion of models. ghstack-source-id: 91076386 Test Plan: buck test caffe2/test:quantization -- 'test_fusion_sequential_model_train \(test_quantization\.FusionTest\)' --print-passing-details buck test caffe2/test:quantization -- 'test_fusion_sequential_model_eval \(test_quantization\.FusionTest\)' --print-passing-details Differential Revision: D17466382 fbshipit-source-id: 0a548f8f4c366f3ecc59db693bac725ccd62328e	2019-09-30 22:00:20 -07:00
James Reed	4d7bec5f3e	Improve repr for quantized modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27008 Test Plan: Imported from OSS Differential Revision: D17649174 Pulled By: jamesr66a fbshipit-source-id: e3e6c4bb31e1ad8ed1ebe27f803f90d564ecfe53	2019-09-28 15:15:14 -07:00
Raghuraman Krishnamoorthi	2ccbdb79c8	Per-channel baseline (#26516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26516 ghstack-source-id: 90982010 Test Plan: Integrate per-channel support into conv and linear modules. The following tests pass: buck test caffe2/test:quantized -- 'test_linear_api \(test_quantized_nn_mods\.ModuleAPITest\)' --print-passing-details buck test caffe2/test:quantized -- 'test_conv_api \(test_quantized_nn_mods\.ModuleAPITest\)' --print-passing-details buck test caffe2/test:quantized -- 'test_float_quant_compare_per_channel \(test_quantized_models\.ModelNumerics\)' --print-passing-details Differential Revision: D17342622 fbshipit-source-id: f0d618928e3d9348672c589a6b7a47049c372a2e	2019-09-28 14:05:06 -07:00
James Reed	df16fb9ca1	Throw if someone tries to torch.save() quantized modules (#26828 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26828 Pickle serialization for quantized modules is currently broken by https://github.com/pytorch/pytorch/issues/24045, so let's be loud and fail if the user tries to do it Test Plan: Imported from OSS Differential Revision: D17579127 Pulled By: jamesr66a fbshipit-source-id: 3deccac7e4590c6f648f22bb79c57badf3bf0487	2019-09-25 19:55:17 -07:00
Jerry Zhang	254122dd4e	quantize_linear -> quantize_per_tensor (#26574 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26574 Since we also have `quantized::linear`, `quantize_linear` sounds confusing, so we plan to rename it before the branch cut Test Plan: ci Imported from OSS Differential Revision: D17514876 fbshipit-source-id: 01d9005e6ec8cb9950b9d8bba122109c389641d3	2019-09-20 21:58:48 -07:00
Daya Khudia	2b52c1d982	Dynamic quantization for bias. (#26057 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26057 bias is now unquantized (i.e. floating type) for qconv and qlinear. It is dynamically quantized by fbgemm. TODO: Add some performance numbers. Tests: test:quantization ``` Summary (total time 8.41s): PASS: 24 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0More details at https://our.intern.facebook.com/intern/buck/build/74d5f6f7-55c9-4350-a618-2013042fffd8 OMIT: 0 ``` test:quantized ``` Summary (total time 13.21s): PASS: 43 FAIL: 0 SKIP: 5 caffe2/test:quantized - test_qnnpack_maxpool2d (test_quantized.TestQNNPackOps) caffe2/test:quantized - test_compare_tensor_scalar (test_quantized.TestComparatorOps) caffe2/test:quantized - test_qnnpack_linear (test_quantized.TestQNNPackOps) caffe2/test:quantized - test_qnnpack_relu (test_quantized.TestQNNPackOps) caffe2/test:quantized - test_qnnpack_add (test_quantized.TestQNNPackOps) FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` ghstack-source-id: 90166254 Test Plan: buck test mode/dev caffe2/test:quantization buck test mode/dev caffe2/test:quantized Differential Revision: D17328028 fbshipit-source-id: d4a163d730d0f4a03e8e0faf7420710cf36eec09	2019-09-16 14:43:06 -07:00
Supriya Rao	9d2d31e626	Store bias in PackedLinearWeight struct in fbgemm (#25428 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25428 Added bias as an optional param to the quantized_linear_prepack function. Bias is quantized during runtime using input scale and weight scale. ghstack-source-id: 89601399 Test Plan: python test/run_test.py --exclude nn --verbose --bring-to-front quantization quantized quantized_tensor quantized_nn_mods quantizer Differential Revision: D17121304 fbshipit-source-id: 8adb0e55e4aed0a5430aaa2c8639c8ad1639c85a	2019-09-06 08:37:34 -07:00
Supriya Rao	61819260f7	Rename FBGEMM quantized operators to generic quantized ops (#25678 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25678 As an effort to unify fbgemm and qnnpack at the dispatcher level, we need to have a generic name for the quantized backed ops. Currently FBGEMM is guarded by the USE_FBGEMM macro and QNNPACK uses USE_QNNPACK. ghstack-source-id: 89518961 Test Plan: buck test caffe2/test:quantized Differential Revision: D17194364 fbshipit-source-id: 5960aedff6b8cb89eb3872c39b74caf54c0fbf20	2019-09-05 10:13:08 -07:00
Edward Yang	55da02a86d	Revert D17097735: [quantization] Rename fbgemm quantized operators to generic `quantized` ops Test Plan: revert-hammer Differential Revision: D17097735 Original commit changeset: 447112a7a421 fbshipit-source-id: 78368b6f84d96cea70692fb000cebe99602a08c1	2019-09-04 15:02:32 -07:00
Supriya Rao	c9ba5186d3	Rename fbgemm quantized operators to generic `quantized` ops (#25338 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25338 As an effort to unify fbgemm and qnnpack at the dispatcher level, we need to have a generic name for the quantized backed ops. Currently FBGEMM is guarded by the USE_FBGEMM macro and QNNPACK uses USE_QNNPACK. TBD: Use compile time macro or run_time to switch between fbgemm and qnnpack. ghstack-source-id: 89454244 Test Plan: buck test caffe2/test:quantized Differential Revision: D17097735 fbshipit-source-id: 447112a7a421387724d3e29b8fd8412dfb1c373a	2019-09-04 14:27:27 -07:00
Raghuraman Krishnamoorthi	9945c0cea6	Work around for bias quantization for conv and linear operators (#25212 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25212 In eager mode, all modules need to work with input tensors that can change qparams dynamically. This issue https://github.com/pytorch/pytorch/issues/23874 will address this via FBGEMM modifications. This is a work around before that. ghstack-source-id: 89118038 Test Plan: buck test caffe2/test:quantized -- 'test_conv_api \(test_quantized_nn_mods\.ModuleAPITest\)' --print-passing-details Summary (total time 65.86s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 Differential Revision: D17064471 fbshipit-source-id: 3c192442b19bf2d9d88d4e52de6c24dc134a846f	2019-08-28 07:24:03 -07:00
Raghuraman Krishnamoorthi	26a438d4fb	Revert D16852280: Work around for bias quantization for conv and linear operators Test Plan: revert-hammer Differential Revision: D16852280 Original commit changeset: 988f8ff91616 fbshipit-source-id: e2cf03e13dc8dcf0db22d43740d72fd8b069fd74	2019-08-26 16:25:33 -07:00
Raghuraman Krishnamoorthi	ea601d90d6	Work around for bias quantization for conv and linear operators (#24789 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24789 In eager mode, all modules need to work with input tensors that can change qparams dynamically. This issue https://github.com/pytorch/pytorch/issues/23874 will address this via FBGEMM modifications. This is a work around before that. ghstack-source-id: 89003798 Test Plan: buck test caffe2/test:quantized -- 'test_conv_api \(test_quantized_nn_mods\.ModuleAPITest\)' --print-passing-details Summary (total time 65.86s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 Differential Revision: D16852280 fbshipit-source-id: 988f8ff91616eddf511e71926aa7d2d0f1938188	2019-08-26 12:16:42 -07:00
James Reed	a0b13b4fa5	extra_repr for quantized modules (#24443 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24443 This gives us useful information about the Module when we print it, like so: ``` FloatModule( (quant): Quantize() (conv1): Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1), scale=0.08209919929504395, zero_point=128) (conv2): Conv2d(20, 50, kernel_size=(5, 5), stride=(1, 1), scale=0.16885940730571747, zero_point=128) (fc1): Linear(in_features=800, out_features=500, bias=True, scale=0.12840059399604797, zero_point=128) (fc2): Linear(in_features=500, out_features=10, bias=True, scale=0.260015606880188, zero_point=128) (dequant): DeQuantize() ) ``` Test Plan: Imported from OSS Differential Revision: D16847140 Pulled By: jamesr66a fbshipit-source-id: 8c995108f17ed1b086d1fb30471a41c532c68080	2019-08-16 22:38:45 -07:00
James Reed	a1b111709d	Assert weight_observer has the correct dtype Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24436 Test Plan: Imported from OSS Differential Revision: D16847141 Pulled By: jamesr66a fbshipit-source-id: 1dde5c26449115b53e71d410b41204d743787c44	2019-08-15 19:40:14 -07:00
Jerry Zhang	754bf383b1	Change return type of observer to two tensors (#24339 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24339 Att Differential Revision: D16820813 fbshipit-source-id: 3e7301f1700176e19f46e8677a644ba167209254	2019-08-15 10:26:44 -07:00
Jerry Zhang	761ae8e9b6	Add intrinsic module mappings (#23753 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23753 Add intrinsic(fused) module mappings in quantize.py to enable mapping fused modules in both QAT and post PTQ Differential Revision: D16820749 fbshipit-source-id: 07de76a4f09b44bde8b193c103eac02c22b875b6	2019-08-15 09:37:24 -07:00
James Reed	7923884a03	Fix incorrect type annotation on Linear __setstate__ Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24209 Test Plan: Imported from OSS Differential Revision: D16777886 Pulled By: jamesr66a fbshipit-source-id: 4f75b3c16458f093a5ae658d36dcb7a6d313410a	2019-08-12 19:21:41 -07:00
James Reed	f66bfa7ec4	state_dict serialization for Conv2d + some bugfixes Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24116 Test Plan: Imported from OSS Reviewed By: eellison Differential Revision: D16765476 Pulled By: jamesr66a fbshipit-source-id: 96275cea87d7f5e7de5d1925cbce220066f1a465	2019-08-12 16:24:54 -07:00

1 2

69 Commits