pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
James Reed	049284e14d	Make observer scriptable Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24996 Test Plan: Imported from OSS Differential Revision: D16952938 Pulled By: jamesr66a fbshipit-source-id: 3d08e0c746603d0fe090fb3dbf13c5fc9dc022f4	2019-08-22 11:28:45 -07:00
James Reed	a0b13b4fa5	extra_repr for quantized modules (#24443 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24443 This gives us useful information about the Module when we print it, like so: ``` FloatModule( (quant): Quantize() (conv1): Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1), scale=0.08209919929504395, zero_point=128) (conv2): Conv2d(20, 50, kernel_size=(5, 5), stride=(1, 1), scale=0.16885940730571747, zero_point=128) (fc1): Linear(in_features=800, out_features=500, bias=True, scale=0.12840059399604797, zero_point=128) (fc2): Linear(in_features=500, out_features=10, bias=True, scale=0.260015606880188, zero_point=128) (dequant): DeQuantize() ) ``` Test Plan: Imported from OSS Differential Revision: D16847140 Pulled By: jamesr66a fbshipit-source-id: 8c995108f17ed1b086d1fb30471a41c532c68080	2019-08-16 22:38:45 -07:00
Zafar Takhirov	1a74bd407d	Fixes the adding of the observer to the FloatFunctional (#24418 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24418 Fixes #24394 The observer is not added correctlty, because one of the conditions is not met. Test Plan: Imported from OSS Differential Revision: D16833951 Pulled By: zafartahirov fbshipit-source-id: bb4699e6a1cf6368c7278272a68e5e7c6d3f59a8	2019-08-15 17:27:00 -07:00
Raghuraman Krishnamoorthi	696cabae9b	Baseline observer module, ensuring that (min,max) range includes zero. (#24297 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24297 ghstack-source-id: 88252409 Differential Revision: D16635637 fbshipit-source-id: fcef20b9c88b2c3bd97e311514e5b2d0339ff28a	2019-08-15 15:25:23 -07:00
James Reed	f03700b997	Fix QConfig_dynamic typename (#24431 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24431 Pickle's fully-qualified name lookup would fail when trying to serialize QConfig_dynamic since the __name__ on the instance would refer to the wrong class name Test Plan: Imported from OSS Differential Revision: D16835705 Pulled By: jamesr66a fbshipit-source-id: e146835cbe10b08923d77298bc93b0f5b0ba37c5	2019-08-15 15:25:19 -07:00
Jerry Zhang	754bf383b1	Change return type of observer to two tensors (#24339 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24339 Att Differential Revision: D16820813 fbshipit-source-id: 3e7301f1700176e19f46e8677a644ba167209254	2019-08-15 10:26:44 -07:00
Jerry Zhang	761ae8e9b6	Add intrinsic module mappings (#23753 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23753 Add intrinsic(fused) module mappings in quantize.py to enable mapping fused modules in both QAT and post PTQ Differential Revision: D16820749 fbshipit-source-id: 07de76a4f09b44bde8b193c103eac02c22b875b6	2019-08-15 09:37:24 -07:00
Jianyu Huang	0f64043b49	Remove the activation observer for default_qconfig (#24299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24299 As suggested in https://github.com/pytorch/pytorch/pull/24232, we will remove the activation observer for dynamic quantization path. ghstack-source-id: 88287094 Differential Revision: D16798590 fbshipit-source-id: 07a245d5584b5b15c6895d9b09deef4a0605073a	2019-08-14 17:21:50 -07:00
Jianyu Huang	e8d2ddc2c4	Make the default qconfig_dict (#24232 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24232 As suggested in https://github.com/pytorch/pytorch/pull/23128#discussion_r306650311, we will make the keys of default_qconfig_dict as `torch.nn.Linear`. That is, we will do the dynamic quantization on the `torch.nn.Linear` by default, if the user just specify `torch.quantize_dynamic(model)`. ghstack-source-id: 88287089 Differential Revision: D16781191 fbshipit-source-id: 991a5e151a9ea32b879d6897cd9862855d747135	2019-08-14 15:12:55 -07:00
Jianyu Huang	584c6986fd	Add the type matching rule for qconfig_dict (#23212 ) Summary: We want to use the Module type as the key for the qconfig_dict for the module replacement during the quantization. Before this Diff, to dynamic quantize the BERT model, we have to specify each layer: ``` qconfig_dict = { 'encoder.layer.0.attention.self.query': default_qconfig, 'encoder.layer.0.attention.self.key': default_qconfig, 'encoder.layer.0.attention.self.value': default_qconfig, 'encoder.layer.0.attention.output.dense': default_qconfig, 'encoder.layer.0.intermediate.dense': default_qconfig, 'encoder.layer.0.output.dense': default_qconfig, 'encoder.layer.1.attention.self.query': default_qconfig, 'encoder.layer.1.attention.self.key': default_qconfig, 'encoder.layer.1.attention.self.value': default_qconfig, 'encoder.layer.1.attention.output.dense': default_qconfig, 'encoder.layer.1.intermediate.dense': default_qconfig, 'encoder.layer.1.output.dense': default_qconfig, ... } ``` After this Diff, we only need the following ``` qconfig_dict = { torch.nn.Linear : default_qconfig } ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/23212 ghstack-source-id: 88287091 Reviewed By: zafartahirov Differential Revision: D16436542 fbshipit-source-id: 11fbe68ee460560c1a7cdded63581eb7a00e5a89	2019-08-14 13:07:36 -07:00
Jianyu Huang	e94ba742b0	Dynamic Quantized Linear Module (#23128 ) Summary: - ~~Add a unit test for the Dynamic Quantized Linear operator (```torch.fbgemm_linear_quantize_weight```, ```torch.fbgemm_pack_quantized_matrix```, and ```torch.fbgemm_linear_int8_weight```) in ```test_quantized.py```.~~ Move this to D16404027 for a separate review. - Add the Dynamic Quantized Linear module in ```torch/nn/quantized/modules/linear.py```. ~~This is in a rudimentary stage. Will add more functions later~~. - Add the torch.quantize logic (prepare, eval, convert) for dynamic quantization. - Add a unit test for the Dynamic Quantized Linear module in ```test_nn_quantized.py```. - Add a unit test for the Model-level Quantization API Pull Request resolved: https://github.com/pytorch/pytorch/pull/23128 ghstack-source-id: 88257232 Differential Revision: D16258664 fbshipit-source-id: 4be3ac39ee27c088b341c741d3f09f51d5a23ef0	2019-08-13 21:01:23 -07:00
Raghuraman Krishnamoorthi	1c5e48bbd0	Observer returns original tensor for post training quantization (#24196 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24196 Observer returns output with no changes for post training quant. This unifies observer semantics for QAT and PTQ. ghstack-source-id: 88140887 Differential Revision: D16768277 fbshipit-source-id: fae7c94e3dc0eeda363e9982b3865a15113e11bd	2019-08-13 14:01:37 -07:00
Zafar Takhirov	4cc16782f3	Removing the make_module script. (#23635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23635 It appears it is the same complexity to add new modules using a base class and using a generation script. Test Plan: Imported from OSS Differential Revision: D16593364 Pulled By: zafartahirov fbshipit-source-id: 852dcf41f3dfa2a89152042b8e61d0b6defa8feb	2019-08-13 09:58:28 -07:00
Jerry Zhang	89956374c3	Remove qconfig_dict from API (#23465 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23465 We decided not to allow user to use qconfig_dict to do quantization since that API is not robust. Differential Revision: D16611504 fbshipit-source-id: b0d1d311b32c990a165c480f50e9ce3d68b785b5	2019-08-02 10:28:48 -07:00
Jerry Zhang	6cf9ed4a54	ConvBn2d/ConvBnReLU2d (#23357 ) Summary: Added _intrinsic.qat.ConvBn2d/_intrinsic.qat.ConvBnReLU2d. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23357 ghstack-source-id: 87519573 Differential Revision: D16295500 fbshipit-source-id: 81e6d1d10d05bf6e343721fc5701d3d6bd7e07e6	2019-08-01 10:07:00 -07:00
Zafar Takhirov	9c549dfdc1	make_module: First version Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23288 Test Plan: Imported from OSS Differential Revision: D16455390 Pulled By: zafartahirov fbshipit-source-id: 4352f0a17cd0382b48502b93e51574cc3acdfdcc	2019-07-30 22:14:44 -07:00
Jerry Zhang	bc64324da9	Change condition in swap module Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23561 Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D16570928 Pulled By: jerryzh168 fbshipit-source-id: 70f36f577ac657d015f3d7738819867742088e5a	2019-07-30 17:25:02 -07:00
Jerry Zhang	7364aa796d	skip nn.Identity in add_observer Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23500 Test Plan: e2e test in quantizing resnext 101 Imported from OSS Differential Revision: D16550190 Pulled By: jerryzh168 fbshipit-source-id: 6128d7c3419235152b43739fcc5cade34342ba3d	2019-07-30 11:00:36 -07:00
Zafar Takhirov	058645acb1	Fusion and _intrinsic modules (#23003 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23003 torch.quantization.fuse_module and torch.nn._intrinsic convRelu and LinearRelu Fusion function to combine specific modules: (conv,bn) and (conv,bn,relu). In all cases, replace modules in place. The first module is replaced with the _intrinsic fused module and the remaining modules are replaced by nn.Identity. Support both training and eval. For training, the modules are "fused" with a sequential container. This is to allow for further module swaps for quantization aware training. Also add: torch.nn._intrinsic for convRelu and LinearRelu. TODO: Add tests for _intrinsic modules. Conv BN fusion code is based on DsKhudia's implementation Differential Revision: D16199720 fbshipit-source-id: 95fb9ffe72b361d280313b2ec57de2acd4f9dda2	2019-07-23 14:54:19 -07:00
Jerry Zhang	d7448c7812	quantized conv module (#23178 ) Summary: att Pull Request resolved: https://github.com/pytorch/pytorch/pull/23178 ghstack-source-id: 86973164 Differential Revision: D16426871 fbshipit-source-id: a2ebb38997acfeb61b7dfd6b11dd8ee9b3a7a8ed	2019-07-22 20:47:40 -07:00
Jerry Zhang	77353636de	Conv module (#23084 ) Summary: Added Conv module for qat Pull Request resolved: https://github.com/pytorch/pytorch/pull/23084 ghstack-source-id: 86862445 Differential Revision: D16379417 fbshipit-source-id: 742cc8b8e0f132070ca4943a1c2e3db60c2b5bdc	2019-07-19 18:49:52 -07:00
Jerry Zhang	7cc029cb75	Quantization aware training in eager mode (#23082 ) Summary: Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23082 ghstack-source-id: 86824650 Differential Revision: D16379374 fbshipit-source-id: 7d16d1acd87025065a24942ff92abf18e9fc8070	2019-07-19 14:57:25 -07:00
Soumith Chintala	84c2c89e2c	Revert D16199356: [qat] Quantization aware training in eager mode Differential Revision: D16199356 Original commit changeset: 62aeaf47c12c fbshipit-source-id: d06a96b0a617ae38029ffb246173ec065454b666	2019-07-19 03:18:48 -07:00
Soumith Chintala	f19aa12ae5	Revert D16274792: [qat] Conv module Differential Revision: D16274792 Original commit changeset: 1da10194123b fbshipit-source-id: 71b34774b463f2350289bd39b8cfd798e095ffa5	2019-07-19 03:18:45 -07:00
Jerry Zhang	12d9d768b8	Conv module (#22899 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22899 Added Conv module for qat Reviewed By: zafartahirov Differential Revision: D16274792 fbshipit-source-id: 1da10194123b2759a6a35c60d1c2d2c0b569ccdc	2019-07-18 18:58:07 -07:00
Jerry Zhang	65ef671d11	Quantization aware training in eager mode (#22732 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22732 Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Reviewed By: zafartahirov Differential Revision: D16199356 fbshipit-source-id: 62aeaf47c12c62a87d9cac208f25f7592e245d6c	2019-07-18 18:58:03 -07:00
Jerry Zhang	f7de9be3c0	Add FakeQuantize Module (#21767 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21767 Adding FakeQuantize Module for quantization aware training Reviewed By: dzhulgakov Differential Revision: D15728503 fbshipit-source-id: 2a9a6a362812ede3deac42b93dddca35987bd8e6	2019-07-15 14:08:55 -07:00
Jerry Zhang	b984b0ab4b	fix print (#22689 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22689 att Reviewed By: Lucaskabela Differential Revision: D16184260 fbshipit-source-id: 1a6ad51a37918d0c81d6e3baa0ca0baa32cb9673	2019-07-10 11:26:34 -07:00
Jerry Zhang	5040d52a5a	torch.quantization conversion utilities, observers for eager mode quantization (#22010 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22010 torch.quantization module with observers and conversion routines Reviewed By: zafartahirov Differential Revision: D15554183 fbshipit-source-id: 05a3fabe28dd701978b8ecebf5bfc3a4c044ba5c	2019-07-09 10:51:38 -07:00

... 13 14 15 16 17

829 Commits