pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jianyu Huang	584c6986fd	Add the type matching rule for qconfig_dict (#23212 ) Summary: We want to use the Module type as the key for the qconfig_dict for the module replacement during the quantization. Before this Diff, to dynamic quantize the BERT model, we have to specify each layer: ``` qconfig_dict = { 'encoder.layer.0.attention.self.query': default_qconfig, 'encoder.layer.0.attention.self.key': default_qconfig, 'encoder.layer.0.attention.self.value': default_qconfig, 'encoder.layer.0.attention.output.dense': default_qconfig, 'encoder.layer.0.intermediate.dense': default_qconfig, 'encoder.layer.0.output.dense': default_qconfig, 'encoder.layer.1.attention.self.query': default_qconfig, 'encoder.layer.1.attention.self.key': default_qconfig, 'encoder.layer.1.attention.self.value': default_qconfig, 'encoder.layer.1.attention.output.dense': default_qconfig, 'encoder.layer.1.intermediate.dense': default_qconfig, 'encoder.layer.1.output.dense': default_qconfig, ... } ``` After this Diff, we only need the following ``` qconfig_dict = { torch.nn.Linear : default_qconfig } ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/23212 ghstack-source-id: 88287091 Reviewed By: zafartahirov Differential Revision: D16436542 fbshipit-source-id: 11fbe68ee460560c1a7cdded63581eb7a00e5a89	2019-08-14 13:07:36 -07:00
Jianyu Huang	e94ba742b0	Dynamic Quantized Linear Module (#23128 ) Summary: - ~~Add a unit test for the Dynamic Quantized Linear operator (```torch.fbgemm_linear_quantize_weight```, ```torch.fbgemm_pack_quantized_matrix```, and ```torch.fbgemm_linear_int8_weight```) in ```test_quantized.py```.~~ Move this to D16404027 for a separate review. - Add the Dynamic Quantized Linear module in ```torch/nn/quantized/modules/linear.py```. ~~This is in a rudimentary stage. Will add more functions later~~. - Add the torch.quantize logic (prepare, eval, convert) for dynamic quantization. - Add a unit test for the Dynamic Quantized Linear module in ```test_nn_quantized.py```. - Add a unit test for the Model-level Quantization API Pull Request resolved: https://github.com/pytorch/pytorch/pull/23128 ghstack-source-id: 88257232 Differential Revision: D16258664 fbshipit-source-id: 4be3ac39ee27c088b341c741d3f09f51d5a23ef0	2019-08-13 21:01:23 -07:00
Zafar Takhirov	4cc16782f3	Removing the make_module script. (#23635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23635 It appears it is the same complexity to add new modules using a base class and using a generation script. Test Plan: Imported from OSS Differential Revision: D16593364 Pulled By: zafartahirov fbshipit-source-id: 852dcf41f3dfa2a89152042b8e61d0b6defa8feb	2019-08-13 09:58:28 -07:00
Jerry Zhang	89956374c3	Remove qconfig_dict from API (#23465 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23465 We decided not to allow user to use qconfig_dict to do quantization since that API is not robust. Differential Revision: D16611504 fbshipit-source-id: b0d1d311b32c990a165c480f50e9ce3d68b785b5	2019-08-02 10:28:48 -07:00
Zafar Takhirov	9c549dfdc1	make_module: First version Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23288 Test Plan: Imported from OSS Differential Revision: D16455390 Pulled By: zafartahirov fbshipit-source-id: 4352f0a17cd0382b48502b93e51574cc3acdfdcc	2019-07-30 22:14:44 -07:00
Jerry Zhang	bc64324da9	Change condition in swap module Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23561 Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D16570928 Pulled By: jerryzh168 fbshipit-source-id: 70f36f577ac657d015f3d7738819867742088e5a	2019-07-30 17:25:02 -07:00
Jerry Zhang	7364aa796d	skip nn.Identity in add_observer Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23500 Test Plan: e2e test in quantizing resnext 101 Imported from OSS Differential Revision: D16550190 Pulled By: jerryzh168 fbshipit-source-id: 6128d7c3419235152b43739fcc5cade34342ba3d	2019-07-30 11:00:36 -07:00
Jerry Zhang	d7448c7812	quantized conv module (#23178 ) Summary: att Pull Request resolved: https://github.com/pytorch/pytorch/pull/23178 ghstack-source-id: 86973164 Differential Revision: D16426871 fbshipit-source-id: a2ebb38997acfeb61b7dfd6b11dd8ee9b3a7a8ed	2019-07-22 20:47:40 -07:00
Jerry Zhang	77353636de	Conv module (#23084 ) Summary: Added Conv module for qat Pull Request resolved: https://github.com/pytorch/pytorch/pull/23084 ghstack-source-id: 86862445 Differential Revision: D16379417 fbshipit-source-id: 742cc8b8e0f132070ca4943a1c2e3db60c2b5bdc	2019-07-19 18:49:52 -07:00
Jerry Zhang	7cc029cb75	Quantization aware training in eager mode (#23082 ) Summary: Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23082 ghstack-source-id: 86824650 Differential Revision: D16379374 fbshipit-source-id: 7d16d1acd87025065a24942ff92abf18e9fc8070	2019-07-19 14:57:25 -07:00
Soumith Chintala	84c2c89e2c	Revert D16199356: [qat] Quantization aware training in eager mode Differential Revision: D16199356 Original commit changeset: 62aeaf47c12c fbshipit-source-id: d06a96b0a617ae38029ffb246173ec065454b666	2019-07-19 03:18:48 -07:00
Soumith Chintala	f19aa12ae5	Revert D16274792: [qat] Conv module Differential Revision: D16274792 Original commit changeset: 1da10194123b fbshipit-source-id: 71b34774b463f2350289bd39b8cfd798e095ffa5	2019-07-19 03:18:45 -07:00
Jerry Zhang	12d9d768b8	Conv module (#22899 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22899 Added Conv module for qat Reviewed By: zafartahirov Differential Revision: D16274792 fbshipit-source-id: 1da10194123b2759a6a35c60d1c2d2c0b569ccdc	2019-07-18 18:58:07 -07:00
Jerry Zhang	65ef671d11	Quantization aware training in eager mode (#22732 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22732 Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Reviewed By: zafartahirov Differential Revision: D16199356 fbshipit-source-id: 62aeaf47c12c62a87d9cac208f25f7592e245d6c	2019-07-18 18:58:03 -07:00
Jerry Zhang	b984b0ab4b	fix print (#22689 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22689 att Reviewed By: Lucaskabela Differential Revision: D16184260 fbshipit-source-id: 1a6ad51a37918d0c81d6e3baa0ca0baa32cb9673	2019-07-10 11:26:34 -07:00
Jerry Zhang	5040d52a5a	torch.quantization conversion utilities, observers for eager mode quantization (#22010 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22010 torch.quantization module with observers and conversion routines Reviewed By: zafartahirov Differential Revision: D15554183 fbshipit-source-id: 05a3fabe28dd701978b8ecebf5bfc3a4c044ba5c	2019-07-09 10:51:38 -07:00

1 2 3

116 Commits