pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Alban Desmaison	25db74bf5e	Revert D24486972: [quant][graphmode][fx] Support sigmoid/hardsigmoid/tanh in qat Test Plan: revert-hammer Differential Revision: D24486972 (`e927b62e73`) Original commit changeset: c9f139bfdd54 fbshipit-source-id: 2a75f5ec93d55a62b40d1cdd49adcf65436058f7	2020-10-26 12:47:05 -07:00
Jerry Zhang	e927b62e73	[quant][graphmode][fx] Support sigmoid/hardsigmoid/tanh in qat (#46738 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46738 Test Plan: Imported from OSS Reviewed By: raghuramank100 Differential Revision: D24486972 fbshipit-source-id: c9f139bfdd54973da1a93a45e32937595dbe67fc	2020-10-26 12:04:42 -07:00
Jerry Zhang	13decddae2	[reland][quant] Add FixedQParamsFakeQuantize module (#45538 ) (#46657 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46657 This is used to simulate fake quantize operation for ops with fixed quantization parameters e.g. hardsigmoid Test Plan: Imported from OSS Imported from OSS Reviewed By: vkuzo Differential Revision: D24451406 fbshipit-source-id: 26cc140c00f12bdec9a8f9dc880f4c425f4d4074	2020-10-21 16:47:11 -07:00
Ashkan Aliabadi	2181449068	Revert D24004795: [quant] Add FixedQParamsFakeQuantize module Test Plan: revert-hammer Differential Revision: D24004795 (`253918ec55`) Original commit changeset: fc4797f80842 fbshipit-source-id: 663169e90a2f58e5a89e4d382291ae41c24d0fee	2020-10-20 19:40:21 -07:00
Jerry Zhang	253918ec55	[quant] Add FixedQParamsFakeQuantize module (#45538 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45538 This is used to simulate fake quantize operation for ops with fixed quantization parameters e.g. hardsigmoid Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D24004795 fbshipit-source-id: fc4797f80842daacd3b3584c5b72035774634edd	2020-10-20 17:43:25 -07:00
Jerry Zhang	0da6730f02	[quant][graphmode][fx][eagermode] Add leaky relu support in quantization workflows (#45712 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45712 Eager mode will still be able to use functional leaky relu, but it will be less accurate than LeakyReLU module. FX graph mode will support both leaky relu functional and module Test Plan: Imported from OSS Reviewed By: z-a-f Differential Revision: D24069961 fbshipit-source-id: 8d91c3c50c0bcd068ba3072378ebb4da9549be3b	2020-10-06 12:16:04 -07:00
Supriya Rao	6013a29fc0	[quant] Support quantization of embedding lookup operators (#44207 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44207 Use existing embedding_bag operator but set offsets to [0, 1, .. len(indices)] Test Plan: python test/test_quantization.py TestEmbeddingOps.test_embedding_byte Imported from OSS Reviewed By: vkuzo Differential Revision: D23547385 fbshipit-source-id: ccce348bc192c6a4a65a8eca4c8b90f99f40f1b1	2020-09-08 19:03:59 -07:00
Jerry Zhang	5a1aa0e21e	[reland][quant][graphmode][fx] Add e2e test on torchvision (#43587 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43587 Add tests for graph mode quantization on torchvision and make sure it matches current eager mode quantization Test Plan: Imported from OSS Imported from OSS Reviewed By: z-a-f Differential Revision: D23331253 fbshipit-source-id: 0445a44145d99837a2c975684cd0a0b7d965c8f9	2020-08-27 10:12:07 -07:00
Mikhail Zolotukhin	be637fd5f6	Revert D23306683: [quant][graphmode][fx] Testing torchvision Test Plan: revert-hammer Differential Revision: D23306683 (`62dcd253e3`) Original commit changeset: 30d27e225d45 fbshipit-source-id: e661334d187d3d6756facd36f2ebdb3ab2cd2e26	2020-08-25 15:24:02 -07:00
Jerry Zhang	62dcd253e3	[quant][graphmode][fx] Testing torchvision (#43526 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43526 Add tests for graph mode quantization on torchvision and make sure it matches current eager mode quantization Test Plan: Imported from OSS Reviewed By: vkuzo Differential Revision: D23306683 fbshipit-source-id: 30d27e225d4557bfc1d9aa462086e416aa9a9c0e	2020-08-25 13:02:14 -07:00
Edmund Williams Jr	17f9edda42	Bias Correction Implementation (#41845 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41845 Test Plan: Imported from OSS Reviewed By: jerryzh168 Differential Revision: D22661503 Pulled By: edmundw314 fbshipit-source-id: a88c349c6cc15b1c66aa6dee7593ef3df588eb85	2020-08-20 21:40:33 -07:00
Jerry Zhang	b0ec336477	[quant][graphmode][fx][test] Add per op test for graph mode quant on fx (#43229 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43229 Test Plan: Imported from OSS Reviewed By: supriyar Differential Revision: D23201692 fbshipit-source-id: 37fa54dcf0a9d5029f1101e11bfd4ca45b422641	2020-08-20 17:32:02 -07:00
Jerry Zhang	dae2973fae	[quant][graphmode][fx] Add graph mode quantization on fx (#43175 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43175 This PR added graph mode quantization on fx: https://github.com/pytorch/pytorch/pull/42741 Currently it matches eager mode quantization for torchvision with static/dynamic/qat ddp/synbn test is still wip Test Plan: python test/test_quantization.py TestQuantizeFx Imported from OSS Reviewed By: vkuzo Differential Revision: D23178602 fbshipit-source-id: 8e7e0322846fbda2cfa79ad188abd7235326f879	2020-08-20 14:50:09 -07:00
Mike Ruberry	b7a9bc0802	Revert D22217029: Add fake quantize operator that works in backward pass Test Plan: revert-hammer Differential Revision: D22217029 (`48e978ba18`) Original commit changeset: 7055a2cdafcf fbshipit-source-id: f57a27be412c6fbfd5a5b07a26f758ac36be3b67	2020-08-07 23:04:40 -07:00
Presley Graham	48e978ba18	Add fake quantize operator that works in backward pass (#40532 ) Summary: This diff adds FakeQuantizeWithBackward. This works the same way as the regular FakeQuantize module, allowing QAT to occur in the forward pass, except it has an additional quantize_backward parameter. When quantize_backward is enabled, the gradients are fake quantized as well (dynamically, using hard-coded values). This allows the user to see whether there would be a significant loss of accuracy if the gradients were quantized in their model. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40532 Test Plan: The relevant test for this can be run using `python test/test_quantization.py TestQATBackward.test_forward_and_backward` Reviewed By: supriyar Differential Revision: D22217029 Pulled By: durumu fbshipit-source-id: 7055a2cdafcf022f1ea11c3442721ae146d2b3f2	2020-08-07 17:47:01 -07:00
Edmund Williams Jr	fd62847eb2	cross_layer_equalization (#41685 ) Summary: The goal is to implement cross layer equalization as described in section 4.1 in this paper: https://arxiv.org/pdf/1906.04721.pdf Given two adjacent submodules in a trained model, A,B quantization might hurt one of the submodules more than the other. The paper poses the idea that a loss in accuracy from quantizing can be due to a difference in the channel ranges between the two submodules (the output channel range of A can be small, while the input channel range of B can be large). To minimize this source of error, we want to scale the tensors of A,B s.t. their channel ranges are equal (them being equal means no difference in ranges and minimizes this source of error). Pull Request resolved: https://github.com/pytorch/pytorch/pull/41685 Test Plan: Imported from OSS Reviewed By: z-a-f Differential Revision: D22630219 Pulled By: edmundw314 fbshipit-source-id: ccc91ba12c10b652d7275222da8b85455b8a7cd5	2020-07-22 08:39:23 -07:00
Radhakrishnan Venkataramani	f41173b975	[PyPer][quant] Add quantized embedding operators to OSS. (#40076 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40076 Pull Request resolved: https://github.com/pytorch/glow/pull/4606 [PyPer][quant] Add quantized embedding operators to OSS. This is the first step in supporting Graph Mode Quantization for EmbeddingBag. At a high level, the next steps would be a) Implementation of Embedding prepack/unpack operators, b) Implementation of torch.nn.quantized.dynamic.EmbeddingBag Module, c) Implementation of torch.nn.quantized.EmbeddingBag Module, d) Implementation (modification) of IR passes to support graph quantization of EmbeddingBag module. More in-depth details regarding each step will be in the follow up diffs. Consider this as an initial diff that moves operators to respective places that's required for us to proceed. Test Plan: ```buck test mode/no-gpu caffe2/test:quantization -- --stress-runs 100 test_embedding_bag``` Reviewed By: supriyar Differential Revision: D21949828 fbshipit-source-id: cad5ed0a855db7583bddb1d93e2da398c128024a	2020-06-25 12:01:49 -07:00
Jerry Zhang	9f9e7c1d71	[quant][refactor] Tests for torch.jit.quantized (#40330 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40330 Test Plan: Imported from OSS Differential Revision: D22149707 fbshipit-source-id: 44e7545bf9277d9245b5e9c2d9461f664fff0426	2020-06-22 10:41:31 -07:00
Jerry Zhang	b2f489dc57	[quant][graphmode] Rename graph mode quantization API to `quantize_jit` (#40212 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40212 Test Plan: Imported from OSS Reviewed By: z-a-f Differential Revision: D22144745 fbshipit-source-id: 38a19b5afdddbbce262eea8ddf5b68458e6017b3	2020-06-19 18:13:37 -07:00
Edmund Williams Jr	465138ec39	refactoring TestQuantizeScript (#39677 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39677 Test Plan: Moved a test class suite between files, wanted to have same functionality (simple code refactor) so tested to make sure the test output was the same before/after the refactor. Image below shows the output of TestGraphModePostTrainingStatic before refactor {F239676498} This image shows the output of TestQuantizeScript (renamed version that is in test_quantize_script.py instead of test_quantize.py) {F239676509} Differential Revision: D21940638 Pulled By: edmundw314 fbshipit-source-id: 54160a5151aadf3a34bdac2bcaeb52904e6653ed	2020-06-19 11:47:00 -07:00
Wanchao Liang	442ec1dd4e	[test] split remaining quantization tests out of test_jit (#40144 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40144 as title, split remaining quantization tests out of test_jit to reduce the size of test_jit Test Plan: Imported from OSS Differential Revision: D22085034 Pulled By: wanchaol fbshipit-source-id: 0c8639da01ffc3e6a72e6f470837786c73a6b3f0	2020-06-18 13:39:13 -07:00
Supriya Rao	f6739ec8e8	[quant][graphmode] Refactor dynamic quant tests (#40127 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40127 Reland PR. Similar to static quant, break it up into op level tests and tests for jit passes Test Plan: python test/test_quantization.py TestQuantizeScriptPTDQOps python test/test_quantization.py TestDynamicQuantizeScriptJitPasses Imported from OSS Differential Revision: D22081259 fbshipit-source-id: cef8f78f89ef8789683b52508379ae1b9ad00700	2020-06-17 13:40:19 -07:00
Supriya Rao	b5d54db6f4	Revert D22071278: [quant][graphmode] Refactor dynamic quant tests Test Plan: revert-hammer Differential Revision: D22071278 Original commit changeset: 54292addcfbc fbshipit-source-id: 20ffbea0fd05e974b31381437c61040b5b24c993	2020-06-16 15:01:05 -07:00
Supriya Rao	ddeaa74382	[quant][graphmode] Refactor dynamic quant tests (#40039 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40039 Similar to static quant, break it up into op level tests and tests for jit passes Test Plan: python test/test_quantization.py TestQuantizeScriptPTDQOps python test/test_quantization.py TestDynamicQuantizeScriptJitPasses Imported from OSS Differential Revision: D22071278 fbshipit-source-id: 54292addcfbc00f7af960fb333921db2ff9fda04	2020-06-16 13:14:48 -07:00
Kimish Patel	bb12e4dca0	Add JIT fusion pass to fuse quantized add and relu. (#38897 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38897 Quantized ops support add_relu. This pass enables finding quantized add + relu pattern and fuse them to add_relu. Test Plan: buck run caffe2/test:quantization -- test_quantization.TestFusionPasses Reviewed By: jerryzh168 Differential Revision: D21690909 fbshipit-source-id: 607cf72dde535df15eb7638841543ab2156af464	2020-05-27 14:16:57 -07:00
Vasiliy Kuznetsov	b57c8b720e	[wip] Make quantization modules work with DataParallel (#37032 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37032 DataParallel requires all params and buffers of child modules to be updated in place because of how it implements model replication during the forward pass (see https://github.com/pytorch/pytorch/pull/12671 for context). Any params or buffers not updated in place are lost and not propagated back to the master. This diff updates (some quantized modules) (TBD: all quantized modules? determine a good cut point) to do their parameter update in-place. This will enable static quant and QAT to work correctly with DataParallel. TODO: https://github.com/pytorch/pytorch/pull/32684 needs to land before we can fix the graph mode test failures on this PR. Test Plan: script failed before and passes after the diff: https://gist.github.com/vkuzo/78b06c01f23f98ee2aaaeb37e55f8d40 TODO before land: add integration testing Imported from OSS Differential Revision: D21206454 fbshipit-source-id: df6b4b04d0ae0f7ef582c82d81418163019e96f7	2020-05-05 13:06:43 -07:00
Zafar Takhirov	a09cb5f2f5	[quant] quantized reflection_pad1d (#37452 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37452 Test Plan: Imported from OSS Differential Revision: D21286659 Pulled By: z-a-f fbshipit-source-id: f9f4de497a790b296149313562d09f8ead5facee	2020-04-30 18:45:38 -07:00
Zafar	297cc5512e	[quant] Enable convolution tests (#37494 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37494 Test Plan: Imported from OSS Differential Revision: D21299442 Pulled By: z-a-f fbshipit-source-id: 68513b52aaef852278f28031866f85123b016486	2020-04-29 12:24:45 -07:00
Jerry Zhang	facdd15cc6	[quant] Finishing refactor for quantization test files (#37366 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37366 - we can put both fake quant module and observer module tests in the test_workflow_module.py - added test_quantized_functional.py - moved tests in test_numerics.py to test_quantize.py and removed test_numerics.py Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D21282198 fbshipit-source-id: 60107cee7d1ed2cd14a45650e91ec28b8a262c52	2020-04-28 21:40:57 -07:00
Jerry Zhang	230b68168b	[quant] Refactor test files (#36964 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36964 Rename and restructure quantization related tests https://github.com/pytorch/pytorch/issues/31625 Test Plan: . Imported from OSS Differential Revision: D21192509 fbshipit-source-id: 148c93e86e0ea68ab18a067fe74a8035a29a1e4e	2020-04-23 10:28:56 -07:00
Jerry Zhang	ab26dfb44e	[quant] Move quantization tests into test/quantization (#35812 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35812 Test Plan: . Imported from OSS Differential Revision: D20795329 fbshipit-source-id: 42cc905c44ce7b86720aeef512d747ff6788d7a2	2020-04-01 12:44:19 -07:00
Michael Suo	319aee1afb	Revert D20771828: [quant] Move quantization tests into test/quantization Test Plan: revert-hammer Differential Revision: D20771828 Original commit changeset: 5f1df5e86c29 fbshipit-source-id: d14f915f291ae8a90026c5b65624459211495f47	2020-03-31 23:01:00 -07:00
Jerry Zhang	fef6c617d4	[quant] Move quantization tests into test/quantization (#35688 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35688 Test Plan: . Imported from OSS Differential Revision: D20771828 fbshipit-source-id: 5f1df5e86c29f7bdfbdc6563450e909b3bfdc07a	2020-03-31 20:30:57 -07:00
Supriya Rao	a090de380c	[quant][graph] Add quant fusion for dynamic quantization (#35586 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35586 This pass fuses the choose_qparams-quant-dequant sequence Fusion for weight tensor is the same as static quant. Test Plan: python test/test_quantize_script.py Imported from OSS Differential Revision: D20755680 fbshipit-source-id: b7443770642b6e6fa0fa9da8a44637e9b2d4df70	2020-03-30 23:34:56 -07:00
Jerry Zhang	6fc2403951	[quant][graphmode] qconfig_dict support None (#35336 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35336 Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D20655302 fbshipit-source-id: b453f3240ac487aa29629953b4d71274dbbc25fc	2020-03-29 12:47:47 -07:00
Supriya Rao	daba68c601	[quant][graph] Add a new observer type for dynamic quantization (#35455 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35455 In graph mode we need to observer the activation tensor for dynamic quantization. This observer should behave the same way as the quantization functions called in the dynamic operator. Currently for qlinear_dynamic we call quant_utils::ChooseQuantizationParams which has its own logic for calculating scale and zero_point. We mimic those calculations in the new observer. Test Plan: python test/test_quantization.py ObserverTest Imported from OSS Differential Revision: D20664586 fbshipit-source-id: e987ea71fff777c21e00c498504e6586e92568a2	2020-03-26 17:38:21 -07:00
Supriya Rao	b4b8b3c0ca	Revert D20630988: [quant][graph] Add a new observer type for dynamic quantization Test Plan: revert-hammer Differential Revision: D20630988 Original commit changeset: 7e7aca77590f fbshipit-source-id: 6bc67ca322c1703004e0053f8eba9b8f6a3a5f67	2020-03-25 18:52:21 -07:00
Supriya Rao	7e24ab8c4a	[quant][graph] Add a new observer type for dynamic quantization (#35265 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35265 In graph mode we need to observer the activation tensor for dynamic quantization. This observer should behave the same way as the quantization functions called in the dynamic operator. Currently for qlinear_dynamic we call quant_utils::ChooseQuantizationParams which has its own logic for calculating scale and zero_point. We mimic those calculations in the new observer. Test Plan: python test/test_quantization.py ObserverTest Imported from OSS Differential Revision: D20630988 fbshipit-source-id: 7e7aca77590f965dcb423a705e68d030aaf98550	2020-03-25 16:50:05 -07:00
Lingyi Liu	fddcd72a31	Add the more fusion (conv3d and batchnorm)support in pytorch quantization flow (#33540 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33540 Differential Revision: D19994498 Pulled By: lly-zero-one fbshipit-source-id: e5e13eab6924bd2ce1b57b16b672844b8b9638f5	2020-03-23 20:36:03 -07:00
Jerry Zhang	90ca7a1feb	[quant][graphmode] Add Finalize function that inlines graph and produce quantized ops (#33927 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33927 Test Plan: test will be added in later PRs Imported from OSS Differential Revision: D20354879 fbshipit-source-id: 03976f4b86c46dbdc4e45764a1e72f1a3855a404	2020-03-12 14:52:58 -07:00
James Reed	8a17dc65af	[quantization] Make FP16 RNN use new prepack op (#34339 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34339 Test Plan: Imported from OSS Differential Revision: D20297194 Pulled By: jamesr66a fbshipit-source-id: 8bf6d0f2cb047e90bbdd184aaad337b143040d10	2020-03-07 10:04:01 -08:00
Supriya Rao	e236e15934	[quant] Run weight_post_process for QAT (#33852 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33852 This fixes an issue for QAT models. During eval if we call `prepare_qat` and `convert` before calling `load_state_dict` it throws an error because the weight info (num channels) is not updated in the observer module. It is not an issue for per-tensor case Fixes issue #33830 Test Plan: python test/test_quantization.py EagerModePostTrainingQuantTest.test_eval_after_train python test/test_quantization.py EagerModeQuantizationAwareTrainingTest.test_eval_after_train Imported from OSS Differential Revision: D20212996 fbshipit-source-id: a04af8fe4df2e555270ae4d6693f5777d86f8a46	2020-03-04 14:01:32 -08:00
Dmytro Dzhulgakov	a8fc3d8c2a	Fix HistogramObserver to not do detach on input (#34114 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33545, added a unittest Pull Request resolved: https://github.com/pytorch/pytorch/pull/34114 Differential Revision: D20224719 Pulled By: dzhulgakov fbshipit-source-id: 053d3b3b0c86340027ba1b95b5f3c247aa151aee	2020-03-03 13:15:22 -08:00
Jianyu Huang	5ef1c2c5d2	Back out "[pt][quant] RNN debug test" (#33750 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33750 Original commit changeset: 8c38d8f067e5 ghstack-source-id: 98911215 Test Plan: CI Differential Revision: D20090521 fbshipit-source-id: 73df43ad60574e44e80b36ebf6392030c3efb66e	2020-02-25 09:28:00 -08:00
Jianyu Huang	5b031d961d	[pt][quant] RNN debug test (#33621 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33621 ghstack-source-id: 98746093 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details Differential Revision: D20036968 fbshipit-source-id: 7cbb027a6afbe28bc250fc663089c6a9406e880b	2020-02-24 16:15:17 -08:00
Supriya Rao	c2d736cefb	Add support for Dynamic LSTM quantization on Mobile (#32757 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32757 This PR updates the main quantize_dynamic API to use QNNPACK backend for mobile Test Plan: python test/test_quantization.py PostTrainingDynamicQuantTest.test_quantized_rnn Imported from OSS Differential Revision: D19632220 fbshipit-source-id: b4c51485c281d088524101b97c84dd806438b597	2020-01-29 20:55:48 -08:00
Jianyu Huang	6f7d5bb3e1	Temporarily disable the test_quantized_rnn test (#32742 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32742 As Title says (Check https://github.com/pytorch/pytorch/issues/32644). ghstack-source-id: 97352793 Test Plan: CI Differential Revision: D19611029 fbshipit-source-id: 9f4a155c909f419e41c1d7078eb2796dd17cedd2	2020-01-28 16:50:59 -08:00
James Reed	812b1ad869	[quantization] FP16 dynamic quantized Linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32331 Test Plan: Imported from OSS Differential Revision: D19441158 Pulled By: jamesr66a fbshipit-source-id: c04247ffe707be68718c486c31bc6c6040f7dc11	2020-01-27 15:45:32 -08:00
Jerry Zhang	4cd6b5cda6	[quant] Re-enable test_nested that has different qconfig for shared ClassType (#32206 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32206 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D19508028 fbshipit-source-id: 5de3c2ef17de146feca03d7135a7e04f393de398	2020-01-23 15:32:57 -08:00
Pritam Damania	f050b16dd9	Move pytorch distributed tests to separate folder for contbuild. (#30445 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30445 Create distributed and rpc directories under caffe/test for better management of unit tests. Differential Revision: D18702786 fbshipit-source-id: e9daeed0cfb846ef68806f6decfcb57c0e0e3606	2020-01-22 21:16:59 -08:00
Jerry Zhang	4314620ba0	[jit] Module clone work with shared ClassType (#31970 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31970 Now that the ClassType can be shared among different module instances, we'll preserve the sharing in clone as well, that is if the original module has a ClassType that is shared, we'll clone this ClassType once and share it between different module instances as well. Test Plan: build/test/test_jit Imported from OSS Differential Revision: D19406251 fbshipit-source-id: 2881c695f6e718e5432040a3817cf187a62017bf	2020-01-15 11:24:53 -08:00
James Reed	a3cdb7eca3	Fix default instantation of dynamic quantized LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31433 Test Plan: Imported from OSS Differential Revision: D19164539 Pulled By: jamesr66a fbshipit-source-id: 7045817ab3dfb530c4480a10523c4c6bcdbfc7eb	2019-12-18 16:59:00 -08:00
Michael Suo	62b10721fb	Actually make flake8 do something (#30892 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30892 Fixes all outstanding lints and actually installs a properly configured flake8 Test Plan: Imported from OSS Differential Revision: D18862825 Pulled By: suo fbshipit-source-id: 08e9083338a7309272e17bb803feaa42e348aa85	2019-12-06 17:50:50 -08:00
James Reed	4fd20c0816	Kill hypothesis deadline testing (#30890 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30890 We've received way too many complaints about this functionality making tests flaky, and it's not providing value to us anyway. Let's cut the shit and kill deadline testing Test Plan: Imported from OSS Differential Revision: D18857597 Pulled By: jamesr66a fbshipit-source-id: 67e3412795ef2fb7b7ee896169651084e434d2f6	2019-12-06 13:36:14 -08:00
Jerry Zhang	58cdf1429c	Add tests for quantizing traced models (#30476 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30476 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18795724 fbshipit-source-id: 9253e102bf458d9185f68848071a4e4eff9f9b08	2019-12-05 23:03:45 -08:00
Jerry Zhang	1fa4908ac0	Refactor test_quantization.py and enable `test_nested` (#30475 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30475 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18795727 fbshipit-source-id: c9942c5361e0a34e91a08b8fc27405799db7ff4f	2019-12-05 21:56:03 -08:00
Brian Wignall	e7fe64f6a6	Fix typos (#30606 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30606 Differential Revision: D18763028 Pulled By: mrshenli fbshipit-source-id: 896515a2156d062653408852e6c04b429fc5955c	2019-12-02 20:17:42 -08:00
Lingyi Liu	59ca9b7430	Graph-mode quantization for convolution from traced model (#30245 ) Summary: In the PR, we enhance the graph-mode quantization for aten::_convolution, which could be generated from tracing path. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30245 Differential Revision: D18671597 Pulled By: lly-zero-one fbshipit-source-id: 78a2470fbb0fe0def55d63c6bda7cbb5c89f7848	2019-11-23 01:24:50 -08:00
Lingyi Liu	7d3afc4186	enable the per channel dynamic quantization (#30122 ) Summary: The PR tried to enable the per-channel(row-wise) dynamic quantization for linear operator. Given we have seen some accuracy drop due to the per-tensor quantization, we expect the per-channel could help improve the accuracy. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30122 Differential Revision: D18630541 Pulled By: lly-zero-one fbshipit-source-id: d52685deec5e7de46cd686ae649a8c8765b9cacf	2019-11-21 10:12:05 -08:00
Jerry Zhang	b2291d4600	Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` (#29416 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29416 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18580906 fbshipit-source-id: 5370300b89e26c2b4662b17e51284e8708cb5843	2019-11-19 19:12:55 -08:00
James Reed	20fb8a814c	PackedSequence support for quantized LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29585 Test Plan: Imported from OSS Differential Revision: D18436569 Pulled By: jamesr66a fbshipit-source-id: 0f32c0fcc897894e30d8e7ff203392c1a961ce60	2019-11-12 20:13:38 -08:00
Jerry Zhang	4bcf4796aa	Make HistogramObserver scriptable with `@torch.jit.ignore` (#27950 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27950 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18360139 fbshipit-source-id: 5459ae49c087886e4990de136198773a75b1c572	2019-11-07 18:02:44 -08:00
James Reed	821f8bfc2f	Fix tracing for dynamic quantized LSTM (#29331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29331 Closes #27954 This fixes the hard-coding of packed parameter values for the dynamic quantized LSTM by orchestrating the following dance: 1) Each variadic parameter on the module has its own Module. That Module defines the `__getstate__` and __setstate__` method s.t. packed weights are properly re-done on model load. 2) Each of these modules is wrapped into a `torch.nn.ModuleList`, s.t. the parameters appear as attributes in the hierarchy. Then, `gatherParametersAndBuffers` (`9c43b16df9/torch/csrc/jit/tracer.cpp (L285)`) can see these parameters and create a `Value*` for them in the traced graph. 3) In forward, we need to convert from ModuleList -> Module -> Parameter to a simple TensorList of the parameters. We just use a loop here. In tracing, we simply record a `ListConstruct` with each of the proper parameter values. In scripting, the `ModuleList` is const, so it can be unrolled into the graph and a subsequent `ListConstruct` does its business. The `forward` of the traced LSTM before and after this change are as follows: Before ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: hx, hx0, = argument_2 _0, _1, _2 = torch.quantized_lstm(input, [hx, hx0], [CONSTANTS.c0, CONSTANTS.c1], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_0, (_1, _2)) ``` After ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: _0 = self.cell._all_weight_values _1 = getattr(_0, "0").param _2 = getattr(_0, "1").param hx, hx0, = argument_2 _3, _4, _5 = torch.quantized_lstm(input, [hx, hx0], [_1, _2], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_3, (_4, _5)) ``` Test Plan: Imported from OSS Differential Revision: D18374904 Pulled By: jamesr66a fbshipit-source-id: f1a9b58998bc365b9baad38c21fd4bb510dd639c	2019-11-07 13:45:39 -08:00
Mike Ruberry	84a6583ba1	Revert D18359880: Fix tracing for dynamic quantized LSTM Test Plan: revert-hammer Differential Revision: D18359880 Original commit changeset: 0ff2cad294a1 fbshipit-source-id: 834cd43b39fb754f90c8b18b8ab9b837f2b511ab	2019-11-06 21:10:33 -08:00
James Reed	f17e02fd94	Fix tracing for dynamic quantized LSTM (#29331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29331 Closes #27954 This fixes the hard-coding of packed parameter values for the dynamic quantized LSTM by orchestrating the following dance: 1) Each variadic parameter on the module has its own Module. That Module defines the `__getstate__` and __setstate__` method s.t. packed weights are properly re-done on model load. 2) Each of these modules is wrapped into a `torch.nn.ModuleList`, s.t. the parameters appear as attributes in the hierarchy. Then, `gatherParametersAndBuffers` (`9c43b16df9/torch/csrc/jit/tracer.cpp (L285)`) can see these parameters and create a `Value*` for them in the traced graph. 3) In forward, we need to convert from ModuleList -> Module -> Parameter to a simple TensorList of the parameters. We just use a loop here. In tracing, we simply record a `ListConstruct` with each of the proper parameter values. In scripting, the `ModuleList` is const, so it can be unrolled into the graph and a subsequent `ListConstruct` does its business. The `forward` of the traced LSTM before and after this change are as follows: Before ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: hx, hx0, = argument_2 _0, _1, _2 = torch.quantized_lstm(input, [hx, hx0], [CONSTANTS.c0, CONSTANTS.c1], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_0, (_1, _2)) ``` After ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: _0 = self.cell._all_weight_values _1 = getattr(_0, "0").param _2 = getattr(_0, "1").param hx, hx0, = argument_2 _3, _4, _5 = torch.quantized_lstm(input, [hx, hx0], [_1, _2], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_3, (_4, _5)) ``` Test Plan: Imported from OSS Differential Revision: D18359880 Pulled By: jamesr66a fbshipit-source-id: 0ff2cad294a1871123015dfc704eaf73a7ac1d9e	2019-11-06 17:02:12 -08:00
Xiang Gao	25e261d6d5	assertEquals is deprecated, use assertEqual instead Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28335 Differential Revision: D18263456 Pulled By: ngimel fbshipit-source-id: c0f79071feaa5a4c3c4b20505013bf7c4b5455d5	2019-11-05 09:52:21 -08:00
Jerry Zhang	d690521cf6	Add e2e test for conv+bn (#27348 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27348 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18182920 fbshipit-source-id: 40edc4d85903f979cd4755d6785d2842faa4d566	2019-11-01 11:28:47 -07:00
Jerry Zhang	59c5de4d0e	Don't permute in quantized::conv2d pattern (#27347 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27347 it's already done in the op, we don't need to permute again Test Plan: test_jit.py we'll test in e2e tests Imported from OSS Differential Revision: D18182919 fbshipit-source-id: 04dd2a19a719828fbc7b62e451b81752187e0fcb	2019-10-31 15:58:28 -07:00
Jerry Zhang	1c436ded44	Remove `test_quantizer.py` and reuse one of its test in `test_quantization.py` (#27269 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27269 Remove `test_quantizer.py`, add and rewrite one of the tests in `test_quantizer` in `test_quantization.py` The conv test is removed for now since conv pattern is still broken, we'll add another test later ghstack-source-id: 92869823 Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18182916 fbshipit-source-id: 325b5d8e877228d6a513e3ddf52c974479250d42	2019-10-29 19:04:21 -07:00
Zafar Takhirov	a5ac7f6387	Changing observer name Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27779 Test Plan: Imported from OSS Differential Revision: D17886605 Pulled By: z-a-f fbshipit-source-id: 68c50b482e65015336ff27171fd730da493525b6	2019-10-17 11:36:03 -07:00
Raghuraman Krishnamoorthi	ac0f18437f	MovingAverage Observer (#27396 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27396 Observer that estimates moving averages of min and max values per batch, more suited for quantization aware training instead of minmax observers that track extremal values across batches ghstack-source-id: 91369018 Test Plan: buck test caffe2/test:quantization -- 'test_per_tensor_observers $test_quantization\.ObserverTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_per_channel_observers $test_quantization\.ObserverTest$' --print-passing-details Differential Revision: D17727213 fbshipit-source-id: 024a890bf3dd0bf269d8bfe61f19871d027326f0	2019-10-04 16:28:59 -07:00
Zafar Takhirov	6bb7433ad5	Replacing the skip_list with white_list in the qconfig propagation Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27183 Test Plan: Imported from OSS Differential Revision: D17700548 Pulled By: zafartahirov fbshipit-source-id: 18e6ffbda496b14ac1da1783f928ad539cdb1d16	2019-10-03 20:40:17 -07:00
James Reed	1affa7c32c	Allow set for qconfig for dynamic_quantize Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27181 Test Plan: Imported from OSS Differential Revision: D17717482 Pulled By: jamesr66a fbshipit-source-id: f3930fc87831cbdcf4390cd769c594bb13f5cd81	2019-10-02 19:55:45 -07:00
Zafar Takhirov	27dc595215	Rename _intrinsic to intrinsic Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27194 Test Plan: Imported from OSS Differential Revision: D17704957 Pulled By: zafartahirov fbshipit-source-id: 46f02d129aa77c3047b2a6c606bfadd831a6b0fc	2019-10-02 18:53:06 -07:00
Raghuraman Krishnamoorthi	4abfb5493e	Handle uninitialized min/max values in histogram observer (#27151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27151 We need to be ab le to handle observers with no min/max data correctly as models sometimes have modules that do not get any data. ghstack-source-id: 91113403 Test Plan: buck test caffe2/test:quantization -- test_minmax_observer buck test caffe2/test:quantization -- test_per_channel_minmax_observer buck test caffe2/test:quantization --test_histogram_observer Reviewed By: csummersea Differential Revision: D17690828 fbshipit-source-id: e95709333ea0f66d79ddb8141b7cba5a83347dbd	2019-10-01 14:56:37 -07:00
Jerry Zhang	98c02e6df3	Enable tests (#27103 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27103 att Test Plan: python test/test_quantization.py 'GraphModePostTrainingQuantTest' Imported from OSS Differential Revision: D17678261 fbshipit-source-id: 5caa7512c6ff4a613980c86b5b221e0cfbe0a173	2019-10-01 12:10:21 -07:00
Raghuraman Krishnamoorthi	dddae3f854	Fuse module enhancements (#26457 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26457 Enhancement to fuse module to support sequentials, fuse list can now be just like the state dict. Also add support for Conv-Relu and linear-relu fusion Also support inplace and out of place fusion of models. ghstack-source-id: 91076386 Test Plan: buck test caffe2/test:quantization -- 'test_fusion_sequential_model_train $test_quantization\.FusionTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_fusion_sequential_model_eval $test_quantization\.FusionTest$' --print-passing-details Differential Revision: D17466382 fbshipit-source-id: 0a548f8f4c366f3ecc59db693bac725ccd62328e	2019-09-30 22:00:20 -07:00
Raghuraman Krishnamoorthi	bdcaf6334b	Support for add relu functional module (#26612 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26612 Add support for add relu functional module, this allows for fusion of add and relu quantized operations ghstack-source-id: 91055976 Test Plan: buck test caffe2/test:quantization -- 'test_functional_module $test_quantization\.FunctionalModuleTest$' --print-passing-details Differential Revision: D17518268 fbshipit-source-id: e1e8b4655d6b32405863ab9d1c7da111fb4343cc	2019-09-30 18:16:58 -07:00
James Reed	4d7bec5f3e	Improve repr for quantized modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27008 Test Plan: Imported from OSS Differential Revision: D17649174 Pulled By: jamesr66a fbshipit-source-id: e3e6c4bb31e1ad8ed1ebe27f803f90d564ecfe53	2019-09-28 15:15:14 -07:00
Raghuraman Krishnamoorthi	2ccbdb79c8	Per-channel baseline (#26516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26516 ghstack-source-id: 90982010 Test Plan: Integrate per-channel support into conv and linear modules. The following tests pass: buck test caffe2/test:quantized -- 'test_linear_api $test_quantized_nn_mods\.ModuleAPITest$' --print-passing-details buck test caffe2/test:quantized -- 'test_conv_api $test_quantized_nn_mods\.ModuleAPITest$' --print-passing-details buck test caffe2/test:quantized -- 'test_float_quant_compare_per_channel $test_quantized_models\.ModelNumerics$' --print-passing-details Differential Revision: D17342622 fbshipit-source-id: f0d618928e3d9348672c589a6b7a47049c372a2e	2019-09-28 14:05:06 -07:00
Jerry Zhang	09f0e949cd	PyTorch Graph Mode Quantization API (#26390 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26390 `quantize_script`: top level API for graph mode quantization Test Plan: there are some known issues, we can enable test after all known issues are fixed. Imported from OSS Differential Revision: D17645132 fbshipit-source-id: 61f261d5607409d493b39a2f4e05ebd017279f6b	2019-09-27 19:23:51 -07:00
Dmytro Dzhulgakov	764bf826e3	Remove fbgemm_is_cpu_supported in favor of torch.backends.quantized.supported_qengines (#26840 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26840 Cleaning up top-level namespace. Also cosmetic changes to torch.backends.quantized Test Plan: Imported from OSS Differential Revision: D17604403 Pulled By: dzhulgakov fbshipit-source-id: c55af277ea7319d962a82a6120f65ccd47a60abc	2019-09-27 13:45:15 -07:00
Raghuraman Krishnamoorthi	b0a2f6f2f5	Serialization and range reduction support for Fake Quant/Observer (#26519 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26519 ghstack-source-id: 90895631 Test Plan: buck test caffe2/test:quantization -- 'test_histogram_observer $test_quantization\.ObserverTest$' --print-passing-details and buck test caffe2/test:fake_quant -- 'test_fq_serializable $test_fake_quant\.TestFakeQuantizePerTensorAffine$' --print-passing-details Differential Revision: D17217408 fbshipit-source-id: 0da7efdcdae0c065dd035c5dd2b6a78231545ece	2019-09-27 10:09:39 -07:00
Dmytro Dzhulgakov	0a8a779abe	Add more inplace arguments to quantization top level API (#26782 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26782 At least we should be consistent on top-level APIs and prepare/convert/etc. Logic is inplace=False by default but top-level APIs take care of doing fewer copies. Also renames always-inplace methods like add_observer to have underscore in the end. One fix for MinMaxObserver was triggered by deepcopy surfacing that we were accidentally keeping autograd around Test Plan: Imported from OSS Differential Revision: D17595956 Pulled By: dzhulgakov fbshipit-source-id: 801f9f5536b553f24c7a660064dd6fce685edd65	2019-09-26 00:07:07 -07:00
Dmytro Dzhulgakov	128a65e2e0	Use noop observer to pass dtype for dynamic quantization (#26709 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26709 Polishes implementation from #25975. Primarily, we use NoopObserver to communicate that weights need to be quantized to float16. The very top-level API (quantize_dynamic) stays the same with `dtype` argument but the implementation follows the common flow. One can argue that dynamic fp16 quantization doesn't really fit into the 'observer' mechanism. It's in fact not ideal, but it's better to have the same flow than branching on both dtype and qconfig. Test Plan: Imported from OSS Differential Revision: D17544103 Pulled By: dzhulgakov fbshipit-source-id: 6af3f18c35929a1a53ea734079c005f656e4925f	2019-09-24 09:24:39 -07:00
Dmytro Dzhulgakov	a79b3685db	Simplify observers declaration with functools.partial (#26492 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26492 Previous definition of observers was quite clumsy - with things like `default_observer()()`. This PR strips a way a lot of craft and allows to pass just class names directly. In order to override default arguments either `functools.partial` can be used or convenient wrapper `MyObserver.with_args(x=1)` is provided. Also rename `QConfig_dynamic` to `QConfigDynamic` because it violates the naming convention. Test Plan: Imported from OSS Differential Revision: D17521265 Pulled By: dzhulgakov fbshipit-source-id: ba9df19b368641acf4093c43df9990796284fd9e	2019-09-23 10:15:59 -07:00
Jerry Zhang	254122dd4e	quantize_linear -> quantize_per_tensor (#26574 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26574 Since we also have `quantized::linear`, `quantize_linear` sounds confusing, so we plan to rename it before the branch cut Test Plan: ci Imported from OSS Differential Revision: D17514876 fbshipit-source-id: 01d9005e6ec8cb9950b9d8bba122109c389641d3	2019-09-20 21:58:48 -07:00
Lingyi Liu	11f9fe2433	Fix the API for record observer (#26413 ) Summary: Mainly want to resolve comments from https://github.com/pytorch/pytorch/pull/25830. Overall, we want to provide a recording observer for recording the runtime tensor values of activation path in order to debug the numerical accuracy loss offline. According to the feedback from https://github.com/pytorch/pytorch/issues/25830, it might be better to record all the observers in a dict and query the dict to get corresponding tensor values. hx89 is working on how to insert the recording observers into model under debug. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26413 Differential Revision: D17506502 Pulled By: llyfacebook fbshipit-source-id: 3ab90dc78920e7ec3fa572c2a07327a9991c530a	2019-09-20 14:27:56 -07:00
Jianyu Huang	f433ee1499	Add the FP16 weight support for LSTM in dynamic_quantize (#25975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25975 We would like to add the FP16 weight support for the dynamic quantized LSTM. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm794.ftw3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details Building: finished in 13.4 sec (100%) 8134/8134 jobs, 81 updated Total time: 13.9 sec Trace available for this run at /tmp/testpilot.20190910-210241.2092790.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision c86e65add357582accb6ec0be23b92c8a2c510bd fbpkg ca46e8f5b26c451a8b0b2462c11bb61d at Mon Sep 9 22:16:37 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/696/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 0.183 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 0.184s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 Summary (total time 4.35s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D17299116 fbshipit-source-id: 7fe91ece25867f2c0496f1b63fb1041e6b815166	2019-09-19 22:19:22 -07:00
Haixin Liu	dcbfc3bdbf	Add per channel observer (#25887 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25887 ghstack-source-id: 90383258 Add per channel observer to compute the qparams for each channel. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_per_channel_minmax_observer' buck test mode/dev caffe2/test:quantization -- 'test_per_channel_minmax_observer_scriptable' Differential Revision: D17137226 fbshipit-source-id: 0b1c93e3cbcda86f5c4e30f7cd94c670f2665063	2019-09-18 22:16:45 -07:00
Haixin Liu	f2e9622ed8	Add l2 norm minimization (#24022 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24022 In histogram observer add an approximation for L2 error minimization for selecting min/max. By selecting new min/max, we filter out outliers in input distribution. This follows the implementation of NormMinimization::NonlinearQuantizationParamsSearch in caffe2/quantization/server/norm_minimization.cc ghstack-source-id: 90298789 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_histogram_observer' Differential Revision: D16713239 fbshipit-source-id: 82631ba47974e25689c9c66bc3088117090e26d4	2019-09-18 00:07:10 -07:00
Sebastian Messmer	9f6b6b8101	Back out "[quant][observer] Add histogram observer" (#26236 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26236 Original diff broke oss CI. Reverting. Original commit changeset: 0f047d3349cb ghstack-source-id: 90125990 Test Plan: testinprod Reviewed By: hx89 Differential Revision: D17385490 fbshipit-source-id: 4258502bbc0e3a6dd6852c8ce01ed05eee618b1a	2019-09-14 12:48:46 -07:00
Haixin Liu	1563fdb591	Add histogram observer (#23959 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23959 Add histogram observer that records the running histogram of tensor values along with min/max values. ghstack-source-id: 90076996 Test Plan: Added a test test_histogram_observer buck test mode/dev caffe2/test:quantization -- 'test_histogram_observer' buck test mode/dev caffe2/test:quantization -- 'test_observer_scriptable' Differential Revision: D16692835 fbshipit-source-id: 0f047d3349cb9770fad4a2b6cb346c51d9e99cd4	2019-09-13 19:24:04 -07:00
James Reed	bdc656da70	TorchScript Serialization for dynamic LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26084 Test Plan: Imported from OSS Differential Revision: D17339315 Pulled By: jamesr66a fbshipit-source-id: 03a2674edcf779becfe3b8ec96f1bae23c74b11c	2019-09-12 11:04:47 -07:00
James Reed	83ecdf76da	Revert "TorchScript Serialization for dynamic LSTM module" (#26079 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26079 This reverts commit `e3039612d8`. Test Plan: Imported from OSS Differential Revision: D17337585 Pulled By: jamesr66a fbshipit-source-id: 4b93a4c5ca2fe491d609da889a42d22be8e52889	2019-09-11 21:23:19 -07:00
Jianyu Huang	ead14a6bd4	Use BytesIO instead of tempfile (#25976 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25976 As recommended in https://github.com/pytorch/pytorch/pull/25877/files#r322956051: > We should move more of these toward using BytesIO. Using files in tests is generally considered bad practice because it introduces syscalls and dependencies on the execution environment, and thus can cause test flakiness/instability. ghstack-source-id: 89929947 Test Plan: CI Differential Revision: D17310441 fbshipit-source-id: ba97cce4224225df45ff44062f1bc8ebefb25922	2019-09-11 19:35:49 -07:00
James Reed	e3039612d8	TorchScript Serialization for dynamic LSTM module Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25877 Test Plan: Imported from OSS Reviewed By: jianyuh Differential Revision: D17275746 Pulled By: jamesr66a fbshipit-source-id: db2f38ddd99f02ccb4fb754fa1c1e6cad4425fa8	2019-09-11 19:17:25 -07:00
Lingyi Liu	62767077c3	add the tensor_observer to record the runtime tensor for quantization … (#25830 ) Summary: …accuracy analsyis Pull Request resolved: https://github.com/pytorch/pytorch/pull/25830 Differential Revision: D17327147 Pulled By: llyfacebook fbshipit-source-id: 095d5537a31b8d7541081000eaeb8b8474dfb8d0	2019-09-11 13:36:28 -07:00
James Reed	79bcf6e5ba	Test scripting and tracing for dynamic linear modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25870 Test Plan: Imported from OSS Differential Revision: D17275747 Pulled By: jamesr66a fbshipit-source-id: ed8eaf7e9af3127c987e56d17d60b52d039d5ae8	2019-09-09 19:00:35 -07:00
Raghuraman Krishnamoorthi	17c1b2c715	Relax scale to prevent saturation in conv/linear. Add test to verify precision of numerics of quantized model with updated observer. This test catches errors in (#25667 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25667 Relax scale and zero-point for activations to ensure that fbgemm implementations of conv and linear do not saturate due to 16 bit intermediate accumulation. Add test to verify precision of numerics of quantized model with updated observer. This test catches errors in handling layouts for quantized ops in addition to saturation/quantization errors. ghstack-source-id: 89587942 Test Plan: buck test caffe2/test:quantized -- 'test_float_quant_compare $test_quantized_models\.ModelNumerics$' --print-passing-details Passes when SQNR > 35 dB buck test caffe2/test:quantization -- 'test_minmax_observer $test_quantization\.ObserverTest$' --print-passing-details Passes with additional coverage for observer changes Differential Revision: D17140498 fbshipit-source-id: 42c58e726bb0b0f51890590ee2525428f9a8d24e	2019-09-06 17:18:01 -07:00
Jianyu Huang	0483d537ab	Add the dynamic quantized LSTM module (#25157 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25157 Add the dynamic quantized LSTM module. TODO (separate PRs): - Serialization. - Bias can be Null. ghstack-source-id: 89443731 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm2816.prn3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_q uantization\.PostTrainingDynamicQuantTest$' --print-passing-details Action graph will be rebuilt because files have been added or removed. Parsing buck files: finished in 1.4 sec Building: finished in 4.0 sec (100%) 8122/8122 jobs, 2 updated Total time: 5.5 sec Trace available for this run at /tmp/testpilot.20190902-164918.1275502.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision b61bc0e3b71033578eddfe0a28b0739bc685663f fbpkg 3b1c1aed1c534c0cb161a981eca6e2f0 at Sun Sep 1 20:58:52 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/690/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/2251799823877227 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 1.048 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 1.049s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/2251799823877227 Summary (total time 5.53s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D16955662 fbshipit-source-id: 61cf1a74913105fa02e44b3941813eabac0006b5	2019-09-03 19:18:28 -07:00
Zafar Takhirov	e44c09ecae	making quant utilities inplace Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25054 Test Plan: Imported from OSS Differential Revision: D16974198 Pulled By: zafartahirov fbshipit-source-id: 54befc8429990adafe746d1255d117fca5f12e11	2019-08-29 16:03:13 -07:00
Zafar Takhirov	e8acc2ebb1	Removing future imports from the test fixtures. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25296 Test Plan: Imported from OSS Differential Revision: D17090201 Pulled By: zafartahirov fbshipit-source-id: 5a4f6ac0ea475b55d2c610e2f9f4f0cef8690e8f	2019-08-29 01:39:59 -07:00
Haixin Liu	06757acb30	Refactor MinMax observer (#23902 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23902 Copied from Daya's diff in pytorch/pytorch #23191 Refactor MinMax observer and create the base observer class to prepare for future observers such as histogram observer. ghstack-source-id: 89146014 Test Plan: Added a test test_minmax_observer buck test mode/dev caffe2/test:quantization -- 'test_minmax_observer' ``` Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/2533274797931635 ✓ caffe2/test:quantization - test_minmax_observer (test_quantization.ObserverTest) 0.055 1/1 (passed) Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/2533274797931635 Summary (total time 4.26s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` buck test mode/dev caffe2/test:quantization -- 'test_observer_scriptable' ``` Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/5348024563344195 ✓ caffe2/test:quantization - test_observer_scriptable (test_quantization.ObserverTest) 1.762 1/1 (passed) Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/5348024563344195 Summary (total time 6.02s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D16663221 fbshipit-source-id: 3d0e1aa9e4d27808e61b10604782606de067a34a	2019-08-28 13:12:38 -07:00
Raghuraman Krishnamoorthi	9d06a984f8	Serialization for nn.quantized.functional modules (#25220 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25220 Add load_from_state_dict and save_from_state_dict for quantized functional modules ghstack-source-id: 89070836 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_scriptability_serialization\ $test_quantization.ScriptabilityTest$' --print-passing-details Differential Revision: D17065243 fbshipit-source-id: 413ce0a95d0c27fedb23894f1483e3da2f60f417	2019-08-27 18:56:10 -07:00
Raghuraman Krishnamoorthi	f622ec8084	Update mapping dictionary to support functionalmodules and pooling operations (#25216 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25216 ghstack-source-id: 89045562 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_resnet_base\ $test_quantization.PostTrainingQuantTest$' --print-passing-details Differential Revision: D17065029 fbshipit-source-id: b248abf6de162f38e35e6bace17bde1be9e38c57	2019-08-26 23:00:01 -07:00
Raghuraman Krishnamoorthi	a9fdc1923b	Revert D16879132: Update mapping dictionary to support functionalmodules and pooling operations Test Plan: revert-hammer Differential Revision: D16879132 Original commit changeset: cd8c10182aa7 fbshipit-source-id: 9b67ccf73f43d15ef50bf0331d3df4d57835931b	2019-08-26 16:25:25 -07:00
Raghuraman Krishnamoorthi	c3c36a5b68	Revert D16923651: Serialization for nn.quantized.functional modules Test Plan: revert-hammer Differential Revision: D16923651 Original commit changeset: eb1234be1941 fbshipit-source-id: c80d0b50db0f949cc293dbc2f825404bbc8cb86c	2019-08-26 15:36:21 -07:00
Raghuraman Krishnamoorthi	95a3ffc2f1	Serialization for nn.quantized.functional modules (#24924 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24924 Add load_from_state_dict and save_from_state_dict for quantized functional modules ghstack-source-id: 89001576 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_scriptability_serialization\ $test_quantization.ScriptabilityTest$' --print-passing-details Differential Revision: D16923651 fbshipit-source-id: eb1234be1941ccf268a2fc5b756540ab973f3ffb	2019-08-26 12:16:57 -07:00
Raghuraman Krishnamoorthi	794f63fe92	Update mapping dictionary to support functionalmodules and pooling operations (#24804 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24804 ghstack-source-id: 89003799 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_resnet_base\ $test_quantization.PostTrainingQuantTest$' --print-passing-details Differential Revision: D16879132 fbshipit-source-id: cd8c10182aa732ddf655bcda17f72ea08033a633	2019-08-26 12:16:49 -07:00
James Reed	049284e14d	Make observer scriptable Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24996 Test Plan: Imported from OSS Differential Revision: D16952938 Pulled By: jamesr66a fbshipit-source-id: 3d08e0c746603d0fe090fb3dbf13c5fc9dc022f4	2019-08-22 11:28:45 -07:00
Raghuraman Krishnamoorthi	696cabae9b	Baseline observer module, ensuring that (min,max) range includes zero. (#24297 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24297 ghstack-source-id: 88252409 Differential Revision: D16635637 fbshipit-source-id: fcef20b9c88b2c3bd97e311514e5b2d0339ff28a	2019-08-15 15:25:23 -07:00
Jerry Zhang	761ae8e9b6	Add intrinsic module mappings (#23753 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23753 Add intrinsic(fused) module mappings in quantize.py to enable mapping fused modules in both QAT and post PTQ Differential Revision: D16820749 fbshipit-source-id: 07de76a4f09b44bde8b193c103eac02c22b875b6	2019-08-15 09:37:24 -07:00
Jianyu Huang	f66c90469b	Fix Lint (#24381 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24381 As pointed out in https://github.com/pytorch/pytorch/pull/24299#issuecomment-521471089, the previous PR broke the Lint. ghstack-source-id: 88339887 Reviewed By: jamesr66a Differential Revision: D16822443 fbshipit-source-id: 3aed5b9404b0f0fcf453c05b59189974243b0df2	2019-08-14 19:22:09 -07:00
Jianyu Huang	0f64043b49	Remove the activation observer for default_qconfig (#24299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24299 As suggested in https://github.com/pytorch/pytorch/pull/24232, we will remove the activation observer for dynamic quantization path. ghstack-source-id: 88287094 Differential Revision: D16798590 fbshipit-source-id: 07a245d5584b5b15c6895d9b09deef4a0605073a	2019-08-14 17:21:50 -07:00
Jianyu Huang	584c6986fd	Add the type matching rule for qconfig_dict (#23212 ) Summary: We want to use the Module type as the key for the qconfig_dict for the module replacement during the quantization. Before this Diff, to dynamic quantize the BERT model, we have to specify each layer: ``` qconfig_dict = { 'encoder.layer.0.attention.self.query': default_qconfig, 'encoder.layer.0.attention.self.key': default_qconfig, 'encoder.layer.0.attention.self.value': default_qconfig, 'encoder.layer.0.attention.output.dense': default_qconfig, 'encoder.layer.0.intermediate.dense': default_qconfig, 'encoder.layer.0.output.dense': default_qconfig, 'encoder.layer.1.attention.self.query': default_qconfig, 'encoder.layer.1.attention.self.key': default_qconfig, 'encoder.layer.1.attention.self.value': default_qconfig, 'encoder.layer.1.attention.output.dense': default_qconfig, 'encoder.layer.1.intermediate.dense': default_qconfig, 'encoder.layer.1.output.dense': default_qconfig, ... } ``` After this Diff, we only need the following ``` qconfig_dict = { torch.nn.Linear : default_qconfig } ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/23212 ghstack-source-id: 88287091 Reviewed By: zafartahirov Differential Revision: D16436542 fbshipit-source-id: 11fbe68ee460560c1a7cdded63581eb7a00e5a89	2019-08-14 13:07:36 -07:00
Jianyu Huang	e94ba742b0	Dynamic Quantized Linear Module (#23128 ) Summary: - ~~Add a unit test for the Dynamic Quantized Linear operator (```torch.fbgemm_linear_quantize_weight```, ```torch.fbgemm_pack_quantized_matrix```, and ```torch.fbgemm_linear_int8_weight```) in ```test_quantized.py```.~~ Move this to D16404027 for a separate review. - Add the Dynamic Quantized Linear module in ```torch/nn/quantized/modules/linear.py```. ~~This is in a rudimentary stage. Will add more functions later~~. - Add the torch.quantize logic (prepare, eval, convert) for dynamic quantization. - Add a unit test for the Dynamic Quantized Linear module in ```test_nn_quantized.py```. - Add a unit test for the Model-level Quantization API Pull Request resolved: https://github.com/pytorch/pytorch/pull/23128 ghstack-source-id: 88257232 Differential Revision: D16258664 fbshipit-source-id: 4be3ac39ee27c088b341c741d3f09f51d5a23ef0	2019-08-13 21:01:23 -07:00
Zafar Takhirov	4cc16782f3	Removing the make_module script. (#23635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23635 It appears it is the same complexity to add new modules using a base class and using a generation script. Test Plan: Imported from OSS Differential Revision: D16593364 Pulled By: zafartahirov fbshipit-source-id: 852dcf41f3dfa2a89152042b8e61d0b6defa8feb	2019-08-13 09:58:28 -07:00
Daya Khudia	f510409281	Enable FBGEMM tests under UBSAN as well (#23570 ) Summary: Enabling tests under UBSAN as well Pull Request resolved: https://github.com/pytorch/pytorch/pull/23570 Test Plan: buck test mode/dev caffe2/test:quantized ``` Running 29 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/3940649677415136 ✓ caffe2/test:quantized - test_qtensor (test_quantized_tensor.TestQuantizedTensor) 0.536 1/29 (passed) ✓ caffe2/test:quantized - test_qtensor_per_channel_affine (test_quantized_tensor.TestQuantizedTensor) 0.453 2/29 (passed) ✓ caffe2/test:quantized - test_qtensor_reshape (test_quantized_tensor.TestQuantizedTensor) 0.302 3/29 (passed) ✓ caffe2/test:quantized - test_qadd_relu_same_qparams (test_quantized.TestQuantizedOps) 0.332 4/29 (passed) ✓ caffe2/test:quantized - test_qtensor_view (test_quantized_tensor.TestQuantizedTensor) 0.351 5/29 (passed) ✓ caffe2/test:quantized - test_qadd_relu_different_qparams (test_quantized.TestQuantizedOps) 0.348 6/29 (passed) ✓ caffe2/test:quantized - test_qtensor_dequantize_linear (test_quantized_tensor.TestQuantizedTensor) 0.338 7/29 (passed) ✓ caffe2/test:quantized - test_qtensor_copy (test_quantized_tensor.TestQuantizedTensor) 0.267 8/29 (passed) ✓ caffe2/test:quantized - test_qtensor_clone (test_quantized_tensor.TestQuantizedTensor) 0.330 9/29 (passed) ✓ caffe2/test:quantized - test_qrelu (test_quantized.TestQuantizedOps) 1.774 10/29 (passed) ✓ caffe2/test:quantized - test_pool_api (test_nn_quantized.ModuleAPITest) 0.418 11/29 (passed) ✓ caffe2/test:quantized - test_qtensor_load_save (test_quantized_tensor.TestQuantizedTensor) 0.724 12/29 (passed) ✓ caffe2/test:quantized - test_relu_api (test_nn_quantized.FunctionalAPITest) 1.013 13/29 (passed) ✓ caffe2/test:quantized - test_qtensor_quant_dequant (test_quantized_tensor.TestQuantizedTensor) 1.055 14/29 (passed) ✓ caffe2/test:quantized - test_qtensor_permute (test_quantized_tensor.TestQuantizedTensor) 0.696 15/29 (passed) ✓ caffe2/test:quantized - test_qtensor_dtypes (test_quantized_tensor.TestQuantizedTensor) 0.841 16/29 (passed) ✓ caffe2/test:quantized - test_quant_dequant_api (test_nn_quantized.ModuleAPITest) 0.616 17/29 (passed) ✓ caffe2/test:quantized - test_qtensor_creation (test_quantized_tensor.TestQuantizedTensor) 0.698 18/29 (passed) ✓ caffe2/test:quantized - test_qconv (test_quantized.TestQuantizedConv) 4.743 19/29 (passed) ✓ caffe2/test:quantized - test_cat (test_quantized.TestQuantizedOps) 6.992 20/29 (passed) ✓ caffe2/test:quantized - test_linear_api (test_nn_quantized.ModuleAPITest) 8.970 21/29 (passed) ✓ caffe2/test:quantized - test_conv_api (test_quantized_conv.QuantizedConvTest) 9.403 22/29 (passed) ↷ caffe2/test:quantized - test_qnnpack_linear (test_quantized.TestQNNPackOps) 0.000 23/29 (skipped) Test output: > Skipped: QNNPACK does not play well with UBSAN at the moment, so we skip the test if we are in a UBSAN environment. > test_qnnpack_linear (test_quantized.TestQNNPackOps) ... skipped 'QNNPACK does not play well with UBSAN at the moment, so we skip the test if we are in a UBSAN environment.' > > ---------------------------------------------------------------------- > Ran 1 test in 0.000s > > OK (skipped=1) ↷ caffe2/test:quantized - test_qnnpack_relu (test_quantized.TestQNNPackOps) 0.000 24/29 (skipped) Test output: > Skipped: QNNPACK does not play well with UBSAN at the moment, so we skip the test if we are in a UBSAN environment. > test_qnnpack_relu (test_quantized.TestQNNPackOps) ... skipped 'QNNPACK does not play well with UBSAN at the moment, so we skip the test if we are in a UBSAN environment.' > > ---------------------------------------------------------------------- > Ran 1 test in 0.000s > > OK (skipped=1) ✓ caffe2/test:quantized - test_max_pool2d (test_quantized.TestQuantizedOps) 8.453 25/29 (passed) ✓ caffe2/test:quantized - test_qlinear_unpack (test_quantized.TestQuantizedLinear) 0.664 26/29 (passed) ✓ caffe2/test:quantized - test_qconv_unpack (test_quantized.TestQuantizedConv) 2.965 27/29 (passed) ✓ caffe2/test:quantized - test_qlinear (test_quantized.TestQuantizedLinear) 1.915 28/29 (passed) ✓ caffe2/test:quantized - test_conv_api (test_nn_quantized.ModuleAPITest) 60.804 29/29 (passed) ✓ caffe2/test:quantized - main 0.000 (passed) Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/3940649677415136 Summary (total time 68.66s): PASS: 28 FAIL: 0 SKIP: 2 caffe2/test:quantized - test_qnnpack_linear (test_quantized.TestQNNPackOps) caffe2/test:quantized - test_qnnpack_relu (test_quantized.TestQNNPackOps) FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Reviewed By: jianyuh Differential Revision: D16569166 Pulled By: dskhudia fbshipit-source-id: 53522b4162eb1ebb35b408a1503d9664305c85b0	2019-08-12 17:59:22 -07:00
James Reed	a35d2902ef	jit.script() testing and fixes (#23891 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23891 This adds an initial set of testing coverage for quantization that checks if the modules can be scripted. Testing for tracing and serialization is forthcoming Test Plan: Imported from OSS Differential Revision: D16698045 Pulled By: jamesr66a fbshipit-source-id: 96d80d938b816220af72359165a7b96d998a30c9	2019-08-08 12:06:18 -07:00
James Reed	40f0b1c844	Enable OSS quantization tests (#23858 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23858 Pull Request resolved: https://github.com/pytorch/pytorch/pull/23718 Changes: - Enable tests for quantization test files in `run_tests.py` - Remove `__future__` imports from `torch/nn/qat/modules/__init__.py`, since `unicode_literals` messes up imports on python2 because the elements in `__all__` will be Unicode and not string - Skip PostTrainingQuantTests if the build doesn't have FBGEMM (only a small subset of targets in tests) or if testing under UBSAN (the suppression file doesn't seem to work) Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D16639467 Pulled By: jamesr66a fbshipit-source-id: 532766797c216976dd7e07d751f768ff8e0fc207	2019-08-06 11:20:30 -07:00
Jerry Zhang	89956374c3	Remove qconfig_dict from API (#23465 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23465 We decided not to allow user to use qconfig_dict to do quantization since that API is not robust. Differential Revision: D16611504 fbshipit-source-id: b0d1d311b32c990a165c480f50e9ce3d68b785b5	2019-08-02 10:28:48 -07:00
Zafar Takhirov	058645acb1	Fusion and _intrinsic modules (#23003 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23003 torch.quantization.fuse_module and torch.nn._intrinsic convRelu and LinearRelu Fusion function to combine specific modules: (conv,bn) and (conv,bn,relu). In all cases, replace modules in place. The first module is replaced with the _intrinsic fused module and the remaining modules are replaced by nn.Identity. Support both training and eval. For training, the modules are "fused" with a sequential container. This is to allow for further module swaps for quantization aware training. Also add: torch.nn._intrinsic for convRelu and LinearRelu. TODO: Add tests for _intrinsic modules. Conv BN fusion code is based on DsKhudia's implementation Differential Revision: D16199720 fbshipit-source-id: 95fb9ffe72b361d280313b2ec57de2acd4f9dda2	2019-07-23 14:54:19 -07:00
Jerry Zhang	77353636de	Conv module (#23084 ) Summary: Added Conv module for qat Pull Request resolved: https://github.com/pytorch/pytorch/pull/23084 ghstack-source-id: 86862445 Differential Revision: D16379417 fbshipit-source-id: 742cc8b8e0f132070ca4943a1c2e3db60c2b5bdc	2019-07-19 18:49:52 -07:00
Jerry Zhang	7cc029cb75	Quantization aware training in eager mode (#23082 ) Summary: Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23082 ghstack-source-id: 86824650 Differential Revision: D16379374 fbshipit-source-id: 7d16d1acd87025065a24942ff92abf18e9fc8070	2019-07-19 14:57:25 -07:00
Soumith Chintala	84c2c89e2c	Revert D16199356: [qat] Quantization aware training in eager mode Differential Revision: D16199356 Original commit changeset: 62aeaf47c12c fbshipit-source-id: d06a96b0a617ae38029ffb246173ec065454b666	2019-07-19 03:18:48 -07:00
Soumith Chintala	f19aa12ae5	Revert D16274792: [qat] Conv module Differential Revision: D16274792 Original commit changeset: 1da10194123b fbshipit-source-id: 71b34774b463f2350289bd39b8cfd798e095ffa5	2019-07-19 03:18:45 -07:00
Jerry Zhang	12d9d768b8	Conv module (#22899 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22899 Added Conv module for qat Reviewed By: zafartahirov Differential Revision: D16274792 fbshipit-source-id: 1da10194123b2759a6a35c60d1c2d2c0b569ccdc	2019-07-18 18:58:07 -07:00
Jerry Zhang	65ef671d11	Quantization aware training in eager mode (#22732 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22732 Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Reviewed By: zafartahirov Differential Revision: D16199356 fbshipit-source-id: 62aeaf47c12c62a87d9cac208f25f7592e245d6c	2019-07-18 18:58:03 -07:00
Lucas Kabela	86fc417147	Move Quantization Models to common_quantization (#22706 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22706 Moved the models used for quantization test from the test_quantization.py file to common_quantization.py Reviewed By: jerryzh168 Differential Revision: D16189865 fbshipit-source-id: 409b43454b6b3fe278ac16b1affb9085d6ed6835	2019-07-10 15:05:49 -07:00
Lucas Kabela	3e3e6ee335	Add common_quantized test case utilities (#22694 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22694 Move quantization and quantized utility functions for testing to common_quantized.py and common_quantization.py. Addditionally, add a quantized test case base class which contains common methods for checking the results of quantization on modules. As a consequence of the move, fixed the import at the top of test_quantized.py, and test_quantization to use the new utility Reviewed By: jerryzh168 Differential Revision: D16172012 fbshipit-source-id: 329166af5555fc829f26bf1383d682c25c01a7d9	2019-07-10 12:23:36 -07:00
Jerry Zhang	164388150a	fix lint (#22654 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22654 att Reviewed By: bddppq Differential Revision: D16168219 fbshipit-source-id: db1a5e2161e7be70b2f6e6b4beaa27ea91f853f2	2019-07-09 16:41:26 -07:00
Jerry Zhang	5040d52a5a	torch.quantization conversion utilities, observers for eager mode quantization (#22010 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22010 torch.quantization module with observers and conversion routines Reviewed By: zafartahirov Differential Revision: D15554183 fbshipit-source-id: 05a3fabe28dd701978b8ecebf5bfc3a4c044ba5c	2019-07-09 10:51:38 -07:00

1 2 3 4 5

233 Commits