pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Edmund Williams Jr	e9e6cc8c83	Added Prehook option to prepare method (#41863 ) Summary: Added a logic so that if a prehook is passed into the prepare method during quantization, then the hook will be added as a prehook to all leaf nodes (and modules specified in the non_leaf_module_list). Pull Request resolved: https://github.com/pytorch/pytorch/pull/41863 Test Plan: Small demo, made simple module then called prepare with prehook parameter set to the numeric suite logger, printed the results to verify its what we wanted {F245156246} Reviewed By: jerryzh168 Differential Revision: D22671288 Pulled By: edmundw314 fbshipit-source-id: ce65a00830ff03360a82c0a075b3b6d8cbc4362e	2020-07-24 10:26:39 -07:00
emil	0c77bd7c0b	Quantization: preserving pre and post forward hooks (#37233 ) Summary: 1. While do convert() preserve module's pre and post forward hooks 2. While do fusion preserve only module's pre forward hooks (because after fusion output no longer the same) Pull Request resolved: https://github.com/pytorch/pytorch/pull/37233 Differential Revision: D22425141 Pulled By: jerryzh168 fbshipit-source-id: e69b81821d507dcd110d2ff3594ba94b9593c8da	2020-07-13 12:41:24 -07:00
Edward Leardi	733b8c23c4	Fix several quantization documentation typos (#40567 ) Summary: This PR fixes several typos I noticed in the docs here: https://pytorch.org/docs/master/quantization.html. In one case there was a misspelled module [torch.nn.instrinsic.qat](https://pytorch.org/docs/master/quantization.html#torch-nn-instrinsic-qat) which I corrected and am including screenshots of below just in case. <img width="1094" alt="before" src="https://user-images.githubusercontent.com/54918401/85766765-5cdd6280-b6e5-11ea-93e6-4944cf820b71.png"> <img width="1093" alt="after" src="https://user-images.githubusercontent.com/54918401/85766769-5d75f900-b6e5-11ea-8850-0d1f5ed67b16.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/40567 Differential Revision: D22311291 Pulled By: ezyang fbshipit-source-id: 65d1f3dd043357e38a584d9e30f31634a5b0995c	2020-07-07 09:45:23 -07:00
Jerry Zhang	59ca1d31ca	[quant][graphmode] docstrings for top level APIs (#40328 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40328 Test Plan: Imported from OSS Differential Revision: D22149708 fbshipit-source-id: 63a1cd229d9e4668fba0ef3977e894cb8984318b	2020-06-19 22:20:23 -07:00
Haixin Liu	d9c804ce22	[PyTorch Numeric Suite] Add support for dynamic quantization of linear module (#39024 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39024 Add support for dynamic quantization of linear module. ghstack-source-id: 106205450 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_compare_weights_conv_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_weights_linear_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_weights_linear_dynamic' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_stub_conv_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_stub_linear_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_stub_submodule_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_stub_functional_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_stub_linear_dynamic' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_outputs_conv_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_outputs_linear_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_outputs_functional_static' buck test mode/dev caffe2/test:quantization -- 'test_compare_model_outputs_linear_dynamic' Differential Revision: D21675971 fbshipit-source-id: c9562744dc59b61cf47f2787a934e6a5a53e12fd	2020-06-19 10:58:56 -07:00
Raghuraman Krishnamoorthi	3258cb61b1	Dynamic quantization support for LSTMCell, RNNCell and GRUCell [Remove randomness in weights] (#40102 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40102 Enable dynamic quantization for LSTMCell, RNNCell and GRUCell ghstack-source-id: 105997236 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_quantized_rnn_cell $quantization\.test_quantize\.TestPostTrainingDynamic$' Differential Revision: D22071017 fbshipit-source-id: 3fe1eac39db9c1e0566838eb8b969bbb1fa983c9	2020-06-16 21:29:50 -07:00
Raghuraman Krishnamoorthi	e55e0cb1a9	Revert D20978736: Dynamic quantization support for LSTMCell, RNNCell and GRUCell Test Plan: revert-hammer Differential Revision: D20978736 Original commit changeset: 8f303ba1d7f8 fbshipit-source-id: bcd300819616d6536f582fcd3c90decd543c4657	2020-06-16 10:11:32 -07:00
Raghuraman Krishnamoorthi	48db06e39a	Dynamic quantization support for LSTMCell, RNNCell and GRUCell (#37159 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37159 Enable dynamic quantization for LSTMCell, RNNCell and GRUCell ghstack-source-id: 105946183 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_quantized_rnn_cell $quantization\.test_quantize\.TestPostTrainingDynamic$' Differential Revision: D20978736 fbshipit-source-id: 8f303ba1d7f8e0c646ac73e862d2c1e735b7ff61	2020-06-16 09:14:59 -07:00
Vasiliy Kuznetsov	6a60a8c1da	add_observer: respect device affinity for ReLU (#39337 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39337 In #39031 we made fake quantize respect device affinity of the original module. However, that PR only handled modules with parameters or buffers, and did not work properly for `ReLU`. Fixing the logic to also work for `ReLU` by passing the parent's device when adding observers. Test Plan: ``` python test/test_quantization.py TestDistributed.test_device_affinity ``` Imported from OSS Differential Revision: D21821243 fbshipit-source-id: cc6abda3694b80ce8ba0440dc6c1b5b58f3c0066	2020-06-03 09:31:36 -07:00
Vasiliy Kuznetsov	c193bd41f5	fake_quantize: respect device affinity (#39031 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39031 Makes the eager mode QAT prepare logic respect device affinity. This fixes the issue where a module is on `cuda:0`, and running the QAT prepare script would add observers on `cpu`. Now it will add them on the original device. Test Plan: ``` python test/test_quantization.py TestDistributed.test_device_affinity ``` Imported from OSS Differential Revision: D21729272 fbshipit-source-id: 5537bf3977ddc23412184941978bf0d1cc6fb479	2020-06-01 08:55:14 -07:00
Supriya Rao	530d48e93a	[quant] Support for fused ConvBn1d and ConvBnRelu1d modules (#38452 ) (#38749 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38749 Test Plan: python test/test_quantization.py TestFused Differential Revision: D21654659 Pulled By: supriyar fbshipit-source-id: 301be24083e794f4e71ff1d6d842e1aaefa640f0	2020-05-19 22:48:05 -07:00
Natalia Gimelshein	b995540a01	Revert D21632878: [quant] Support for fused ConvBn1d and ConvBnRelu1d modules Test Plan: revert-hammer Differential Revision: D21632878 Original commit changeset: 0d73398b95d7 fbshipit-source-id: c4dd18a4220d175237f31f741a782f2596228009	2020-05-19 15:22:16 -07:00
Supriya Rao	7d38db0f9a	[quant] Support for fused ConvBn1d and ConvBnRelu1d modules (#38452 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38452 Test Plan: python test/test_quantization.py TestFused Imported from OSS Differential Revision: D21632878 fbshipit-source-id: 0d73398b95d72a0a23b42ef36f3ede1bfcc35eda	2020-05-19 09:53:56 -07:00
Supriya Rao	f6626aaf43	[quant] Add support for Quantized Conv1d and ConvRELU1d (#38283 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38283 Adds support for the modules and tests Test Plan: python test/test_quantization.py TestStaticQuantizedModule.test_conv1d_api Imported from OSS Differential Revision: D21553665 fbshipit-source-id: 7ea28da024bdf59f87f300d616c266f2b41f0bcd	2020-05-13 16:59:13 -07:00
Haixin Liu	cc0f1b22a2	[PyTorch Numeric Suite] Add module output comparison (#36701 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36701 Add module output comparison API. ghstack-source-id: 103368194 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_compare_model_outputs' Differential Revision: D21053197 fbshipit-source-id: cabcafbeeac1b604db069833a0f17ebce506ba65	2020-05-03 00:04:35 -07:00
Lingyi Liu	fddcd72a31	Add the more fusion (conv3d and batchnorm)support in pytorch quantization flow (#33540 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33540 Differential Revision: D19994498 Pulled By: lly-zero-one fbshipit-source-id: e5e13eab6924bd2ce1b57b16b672844b8b9638f5	2020-03-23 20:36:03 -07:00
James Reed	812b1ad869	[quantization] FP16 dynamic quantized Linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32331 Test Plan: Imported from OSS Differential Revision: D19441158 Pulled By: jamesr66a fbshipit-source-id: c04247ffe707be68718c486c31bc6c6040f7dc11	2020-01-27 15:45:32 -08:00
Jerry Zhang	f995ec2076	Remove qconfig_dict in top level eager mode quantization API (#31972 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31972 Since eager mode quantization requires many user modifications, we can't consistently quantize a given model by just changing qconfig_dict, therefore the top level `qconfig_dict` is not that useful. fixes: https://github.com/pytorch/pytorch/issues/31549 Test Plan: . Imported from OSS Differential Revision: D19330691 fbshipit-source-id: 8aee6e5249e0c14e8a363ac1a83836e88887cd7d	2020-01-10 11:04:37 -08:00
Xiaomeng Yang	c12f9a12a8	Fix quantized ConvReLU3d test (#30266 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30266 Fix quantized ConvReLU3d test Test Plan: buck test mode/dev-nosan //caffe2/test:quantized -- "conv" Reviewed By: hl475 Differential Revision: D18645717 fbshipit-source-id: bbe93f9daf5046f2aa05363efc7d0e59eaff37bf	2019-11-25 14:52:32 -08:00
Raghuraman Krishnamoorthi	94757e035d	Do not insert observers for empty sequential modules (#28384 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28384 ghstack-source-id: 92340259 Test Plan: buck test caffe2/test:quantization -- 'test_fusion_sequential_model_train $test_quantization\.FusionTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_fusion_sequential_model_eval $test_quantization\.FusionTest$' --print-passing-details Differential Revision: D18047293 fbshipit-source-id: 7e18b1aa76cc0fd26e8ee48a70c3a45688e73549	2019-10-21 20:32:13 -07:00
Zafar Takhirov	07b5666a87	Add default arg to `prepare_qat` mapping. (#28193 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28193 Fixes #28015 Test Plan: Imported from OSS Differential Revision: D17973121 Pulled By: z-a-f fbshipit-source-id: 03b3f70c70b89060c1f03d7ed8ab6002fe60bd49	2019-10-17 14:11:54 -07:00
Zafar Takhirov	a5ac7f6387	Changing observer name Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27779 Test Plan: Imported from OSS Differential Revision: D17886605 Pulled By: z-a-f fbshipit-source-id: 68c50b482e65015336ff27171fd730da493525b6	2019-10-17 11:36:03 -07:00
Zafar Takhirov	dc8785a022	Refactoing names for consistency Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27670 Test Plan: Imported from OSS Differential Revision: D17846269 Pulled By: z-a-f fbshipit-source-id: ed3c7441c185bf11b2e62879aa3ecbc654aa2d4e	2019-10-16 12:18:26 -07:00
zou3519	e5d6b75319	Bag of documentation fixes; fix more sphinx warnings (#27850 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27850 Many of these are real problems in the documentation (i.e., link or bullet point doesn't display correctly). Test Plan: - built and viewed the documentation for each change locally. Differential Revision: D17908123 Pulled By: zou3519 fbshipit-source-id: 65c92a352c89b90fb6b508c388b0874233a3817a	2019-10-15 07:31:14 -07:00
Chris Gottbrath	a96b003b39	docstring only formatting changes: quantize.py, fake_quantize.py, observer.py Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27415 Reviewed By: zafartahirov Differential Revision: D17783101 Pulled By: gottbrath fbshipit-source-id: a7acbc55edfaa75fdbd17fd30d530710a401b22f	2019-10-08 09:21:03 -07:00
Zafar Takhirov	6bb7433ad5	Replacing the skip_list with white_list in the qconfig propagation Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27183 Test Plan: Imported from OSS Differential Revision: D17700548 Pulled By: zafartahirov fbshipit-source-id: 18e6ffbda496b14ac1da1783f928ad539cdb1d16	2019-10-03 20:40:17 -07:00
Zafar Takhirov	111da77912	Factored out the default mappings Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27164 Test Plan: Imported from OSS Differential Revision: D17694475 Pulled By: zafartahirov fbshipit-source-id: df8df5f7d66062ed35da957064a31344e1d3c961	2019-10-03 11:52:21 -07:00
James Reed	a423817055	Fix reprs for _intrinsic modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27184 Test Plan: Imported from OSS Differential Revision: D17717481 Pulled By: jamesr66a fbshipit-source-id: 4bd72bcd42191d9b21d03f5bb6698198dbffffda	2019-10-02 19:55:49 -07:00
James Reed	1affa7c32c	Allow set for qconfig for dynamic_quantize Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27181 Test Plan: Imported from OSS Differential Revision: D17717482 Pulled By: jamesr66a fbshipit-source-id: f3930fc87831cbdcf4390cd769c594bb13f5cd81	2019-10-02 19:55:45 -07:00
Zafar Takhirov	27dc595215	Rename _intrinsic to intrinsic Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27194 Test Plan: Imported from OSS Differential Revision: D17704957 Pulled By: zafartahirov fbshipit-source-id: 46f02d129aa77c3047b2a6c606bfadd831a6b0fc	2019-10-02 18:53:06 -07:00
Dmytro Dzhulgakov	0a8a779abe	Add more inplace arguments to quantization top level API (#26782 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26782 At least we should be consistent on top-level APIs and prepare/convert/etc. Logic is inplace=False by default but top-level APIs take care of doing fewer copies. Also renames always-inplace methods like add_observer to have underscore in the end. One fix for MinMaxObserver was triggered by deepcopy surfacing that we were accidentally keeping autograd around Test Plan: Imported from OSS Differential Revision: D17595956 Pulled By: dzhulgakov fbshipit-source-id: 801f9f5536b553f24c7a660064dd6fce685edd65	2019-09-26 00:07:07 -07:00
Raghuraman Krishnamoorthi	bc4519dc27	Handle DeQuantStub() for QAT (#26518 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26518 Skip Dequantize() modules for QAT alone. For fake quant insertion, DeQuantize() is a no-op and we should not be inserting fake-quant. ghstack-source-id: 90704220 Test Plan: buck test caffe2/test:quantization -- --print-passing-details Tests in test_quantization pass with changes: Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/281475121296989 Summary (total time 73.03s): PASS: 28 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 Differential Revision: D17439333 fbshipit-source-id: f716c23500324ae08c8d104ee2c9587fa6926571	2019-09-25 00:35:34 -07:00
Dmytro Dzhulgakov	128a65e2e0	Use noop observer to pass dtype for dynamic quantization (#26709 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26709 Polishes implementation from #25975. Primarily, we use NoopObserver to communicate that weights need to be quantized to float16. The very top-level API (quantize_dynamic) stays the same with `dtype` argument but the implementation follows the common flow. One can argue that dynamic fp16 quantization doesn't really fit into the 'observer' mechanism. It's in fact not ideal, but it's better to have the same flow than branching on both dtype and qconfig. Test Plan: Imported from OSS Differential Revision: D17544103 Pulled By: dzhulgakov fbshipit-source-id: 6af3f18c35929a1a53ea734079c005f656e4925f	2019-09-24 09:24:39 -07:00
Lingyi Liu	11f9fe2433	Fix the API for record observer (#26413 ) Summary: Mainly want to resolve comments from https://github.com/pytorch/pytorch/pull/25830. Overall, we want to provide a recording observer for recording the runtime tensor values of activation path in order to debug the numerical accuracy loss offline. According to the feedback from https://github.com/pytorch/pytorch/issues/25830, it might be better to record all the observers in a dict and query the dict to get corresponding tensor values. hx89 is working on how to insert the recording observers into model under debug. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26413 Differential Revision: D17506502 Pulled By: llyfacebook fbshipit-source-id: 3ab90dc78920e7ec3fa572c2a07327a9991c530a	2019-09-20 14:27:56 -07:00
Jianyu Huang	f433ee1499	Add the FP16 weight support for LSTM in dynamic_quantize (#25975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25975 We would like to add the FP16 weight support for the dynamic quantized LSTM. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm794.ftw3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details Building: finished in 13.4 sec (100%) 8134/8134 jobs, 81 updated Total time: 13.9 sec Trace available for this run at /tmp/testpilot.20190910-210241.2092790.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision c86e65add357582accb6ec0be23b92c8a2c510bd fbpkg ca46e8f5b26c451a8b0b2462c11bb61d at Mon Sep 9 22:16:37 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/696/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 0.183 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 0.184s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 Summary (total time 4.35s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D17299116 fbshipit-source-id: 7fe91ece25867f2c0496f1b63fb1041e6b815166	2019-09-19 22:19:22 -07:00
Lingyi Liu	62767077c3	add the tensor_observer to record the runtime tensor for quantization … (#25830 ) Summary: …accuracy analsyis Pull Request resolved: https://github.com/pytorch/pytorch/pull/25830 Differential Revision: D17327147 Pulled By: llyfacebook fbshipit-source-id: 095d5537a31b8d7541081000eaeb8b8474dfb8d0	2019-09-11 13:36:28 -07:00
Jianyu Huang	9b4f3fd7d3	Add torch.nn.LSTM into the default dynamic quantize mappings (#25954 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25954 Add torch.nn.LSTM into the default dynamic quantize mappings. We will by default dynamic quantize LSTM when we apply the quantize_dynamic API. ghstack-source-id: 89839673 Test Plan: CI Differential Revision: D17294958 fbshipit-source-id: 824aceef821276b3e28c52ce3bebafaf9b0a0833	2019-09-10 21:03:12 -07:00
Haixin Liu	9c10f729de	Add Dropout to blacklist (#25881 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25881 Add Dropout to blacklist to avoid the error in eager mode quantization. ghstack-source-id: 89759536 Test Plan: Test locally in python notebook. Reviewed By: jianyuh Differential Revision: D17270826 fbshipit-source-id: bcf43483976740564d7f407838f25c2dbb67b016	2019-09-10 10:57:38 -07:00
Jianyu Huang	0483d537ab	Add the dynamic quantized LSTM module (#25157 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25157 Add the dynamic quantized LSTM module. TODO (separate PRs): - Serialization. - Bias can be Null. ghstack-source-id: 89443731 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm2816.prn3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_q uantization\.PostTrainingDynamicQuantTest$' --print-passing-details Action graph will be rebuilt because files have been added or removed. Parsing buck files: finished in 1.4 sec Building: finished in 4.0 sec (100%) 8122/8122 jobs, 2 updated Total time: 5.5 sec Trace available for this run at /tmp/testpilot.20190902-164918.1275502.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision b61bc0e3b71033578eddfe0a28b0739bc685663f fbpkg 3b1c1aed1c534c0cb161a981eca6e2f0 at Sun Sep 1 20:58:52 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/690/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/2251799823877227 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 1.048 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 1.049s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/2251799823877227 Summary (total time 5.53s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D16955662 fbshipit-source-id: 61cf1a74913105fa02e44b3941813eabac0006b5	2019-09-03 19:18:28 -07:00
Zafar Takhirov	e44c09ecae	making quant utilities inplace Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25054 Test Plan: Imported from OSS Differential Revision: D16974198 Pulled By: zafartahirov fbshipit-source-id: 54befc8429990adafe746d1255d117fca5f12e11	2019-08-29 16:03:13 -07:00
Raghuraman Krishnamoorthi	f5a3d59254	Handle empty qconfig for functional Modules (#25215 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25215 ghstack-source-id: 89044252 Test Plan: Test implemented in D16879132/ Differential Revision: D17064670 fbshipit-source-id: 08d3d566aa123bedf318ab5a8bc9b71457930ff2	2019-08-27 12:31:26 -07:00
Raghuraman Krishnamoorthi	f622ec8084	Update mapping dictionary to support functionalmodules and pooling operations (#25216 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25216 ghstack-source-id: 89045562 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_resnet_base\ $test_quantization.PostTrainingQuantTest$' --print-passing-details Differential Revision: D17065029 fbshipit-source-id: b248abf6de162f38e35e6bace17bde1be9e38c57	2019-08-26 23:00:01 -07:00
Raghuraman Krishnamoorthi	17f69eff22	Revert D16879133: Handle empty qconfig for functional Modules Test Plan: revert-hammer Differential Revision: D16879133 Original commit changeset: 230f5204cfbd fbshipit-source-id: 29b4bfe066b173797f3d9f2fcf7cbf5ee21ff8fb	2019-08-26 16:25:29 -07:00
Raghuraman Krishnamoorthi	a9fdc1923b	Revert D16879132: Update mapping dictionary to support functionalmodules and pooling operations Test Plan: revert-hammer Differential Revision: D16879132 Original commit changeset: cd8c10182aa7 fbshipit-source-id: 9b67ccf73f43d15ef50bf0331d3df4d57835931b	2019-08-26 16:25:25 -07:00
Raghuraman Krishnamoorthi	794f63fe92	Update mapping dictionary to support functionalmodules and pooling operations (#24804 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24804 ghstack-source-id: 89003799 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_resnet_base\ $test_quantization.PostTrainingQuantTest$' --print-passing-details Differential Revision: D16879132 fbshipit-source-id: cd8c10182aa732ddf655bcda17f72ea08033a633	2019-08-26 12:16:49 -07:00
Raghuraman Krishnamoorthi	d7f6ac1dbb	Handle empty qconfig for functional Modules (#24803 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24803 ghstack-source-id: 89003797 Test Plan: Test implemented in D16879132/ Differential Revision: D16879133 fbshipit-source-id: 230f5204cfbd149fea1c0985578a2572a0e0f2a8	2019-08-26 12:16:46 -07:00
Zafar Takhirov	1a74bd407d	Fixes the adding of the observer to the FloatFunctional (#24418 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24418 Fixes #24394 The observer is not added correctlty, because one of the conditions is not met. Test Plan: Imported from OSS Differential Revision: D16833951 Pulled By: zafartahirov fbshipit-source-id: bb4699e6a1cf6368c7278272a68e5e7c6d3f59a8	2019-08-15 17:27:00 -07:00
Jerry Zhang	761ae8e9b6	Add intrinsic module mappings (#23753 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23753 Add intrinsic(fused) module mappings in quantize.py to enable mapping fused modules in both QAT and post PTQ Differential Revision: D16820749 fbshipit-source-id: 07de76a4f09b44bde8b193c103eac02c22b875b6	2019-08-15 09:37:24 -07:00
Jianyu Huang	0f64043b49	Remove the activation observer for default_qconfig (#24299 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24299 As suggested in https://github.com/pytorch/pytorch/pull/24232, we will remove the activation observer for dynamic quantization path. ghstack-source-id: 88287094 Differential Revision: D16798590 fbshipit-source-id: 07a245d5584b5b15c6895d9b09deef4a0605073a	2019-08-14 17:21:50 -07:00
Jianyu Huang	e8d2ddc2c4	Make the default qconfig_dict (#24232 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24232 As suggested in https://github.com/pytorch/pytorch/pull/23128#discussion_r306650311, we will make the keys of default_qconfig_dict as `torch.nn.Linear`. That is, we will do the dynamic quantization on the `torch.nn.Linear` by default, if the user just specify `torch.quantize_dynamic(model)`. ghstack-source-id: 88287089 Differential Revision: D16781191 fbshipit-source-id: 991a5e151a9ea32b879d6897cd9862855d747135	2019-08-14 15:12:55 -07:00
Jianyu Huang	584c6986fd	Add the type matching rule for qconfig_dict (#23212 ) Summary: We want to use the Module type as the key for the qconfig_dict for the module replacement during the quantization. Before this Diff, to dynamic quantize the BERT model, we have to specify each layer: ``` qconfig_dict = { 'encoder.layer.0.attention.self.query': default_qconfig, 'encoder.layer.0.attention.self.key': default_qconfig, 'encoder.layer.0.attention.self.value': default_qconfig, 'encoder.layer.0.attention.output.dense': default_qconfig, 'encoder.layer.0.intermediate.dense': default_qconfig, 'encoder.layer.0.output.dense': default_qconfig, 'encoder.layer.1.attention.self.query': default_qconfig, 'encoder.layer.1.attention.self.key': default_qconfig, 'encoder.layer.1.attention.self.value': default_qconfig, 'encoder.layer.1.attention.output.dense': default_qconfig, 'encoder.layer.1.intermediate.dense': default_qconfig, 'encoder.layer.1.output.dense': default_qconfig, ... } ``` After this Diff, we only need the following ``` qconfig_dict = { torch.nn.Linear : default_qconfig } ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/23212 ghstack-source-id: 88287091 Reviewed By: zafartahirov Differential Revision: D16436542 fbshipit-source-id: 11fbe68ee460560c1a7cdded63581eb7a00e5a89	2019-08-14 13:07:36 -07:00
Jianyu Huang	e94ba742b0	Dynamic Quantized Linear Module (#23128 ) Summary: - ~~Add a unit test for the Dynamic Quantized Linear operator (```torch.fbgemm_linear_quantize_weight```, ```torch.fbgemm_pack_quantized_matrix```, and ```torch.fbgemm_linear_int8_weight```) in ```test_quantized.py```.~~ Move this to D16404027 for a separate review. - Add the Dynamic Quantized Linear module in ```torch/nn/quantized/modules/linear.py```. ~~This is in a rudimentary stage. Will add more functions later~~. - Add the torch.quantize logic (prepare, eval, convert) for dynamic quantization. - Add a unit test for the Dynamic Quantized Linear module in ```test_nn_quantized.py```. - Add a unit test for the Model-level Quantization API Pull Request resolved: https://github.com/pytorch/pytorch/pull/23128 ghstack-source-id: 88257232 Differential Revision: D16258664 fbshipit-source-id: 4be3ac39ee27c088b341c741d3f09f51d5a23ef0	2019-08-13 21:01:23 -07:00
Zafar Takhirov	4cc16782f3	Removing the make_module script. (#23635 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23635 It appears it is the same complexity to add new modules using a base class and using a generation script. Test Plan: Imported from OSS Differential Revision: D16593364 Pulled By: zafartahirov fbshipit-source-id: 852dcf41f3dfa2a89152042b8e61d0b6defa8feb	2019-08-13 09:58:28 -07:00
Jerry Zhang	89956374c3	Remove qconfig_dict from API (#23465 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23465 We decided not to allow user to use qconfig_dict to do quantization since that API is not robust. Differential Revision: D16611504 fbshipit-source-id: b0d1d311b32c990a165c480f50e9ce3d68b785b5	2019-08-02 10:28:48 -07:00
Zafar Takhirov	9c549dfdc1	make_module: First version Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23288 Test Plan: Imported from OSS Differential Revision: D16455390 Pulled By: zafartahirov fbshipit-source-id: 4352f0a17cd0382b48502b93e51574cc3acdfdcc	2019-07-30 22:14:44 -07:00
Jerry Zhang	bc64324da9	Change condition in swap module Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23561 Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D16570928 Pulled By: jerryzh168 fbshipit-source-id: 70f36f577ac657d015f3d7738819867742088e5a	2019-07-30 17:25:02 -07:00
Jerry Zhang	7364aa796d	skip nn.Identity in add_observer Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23500 Test Plan: e2e test in quantizing resnext 101 Imported from OSS Differential Revision: D16550190 Pulled By: jerryzh168 fbshipit-source-id: 6128d7c3419235152b43739fcc5cade34342ba3d	2019-07-30 11:00:36 -07:00
Jerry Zhang	d7448c7812	quantized conv module (#23178 ) Summary: att Pull Request resolved: https://github.com/pytorch/pytorch/pull/23178 ghstack-source-id: 86973164 Differential Revision: D16426871 fbshipit-source-id: a2ebb38997acfeb61b7dfd6b11dd8ee9b3a7a8ed	2019-07-22 20:47:40 -07:00
Jerry Zhang	77353636de	Conv module (#23084 ) Summary: Added Conv module for qat Pull Request resolved: https://github.com/pytorch/pytorch/pull/23084 ghstack-source-id: 86862445 Differential Revision: D16379417 fbshipit-source-id: 742cc8b8e0f132070ca4943a1c2e3db60c2b5bdc	2019-07-19 18:49:52 -07:00
Jerry Zhang	7cc029cb75	Quantization aware training in eager mode (#23082 ) Summary: Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23082 ghstack-source-id: 86824650 Differential Revision: D16379374 fbshipit-source-id: 7d16d1acd87025065a24942ff92abf18e9fc8070	2019-07-19 14:57:25 -07:00
Soumith Chintala	84c2c89e2c	Revert D16199356: [qat] Quantization aware training in eager mode Differential Revision: D16199356 Original commit changeset: 62aeaf47c12c fbshipit-source-id: d06a96b0a617ae38029ffb246173ec065454b666	2019-07-19 03:18:48 -07:00
Soumith Chintala	f19aa12ae5	Revert D16274792: [qat] Conv module Differential Revision: D16274792 Original commit changeset: 1da10194123b fbshipit-source-id: 71b34774b463f2350289bd39b8cfd798e095ffa5	2019-07-19 03:18:45 -07:00
Jerry Zhang	12d9d768b8	Conv module (#22899 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22899 Added Conv module for qat Reviewed By: zafartahirov Differential Revision: D16274792 fbshipit-source-id: 1da10194123b2759a6a35c60d1c2d2c0b569ccdc	2019-07-18 18:58:07 -07:00
Jerry Zhang	65ef671d11	Quantization aware training in eager mode (#22732 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22732 Add support for quantization aware training in eager mode Modifications to Post training flow: ## Prepare * Fusion: e.g. (Conv, Bn) → ConvBn (float) * Swapping: To insert fake_quant to weight, we need to swap the float modules that has weight with different qat modules, e.g. Conv → torch.nn.qat.Conv , ConvBn → torch.nn._intrinsic.qat.ConvBn ``` * previously we were thinking about modify the weight in forward_pre hook and change it back in forward_hook: * def forward_pre_hook(self, input): self.float_weight = self.weight self.weight = self.fake_quantize(self.float_weight) def forward_hook(self, input): self.weight = self.float_weight ``` * Assignments to self.weight are needed because we can’t change forward function and in forward function they are using self.weight. * But we will need to keep two copies of weight in this case, so it’s probably better to just swap the module * So we want to just swap Conv to torch.nn.qat.Conv and Linear to torch.nn.qat.Linear * qat modules will have fake_quant for output and weights inserted in forward function ## Convert * flow should be identical to ptq, but the swapping dictionary is slightly different since modules are changed in prepare step. Reviewed By: zafartahirov Differential Revision: D16199356 fbshipit-source-id: 62aeaf47c12c62a87d9cac208f25f7592e245d6c	2019-07-18 18:58:03 -07:00
Jerry Zhang	b984b0ab4b	fix print (#22689 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22689 att Reviewed By: Lucaskabela Differential Revision: D16184260 fbshipit-source-id: 1a6ad51a37918d0c81d6e3baa0ca0baa32cb9673	2019-07-10 11:26:34 -07:00
Jerry Zhang	5040d52a5a	torch.quantization conversion utilities, observers for eager mode quantization (#22010 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22010 torch.quantization module with observers and conversion routines Reviewed By: zafartahirov Differential Revision: D15554183 fbshipit-source-id: 05a3fabe28dd701978b8ecebf5bfc3a4c044ba5c	2019-07-09 10:51:38 -07:00

1 2 3

116 Commits