pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jerry Zhang	b2291d4600	Make PerChannelMinMaxObserver scriptable using `torch.jit.ignore` (#29416 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29416 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18580906 fbshipit-source-id: 5370300b89e26c2b4662b17e51284e8708cb5843	2019-11-19 19:12:55 -08:00
Vitaly Fedyunin	877c96cddf	explicitly provide memory format when calling to *_like operators Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30008 Test Plan: Imported from OSS Differential Revision: D18575981 Pulled By: VitalyFedyunin fbshipit-source-id: ec3418257089ad57913932be1a8608cd20ce054c	2019-11-19 16:19:29 -08:00
Xiaomeng Yang	510ef4b63a	Add nn.quantized.Conv3d (#29813 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29813 Add nn.quantized.Conv3d Test Plan: buck test mode/dev-nosan //caffe2/test:quantized -- "conv" Reviewed By: jianyuh Differential Revision: D18467749 fbshipit-source-id: 892f708179e9e836ad902851ac1838847009da15	2019-11-15 04:33:40 -08:00
Zafar Takhirov	09d359dfd9	Changed default args in quantization observers Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29640 Test Plan: Imported from OSS Differential Revision: D18447297 Pulled By: z-a-f fbshipit-source-id: 7c86a5bb467a2fad8fe30c935d9c031c69868296	2019-11-12 23:32:05 -08:00
Jianyu Huang	bbff06ee96	Convert conv_prepack to conv2d_prepack and conv_unpack to conv2d_unpack (#29529 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29529 Pull Request resolved: https://github.com/pytorch/glow/pull/3771 We would like to replace `conv_prepack` with `conv2d_prepack` and `conv_unpack` with `conv2d_unpack`. This makes the naming consistent between 2D and 3D conv: ``` torch.ops.quantized.conv2d_prepack torch.ops.quantized.conv2d_unpack torch.ops.quantized.conv2d torch.ops.quantized.conv3d_prepack torch.ops.quantized.conv3d_unpack torch.ops.quantized.conv3d ``` We should do this earlier rather than later when we have more users for the quantized conv2d ops, for better engineering. The replacement bash command is as the follows: ``` find ./ -type f -exec sed -i -e 's/quantized::conv_prepack/quantized::conv2d_prepack/g' {} \; find ./ -type f -exec sed -i -e 's/quantized::conv_unpack/quantized::conv2d_unpack/g' {} \; find ./ -type f -exec sed -i -e 's/torch.ops.quantized.conv_prepack/torch.ops.quantized.conv2d_prepack/g' {} \; find ./ -type f -exec sed -i -e 's/torch.ops.quantized.conv_unpack/torch.ops.quantized.conv2d_unpack/g' {} \; ``` ghstack-source-id: 93661879 Test Plan: CI Reviewed By: jackm321 Differential Revision: D18421079 fbshipit-source-id: 17ae8b1ee79223bd2c5d4bbccd57af6580c4ab12	2019-11-11 21:54:10 -08:00
Jerry Zhang	4bcf4796aa	Make HistogramObserver scriptable with `@torch.jit.ignore` (#27950 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27950 att Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D18360139 fbshipit-source-id: 5459ae49c087886e4990de136198773a75b1c572	2019-11-07 18:02:44 -08:00
Jerry Zhang	5ac3df7712	Minor fix and turn off fold_convbn (#27403 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27403 In fold_convbn pass, we need to recompute the parameter(weight, bias) for conv, update the attribute of conv and update the access of bias in conv because if the original conv have no bias, the `self.bias` access will be inline and replaced by Constant node `None = prim::Constant()`, we need to update this to use `GetAttr[name="bias"]` to make this work. But there is also some work going on the handle constants, so we'll fix this pass after that is done. Test Plan: . Imported from OSS Differential Revision: D18182918 fbshipit-source-id: bba510bc41ab58e0eb76f7b77335b6e3ffe2862d	2019-11-01 12:15:38 -07:00
Jerry Zhang	0eeda56632	Add nn.ReLU6 to default mapping (#28516 ) Summary: https://discuss.pytorch.org/t/quantized-hard-sigmoid/59013 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28516 Differential Revision: D18128717 Pulled By: jerryzh168 fbshipit-source-id: 4d06d1b54cf9f84a610d79fbadde2c8ef38c33f8	2019-10-25 14:52:44 -07:00
なるみ	d83389d327	Ignore F401 in all __init__.py without putting noqa (#25823 ) Summary: By adding `per-file-ignores = __init__.py: F401` into `.flake8` with `flake8>=3.7`, we can ignore F410 in all `__init__.py` without putting `# noqa: F401` line by line. http://flake8.pycqa.org/en/latest/user/options.html?highlight=per-file-ignores#cmdoption-flake8-per-file-ignores Pull Request resolved: https://github.com/pytorch/pytorch/pull/25823 Differential Revision: D17252182 Pulled By: soumith fbshipit-source-id: 87b174075b79e4078953a7521bd1a8f82405646b	2019-10-23 15:28:13 -07:00
Jerry Zhang	e280f93e31	Prepack folding for conv2d (#27119 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27119 att Test Plan: python test/test_jit.py 'TestJit.test_fold_prepack' Imported from OSS Differential Revision: D17717636 fbshipit-source-id: 97e9f8d927f7eacedf09f47b8ae1bf8216b8cad4	2019-10-23 09:03:14 -07:00
Raghuraman Krishnamoorthi	94757e035d	Do not insert observers for empty sequential modules (#28384 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28384 ghstack-source-id: 92340259 Test Plan: buck test caffe2/test:quantization -- 'test_fusion_sequential_model_train $test_quantization\.FusionTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_fusion_sequential_model_eval $test_quantization\.FusionTest$' --print-passing-details Differential Revision: D18047293 fbshipit-source-id: 7e18b1aa76cc0fd26e8ee48a70c3a45688e73549	2019-10-21 20:32:13 -07:00
Zafar Takhirov	783c9c8445	Adding docstring to the observers (#27791 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27791 This is the first part of the change. The next ones will amend more :) Test Plan: Imported from OSS Differential Revision: D17889913 Pulled By: z-a-f fbshipit-source-id: ff74007903dd789d4c68684e83b50c0c86a25149	2019-10-21 19:09:50 -07:00
Zafar Takhirov	07b5666a87	Add default arg to `prepare_qat` mapping. (#28193 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28193 Fixes #28015 Test Plan: Imported from OSS Differential Revision: D17973121 Pulled By: z-a-f fbshipit-source-id: 03b3f70c70b89060c1f03d7ed8ab6002fe60bd49	2019-10-17 14:11:54 -07:00
Zafar Takhirov	a5ac7f6387	Changing observer name Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27779 Test Plan: Imported from OSS Differential Revision: D17886605 Pulled By: z-a-f fbshipit-source-id: 68c50b482e65015336ff27171fd730da493525b6	2019-10-17 11:36:03 -07:00
Zafar Takhirov	dc8785a022	Refactoing names for consistency Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27670 Test Plan: Imported from OSS Differential Revision: D17846269 Pulled By: z-a-f fbshipit-source-id: ed3c7441c185bf11b2e62879aa3ecbc654aa2d4e	2019-10-16 12:18:26 -07:00
zou3519	e5d6b75319	Bag of documentation fixes; fix more sphinx warnings (#27850 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27850 Many of these are real problems in the documentation (i.e., link or bullet point doesn't display correctly). Test Plan: - built and viewed the documentation for each change locally. Differential Revision: D17908123 Pulled By: zou3519 fbshipit-source-id: 65c92a352c89b90fb6b508c388b0874233a3817a	2019-10-15 07:31:14 -07:00
zou3519	23bffc4f14	Fix most documentation warnings (#27782 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27782 Warnings show up when running `make html` to build documentation. All of the warnings are very reasonable and point to bugs in our docs. This PR attempts to fix most of those warnings. In the future we will add something to the CI that asserts that there are no warnings in our docs. Test Plan: - build and view changes locally Differential Revision: D17887067 Pulled By: zou3519 fbshipit-source-id: 6bf4d08764759133b20983d6cd7f5d27e5ee3166	2019-10-13 10:34:01 -07:00
Michael Suo	341262754f	module dedupe (#26666 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26666 Changes: - Introduce a `ConcreteModuleType` concept. This acts both as the key into the type cache, and as the source of truth for `ModuleValue::attr` queries. It needs to do both jobs because that's how we ensure correctness (if the types are different, it's because `ModuleValue::attr` would return different things). - Now `recursive_script` will first construct a `ConcreteModuleType` and search for a pre-existing type before starting compilation. - All previous paths to creating a `ScriptModule` (including inheriting from `ScriptModule`) are now rewritten to go through `create_script_module`, so that we have only a single place where construction happens. Behavioral changes: - Big change to `torch.jit.ScriptModule` inheritance: all attributes are now recursively scripted if possible, matching recursive scripting semantics. This makes it hard to keep something from being scripted (for example, a Python submodule). Possibly we'll need an `ignore()` type thing for attributes. In particular, this adds `self.training` to every ScriptModule, since it's present on every `nn.Module`. - I believe this change to be transparent to existing users of the inheritance API, since if you had an attribute that is unscriptable that you never used, there is no error. In some cases, we will create new attributes (even if they are unused), which will increase serialized model size from before. Test Plan: Imported from OSS Differential Revision: D17551196 Pulled By: suo fbshipit-source-id: b476d1c9feb3ddfd63406d90989aaf9dfe890591	2019-10-12 09:51:57 -07:00
Chris Gottbrath	a96b003b39	docstring only formatting changes: quantize.py, fake_quantize.py, observer.py Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27415 Reviewed By: zafartahirov Differential Revision: D17783101 Pulled By: gottbrath fbshipit-source-id: a7acbc55edfaa75fdbd17fd30d530710a401b22f	2019-10-08 09:21:03 -07:00
davidriazati	0046092178	Reduce special casing around 'training' (#27109 ) Summary: Most of this was old cruft left over from special handling of `training` before we had a `bool` type. This makes all modules have a `training` attribute that is true by default and removes all other special handling. Fixes #26884 ](https://our.intern.facebook.com/intern/diff/17728129/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/27109 Pulled By: driazati Differential Revision: D17728129 fbshipit-source-id: 8ddc9fbb07a953dd05529538bfdd01ed88b5cb57	2019-10-07 13:52:59 -07:00
Raghuraman Krishnamoorthi	ac0f18437f	MovingAverage Observer (#27396 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27396 Observer that estimates moving averages of min and max values per batch, more suited for quantization aware training instead of minmax observers that track extremal values across batches ghstack-source-id: 91369018 Test Plan: buck test caffe2/test:quantization -- 'test_per_tensor_observers $test_quantization\.ObserverTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_per_channel_observers $test_quantization\.ObserverTest$' --print-passing-details Differential Revision: D17727213 fbshipit-source-id: 024a890bf3dd0bf269d8bfe61f19871d027326f0	2019-10-04 16:28:59 -07:00
Zafar Takhirov	6bb7433ad5	Replacing the skip_list with white_list in the qconfig propagation Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27183 Test Plan: Imported from OSS Differential Revision: D17700548 Pulled By: zafartahirov fbshipit-source-id: 18e6ffbda496b14ac1da1783f928ad539cdb1d16	2019-10-03 20:40:17 -07:00
Zafar Takhirov	111da77912	Factored out the default mappings Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27164 Test Plan: Imported from OSS Differential Revision: D17694475 Pulled By: zafartahirov fbshipit-source-id: df8df5f7d66062ed35da957064a31344e1d3c961	2019-10-03 11:52:21 -07:00
James Reed	a423817055	Fix reprs for _intrinsic modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27184 Test Plan: Imported from OSS Differential Revision: D17717481 Pulled By: jamesr66a fbshipit-source-id: 4bd72bcd42191d9b21d03f5bb6698198dbffffda	2019-10-02 19:55:49 -07:00
James Reed	1affa7c32c	Allow set for qconfig for dynamic_quantize Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27181 Test Plan: Imported from OSS Differential Revision: D17717482 Pulled By: jamesr66a fbshipit-source-id: f3930fc87831cbdcf4390cd769c594bb13f5cd81	2019-10-02 19:55:45 -07:00
Zafar Takhirov	27dc595215	Rename _intrinsic to intrinsic Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27194 Test Plan: Imported from OSS Differential Revision: D17704957 Pulled By: zafartahirov fbshipit-source-id: 46f02d129aa77c3047b2a6c606bfadd831a6b0fc	2019-10-02 18:53:06 -07:00
Raghuraman Krishnamoorthi	4abfb5493e	Handle uninitialized min/max values in histogram observer (#27151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27151 We need to be ab le to handle observers with no min/max data correctly as models sometimes have modules that do not get any data. ghstack-source-id: 91113403 Test Plan: buck test caffe2/test:quantization -- test_minmax_observer buck test caffe2/test:quantization -- test_per_channel_minmax_observer buck test caffe2/test:quantization --test_histogram_observer Reviewed By: csummersea Differential Revision: D17690828 fbshipit-source-id: e95709333ea0f66d79ddb8141b7cba5a83347dbd	2019-10-01 14:56:37 -07:00
Jerry Zhang	98c02e6df3	Enable tests (#27103 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27103 att Test Plan: python test/test_quantization.py 'GraphModePostTrainingQuantTest' Imported from OSS Differential Revision: D17678261 fbshipit-source-id: 5caa7512c6ff4a613980c86b5b221e0cfbe0a173	2019-10-01 12:10:21 -07:00
Jerry Zhang	f742ceaa46	API - add more passes to graph mode (#27093 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27093 Add `insert_prepack_unpack` and `fold_prepack` to `convert_script` Test Plan: . Imported from OSS Differential Revision: D17678262 fbshipit-source-id: 4bfd6681af6fce226cc77aed8dd84066cbd8ed17	2019-10-01 11:26:02 -07:00
Raghuraman Krishnamoorthi	dddae3f854	Fuse module enhancements (#26457 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26457 Enhancement to fuse module to support sequentials, fuse list can now be just like the state dict. Also add support for Conv-Relu and linear-relu fusion Also support inplace and out of place fusion of models. ghstack-source-id: 91076386 Test Plan: buck test caffe2/test:quantization -- 'test_fusion_sequential_model_train $test_quantization\.FusionTest$' --print-passing-details buck test caffe2/test:quantization -- 'test_fusion_sequential_model_eval $test_quantization\.FusionTest$' --print-passing-details Differential Revision: D17466382 fbshipit-source-id: 0a548f8f4c366f3ecc59db693bac725ccd62328e	2019-09-30 22:00:20 -07:00
Raghuraman Krishnamoorthi	9e3ba35500	Add control for observers in Fake-quantize module (#27113 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27113 Fix bug in fake quant control of observer and fake-quantize operations. Add test to ensure that features work as expected ghstack-source-id: 91071181 Test Plan: buck test mode/dev-nosan caffe2/test:fake_quant -- test_fake_quant_control Differential Revision: D17678875 fbshipit-source-id: 2912ad8b6e674daa1d129f7a7c6f27d8c1b4f93b	2019-09-30 18:23:26 -07:00
Raghuraman Krishnamoorthi	d5298b6e66	Default observer and fake-quant for backends (#26627 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26627 ghstack-source-id: 91008337 Test Plan: buck test caffe2/test:quantization -- --print-passing-details Differential Revision: D17518194 fbshipit-source-id: 1eb8a7a85dc811c4ee5228d68563abb157613ceb	2019-09-30 00:37:11 -07:00
Raghuraman Krishnamoorthi	32b0e8c980	Emulate weight and activation only quant with fake quant, numerics test (#26625 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26625 ghstack-source-id: 91008296 Test Plan: buck test caffe2/test:quantized -- 'test_weight_only_activation_only_fakequant $test_quantized_models\.ModelNumerics$' --print-passing-details Differential Revision: D17520342 fbshipit-source-id: 26e148d3299afcfdfb1187aff6ab80687ed8df47	2019-09-30 00:37:07 -07:00
Raghuraman Krishnamoorthi	7dc7075795	Per channel fake quant (#26623 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26623 Per-channel fake quant cpu and cuda operators, per-channel support in fake quant module, tests for per-channel fake-quant and serializability of fake quant modules ghstack-source-id: 91008299 ghstack-source-id: 91008299 Test Plan: buck test mode/dev caffe2/test:fake_quant -- Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/1970324848875929 ✓ caffe2/test:fake_quant - test_backward_per_tensor (test_fake_quant.TestFakeQuantizePerTensor) 0.242 1/10 (passed) ✓ caffe2/test:fake_quant - test_numerical_consistency_per_tensor (test_fake_quant.TestFakeQuantizePerTensor) 0.204 2/10 (passed) ✓ caffe2/test:fake_quant - test_fq_serializable (test_fake_quant.TestFakeQuantizePerTensor) 0.174 3/10 (passed) ✓ caffe2/test:fake_quant - test_numerical_consistency_per_channel (test_fake_quant.TestFakeQuantizePerChannel) 0.279 4/10 (passed) ✓ caffe2/test:fake_quant - test_forward_per_tensor (test_fake_quant.TestFakeQuantizePerTensor) 0.241 5/10 (passed) ✓ caffe2/test:fake_quant - test_forward_per_channel (test_fake_quant.TestFakeQuantizePerChannel) 0.353 6/10 (passed) ✓ caffe2/test:fake_quant - test_fq_module (test_fake_quant.TestFakeQuantizePerTensor) 0.354 7/10 (passed) ✓ caffe2/test:fake_quant - test_backward_per_channel (test_fake_quant.TestFakeQuantizePerChannel) 0.334 8/10 (passed) ✓ caffe2/test:fake_quant - test_fq_serializable (test_fake_quant.TestFakeQuantizePerChannel) 0.168 9/10 (passed) ✓ caffe2/test:fake_quant - test_fq_module (test_fake_quant.TestFakeQuantizePerChannel) 0.429 10/10 (passed) ✓ caffe2/test:fake_quant - main 0.000 (passed) Differential Revision: D17439406 fbshipit-source-id: 64bfff5e4f40bc2ab8af2b432c7bc33805418077	2019-09-30 00:21:25 -07:00
Raghuraman Krishnamoorthi	2ccbdb79c8	Per-channel baseline (#26516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26516 ghstack-source-id: 90982010 Test Plan: Integrate per-channel support into conv and linear modules. The following tests pass: buck test caffe2/test:quantized -- 'test_linear_api $test_quantized_nn_mods\.ModuleAPITest$' --print-passing-details buck test caffe2/test:quantized -- 'test_conv_api $test_quantized_nn_mods\.ModuleAPITest$' --print-passing-details buck test caffe2/test:quantized -- 'test_float_quant_compare_per_channel $test_quantized_models\.ModelNumerics$' --print-passing-details Differential Revision: D17342622 fbshipit-source-id: f0d618928e3d9348672c589a6b7a47049c372a2e	2019-09-28 14:05:06 -07:00
Jerry Zhang	09f0e949cd	PyTorch Graph Mode Quantization API (#26390 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26390 `quantize_script`: top level API for graph mode quantization Test Plan: there are some known issues, we can enable test after all known issues are fixed. Imported from OSS Differential Revision: D17645132 fbshipit-source-id: 61f261d5607409d493b39a2f4e05ebd017279f6b	2019-09-27 19:23:51 -07:00
Raghuraman Krishnamoorthi	8fa9900c28	control of observer/fake-quant operations (#26520 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26520 Hooks to enable control of observer and fake quant that can be used by model.apply() to control fake quant during QAT ghstack-source-id: 90897063 Test Plan: buck test caffe2/test:quantization -- --print-passing-details Differential Revision: D17491155 fbshipit-source-id: 80ff0d7a1ac35c96e054b4f0165a73c56c2f53cc	2019-09-27 11:01:34 -07:00
Raghuraman Krishnamoorthi	102a148641	Default histogram observer (#26622 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26622 ghstack-source-id: 90897064 Test Plan: buck test caffe2/test:quantization -- --print-passing-details Differential Revision: D17508787 fbshipit-source-id: ae733ab35ec9b0233264014b8054d4d870fb05e1	2019-09-27 10:39:21 -07:00
Raghuraman Krishnamoorthi	b0a2f6f2f5	Serialization and range reduction support for Fake Quant/Observer (#26519 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26519 ghstack-source-id: 90895631 Test Plan: buck test caffe2/test:quantization -- 'test_histogram_observer $test_quantization\.ObserverTest$' --print-passing-details and buck test caffe2/test:fake_quant -- 'test_fq_serializable $test_fake_quant\.TestFakeQuantizePerTensorAffine$' --print-passing-details Differential Revision: D17217408 fbshipit-source-id: 0da7efdcdae0c065dd035c5dd2b6a78231545ece	2019-09-27 10:09:39 -07:00
Raghuraman Krishnamoorthi	9a5e2e80b8	Fake quantization enhancements for QAT/PTQ support- fix tests (#26876 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26876 Add ability to turn fake quantization and observers independently. ghstack-source-id: 90892132 Test Plan: buck test caffe2/test:quantized -- 'test_conv_bn_relu $test_qat\.IntrinsicQATModuleTest$' --print-passing-details Differential Revision: D17592961 fbshipit-source-id: 24c60c94ed7c6c9fa55c634a8545731614e4f52f	2019-09-27 08:59:29 -07:00
Dmytro Dzhulgakov	0a8a779abe	Add more inplace arguments to quantization top level API (#26782 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26782 At least we should be consistent on top-level APIs and prepare/convert/etc. Logic is inplace=False by default but top-level APIs take care of doing fewer copies. Also renames always-inplace methods like add_observer to have underscore in the end. One fix for MinMaxObserver was triggered by deepcopy surfacing that we were accidentally keeping autograd around Test Plan: Imported from OSS Differential Revision: D17595956 Pulled By: dzhulgakov fbshipit-source-id: 801f9f5536b553f24c7a660064dd6fce685edd65	2019-09-26 00:07:07 -07:00
Richard Zou	be93d30e37	Revert D17458232: Fake quantization enhancements for QAT/PTQ support Test Plan: revert-hammer Differential Revision: D17458232 Original commit changeset: f44380c60f1a fbshipit-source-id: 64a244c720b61fa912bacbb23fcbf9faed0757c2	2019-09-25 04:56:30 -07:00
Raghuraman Krishnamoorthi	e2c3d7e52c	Fake quantization enhancements for QAT/PTQ support (#26420 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26420 Flags for enabling/disabling observer and fake quant independently. Improve repr for fake quant. ghstack-source-id: 90704254 Test Plan: buck test caffe2/test:fake_quant -- --print-passing-details buck test caffe2/test:quantization -- --print-passing-details Differential Revision: D17458232 fbshipit-source-id: f44380c60f1a10a8ea09bca8ab79ba5d1867ed62	2019-09-25 02:02:00 -07:00
Raghuraman Krishnamoorthi	bc4519dc27	Handle DeQuantStub() for QAT (#26518 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26518 Skip Dequantize() modules for QAT alone. For fake quant insertion, DeQuantize() is a no-op and we should not be inserting fake-quant. ghstack-source-id: 90704220 Test Plan: buck test caffe2/test:quantization -- --print-passing-details Tests in test_quantization pass with changes: Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/281475121296989 Summary (total time 73.03s): PASS: 28 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 Differential Revision: D17439333 fbshipit-source-id: f716c23500324ae08c8d104ee2c9587fa6926571	2019-09-25 00:35:34 -07:00
Dmytro Dzhulgakov	128a65e2e0	Use noop observer to pass dtype for dynamic quantization (#26709 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26709 Polishes implementation from #25975. Primarily, we use NoopObserver to communicate that weights need to be quantized to float16. The very top-level API (quantize_dynamic) stays the same with `dtype` argument but the implementation follows the common flow. One can argue that dynamic fp16 quantization doesn't really fit into the 'observer' mechanism. It's in fact not ideal, but it's better to have the same flow than branching on both dtype and qconfig. Test Plan: Imported from OSS Differential Revision: D17544103 Pulled By: dzhulgakov fbshipit-source-id: 6af3f18c35929a1a53ea734079c005f656e4925f	2019-09-24 09:24:39 -07:00
Dmytro Dzhulgakov	a79b3685db	Simplify observers declaration with functools.partial (#26492 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26492 Previous definition of observers was quite clumsy - with things like `default_observer()()`. This PR strips a way a lot of craft and allows to pass just class names directly. In order to override default arguments either `functools.partial` can be used or convenient wrapper `MyObserver.with_args(x=1)` is provided. Also rename `QConfig_dynamic` to `QConfigDynamic` because it violates the naming convention. Test Plan: Imported from OSS Differential Revision: D17521265 Pulled By: dzhulgakov fbshipit-source-id: ba9df19b368641acf4093c43df9990796284fd9e	2019-09-23 10:15:59 -07:00
Lingyi Liu	11f9fe2433	Fix the API for record observer (#26413 ) Summary: Mainly want to resolve comments from https://github.com/pytorch/pytorch/pull/25830. Overall, we want to provide a recording observer for recording the runtime tensor values of activation path in order to debug the numerical accuracy loss offline. According to the feedback from https://github.com/pytorch/pytorch/issues/25830, it might be better to record all the observers in a dict and query the dict to get corresponding tensor values. hx89 is working on how to insert the recording observers into model under debug. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26413 Differential Revision: D17506502 Pulled By: llyfacebook fbshipit-source-id: 3ab90dc78920e7ec3fa572c2a07327a9991c530a	2019-09-20 14:27:56 -07:00
Jianyu Huang	f433ee1499	Add the FP16 weight support for LSTM in dynamic_quantize (#25975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25975 We would like to add the FP16 weight support for the dynamic quantized LSTM. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm794.ftw3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details Building: finished in 13.4 sec (100%) 8134/8134 jobs, 81 updated Total time: 13.9 sec Trace available for this run at /tmp/testpilot.20190910-210241.2092790.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision c86e65add357582accb6ec0be23b92c8a2c510bd fbpkg ca46e8f5b26c451a8b0b2462c11bb61d at Mon Sep 9 22:16:37 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/696/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 0.183 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 0.184s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 Summary (total time 4.35s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D17299116 fbshipit-source-id: 7fe91ece25867f2c0496f1b63fb1041e6b815166	2019-09-19 22:19:22 -07:00
Haixin Liu	dcbfc3bdbf	Add per channel observer (#25887 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25887 ghstack-source-id: 90383258 Add per channel observer to compute the qparams for each channel. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_per_channel_minmax_observer' buck test mode/dev caffe2/test:quantization -- 'test_per_channel_minmax_observer_scriptable' Differential Revision: D17137226 fbshipit-source-id: 0b1c93e3cbcda86f5c4e30f7cd94c670f2665063	2019-09-18 22:16:45 -07:00
Haixin Liu	f2e9622ed8	Add l2 norm minimization (#24022 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24022 In histogram observer add an approximation for L2 error minimization for selecting min/max. By selecting new min/max, we filter out outliers in input distribution. This follows the implementation of NormMinimization::NonlinearQuantizationParamsSearch in caffe2/quantization/server/norm_minimization.cc ghstack-source-id: 90298789 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_histogram_observer' Differential Revision: D16713239 fbshipit-source-id: 82631ba47974e25689c9c66bc3088117090e26d4	2019-09-18 00:07:10 -07:00

1 2 3

101 Commits