pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Guilherme Leobas	63a0bb0ab9	Add typing annotations for torch.nn.quantized.dynamic.modules.rnn (#43186 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/43185 xref: [gh-43072](https://github.com/pytorch/pytorch/issues/43072) Pull Request resolved: https://github.com/pytorch/pytorch/pull/43186 Reviewed By: ezyang Differential Revision: D23441259 Pulled By: malfet fbshipit-source-id: 80265ae7f3a70f0087e620969dbd4aa8ca17c317	2020-09-01 10:25:10 -07:00
Yanan Cao	bdcf320bed	Support custom exception message (#41907 ) Summary: Raise and assert used to have a hard-coded error message "Exception". User provided error message was ignored. This PR adds support to represent user's error message in TorchScript. This breaks backward compatibility because now we actually need to script the user's error message, which can potentially contain unscriptable expressions. Such programs can break when scripting, but saved models can still continue to work. Increased an op count in test_mobile_optimizer.py because now we need aten::format to form the actual exception message. This is built upon an WIP PR: https://github.com/pytorch/pytorch/pull/34112 by driazati Pull Request resolved: https://github.com/pytorch/pytorch/pull/41907 Reviewed By: ngimel Differential Revision: D22778301 Pulled By: gmagogsfm fbshipit-source-id: 2b94f0db4ae9fe70c4cd03f4048e519ea96323ad	2020-08-01 13:03:45 -07:00
Raghuraman Krishnamoorthi	480851ad2c	Docstring changes for dynamic quantized classes (#40931 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40931 Fix docstrings for dynamic quantized Linear/LSTM and associated classes ghstack-source-id: 107064446 Test Plan: Docs show up in correctly Differential Revision: D22360787 fbshipit-source-id: 8e357e081dc59ee42fd7f12ea5079ce5d0cc9df2	2020-07-03 21:04:12 -07:00
Raghuraman Krishnamoorthi	d7d75e37bb	Add state dict for LSTM and RNNCell and helper functions for accessing weights and bias (#40333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40333 Add state_dict support for dynamic quantized LSTM/GRU/RNNCell. Add helper functions get_weight and get_bias for LSTM and RNNCells ghstack-source-id: 106364749 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_lstm_api $quantization\.test_quantized_module\.TestDynamicQuantizedModule$' --print-passing-details buck test caffe2/test:quantization -- 'test_cell_api $quantization\.test_quantized_module\.TestDynamicQuantizedModule$' --print-passing-details Differential Revision: D22151020 fbshipit-source-id: 2eb54062f6c6a35ffe4dbe8e8cfbf7ede0e92ba1	2020-06-22 17:41:07 -07:00
Raghuraman Krishnamoorthi	3258cb61b1	Dynamic quantization support for LSTMCell, RNNCell and GRUCell [Remove randomness in weights] (#40102 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40102 Enable dynamic quantization for LSTMCell, RNNCell and GRUCell ghstack-source-id: 105997236 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_quantized_rnn_cell $quantization\.test_quantize\.TestPostTrainingDynamic$' Differential Revision: D22071017 fbshipit-source-id: 3fe1eac39db9c1e0566838eb8b969bbb1fa983c9	2020-06-16 21:29:50 -07:00
Raghuraman Krishnamoorthi	15758bca55	Refactor LSTM tests, [Remove randomness in weights] (#40101 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40101 Create three tests for LSTMs: 1. test_qlstm: Test to check numerics of quantized LSTM operator. 2. test_lstm_api: To check the LSTM module and compare it with the quantized LSTM op 3. test_quantized_rnn: Check the dynamic quantization workflow, scriptability and serialization of quantized LSTM ghstack-source-id: 105997268 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_lstm_api $quantization\.test_quantized_module\.TestDynamicQuantizedModule$' --print-passing-details buck test caffe2/test:quantization -- 'test_quantized_rnn $quantization\.test_quantize\.TestPostTrainingDynamic$' buck test caffe2/test:quantization -- 'test_qlstm $quantization\.test_quantized_op\.TestDynamicQuantizedRNNOp$' --print-passing-details Differential Revision: D22070826 fbshipit-source-id: 46c333e19b9eab8fa5cab6f132e89b80a635791a	2020-06-16 17:24:07 -07:00
Raghuraman Krishnamoorthi	5add2e861c	Revert D21628596: Refactor LSTM tests Test Plan: revert-hammer Differential Revision: D21628596 Original commit changeset: 4aeda899f2e5 fbshipit-source-id: ab6544b87404863e054172aa9ec7ada51fad8e5e	2020-06-16 10:14:15 -07:00
Raghuraman Krishnamoorthi	e55e0cb1a9	Revert D20978736: Dynamic quantization support for LSTMCell, RNNCell and GRUCell Test Plan: revert-hammer Differential Revision: D20978736 Original commit changeset: 8f303ba1d7f8 fbshipit-source-id: bcd300819616d6536f582fcd3c90decd543c4657	2020-06-16 10:11:32 -07:00
Raghuraman Krishnamoorthi	48db06e39a	Dynamic quantization support for LSTMCell, RNNCell and GRUCell (#37159 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37159 Enable dynamic quantization for LSTMCell, RNNCell and GRUCell ghstack-source-id: 105946183 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_quantized_rnn_cell $quantization\.test_quantize\.TestPostTrainingDynamic$' Differential Revision: D20978736 fbshipit-source-id: 8f303ba1d7f8e0c646ac73e862d2c1e735b7ff61	2020-06-16 09:14:59 -07:00
Raghuraman Krishnamoorthi	655f1ea176	Refactor LSTM tests (#38851 ) Summary: Create three tests for LSTMs: 1. test_qlstm: Test to check numerics of quantized LSTM operator. 2. test_lstm_api: To check the LSTM module and compare it with the quantized LSTM op 3. test_quantized_rnn: Check the dynamic quantization workflow, scriptability and serialization of quantized LSTM Pull Request resolved: https://github.com/pytorch/pytorch/pull/38851 ghstack-source-id: 105945574 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_lstm_api $quantization\.test_quantized_module\.TestDynamicQuantizedModule$' --print-passing-details buck test caffe2/test:quantization -- 'test_quantized_rnn $quantization\.test_quantize\.TestPostTrainingDynamic$' buck test caffe2/test:quantization -- 'test_qlstm $quantization\.test_quantized_op\.TestDynamicQuantizedRNNOp$' --print-passing-details Differential Revision: D21628596 fbshipit-source-id: 4aeda899f2e5f14bfbe3d82096cb4ce89c725fa1	2020-06-16 00:41:24 -07:00
Supriya Rao	e1392922f2	[quant] Enable per-channel quantization for LSTM Modules (#39666 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39666 Test Plan: python test/test_quantization.py TestPostTrainingDynamic.test_per_channel_lstm_quantize Imported from OSS Differential Revision: D21977601 fbshipit-source-id: 1333259e75782e54864ab444e05397b86cd9b9aa	2020-06-10 23:19:08 -07:00
Supriya Rao	425927bb2b	[quant] Add reduce_range params for quantized_lstm (#39604 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39604 This change preserves BC for older models that are saved with reduce_range set to false. Newer models will use the version information in RNN module to toggle reduce_range parameter Internally this is implemented using a new CellParams type that calls the linear functions with reduce_range option set to true. New models serialized will use the CellParams struct for the `__getstate__` and `__setstate__` calls. Older models using QuantizedCellParamsDynamic will continue to use their original serialization/de-serialization methods tested using LSTM BC test and test_quantized_rnn Test Plan: python test/test_quantization.py Imported from OSS Differential Revision: D21977600 fbshipit-source-id: 0cb0e098b87207b537574d3beeab1f341c41c0d2	2020-06-10 23:16:57 -07:00
Supriya Rao	1d1f16079d	[quant] Add save/load state_dict to quantized dynamic RNNs (#39105 ) Summary: Previously dynamic LSTM modules weren't able to save/load from state_dict since PackedParameter used in RNNs isn't serializable from python Pull Request resolved: https://github.com/pytorch/pytorch/pull/39105 Test Plan: python test/test_quantization.py TestSerialization Reviewed By: jerryzh168 Differential Revision: D21752256 Pulled By: supriyar fbshipit-source-id: ef82cf21ce21a3a1304d147ed0da538c639f952d	2020-05-28 10:37:38 -07:00
James Reed	d44573a6dc	Remove _all_weight_values again (#38504 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38504 Test Plan: Imported from OSS Differential Revision: D21579530 Pulled By: jamesr66a fbshipit-source-id: 4449c92142200eaadc68b59d6f5f964ba60b1c80	2020-05-15 11:55:09 -07:00
Nikita Shulga	3e9b4332d2	Fix @skipIfNoFBGEMM for types (#38432 ) Summary: Return unmodified type from decorator if fbgemm is present. Fix `Tried to trace <__torch__.torch.classes.rnn.CellParamsBase object at 0x55f504c56b40> but it is not part of the active trace. Modules that are called during a trace must be registered as submodules of the thing being traced` thrown from `TestPostTrainingDynamic.test_quantized_rnn` by preserving modules in returned qRNNBase (i.e. by partially reverting https://github.com/pytorch/pytorch/pull/38134 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/38432 Differential Revision: D21567333 Pulled By: malfet fbshipit-source-id: 364fa2c8fc6e400b4f2e425b922a977756aec1d8	2020-05-14 08:27:29 -07:00
James Reed	41572116f6	Dont store redundant packed params in dynamic quantized RNN (#38134 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38134 Test Plan: Imported from OSS Reviewed By: ailzhang Differential Revision: D21479289 Pulled By: jamesr66a fbshipit-source-id: 11d9ad034396ce75c5a93d1f7ebca587205089ee	2020-05-08 13:52:52 -07:00
James Reed	c1e7758b5e	Back out "Revert D20229168: [quantization] Use torchbind for Linear PackedParams" (#38101 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38101 Original commit changeset: 29e8a4d3b8bf ghstack-source-id: 103730417 Test Plan: waitforsadcastle Differential Revision: D21471381 fbshipit-source-id: a922cdf31ba32021e7264ae1454c646c0bfd7ef4	2020-05-08 10:53:06 -07:00
Nikita Shulga	4bc0a7f86a	Revert D20229168: [quantization] Use torchbind for Linear PackedParams Test Plan: revert-hammer Differential Revision: D20229168 Original commit changeset: 3607cac9aa5b fbshipit-source-id: 29e8a4d3b8bffd95ff6a58b46c4f1c1e23770304	2020-05-07 19:47:45 -07:00
James Reed	eaf9b28c55	[quantization] Use torchbind for Linear PackedParams (#34140 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34140 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D20229168 Pulled By: jamesr66a fbshipit-source-id: 3607cac9aa5b4b044572329742baed03350491c6	2020-05-07 19:03:44 -07:00
Raghuraman Krishnamoorthi	34bf868ebc	Fix weight quantization in RNNs (#35961 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35961 Weight quantization was done incorrectly for LSTMs, the statistics for all weights (across layers) were combined in the observer. This meant that weights for later layers in a LSTM would use sub-optimal scales impacting accuracy. The problem gets worse as the number of layers increases. ghstack-source-id: 103511725 Test Plan: Will be updated Differential Revision: D20842145 fbshipit-source-id: a622b012d393e0755970531583950b44f1964413	2020-05-05 16:40:16 -07:00
James Reed	fd4a09ea73	[WIP] Bind in CellParams for RNN (#35787 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35787 Test Plan: Imported from OSS Differential Revision: D20784118 Pulled By: jamesr66a fbshipit-source-id: 5d8f7e1502f707bff9a9aefa90e3edfb3429549b	2020-04-28 21:47:06 -07:00
Kjell Schubert	a2e059cfa6	add missing 'import warnings' (#35313 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35313 The intention of D16955662 was to print a warning when a single-layer LSTM has an (ignored) dropout specified. I ran into this warning with one of our models, but instead of a warning I got "name 'warnings' is not defined". The linter could have called out that problem on the original diff, not sure why it didn't. Test Plan: Before this diff JITing a particular model in f176977725 yielded "name 'warnings' is not defined". After this diff f176980937 gets past that point (failing in an unrelated downstream workflow). Reviewed By: jianyuh Differential Revision: D20611822 fbshipit-source-id: 99d90f4830f3b15ddbf1e2146e2cc014ef26c2ab	2020-04-13 08:41:44 -07:00
Zafar Takhirov	463f7920bd	repr and _*state_dict for qRNN (#31540 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31540 Fixes #31468 Test Plan: Imported from OSS Differential Revision: D19205894 Pulled By: z-a-f fbshipit-source-id: 80c36f74aa20a125ea8d74a54e9905576f1bc6d7	2020-03-19 20:49:50 -07:00
James Reed	8a17dc65af	[quantization] Make FP16 RNN use new prepack op (#34339 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34339 Test Plan: Imported from OSS Differential Revision: D20297194 Pulled By: jamesr66a fbshipit-source-id: 8bf6d0f2cb047e90bbdd184aaad337b143040d10	2020-03-07 10:04:01 -08:00
Elias Ellison	fddf73250d	[JIT] fix resolving of functions in torch/functional. fix compilation of torch.stft (#33504 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33504 Fix resolution fo functions that are bound onto torch in torch/functional.py. This does not fix compilation of all of those functions, those will be done in follow ups. Does torch.stft as a start. Fixes #21478 Test Plan: Imported from OSS Differential Revision: D20014591 Pulled By: eellison fbshipit-source-id: bb362f1b5479adbb890e72a54111ef716679d127	2020-02-26 18:35:43 -08:00
James Reed	a3cdb7eca3	Fix default instantation of dynamic quantized LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31433 Test Plan: Imported from OSS Differential Revision: D19164539 Pulled By: jamesr66a fbshipit-source-id: 7045817ab3dfb530c4480a10523c4c6bcdbfc7eb	2019-12-18 16:59:00 -08:00
James Reed	20fb8a814c	PackedSequence support for quantized LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29585 Test Plan: Imported from OSS Differential Revision: D18436569 Pulled By: jamesr66a fbshipit-source-id: 0f32c0fcc897894e30d8e7ff203392c1a961ce60	2019-11-12 20:13:38 -08:00
James Reed	821f8bfc2f	Fix tracing for dynamic quantized LSTM (#29331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29331 Closes #27954 This fixes the hard-coding of packed parameter values for the dynamic quantized LSTM by orchestrating the following dance: 1) Each variadic parameter on the module has its own Module. That Module defines the `__getstate__` and __setstate__` method s.t. packed weights are properly re-done on model load. 2) Each of these modules is wrapped into a `torch.nn.ModuleList`, s.t. the parameters appear as attributes in the hierarchy. Then, `gatherParametersAndBuffers` (`9c43b16df9/torch/csrc/jit/tracer.cpp (L285)`) can see these parameters and create a `Value*` for them in the traced graph. 3) In forward, we need to convert from ModuleList -> Module -> Parameter to a simple TensorList of the parameters. We just use a loop here. In tracing, we simply record a `ListConstruct` with each of the proper parameter values. In scripting, the `ModuleList` is const, so it can be unrolled into the graph and a subsequent `ListConstruct` does its business. The `forward` of the traced LSTM before and after this change are as follows: Before ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: hx, hx0, = argument_2 _0, _1, _2 = torch.quantized_lstm(input, [hx, hx0], [CONSTANTS.c0, CONSTANTS.c1], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_0, (_1, _2)) ``` After ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: _0 = self.cell._all_weight_values _1 = getattr(_0, "0").param _2 = getattr(_0, "1").param hx, hx0, = argument_2 _3, _4, _5 = torch.quantized_lstm(input, [hx, hx0], [_1, _2], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_3, (_4, _5)) ``` Test Plan: Imported from OSS Differential Revision: D18374904 Pulled By: jamesr66a fbshipit-source-id: f1a9b58998bc365b9baad38c21fd4bb510dd639c	2019-11-07 13:45:39 -08:00
Mike Ruberry	84a6583ba1	Revert D18359880: Fix tracing for dynamic quantized LSTM Test Plan: revert-hammer Differential Revision: D18359880 Original commit changeset: 0ff2cad294a1 fbshipit-source-id: 834cd43b39fb754f90c8b18b8ab9b837f2b511ab	2019-11-06 21:10:33 -08:00
James Reed	f17e02fd94	Fix tracing for dynamic quantized LSTM (#29331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29331 Closes #27954 This fixes the hard-coding of packed parameter values for the dynamic quantized LSTM by orchestrating the following dance: 1) Each variadic parameter on the module has its own Module. That Module defines the `__getstate__` and __setstate__` method s.t. packed weights are properly re-done on model load. 2) Each of these modules is wrapped into a `torch.nn.ModuleList`, s.t. the parameters appear as attributes in the hierarchy. Then, `gatherParametersAndBuffers` (`9c43b16df9/torch/csrc/jit/tracer.cpp (L285)`) can see these parameters and create a `Value*` for them in the traced graph. 3) In forward, we need to convert from ModuleList -> Module -> Parameter to a simple TensorList of the parameters. We just use a loop here. In tracing, we simply record a `ListConstruct` with each of the proper parameter values. In scripting, the `ModuleList` is const, so it can be unrolled into the graph and a subsequent `ListConstruct` does its business. The `forward` of the traced LSTM before and after this change are as follows: Before ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: hx, hx0, = argument_2 _0, _1, _2 = torch.quantized_lstm(input, [hx, hx0], [CONSTANTS.c0, CONSTANTS.c1], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_0, (_1, _2)) ``` After ``` def forward(self, input: Tensor, argument_2: Tuple[Tensor, Tensor]) -> Tuple[Tensor, Tuple[Tensor, Tensor]]: _0 = self.cell._all_weight_values _1 = getattr(_0, "0").param _2 = getattr(_0, "1").param hx, hx0, = argument_2 _3, _4, _5 = torch.quantized_lstm(input, [hx, hx0], [_1, _2], True, 1, 0., True, False, False, dtype=12, use_dynamic=True) return (_3, (_4, _5)) ``` Test Plan: Imported from OSS Differential Revision: D18359880 Pulled By: jamesr66a fbshipit-source-id: 0ff2cad294a1871123015dfc704eaf73a7ac1d9e	2019-11-06 17:02:12 -08:00
Zafar Takhirov	dc8785a022	Refactoing names for consistency Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27670 Test Plan: Imported from OSS Differential Revision: D17846269 Pulled By: z-a-f fbshipit-source-id: ed3c7441c185bf11b2e62879aa3ecbc654aa2d4e	2019-10-16 12:18:26 -07:00
Michael Suo	341262754f	module dedupe (#26666 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26666 Changes: - Introduce a `ConcreteModuleType` concept. This acts both as the key into the type cache, and as the source of truth for `ModuleValue::attr` queries. It needs to do both jobs because that's how we ensure correctness (if the types are different, it's because `ModuleValue::attr` would return different things). - Now `recursive_script` will first construct a `ConcreteModuleType` and search for a pre-existing type before starting compilation. - All previous paths to creating a `ScriptModule` (including inheriting from `ScriptModule`) are now rewritten to go through `create_script_module`, so that we have only a single place where construction happens. Behavioral changes: - Big change to `torch.jit.ScriptModule` inheritance: all attributes are now recursively scripted if possible, matching recursive scripting semantics. This makes it hard to keep something from being scripted (for example, a Python submodule). Possibly we'll need an `ignore()` type thing for attributes. In particular, this adds `self.training` to every ScriptModule, since it's present on every `nn.Module`. - I believe this change to be transparent to existing users of the inheritance API, since if you had an attribute that is unscriptable that you never used, there is no error. In some cases, we will create new attributes (even if they are unused), which will increase serialized model size from before. Test Plan: Imported from OSS Differential Revision: D17551196 Pulled By: suo fbshipit-source-id: b476d1c9feb3ddfd63406d90989aaf9dfe890591	2019-10-12 09:51:57 -07:00
James Reed	4d7bec5f3e	Improve repr for quantized modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27008 Test Plan: Imported from OSS Differential Revision: D17649174 Pulled By: jamesr66a fbshipit-source-id: e3e6c4bb31e1ad8ed1ebe27f803f90d564ecfe53	2019-09-28 15:15:14 -07:00
Dmytro Dzhulgakov	128a65e2e0	Use noop observer to pass dtype for dynamic quantization (#26709 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26709 Polishes implementation from #25975. Primarily, we use NoopObserver to communicate that weights need to be quantized to float16. The very top-level API (quantize_dynamic) stays the same with `dtype` argument but the implementation follows the common flow. One can argue that dynamic fp16 quantization doesn't really fit into the 'observer' mechanism. It's in fact not ideal, but it's better to have the same flow than branching on both dtype and qconfig. Test Plan: Imported from OSS Differential Revision: D17544103 Pulled By: dzhulgakov fbshipit-source-id: 6af3f18c35929a1a53ea734079c005f656e4925f	2019-09-24 09:24:39 -07:00
Jerry Zhang	254122dd4e	quantize_linear -> quantize_per_tensor (#26574 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26574 Since we also have `quantized::linear`, `quantize_linear` sounds confusing, so we plan to rename it before the branch cut Test Plan: ci Imported from OSS Differential Revision: D17514876 fbshipit-source-id: 01d9005e6ec8cb9950b9d8bba122109c389641d3	2019-09-20 21:58:48 -07:00
Jianyu Huang	f433ee1499	Add the FP16 weight support for LSTM in dynamic_quantize (#25975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25975 We would like to add the FP16 weight support for the dynamic quantized LSTM. Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm794.ftw3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details Building: finished in 13.4 sec (100%) 8134/8134 jobs, 81 updated Total time: 13.9 sec Trace available for this run at /tmp/testpilot.20190910-210241.2092790.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision c86e65add357582accb6ec0be23b92c8a2c510bd fbpkg ca46e8f5b26c451a8b0b2462c11bb61d at Mon Sep 9 22:16:37 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/696/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 0.183 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 0.184s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/1125900050322971 Summary (total time 4.35s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D17299116 fbshipit-source-id: 7fe91ece25867f2c0496f1b63fb1041e6b815166	2019-09-19 22:19:22 -07:00
James Reed	bdc656da70	TorchScript Serialization for dynamic LSTM Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26084 Test Plan: Imported from OSS Differential Revision: D17339315 Pulled By: jamesr66a fbshipit-source-id: 03a2674edcf779becfe3b8ec96f1bae23c74b11c	2019-09-12 11:04:47 -07:00
James Reed	83ecdf76da	Revert "TorchScript Serialization for dynamic LSTM module" (#26079 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26079 This reverts commit `e3039612d8`. Test Plan: Imported from OSS Differential Revision: D17337585 Pulled By: jamesr66a fbshipit-source-id: 4b93a4c5ca2fe491d609da889a42d22be8e52889	2019-09-11 21:23:19 -07:00
James Reed	e3039612d8	TorchScript Serialization for dynamic LSTM module Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25877 Test Plan: Imported from OSS Reviewed By: jianyuh Differential Revision: D17275746 Pulled By: jamesr66a fbshipit-source-id: db2f38ddd99f02ccb4fb754fa1c1e6cad4425fa8	2019-09-11 19:17:25 -07:00
Supriya Rao	9d2d31e626	Store bias in PackedLinearWeight struct in fbgemm (#25428 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25428 Added bias as an optional param to the quantized_linear_prepack function. Bias is quantized during runtime using input scale and weight scale. ghstack-source-id: 89601399 Test Plan: python test/run_test.py --exclude nn --verbose --bring-to-front quantization quantized quantized_tensor quantized_nn_mods quantizer Differential Revision: D17121304 fbshipit-source-id: 8adb0e55e4aed0a5430aaa2c8639c8ad1639c85a	2019-09-06 08:37:34 -07:00
Supriya Rao	61819260f7	Rename FBGEMM quantized operators to generic quantized ops (#25678 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25678 As an effort to unify fbgemm and qnnpack at the dispatcher level, we need to have a generic name for the quantized backed ops. Currently FBGEMM is guarded by the USE_FBGEMM macro and QNNPACK uses USE_QNNPACK. ghstack-source-id: 89518961 Test Plan: buck test caffe2/test:quantized Differential Revision: D17194364 fbshipit-source-id: 5960aedff6b8cb89eb3872c39b74caf54c0fbf20	2019-09-05 10:13:08 -07:00
Jianyu Huang	0483d537ab	Add the dynamic quantized LSTM module (#25157 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25157 Add the dynamic quantized LSTM module. TODO (separate PRs): - Serialization. - Bias can be Null. ghstack-source-id: 89443731 Test Plan: buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_quantization\.PostTrainingDynamicQuantTest$' --print-passing-details ``` [jianyuhuang@devvm2816.prn3.facebook.com: ~/fbsource/fbcode/caffe2/test] $ buck test mode/dev caffe2/test:quantization -- 'test_quantized_rnn $test_q uantization\.PostTrainingDynamicQuantTest$' --print-passing-details Action graph will be rebuilt because files have been added or removed. Parsing buck files: finished in 1.4 sec Building: finished in 4.0 sec (100%) 8122/8122 jobs, 2 updated Total time: 5.5 sec Trace available for this run at /tmp/testpilot.20190902-164918.1275502.log TestPilot test runner for Facebook. See https://fburl.com/testpilot for details. Testpilot build revision b61bc0e3b71033578eddfe0a28b0739bc685663f fbpkg 3b1c1aed1c534c0cb161a981eca6e2f0 at Sun Sep 1 20:58:52 2019 by twsvcscm from /usr/local/fbprojects/packages/testinfra.testpilot/690/t.par Discovering tests Running 1 tests Started new test run: https://our.intern.facebook.com/intern/testinfra/testrun/2251799823877227 ✓ caffe2/test:quantization - test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) 1.048 1/1 (passed) Test output: > test_quantized_rnn (test_quantization.PostTrainingDynamicQuantTest) ... ok > > ---------------------------------------------------------------------- > Ran 1 test in 1.049s > > OK Finished test run: https://our.intern.facebook.com/intern/testinfra/testrun/2251799823877227 Summary (total time 5.53s): PASS: 1 FAIL: 0 SKIP: 0 FATAL: 0 TIMEOUT: 0 OMIT: 0 ``` Differential Revision: D16955662 fbshipit-source-id: 61cf1a74913105fa02e44b3941813eabac0006b5	2019-09-03 19:18:28 -07:00

42 Commits