pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Richard Barnes	6838ecefb6	Clean up some type annotations in torch/jit (#49939 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49939 Upgrades type annotations from Python2 to Python3 Test Plan: Sandcastle tests Reviewed By: xush6528 Differential Revision: D25717573 fbshipit-source-id: 7d5c98fafaa224e0504b73dc69b1e4a6410c0494	2021-01-06 16:39:57 -08:00
Nikita Shulga	b60ffcdfdd	Enable typechecks for torch.nn.quantized.modules.linear (#44154 ) Summary: Also import `Optional` directly from `typing` rather than from `_jit_internal` Pull Request resolved: https://github.com/pytorch/pytorch/pull/44154 Reviewed By: seemethere Differential Revision: D23511833 Pulled By: malfet fbshipit-source-id: f78c5fd679c002b218e4d287a9e56fa198171981	2020-09-03 19:52:49 -07:00
Meghan Lele	7816d53798	[JIT] Add mypy type annotations for JIT (#43862 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43862 Test Plan: Imported from OSS Reviewed By: eellison Differential Revision: D23491151 Pulled By: SplitInfinity fbshipit-source-id: 88367b89896cf409bb9ac3db7490d6779efdc3a4	2020-09-03 15:09:24 -07:00
Yanan Cao	bdcf320bed	Support custom exception message (#41907 ) Summary: Raise and assert used to have a hard-coded error message "Exception". User provided error message was ignored. This PR adds support to represent user's error message in TorchScript. This breaks backward compatibility because now we actually need to script the user's error message, which can potentially contain unscriptable expressions. Such programs can break when scripting, but saved models can still continue to work. Increased an op count in test_mobile_optimizer.py because now we need aten::format to form the actual exception message. This is built upon an WIP PR: https://github.com/pytorch/pytorch/pull/34112 by driazati Pull Request resolved: https://github.com/pytorch/pytorch/pull/41907 Reviewed By: ngimel Differential Revision: D22778301 Pulled By: gmagogsfm fbshipit-source-id: 2b94f0db4ae9fe70c4cd03f4048e519ea96323ad	2020-08-01 13:03:45 -07:00
Raghuraman Krishnamoorthi	3258cb61b1	Dynamic quantization support for LSTMCell, RNNCell and GRUCell [Remove randomness in weights] (#40102 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40102 Enable dynamic quantization for LSTMCell, RNNCell and GRUCell ghstack-source-id: 105997236 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_quantized_rnn_cell \(quantization\.test_quantize\.TestPostTrainingDynamic\)' Differential Revision: D22071017 fbshipit-source-id: 3fe1eac39db9c1e0566838eb8b969bbb1fa983c9	2020-06-16 21:29:50 -07:00
Raghuraman Krishnamoorthi	e55e0cb1a9	Revert D20978736: Dynamic quantization support for LSTMCell, RNNCell and GRUCell Test Plan: revert-hammer Differential Revision: D20978736 Original commit changeset: 8f303ba1d7f8 fbshipit-source-id: bcd300819616d6536f582fcd3c90decd543c4657	2020-06-16 10:11:32 -07:00
Raghuraman Krishnamoorthi	48db06e39a	Dynamic quantization support for LSTMCell, RNNCell and GRUCell (#37159 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37159 Enable dynamic quantization for LSTMCell, RNNCell and GRUCell ghstack-source-id: 105946183 (Note: this ignores all push blocking failures!) Test Plan: buck test caffe2/test:quantization -- 'test_quantized_rnn_cell \(quantization\.test_quantize\.TestPostTrainingDynamic\)' Differential Revision: D20978736 fbshipit-source-id: 8f303ba1d7f8e0c646ac73e862d2c1e735b7ff61	2020-06-16 09:14:59 -07:00
James Reed	c1e7758b5e	Back out "Revert D20229168: [quantization] Use torchbind for Linear PackedParams" (#38101 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38101 Original commit changeset: 29e8a4d3b8bf ghstack-source-id: 103730417 Test Plan: waitforsadcastle Differential Revision: D21471381 fbshipit-source-id: a922cdf31ba32021e7264ae1454c646c0bfd7ef4	2020-05-08 10:53:06 -07:00
Nikita Shulga	4bc0a7f86a	Revert D20229168: [quantization] Use torchbind for Linear PackedParams Test Plan: revert-hammer Differential Revision: D20229168 Original commit changeset: 3607cac9aa5b fbshipit-source-id: 29e8a4d3b8bffd95ff6a58b46c4f1c1e23770304	2020-05-07 19:47:45 -07:00
James Reed	eaf9b28c55	[quantization] Use torchbind for Linear PackedParams (#34140 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34140 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D20229168 Pulled By: jamesr66a fbshipit-source-id: 3607cac9aa5b4b044572329742baed03350491c6	2020-05-07 19:03:44 -07:00
James Reed	fd4a09ea73	[WIP] Bind in CellParams for RNN (#35787 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35787 Test Plan: Imported from OSS Differential Revision: D20784118 Pulled By: jamesr66a fbshipit-source-id: 5d8f7e1502f707bff9a9aefa90e3edfb3429549b	2020-04-28 21:47:06 -07:00
Elias Ellison	fddf73250d	[JIT] fix resolving of functions in torch/functional. fix compilation of torch.stft (#33504 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33504 Fix resolution fo functions that are bound onto torch in torch/functional.py. This does not fix compilation of all of those functions, those will be done in follow ups. Does torch.stft as a start. Fixes #21478 Test Plan: Imported from OSS Differential Revision: D20014591 Pulled By: eellison fbshipit-source-id: bb362f1b5479adbb890e72a54111ef716679d127	2020-02-26 18:35:43 -08:00
davidriazati	53429680d5	Remove stray `@script` (#32235 ) Summary: This should be covered under recursive script now Pull Request resolved: https://github.com/pytorch/pytorch/pull/32235 Pulled By: driazati Differential Revision: D19414889 fbshipit-source-id: 85f8132401dbe44c9dbaef7c0350110f90eb9843	2020-01-17 19:22:09 -08:00
Igor Fedan	75309b45f3	explicitly provide memory format when calling to clone() at Indexing.cpp Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28660 Test Plan: Imported from OSS Differential Revision: D18333346 Pulled By: ifedan fbshipit-source-id: 06590205d883a5096388a4ae318389244130972d	2019-11-07 05:38:32 -08:00
Jianyu Huang	9e64c54c01	Add the warning message for API with linear modules (#28766 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28766 Add the warning message to explicitly ask the users to upgrade the deprecated `torch.jit.quantized` API to the new `torch.quantization.quantize_dynamic` API. ghstack-source-id: 92711620 Test Plan: CI Differential Revision: D18164903 fbshipit-source-id: e6aff2527f335c2d9f362e6856ce8597edb52aaa	2019-10-28 14:24:44 -07:00
Michael Suo	ffa422a8b3	kill _parameter_list (#27399 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27399 This was devised in a time when we didn't have module attributes. They are essentially just tensor lists, so represent them that way. This has the additional benefit of making the RNN forward pass faster because we effectively cache the flattened weights. The only complication part is that someone may come along and do: ``` my_rnn_mod.w_ih_l0 = torch.nn.Parameter(...) ``` This means we need to override setattr to keep the flattened weights cache up to date. Test Plan: Imported from OSS Differential Revision: D17785658 Pulled By: suo fbshipit-source-id: 7789cd1d0d4922bfd5eba1716976442fbf150766	2019-10-12 09:51:53 -07:00
Halil Akin	31960e8872	Add missing argument for failing function call (#26311 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26311 We are currently unable to deploy models due to D16955662 changing function signature of ```quantized_lstm(``` but the function call here (https://fburl.com/diffusion/e4wrmx83) not passing the newly added ```use_dynamic``` param. Here is the details of the error: P111215482 ``` E0916 12:36:16.423516 1149877 ExceptionTracer.cpp:214] exception stack complete terminate called after throwing an instance of 'torch::jit::script::ErrorReport' what(): Arguments for call are not valid. The following operator variants are available: aten::quantized_lstm(Tensor input, Tensor[] hx, Tensor[] params, bool has_biases, int num_layers, float dropout, bool train, bool bidirectional, bool batch_first, *, int? dtype=None) -> (Tensor, Tensor, Tensor): Keyword argument use_dynamic unknown. ``` This diff fixes that. Test Plan: Running quantization tests after. ```buck test mode/dev caffe2/test:jit -- 'test_quantization_modules \(test_jit\.TestScript\)'``` https://our.intern.facebook.com/intern/testinfra/testrun/5910974518872494 Also, currently building a package (language_technology.translation.jedi.scripts:35c3643) and testing this (f138747078). f138771702 Reviewed By: jhcross Differential Revision: D17404451 fbshipit-source-id: 390d2ce1ecbdd63a07a8f16c80e4c3ac25ab0a99	2019-09-16 17:04:14 -07:00
Pieter Noordhuis	14ab44f834	Fix flake8 issues in ./torch/jit Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24240 Differential Revision: D16783395 Pulled By: ezyang fbshipit-source-id: 8427b7cd7d0552820cbbf20ebfca86898f3f53f7	2019-08-13 11:50:02 -07:00
James Reed	442b3512d4	Simplified nnq.Linear class (#24046 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/24046 `nnq.Linear` was a confusing mess of buffers/attributes and Tensor/not tensor members. This PR reworks it to consistently have only Python attributes, with the conversions handled explicitly by state_dict or __{get,set}state__ methods (added in PRs further up the stack Test Plan: Imported from OSS Reviewed By: driazati Differential Revision: D16728345 Pulled By: jamesr66a fbshipit-source-id: 47468b776b428fca2409bb55c8b161afb68a3379	2019-08-09 17:14:48 -07:00
Jianyu Huang	2635b6262e	Remove K and N function arguments for fbgemm_pack_quantized_matrix (#22956 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22956 As Title says: remove the extra function arguments for better engineering. Differential Revision: D16297724 fbshipit-source-id: a31be17708d13508c4ce9a3ce7eb5238e8d17984	2019-08-07 08:50:13 -07:00
Jianyu Huang	78cc9b92a5	Change fbgemm_linear_{int8,fp16}_weight to fbgemm_linear_{int8,fp16}_weight_fp32_activation (#22955 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22955 Following the comment in https://github.com/pytorch/pytorch/pull/22891, change the fbgemm wrapper function name to indicate whether it is dynamic quantization or static quantization. Differential Revision: D16297512 fbshipit-source-id: 498678e2af27070628be11a6d724ce17c2a3cde5	2019-08-06 23:19:26 -07:00
Wanchao Liang	8fb0d198e9	make nn.LSTM accept PackedSequence instead of Tuples Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23643 Differential Revision: D16615531 fbshipit-source-id: af508838cac21d271d3470f0f16fd75473a6e68d	2019-08-05 17:16:18 -07:00
Mingzhe Li	29881c7f02	Fix LSTM int8 quantization model size issue (#23577 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23577 This diff is fixing a model size issue introduced in #23291. After that PR, the model size after in8 quantization is the same as that of the original unquantized model. The reason is that we save original weight for int8 quantization even when that's not needed anymore. This diff fixes that by only saving original weight for fp16 quantization path. Reviewed By: llyfacebook Differential Revision: D16557619 fbshipit-source-id: f924ae8d155a0d525b86a7440b3c7147d5bead0a	2019-08-02 13:38:30 -07:00
Mingzhe Li	b3980f46a2	Replace uint8 with int8 in Linear and LSTM quantization path (#23347 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23347 This diff replaces uint8 with int8 to match with the underlying kernel implementation. When we do int8 quantization, we are computing with uint8 (input activation) * int8 (weight) -> uint8 (output activation). The weight is quantized into int8. Reviewed By: jianyuh Differential Revision: D16469435 fbshipit-source-id: a697655b0e97833fc601e5980970aec4dba53c39	2019-07-24 22:25:12 -07:00
Mingzhe Li	8fdbe1e10b	Support LSTM with FP16 weight (#23291 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23291 This diff implements LSTM with FP16 weights based on FBGEMM. At a high level, here are the steps: 1. Quantize and pack weight in every layer of LSTM 2. Pass weights from step 1 to the ATen `quantized_lstm` function which does matrix multiplication with FP16 weight. The following code shows the dtype of each variable used in MM: Y = X * W + B (fp32, fp32, fp16, fp32) Reviewed By: jianyuh Differential Revision: D16389595 fbshipit-source-id: c26ae4e153c667a941f4af64e9d07fc251403cee	2019-07-24 12:40:11 -07:00
Lingyi Liu	1a93b96815	Revert `da315a4` Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22837 Differential Revision: D16239667 Pulled By: llyfacebook fbshipit-source-id: 1a625d78d633927129dd2791e65b333b3902f94f	2019-07-13 01:54:20 -07:00
Karl Ostmo	da315a4e2a	Revert D16037021: Support GRU module quantization in Pytorch Differential Revision: D16037021 Original commit changeset: 71145c67d869 fbshipit-source-id: 33cd2e57eba30ea33cc4f3116732a721c26f6efb	2019-07-12 21:05:34 -07:00
Lingyi Liu	d8c1b86135	Support GRU module quantization in Pytorch Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22498 Reviewed By: BIT-silence Differential Revision: D16037021 fbshipit-source-id: 71145c67d8696e525b686cd3313033e5b6771718	2019-07-12 18:31:08 -07:00
Mingzhe Li	9eb039334f	Use Linear Operator with fp16 weights in JIT (#22323 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22323 This diff adds an interface to use quantized Linear op in JIT. Reviewed By: jamesr66a Differential Revision: D16040724 fbshipit-source-id: 90e90aff9973c96ea076ed6a21ae02c349ee2bcf	2019-07-12 15:59:17 -07:00
Lingyi Liu	2c556a9489	fix the input/output type mismatch (#20829 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20829 as title Reviewed By: jamesr66a Differential Revision: D15461937 fbshipit-source-id: 02c7150c0e8d020030ae8898008f718c74850dca	2019-05-23 11:08:21 -07:00
Edward Yang	fdb923996d	Revert D15445092: Some minor fix to unblock the Bert model quantization Differential Revision: D15445092 Original commit changeset: 22da41a56ecb fbshipit-source-id: eca9a85900bf48fe6a9da5cfff61606a10f0c3de	2019-05-22 14:25:14 -07:00
Lingyi Liu	70ecddfd76	Some minor fix to unblock the Bert model quantization (#20787 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20787 Set requires_grad=False for bias: this will block the jit tracing. The as_type fix: The input tensor shape and output tensor shape will be different, which will trigger the assertion failure at https://fburl.com/0m8xy7tc. Reviewed By: jamesr66a Differential Revision: D15445092 fbshipit-source-id: 22da41a56ecb9ac092585d0cc1ff0658fb9d631b	2019-05-21 23:13:08 -07:00
James Cross	a4ae689636	quantize_rnn_modules in ensemble_export (#20365 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20365 Enable quantization of `torch.nn.LSTM` module in the decoder of PyTorch natively exported beam search models. Reviewed By: jmp84 Differential Revision: D15260631 fbshipit-source-id: bbdd3a30c2c2110986eb7aa7ff11ce1c9090ddf4	2019-05-10 15:33:41 -07:00
James Reed	2179d5b32b	Dynamic quantized full LSTM module (#20249 ) Summary: Previously we only had a Python wrapper for `torch.quantized_lstm_cell`. We had the op `torch.quantized_lstm`, but it didn't have a wrapper. This makes the wrapper Pull Request resolved: https://github.com/pytorch/pytorch/pull/20249 Reviewed By: driazati Differential Revision: D15250023 Pulled By: jamesr66a fbshipit-source-id: f05ad784d903e0ef3a62633c8bf80bad79de48ae	2019-05-08 19:46:59 -07:00
Edward Yang	173f224570	Turn on F401: Unused import warning. (#18598 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18598 ghimport-source-id: c74597e5e7437e94a43c163cee0639b20d0d0c6a Stack from [ghstack](https://github.com/ezyang/ghstack): * #18598 Turn on F401: Unused import warning. This was requested by someone at Facebook; this lint is turned on for Facebook by default. "Sure, why not." I had to noqa a number of imports in __init__. Hypothetically we're supposed to use __all__ in this case, but I was too lazy to fix it. Left for future work. Be careful! flake8-2 and flake8-3 behave differently with respect to import resolution for # type: comments. flake8-3 will report an import unused; flake8-2 will not. For now, I just noqa'd all these sites. All the changes were done by hand. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: D14687478 fbshipit-source-id: 30d532381e914091aadfa0d2a5a89404819663e3	2019-03-30 09:01:17 -07:00
Elias Ellison	561037aef8	use flake8-mypy (#17721 ) Summary: Use flake8 installed with mypy checks so that our linter matches fbcode. Mypy type errors also provide valuable signal Pull Request resolved: https://github.com/pytorch/pytorch/pull/17721 Differential Revision: D14357778 Pulled By: eellison fbshipit-source-id: d8c9ea3fe3b5f550c3b70fe259e0eabf95e4c92d	2019-03-07 09:15:54 -08:00
Elias Ellison	c2be9f1487	Remove unneeded manual unwrap optionals (#16245 ) Summary: Remove calls to torch.jit._unwrap_optional that are no longer needed. The remaining instances would require control flow logic for exceptions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16245 Differential Revision: D13804292 Pulled By: eellison fbshipit-source-id: 08c5cbe4b956519be2333de5cf4e202488aff626	2019-01-24 15:48:01 -08:00
James Reed	7f1397acef	Quantized RNNCell modules (#15469 ) Summary: Similarly to https://github.com/pytorch/pytorch/pull/13777, we apply post-processing quantization to RNN cell modules (`RNNCell`, `LSTMCell`, and `GRUCell`). A further follow-up PR will involve quantizing the full `RNN`, `GRU`, and `LSTM` modules. This depends on those modules being scriptable as part of the standard library scripting effort, though. Note that infrastructure in this pr such as `gather_quantized_params` is currently unused but should be used in the future when we can port over the full RNN modules. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15469 Differential Revision: D13545802 Pulled By: jamesr66a fbshipit-source-id: ad3b694517842893ea619438e9f5e88fd7b96510	2019-01-15 10:40:51 -08:00
James Reed	acbd9c49b0	Direct FBGEMM integraton into ATen (#13777 ) Summary: This PR implements infrastructure for post-processing a model to apply int8 quantization to its `nn.Linear` modules. Highlights of the implementation: 1) Inputs and outputs are `float` (quantized and packed internally), but the weight is quantized and packed ahead of time for efficiency. This implementation performs well in small-batch size GEMM calls. It should not be considered a general-purpose quantized GEMM kernel. 2) Weight packing is dependent on machine architecture (e.g. vector register width), so it is done just-in-time. Concretely, it is done on model load for the weights and it is done during operator execution for the input value. 3) Biases are unquantized 4) We fail loudly if we are attempting to run this on a machine that does not support FBGEMM. This is because we do not want a model's numerics to differ based on which machine it is run on. A model containing these FBGEMM ops must be run with FBGEMM The API can be seen in the added test case. Highlights are: 1) `torch.jit.quantized.quantize_linear_modules` walks the module hierarchy of the passed-in Module and replaces all `nn.Linear` modules with a new `QuantizedLinear` module, which encapsulates the behavior described above. 2) `_pack()` and `_unpack()` script methods are present on `QuantizedLinear` modules. These methods should be called before serialization and after deserialization, respectively. This ensures that the weight matrix is properly packed for the running machine's architecture. Note that in the long term, we would like to move toward a more Pickle-style serialization technique, rather than having these explicit methods that mutate member values. This is blocked on being able to assign attributes in a ScriptMethod, among other things. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13777 Differential Revision: D13383276 Pulled By: jamesr66a fbshipit-source-id: 00f29c9f34544add2b90107e3cf55a287802c344	2018-12-21 10:35:51 -08:00

39 Commits