pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Linus	7201edc0a5	Fix RNN class constructor signature (#115341 ) Fixes #114617 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115341 Approved by: https://github.com/mikaylagawarecki	2023-12-07 19:46:33 +00:00
zabboud	53e7de4b65	Issue 112599 - fix pydocstyle errors (#113177 ) Fixes #112599 Fixed errors relating to pydocstyle in the following files. The remaining errors are related to docstrings at the module level and at methods within each module, `forward()`, `reset_parameters`, `__init__` ..etc pydocstyle torch/nn/modules/pooling.py --count before: 49 after: 29 remaining errors: ``` torch/nn/modules/pooling.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/pooling.py:90 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:163 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:240 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:315 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:321 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:402 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:408 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:472 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:478 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:541 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:550 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:620 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:630 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:706 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:716 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:720 in public method `__setstate__`: D105: Missing docstring in magic method torch/nn/modules/pooling.py:774 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:792 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:845 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pooling.py:863 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:925 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:979 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1026 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1068 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1111 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1150 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1189 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pooling.py:1228 in public method `forward`: D102: Missing docstring in public method ``` pydocstyle torch/nn/modules/upsampling.py --count before: 14 after: 7 remaining: ``` torch/nn/modules/upsampling.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/upsampling.py:142 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/upsampling.py:156 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/upsampling.py:160 in public method `__setstate__`: D105: Missing docstring in magic method torch/nn/modules/upsampling.py:166 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/upsampling.py:216 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/upsampling.py:263 in public method `__init__`: D107: Missing docstring in __init__ ``` pydocstyle torch/nn/modules/rnn.py --count before: 47 after: 40 remaining ``` torch/nn/modules/rnn.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/rnn.py:59 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:160 in public method `__setattr__`: D105: Missing docstring in magic method torch/nn/modules/rnn.py:225 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/rnn.py:230 in public method `check_input`: D102: Missing docstring in public method torch/nn/modules/rnn.py:242 in public method `get_expected_hidden_size`: D102: Missing docstring in public method torch/nn/modules/rnn.py:256 in public method `check_hidden_size`: D102: Missing docstring in public method torch/nn/modules/rnn.py:272 in public method `check_forward_args`: D102: Missing docstring in public method torch/nn/modules/rnn.py:278 in public method `permute_hidden`: D102: Missing docstring in public method torch/nn/modules/rnn.py:284 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/rnn.py:305 in public method `__getstate__`: D105: Missing docstring in magic method torch/nn/modules/rnn.py:313 in public method `__setstate__`: D105: Missing docstring in magic method torch/nn/modules/rnn.py:355 in public method `all_weights`: D102: Missing docstring in public method torch/nn/modules/rnn.py:471 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:478 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:481 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:503 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:762 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:768 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:771 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:774 in public method `get_expected_cell_size`: D102: Missing docstring in public method torch/nn/modules/rnn.py:786 in public method `check_forward_args`: D102: Missing docstring in public method torch/nn/modules/rnn.py:798 in public method `permute_hidden`: D102: Missing docstring in public method torch/nn/modules/rnn.py:809 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:820 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:1030 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1036 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1039 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1046 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:1054 in public method `forward` (skipping F811): D102: Missing docstring in public method torch/nn/modules/rnn.py:1123 in public class `RNNCellBase`: D101: Missing docstring in public class torch/nn/modules/rnn.py:1134 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1152 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1160 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1224 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1230 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1327 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1332 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/rnn.py:1422 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/rnn.py:1427 in public method `forward`: D102: Missing docstring in public method ``` pydocstyle torch/nn/modules/pixelshuffle.py --count before: 13 after: 8 remaining: ``` torch/nn/modules/pixelshuffle.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/pixelshuffle.py:52 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pixelshuffle.py:56 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pixelshuffle.py:59 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/pixelshuffle.py:105 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/pixelshuffle.py:109 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/pixelshuffle.py:112 in public method `extra_repr`: D102: Missing docstring in public method ``` pydocstyle torch/nn/modules/sparse.py --count before: 14 after: 8 remaining errors: ``` torch/nn/modules/sparse.py:1 at module level: D100: Missing docstring in public module torch/nn/modules/sparse.py:124 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/sparse.py:153 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/sparse.py:162 in public method `forward`: D102: Missing docstring in public method torch/nn/modules/sparse.py:167 in public method `extra_repr`: D102: Missing docstring in public method torch/nn/modules/sparse.py:320 in public method `__init__`: D107: Missing docstring in __init__ torch/nn/modules/sparse.py:350 in public method `reset_parameters`: D102: Missing docstring in public method torch/nn/modules/sparse.py:396 in public method `extra_repr`: D102: Missing docstring in public method ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113177 Approved by: https://github.com/ezyang	2023-11-14 20:55:22 +00:00
Federico Galatolo	d118531733	Use `\odot` everywhere instead of mixing `\odot` and `` for the Hadamard product (#111763 ) This pull request addresses an inconsistency in the representation of the Hadamard product across PyTorch documentation. Currently, the notation varies among different modules: - In `torch.nn.LSTM` documentation the Hadamard product is represented with $\odot$ - In `torch.nn.GRU` documentation the Hadamard product is represented with $$ - In `torch.nn.LSTMCell` documentation the Hadamard product is represented with $$ - In `torch.nn.GRUCell` documentation the Hadamard product is represented with $$ - In `torch.ao.nn.quantized.dynamic.GRU` documentation the Hadamard product is represented with $$ This PR proposes consistently representing the Hadamard product throughout the documentation to enhance clarity and align with established standards. The notation $\odot$ will be uniformly adopted, following the convention in the [Deep Learning Book](https://www.deeplearningbook.org/contents/linear_algebra.html). Changes Made:* - Modified `torch.nn.GRU` documentation to represent the Hadamard product with $\odot$ - Modified `torch.nn.LSTMCell` documentation to represent the Hadamard product with $\odot$ - Modified `torch.nn.GRUCell` documentation to represent the Hadamard product with $\odot$ - Modified `torch.ao.nn.quantized.dynamic.GRU` documentation to represent the Hadamard product with $\odot$ Pull Request resolved: https://github.com/pytorch/pytorch/pull/111763 Approved by: https://github.com/albanD	2023-10-22 21:01:35 +00:00
FFFrog	003c5bb156	Add checks to `num_layers` for `RNN`, `LSTM`, `GRU` (#108853 ) Fixes #108223 As the title shown Pull Request resolved: https://github.com/pytorch/pytorch/pull/108853 Approved by: https://github.com/mikaylagawarecki	2023-09-09 19:33:52 +00:00
hasteinmetz	b535ed2c1a	Update to RNN documentation (issue #106085 ) (#106222 ) Addresses [issue #106085](https://github.com/pytorch/pytorch/issues/106085). In `torch/nn/modules/rnn.py`: - Adds documentation string to RNNBase class. - Adds parameters to __init__ methods for RNN, LSTM, and GRU, classes. - Adds type annotations to __init__ methods for RNN, LSTM, and GRU. In `torch/ao/nn/quantized/dynamic/modules/rnn.py`: - Adds type specifications to `_FLOAT_MODULE` attributes in RNNBase, RNN, LSTM, and GRU classes. > This resolves a `mypy` assignment error `Incompatible types in assignment (expression has type "Type[LSTM]", base class "RNNBase" defined the type as "Type[RNNBase]")` that seemed to be a result of fully specified type annotations in `torch/nn/modules/rnn.py`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/106222 Approved by: https://github.com/mikaylagawarecki	2023-08-31 00:50:32 +00:00
Aaron Gokaslan	660e8060ad	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-22 23:16:38 +00:00
PyTorch MergeBot	d59a6864fb	Revert "[BE]: Update ruff to 0.285 (#107519 )" This reverts commit `88ab3e4322`. Reverted https://github.com/pytorch/pytorch/pull/107519 on behalf of https://github.com/ZainRizvi due to Sorry, but this PR breaks internal tests. @ezyang, can you please hep them get unblocked? It seems like one of the strings was prob accidentally modified ([comment](https://github.com/pytorch/pytorch/pull/107519#issuecomment-1688833480))	2023-08-22 19:53:32 +00:00
Aaron Gokaslan	88ab3e4322	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-20 01:36:18 +00:00
박종의 (PARK, Jongeui)	af0ed25ea8	Change >= in the GRU and the LSTM document to \ge (#107379 ) Change >= in the GRU document to \ge Pull Request resolved: https://github.com/pytorch/pytorch/pull/107379 Approved by: https://github.com/ezyang	2023-08-18 20:44:51 +00:00
FFFrog	2d2d43d9fb	add more check on LSTMCell (#107380 ) Just like #107223, operator ``LSTMCell`` have the same problems as ``GRUCell``, and add some check and tests related to fix it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107380 Approved by: https://github.com/ezyang	2023-08-18 20:44:17 +00:00
FFFrog	a4229690e3	Add Some Checks about dim (#107223 ) Fixes #106769 As mentioned in [GRUCell](https://pytorch.org/docs/stable/generated/torch.nn.GRUCell.html#grucell), `hidden` should have the same dimension as `input`, and the dimension should be either `1D` or `2D`. As for other aspects, it has been verified in `C++`, such as the batch of `Input` and `hidden` are the same, `Input`'s Dim1 and `input_size` are the same, `hidden`'s Dim1 and `hidden_size` are the same, etc. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107223 Approved by: https://github.com/albanD	2023-08-16 22:03:31 +00:00
Danni Li	1a6f1d816d	[Doc] Add proj_size < hidden_size in LSTM (#106364 ) Summary: Add parameter constraint: `proj_size` has to be smaller than `hidden_size` in RNNBase doc. Ref: `ceea08a986/torch/nn/modules/rnn.py (L83)` `ceea08a986/torch/nn/modules/rnn.py (L458)` Test Plan: Please see GitHub Actions. Differential Revision: D47943365 Fix: #105628 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106364 Approved by: https://github.com/mikaylagawarecki	2023-08-01 18:58:27 +00:00
chunyuan	cb6c3cbc91	inductor: enable weight prepack for LSTM (#103071 ) - Enabled LSTM weight prepack in inductor. - Added a mkldnn decomposition for lstm which won't change for different `seq_lens`. With the previous decomposition, for dynamic shapes use case where `seq_lens` changes, the graph will be different. - Extended several inductor utility functions to support `List(Tensor`) as input. Previously those functions only supported `Tensor` input. Update 2023-07-26: - https://github.com/pytorch/pytorch/pull/103851 has moved CPU weight packing to be after AOTAutograd. Fixed the support in this PR to follow the same way (mainly in `3b207f7f1c (diff-6dffed1ade0ba3e887f9a4eafa3bfcec267ab2365b8adcb91bd391f49b3fd2e3)`). LSTM is decomposed in `aten.mkldnn_rnn_layer` by layer and by direction. The weight prepack is done at the `mkldnn_rnn_layer` level. - Add a fix in rnn `__get_state__` function in case we need to recompile an `LSTM` module. When compiling the module, the weights tensors which are the `named_parameters` of the module are converted to `functional_tensor` here: `76fb72e24a/torch/nn/utils/stateless.py (L125-L128)` The forward function of LSTM will be called: `76fb72e24a/torch/_functorch/aot_autograd.py (L3379-L3381)` In the forward function, the `_flat_weights` are updated to be the same as the weights, thus becoming `functional_tensor`: `76fb72e24a/torch/nn/modules/rnn.py (L775-L778)` The weights tensors are converted back to the original tensors (which are not `functional_tensor` anymore) before exiting the `_reparametrize_module` context here: `76fb72e24a/torch/nn/utils/stateless.py (L130-L142)` But since `_flat_weights` is not in the `named_parameters` of the module, it's still `functional_tensor` ([link of the parameters that will be converted to functional and reverted back](`76fb72e24a/torch/_functorch/aot_autograd.py (L3695-L3698)`)). At this moment, if we need to recompile the model, `deepcopy` will be called: `76fb72e24a/torch/_dynamo/utils.py (L915-L917)` And it will report `UnImplemented` since we have `functional_tensor` (`_flat_weights`) and will trigger graph break which is not what we expect: `76fb72e24a/torch/_subclasses/meta_utils.py (L514)` Added a fix in the `__get_state__` to update the `_flat_weights` if ever weights have changed to fix this issue. The fix is covered in the `test_lstm_packed` UT. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103071 Approved by: https://github.com/jgong5, https://github.com/jansel	2023-07-28 13:54:32 +00:00
Aaron Gokaslan	6d43c89f37	[BE]: Update Ruff to 0.0.280 (#105724 ) Removes unusued loop values in python dictionary iteration. Automated fix from Ruff master Pull Request resolved: https://github.com/pytorch/pytorch/pull/105724 Approved by: https://github.com/ezyang, https://github.com/janeyx99	2023-07-22 23:03:34 +00:00
Justin Chu	4cc1745b13	[BE] f-stringify torch/ and scripts (#105538 ) This PR is a follow up on the pyupgrade series to convert more strings to use f-strings using `flynt`. - https://docs.python.org/3/reference/lexical_analysis.html#f-strings - https://pypi.org/project/flynt/ Command used: ``` flynt torch/ -ll 120 flynt scripts/ -ll 120 flynt tools/ -ll 120 ``` and excluded `collect_env.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105538 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-07-21 19:35:24 +00:00
Justin Chu	79c5e33349	[BE] Enable ruff's UP rules and autoformat nn/ mps/ and torch/ (#105436 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105436 Approved by: https://github.com/malfet, https://github.com/albanD	2023-07-21 07:38:46 +00:00
Thomas Findlay	8399cf9bfe	Rnn base hidden size type check (#105659 ) Fixes #105631 Added a type and value check on `hidden_size` to align behaviour between GPU and CPU modes and alert users when the wrong type is supplied. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105659 Approved by: https://github.com/albanD, https://github.com/mikaylagawarecki	2023-07-20 22:45:43 +00:00
Danni Li	b33d63d97b	[BE] Use ValueError for input.dim check in torch.nn.modules (#105127 ) Summary: Use ValueError for input.dim check instead of Assertion Error. Fix: #104839 Test Plan: Please see GitHub actions. Differential Revision: D47427998 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105127 Approved by: https://github.com/albanD, https://github.com/Skylion007	2023-07-13 23:20:46 +00:00
Mikayla Gawarecki	b93ed8164e	Add non-recursive module.to_empty option (#104197 ) Fixes https://github.com/pytorch/pytorch/issues/97049, related to https://github.com/pytorch/pytorch/issues/104187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104197 Approved by: https://github.com/albanD	2023-06-26 21:47:22 +00:00
chunyuan	7012600abe	fix cpu autocast check in rnn (#100621 ) https://github.com/pytorch/pytorch/pull/100100 added Typechecking while `torch.is_autocast_enabled()` always return `False` on cpu. This PR fixes the autocast check for cpu. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100621 Approved by: https://github.com/albanD	2023-05-09 04:34:18 +00:00
Danni Li	4a90deb137	[Doc] Add GRU new gate calculation difference (#100646 ) Summary: Add a note for the calculation difference of GRU new gate `n_t` between PyTorch and original paper. Fix: #99531 Test Plan: Please see GitHub pipelines. Differential Revision: D45579790 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100646 Approved by: https://github.com/mikaylagawarecki	2023-05-05 22:18:54 +00:00
TamirCohen	0221198790	Added Typechecking to input tensor in RNN (#100100 ) The input tensor of the RNN forward must be the same type as the weights. While passing tensor of type long the error is: `RuntimeError: expected scalar type Long but found Float` Which is misleading because it said to convert Something to Long, but the correct solution is to convert the input to Float (Which is the type of the weights). The new error: `RuntimeError: input must have the type torch.float32, got type torch.int64` Is correct and more verbose Fixes #99998 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100100 Approved by: https://github.com/drisspg	2023-04-27 23:36:57 +00:00
Boris Fomitchev	96c745dfdc	Fix int() casting in torch.nn.RNN to have correctly traced JIT and ONNX graph. (#92970 ) Signed-off-by: Boris Fomitchev <bfomitchev@nvidia.com> Fixes #91351 As for unit tests - in this PR I only fixed LSTM unit test to properly use dynamic axes and expose export issue by running test with same ONNX for additional inputs. If the changes approved, we should also fix the rest of the tests (RNN/GRU and beyond). I have verified the following updated tests are working with new code and failing with the old code: test/onnx/test_pytorch_onnx_onnxruntime.py::TestONNXRuntime_opset_version_14_is_script_False_keep_initializers_as_inputs_True::test_rnn_name_lstm_nonlinearity_None_unilayer_bidirectional_no_initial_state_with_variable_length_sequences_with_dropout test/onnx/test_pytorch_onnx_onnxruntime.py::TestONNXRuntime_opset_version_14_is_script_False_keep_initializers_as_inputs_True::test_rnn_name_lstm_nonlinearity_None_unilayer_bidirectional_with_initial_state_with_variable_length_sequences_with_dropout Pull Request resolved: https://github.com/pytorch/pytorch/pull/92970 Approved by: https://github.com/titaiwangms, https://github.com/kit1980	2023-03-15 05:33:41 +00:00
Xuehai Pan	ef731cdaf0	[2/3] Update `.pyi` Python stub files: Prettify `rnn.py` by using type annotated `NamedTuple` (#95267 ) Changes: - #95200 1. Recognize `.py.in` and `.pyi.in` files as Python in VS Code for a better development experience. 2. Fix deep setting merge in `tools/vscode_settings.py`. - => this PR: #95267 3. Use `Namedtuple` rather than `namedtuple + __annotations__` for `torch.nn.utils.rnn.PackedSequence_`: `namedtuple + __annotations__`: ```python PackedSequence_ = namedtuple('PackedSequence_', ['data', 'batch_sizes', 'sorted_indices', 'unsorted_indices']) # type annotation for PackedSequence_ to make it compatible with TorchScript PackedSequence_.__annotations__ = {'data': torch.Tensor, 'batch_sizes': torch.Tensor, 'sorted_indices': Optional[torch.Tensor], 'unsorted_indices': Optional[torch.Tensor]} ``` `Namedtuple`: Python 3.6+ ```python class PackedSequence_(NamedTuple): data: torch.Tensor batch_sizes: torch.Tensor sorted_indices: Optional[torch.Tensor] unsorted_indices: Optional[torch.Tensor] ``` - #95268 4. Sort import statements and remove unnecessary imports in `.pyi`, `.pyi.in` files. 5. Format `.pyi`, `.pyi.in` files and remove unnecessary ellipsis `...` in type stubs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95267 Approved by: https://github.com/janeyx99	2023-03-01 19:37:23 +00:00
Aaron Gokaslan	67d9790985	[BE] Apply almost all remaining flake8-comprehension checks (#94676 ) Applies the remaining flake8-comprehension fixes and checks. This changes replace all remaining unnecessary generator expressions with list/dict/set comprehensions which are more succinct, performant, and better supported by our torch.jit compiler. It also removes useless generators such as 'set(a for a in b)`, resolving it into just the set call. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94676 Approved by: https://github.com/ezyang	2023-02-12 01:01:25 +00:00
Xuehai Pan	5b1cedacde	[BE] [2/3] Rewrite `super()` calls in functorch and torch (#94588 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94588 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-10 21:16:33 +00:00
Richard Zou	5d01277fea	Deprecate torch.nn.utils.stateless.functional_call (#92280 ) This PR: - Updates the docs to say it is deprecated - Raises a UserWarning - Changes most of the callsites inside PyTorch to use torch.func.functional_call, minus the test_stateless testing. The motivation behind this is that we can now align behind a single functional_call API in PyTorch. Test Plan: - existing tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/92280 Approved by: https://github.com/albanD	2023-01-18 14:26:25 +00:00
joncrall	ad782ff7df	Enable xdoctest runner in CI for real this time (#83816 ) Builds on #83317 and enables running the doctests. Just need to figure out what is causing the failures. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83816 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-12-29 05:32:42 +00:00
Joel Schlosser	85e393bade	Fix for RNN/LSTM/GRU modules to work with stateless.functional_call() (#91111 ) Fixes #90500 The change here checks for parameter changes at the beginning of each `forward()` call; if the parameters are found to be different tensors than last time, `self._flat_weights` is re-initialized with the new values. Thus, swapping parameter values using `stateless.functional_call()` will re-initialize `self._flat_weights` during the `forward()` call, and the provided parameters will be used for module computation as expected. NB: There are still some changes needed for symbolic shapes to work with `nn.GRU` (will address in a follow-up PR). Pull Request resolved: https://github.com/pytorch/pytorch/pull/91111 Approved by: https://github.com/ezyang, https://github.com/albanD	2022-12-21 23:40:08 +00:00
jyx-su	70c46d32e2	Fix input dimension issue in RNN, LSTM, GRU error message (#87442 ) Fixes #86576 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87442 Approved by: https://github.com/albanD	2022-10-21 16:28:32 +00:00
joncrall	4618371da5	Integrate xdoctest - Rebased (#82797 ) This is a new version of #15648 based on the latest master branch. Unlike the previous PR where I fixed a lot of the doctests in addition to integrating xdoctest, I'm going to reduce the scope here. I'm simply going to integrate xdoctest, and then I'm going to mark all of the failing tests as "SKIP". This will let xdoctest run on the dashboards, provide some value, and still let the dashboards pass. I'll leave fixing the doctests themselves to another PR. In my initial commit, I do the bare minimum to get something running with failing dashboards. The few tests that I marked as skip are causing segfaults. Running xdoctest results in 293 failed, 201 passed tests. The next commits will be to disable those tests. (unfortunately I don't have a tool that will insert the `#xdoctest: +SKIP` directive over every failing test, so I'm going to do this mostly manually.) Fixes https://github.com/pytorch/pytorch/issues/71105 @ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/82797 Approved by: https://github.com/ezyang	2022-08-12 02:08:01 +00:00
PyTorch MergeBot	9db3c517de	Add __all__ for torch.nn.modules, torch.distributed.elastic, torch.nn.utils submodules (#80240 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/80240 Approved by: https://github.com/rohan-varma	2022-06-27 17:11:12 +00:00
Kirin Xiao	436396eddd	fix GRU document string (#79380 ) Fixes #78711 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79380 Approved by: https://github.com/albanD	2022-06-13 14:03:33 +00:00
Jeff Daily	de86146c61	rocblas alt impl during backward pass only (#71881 ) In preparation of adopting future rocblas library options, it is necessary to track when the backward pass of training is executing. The scope-based helper class `BackwardPassGuard` is provided to toggle state. Pull Request resolved: https://github.com/pytorch/pytorch/pull/71881 Approved by: https://github.com/albanD	2022-05-18 19:42:58 +00:00
Yashika Jain	182b28fe65	Fix math notation for linear projections in nn.RNN docs (#77082 ) Updating line number 316. Fixes #76658 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77082 Approved by: https://github.com/jbschlosser	2022-05-10 16:35:08 +00:00
Clémentine Fourrier	fafa54a940	Update LSTM documentation Explicits what the model outputs in h_n and c_n, in the case of a bidirectional model. The logic in h_n and c_n is different than the logic in output, and it can be confusing if not explicit. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74291 Approved by: https://github.com/jbschlosser	2022-03-17 16:48:32 +00:00
kshitij12345	b372be4211	[nn] lstm : no batch dim support (#71056 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/60585 TODO: * [x] Update docs Pull Request resolved: https://github.com/pytorch/pytorch/pull/71056 Reviewed By: samdow Differential Revision: D33638643 Pulled By: jbschlosser fbshipit-source-id: c0949829de8a8e6e7b2873f459a8d7da597a3be3 (cherry picked from commit `f94d5849f6`)	2022-01-24 15:13:40 +00:00
kshitij12345	c8b897333c	[rnn/gru] no batch dim (#70977 ) Summary: Reference https://github.com/pytorch/pytorch/issues/60585 Reland: https://github.com/pytorch/pytorch/pull/70442 Pull Request resolved: https://github.com/pytorch/pytorch/pull/70977 Reviewed By: dagitses, george-qi Differential Revision: D33477256 Pulled By: jbschlosser fbshipit-source-id: 2035c2d00b2f627c7046fd9b13c71b9360cd6fad	2022-01-07 13:14:41 -08:00
Michael Suo	a311cfa800	Revert D33460427: [pytorch][PR] [rnn/gru] : no batch dim Test Plan: revert-hammer Differential Revision: D33460427 (`6eba936082`) Original commit changeset: c64d9624c305 Original Phabricator Diff: D33460427 (`6eba936082`) fbshipit-source-id: 9a5000e202c5f383b03dd6caad9399e46e4ce80e	2022-01-06 23:37:28 -08:00
kshitij12345	6eba936082	[rnn/gru] no batch dim (#70442 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/60585 TODO: * [x] Doc updates Pull Request resolved: https://github.com/pytorch/pytorch/pull/70442 Reviewed By: zou3519 Differential Revision: D33460427 Pulled By: jbschlosser fbshipit-source-id: c64d9624c305d90570c79d11a28557f9ec667b27	2022-01-06 18:39:09 -08:00
Jake Tae	b7742b437a	Allow RNN hidden_size to be 0 (#70556 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/56767. Pull Request resolved: https://github.com/pytorch/pytorch/pull/70556 Reviewed By: ngimel Differential Revision: D33455156 Pulled By: jbschlosser fbshipit-source-id: 5dc57b09d7beb6ae81dfabc318e87c109bb4e6ae	2022-01-06 14:18:36 -08:00
kshitij12345	2c67621a19	[rnn,gru,lstm]cell : no batch dim (#70236 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/70236 Reviewed By: mikaylagawarecki Differential Revision: D33338774 Pulled By: jbschlosser fbshipit-source-id: 7d8d00272e543b3e67060136b5d98a4baefbedd5	2021-12-29 09:27:32 -08:00
Nicolas Weber	529ebae0ac	Bugfix for TorchScript RNN RELU and TANH (#61274 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/28418 Related https://github.com/pytorch/pytorch/issues/32976 but has already been fixed before. TorchScript handling of GRU and LSTM have been working, but not for RNN (Tanh and ReLU). The reason is that the ```Union[Tensor, PackedSequence]``` is not supported by TorchScript. Using ```torch._jit_internal._overload_method``` in ```RNNBase::Forward``` does not work, as it seems TorchScript does not correctly use them if the method gets inherited by ```RNN```. My solution is to move the ```RNNBase::forward``` to ```RNN``` and annotate using ```torch._jit_internal._overload_method```. LSTM and GRU anyway use their own ```forward``` methods, so there seems to be no problem related to this fix. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61274 Reviewed By: anjali411 Differential Revision: D32374452 Pulled By: malfet fbshipit-source-id: 77bab2469c01c5dfa5eaab229429724a4172445d Co-authored-by: Nikita Shulga <nshulga@fb.com>	2021-11-15 16:20:58 -08:00
Rodrigo Berriel	83878e19ff	Improve LSTM documentation for proj_size > 0 (#65102 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/65053. Although the documentation states that: `fe0f9d1daf/torch/nn/modules/rnn.py (L500-L506)` It seems that the definition of `weight_ih_l[k]` could be improved by specifying what happens when `k > 0` and `proj_size > 0`. As `proj_size` is only used in LSTM, no changes are needed for the other RNNs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65102 Reviewed By: supriyar Differential Revision: D30975781 Pulled By: jbschlosser fbshipit-source-id: 12df06e5e6a8d5de0ad10fb15e33c3e6311c11d3	2021-09-16 06:35:27 -07:00
Thomas J. Fan	1c97c3e3a4	DOC Adds LSTM docs for defined variables when bidirectional=True (#60120 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/59332 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60120 Reviewed By: gchanan Differential Revision: D29240245 Pulled By: jbschlosser fbshipit-source-id: acad9c24f41f7253a7d42cd940e54bb66e083ecf	2021-06-18 17:28:44 -07:00
Joel Schlosser	36a77580f5	[docs] Clarify batch_first behavior for nn.LSTM, nn.RNN, and nn.GRU (#58809 ) Summary: Fixes the high-pri doc component of https://github.com/pytorch/pytorch/issues/4145. To make the input / output shapes more readable for both `batch_first` states, this PR also introduces short dim names. Opinions welcome on the readability of the restructured docs! Screenshot for `nn.LSTM`: <img width="791" alt="Screen Shot 2021-05-24 at 5 11 39 PM" src="https://user-images.githubusercontent.com/75754324/119408130-389e5300-bcb3-11eb-9a4f-1df96a0a4d70.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/58809 Reviewed By: gchanan Differential Revision: D28685415 Pulled By: jbschlosser fbshipit-source-id: e8c92e3d7e052071a505b55dca976fd2ef5a8307	2021-05-25 15:27:17 -07:00
Hameer Abbasi	46e4b2dbda	Convert assert -> cast. (#57458 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/55868. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57458 Reviewed By: mruberry Differential Revision: D28365745 Pulled By: walterddr fbshipit-source-id: 35cc3fa85f87b0ef98cf970f620ab909d240c7be	2021-05-12 13:54:16 -07:00
Richard Zou	0787d781c5	Fix compatibility problem with LSTMs and torch.save (#57558 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57558 Fixes #53359 If someone directly saves an nn.LSTM in PyTorch 1.7 and then loads it in PyTorch 1.8, it errors out with the following: ``` (In PyTorch 1.7) import torch model = torch.nn.LSTM(2, 3) torch.save(model, 'lstm17.pt') (In PyTorch 1.8) model = torch.load('lstm17.pt') AttributeError: 'LSTM' object has no attribute 'proj_size' ``` Although we do not officially support this (directly saving modules via torch.save), it used to work and the fix is very simple. This PR adds an extra line to `__setstate__`: if the state we are passed does not have a `proj_size` attribute, we assume it was saved from PyTorch 1.7 and older and set `proj_size` equal to 0. Test Plan: I wrote a test that tests `__setstate__`. But also, Run the following: ``` (In PyTorch 1.7) import torch x = torch.ones(32, 5, 2) model = torch.nn.LSTM(2, 3) torch.save(model, 'lstm17.pt') y17 = model(x) (Using this PR) model = torch.load('lstm17.pt') x = torch.ones(32, 5, 2) y18 = model(x) ``` and finally compare y17 and y18. Reviewed By: mrshenli Differential Revision: D28198477 Pulled By: zou3519 fbshipit-source-id: e107d1ebdda23a195a1c3574de32a444eeb16191	2021-05-05 07:36:13 -07:00
Joel Schlosser	febff45900	Support factory kwargs in torch.nn modules (#54508 ) Summary: Continuation of https://github.com/pytorch/pytorch/pull/53144 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54508 Reviewed By: albanD Differential Revision: D27939544 Pulled By: jbschlosser fbshipit-source-id: 4bf517e5f74f093e27ca38a85e732da65e44d805	2021-04-22 16:16:53 -07:00
Joel Schlosser	12b2bc94d7	Revert D27909732: [pytorch][PR] Support factory kwargs in torch.nn modules Test Plan: revert-hammer Differential Revision: D27909732 (`5a09def9b0`) Original commit changeset: d8684b2403ab fbshipit-source-id: d00d69fae4fa4ed58d9e97e70b27a06a0dcb39e4	2021-04-21 13:44:03 -07:00

1 2 3 4

199 Commits