pytorch/cudnn_persistent_rnn.rst at f2f285c240efa2743f54653b83c03cc236a1fb27 - pytorch - Carlos Sousa's Git

OSSForks/pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Natalia Gimelshein 134b5d62e8 don't copy weight gradients in rnn (#12600 )

Summary:
This PR gets rid of unnecessary copy of weight gradients in cudnn rnn. Also removes unnecessary check for  input size when deciding whether to use persistent rnn, and adds doc string explaining when persistent rnn can be used. cc ezyang
Pull Request resolved: https://github.com/pytorch/pytorch/pull/12600

Differential Revision: D10359981

Pulled By: soumith

fbshipit-source-id: 0fce11b527d543fabf21e6e9213fb2879853d7fb

2018-10-12 13:34:10 -07:00

10 lines

310 B

ReStructuredText

Raw Blame History

 .. note::
     If the following conditions are satisfied:
 ) cudnn is enabled,
 ) input data is on the GPU
 ) input data has dtype ``torch.float16``
 ) V100 GPU is used,
 ) input data is not in ``PackedSequence`` format
     persistent algorithm can be selected to improve performance.