pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
James Reed	6ba60ec9b0	Add flag to temporarily disable MKL-DNN conv (#23837 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23837 This is a temporary workaround to an issue in MKL-DNN's Convolution backwards implementation: https://github.com/pytorch/pytorch/issues/23825 It is only used to enable testing quantization Test Plan: Imported from OSS Differential Revision: D16659081 Pulled By: jamesr66a fbshipit-source-id: de18ebe98dec2a042f28b23373e20da2b44a42a2	2019-08-06 11:20:26 -07:00
Mingfei Ma	25f0dc3490	BERT CPU performance optimization: use mkldnn for nn.Linear() when input is dense layout (#21851 ) Summary: This PR aims at improving BERT performance on CPU by using `mkldnn` inner product for `nn.Linear()`. The current logic is to use `mkldnn` only when `input` tensor is of mkldnn layout. This PR loosens this condition, `mkldnn` will be used for `nn.Linear()` when `input` tensor is of dense layout. The aten tensor is viewed inplace in `mkldnn` without additional memory copy. 1. when `input.dim() >= 3` , it is viewed as 2d tensor. e.g. `[T, N, C]` is treated as `[TN, C]`; 2. when `input` is not contiguous, it is copied so as to be contiguous. `mkldnn` inner product can't handle non-contiguous memory. With this PR, BERT on `glue/MRPC` inference (batch size = 1) on Xeon 6148 single socket (20 cores@2.5GHz) improves by `44%`: 1. before (unit: iterations/sec): ```bash 408/408 [00:24<00:00, 16.69it/s] ``` 2. after (unit: iterations/sec): ```bash 408/408 [00:16<00:00, 24.06it/s] ``` The latency reduces from `59.92 ms` to `41.56ms` correspondingly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21851 Differential Revision: D16056334 Pulled By: dzhulgakov fbshipit-source-id: 9b70ed58323b5e2f3f4e3ebacc766a74a8b68a8a	2019-07-19 00:54:29 -07:00
Junjie Bai	63585c3b81	Add support for save and load mkldnn modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20799 Reviewed By: wanchaol Differential Revision: D15447891 fbshipit-source-id: e34de946c79282fb934a5c52ff1def41c7993c75	2019-05-23 12:51:50 -07:00
Junjie Bai	c9f380df02	Add aten mkldnn linear operator Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19210 Reviewed By: dzhulgakov Differential Revision: D14901641 fbshipit-source-id: 8fa68b9941fd93cea0f313a828cba34c5c81ae11	2019-04-26 13:41:57 -07:00
Junjie Bai	3445020ca3	Add aten mkldnn conv2d operator Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19204 Reviewed By: dzhulgakov Differential Revision: D14857513 fbshipit-source-id: 1172c9785e5a17a7d7360474551bdc7a511b3f2f	2019-04-26 13:41:41 -07:00

5 Commits