pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
XiaobingSuper	324c6aada1	BFloat16: enable prepacked weights's inference (#48922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48922 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D25537188 Pulled By: VitalyFedyunin fbshipit-source-id: ab6eb1ba8cffb5ba9d00d05db8ef616628f8c932	2021-02-17 11:20:00 -08:00
Xiang Gao	20ac736200	Remove py2 compatible future imports (#44735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44735 Reviewed By: mruberry Differential Revision: D23731306 Pulled By: ezyang fbshipit-source-id: 0ba009a99e475ddbe22981be8ac636f8a1c8b02f	2020-09-16 12:55:57 -07:00
Zhang, Xiaobing	2b8db35c7e	[reland][DNNL]:enable batchnorm3d (#40995 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40995 Test Plan: Imported from OSS Differential Revision: D22440765 Pulled By: VitalyFedyunin fbshipit-source-id: b4bf427bbb7010ee234a54e81ade371627f9e82c	2020-07-15 13:56:47 -07:00
Zhang, Xiaobing	b48ee175e6	[reland][DNNL]:enable conv3d (#40691 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40691 Test Plan: Imported from OSS Differential Revision: D22296548 Pulled By: VitalyFedyunin fbshipit-source-id: 8e2a7cf14e8bdfa2f29b735a89e8c83f6119e68d	2020-07-15 13:54:41 -07:00
Yanli Zhao	016cf7d66e	Revert D22102408: DNNL: enable conv3d Test Plan: revert-hammer Differential Revision: D22102408 Original commit changeset: 1e95cede429f fbshipit-source-id: a20b725164177e8571320007548a58cc4779d669	2020-06-22 15:41:51 -07:00
Yanli Zhao	17fe0e2b8a	Revert D22102407: DNNL: enable batchnorm3d Test Plan: revert-hammer Differential Revision: D22102407 Original commit changeset: c9dbb61d0538 fbshipit-source-id: d40976aa8120d2d0839624bf02c082d7d1eb610d	2020-06-22 15:39:37 -07:00
Zhang, Xiaobing	8df35fd755	DNNL: enable batchnorm3d (#35663 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35663 Test Plan: Imported from OSS Differential Revision: D22102407 Pulled By: VitalyFedyunin fbshipit-source-id: c9dbb61d0538ab9e1e76b2815564030b5f89d33e	2020-06-22 11:57:09 -07:00
Zhang, Xiaobing	6ba807cb43	DNNL: enable conv3d (#35662 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35662 Test Plan: Imported from OSS Differential Revision: D22102408 Pulled By: VitalyFedyunin fbshipit-source-id: 1e95cede429f1a950f26bc7052ab33d198857df3	2020-06-22 11:55:04 -07:00
Mingfei Ma	9ad14f6b43	cover nn.Conv1d in mkldnn model conversion logic (#38528 ) Summary: current `to_mkldnn` model conversion logic under `torch.utils.mkldnn` does not cover `nn.Conv1d`. This patch fills the gap, using similar logic to `nn.Conv2d`. The model conversion will remove unnecessary memory format reorders of input/output tensors and thus speedup the model. Pull Request resolved: https://github.com/pytorch/pytorch/pull/38528 Differential Revision: D21640325 Pulled By: albanD fbshipit-source-id: c3340153b5c524e020c097eb4b9e2ffcbde8896d	2020-05-19 13:04:18 -07:00
Dmytro Dzhulgakov	67608cc018	Fix MKLDNN conv2d 5d weight handling (#34115 ) Summary: Effectively backporting `c5c00c119f` before that PR lands The bug didn't manifesting itself earlier because MkldnnConv2d constructor didn't reorder the weights. So the issue was arising only on second serialization/deserialization. This also fixes the constructor to deliver better perf right away. Note, that I still serialize 5d tensor - it was the previous behavior, we have to handle it anyway and with https://github.com/pytorch/pytorch/issues/32422 the output of `mkldnn_reorder_conv2d_weight` will always be 4d. cc pinzhenx Pull Request resolved: https://github.com/pytorch/pytorch/pull/34115 Reviewed By: wanchaol Differential Revision: D20224685 Pulled By: dzhulgakov fbshipit-source-id: 24ca9227c4eb4c139096a64ae348808d7478d7dc	2020-03-04 11:26:38 -08:00
Michael Suo	341262754f	module dedupe (#26666 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26666 Changes: - Introduce a `ConcreteModuleType` concept. This acts both as the key into the type cache, and as the source of truth for `ModuleValue::attr` queries. It needs to do both jobs because that's how we ensure correctness (if the types are different, it's because `ModuleValue::attr` would return different things). - Now `recursive_script` will first construct a `ConcreteModuleType` and search for a pre-existing type before starting compilation. - All previous paths to creating a `ScriptModule` (including inheriting from `ScriptModule`) are now rewritten to go through `create_script_module`, so that we have only a single place where construction happens. Behavioral changes: - Big change to `torch.jit.ScriptModule` inheritance: all attributes are now recursively scripted if possible, matching recursive scripting semantics. This makes it hard to keep something from being scripted (for example, a Python submodule). Possibly we'll need an `ignore()` type thing for attributes. In particular, this adds `self.training` to every ScriptModule, since it's present on every `nn.Module`. - I believe this change to be transparent to existing users of the inheritance API, since if you had an attribute that is unscriptable that you never used, there is no error. In some cases, we will create new attributes (even if they are unused), which will increase serialized model size from before. Test Plan: Imported from OSS Differential Revision: D17551196 Pulled By: suo fbshipit-source-id: b476d1c9feb3ddfd63406d90989aaf9dfe890591	2019-10-12 09:51:57 -07:00
davidriazati	0046092178	Reduce special casing around 'training' (#27109 ) Summary: Most of this was old cruft left over from special handling of `training` before we had a `bool` type. This makes all modules have a `training` attribute that is true by default and removes all other special handling. Fixes #26884 ](https://our.intern.facebook.com/intern/diff/17728129/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/27109 Pulled By: driazati Differential Revision: D17728129 fbshipit-source-id: 8ddc9fbb07a953dd05529538bfdd01ed88b5cb57	2019-10-07 13:52:59 -07:00
jiayisun	b9bf91feb8	Add torch.backends.mkldnn.enabled flag (#25459 ) Summary: This PR is about add torch.backends.mkldnn.enabled flag said in https://github.com/pytorch/pytorch/issues/25186 which can be used disable mkldnn at runtime step as torch.backends.cudnn.enabled. Pull Request resolved: https://github.com/pytorch/pytorch/pull/25459 Differential Revision: D17258926 Pulled By: ezyang fbshipit-source-id: e179ad364cc608fdaa7d0f37e2e762ceb5eda598	2019-09-11 12:09:40 -07:00
James Reed	6ba60ec9b0	Add flag to temporarily disable MKL-DNN conv (#23837 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23837 This is a temporary workaround to an issue in MKL-DNN's Convolution backwards implementation: https://github.com/pytorch/pytorch/issues/23825 It is only used to enable testing quantization Test Plan: Imported from OSS Differential Revision: D16659081 Pulled By: jamesr66a fbshipit-source-id: de18ebe98dec2a042f28b23373e20da2b44a42a2	2019-08-06 11:20:26 -07:00
Mingfei Ma	25f0dc3490	BERT CPU performance optimization: use mkldnn for nn.Linear() when input is dense layout (#21851 ) Summary: This PR aims at improving BERT performance on CPU by using `mkldnn` inner product for `nn.Linear()`. The current logic is to use `mkldnn` only when `input` tensor is of mkldnn layout. This PR loosens this condition, `mkldnn` will be used for `nn.Linear()` when `input` tensor is of dense layout. The aten tensor is viewed inplace in `mkldnn` without additional memory copy. 1. when `input.dim() >= 3` , it is viewed as 2d tensor. e.g. `[T, N, C]` is treated as `[TN, C]`; 2. when `input` is not contiguous, it is copied so as to be contiguous. `mkldnn` inner product can't handle non-contiguous memory. With this PR, BERT on `glue/MRPC` inference (batch size = 1) on Xeon 6148 single socket (20 cores@2.5GHz) improves by `44%`: 1. before (unit: iterations/sec): ```bash 408/408 [00:24<00:00, 16.69it/s] ``` 2. after (unit: iterations/sec): ```bash 408/408 [00:16<00:00, 24.06it/s] ``` The latency reduces from `59.92 ms` to `41.56ms` correspondingly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21851 Differential Revision: D16056334 Pulled By: dzhulgakov fbshipit-source-id: 9b70ed58323b5e2f3f4e3ebacc766a74a8b68a8a	2019-07-19 00:54:29 -07:00
Junjie Bai	63585c3b81	Add support for save and load mkldnn modules Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20799 Reviewed By: wanchaol Differential Revision: D15447891 fbshipit-source-id: e34de946c79282fb934a5c52ff1def41c7993c75	2019-05-23 12:51:50 -07:00
Junjie Bai	c9f380df02	Add aten mkldnn linear operator Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19210 Reviewed By: dzhulgakov Differential Revision: D14901641 fbshipit-source-id: 8fa68b9941fd93cea0f313a828cba34c5c81ae11	2019-04-26 13:41:57 -07:00
Junjie Bai	3445020ca3	Add aten mkldnn conv2d operator Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19204 Reviewed By: dzhulgakov Differential Revision: D14857513 fbshipit-source-id: 1172c9785e5a17a7d7360474551bdc7a511b3f2f	2019-04-26 13:41:41 -07:00

18 Commits