pytorch/caffe2/queue
Junjie Bai 246f5c412e Revert "Tensor construction codemod(raw_mutable_data) (#16373)" (#18680)
Summary:
This reverts commit d73c830e23.

We have observed significant perf drop when training ResNext101 with multiple amd GPUs:

Before:
https://ci.pytorch.org/jenkins/job/caffe2-builds/job/py2-clang7-rocmdeb-ubuntu16.04-bench/1636/console
2 GPUs ResNext training got 150\~160 imgs/sec
4 GPUs ResNext training got 270\~280 imgs/sec

After:
https://ci.pytorch.org/jenkins/job/caffe2-builds/job/py2-clang7-rocmdeb-ubuntu16.04-bench/1637/console
Both 2 and 4 GPUs ResNext training drop to 110\~120 imgs/sec

Similar perf drop are seen on ResNet50 training jobs as well.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/18680

Differential Revision: D14702941

Pulled By: bddppq

fbshipit-source-id: 828141805afc23f25c08d4a2eb6d4b99f817c128
2019-04-01 14:39:13 -07:00
..
blobs_queue_db.cc Add/fallback some operators for mkl-dnn (#11696) 2019-01-11 12:53:06 -08:00
blobs_queue_db.h Remove template parameter from Tensor (#9939) 2018-07-27 10:56:39 -07:00
blobs_queue.cc fix -Wsign-compare warnings for some files inside c2 (#18123) 2019-03-19 10:39:20 -07:00
blobs_queue.h build changes to make cpu unified build working. (#10504) 2018-08-15 17:22:36 -07:00
CMakeLists.txt Change hip filename extension to .hip (#14036) 2018-11-16 11:55:59 -08:00
queue_ops_gpu.cc Eanble python tests on ROCM (#9616) 2018-07-24 11:37:58 -07:00
queue_ops.cc Remove Apache headers from source. 2018-03-27 13:10:18 -07:00
queue_ops.h Tensor construction codemod - 2/3 (#14836) 2018-12-10 19:30:56 -08:00
rebatching_queue_ops.cc Remove Apache headers from source. 2018-03-27 13:10:18 -07:00
rebatching_queue_ops.h Remove template parameter from Tensor (#9939) 2018-07-27 10:56:39 -07:00
rebatching_queue.cc Revert "Tensor construction codemod(raw_mutable_data) (#16373)" (#18680) 2019-04-01 14:39:13 -07:00
rebatching_queue.h Remove Apache headers from source. 2018-03-27 13:10:18 -07:00