pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Junjie Bai	246f5c412e	Revert "Tensor construction codemod(raw_mutable_data) (#16373 )" (#18680 ) Summary: This reverts commit `d73c830e23`. We have observed significant perf drop when training ResNext101 with multiple amd GPUs: Before: https://ci.pytorch.org/jenkins/job/caffe2-builds/job/py2-clang7-rocmdeb-ubuntu16.04-bench/1636/console 2 GPUs ResNext training got 150\~160 imgs/sec 4 GPUs ResNext training got 270\~280 imgs/sec After: https://ci.pytorch.org/jenkins/job/caffe2-builds/job/py2-clang7-rocmdeb-ubuntu16.04-bench/1637/console Both 2 and 4 GPUs ResNext training drop to 110\~120 imgs/sec Similar perf drop are seen on ResNet50 training jobs as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18680 Differential Revision: D14702941 Pulled By: bddppq fbshipit-source-id: 828141805afc23f25c08d4a2eb6d4b99f817c128	2019-04-01 14:39:13 -07:00
Jerry Zhang	d73c830e23	Tensor construction codemod(raw_mutable_data) (#16373 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16373 motivation: https://github.com/pytorch/pytorch/pull/12407 This is a manual diff. most of the fixes should be: ``` auto* Y = Output(0); Y->Resize(dims); Y->raw_mutable_data(dtype); ``` --> ``` auto* Y = Output(0, dims, at::dtype(dtype)); ``` But there might be other cases. Reviewed By: dzhulgakov Differential Revision: D13725460 fbshipit-source-id: 649a4b0e42f62cda1a60171dd9fa3e440dc9dca1	2019-03-29 18:36:46 -07:00
Sebastian Messmer	f6fda4409b	refactor caffe2 operator constructors - 10/9 (#17659 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17659 clangr codemod Reviewed By: ezyang Differential Revision: D14304675 fbshipit-source-id: 45fbd84c50651a70ae29bf46df3322715e99d225	2019-03-06 15:11:47 -08:00
Jerry Zhang	ae91156e5d	Tensor method rename dims()->sizes() - 1/2 Summary: Codemod generated with clangr shard mode, 25 files per diff, Reviewed By: BIT-silence Differential Revision: D13581782 fbshipit-source-id: b16b4198e100617769d84aa599bf141117cfbe5b	2019-01-04 16:02:22 -08:00
Jerry Zhang	fb8487d708	Tensor construction codemod(ResizeLike) - 3/7 (#15122 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15122 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: dzhulgakov Differential Revision: D13419643 fbshipit-source-id: 65b5a037b94d458b944d51f790ba2829db1fb530	2018-12-14 02:08:37 -08:00
Jerry Zhang	1c2ed4eb23	Tensor construction: combine Resize+mutable_data - 1/4 (#13942 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13942 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: smessmer Differential Revision: D13054770 fbshipit-source-id: a9e86e5dfcb4f7cebf5243e1d359fad064561bed	2018-11-19 15:33:50 -08:00
Junjie Bai	fbd50bbfb9	Revert D13007246: [codemod][caffe2] Tensor construction: combine Resize+mutable_data - 1/4 Differential Revision: D13007246 Original commit changeset: 230de42a3843 fbshipit-source-id: 40ce266826f00d320f7215169188ef4ead232660	2018-11-13 16:41:52 -08:00
Jerry Zhang	9d36c37bdb	Tensor construction: combine Resize+mutable_data - 1/4 (#13853 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13853 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: smessmer Differential Revision: D13007246 fbshipit-source-id: 230de42a3843d71599e812d5511f52f3af47f59b	2018-11-13 12:26:02 -08:00
Jerry Zhang	3c32f897ca	Rename ndim() -> dim() - 3/6 Summary: Codemod generated with clangr shard mode, 50 files per diff, clangr code(ndim()->dim()): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: dzhulgakov Differential Revision: D12935748 fbshipit-source-id: fccec04e28ec049789f772e70d691382cb8927e0	2018-11-05 23:21:40 -08:00
Jerry Zhang	e7242cbaf2	Rename dim(i) -> size(i) - 1/2 Summary: Codemod generated with clangr shard mode, 50 files per diff, clangr code(dim->size): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12896712 fbshipit-source-id: 909731691fab7799efbcfc3b5dcc9e531831c2d4	2018-11-05 07:27:04 -08:00
Jerry Zhang	edd902594a	Renaming meta() to dtype() - 1/2 (#13333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13333 Codemod generated with clangr shard mode, 50 files per diff, clangr code(meta->dtype): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12845168 fbshipit-source-id: 492091963d2211ea80215200e981965767566135	2018-10-31 17:14:08 -07:00
Jerry Zhang	dcbca53e58	Renaming size() to numel() - 1/6 Summary: Codemod generated with clangr shard mode, 50 files per diff Reviewed By: li-roy Differential Revision: D10866373 fbshipit-source-id: 589194164d4fea93b74d83fa7fc4c59558c41f4a	2018-10-29 11:11:19 -07:00
Jerry Zhang	07c0f4a097	Tensor dims() -> sizes() (caffe2/operators) - 1/5 (#13028 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13028 Codemod generated with clangr shard mode, 25 files per diff, for renaming dims() to sizes() Reviewed By: ezyang Differential Revision: D10476220 fbshipit-source-id: 3c3b3d5e2082cd6a1f0ff4a3c8641b30e6f16896	2018-10-24 14:18:18 -07:00
Christian Puhrsch	a6630e25af	Remove many caffe2::TIndex and replace them with int64_t (#11943 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11943 See title Reviewed By: ezyang Differential Revision: D9992645 fbshipit-source-id: e8f80d6ea762971513e5e8072975ceea53e1f11a	2018-09-22 18:11:04 -07:00
Jerry Zhang	aebf3b47ae	Remove template parameter from Tensor (#9939 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9939 Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: ezyang, houseroad Differential Revision: D9024330 fbshipit-source-id: e0b8295d2dc6ebe2963383ded5af799ad17164ba	2018-07-27 10:56:39 -07:00
Jerry Zhang	969b62f276	Revert D8121878: Remove template parameter from Tensor Differential Revision: D8121878 Original commit changeset: 4a5e9a677ba4 fbshipit-source-id: d8e2c0bb145b52fbcca323b22d1d3346f0b3249e	2018-07-26 14:02:04 -07:00
Jerry Zhang	cd5adc7b5f	Remove template parameter from Tensor (#13 ) Summary: Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: xw285cornell Differential Revision: D8121878 fbshipit-source-id: 4a5e9a677ba4ac82095df959851a054c81eccf81	2018-07-26 10:25:23 -07:00
Nathan Inkawhich	38dbe6e605	Updates to caffe2 operator documentation (#7917 ) * Significant updates to the operator docs in prep for merge	2018-05-29 14:38:56 -07:00
Orion Reblitz-Richardson	1d5780d42c	Remove Apache headers from source. * LICENSE file contains details, so removing from individual source files.	2018-03-27 13:10:18 -07:00
Orion Reblitz-Richardson	b0d09dd8d7	Cleanup operator docs for catalog generation. Summary: * Likely need to test this so bad formatting can't be added in the future, but cleaning all operators so we at least have good examples. * Formatting between our internal Facebook operator catalog and external caffe2.ai catalog are still slightly different. We'll work on this. Closes https://github.com/caffe2/caffe2/pull/1846 Reviewed By: pjh5 Differential Revision: D6848570 Pulled By: orionr fbshipit-source-id: b9bc0bfccb243d0440bd7b2406858cad8dc37e92	2018-02-02 16:36:05 -08:00
James Cross	96c6212513	repeat sequence mask for data dims Summary: Allow the application of sequence-length masking to be replicated along one or more minor axes. See task for details. Reviewed By: jamesr66a Differential Revision: D6090835 fbshipit-source-id: 9064232aa9b93246c582b6e0bae73be5dbe09e98	2017-10-18 18:08:08 -07:00
Uthsav Chitra	964d740ede	adding batch support to SequenceMaskOps Summary: Added support for batching to SequenceMaskOp. Let b be the batch dim and k be the axis dim. (We enforce that b < k.) Write the dimensions of the input tensor as [a_1, ..., a_b, ..., a_k, ...]. We first collapse our tensor down to 3D, with dimensions [P, Q, D], where: P = a_1 * ... * a_b, Q=a_{b+1} * ... * a_{k-1}, and D=a_k * a_{k+1} * ... * a_n. Then we mask each slice [i, :, : ] of this 3D tensor (note that each slice is a Q times D tensor w/ dimension 2) Reviewed By: jamesr66a Differential Revision: D5733382 fbshipit-source-id: e7a314d9fe6e6691a75112edbee8ba6e8ea8e396	2017-10-04 15:18:09 -07:00
Yangqing Jia	8286ce1e3a	Re-license to Apache Summary: Closes https://github.com/caffe2/caffe2/pull/1260 Differential Revision: D5906739 Pulled By: Yangqing fbshipit-source-id: e482ba9ba60b5337d9165f28f7ec68d4518a0902	2017-09-28 16:22:00 -07:00
Yangqing Jia	0f0829d88e	Strict bound check for SequenceFunctor Summary: This exhibits the problem in NMT training where some out of bound data seems to have silently written over bound, and causing random segfaults elsewhere in the code. This itself does not solve the problem, but will trigger us to then fix the out of bound issues. Differential Revision: D5832646 fbshipit-source-id: 5eb259e4584e5341ef3f19362f98f0a9554e9aec	2017-09-14 01:30:58 -07:00
Yangqing Jia	01a1cf1e07	small fix for pointer initialization. Summary: A bit safer, and also suppresses compiler warning. Reviewed By: bddppq Differential Revision: D5803080 fbshipit-source-id: d8c782c936a8fdaded4ae209b212378e78606ffb	2017-09-11 01:41:35 -07:00
Luke Yeager	1cf94854a4	fp16: SequenceMask Summary: Was https://github.com/caffe2/caffe2/pull/1151 Closes https://github.com/caffe2/caffe2/pull/1178 Reviewed By: bddppq Differential Revision: D5794641 Pulled By: akyrola fbshipit-source-id: c3bd99dde74317280a65af7cc7a36a6a734822f6	2017-09-09 13:02:38 -07:00
Jingfei Du	b3029df1d0	Added window mode for caffe2 sequence operator Summary: This can be used for local attention to mask elements outside of a window Reviewed By: jamesr66a Differential Revision: D5643677 fbshipit-source-id: 92b33866258ccc7307d5bcf08234610aa3fb152d	2017-08-16 21:34:29 -07:00
James Reed	a985355935	Gradient for SequenceMaskOp Summary: Implement backward pass for a SequenceMaskOp to replace https://github.com/caffe2/caffe2/blob/master/caffe2/python/attention.py#L54-L72. Reviewed By: akyrola Differential Revision: D5618373 fbshipit-source-id: b831fa69f51d9468c858961f922564159e12b46f	2017-08-12 14:34:29 -07:00
James Reed	0a828768e9	Implement SequenceMaskOp forward pass Summary: Implement forward pass for a SequenceMaskOp to replace https://github.com/caffe2/caffe2/blob/master/caffe2/python/attention.py#L54-L72. This implements two modes: a sequence-length based mode and a matrix triangle mode. Reviewed By: akyrola Differential Revision: D5615493 fbshipit-source-id: a2ce4a8e655d9b720049010a7856be052c5567eb	2017-08-12 14:34:28 -07:00
Junjie Bai	efe2d01a3e	Fix some bugs in CPU version of BooleanMask and add GPU version Reviewed By: akyrola Differential Revision: D5397208 fbshipit-source-id: 0314cc181e315f3b6cda846292b2e2ea73bb015b	2017-07-21 11:38:49 -07:00
Kun Han	2ec2d23f88	booleanmask support indices sorting Summary: The booleanmask supports another output with sorted indices Differential Revision: D4984255 fbshipit-source-id: becb10d7fe989bb2f6488c901766a45369613eb7	2017-06-06 13:32:51 -07:00
Shenxiu Liu	dd923cf052	Unmask operator in Caffe2 Summary: A CPU implementation for unmask operator in caffe2. There's also a small bug in mask operator, fix it as well. Reviewed By: ender-wieczorek Differential Revision: D4896351 fbshipit-source-id: 887d1beb66fe93ea2da1c4e165fce2e026907726	2017-04-16 11:23:19 -07:00
Yangqing Jia	b23e51d467	chunky sync	2016-09-06 15:55:19 -07:00
Yangqing Jia	05512d1e10	sync	2016-08-10 11:02:15 -07:00
Yangqing Jia	bcea409c82	sync	2016-07-28 15:06:43 -07:00

35 Commits