pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-08 07:39:33 +01:00

Author	SHA1	Message	Date
Edward Yang	65bb34d885	Remove TensorImpl::is_variable, deprecate Tensor::is_variable (#29653 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29653 I didn't remove is_variable from Tensor for BC reasons, but I did remove as many uses as I could from the codebase. at::impl::variable_excluded_from_dispatch got moved to TensorBody.h so that it's more widely accessible. This diff is NOT semantics preserving. Here are the major differences: - In a number of native operator implementations, we tested that arguments are not variable. I replaced these with asserts that variable is excluded from dispatch. I actually don't think these asserts are really necessary now (they should certainly be true, but it's hard to get it wrong), but I've kept them for old time's sake. At least, they'll detect if you call these functions before you've processed variable (indicating a bug in your kernel.) - There are a number of places where we do a per-tensor test for being a variable, for better error reporting when someone commits Tensor/Variable confusion. Although these tests are substantively the same as the tests above, in these cases I decided to delete the test entirely. The reasoning is that in these cases, we didn't really care about dispatch (also, see above; I'm not too sure we really need the dispatch asserts), we cared about Tensor/Variable confusion. Since Tensor/Variable confusion is impossible now, we don't need the tests. One of the key factors which pushed me one way or another was whether or not a function was doing per-tensor validation; if I kept the assert in such functions, I'd repeatedly access the TLS. Even if we want to bring back the asserts, they would have to go somewhere else. Another similar idiom is the number of places we do !x.defined() \|\| x.is_variable(); I treated this equivalently. - nuclear_norm's computation of compute_uv is a bit weird, but I think it's OK to just delete the is_variable case (I suspect that it is always the case that self.is_variable(), but it doesn't really matter.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18496168 Pulled By: ezyang fbshipit-source-id: 5a1ded931e0c10a6b758ba64a8380d34110e0c3e	2019-11-14 11:41:02 -08:00
Sebastian Messmer	d9de2e0ba9	Back out "Revert D17936166: [wip] Constexpr type ids" (#28155 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28155 Original commit changeset: 92c63a96dedd ghstack-source-id: 92051874 Test Plan: unit tests Differential Revision: D17964410 fbshipit-source-id: 1d989d28b3e1de6d43c915f122f2b65a77a332eb	2019-10-16 18:24:04 -07:00
Lu Fang	1819fade35	Revert D17936166: [wip] Constexpr type ids Test Plan: revert-hammer Differential Revision: D17936166 Original commit changeset: 68cfa926c721 fbshipit-source-id: 92c63a96dedd8764e342c6437c6ea308d93d29b2	2019-10-16 06:47:10 -07:00
Sebastian Messmer	9cc4405dc9	Constexpr type ids (#28023 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28023 ghstack-source-id: 91987335 Test Plan: waitforsandcastle Differential Revision: D17936166 fbshipit-source-id: 68cfa926c721e5fbc96e083eb47e784bf34a9df4	2019-10-15 21:21:20 -07:00
Sebastian Messmer	ef8bcfe2c7	Revert D17488861: constexpr type ids Test Plan: revert-hammer Differential Revision: D17488861 Original commit changeset: ce7b059d7c86 fbshipit-source-id: 426fca9abe7122190fc17ac6976bc6bcbd5718df	2019-10-15 09:59:21 -07:00
Sebastian Messmer	6f865c1e37	constexpr type ids (#26502 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26502 Create type ids at compile time instead of incrementing a counter at runtime. This is done by computing a compile time crc64 on the type name. We couldn't do this before, because we still used GCC4 and that compiler didn't support the use of `__PRETTY_FUNCTION__` in a constexpr context. However, since GCC5 this is possible and we can use this trick. This does not change the semantics of preallocated type ids. I actually think we don't need to preallocate anymore, but I split the removal of preallocation into a separate diff to be able to test it separately. ghstack-source-id: 91896920 Test Plan: unit tests Differential Revision: D17488861 fbshipit-source-id: ce7b059d7c8686b69cb091a4a8beaf4b96391343	2019-10-15 08:47:09 -07:00
Edward Yang	0b6186d778	Remove Tensor.h, TensorMethods.h from src/core. (#27086 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27086 This is a major source of merge conflicts, and AFAICT isn't necessary anymore (it may have been necessary for some mobile build stuff in the past). This is a commandeer of #25031 Test Plan: Imported from OSS Reviewed By: ljk53 Differential Revision: D17687345 Pulled By: ezyang fbshipit-source-id: bf6131af835ed1f9e3c10699c81d4454a240445f	2019-10-06 09:37:50 -07:00
Will Feng	3a12520844	Pass Variable into Caffe2 ops, by requiring that the Variable doesn't require grad (#22473 ) Summary: As part of the Variable/Tensor merge, we want to be able to pass Variables into Caffe2 without doing extra shallow copy, to improve performance and also allow for in-place mutations in Caffe2 ops. There are a few approaches outlined in https://github.com/pytorch/pytorch/pull/22418, and this PR is the chosen approach. Specifically, we can have the assumption that we won't be connecting autograd to C2 gradients at any point (as it's too tricky and not that useful). Therefore, we can pass Variable into Caffe2 ops by requiring that all Variables in Caffe2 don't require grad. For code paths in Caffe2 that might potentially track gradients (e.g. `ScriptModuleOp` and `call_caffe2_op_from_c10`), we use the `torch::NoGradGuard` to make sure gradients are not tracked. This supersedes https://github.com/pytorch/pytorch/pull/22418. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22473 Differential Revision: D16099042 Pulled By: yf225 fbshipit-source-id: 57efc3c7cfb3048d9abe90e63759acc14ebd2972	2019-07-08 11:31:10 -07:00
Yinghai Lu	cf7ef5e631	Add onnxifi support for Int8FCDNNLowPPackedWeightBlob (#20564 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20564 Reviewed By: bddppq Differential Revision: D15106712 fbshipit-source-id: 428db9c23cfd36ddedc8d79121fbbb3bb484c993	2019-05-20 16:57:11 -07:00
Rui Zhu	c129ab06e9	Change onnxifi workflow to support multi-group quantized & Add multi quantization info to caffe2.proto (#20439 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20439 This is the QTensorProto workflow for multi group quantization in C2 side. No DNNLOWP Tensor related thing is included in this pr, so once we finished glow side, we should be able to test this pr using resnet50. Reviewed By: yinghai Differential Revision: D15096919 fbshipit-source-id: 741eecd59eb79d24d9fe2b035f6246d42422d25c	2019-05-15 19:24:08 -07:00
Jerry Zhang	40a54bf2f1	Change ReinitializeTensor to use C10_LOG_FIRST_N (#18531 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18531 Currently we use C10_LOG_EVERY_MS to log the data type change, but it pollutes the log of some service, we would like to change it to C10_LOG_FIRST_N to prevent that. Reviewed By: dzhulgakov Differential Revision: D14647704 fbshipit-source-id: b84e4002bd4aa94d616133cd1049c3d4ab05386e	2019-04-02 21:03:37 -07:00
Rui Zhu	19fe2b9db4	Adding quantized tensor shape/type info support for caffe2=>glow in caffe2 side (#18621 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18621 This diff added caffe2 support for onnxifi quantization. Reviewed By: yinghai Differential Revision: D14648767 fbshipit-source-id: 4ddb492cacbba6142305866e6dbb875880acaea3	2019-03-31 17:42:27 -07:00
Edward Yang	4404762d7d	Rename IntList to IntArrayRef. (#16751 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16751 This was made more complicated by the fact that ivalue::IntList is a thing. So I had to fix all of the sites where we referring to IValue post facto. The following codemods were run, in this order: ``` codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in IntList IntArrayRef codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in IntArrayRef::create IntList::create codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in ivalue::IntArrayRef ivalue::IntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in Tag::IntArrayRef Tag::IntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in isIntArrayRef isIntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in toIntArrayRef toIntList codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in 'Shared<IntArrayRef>' 'Shared<IntList>' codemod -m -d . --extensions cc,cpp,cu,cuh,h,hpp,py,cwrap,yaml,in 'intrusive_ptr<IntArrayRef>' 'intrusive_ptr<IntList>' ``` Some manual fixups were done afterwards; they can be reviewed separately at https://github.com/pytorch/pytorch/pull/16752 Reviewed By: dzhulgakov Differential Revision: D13954363 fbshipit-source-id: b5c40aacba042402155a2f5a229fa6db7992ac64	2019-02-05 14:54:34 -08:00
Dmytro Dzhulgakov	a061e3fd77	Back out "Revert D13596031: Improve c2-aten tensor interop and add proper testing" (#16514 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16514 Original commit changeset: dc371697f14b Relanding https://github.com/pytorch/pytorch/pull/15860 - the problem was that layer_norm was using at::empty which is not yet on mobile Reviewed By: ezyang Differential Revision: D13861480 fbshipit-source-id: e2116da32bc117175c96b9151b1beba9b31eff36	2019-01-31 13:38:55 -08:00
Edward Yang	3b337e7892	Revert D13596031: Improve c2-aten tensor interop and add proper testing Differential Revision: D13596031 Original commit changeset: d20b601e06ba fbshipit-source-id: dc371697f14b3893a9164380a39e7a49d8d68ecf	2019-01-29 07:14:57 -08:00
Dmytro Dzhulgakov	5e21e0fe75	Improve c2-aten tensor interop and add proper testing (#15860 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15860 Few changes (which are harder to split in separate diffs, so together): - make conversion explicit (as they can throw to avoid surprises) - fix tensor legacy dispatch not initialized when tensor is created on C2 side - add a bunch of invariants to enforce Reviewed By: ezyang Differential Revision: D13596031 fbshipit-source-id: d20b601e06ba47aeff2f6e8e15769840e2d46108	2019-01-28 23:41:50 -08:00
Jerry Zhang	0a3932acb2	Fix comparison in ReinitializeTensor (#16294 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16294 In `ReinitializeTensor`, we compare `tensor->GetDevice()` and `options.device()`, but in the callsite, we actually just provide an option with `device_type`, which means the `device_id` will always be default(-1) for `options`, but for tensor, although it is passed a `device` with default `device_id`, when we allocate the data, the `device` of the `tensor` is the `device` of `Storage`, which is the `device` of underlying `DataPtr`, which is the same as the `device` of the `Context` of the operator, which has a non-default `device_id`. Therefore everytime we do `ReinitializeTensor`, we'll find the `device` does not match, and after the `ReinitializeTensor` call, the `device` still does not match. That's why everytime we'll allocate a new Tensor and cause perf regressions for ops that uses `ReinitializeTensor` on multiple GPUs. Reviewed By: BIT-silence Differential Revision: D13795635 fbshipit-source-id: 24d6afa1a0196a32eb0134ee08b4280244cdb0c3	2019-01-23 19:29:29 -08:00
Dmytro Dzhulgakov	da9e49e586	Remove Context dependency from Tensor class (#14269 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14269 Removes reference to Context proper and instead adds a bool argument for async copy (the same as `copy_`) For CopyFrom - I haven't tweaked all callsites yet. Instead I rely on a terrible hack that pointer to context is implicitly converted to bool when passed, haha :) It's not a good code and I propose to fix it in a follow up diff (maybe using clangr tooling). Reviewed By: ezyang Differential Revision: D13117981 fbshipit-source-id: 7cb1dc2ba6a4c50ac26614f45ab8318ea96e3138	2018-11-28 15:45:38 -08:00
Jerry Zhang	1c2ed4eb23	Tensor construction: combine Resize+mutable_data - 1/4 (#13942 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13942 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: smessmer Differential Revision: D13054770 fbshipit-source-id: a9e86e5dfcb4f7cebf5243e1d359fad064561bed	2018-11-19 15:33:50 -08:00
Jerry Zhang	0f59dcb317	Remove partially initialized Tensor + CopyFrom (#13629 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13629 Previously we have a Tensor which has a initialized storage(therefore a known device_type) and then we'll call CopyFrom on it to initialize the sizes and data. We want to eliminate partially initialized Tensor by replacing the pattern of calling CopyFrom with a partially initialized Tensor with either splitting that to undefined Tensor + initialization API(1)(3) or combine all the initialization in the same step(2). 1. member variable initialization + CopyFrom Previously we have a tensor that is initialized with device_type, and then use CopyFrom to populate the content, now we remove the partial initialization by make the original member variable an undefined Tensor and use ReinitializeFrom to copy from another Tensor. 2. Output + CopyFrom Previously, we first get a tensor with device_type, and then CopyFrom another Tensor, We changed it two combining these two operations into OperatorBase::OutputTensor. 3. Output + custom functions Example can be found in TransformGPU function. In this case we move the part that initializes the tensor outside of the function, and do that explicitly outside so that we could reuse the Output functions to make a fully initialized Tensor. Note that to keep the original semantics, both of the APIs has a caching effect based on device_type, which means we only create a Tensor object when device_type does not match or the Tensor is undefined, otherwise, we will reuse the original Tensor object. Reviewed By: dzhulgakov Differential Revision: D12848855 fbshipit-source-id: 37bb4ddc1698ebea533b73006eeb1218faa8ddf8	2018-11-07 11:31:03 -08:00
Jerry Zhang	ebaabfbbd5	ReinitializeTensor function for refactoring Tensor as member variable (#13147 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13147 We want to refactor ``` class A { void func() { x_.Resize(dims); auto* data = x_.mutable_data<T>(); } Tensor x(CPU); }; ``` to ``` class A { void func() { ReinitializeTensor(&x_, dims, at::dtype<T>().device(CPU)); auto* data = x_.mutable_data<T>(); } Tensor x_; // Undefined Tensor }; ``` This diff adds the ReinitializeTensor function. Reviewed By: dzhulgakov Differential Revision: D10861298 fbshipit-source-id: 9f432297d07a4890e29bb68436364e0b2e2545e7	2018-11-05 19:13:55 -08:00
Jerry Zhang	edd902594a	Renaming meta() to dtype() - 1/2 (#13333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13333 Codemod generated with clangr shard mode, 50 files per diff, clangr code(meta->dtype): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12845168 fbshipit-source-id: 492091963d2211ea80215200e981965767566135	2018-10-31 17:14:08 -07:00
Roy Li	b818d31a3e	use TypeMeta instead of ScalarType in TensorOptions (#13172 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13172 reland D10419671 Reviewed By: ezyang Differential Revision: D12143282 fbshipit-source-id: 43504d06a901af30130ebe97fb0b33def45cdc9a	2018-10-29 11:15:37 -07:00
Jerry Zhang	eea2ee6d29	Renaming size() to numel() - 1/17 Summary: Codemod generated with clangr shard mode, 25 files per diff Reviewed By: li-roy Differential Revision: D10866237 fbshipit-source-id: 020fcfdf52083430c5b674eda8e07ad3adfcc838	2018-10-26 15:36:59 -07:00
Peter Goldsborough	8797bb1d30	Revert D10419671: use TypeMeta instead of ScalarType in TensorOptions Differential Revision: D10419671 Original commit changeset: 9cc8c5982fde fbshipit-source-id: c870ecdd3730cf695007ebb110d362996da05e5d	2018-10-26 11:09:58 -07:00
Roy Li	a70573b589	use TypeMeta instead of ScalarType in TensorOptions (#12768 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12768 Note: DefaultTensorOptions no longer fits in 64-bits. I kept functions that take ScalarType as input to minimize changes for now. Reviewed By: ezyang Differential Revision: D10419671 fbshipit-source-id: 9cc8c5982fde9ff243e03d55c0c52c2aa2c7efd8	2018-10-26 09:27:12 -07:00
Jerry Zhang	dd7c2d4284	Change the function signature for caffe2::empty (#13015 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13015 att Reviewed By: ezyang Differential Revision: D10469310 fbshipit-source-id: f4621fe5d17bb4663192860f81effe6bdfe21bea	2018-10-24 13:14:24 -07:00
Jerry Zhang	353fdefdd6	dims() -> sizes() (caffe2/core) (#13014 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13014 Tensor method renaming using clangr Reviewed By: ezyang Differential Revision: D10467556 fbshipit-source-id: 7d7eaf5fc59bbb493c057d5b8bfdda03b140c97e	2018-10-24 12:49:28 -07:00
Jerry Zhang	ab1a25aa9b	caffe2::empty for Resize+mutable_data refactor (#12407 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12407 We want to use tensor factory to refactor the caffe2's old way of initialize Tensor by Resize and mutable_data in order to eliminate uninitialized Tensor. Previously when we want to create a Tensor in caffe2, we'll do the following ``` Tensor x(CPU); // device type provided x.Resize({1, 2, 3}); // size provided x.mutable_data<float>(); // data type provided and memory allocated ``` This leaves Tensor in not fully initialized state during the process, to eliminate this, we want to provide all the needed information in the begining. ATen already has its TensorFactories: https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/TensorFactories.cpp, and there is a TensorOption, we want to adopt the same interface to ease future refactoring. In the callsite, we used to have `Output(i)` that returns a `Blob` that contains an uninitialized `Tensor` and we'll call Resize and mutable_data afterwards to provide dimension and data type, ``` // uninitialized tensor auto* Y = Output(0); // set dimensions Y->Resize({1, 2, 3}); // actually allocate the data auto* data = Y->mutable_data<float>(); // After this step, Tensor is fully initialized. ``` We want to change it to the following: ``` // provide dimensions and TensorOptions which include device type and data type. // This will set all the information of Tensor properly and also allocate memory. auto* Y = Output(0, {1, 2, 3}, at::device({context_.device_type()}).template dtype<T>()); // Tensor is fully initialized after this step // following `mutable_data` call won't allocate memory. auto* data = Y->mutable_data<float>(); ``` microbenchmarks ``` ============================================================================ caffe2/caffe2/fb/benchmarks/core_overhead_benchmark.ccrelative time/iter iters/s ============================================================================ OperatorNewOutputTensorAPI 3.27us 306.05K OperatorOldOutputTensorAPI 3.55us 281.54K ============================================================================ ``` Reviewed By: ezyang Differential Revision: D10207890 fbshipit-source-id: f54ddacaa057b7c6bc7d5a8290171f35e9e40e29	2018-10-17 13:03:06 -07:00
Jerry Zhang	7724807551	Remove ExtractDeviceOption from StaticContext (#12304 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12304 - make ExtractDeviceOption to be a free function. - Add a Strorage(at::Device) constructor in order to preserve the device_id. Reviewed By: dzhulgakov Differential Revision: D10069839 fbshipit-source-id: a5f3994a39bdf1b7503b39bb42c228e438b52bfa	2018-10-10 14:12:16 -07:00
Sebastian Messmer	6f664d3917	Improve TypeMeta (#11502 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11502 TypeMeta now is only a pointer to a TypeMetaData structure, of which there is exactly one global instance per type. This reduces the size of everything storing a TypeMeta (Tensor, Blob, ...) and potentially improves performance. Also, this diff gets rid of the type name registry in favor of static strings. Experiments (summary: 1-3% perf gain) - Service Lab: https://our.intern.facebook.com/intern/servicelab/30712497/ -> No significant results found. - Mobile Lab c10bench.json: https://our.intern.facebook.com/intern/fblearner/details/75984908/ -> 1-3% perf gain - Mobile Lab c10bench default: https://our.intern.facebook.com/intern/fblearner/details/75984999/ -> 2-3% perf gain - adindexer canary: https://our.intern.facebook.com/intern/ads/canary/413002142824203076 -> no significant changes (benchmark too noisy) - adfinder canary: https://our.intern.facebook.com/intern/ads/canary/413002166737860362 -> no significant changes (benchmark too noisy) Reviewed By: dzhulgakov Differential Revision: D9763422 fbshipit-source-id: fc08937f114af5ff9f3ddbe7c7e396942868cdf5	2018-10-06 14:09:28 -07:00
Edward Yang	54d9823d00	Make caffe2::Tensor::dims() return an IntList instead of a const vector& (#12180 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12180 I had to fix a lot of call sites, because a lot of places assume that you can actually get a const vector&, and if the internal representation of sizes in a tensor is NOT a vector, it's not possible to fulfill this API contract. Framework changes: - I deleted TensorImpl::dims(); caffe2::Tensor::dims() just forwards to sizes() now. - De-templatized SetDims; now it is an explicit list of ArrayRef and variadic overloads. This makes implicit conversions work again, so I don't need to explicitly list the std::vector cases too. - As a knock-on effect, this causes Reset() to accept at::IntList as well as const std::vector<int64_t>& - Edited variadic overloads of SetDims to all forward to the underlying arbitrary-dim implementation, reducing code duplication. (It's probably marginally less efficient in the new world.) - Replace Tensor constructor accepting const std::vector<int64_t>& with at::IntList - Make MKLTensor accept ArrayRef along with vector in constructor and Reset (unfortunately, no implicit conversions here, since it's templated on index type.) - There are a few other places, like cudnn, where I changed functions that previously took const std::vector<int64_t>& to take at::IntList instead. Classification of call site changes: - 'const std::vector<int64_t>& x_dims = x.dims()' ==> 'at::IntList x_dims = x.dims()' - 'std::vector<int64_t> x_dims = x.dims()' ==> 'std::vector<int64_t> x_dims = x.dims().vec()' (we need a copy!) Usually this is because we're about to mutably modify the vector to compute some new dimension. However, it also very commonly occurs in the form: 'x_dims_ = x.dims()' because we frequently cache sizes in operators. - Instead of constructing std::vector<int64_t>{blah, blah}, construct an at::IntList directly ArrayRef changes: - cbegin()/cend() iterators, they operate the same aas begin()/end() because everything on ArrayRef is const. - Moved operator<< into ArrayRef.h, so that it's always available when working with ArrayRef. I also templated it, so it now works on an ArrayRef of any type. - Add operator== overload for ArrayRef, and also add variants to permit comparison of ArrayRef with std::vector, a very common operation. (The non-templated version of operator== can get these automatically via implicit conversion, but with templates C++ refuses to do any explicit conversions.) I'm planning to audit all dims() call sites to make sure they don't expect 'auto x = t.dims()' to give you an x whose lifetime can validly outlive the tensor. I opted not to do a dims() to sizes() rename, because dims() also matches the protobufs accessor. Bad news! Reviewed By: jerryzh168 Differential Revision: D10111759 fbshipit-source-id: a2a81dc4b92c22ad4b3b8ef4077a7e97b6479452	2018-10-05 15:57:41 -07:00
Edward Yang	c5fc2f1105	Merge UndefinedTensorImpl. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11972 Reviewed By: gchanan, Yangqing, jerryzh168 Differential Revision: D9995633 fbshipit-source-id: 6b4645c9d4bb0bc4301cd4bcfa76cf85331b8379	2018-09-27 18:40:16 -07:00
Christian Puhrsch	a9e6a673ae	Remove caffe2::Tensor::capacity_nbytes, at::Tensor::to##name##Data, (#11876 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11876 Modern C++ api instead of macros, item() is aligned with Python frontend. caffe2::Tensor::capacity_nbytes is effecitvely unused and confusing w.r.t. caffe2::Tensor::nbytes(). codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCByte "item<uint8_t>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCLong "item<int64_t>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCInt "item<int32_t>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCDouble "item<double>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCFloat "item<float>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toByteData "data<uint8_t>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toLongData "data<int64_t>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toIntData "data<int32_t>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toDoubleData "data<double>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toFloatData "data<float>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCByte "item<uint8_t>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCLong "item<int64_t>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCInt "item<int32_t>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCDouble "item<double>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCFloat "item<float>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toByteData "data<uint8_t>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toLongData "data<int64_t>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toIntData "data<int32_t>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toDoubleData "data<double>" codemod -d hphp --extensions cc,cpp,cu,cuh,h,py,hpp,mm toFloatData "data<float>" codemod -d caffe2 --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCComplexDouble "item<std::complex<double>>" codemod -d tc --extensions cc,cpp,cu,cuh,h,py,hpp,mm toCFloat "item<float>" Reviewed By: ezyang Differential Revision: D9948572 fbshipit-source-id: 70c9f5390d92b82c85fdd5f8a5aebca338ab413c	2018-09-24 10:40:10 -07:00
Christian Puhrsch	a6630e25af	Remove many caffe2::TIndex and replace them with int64_t (#11943 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11943 See title Reviewed By: ezyang Differential Revision: D9992645 fbshipit-source-id: e8f80d6ea762971513e5e8072975ceea53e1f11a	2018-09-22 18:11:04 -07:00
Alexander Sidorov	eb039dc92c	Add CHECKs into GetTensorInfo and ExtractDeviceOption (#11597 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11597 We should always CHECK pointers which we plan to dereference if they are inputs to the function. Nobody knows how the function will be called in the future. Reviewed By: yinghai Differential Revision: D9800002 fbshipit-source-id: 7fd05f4717f2256d1b09a9e75475b12de6685b03	2018-09-14 09:40:27 -07:00
Edward Yang	912d3626c8	Split tensor.h into tensor_impl.h and tensor.h (#11642 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11642 This is just a preparatory change to help with future refactoring: - I want to reduce the number of includes that tensor_impl.h depends on, but - I need to keep tensor.h providing all Caffe2 headers, because users may be relying on tensor.h transitively providing those headers. Introducing a level of indirection lets me do both at the same time. Reviewed By: jerryzh168 Differential Revision: D9810823 fbshipit-source-id: 8dfaac4b8768051a22898be8fcaf787ecc57eb13	2018-09-13 12:26:20 -07:00
Christian Puhrsch	36fc1a0a58	Merge caffe2::/at::Storage Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11637 Reviewed By: gchanan Differential Revision: D9806425 Pulled By: ezyang fbshipit-source-id: e20ec93bff6dc7fb22ca9b7e7348d060b3876b67	2018-09-13 09:40:48 -07:00
Yinghai Lu	316c167940	Add checking of nullptrs in GetTensorInfo (#11587 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11587 To help debug the issue in T33295362, we add some checks in the function. Possible crashing site in `GetTensorInfo` 1. tc is nullptr, which is checked. 2. tc->capacity_nbytes() hits nullptr, this is unlikely because storage is not a pointer and compute of capacity_nbytes doesn't involve pointers. It's numel * itermsize(). 3. tc->ExtractDeviceOption hits nullpt. One possibility raw_data() is nullptr because tc->ExtractDeviceOption will use that. This is checked. 4. Tensor itself which is not a reference. This is also checked. Reviewed By: salexspb Differential Revision: D9793484 fbshipit-source-id: 3fc72746fc310a23ae45553bbe0d269a4b9edb72	2018-09-12 16:25:46 -07:00
Edward Yang	82ddeb7f2b	Using shared implementation in Tensor (#10619 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10619 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9047 Reviewed By: jerryzh168 Differential Revision: D8417101 fbshipit-source-id: 98e0a3275864283c2f06d28f4c9b859b5827ed4d	2018-08-23 13:39:53 -07:00
Edward Yang	145eb330ad	Back out "Back out "Move typeid.h to move to ATen/core"" (#10465 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10465 Original commit changeset: 7050fe845e65 Reviewed By: jerryzh168 Differential Revision: D9296375 fbshipit-source-id: cb8161440ba809dcec5027858a29cd026d537fc3	2018-08-13 11:20:01 -07:00
Edward Yang	1dbdc5a93d	Back out "Move typeid.h to move to ATen/core" Summary: Original commit changeset: 21f2c89e58ca Reviewed By: yinghai Differential Revision: D9282171 fbshipit-source-id: 7050fe845e6524b965bdd45794a6fa1665b83e34	2018-08-10 21:39:25 -07:00
Edward Yang	3667d029b4	Move typeid.h to move to ATen/core (#10163 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10163 - Remove dependency on caffe2/core/common.h for ATen/core/typeid.h Unfortunately, Windows seems to rely on typeid.h including this header, so it is still included from the forwarding header caffe2/core/typeid.h - Deduplicate Demangle/DemangleType with their ATen equivalents Reviewed By: smessmer Differential Revision: D9132432 fbshipit-source-id: 21f2c89e58ca1e795f1b2caa316361b729a5231b	2018-08-10 08:45:44 -07:00
Dmytro Dzhulgakov	7bc87172ea	Kill Tensor::shares_data (#10217 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10217 It's only used in debug printing and is not that reliable anyway. If we want to implement it later - we should do it proper accounting for shared storages. Reviewed By: jerryzh168 Differential Revision: D9155685 fbshipit-source-id: 48320d41a0c4155645f3ba622ef88730a4567895	2018-08-03 17:40:39 -07:00
Edward Yang	5765549155	codemod -d caffe2 --extensions cc,h CaffeTypeId TypeIdentifier (#10166 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10166 TypeIdentifier is still easy to codemod away from Reviewed By: smessmer Differential Revision: D9132840 fbshipit-source-id: bc83a8b17b2e7c19c9d2c9cfe5c7ce6ec1d8cec5	2018-08-02 11:54:30 -07:00
Jerry Zhang	aebf3b47ae	Remove template parameter from Tensor (#9939 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9939 Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: ezyang, houseroad Differential Revision: D9024330 fbshipit-source-id: e0b8295d2dc6ebe2963383ded5af799ad17164ba	2018-07-27 10:56:39 -07:00
Jerry Zhang	969b62f276	Revert D8121878: Remove template parameter from Tensor Differential Revision: D8121878 Original commit changeset: 4a5e9a677ba4 fbshipit-source-id: d8e2c0bb145b52fbcca323b22d1d3346f0b3249e	2018-07-26 14:02:04 -07:00
Jerry Zhang	cd5adc7b5f	Remove template parameter from Tensor (#13 ) Summary: Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: xw285cornell Differential Revision: D8121878 fbshipit-source-id: 4a5e9a677ba4ac82095df959851a054c81eccf81	2018-07-26 10:25:23 -07:00
Sebastian Messmer	d2610fb379	Constexpr Type Ids -> 6.5% caffe2 perf improvement (#9603 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9603 Using constexpr for some heavily queried type ids gives us a 6.5% perf improvement for caffe2. Benchmark results: P59829647 Also ad canaries (but they don't show a significant difference): - adfinder: - https://our.intern.facebook.com/intern/ads/canary/411346509423301481 - https://our.intern.facebook.com/intern/ads/canary/411346563021753557 - adindexer: - https://our.intern.facebook.com/intern/ads/canary/411346517006038367 - https://our.intern.facebook.com/intern/ads/canary/411346571387258927 - multifeed_predictor: - https://our.intern.facebook.com/intern/ads/canary/411346526631282941 - https://our.intern.facebook.com/intern/ads/canary/411346583141009531 Reviewed By: dzhulgakov Differential Revision: D8841577 fbshipit-source-id: 1a0ce7f2bee1ae54b723caefe5bc7f85a20935b4	2018-07-24 17:24:55 -07:00
Orion Reblitz-Richardson	d1bdb3b10a	Remove core and util warnings (#8239 ) * Fix some signed/unsigned mismatches * Skip unused result warning * Explict fallthrough for murmur hash * Enable aligned new support to eliminate warning * Switch to int instead of unsigned in some cases	2018-06-07 09:10:33 -07:00

1 2

64 Commits