pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Scott Wolchok	c47464ed95	[PyTorch] Further reduce cost of TypeMeta::_typeMetaData (by 10x!) (#98105 ) Currently we should be paying a small cost for the thread-safe initialization of `index`. Now we should eliminate that cost. (10x figure in the title comes from internal benchmark that just calls `TypeMeta::Match<caffe2::Tensor>()` in a loop). Differential Revision: [D44597852](https://our.internmc.facebook.com/intern/diff/D44597852/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98105 Approved by: https://github.com/ezyang	2023-04-12 17:44:48 +00:00
mikey dagitses	ee0143bf65	distinguish mutability of TensorImpl::data<T>() (#98719 ) There already is a mutable_data<T>() with different semantics, so we introduce new names: TensorImpl::(mutable_)?data_dtype_initialized<T>(). Differential Revision: [D44824778](https://our.internmc.facebook.com/intern/diff/D44824778/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98719 Approved by: https://github.com/ezyang	2023-04-12 07:24:35 +00:00
mikey dagitses	2400cb1d57	distinguish mutability of TensorImpl::data() (#97776 ) See D44409928. Differential Revision: [D44459999](https://our.internmc.facebook.com/intern/diff/D44459999/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/97776 Approved by: https://github.com/ezyang	2023-04-09 20:21:56 +00:00
Scott Wolchok	e23d159bc5	[PyTorch][caffe2] Add CAFFE2_{DECLARE,DEFINE}_KNOWN_TYPE (#83707 ) It looks like we aren't getting inlining for the defined `_typeMetaData` functions from CAFFE_KNOWN_TYPE and there's some cost associated with that. I added new macros that fix this problem; I will migrate to them in a follow-up after I get buy-in from reviewers. Differential Revision: [D36883685](https://our.internmc.facebook.com/intern/diff/D36883685/) NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D36883685/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/83707 Approved by: https://github.com/ezyang	2022-08-30 23:09:49 +00:00
Nikolay Korovaiko	eda217ab67	Reland symint_numel (#84281 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/84281 Approved by: https://github.com/ezyang	2022-08-30 21:53:34 +00:00
Nikolay Korovaiko	44a975335e	Revert "Re-land sym_numel (#82374 ) (#82726 ) (#82731 ) (#82855 )" (#84207 ) This reverts commit `bfebf254dd`. Differential Revision: [D39104562](https://our.internmc.facebook.com/intern/diff/D39104562) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84207 Approved by: https://github.com/robieta	2022-08-30 13:22:58 +00:00
Brian Hirsh	1665715cb0	add sym_strides() function, use in fake/proxy tensors (#81300 ) Add `TensorImpl::sym_strides`, bind it to python with `torch.ops.aten.sym_strides`, and use it in `ProxyTensor` and `FakeTensor`. Before, `ProxyTensor` was generating `ProxySymInt`'s for the sizes, but not for the strides. Internally we still represent strides with a `SymIntArrayRef` though, so I ran into some weird issues where sizes were showing up as `ProxySymInt`, but strides were `PySymInt`'s. Differential Revision: [D38594558](https://our.internmc.facebook.com/intern/diff/D38594558) Pull Request resolved: https://github.com/pytorch/pytorch/pull/81300 Approved by: https://github.com/ezyang	2022-08-16 14:31:27 +00:00
Nikolay Korovaiko	bfebf254dd	Re-land sym_numel (#82374 ) (#82726 ) (#82731 ) (#82855 ) ### Description This is a reland of (#82374) (#82726) (#82731) This PR has no extra fixes, it simply updates the correct pin to point to the XLA side that has the corresponding changes. ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82855 Approved by: https://github.com/ezyang, https://github.com/qihqi	2022-08-05 03:36:09 +00:00
PyTorch MergeBot	78bd95b13a	Revert "Re-land sym_numel (#82374 ) (#82726 ) (#82731 )" This reverts commit `c90e00cf85`. Reverted https://github.com/pytorch/pytorch/pull/82731 on behalf of https://github.com/zengk95 due to This is breaking XLA tests on trunk. It seems to have passed on PR and was able to checkout that commit `c90e00cf85`.	2022-08-04 22:45:26 +00:00
Nikolay Korovaiko	c90e00cf85	Re-land sym_numel (#82374 ) (#82726 ) (#82731 ) This PR relands sym_numel #82374 and fixes the ios build break in this commit : `8cbd0031c5` which was a type mismatch in an equality. ### Description <!-- What did you change and why was it needed? --> ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82731 Approved by: https://github.com/malfet	2022-08-04 21:05:24 +00:00
zengk95	d0e6e5a5bb	Revert "sym_numel (#82374 )" (#82726 ) TSIA It looks like this PR #82374 is breaking mac builds on trunk but I can't revert it normally since there's a merge conflict in the XLA hash. <img width="1753" alt="image" src="https://user-images.githubusercontent.com/34172846/182644661-b7fdda4b-e5ce-45c3-96a2-ad6737d169ae.png"> I reverted it and resolved the conflict using the old XLA hash that this commit was based upon Pull Request resolved: https://github.com/pytorch/pytorch/pull/82726 Approved by: https://github.com/albanD, https://github.com/janeyx99	2022-08-03 15:23:47 +00:00
Nikolay Korovaiko	fd68b0931f	sym_numel (#82374 ) ### Description This PR makes `numel` symint-aware similar to `sym_sizes()` and `sym_strides()`. Similar to https://github.com/pytorch/pytorch/pull/81300 . This PR is the part of a bigger project to support dynamic_shapes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82374 Approved by: https://github.com/ezyang	2022-08-03 06:33:45 +00:00
Scott Wolchok	82712b7985	[PyTorch] Support ExclusivelyOwned<caffe2::Tensor> (#81964 ) Since `caffe2::Tensor` also shares `TensorImpl`, we can apply `ExclusivelyOwned` to it as needed. Differential Revision: [D38066340](https://our.internmc.facebook.com/intern/diff/D38066340/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/81964 Approved by: https://github.com/ezyang	2022-07-27 22:14:40 +00:00
Nikolay Korovaiko	df1f9b9840	Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#77756 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/77756 Approved by: https://github.com/desertfire	2022-05-20 05:39:03 +00:00
PyTorch MergeBot	e9d660c331	Revert "Revert "Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )""" This reverts commit `acf7136a52`. Reverted https://github.com/pytorch/pytorch/pull/77719 on behalf of https://github.com/suo	2022-05-18 05:06:50 +00:00
Edward Z. Yang	acf7136a52	Revert "Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )"" This reverts commit `c35bd8d423`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77719 Approved by: https://github.com/Chillee, https://github.com/malfet	2022-05-18 03:25:43 +00:00
PyTorch MergeBot	c35bd8d423	Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )" This reverts commit `fc4c3c9bc7`. Reverted https://github.com/pytorch/pytorch/pull/76836 on behalf of https://github.com/suo	2022-05-18 02:45:25 +00:00
Nikolay Korovaiko	fc4c3c9bc7	Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 ) LTC Tensors now create real IR (SizeNode) for sym_sizes() in LTCTensorImpl.cpp. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76836 Approved by: https://github.com/ezyang	2022-05-18 00:40:42 +00:00
Nikolay Korovaiko	99339fddd9	move SymInt and SymIntArrayRef to c10/core (#77009 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/77009 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-05-11 16:21:31 +00:00
Nikolay Korovaiko	69e048b090	List of SymInt rebase on master Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/75115 Approved by: https://github.com/ezyang	2022-04-20 02:09:55 +00:00
Nolan O'Brien	8f4cec2231	[warnings][Caffe2] Suppress warnings in caffe2 headers (#71196 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71196 `caffe2` headers contain code that can elicit warnings when built with strict compiler flags. Rather than force downstream/consuming code to weaken their compiler flags, suppress those warnings in the header using `#pragma clang diagnostic` suppressions. Test Plan: CI Pass Reviewed By: malfet Differential Revision: D33536233 fbshipit-source-id: 74404e7a5edaf244f79f7a0addd991a84442a31f	2022-01-12 10:16:35 -08:00
Scott Wolchok	8a5b946ff6	[caffe2] Don't call TensorImpl::size() in dim32() (#53852 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53852 dim32() requires that its argument is in range, so we can use the faster `TensorImpl::sizes()` call instead. ghstack-source-id: 123784862 Test Plan: Ran MergeNet AdIndexer benchmark under perf stat. Before: ``` Performance counter stats for 'scripts/bwasti/static_runtime/run.sh' (5 runs): 7,008.70 msec task-clock # 0.997 CPUs utilized ( +- 0.25% ) 4,203 context-switches # 0.600 K/sec ( +- 14.71% ) 3 cpu-migrations # 0.000 K/sec 93,896 page-faults # 0.013 M/sec ( +- 0.80% ) 13,869,719,763 cycles # 1.979 GHz ( +- 0.23% ) (50.05%) 27,561,765,867 instructions # 1.99 insn per cycle ( +- 0.06% ) (50.04%) 4,288,245,412 branches # 611.846 M/sec ( +- 0.05% ) (50.01%) 19,633,433 branch-misses # 0.46% of all branches ( +- 0.83% ) (50.01%) # Table of individual measurements: 7.0670 (+0.0379) # 6.9897 (-0.0394) # 7.0203 (-0.0088) # 6.9829 (-0.0462) # 7.0856 (+0.0565) # # Final result: 7.0291 +- 0.0205 seconds time elapsed ( +- 0.29% ) ``` After: ``` Performance counter stats for 'scripts/bwasti/static_runtime/run.sh' (5 runs): 6,935.61 msec task-clock # 0.997 CPUs utilized ( +- 0.47% ) 2,913 context-switches # 0.420 K/sec ( +- 15.25% ) 3 cpu-migrations # 0.000 K/sec 92,628 page-faults # 0.013 M/sec ( +- 0.50% ) 13,724,940,495 cycles # 1.979 GHz ( +- 0.47% ) (50.01%) 27,226,217,974 instructions # 1.98 insn per cycle ( +- 0.02% ) (50.03%) 4,220,129,358 branches # 608.472 M/sec ( +- 0.06% ) (50.04%) 19,025,346 branch-misses # 0.45% of all branches ( +- 0.53% ) (50.04%) # Table of individual measurements: 6.9402 (-0.0145) # 6.8570 (-0.0978) # 6.9311 (-0.0236) # 7.0101 (+0.0554) # 7.0352 (+0.0805) # # Final result: 6.9547 +- 0.0315 seconds time elapsed ( +- 0.45% ) ``` Roughly 1% cycles win, which is outside the quoted noise level. Reviewed By: hlu1 Differential Revision: D26994107 fbshipit-source-id: f4c4963be0a5c268cbcdac5359f8278750218ae6	2021-03-12 16:22:29 -08:00
Scott Wolchok	b2758cdc77	[PyTorch] Don't copy vector arguments to caffe2::Tensor::Resize (#53389 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53389 Resize was written to take arguments by value, which was totally fine if they were ArrayRef or a series of integers, but not so fine if they're std::vector. ghstack-source-id: 123212128 Test Plan: Existing CI should make sure it builds Inspected assembly for ios_caffe.cc and saw no more vector copy before calling Resize Reviewed By: smessmer Differential Revision: D26852105 fbshipit-source-id: 9c3b9549d50d32923b532bbc60d0246e2c2b5fc7	2021-03-08 12:33:33 -08:00
Jane Xu	71ca600af9	Renaming CAFFE2_API to TORCH_API (#49496 ) Summary: Since caffe2 and torch have been consolidated, CAFFE2_API should be merged with TORCH_API. Addresses a TODO. Manually edited some references of the removed `CAFFE2_API`: * `CONTRIBUTING.md` * `caffe2/proto/CMakeLists.txt` * `cmake/ProtoBuf.cmake` * `c10/macros/Export.h` * `torch/csrc/WindowsTorchApiMacro.h` Pull Request resolved: https://github.com/pytorch/pytorch/pull/49496 Reviewed By: malfet, samestep Differential Revision: D25600726 Pulled By: janeyx99 fbshipit-source-id: 7e068d959e397ac183c097d7e9a9afeca5ddd782	2020-12-18 10:54:50 -08:00
Basil Hosmer	f05b66b70d	pass TypeMeta by value (#45026 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45026 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D23802943 Pulled By: bhosmer fbshipit-source-id: 81b06ef00bf8eb4375c0e0ff2032e03bd1d1188a	2020-10-30 10:14:17 -07:00
Sebastian Messmer	2ac7de7d53	Remove hacky_wrapper from BackendSelect kernels (#44062 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44062 Previously, BackendSelect kernels were still written in the legacy way, i.e. they took one TensorOptions argument instead of scattered dtype, layout, device, pin_memory, and they used hacky_wrapper to be callable. This caused a re-wrapping step. Calling into a BackencSelect kernel required taking the individual scattered arguments, packing them into a TensorOptions, and the kernel itself then gathered them again for redispatch. Now with this PR, BackendSelect kernels are written in the new way and no hacky_wrapper or rewrapping is needed for them. ghstack-source-id: 112825789 Test Plan: vs master: https://www.internalfb.com/intern/fblearner/details/216117032/ vs previous diff: https://www.internalfb.com/intern/fblearner/details/216170194/ Reviewed By: ezyang Differential Revision: D23484192 fbshipit-source-id: e8fb49c4692404b6b775d18548b990c4cdddbada	2020-09-25 09:04:03 -07:00
Lu Fang	b2e52186b9	Rename capacity to nbytes in ShareExternalPointer to avoid confusion in future (#41461 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41461 capacity is misleading, and we have many wrong uses internally. Let's rename to nbytes to avoid the confusion in future. Ultimately, we could remove this parameter if possible. So far I haven't seen any case this capacity is necessary. Test Plan: oss ci Differential Revision: D22544189 fbshipit-source-id: f310627f2ab8f4ebb294e0dd5eabc380926991eb	2020-07-15 22:04:18 -07:00
Kurt Mohler	f9eb8824f1	Remove datatype from Storage and StorageImpl (#38870 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38870 * Removed dtype data member from StorageImpl * Removed any methods or method arguments in Storage/StorageImpl that deal with dtypes * Update all callers of the changed API Part of issue https://github.com/pytorch/pytorch/issues/33950 Original PR: https://github.com/pytorch/pytorch/pull/38038 Reviewed By: albanD Differential Revision: D21549645 Pulled By: ezyang fbshipit-source-id: 4289b356c55ff6b9530376a79343b99b540ee3de	2020-05-21 15:26:08 -07:00
Edward Yang	fe88806784	Back out "Revert D21171334: [pytorch][PR] Change StorageImpl to track byte count rather than element count" (#37893 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37893 Original commit changeset: 50746043acf3 Test Plan: sandcastle and ossci Reviewed By: malfet, seemethere, ngimel Differential Revision: D21416509 fbshipit-source-id: 735ec4e61f9d36d4537f52dd2dc6267751aeb94b	2020-05-05 22:43:15 -07:00
Edward Yang	a2fc7f787a	Revert D21171334: [pytorch][PR] Change StorageImpl to track byte count rather than element count Test Plan: revert-hammer Differential Revision: D21171334 Original commit changeset: 37329a379de9 fbshipit-source-id: 50746043acf3c76754688de0fe6f1cc12437ea2f	2020-05-05 16:36:15 -07:00
Kurt Mohler	3706803b60	Change StorageImpl to track byte count rather than element count (#37776 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37776 * Remove type-specific size tracking in favor of byte size tracking in Storage and StorageImpl * Changed numel() and set_numel() to nbytes() and set_nbytes() * Added enum argument to Storage/StorageImpl constructor to indicate new meaning of the size parameter * Update all callers of the changed API Part of issue https://github.com/pytorch/pytorch/issues/33950 Pull Request resolved: https://github.com/pytorch/pytorch/pull/37028 Differential Revision: D21171334 Pulled By: ezyang fbshipit-source-id: 37329a379de9a3a83cc5e9007e455a3e1c2d10b8	2020-05-05 14:20:51 -07:00
Hao Lu	4d1ccafb4b	[caffe2] Enable copying for caffe2::Tensor (#36468 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36468 Since `caffe2::Tensor` is now refcounted, enabling copy constructor and the copy assignment operator should be fine. Test Plan: ``` buck test mode/dev //caffe2/caffe2:caffe2_test_cpu -- TensorTest ``` AI/AF canaries with changes up to D20959214: https://our.intern.facebook.com/intern/experiment_store/experiment/3298538636995/#commit1-commit2 https://our.intern.facebook.com/intern/experiment_store/experiment/2199027015376/#commit1-commit2 AI/AF canaries on this diff: https://our.intern.facebook.com/intern/ads/canary/425960191574068914/ https://our.intern.facebook.com/intern/ads/canary/425960179835413033/ Reviewed By: yinghai Differential Revision: D20985924 fbshipit-source-id: ead5f5ceff23d0adc06d598128de16a5533d767b	2020-04-13 21:41:52 -07:00
Nikita Shulga	e70c28856f	[Caffe2] Move more method implementations from tensor.h to tensor.cc (#34811 ) Summary: To speed up compilation time Pull Request resolved: https://github.com/pytorch/pytorch/pull/34811 Test Plan: CI Differential Revision: D20476992 Pulled By: malfet fbshipit-source-id: 922cde93783fbfc04854851d7a05a635d5239792	2020-03-16 22:15:18 -07:00
Linbin Yu	2fe7fc681d	[PT] add macro to expose caffe2 ops to PyTorch mobile (#34578 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34578 Right now C10_EXPORT_CAFFE2_OP_TO_C10_CPU didn't work on mobile since we disabled some code paths. This diff added a new macro to enable these code paths so we can register caffe2 ops in PT mobile. Test Plan: verified caffe2 ops are registered in PT mobile (on the whole stack) ``` _caffe2::BBoxConcatBatchSplits(Tensor[] input_list, Tensor[]? _caffe2_preallocated_outputs=None) -> (Tensor output) _caffe2::BBoxTransform(Tensor rois, Tensor deltas, Tensor im_info, float[] weights, bool apply_scale, bool rotated, bool angle_bound_on, int angle_bound_lo, int angle_bound_hi, float clip_angle_thresh, bool legacy_plus_one, Tensor[]? _caffe2_preallocated_outputs=None) -> (Tensor output_0, Tensor output_1) _caffe2::BoxWithNMSLimit(Tensor scores, Tensor boxes, Tensor batch_splits, float score_thresh, float nms, int detections_per_im, bool soft_nms_enabled, str soft_nms_method, float soft_nms_sigma, float soft_nms_min_score_thres, bool rotated, bool cls_agnostic_bbox_reg, bool input_boxes_include_bg_cls, bool output_classes_include_bg_cls, bool legacy_plus_one, Tensor[]? _caffe2_preallocated_outputs=None) -> (Tensor scores, Tensor boxes, Tensor classes, Tensor batch_splits, Tensor keeps, Tensor keeps_size) _caffe2::GenerateProposals(Tensor scores, Tensor bbox_deltas, Tensor im_info, Tensor anchors, float spatial_scale, int pre_nms_topN, int post_nms_topN, float nms_thresh, float min_size, bool angle_bound_on, int angle_bound_lo, int angle_bound_hi, float clip_angle_thresh, bool legacy_plus_one, Tensor[]? _caffe2_preallocated_outputs=None) -> (Tensor output_0, Tensor output_1) _caffe2::HeatmapMaxKeypoint(Tensor heatmaps, Tensor bboxes_in, bool should_output_softmax=True, Tensor[]? _caffe2_preallocated_outputs=None) -> (Tensor keypoints) _caffe2::ResizeNearest(Tensor X, str order, float width_scale, float height_scale, Tensor[]? _caffe2_preallocated_outputs=None) -> (Tensor Y) _caffe2::RoIAlign(Tensor features, Tensor rois, str order, float spatial_scale, int pooled_h, int pooled_w, int sampling_ratio, bool aligned, Tensor[]? _caffe2_preallocated_outputs=None) -> (Tensor) Reviewed By: dreiss Differential Revision: D20128254 fbshipit-source-id: 49a837dddc431eb528b5c72ffdfe0d0131cd10b4	2020-03-11 19:15:14 -07:00
Pavel Belevich	62b06b9fae	Rename TensorTypeId to DispatchKey (#32154 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32154 TensorTypeId -> DispatchKey c10/core/TensorTypeId.h -> c10/core/DispatchKey.h c10/core/TensorTypeId.cpp -> c10/core/DispatchKey.cpp TensorTypeId::* -> DispatchKey::* TensorTypeId type_id -> DispatchKey dispatch_key type_id -> dispatch_key TensorTypeId::NumTensorIds -> DispatchKey::NumDispatchKeys RealTensorTypeId -> RealDispatchKey TensorTypeSet -> DispatchKeySet TensorTypeIds -> DispatchKeys c10/core/TensorTypeSet.h -> c10/core/DispatchKeySet.h c10/core/TensorTypeSet.cpp -> c10/core/DispatchKeySet.cpp type_set() -> key_set() type_set_ -> key_set_ typeSet -> keySet ExcludeTensorTypeIdGuard -> ExcludeDispatchKeyGuard IncludeTensorTypeIdGuard -> IncludeDispatchKeyGuard LocalTensorTypeSet -> LocalDispatchKeySet c10/core/impl/LocalTensorTypeSet.h -> c10/core/impl/LocalDispatchKeySet.h c10/core/impl/LocalTensorTypeSet.cpp -> c10/core/impl/LocalDispatchKeySet.cpp tls_local_tensor_type_set -> tls_local_dispatch_key_set tls_is_tensor_type_id_excluded -> tls_is_dispatch_key_excluded tls_set_tensor_type_id_excluded -> tls_set_dispatch_key_excluded tls_is_tensor_type_id_included -> tls_is_dispatch_key_included tls_set_tensor_type_id_included -> tls_set_dispatch_key_included MultiDispatchTensorTypeSet -> MultiDispatchKeySet multi_dispatch_tensor_type_set -> multi_dispatch_key_set tensorTypeIdToBackend -> dispatchKeyToBackend backendToTensorTypeId -> backendToDispatchKey initForTensorTypeSet -> initForDispatchKeySet inferred_type_set -> inferred_key_set computeTensorTypeId -> computeDispatchKey PODLocalTensorTypeSet raw_local_tensor_type_set -> PODLocalDispatchKeySet raw_local_dispatch_key_set get_default_tensor_type_id -> get_default_dispatch_key inferred_type_id -> inferred_dispatch_key actual_type_id -> actual_dispatch_key typeSetToDispatchKey_ -> dispatchKeySetToDispatchKey_ get_type_id() -> get_dispatch_key() legacyExtractTypeId -> legacyExtractDispatchKey extractTypeId -> extractDispatchKey Test Plan: Imported from OSS Differential Revision: D19398900 Pulled By: pbelevich fbshipit-source-id: 234ad19f93d33e00201b61e153b740a339035776	2020-01-15 11:16:08 -08:00
Edward Yang	65bb34d885	Remove TensorImpl::is_variable, deprecate Tensor::is_variable (#29653 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29653 I didn't remove is_variable from Tensor for BC reasons, but I did remove as many uses as I could from the codebase. at::impl::variable_excluded_from_dispatch got moved to TensorBody.h so that it's more widely accessible. This diff is NOT semantics preserving. Here are the major differences: - In a number of native operator implementations, we tested that arguments are not variable. I replaced these with asserts that variable is excluded from dispatch. I actually don't think these asserts are really necessary now (they should certainly be true, but it's hard to get it wrong), but I've kept them for old time's sake. At least, they'll detect if you call these functions before you've processed variable (indicating a bug in your kernel.) - There are a number of places where we do a per-tensor test for being a variable, for better error reporting when someone commits Tensor/Variable confusion. Although these tests are substantively the same as the tests above, in these cases I decided to delete the test entirely. The reasoning is that in these cases, we didn't really care about dispatch (also, see above; I'm not too sure we really need the dispatch asserts), we cared about Tensor/Variable confusion. Since Tensor/Variable confusion is impossible now, we don't need the tests. One of the key factors which pushed me one way or another was whether or not a function was doing per-tensor validation; if I kept the assert in such functions, I'd repeatedly access the TLS. Even if we want to bring back the asserts, they would have to go somewhere else. Another similar idiom is the number of places we do !x.defined() \|\| x.is_variable(); I treated this equivalently. - nuclear_norm's computation of compute_uv is a bit weird, but I think it's OK to just delete the is_variable case (I suspect that it is always the case that self.is_variable(), but it doesn't really matter.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18496168 Pulled By: ezyang fbshipit-source-id: 5a1ded931e0c10a6b758ba64a8380d34110e0c3e	2019-11-14 11:41:02 -08:00
Sebastian Messmer	bb0e46b65a	Remove preallocation of type ids (#28024 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28024 We preallocated type ids to align them with ScalarType. At that point, the maximum type id was 10 and we used 11 to specify undefined type id. However, since then, ScalarType got more additions, 11 isn't undefined anymore, and numbers 11-15 have meaning. caffe2::TypeIdentifier also got its separate additions, 12 and upwards have meaning that differs from ScalarType. I'm going with the (CI-tested) assumption that caffe2::TypeIdentifier and ScalarType actually don't need to be aligned and remove the functionality for preallocated type ids. This simplifies our type ids. ghstack-source-id: 92051872 Test Plan: unit tests Differential Revision: D17936165 fbshipit-source-id: 2c9df2b9b3f35b3e319641c96638321ac3433d5c	2019-10-16 23:08:11 -07:00
Sebastian Messmer	1865f31efa	Revert D17490109: Remove preallocation of type ids Test Plan: revert-hammer Differential Revision: D17490109 Original commit changeset: 800c340d9d35 fbshipit-source-id: a3e39bbce53c828fe553379d9f2b66dc8a07c982	2019-10-15 09:59:17 -07:00
Sebastian Messmer	cf01f53b5a	Remove preallocation of type ids (#26509 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26509 We preallocated type ids to align them with ScalarType. At that point, the maximum type id was 10 and we used 11 to specify undefined type id, see https://github.com/pytorch/pytorch/pull/10139. However, since then, ScalarType got more additions, 11 isn't undefined anymore, and numbers 11-15 have meaning. caffe2::TypeIdentifier also got its separate additions, 12 and upwards have meaning that differs from ScalarType. I'm going with the (CI-tested) assumption that caffe2::TypeIdentifier and ScalarType actually don't need to be aligned and remove the functionality for preallocated type ids. This simplifies our type ids. ghstack-source-id: 91896918 Test Plan: unit tests Differential Revision: D17490109 fbshipit-source-id: 800c340d9d3556a99f6e3ffc33af14ad68d7cc59	2019-10-15 08:47:13 -07:00
Edward Yang	0b6186d778	Remove Tensor.h, TensorMethods.h from src/core. (#27086 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27086 This is a major source of merge conflicts, and AFAICT isn't necessary anymore (it may have been necessary for some mobile build stuff in the past). This is a commandeer of #25031 Test Plan: Imported from OSS Reviewed By: ljk53 Differential Revision: D17687345 Pulled By: ezyang fbshipit-source-id: bf6131af835ed1f9e3c10699c81d4454a240445f	2019-10-06 09:37:50 -07:00
Sebastian Messmer	0e30e6570d	Call aten ops through c10 dispatcher (#23668 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23668 - The eager mode frontend now calls operators who are defined in native_functions.yaml with `use_c10_dispatcher: True` through the c10 dispatcher and not anymore through globalATenDispatch(). - These operators aren't registered with globalAtenDispatch anymore, only on c10 now. - Backend extensions calling globalATenDispatch().registerOp() to add their own kernels still work, this function will forward the registration to the c10 dispatcher for them. ghstack-source-id: 90130455 Test Plan: benchmarks at https://docs.google.com/document/d/1gpzKZcFf1JJameY1vKxF7Cloul9s6D8HKIK2_Pp1hFo/edit# Differential Revision: D16603133 fbshipit-source-id: 991f17b355e9c78c5e86fee4fa381df7ab98ac82	2019-09-15 01:18:07 -07:00
Edward Yang	5ae909b443	Revert D15920763: Move TensorOptions to ATen/core Differential Revision: D15920763 Original commit changeset: c3429973180a fbshipit-source-id: 0efb27722b371e1047f02240f071bc222b52e51d	2019-08-13 12:07:18 -07:00
Richard Zou	bde73860c6	Move TensorOptions to ATen/core (#22020 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22020 ghimport-source-id: 62766d49658ee84b8076c555432b50e13d104bc6 Test Plan: Imported from OSS Differential Revision: D15920763 Pulled By: zou3519 fbshipit-source-id: c3429973180a65606da82face5c0ee377035e716	2019-08-12 07:41:12 -07:00
Will Feng	3a12520844	Pass Variable into Caffe2 ops, by requiring that the Variable doesn't require grad (#22473 ) Summary: As part of the Variable/Tensor merge, we want to be able to pass Variables into Caffe2 without doing extra shallow copy, to improve performance and also allow for in-place mutations in Caffe2 ops. There are a few approaches outlined in https://github.com/pytorch/pytorch/pull/22418, and this PR is the chosen approach. Specifically, we can have the assumption that we won't be connecting autograd to C2 gradients at any point (as it's too tricky and not that useful). Therefore, we can pass Variable into Caffe2 ops by requiring that all Variables in Caffe2 don't require grad. For code paths in Caffe2 that might potentially track gradients (e.g. `ScriptModuleOp` and `call_caffe2_op_from_c10`), we use the `torch::NoGradGuard` to make sure gradients are not tracked. This supersedes https://github.com/pytorch/pytorch/pull/22418. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22473 Differential Revision: D16099042 Pulled By: yf225 fbshipit-source-id: 57efc3c7cfb3048d9abe90e63759acc14ebd2972	2019-07-08 11:31:10 -07:00
Vitaly Fedyunin	516c7e4456	Adding memory_format to empty and empty_like operators (#20558 ) Summary: Original RFC https://github.com/pytorch/pytorch/issues/19092 To ensure that we are not introducing BC breaking change, empty_like returns contiguous tensor by default. ```python nCwh = torch.randn(N, C, H, W) nhwC = nCwh.contiguous(memory_format=torch.channels_last) new_nCwh = torch.empty_like(nhwC) new_nCwh.is_contiguous(memory_format=torch.channels_last) == False ``` Now we need a way to preserve memory format in `empty_like` ```python nCwh = torch.randn(N, C, H, W) nhwC = nCwh.contiguous(memory_format=torch.channels_last) new_nhwC = torch.empty_like(nhwC, memory_format=torch.preserve_format) new_nhwC.is_contiguous(memory_format=torch.channels_last) == True like_nCwh = torch.empty_like(nCwh, memory_format=torch.preserve_format) like_nCwh.is_contiguous(memory_format=torch.channels_last) == False ``` Usage of `torch.preserve_format` allows us to avoid `if` constructs. We can also generate different memory format outputs ```python nCwh = torch.randn(N, C, H, W) nhwC = nCwh.contiguous(memory_format=torch.channels_last) new_nhwC = torch.empty_like(nCwh, memory_format=torch.channels_last) new_nhwC.is_contiguous(memory_format=torch.channels_last) == True new_nCwh = torch.empty_like(nhwC, memory_format=torch.contiguous_format) new_nCwh.is_contiguous(memory_format=torch.channels_last) == False ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/20558 Differential Revision: D15502474 Pulled By: VitalyFedyunin fbshipit-source-id: 2e120d57eefad6fb8e04b8322c79871392f64331	2019-06-26 11:48:27 -07:00
Yinghai Lu	cf7ef5e631	Add onnxifi support for Int8FCDNNLowPPackedWeightBlob (#20564 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20564 Reviewed By: bddppq Differential Revision: D15106712 fbshipit-source-id: 428db9c23cfd36ddedc8d79121fbbb3bb484c993	2019-05-20 16:57:11 -07:00
Vitaly Fedyunin	5b78a5eadb	Memory format support for contiguous and is_contiguous (#20455 ) Summary: #19975 was separated by 2 PRs. This one: Introduce MemoryFormat argument to the `x.is_contiguous(memory_format=torch.channels_last)` and to the `y = x.contiguous(memory_format=torch.channels_last)` functions. At this moment both functions just operate with strides and doesn't store any tensor state. (Original RFC #19092) ----- Expands functionality of two tensor functions `.is_contiguous` and `.contiguous` (both python and c++ api). Note: We had several complaints about `.to(memory_format)` function, and decided not to support it. 1. `.contiguous` now support optional keyword-only argument - `memory_format`, which can be either `torch.contiguous_format` or `torch.channels_last`. - Using `torch.contiguous_format` will preserve existing `.contiguous()` behavior. - Calling `x.contiguous(memory_format=torch.channels_last)` returns new tensor which maintain same semantical layout (NCHW), but have different memory allocation pattern. `x.contiguous(memory_format=torch.channels_last)` expects input tensor to be 3d, 4d or 5d; and fails otherwise. 2. `.is_contiguous` now support optional keyword-only argument - `memory_format`, which can be either `torch.contiguous_format` or `torch.channels_last`. - `x.is_contiguous(memory_format=torch.contiguous_format)` preserves same functionality as `x.is_contiguous()` and remains unchanged. - `x.is_contiguous(memory_format=torch.channels_last)` returns true if A) input tensor is contiguous in memory AND B) allocated in the memory in NWHC (or similar for 3d,5d) format. Note: By the end of the phase one `x.is_contiguous(memory_format=torch.channels_last)` will calculate state of the Tensor on every call. This functionality going to be updated later. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20455 Differential Revision: D15341577 Pulled By: VitalyFedyunin fbshipit-source-id: bbb6b4159a8a49149110ad321109a3742383185d	2019-05-16 07:18:24 -07:00
Yanbo Liang	a8387b7779	Delete TensorImpl::GetDevice() (#20025 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20025 Delete TensorImpl::GetDevice() and clean all its call sites. Reviewed By: ezyang Differential Revision: D15170917 fbshipit-source-id: b6862b74aa036198544f79d18a8c0f995cb0ca7b	2019-05-06 12:44:23 -07:00
Tongliang Liao	f2c715cbe1	Fix the spelling of "context" Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20055 Differential Revision: D15217488 Pulled By: ezyang fbshipit-source-id: bb2b57b5e749357b47a01c6c3e73addf3c5418c7	2019-05-06 06:54:30 -07:00
Sebastian Messmer	17f05ad5e5	Moving at::Tensor into caffe2::Tensor without bumping refcount (#19388 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19388 The old implementation forced a refcount bump when converting at::Tensor to caffe2::Tensor. Now, it is possible to move it without a refcount bump. Reviewed By: dzhulgakov Differential Revision: D14986815 fbshipit-source-id: 92b4b0a6f323ed38376ffad75f960cad250ecd9b	2019-04-18 14:13:26 -07:00

1 2 3 4 5

212 Commits