pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Joel Schlosser	f70aeb4ffd	Fix backward for reshape() on jagged layout NT (#117137 ) Provides symbolic C++-side `reshape_as()` / `reshape()` decomps for jagged layout NTs to make the backwards pass work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/117137 Approved by: https://github.com/soulitzer	2024-01-10 23:35:07 +00:00
Joel Schlosser	0b0c76bace	Support squeeze.dim for jagged NT (#116891 ) As title. Needed for `rev_view_func()` of `unsqueeze()`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116891 Approved by: https://github.com/soulitzer ghstack dependencies: #115894, #116512	2024-01-06 01:00:53 +00:00
Joel Schlosser	3c21264c9b	Introduce reverse view_funcs (#115894 ) Part 2 of implementation for general [subclass view fake-ification](https://docs.google.com/document/d/1C5taWiplmX7nKiURXDOAZG2W5VNJ2iV0fQFq92H0Cxw). Details: * Codegen `rev_view_func()` alongside `view_func()` * Reverse view_func gives you a "base" from a "view": `rev_view_func(new_view) -> new_base` AKA it plays the original view backwards * Utilizes the functional inverses defined in `FunctionalInverses.cpp`, passing `InverseReturnMode::AlwaysView` * Manually implements functional inverses for `narrow()` and `chunk()` * NB: Multi-output views now set view_func() / rev_view_func() for each of the output views! * Due to this, the `as_view()` overload that operates on a list of views is scrapped in favor of iteration via codegen Example codegen in `ADInplaceOrViewTypeN.cpp`: ```cpp at::Tensor narrow(c10::DispatchKeySet ks, const at::Tensor & self, int64_t dim, c10::SymInt start, c10::SymInt length) { auto _tmp = ([&]() { at::AutoDispatchBelowADInplaceOrView guard; return at::_ops::narrow::redispatch(ks & c10::after_ADInplaceOrView_keyset, self, dim, start, length); })(); std::function<at::Tensor(const at::Tensor&)> func=nullptr; std::function<at::Tensor(const at::Tensor&)> rev_func=nullptr; if (false \|\| !self.unsafeGetTensorImpl()->support_as_strided() \|\| c10::AutogradState::get_tls_state().get_view_replay_enabled()) { func = [=](const at::Tensor& input_base) { return at::_ops::narrow::call(input_base, dim, start, length); }; rev_func = [=](const at::Tensor& input_view) { // NB: args from narrow() signature are passed along to the inverse return at::functionalization::FunctionalInverses::narrow_copy_inverse(self, input_view, at::functionalization::InverseReturnMode::AlwaysView, dim, start, length); }; } auto result = as_view(/* base / self, / output / _tmp, / is_bw_differentiable / true, / is_fw_differentiable / true, / view_func / func, / rev_view_func / rev_func, / creation_meta */ InferenceMode::is_enabled() ? CreationMeta::INFERENCE_MODE : (at::GradMode::is_enabled() ? CreationMeta::DEFAULT : CreationMeta::NO_GRAD_MODE)); return result; } ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/115894 Approved by: https://github.com/soulitzer	2024-01-05 16:48:12 +00:00
Joel Schlosser	ea3a5f8ddc	Add chunk for jagged layout NT (#115842 ) Nice to have for the [SDPA tutorial](https://pytorch.org/tutorials/intermediate/scaled_dot_product_attention_tutorial.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115842 Approved by: https://github.com/soulitzer ghstack dependencies: #115192, #116111	2023-12-20 20:13:20 +00:00
Joel Schlosser	29b198dcf8	Add markDynamoStrictTest to NT tests (#116111 ) Decorates all NT tests with `@markDynamoStrictTest` to ensure we get the correct signal. Adds xfails where needed to get things passing. Includes a fix in meta_utils.py for a bug that was breaking several python 3.11 tests. In particular, a dense tensor graph input that is a view of a strided NT would slip past Dynamo's check and break in meta-ification. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116111 Approved by: https://github.com/soulitzer, https://github.com/zou3519 ghstack dependencies: #115192	2023-12-20 20:13:20 +00:00
Joel Schlosser	1474eb5f29	Fix jagged composite impl of flatten() (#115192 ) Need to handle this in `NestedTensor.__torch_function__()` since it's CompositeImplicit Pull Request resolved: https://github.com/pytorch/pytorch/pull/115192 Approved by: https://github.com/cpuhrsch, https://github.com/soulitzer	2023-12-19 19:15:21 +00:00
Joel Schlosser	bf62511e07	Reshape decomposition for jagged layout NT (#115191 ) No more segfault from using `reshape()` on jagged NT :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115191 Approved by: https://github.com/cpuhrsch, https://github.com/soulitzer	2023-12-18 22:34:41 +00:00
Joel Schlosser	6fee208064	Handle -1 in jagged layout NT view ops (#115843 ) Allows for inheriting the ragged and batch dims via -1: ```python nt.view(-1, -1, D) nt.expand(B, -1, D) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/115843 Approved by: https://github.com/soulitzer ghstack dependencies: #115636	2023-12-15 00:42:47 +00:00
Joel Schlosser	0ff155fb65	Fix SDPA for SAM (#115636 ) Addresses the regression for Segment Anything Fast in https://github.com/pytorch-labs/segment-anything-fast/issues/99 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115636 Approved by: https://github.com/soulitzer, https://github.com/ani300	2023-12-12 18:52:38 +00:00
soulitzer	8885128dcc	Fix backward for SDPA NT jagged layout (#115576 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115576 Approved by: https://github.com/jbschlosser, https://github.com/ani300	2023-12-12 18:35:40 +00:00
Yanbo Liang	da341d0d48	[Dynamo][6.1/N] Refactor out TorchInGraphFunctionVariable and improve heuristic (#113432 ) This is splitted from #113009, please check https://github.com/pytorch/pytorch/pull/113009#issuecomment-1804417925 for more details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113432 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-12-09 05:11:44 +00:00
PyTorch MergeBot	e8e4141773	Revert "[Dynamo][6.1/N] Refactor out TorchInGraphFunctionVariable and improve heuristic (#113432 )" This reverts commit `e61d6b42f0`. Reverted https://github.com/pytorch/pytorch/pull/113432 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it is failing dynamo tests in trunk `e61d6b42f0`, landrace? ([comment](https://github.com/pytorch/pytorch/pull/113432#issuecomment-1847787981))	2023-12-08 20:15:39 +00:00
Yanbo Liang	e61d6b42f0	[Dynamo][6.1/N] Refactor out TorchInGraphFunctionVariable and improve heuristic (#113432 ) This is splitted from #113009, please check https://github.com/pytorch/pytorch/pull/113009#issuecomment-1804417925 for more details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113432 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-12-08 17:15:14 +00:00
Joel Schlosser	3b01f30b20	Prevent invalid pointwise ops on jagged with transposed ragged dim (#115190 ) TODO: tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/115190 Approved by: https://github.com/soulitzer, https://github.com/ani300	2023-12-08 00:54:03 +00:00
Yanbo Liang	4620170008	[Dynamo] Revert multiple PRs since they triggered compilation stuck internally (#115126 ) Revert the following PRs to mitigate internal compilation stuck: #113432 #114016 #114507 #114196 #114739 #114669 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115126 Approved by: https://github.com/xush6528	2023-12-05 22:35:37 +00:00
Antoni Viros	1dc4588c6a	Add an SDPA dispatcher for nested tensors with jagged layouts (#114164 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/114164 Approved by: https://github.com/jbschlosser	2023-12-05 06:33:45 +00:00
PyTorch MergeBot	5cfda9b7f8	Revert "Add an SDPA dispatcher for nested tensors with jagged layouts (#114164 )" This reverts commit `aafa8233a4`. Reverted https://github.com/pytorch/pytorch/pull/114164 on behalf of https://github.com/malfet due to Broke ROCM, see `aafa8233a4` ([comment](https://github.com/pytorch/pytorch/pull/114164#issuecomment-1839798986))	2023-12-05 00:35:20 +00:00
Antoni Viros	aafa8233a4	Add an SDPA dispatcher for nested tensors with jagged layouts (#114164 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/114164 Approved by: https://github.com/jbschlosser	2023-12-04 21:54:02 +00:00
Yanbo Liang	033d7b670a	[Dynamo][6.1/N] Refactor out TorchInGraphFunctionVariable and improve heuristic (#113432 ) This is splitted from #113009, please check https://github.com/pytorch/pytorch/pull/113009#issuecomment-1804417925 for more details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113432 Approved by: https://github.com/ezyang	2023-11-17 23:42:00 +00:00
Joel Schlosser	ea39cc34f9	Refactor NestedTensor subclass to remove ragged_size from constructor (#113491 ) This PR removes the need for passing `ragged_size` into the `NestedTensor` constructor. This was an artifact of fake-ification, where sometimes we needed the NT to have a symbolic singleton symint shape for the ragged dimension. The new way of achieving this is to also store mappings between fake / functional tensors -> symbolic symints in the ragged structure registry. Now the `NestedTensor` constructor can just query this registry for the `ragged_size`. Old: `NestedTensor(values, offsets, , ragged_size=None, kwargs)` New: `NestedTensor(values, offsets, *kwargs)` This makes it possible to have a `_nested_view_from_values_offsets(values, offsets)` without needing to pass a `ragged_size`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113491 Approved by: https://github.com/ezyang, https://github.com/soulitzer	2023-11-14 19:32:21 +00:00
Antoni Viros	1aece432ba	Implement narrow from a regular tensor to jagged tensor (#112770 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112770 Approved by: https://github.com/cpuhrsch	2023-11-13 19:09:59 +00:00
Yuqing Jiang	9f3e378125	[nested tensor]add split and layer_norm_backward operations (#113108 ) Summary: Add split and layer_norm_backward. Note: It is non trivial to support split_with_sizes backward so adding the split operation to support the use case in the model. Test Plan: unit tests Differential Revision: D51052966 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113108 Approved by: https://github.com/soulitzer	2023-11-08 07:44:35 +00:00
soulitzer	538ec4942a	Do not generate zero-numel NT by default in helper and improve to_padded_tensor msg (#113162 ) Improvements: improves to_padded_tensor error message when passed a NT with zero numel Pull Request resolved: https://github.com/pytorch/pytorch/pull/113162 Approved by: https://github.com/jbschlosser ghstack dependencies: #113031, #112519, #113091	2023-11-07 19:56:26 +00:00
soulitzer	0c991acab0	Factor out test_nestedtensor setUp tearDown and call super (#113091 ) Fixes https://github.com/pytorch/pytorch/issues/112845 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113091 Approved by: https://github.com/jbschlosser ghstack dependencies: #113031, #112519	2023-11-07 19:56:26 +00:00
soulitzer	c2084da14a	[NT] Backward support for broadcasting binary ops (#112519 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112519 Approved by: https://github.com/jbschlosser ghstack dependencies: #113031	2023-11-07 00:03:21 +00:00
soulitzer	53fff56ab8	Graph break cleanly for test_nestedtensor (#112662 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112662 Approved by: https://github.com/jbschlosser	2023-11-03 07:20:43 +00:00
Joel Schlosser	51a38380d1	Fix torch.load(..., weights_only=True) for NT (#112516 ) Found when looking into #112509 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112516 Approved by: https://github.com/soulitzer	2023-11-02 14:41:04 +00:00
Yuqing Jiang	24f217ee64	[Nested tensor] Add more ops in Python subclass nested tensor (#112302 ) Summary: Add dropout, split_with_sizes, and silu operations in python subclass nested tensor Test Plan: unit tests Differential Revision: D50676812 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112302 Approved by: https://github.com/soulitzer, https://github.com/jbschlosser	2023-10-31 20:57:05 +00:00
William Wen	a380bf3297	[dynamo, test] skip flaky dynamo-wrapped tests (#112310 ) ghstack-source-id: 7a87e33e7513e7924e4513b6473284562989ed4c Pull Request resolved: https://github.com/pytorch/pytorch/pull/112309 Skip flaky tests reported by - https://github.com/pytorch/pytorch/issues/111825 - https://github.com/pytorch/pytorch/issues/111826 - https://github.com/pytorch/pytorch/issues/111909 - https://github.com/pytorch/pytorch/issues/112142 - https://github.com/pytorch/pytorch/issues/112220 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112310 Approved by: https://github.com/xmfan	2023-10-28 04:14:57 +00:00
Joel Schlosser	2225e6361d	Support for as_nested_tensor() with jagged layout + fixed nested_tensor() semantics (#112304 ) This PR: * Adds support for the `layout` kwarg to `torch.nested.as_nested_tensor()` * Fixes `torch.nested.nested_tensor()` * It should accept a list of lists of scalars * It should not preserve autograd history * Adds extensive testing for these two functions Semantics for the two functions follow those of the strided layout: * `torch.nested.nested_tensor(tensor_list, layout=torch.jagged)`: Creates a new jagged layout NT with no autograd history * `tensor_list` can be a list of Tensors or list of lists of scalars * `torch.nested.as_nested_tensor(tensor_list, layout=torch.jagged)`: Creates a new jagged layout NT preserving autograd history of `tensor_list` * `tensor_list` must be a list of Tensors Pull Request resolved: https://github.com/pytorch/pytorch/pull/112304 Approved by: https://github.com/cpuhrsch, https://github.com/soulitzer	2023-10-28 02:34:27 +00:00
Antoni Viros	668c3b3f3b	Add embedding op to jagged NT (#112288 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112288 Approved by: https://github.com/cpuhrsch	2023-10-28 01:29:17 +00:00
soulitzer	73170b23d4	Add compile support for NT unbind (#111531 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111531 Approved by: https://github.com/ezyang	2023-10-23 21:16:20 +00:00
Joel Schlosser	ba2ba9621c	More NT subclass op support for SAM (#111253 ) With this PR, we have full op support for SAM without needing to unwrap subclass into jagged buffer -> run ops -> rewrap manually. Specifically, this was previously happening in the MaskDecoder. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111253 Approved by: https://github.com/soulitzer, https://github.com/cpuhrsch	2023-10-18 21:21:28 +00:00
soulitzer	2dc1726ab7	Compile NestedTensor with AOTAutograd (#110529 ) This PR has a number of changes that improve subclass support for AOTAutograd/Inductor in general: - previously if a subclass does extra aliasing between graph outputs/inputs in a way, the partitioner would complain because grad_outputs are the outputs reused as-is. Now we do a view_as(self) to workaround this. - Use dense -> dense metadata when working with fwd_output_strides during backward. This is important since the stride information comes from inductor which sees the dense to dense graph. - Inductor requires that the inputs to the compiled backward to match some expected strides computed during compilation. We make sure to make the inner tensors of the subclass contiguous (previously, we only made the subclass itself contiguous) Changes specific to NestedTensor relevant to compilation: - Properly handle the case where `__tensor_unflatten__` is passed non-symbolic dense tensors and with meta extracted from fake subclasses. - Skip var_to_range logic for singleton int - Skip size hint logic in inductor for singleton int Pull Request resolved: https://github.com/pytorch/pytorch/pull/110529 Approved by: https://github.com/bdhirsh	2023-10-17 21:17:10 +00:00
Jesse Cai	4c01686027	Public API for constructing NT with jagged layout from tensor list (#111078 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/111078 Approved by: https://github.com/cpuhrsch, https://github.com/soulitzer ghstack dependencies: #109123	2023-10-13 03:27:41 +00:00
soulitzer	110382bacf	Make NestedTensor compilable with eager backend (#109171 ) In this PR: - Adds support for strides for jagged tensor (design doc for this coming soon) - NestedTensor skips automatic dynamic - Make use of @bdhirsh's subclass fakification logic by adding the __tensor_{un,}flatten__ functions. - Additional logic for fakification: since existing subclass fakification logic does not handle the case where the outer tensor has an additional dimension. We insert one-off logic to (1) insert an extra SingletonSymInt onto the fakified NestedTensor. (2) make sure we call track_symint on both the sizes on the inner and outer tensor during guard creation. Remaining things that are weird: - Still need to skip some logic in meta utils for some reason (I was going to write this up more, but decided not to since we're not able to do this anyway for a immediate reason: we cannot arbitrarily compare singleton ints. For now I'm just following Brian's advise from [here](https://github.com/pytorch/pytorch/pull/109171#discussion_r1328137070) ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109171 Approved by: https://github.com/ezyang, https://github.com/bdhirsh	2023-10-11 04:47:10 +00:00
Joel Schlosser	ac01304e22	pin_memory support for NT (#110404 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110404 Approved by: https://github.com/cpuhrsch, https://github.com/albanD ghstack dependencies: #110292	2023-10-10 21:58:19 +00:00
soulitzer	fda0a965c7	[reland] Support SingletonSymNode mul with coefficient (#110673 ) reland of https://github.com/pytorch/pytorch/pull/110369 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110673 Approved by: https://github.com/ezyang	2023-10-10 19:37:17 +00:00
Joel Schlosser	17348b0f51	Implement split_with_sizes backward for NT (#110647 ) Needed internally. Note that `split_with_sizes()` for NT is currently supported only on `dim=-1`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110647 Approved by: https://github.com/cpuhrsch, https://github.com/soulitzer ghstack dependencies: #110646	2023-10-06 18:44:22 +00:00
PyTorch MergeBot	81ce5d5725	Revert "pin_memory support for NT (#110404 )" This reverts commit `3597325bc7`. Reverted https://github.com/pytorch/pytorch/pull/110404 on behalf of https://github.com/kit1980 due to Previous PR in the stack caused CUDA memory leaks ([comment](https://github.com/pytorch/pytorch/pull/110404#issuecomment-1749850211))	2023-10-06 01:03:17 +00:00
PyTorch MergeBot	1c3fae46ee	Revert "Support SingletonSymNode mul with coefficient (#110369 )" This reverts commit `eb8feb8ff8`. Reverted https://github.com/pytorch/pytorch/pull/110369 on behalf of https://github.com/PaliC due to bottom diff is causing a plethora of internal failures ([comment](https://github.com/pytorch/pytorch/pull/110369#issuecomment-1749802899))	2023-10-05 23:51:28 +00:00
Joel Schlosser	3597325bc7	pin_memory support for NT (#110404 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110404 Approved by: https://github.com/cpuhrsch, https://github.com/albanD ghstack dependencies: #110292	2023-10-05 16:33:22 +00:00
soulitzer	eb8feb8ff8	Support SingletonSymNode mul with coefficient (#110369 ) We want to be able to use SingletonSymNode to represent strides for Jagged layout tensor. The following is for 3D, but easily generalizable to higher dimensions. Constraints: - [B, x, D] (where x represents the "variably lengthed dim") can be strided in two ways [x, 1, sum(x)] and [dx, d, 1]. We need two different placeholder values depending on how the jagged tensor is strided. - When doing operations we need the strides of output tensors to be expressable in terms of the strides and sizes of the inner tensors. Given [B, x, D] @ [D, D'], the output strides is [x * D', D', 1] rather than some opaque [x2, D', 1]. This constraint exists because if I'm tracing, I need a symint to represent the output stride. This symint needs to come from somewhere; I get it in several ways: (1) create a constant, (2) unbacked symint, (3) create a new input using a source, (4) output of an operation on an existing symint. It is clear that (4) is what we want here, which brings us to the design below. Design: Given the two constraints, the most straightforward way to implement this is actually to update SingletonSymNode to include some scalar factor, i.e. Morally, SingletonSymNode represents `factor * [s_0, s_1, …, s_n]` This enables us to symbolically compute strides from sizes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110369 Approved by: https://github.com/ezyang ghstack dependencies: #110044	2023-10-04 22:56:15 +00:00
Joel Schlosser	3693777a86	Pickle support for NT (#110219 ) Fixes #104198 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110219 Approved by: https://github.com/cpuhrsch	2023-09-29 15:30:06 +00:00
soulitzer	2bcff92540	Add NestedTensor python subclass (#108314 ) Description coming soon Pull Request resolved: https://github.com/pytorch/pytorch/pull/108314 Approved by: https://github.com/jbschlosser ghstack dependencies: #108808	2023-09-11 18:29:20 +00:00
Joel Schlosser	e5548f8195	NT support for cat with dim > 0 when representable as jagged (#108428 ) Used in SAM Pull Request resolved: https://github.com/pytorch/pytorch/pull/108428 Approved by: https://github.com/cpuhrsch ghstack dependencies: #108361, #108370, #108362	2023-09-03 01:50:32 +00:00
Joel Schlosser	76ccf6c770	NT support for narrow() on dim=0 (#108362 ) Satisfies request here: https://github.com/pytorch/pytorch/issues/105913#issuecomment-1652249934 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108362 Approved by: https://github.com/cpuhrsch ghstack dependencies: #108361, #108370	2023-09-02 23:48:37 +00:00
Joel Schlosser	54dcb0ea61	NT support for matmul of (B, *, C, D) NT with dense (D, E) (#108370 ) Used in SAM Pull Request resolved: https://github.com/pytorch/pytorch/pull/108370 Approved by: https://github.com/cpuhrsch ghstack dependencies: #108361	2023-09-01 20:45:44 +00:00
Joel Schlosser	414cb26ded	NT support for cat with dim=0 (#108361 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108361 Approved by: https://github.com/cpuhrsch	2023-09-01 02:02:53 +00:00
Jirka Borovec	9178deedff	removing some redundant str splits (#106089 ) drop some redundant string splits, no factual changes, just cleaning the codebase Pull Request resolved: https://github.com/pytorch/pytorch/pull/106089 Approved by: https://github.com/albanD, https://github.com/malfet	2023-09-01 00:22:58 +00:00

1 2 3 4

152 Commits