pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Wanchao Liang	a29b9101fa	[dynamo] fix dynamo + DTensor to work with 2d (#108329 ) pair debugged with @wconstab and we found some issue in both dynamo and the TP's fsdp extension side. This PR fixes the dynamo + DTensor integration so that the current graph break FSDP can work with tensor parallel by moving the torch.compile after FSDP wrapping. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108329 Approved by: https://github.com/Skylion007, https://github.com/wconstab	2023-08-31 22:46:26 +00:00
Yanbo Liang	dabdb97087	[Dynamo] Graph break on functions using tensor out variants (#108182 ) Fixes #108021 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108182 Approved by: https://github.com/eellison	2023-08-31 17:49:14 +00:00
Yukio Siraichi	6ad5568cbc	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-30 17:24:11 +00:00
PyTorch MergeBot	4e47ea5131	Revert "Break graph on `manual_seed`. (#107594 )" This reverts commit `6c28de2437`. Reverted https://github.com/pytorch/pytorch/pull/107594 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it seems to cause failures in trunk on inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_uniform_cuda_float, likely a landrace ([comment](https://github.com/pytorch/pytorch/pull/107594#issuecomment-1697783965))	2023-08-29 16:38:01 +00:00
Yukio Siraichi	6c28de2437	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-29 12:59:57 +00:00
Jason Ansel	73235d08c3	[dynamo] Graph break on pack_padded_sequence (#108096 ) This is to workaround #93501. Fixes errors in: ``` ./benchmarks/dynamo/torchbench.py --inference --performance --no-skip --inductor --freezing --only tacotron2 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/108096 Approved by: https://github.com/davidberard98	2023-08-29 00:08:11 +00:00
lezcano	db39a81e1e	Add a flag that allows breaking on NumPy ops (#107687 ) This was removed in `63d406a6a9` Resotiring, as it's rather useful for debugging. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107687 Approved by: https://github.com/larryliu0820	2023-08-23 01:21:22 +00:00
lezcano	612c8a8c84	Guard numpy imports in the dynamo folder (#107299 ) Fixes https://github.com/pytorch/pytorch/issues/107228 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107299 Approved by: https://github.com/atalman	2023-08-21 19:07:20 +00:00
Will Constable	eee2f57257	Raise TypeError for calling moduletype in dynamo (#107393 ) Fixes #107314 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107393 Approved by: https://github.com/williamwen42	2023-08-19 20:04:33 +00:00
lezcano	a9dca53438	NumPy support in torch.compile (#106211 ) RFC: https://github.com/pytorch/rfcs/pull/54 First commit is the contents of https://github.com/Quansight-Labs/numpy_pytorch_interop/ We have already been using this in core for the last few months as a external dependency. This PR pulls all these into core. In the next commits, I do a number of things in this order - Fix a few small issues - Make the tests that this PR adds pass - Bend backwards until lintrunner passes - Remove the optional dependency on `torch_np` and simply rely on the upstreamed code - Fix a number dynamo tests that were passing before (they were not tasting anything I think) and are not passing now. Missing from this PR (but not blocking): - Have a flag that deactivates tracing NumPy functions and simply breaks. There used to be one but after the merge stopped working and I removed it. @lezcano to investigate. - https://github.com/pytorch/pytorch/pull/106431#issuecomment-1667079543. @voznesenskym to submit a fix after we merge. All the tests in `tests/torch_np` take about 75s to run. This was a work by @ev-br, @rgommers @honno and I. I did not create this PR via ghstack (which would have been convenient) as this is a collaboration, and ghstack doesn't allow for shared contributions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106211 Approved by: https://github.com/ezyang	2023-08-11 00:39:32 +00:00
kshitij12345	cce2c52b0b	[pt2] support vmap (#101707 ) Teach dynamo about `vmap` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101707 Approved by: https://github.com/zou3519	2023-08-09 03:39:33 +00:00
Tugsbayasgalan Manlaibaatar	df50f91571	Support fx_pytree in dynamo (#105574 ) This PR does two things: 1. Make dynamo trace through fx_pytree (on top of torch.utils._pytree) so that generated graph modules can be retraced. 2. Fix bug where unflatten not returning dynamo VariableTracker. Differential Revision: [D47734623](https://our.internmc.facebook.com/intern/diff/D47734623) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105574 Approved by: https://github.com/yanboliang, https://github.com/ydwu4	2023-07-29 05:08:15 +00:00
Jason Ansel	099345f1e5	[Compiled Autograd] Handle aten.sym_size/aten.sym_stride (#105814 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105814 Approved by: https://github.com/voznesenskym	2023-07-28 21:42:51 +00:00
kshitij12345	920b446da9	dynamo: support disable_saved_tensors_hooks (#104869 ) Functorch transforms use this context manager which will lead to graph-breaks. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104869 Approved by: https://github.com/zou3519	2023-07-26 07:27:37 +00:00
Wanchao Liang	c76c84bde4	[dynamo] make ProcessGroupVariable a DistributedVariable (#105593 ) This PR move the ProcessGroupVariable from UDO to DistributedVT so that Distributed VTs are consolidated together Pull Request resolved: https://github.com/pytorch/pytorch/pull/105593 Approved by: https://github.com/voznesenskym	2023-07-26 06:42:50 +00:00
Michael Voznesensky	bf693f2000	Strengthen ConstantVariable invariants (#105796 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105796 Approved by: https://github.com/ezyang	2023-07-24 20:41:12 +00:00
Wanchao Liang	f139aab2f4	[dynamo] add initial dynamo support for DTensor (#103146 ) This PR adds initial dynamo support for DTensor, in particular, it: - allows DTensor be passed into a compiled function, and allow fakify DTensor during dynamo tracing by turning the inner local tensor to meta tensor. - We use `allow_in_graph` to include `DTensor` and `DTensor.from_local` to be represented as `TorchVariable` - The dtensor created becomes a normal `TensorVariable` and it would insert any tensor operations to the output graph just like torch.Tensor - note that dtensor have a new instance method `redistribute` compare to plain tensor, and we currently special handle it in `TensorVariable` `from_local` and `redistribute` both accepts some non-trival metadata as arguments (i.e. DeviceMesh, Placement) which fx.Graph does not support. In order to let these two APIs appear in the dynamo captured graph, we encoded the metadata into a new_function (like `functools.partial`) and the new function only accepts prim args (i.e. tensor), then we put `call_function` with this new_function to the graph. This is suggested by @ezyang. The underlying rationale here is that the metadata will not change across the graph invocations so it's safe to encode them. Captured graph: ``` def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ # File: /scratch/wanchaol/work/pytorch/test/distributed/_tensor/test_dtensor.py:685, code: dt = DTensor.from_local(x, mesh, [Shard(0)], run_check=False) prim_from_local = torch__dynamo_variables_torch_prim_from_local(l_x_, run_check = False); l_x_ = None # File: /scratch/wanchaol/work/pytorch/test/distributed/_tensor/test_dtensor.py:686, code: return dt.redistribute(mesh, [Replicate()]).to_local() + 2 prim_redistribute = torch__dynamo_variables_tensor_prim_redistribute(prim_from_local); prim_from_local = None to_local = prim_redistribute.to_local(); prim_redistribute = None add = to_local + 2; to_local = None return (add,) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103146 Approved by: https://github.com/voznesenskym	2023-07-19 16:01:12 +00:00
David Berard	ad6dad810e	[dynamo][profiler] More verbose profiler warning (#105362 ) torch.profiler.record_function and torch.profiler.profile are ignored by dynamo. In the common case, users have `record_function` in the middle of their program in order to annotate a section of the profile. The previous error message was `Profiler will be ignored`. Users would think that profiling would be completely ignored. Now the message will look like `Profiler function <class 'torch.autograd.profiler.record_function'> will be ignored`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105362 Approved by: https://github.com/yanboliang, https://github.com/aaronenyeshi	2023-07-18 04:42:13 +00:00
Michael Lazos	05eea20eb9	[dynamo] Simulate torch function enablement state (#105091 ) Part of https://github.com/pytorch/pytorch/issues/93723 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105091 Approved by: https://github.com/voznesenskym, https://github.com/anijain2305	2023-07-13 17:42:20 +00:00
Michael Lazos	0433cb0596	[dynamo] simulate tracing tree_map_only (#104815 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/104815 Approved by: https://github.com/voznesenskym	2023-07-10 18:05:35 +00:00
Animesh Jain	4005152b92	[dynamo] Organize higherorderops variable trackers (#104565 ) The main change is moving the higherorderops from torch.py to higher_order_ops.py. And creating smaller subclasses of HigherOrderOp for cond, map etc Pull Request resolved: https://github.com/pytorch/pytorch/pull/104565 Approved by: https://github.com/zou3519	2023-07-05 22:19:26 +00:00
William Wen	76a91075ea	propagate pred guards in TorchHigherOrderOperatorVariable call_function for cond (#104379 ) Fixes https://github.com/pytorch/pytorch/issues/104372 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104379 Approved by: https://github.com/voznesenskym, https://github.com/ydwu4, https://github.com/zou3519	2023-06-29 20:47:00 +00:00
Animesh Jain	c0aa442cb5	[dynamo][higher order op] Relaxing too restrictive check for output to be a list/tuple of tensors (#104221 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104221 Approved by: https://github.com/ydwu4, https://github.com/zou3519	2023-06-28 00:30:43 +00:00
Animesh Jain	75dab587ef	[dynamo] FSDP + AC + torch.compile (#103953 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103953 Approved by: https://github.com/wanchaol	2023-06-24 01:40:56 +00:00
Michael Voznesensky	ec24f1e4cc	Simulate treespec flattening/unflattening (#101896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101896 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-06-23 10:53:15 +00:00
kshitij12345	d552c271db	[pt2] grad support (#102264 ) Teach dynamo about grad Pull Request resolved: https://github.com/pytorch/pytorch/pull/102264 Approved by: https://github.com/zou3519	2023-06-21 10:13:09 +00:00
PyTorch MergeBot	e737a8486f	Revert "[pt2] grad support (#102264 )" This reverts commit `85b83954c8`. Reverted https://github.com/pytorch/pytorch/pull/102264 on behalf of https://github.com/huydhn due to This is failing in trunk `85b83954c8` and looks like a landrace ([comment](https://github.com/pytorch/pytorch/pull/102264#issuecomment-1600001309))	2023-06-21 03:02:55 +00:00
kshitij12345	85b83954c8	[pt2] grad support (#102264 ) Teach dynamo about grad Pull Request resolved: https://github.com/pytorch/pytorch/pull/102264 Approved by: https://github.com/zou3519	2023-06-21 01:37:08 +00:00
Tugsbayasgalan Manlaibaatar	d4b85f3031	Support params/buffers inside cond and map (#102310 ) With #102022, params and buffers are always treated as special case of free variables. In this PR, I switch cond and map implementation to the this method and deprecate the old tracing mechanism. Differential Revision: [D46746202](https://our.internmc.facebook.com/intern/diff/D46746202) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102310 Approved by: https://github.com/avikchaudhuri, https://github.com/zou3519	2023-06-20 05:33:10 +00:00
PyTorch MergeBot	2087d32811	Revert "Support params/buffers inside cond and map (#102310 )" This reverts commit `766f236bad`. Reverted https://github.com/pytorch/pytorch/pull/102310 on behalf of https://github.com/huydhn due to The test is failing in trunk `766f236bad` ([comment](https://github.com/pytorch/pytorch/pull/102310#issuecomment-1592159710))	2023-06-15 00:29:20 +00:00
Tugsbayasgalan Manlaibaatar	766f236bad	Support params/buffers inside cond and map (#102310 ) With #102022, params and buffers are always treated as special case of free variables. In this PR, I switch cond and map implementation to the this method and deprecate the old tracing mechanism. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102310 Approved by: https://github.com/avikchaudhuri, https://github.com/zou3519	2023-06-14 22:32:33 +00:00
David Berard	df83fe5bf7	[dynamo] graph break on nn.Parameter construction (#103262 ) Fixes #99569 nn.Parameter construction appears to run into FakeTensor / tracing issues during AOT Autograd. We could try to fix this; but nn.Parameter construction _inside_ the compiled region isn't a common scenario, so it's reasonable to just graph break on nn.Parameter construction. For reference, see #99569 for the errors/issues that appear from tracing through nn.Parameter construction with AOT Autograd. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103262 Approved by: https://github.com/williamwen42	2023-06-12 16:41:56 +00:00
Edward Z. Yang	5987c52082	Delete is_dynamic_shapes test (#103291 ) Simplified version of https://github.com/pytorch/pytorch/pull/102106/ Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103291 Approved by: https://github.com/voznesenskym	2023-06-10 01:27:14 +00:00
shibo19	c24b61bc20	Enable torch._C._get_privateuse1_backend_name in Dynamo tracing (#103141 ) Fixes https://github.com/pytorch/pytorch/issues/103125 torch._C._get_privateuse1_backend_name() will cause graph break, so I add it to the functions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103141 Approved by: https://github.com/yanboliang	2023-06-09 09:19:33 +00:00
gmagogsfm	b4f3a6f58f	[Dynamo Hackathon] Add support for hasattr on TorchVariable (#103177 ) Fixes #101154 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103177 Approved by: https://github.com/yanboliang	2023-06-08 19:34:44 +00:00
fduwjj	52e310f7a8	Enable torch.nn.init._calculate_correct_fan in dynamo tracing (#103182 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103182 Approved by: https://github.com/yanboliang	2023-06-08 06:13:49 +00:00
fduwjj	c454534d25	Enable torch.get_autocast_gpu_dtype in Dynamo tracing (#103166 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103166 Approved by: https://github.com/williamwen42, https://github.com/yanboliang	2023-06-07 21:31:45 +00:00
fduwjj	b5021ba981	Enable torch.is_complex in Dynamo tracing (#103154 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103154 Approved by: https://github.com/yanboliang	2023-06-07 20:56:46 +00:00
Mengwei Liu	c304fddf68	[dynamo][numpy] Support graph break for numpy ndarray (#100839 ) Issue: #93684 In previous PRs #95849 #99560 we redirect `numpy.`, `<tensor>.numpy()` calls to `torch_np.` methods and attributes, by creating `NumpyNdarrayVariable` for those calls. We need to handle `NumpyNdarrayVariable` when graph break happens. This PR did 2 things: 1. In `codegen.py` we made sure we can reconstruct the value wrapped by `NumpyNdarrayVariable`, to be `torch_np.ndarray` in the stack whenerver we recompiles the subgraph. 2. In `builder.py` we can wrap the value to be `NumpyNdarrayVariable` and save it as graph input. ----- Starting from commit 6: ## A new design for supporting numpy in dynamo In short the core concept doesn't change: we still convert `numpy` API calls to `torch_np` API calls. However, instead of wrapping a `torch_np.ndarray` in `NumpyNdarrayVariable`, the new design wraps a `torch.Tensor`. The reason for doing this change is because we need to keep `torch.Tensor` everywhere in the captured graph, so that it works well with the backend of dynamo. See discussions in https://github.com/Quansight-Labs/numpy_pytorch_interop/issues/142 for details. ### Flow This is an example showing how do we think about dynamo working on a simple function: ```python def f(x: torch.Tensor, y: torch.Tensor): a, b = x.numpy(), y.numpy() c = np.add(x, y) return torch.from_numpy(c) ``` ``` +------------+ +------------+ torch.Tensor \| \|numpy.ndarray\| \| -------------- .numpy() --------------\| \| \| \| \| \| +------------------+ +------------+ \| numpy.add \|numpy.ndarray\| \|torch.Tensor +------------+ \| --------------\| torch.from_numpy -------------- torch.Tensor \| \|numpy.ndarray\| \| \| \| -------------- .numpy() --------------\| \| +------------------+ \| \| \| \| +------------+ +------------+ +------------+ +----------------+ torch.Tensor \| \|torch.Tensor \| \| -------------- .detach() --------------\| \| \| \| \| \| +----------------+ +------------+ +------------+ \| \|torch_np.ndarray\| \|torch.Tensor\| \|torch.Tensor \| torch_np.add -----------------\| util.to_tensor -------------\| .detach() -------------- +------------+ \| \| \| \| \| \| torch.Tensor \| \|torch.Tensor \| \| +----------------+ +------------+ -------------- .detach() --------------\| \| \| \| \| \| +------------+ \| +----------------+ \| \| wrapper on torch_np.add \| +--------------------------------------------------------+ ``` ### Approach `torch_np` APIs can take both `torch_np.ndarray` as well as `torch.Tensor`. What we need to do is to have a wrapper for these APIs to convert the return value back to `torch.Tensor`. This way only the wrapper is showing up in the captured graph, with `torch.Tensor`s as input and `torch.Tensor` as output. If we have a graph break or we've traced to the end of the program, we need to inspect all the `NumpyNdarrayVariable` in the stack and convert them back to `numpy.ndarray`, to make sure the compiled version is still behaving the same as the eager version. ### Examples Here's an example of the graph generated: ```python def fn(x: np.ndarray, y: np.ndarray): a = x.real b = y.real torch._dynamo.graph_break() return np.add(a, 1), np.add(b, 1) ``` Graph generated: ``` [2023-05-16 10:31:48,737] torch._dynamo.output_graph.__graph: [DEBUG] TRACED GRAPH __compiled_fn_0 <eval_with_key>.0 opcode name target args kwargs ------------- -------------- ---------------------------------------------------------- ---------------------- -------- placeholder l_x_ L_x_ () {} placeholder l_y_ L_y_ () {} call_function from_numpy <built-in method from_numpy of type object at 0x12b1fdc80> (l_x_,) {} call_function from_numpy_1 <built-in method from_numpy of type object at 0x12b1fdc80> (l_y_,) {} call_function attr_wrapper <function attr_wrapper at 0x12e8693a0> (from_numpy, 'real') {} call_function attr_wrapper_1 <function attr_wrapper at 0x12e8693a0> (from_numpy_1, 'real') {} output output output ((),) {} [2023-05-16 10:31:48,908] torch._dynamo.output_graph.__graph: [DEBUG] TRACED GRAPH __compiled_fn_2 <eval_with_key>.1 opcode name target args kwargs ------------- ------------- ---------------------------------------------------------- ------------------------------- -------- placeholder l_a_ L_a_ () {} placeholder l_b_ L_b_ () {} call_function from_numpy <built-in method from_numpy of type object at 0x12b1fdc80> (l_a_,) {} call_function from_numpy_1 <built-in method from_numpy of type object at 0x12b1fdc80> (l_b_,) {} call_function wrapped_add <Wrapped function <original add>> (from_numpy, 1) {} call_function wrapped_add_1 <Wrapped function <original add>> (from_numpy_1, 1) {} output output output ((wrapped_add, wrapped_add_1),) {} ``` ### Changes * `codegen.py`: reconstruct `numpy.ndarray` from `NumpyNdarrayVariable` by adding bytecode to call `utils.to_numpy_helper()`. * `output_graph.py`: getting rid of legacy code that does exactly what `codegen.py` does, which only handling return case but not graph break case. * `utils.py`: added helpers to convert `numpy.ndarray` to `torch.Tensor` and vice versa. Also adding a wrapper class that takes in a function. In `__call__` it calls the function and converts its out to `torch.Tensor` (or a list of it). * `builder.py`: add method to wrap `numpy.ndarray` graph inputs into `NumpyNdarrayVariable`, by calling `torch.numpy` in the proxy. * `misc.py`: `numpy` API calls goes into `NumpyVariable` and we find the function with the same name in `torch_np` module, then wrap it with the wrapper defined in `utils.py`. * `tensor.py`, `torch.py`: proxy `tensor.numpy()` to be `torch.detach()` but wrap it with `NumpyNdarrayVariable`. Similarly, `torch.from_numpy()` -> `torch.detach()` but wrap it with `TensorVariable`. In `NumpyNdarrayVariable`, do the similar `torch_np.ndarray` to `torch.Tensor` wrapping for attributes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100839 Approved by: https://github.com/ezyang	2023-06-03 00:54:25 +00:00
Animesh Jain	c3c1496143	[dynamo][higher order op] Bugfixes to pass graph.lint (#102448 ) This PR ensures that the subgraphs use the newly created placeholder for the primary inputs and free variables. Earlier, this was not happening, and graph.lint() was failing. I need `graph.lint()` in the followup PRs where I run an `Interpreter` on the subgraph to preserve the metadata information to AOT Autograd. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102448 Approved by: https://github.com/zou3519	2023-05-31 00:29:29 +00:00
Will Constable	e344ff4113	Support dynamo tracing collectives with processgroup arg (#102222 ) Previously, other types of rank descriptors worked but pg caused dynamo to break down when tracing the internal func that converts pg to rank list. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102222 Approved by: https://github.com/wanchaol, https://github.com/voznesenskym	2023-05-27 03:01:49 +00:00
Animesh Jain	5d6810a4ee	[dynamo][higher order op] Support nn.Module calls (#102022 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102022 Approved by: https://github.com/zou3519	2023-05-24 21:39:58 +00:00
ydwu4	7e58891ca0	Support list output for HigherOrderOperators (#101986 ) Fixes the issue in #100278: support list output for HigherOrderOperator. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101986 Approved by: https://github.com/zou3519	2023-05-23 21:36:04 +00:00
Michael Voznesensky	4c1bc91f42	Support autograd.Function w/ grad (#99483 ) This PR adds support for tracing autograd.Function with grad. A few important bullet points outlining our approach: 1) Our goal is to verify soundness in order to add a call_function to the autograd.Function's `apply` to the graph. 2) We achieve (1) by either verifying soundness or rejecting soundness, by ensuring that both forward and backward of the autograd.Function are sound. 3) For the forward, if we verify soundness, we install its guards into the graph. 4) For the backward, if we verify soundness, we throw it out. However, backwards soundness verification is more onerous, and has a config driven set of banned attrs and methods for tensors. 1-4 above are achieved by turning the forward and backward into UserDefinedFunctionVariables, and inlining through them, relying on dynamo's soundness detection. If we graph break in these, we raise and treat them as unsound. As noted above, backwards is stricter yet. For the tracing, the safety comes from dynamo's HigherOrderOperator system. That system ensures that not only do we trace soundly, but that no new variables are lifted into inputs during the tracing, and that the forward and backwards are entirely self contained. Whenever we reject a function as unsound, we restore back, as usual. Due to some limitations in the lifting logic, we have an escape hatch we implemented for tensors that are known in forward, but cross into backwards through save_tensors (save) /saved_tensors (load). We escape hatch here to avoid having the known saved tensors coming from forward end up being accidentally treated as lifted variables (and rejected). This is sound, but a little hacky feeling. Additionally, due to some limitations in fx node removal, combined with how we produce subgraphs for the traces installed from HigherOrderOperators, we had to improve our node removal logic. In the event of a restore, we remove the old nodes from the graph, as usual in dynamo. However, because the references to these nodes may exist in subgraphs, we traverse any nodes users and remove them first if and only if they are in another graph. This is always sound, because removal should only be downstream of restoration at this point. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99483 Approved by: https://github.com/zou3519	2023-05-19 01:26:21 +00:00
Animesh Jain	2fa1b563da	[dynamo] Activation checkpoint higher order ops - Reland 101028 (#101790 ) https://github.com/pytorch/pytorch/pull/101028 was reverted due to internal breakage. Relanding. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101790 Approved by: https://github.com/zou3519	2023-05-18 19:09:14 +00:00
ydwu4	2e08c68564	Avoid cond prefix when naming subgraph of HigherOrderOperators (#101439 ) Fixes the issue in #100278: HigherOrderOperator body functions should not all be named "cond_body". Pull Request resolved: https://github.com/pytorch/pytorch/pull/101439 Approved by: https://github.com/zou3519	2023-05-16 21:28:40 +00:00
PyTorch MergeBot	d0db7d624d	Revert "[dynamo] Activation checkpointing as higher order op (#101028 )" This reverts commit `de15e740a1`. Reverted https://github.com/pytorch/pytorch/pull/101028 on behalf of https://github.com/jeanschmidt due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/101028#issuecomment-1548280970))	2023-05-15 17:47:08 +00:00
Animesh Jain	de15e740a1	[dynamo] Activation checkpointing as higher order op (#101028 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101028 Approved by: https://github.com/voznesenskym, https://github.com/zou3519	2023-05-12 03:17:41 +00:00
Yanbo Liang	8a20ea0a1f	[Dynamo] Fix torch.{cuda/cpu}.amp.autocast arguments binding bug (#101052 ) Fixes Meta internal user case. Repro: ``` import torch import torch._dynamo def fn(x): with torch.cuda.amp.autocast(False): x = torch.sin(x + 1) return x x = torch.randn([2, 3]) ref = fn(x) print(ref) opt_fn = torch._dynamo.optimize(backend="inductor")(fn) print(opt_fn(x)) ``` Error: ``` Traceback (most recent call last): File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/convert_frame.py", line 425, in _compile out_code = transform_code_object(code, transform) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/bytecode_transformation.py", line 1000, in transform_code_object transformations(instructions, code_options) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/convert_frame.py", line 410, in transform tracer.run() File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 2010, in run super().run() File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 703, in run and self.step() File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 663, in step getattr(self, inst.opname)(inst) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 385, in wrapper return inner_fn(self, inst) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 1095, in CALL_FUNCTION self.call_function(fn, args, {}) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 554, in call_function self.push(fn.call_function(self, args, kwargs)) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/variables/torch.py", line 381, in call_function return AutocastModeVariable.create(target_values=args, kwargs=kwargs) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/variables/ctx_manager.py", line 198, in create bound_args = inspect.signature(torch.autocast).bind(target_values, *kwargs) File "/scratch/ybliang/work/env/lib/python3.9/inspect.py", line 3045, in bind return self._bind(args, kwargs) File "/scratch/ybliang/work/env/lib/python3.9/inspect.py", line 2984, in _bind raise TypeError( TypeError: multiple values for argument 'device_type' from user code: File "/scratch/ybliang/work/repos/debug/debug6.py", line 10, in fn with torch.cuda.amp.autocast(False): ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101052 Approved by: https://github.com/anijain2305	2023-05-10 21:19:18 +00:00
Bartosz Szmelczynski	44e73da444	Extend assert statement to include ListVariable (#100841 ) Fixes #100697 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100841 Approved by: https://github.com/lezcano, https://github.com/anijain2305	2023-05-10 01:57:10 +00:00

1 2 3

131 Commits