pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Yanbo Liang	0dad85b402	[Dynamo] Fix torch.tensor call with tuple (#115713 ) Land #114383 on behalf of @ezyang since he is on recharge and this is an high priority issue. Fix #114231 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115713 Approved by: https://github.com/angelayi, https://github.com/voznesenskym	2023-12-13 04:08:12 +00:00
ydwu4	8a58af2a9f	[Reland][HigherOrderOp] make MapHigherOrder create map_impl (#115561 ) This is a reland of #115205, which gets reverted due to internal test failure. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115561 Approved by: https://github.com/angelayi	2023-12-12 20:45:01 +00:00
Bin Bao	19c67a9db5	[dynamo] Fix a closure cell empty error (#115541 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/97115. The solution given by @jansel in that issue works. Checking in the code so it won't get lost. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115541 Approved by: https://github.com/jansel	2023-12-12 00:01:51 +00:00
Yanbo Liang	274fdc81f8	[Dynamo][6.3/N] Further cleanup torch.py (#114669 ) A follow-up PR to clean up what I found during the refactor of torch.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/114669 Approved by: https://github.com/jansel	2023-12-11 07:16:03 +00:00
PyTorch MergeBot	6c1e75e646	Revert "[HigherOrderOp] make MapHigherOrder create map_impl call_function node instead of map (#115205 )" This reverts commit `8b74735878`. Reverted https://github.com/pytorch/pytorch/pull/115205 on behalf of https://github.com/atalman due to ghfirst broke internal tests ([comment](https://github.com/pytorch/pytorch/pull/115205#issuecomment-1848995376))	2023-12-10 15:25:55 +00:00
PyTorch MergeBot	08d63a75a4	Revert "[HigherOrderOp] Remove additional get item calls in MapHigherOrder. (#115207 )" This reverts commit `dd6ae6d3b4`. Reverted https://github.com/pytorch/pytorch/pull/115207 on behalf of https://github.com/atalman due to ghfirst broke internal tests ([comment](https://github.com/pytorch/pytorch/pull/115207#issuecomment-1848991919))	2023-12-10 15:12:12 +00:00
Michael Lazos	fbeca60b1f	Remove replace_all and make VTs mutable (#113725 ) 1. Removes calls to `replace_all` and `clone` and makes VTs mutable. 2. Properly handles Tuple Iterator mutation. Previously TupleIterator variables would only be properly reconstructed if they were advanced at least once in a frame. On calls to `next`, the source information would be lost (due to constructing a new iterator without using builder), which would ensure that during codegen the variable would be reconstructed from scratch. Now that VTs are mutated, the source is never lost, so we need to properly track mutation and handle it by replaying calls to `next` at the end of the modified bytecode. 3. Added test for checking iadd side effects, this was missing in our unit test coverage. 4. Fixed two incorrect sources, DelayGraphBreakVariable, and UserMethodVariable both relied on setting the source to AttrSource(parent, name) at the callsite of `var_getattr`. 5. Fixed a bug in inplace adding for lists, it would set the resulting VariableTracker's source to `None` which would utilize a different reconstruct path in codegen. Now this is handled explicitly by reconstructing vars when allow_cache=`False`, so that during side effect replay, the mutated var is correctly updated. In subsequent PRs: * Refactoring side effect tracking to be significantly simpler (I think we only need an `is_modified` flag) * Refactor `next_variables` iterator to match the signature of `next` * Remove all references to `options` in the code * Refactor VTs representing mutable collections to implement their own mutation update handling * Remove clone and/or make it specific to lists for creating slices * Add mutation tracking/replay for sets * Add mutation tracking/replay for iter.py * Removing setting source in builder (it's set at the top level after a var is returned) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113725 Approved by: https://github.com/jansel	2023-12-10 09:31:21 +00:00
Yanbo Liang	eb3aa424ce	[Reland][Dynamo] Added support for math.radians on ints with dynamic shapes (#115477 ) Reland #114507 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115477 Approved by: https://github.com/larryliu0820	2023-12-09 08:58:18 +00:00
Yanbo Liang	da341d0d48	[Dynamo][6.1/N] Refactor out TorchInGraphFunctionVariable and improve heuristic (#113432 ) This is splitted from #113009, please check https://github.com/pytorch/pytorch/pull/113009#issuecomment-1804417925 for more details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113432 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-12-09 05:11:44 +00:00
PyTorch MergeBot	e8e4141773	Revert "[Dynamo][6.1/N] Refactor out TorchInGraphFunctionVariable and improve heuristic (#113432 )" This reverts commit `e61d6b42f0`. Reverted https://github.com/pytorch/pytorch/pull/113432 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it is failing dynamo tests in trunk `e61d6b42f0`, landrace? ([comment](https://github.com/pytorch/pytorch/pull/113432#issuecomment-1847787981))	2023-12-08 20:15:39 +00:00
Michael Lazos	1c3a4a864c	Remove always restore (#115317 ) Removes always restore, assuming that a HOP will cleanup any leftover state from tracing fwd + bwd This required a minor change to the autograd fn variable higher order op. If we are tracing forward DON'T add the call_function node into the main graph, since we are only tracing it for the purposes of speculation. Instead return the result directly to be passed to the backward for speculation. This was the only observable side effect on the output graph that I found. Test plan: test_smoke_from_test_autograd in test_autograd_function.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/115317 Approved by: https://github.com/voznesenskym, https://github.com/jansel	2023-12-08 18:17:37 +00:00
Yanbo Liang	e61d6b42f0	[Dynamo][6.1/N] Refactor out TorchInGraphFunctionVariable and improve heuristic (#113432 ) This is splitted from #113009, please check https://github.com/pytorch/pytorch/pull/113009#issuecomment-1804417925 for more details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113432 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-12-08 17:15:14 +00:00
Iris Zhang (PyTorch)	23fa9621e4	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 ) (#115193 ) Summary: Rename _device_mesh.py to device_mesh.py, update all callsites, add documentation. We created stubs for public class and methods in torch.distributed.device_mesh so that torch.distributed.device_mesh can be imported with or without distributed is available(). Original diff reverted: D51629761 Original PR reverted: https://github.com/pytorch/pytorch/pull/115099 Prior to landing, CI signals are all passed. Shipit added the "ci/trunk" label to the PR and DID NOT wait for it and went ahead committing. More context can be found in the reverted PR above. Test Plan: CI. Differential Revision: D51861018 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115193 Approved by: https://github.com/fegin	2023-12-08 08:44:32 +00:00
voznesenskym	2c84616a94	Move the shape env symint cache to a symbol cache, better routing for subclass fakification [re-pr 115227] (#115396 ) * Context: Joel sees that unless he manually writes to the fake tensor memo, fakification seems to produce spurious symbols! Voz (me) objects, saying that not only is directly writing to memo a bad pattern, recursively invoking fakification on tensor subclass elements in dynamo should suffice! Joel says that while he morally agrees, he has a test proving otherwise, a most perplexing situation. Digging in, I figured out that while we were making fake tensors correctly, with properly cached symbols and the like, we were also incorrectly creating spurious symbols, leading the test to fail. Before this PR, we would only cache source->symint. This was generally fine, but meant that you would create a symbol, then potentially throw it out due to symint cache. For example, the cache hit flow was: make a symbol (ex: s2) -> use it to make a symint -> hit the cache (my_source-s1) Now, in this example, you have a symbol in your val_to_var/var_to_val (s2) that is unused. This is sound, but wasteful, and furthermore, misleading. This was causing a test added in a PR in this stack to fail, specifically, because the test was using ``` curr_var_to_val = { str(k): v for k, v in context.fake_mode.shape_env.var_to_val.items() } ```` To validate that no new symbols were being created (that is, that recursively creating fake tensors for subclasses was working). The test is correct, but the implementation of caching would make (by this method of observation) cache hits look like cache misses. So, the fix here is to move the cache up to be a general symbol cache, rather than only a cache for symints. The initial implementation did that! But then, it ran into some interesting errors when it came to replay. When replaying symbol creation, behaviors would diverge in the new shape env! How could that be? The answer is because creating a new shape_env resulted in us replaying symbol creation... but with a cache from a different shape env! This was short circuiting symbol creation - and so, adding an extra layer to the cache for id(shape_env) fixes the problem. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115396 Approved by: https://github.com/mlazos	2023-12-08 05:02:21 +00:00
Michael Lazos	18d57dde2d	Remove remaining uses of copy_graphstate (#115321 ) After auditing higher_order_ops.py, the graph checkpoints were only getting used in the event of an exception, so it is safe to remove because we restart analysis in this case now. To make this clearer the current state is the following: Checkpoint side effects Capture subgraph if graph break: restore as usual else: throw away inlining translator and subgraph tracer Restore side effects This will change to the following after this change: Checkpoint side effects Capture subgraph: if graph break: restart analysis else: throw away inlining translator and subgraph tracer Restore side effects Pull Request resolved: https://github.com/pytorch/pytorch/pull/115321 Approved by: https://github.com/jansel, https://github.com/zou3519	2023-12-07 22:35:02 +00:00
ydwu4	dd6ae6d3b4	[HigherOrderOp] Remove additional get item calls in MapHigherOrder. (#115207 ) As titled, this PR removes the unnessecary getitem call from the graph that's manipulated in MapHigherOrder, where we want to get the first dim slice of original tensor for specualtion but using call_method will accidentally create a get_item call in the graph, so want to avoid it by calling unpack_var_sequence on input tensor. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115207 Approved by: https://github.com/yanboliang ghstack dependencies: #115115, #115204, #115205	2023-12-07 17:06:44 +00:00
ydwu4	8b74735878	[HigherOrderOp] make MapHigherOrder create map_impl call_function node instead of map (#115205 ) We want to remove the map_wrapper and replace it with dynamo always on. This is the first step of this plan. In this PR, we make dynamo directly generates a map_impl nodes. This hasn't touch the eager logic yet. So the execution path after this PR looks like 1. `dynamo -> map_impl` when torch.compile is on. (Before this PR, it's `dynamo -> map_wrapper -> map_impl` and 2. `map_wrapper -> map_impl` (This PR did't touch the logic here). The added TODO(yidi) is addressed in the following pr. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115205 Approved by: https://github.com/yanboliang ghstack dependencies: #115115, #115204	2023-12-07 17:06:44 +00:00
ydwu4	be3efbebb6	[HigherOrderOp] make MapHigherOrder use should_flatten_output=True (#115204 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115204 Approved by: https://github.com/yanboliang ghstack dependencies: #115115	2023-12-07 17:06:35 +00:00
ydwu4	998c87f93c	[BE][HigherOrderOp] extract redundant code that unflattens the output (#115115 ) We need this function to unflatten the variable tracker for HOPs that want pytree output support, e.g. map. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115115 Approved by: https://github.com/yanboliang	2023-12-07 17:06:28 +00:00
Michael Lazos	3c882925da	Make subclass type instances constants (like UserDefinedClasses) (#115323 ) As title Pull Request resolved: https://github.com/pytorch/pytorch/pull/115323 Approved by: https://github.com/oulgen	2023-12-07 08:10:59 +00:00
Joel Schlosser	3a18211622	Guard on subclass inner tensors (#114965 ) This PR introduces guarding on subclass inner tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114965 Approved by: https://github.com/voznesenskym ghstack dependencies: #114311, #115212	2023-12-07 01:47:48 +00:00
Jon Chuang	83cb6a75ad	[dynamo] add list iterator contains (#115237 ) Fixes https://github.com/pytorch/pytorch/issues/115236 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115237 Approved by: https://github.com/jansel	2023-12-06 22:26:16 +00:00
rzou	67c8ad7285	Fix autograd.Function x enum input x torch.compile (#115206 ) Fixes https://github.com/pytorch/pytorch/issues/114777. We treat Enums like we do ConstantVariable. Test Plan: New test Pull Request resolved: https://github.com/pytorch/pytorch/pull/115206 Approved by: https://github.com/yanboliang ghstack dependencies: #115185, #115186, #115187	2023-12-06 15:18:25 +00:00
Jason Ansel	f4c67ffff4	[dynamo] Improve support for dynamic shapes str.format and _assert (#115203 ) This removes a graph break in vision_maskrcnn. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115203 Approved by: https://github.com/yanboliang	2023-12-06 04:54:45 +00:00
rzou	b0b190f7c0	More descriptive error message for unsupported inputs to HOP (#115187 ) Test Plan: See updated tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/115187 Approved by: https://github.com/ydwu4, https://github.com/yanboliang ghstack dependencies: #115185, #115186	2023-12-06 01:29:03 +00:00
rzou	b5b011a5cd	Expand input types for HOPs that use manually_set_subgraph_inputs=False (#115186 ) Previously we only supported Tensor, Constants, and SymNode. We lift that restriction (there's not really a good reason for it). HOPs like torch.cond, torch.map already do input validation (those are the ones that can only support Tensor, Constant, and SymNode inputs). Test Plan: New test for `wrap`, which is a HOP that has manually_set_subgraph_inputs=False Pull Request resolved: https://github.com/pytorch/pytorch/pull/115186 Approved by: https://github.com/ydwu4, https://github.com/yanboliang ghstack dependencies: #115185	2023-12-06 01:29:03 +00:00
rzou	bc46347152	Refactor how HOPs create new args to subgraphs (#115185 ) This PR combines the logic for Tensor and SymNode. Test Plan: - Existing tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/115185 Approved by: https://github.com/ydwu4, https://github.com/yanboliang	2023-12-06 01:29:03 +00:00
Yanbo Liang	4620170008	[Dynamo] Revert multiple PRs since they triggered compilation stuck internally (#115126 ) Revert the following PRs to mitigate internal compilation stuck: #113432 #114016 #114507 #114196 #114739 #114669 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115126 Approved by: https://github.com/xush6528	2023-12-05 22:35:37 +00:00
Joel Schlosser	22704426c3	Expand dynamic dims support for traceable subclasses (#114311 ) Continuation of #112185, following the design in this [doc](https://docs.google.com/document/d/1ipSxcTzEMMOAPvxP-YJlD5JBZZmIGgh8Q34ixtOUCRo). Summary: * Introduce `SubclassSymbolicPolicy` containing separate dynamic dim / constraint policies for the outer and inner tensors * Expand the automatic dynamic algorithm to recurse into inner tensors and produce one of these for a subclass instance * Maintain legacy behavior for subclasses by recursively calling `mark_dynamic()` on inner tensors of the same dim as outer when `mark_dynamic(outer, ...)` is called * Addresses this: `6a86cf00ad/torch/_dynamo/variables/builder.py (L1750)` * Add `outer_size` and `outer_stride` arguments to `__tensor_unflatten__()` so that you can find out what symbols were allocated for the outer size / stride (you are expected to return a tensor that compares equal to the outer symbols) * Signatures now: ```python # attrs is a list of inner tensor attributes on x; inner_tensor = getattr(x, attr) # ctx is anything useful for rebuilding the class we want to guard on attrs, ctx = x.__tensor_flatten__() ... # inner_tensors is a dict of {attr -> tensor} # ctx is taken unmodified from flattening and (eventually) guarded on # outer_size is the expected size of the output; possibly symbolic # outer_stride is the expected strides of the output; possibly symbolic y = MySubclass.__tensor_unflatten__(inner_tensors, ctx, outer_size, outer_stride) # at the __tensor_unflatten__() call-site in PT2, we assert y.shape == outer_size and y.stride() == outer_stride # the assert simplifies symbols when there are relationships between outer and inner symbols ``` * Size info needed for `NestedTensor` at least, stride info needed for `DTensor` at least * Punting on `outer_storage_offset` because storage_offset handling is horribly broken in PT2 right now * ~~Add new `__tensor_mark_dynamic__()` to allow overriding the behavior of mark_dynamic on a per-subclass basis~~ (booted to future work) * ~~Add guards for tensor subclasses by calling `__tensor_flatten__()` in the guard to test equality on `ctx`~~ * Now handled in #114469 * Next PR: add TENSOR_MATCH guards on inner tensors Pull Request resolved: https://github.com/pytorch/pytorch/pull/114311 Approved by: https://github.com/ezyang, https://github.com/drisspg, https://github.com/voznesenskym, https://github.com/bdhirsh	2023-12-05 21:09:25 +00:00
Jason Ansel	4b8ddbbc7e	[dynamo] Improve graph break message for copy.deepcopy (#115120 ) I was curious what hf_T5_generate was trying to deepcopy, so I updated the errror message: Before: ``` STATS graph_break ("'skip function deepcopy in file /home/jansel/conda/envs/pytorch/lib/python3.10/copy.py'', skipped according skipfiles.SKIP_DIRS'", 3) ... ``` After: ``` STATS graph_break ('copy.deepcopy UserDefinedObjectVariable(GenerationConfig)', 3) ... ``` Related issue: #115122 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115120 Approved by: https://github.com/oulgen ghstack dependencies: #115095, #115046, #115057, #115119	2023-12-05 19:01:31 +00:00
Jason Ansel	522bae20df	[dynamo] Support any() on SymNodeVariable (#115119 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115119 Approved by: https://github.com/yanboliang ghstack dependencies: #115095, #115046, #115057	2023-12-05 19:01:31 +00:00
Jason Ansel	88642d44d9	[dynamo] Add RestrictedListSubclassVariable (#115057 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115057 Approved by: https://github.com/yanboliang ghstack dependencies: #115095, #115046	2023-12-05 19:01:23 +00:00
Jason Ansel	a97ed2470a	[dynamo] Support hasattr on dataclass (#115046 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115046 Approved by: https://github.com/yanboliang ghstack dependencies: #115095	2023-12-05 19:01:14 +00:00
Nikita Shulga	a827ac71f2	Revert "[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 )" This reverts commit `eaa64339d6`.	2023-12-05 08:59:36 -08:00
Iris Zhang (PyTorch)	eaa64339d6	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#115099 ) Summary: Rename _device_mesh.py to device_mesh.py, update all callsites, adds documentation. Original diff reverted: D51629761 Original PR reverted: https://github.com/pytorch/pytorch/pull/114991 It was failing because failing a public module binding tests in MacOS, and this is due to the change in import order for torch/distributed/fsdp/_common_utils.py. Since this original import would still work, we remove the changes in this file. Test Plan: CI. Differential Revision: D51825114 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115099 Approved by: https://github.com/wanchaol, https://github.com/fegin	2023-12-05 05:44:52 +00:00
Jason Ansel	3d0bbb24a1	[dynamo] Improve support for list subclasses (#115052 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115052 Approved by: https://github.com/oulgen, https://github.com/eellison ghstack dependencies: #114830, #115047, #115048	2023-12-05 01:31:33 +00:00
Jason Ansel	fe690f430a	[dynamo] Fix dict.get with no default (#115048 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115048 Approved by: https://github.com/eellison, https://github.com/oulgen ghstack dependencies: #114830, #115047	2023-12-05 01:31:33 +00:00
Yanbo Liang	8ef44e6110	[autograd.Function] Fix torch.compile w/ once_differentiable leads to opaque graph break (#113625 ) Fixes #106893 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113625 Approved by: https://github.com/zou3519	2023-12-04 21:37:06 +00:00
Jason Ansel	a70c85ce90	[dynamo] Improve support for inspect.signature().parameters (#115047 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115047 Approved by: https://github.com/oulgen ghstack dependencies: #114830	2023-12-04 19:08:36 +00:00
Xuehai Pan	3fbfa8cd0a	[dynamo] support `dict.copy()` / `OrderedDict.copy()` / `defaultdict.copy()` (#115012 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115012 Approved by: https://github.com/jansel ghstack dependencies: #115010, #115011	2023-12-04 01:50:10 +00:00
Xuehai Pan	917a52d2a2	[dynamo] support `dict.update(seq2)` / `OrderedDict.update(seq2)` / `defaultdict.update(seq2)` (#115011 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115011 Approved by: https://github.com/jansel ghstack dependencies: #115010	2023-12-04 01:50:10 +00:00
Xuehai Pan	2e8ac5ea93	[dynamo] support `dict.fromkeys()` / `OrderedDict.fromkeys()` / `defaultdict.fromkeys()` (#115010 ) Add support for `dict.fromkeys`, `OrderedDict.fromkeys`, and `defaultdict.fromkeys`. Fixes #114963 - #114963 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115010 Approved by: https://github.com/jansel	2023-12-04 01:49:59 +00:00
Tugsbayasgalan Manlaibaatar	7f49603ed3	Fix https://github.com/pytorch/pytorch/issues/114899 (#114985 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/114985 Approved by: https://github.com/ydwu4	2023-12-03 05:24:02 +00:00
PyTorch MergeBot	3a2e2044cd	Revert "[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#114710 ) (#114991 )" This reverts commit `729ac7317a`. Reverted https://github.com/pytorch/pytorch/pull/114991 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/114991#issuecomment-1837214567))	2023-12-02 17:55:51 +00:00
Iris Zhang (PyTorch)	729ac7317a	[DeviceMesh] Rename _device_mesh.py to device_mesh.py to prepare for beta (#114710 ) (#114991 ) Summary: Same content of changes as https://github.com/pytorch/pytorch/pull/114710 Rename _device_mesh.py to device_mesh.py, update all callsites, adds documentation. ghstack-source-id: 208980207 exported-using-ghexport Test Plan: CI. Reviewed By: wanchaol Differential Revision: D51629761 Pull Request resolved: https://github.com/pytorch/pytorch/pull/114991 Approved by: https://github.com/wanchaol, https://github.com/fduwjj, https://github.com/fegin	2023-12-02 04:39:41 +00:00
voznesenskym	4cfe997490	[dynamo] handle setting .data on a tensor (#113080 ) Dynamo We don't want setattr in the graph. Setting data has interesting implications on both aliasing and on the autograd engine. The safe recipe is: 1) Disable grad 2) Call set_() 3) Manually lower the version counter on the object to hide it from the autograd engine This is effectively the same exact thing as setting .data, and it composes properly with aot_autograd and inductor. aot_autograd For aot_autograd, there's another snag. Specifically, when we invoke aot_autograd, we call `fake_mode.from_tensor()`, relying on memo to get the right tensor out. For .data mutations, this doesn't work, because the memoized fake_tensor is in the state it will be in at the end of the trace, not at the beginning. This means that the .data call is already applied, and the tensor shape (as in the case of these tests) mismatches. aot_autograd produces an invalid graph, with illegal calls like `torch.ops.aten.view.default(primals_2, [0])` where primals is actually sized `([6])` on input. The new plan here is to: 1) Record tensor fakification policy in dynamo 2) provide a fresh fake mode to all backends 3) Invoke from_tensor with the stored policy to get fresh new fake tensors in aot_autograd Pull Request resolved: https://github.com/pytorch/pytorch/pull/113080 Approved by: https://github.com/bdhirsh	2023-12-02 00:35:44 +00:00
David Berard	3fc58a6bbe	Revert "Make offsets dynamic by default (#113734 )" (#114889 ) This reverts commit `7c38b76efe`. if a graph has a lot of inputs which are views (with nonzero storage offset), then the check for overlapping tensor views will add a lot of guards (n^2?) `b35ca2cb94/torch/_functorch/_aot_autograd/input_output_analysis.py (L256-L260)` this was causing very slow compilations on an internal model. Differential Revision: [D51733774](https://our.internmc.facebook.com/intern/diff/D51733774) Pull Request resolved: https://github.com/pytorch/pytorch/pull/114889 Approved by: https://github.com/ckluk2, https://github.com/YuqingJ, https://github.com/aaronenyeshi	2023-12-01 16:49:42 +00:00
Yanbo Liang	ab5385fc50	[Dynamo][6.3/N] Further cleanup torch.py (#114669 ) A follow-up PR to clean up what I found during the refactor of torch.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/114669 Approved by: https://github.com/jansel	2023-12-01 04:08:29 +00:00
Yanbo Liang	7f40640342	[Dynamo] Support torch.amp.autocast as decorator (#114845 ) Fixes #114818 Pull Request resolved: https://github.com/pytorch/pytorch/pull/114845 Approved by: https://github.com/jansel	2023-11-30 23:54:57 +00:00
vfdev	f93ea14309	[dynamo] Added support for math ops on ints with dynamic shapes (#114507 ) Fixes #114218 ``` import math import torch def func(x, a): b = math.floor(a + 0.5) b = math.radians(a) + b y = x + b return y cfunc = torch.compile(func, dynamic=True, fullgraph=True, backend="eager") x = torch.tensor([0, 1, 2, 3], dtype=torch.float32) a = 12 out = cfunc(x, a) ``` ``` [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] TRACED GRAPH [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] ===== __compiled_fn_0 ===== [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] <eval_with_key>.0 class GraphModule(torch.nn.Module): [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] def forward(self, L_a_ : torch.SymInt, s1 : torch.SymInt, L_x_ : torch.Tensor): [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] l_a_ = L_a_ [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] l_x_ = L_x_ [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:7, code: b = math.floor(a + 0.5) [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add = l_a_ + 0.5 [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] floor = math_floor(add); add = None [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: /pytorch/torch/_dynamo/polyfill.py:28, code: return math.pi / 180.0 * x [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] mul = 0.017453292519943295 * l_a_; l_a_ = None [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:9, code: b = math.radians(a) + b [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] add_1 = mul + floor; mul = floor = None [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] # File: check_math_ops.py:13, code: y = x + b [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] y = l_x_ + add_1; l_x_ = add_1 = None [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] return (y,) [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] [2023-11-29 18:10:08,385] [0/0] torch._dynamo.output_graph.__graph_code: [DEBUG] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/114507 Approved by: https://github.com/lezcano	2023-11-30 14:11:57 +00:00

1 2 3 4 5 ...

868 Commits