pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
PyTorch MergeBot	38fcf77a1b	Revert "[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 )" This reverts commit `1a64ec7dd4`. Reverted https://github.com/pytorch/pytorch/pull/107337 on behalf of https://github.com/huydhn due to Sorry for reverting your change but inductor perf smoke test starts to regress after this ([comment](https://github.com/pytorch/pytorch/pull/107337#issuecomment-1710974588))	2023-09-08 02:03:48 +00:00
ydwu4	1a64ec7dd4	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-07 22:45:54 +00:00
Evgeni Burovski	1f20531939	fall back to eager on `NotImplementedError` (#107863 ) Follow-up to https://github.com/pytorch/pytorch/pull/107710: Help dynamo fall back to eager when compiling unimplemented numpy constructs: - arrays of strings - (arg){min, max} for complex types - various arguments typed as NotImplemented (`np.ones(4, order="F")` etc) - numpy functions which torch._numpy does not implement To test, run (we do not implement arrays of strings) ``` import torch import numpy as np @torch.compile(fullgraph=False) def fn(): return np.asarray(["L", "U"]) ``` and observe it compiles with fullgraph=False and fails with fullgraph=True Fixes https://github.com/pytorch/pytorch/issues/107970 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107863 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-09-07 21:22:20 +00:00
Yukio Siraichi	c887309437	Re-land: Break graph on `manual_seed`. (#108647 ) Trying to re-land #107594. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108647 Approved by: https://github.com/eellison	2023-09-07 12:52:38 +00:00
Animesh Jain	29f1097891	[dynamo] Reduce cache size limit to 8 (#108526 ) As title Pull Request resolved: https://github.com/pytorch/pytorch/pull/108526 Approved by: https://github.com/ezyang	2023-09-05 17:56:26 +00:00
Peter Bell	a16b0aa26a	[dynamo] Fix return type of Tensor.shape (#108240 ) This should be `torch.Size` but was returning a plain tuple under dynamo. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108240 Approved by: https://github.com/ezyang ghstack dependencies: #108239	2023-09-05 14:58:39 +00:00
Peter Bell	7c931f2491	[dynamo] Add dynamic shapes support to torch.Size.numel (#108239 ) Currently numel only supports static shapes, but this expands it to support generating symbolic arithmetic into the graph. e.g. ``` # x.size().numel with x.size() = [s0, 1, s1] size = l_x_.size() getitem = size[0] getitem_2 = size[2]; size = None mul = getitem * getitem_2; getitem = getitem_2 = None ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/108239 Approved by: https://github.com/ezyang	2023-09-05 14:58:39 +00:00
PyTorch MergeBot	48286d34a4	Revert "Break graph on `manual_seed`. (#107594 )" This reverts commit `6ad5568cbc`. Reverted https://github.com/pytorch/pytorch/pull/107594 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it has an import issue that breaks internal code ([comment](https://github.com/pytorch/pytorch/pull/107594#issuecomment-1705584405))	2023-09-04 18:00:37 +00:00
David Berard	06b173780d	[dynamo] "TorchDynamo Cache Lookup" event: use C++ api (#108436 ) Background: "TorchDynamo Cache Lookup" events appear in traces to indicate a dynamo cache lookup; it's useful to check when cache lookups are taking a long time. To add a profiler event, one can use the `torch.profiler.record_function` context manager, or the C++ equivalent. Previously, the python version was used; first, when the profiler was enabled, callbacks for record_function_enter and record_function_exit were registered; then those would be called before and after every cache lookup. This PR: Instead of calling the python bindings for `torch.profiler.record_function`, directly call the C++ implementation. This simplifies a lot of the code for binding C/C++. It also improves performance; previously there was a lot of overhead in the "TorchDynamo Cache Lookup" event, making the event artificially take a long time. After this change the events now appear shorter, because there's less overhead in starting/stopping the event: in other words, the profiler no longer distorts the results as much. Performance results: I ran using the script below on a cpu-only 1.6GHz machine. I report the median time (from 100 measurements) of a "TorchDynamo Cache Lookup" event before and after this PR. I think it is reasonable to consider the difference to be due to a reduction in overhead. <details> <summary>Benchmarking script</summary> ```python def fn(x, y): return (x * y).relu() a, b = [torch.rand((4, 4), requires_grad=True) for _ in range(2)] opt_fn = torch.compile(fn) opt_fn(a, b) opt_fn(a, b) with torch.profiler.profile() as prof: opt_fn(a, b) ``` </details> Median before PR: 198-228 us (median of 100, measured 5 times) Median after PR: 27us Pull Request resolved: https://github.com/pytorch/pytorch/pull/108436 Approved by: https://github.com/anijain2305, https://github.com/jansel	2023-09-04 04:37:26 +00:00
youkaichao	b9fc6d7ded	[Dynamo] Update the implementation of _debug_get_cache_entry_list (#108335 ) In https://github.com/pytorch/pytorch/pull/106673 , I created a private API `_debug_get_cache_entry_list` to help pull out cache entries from compiled functions. Recently, I find that @anijain2305 commented in the code that this API should be revisited, and so I created this PR. First, this API cannot be removed even if cache entry becomes a first-class python class`torch._C._dynamo.eval_frame._CacheEntry`. The facts that `extra_index` is static, and `get_extra_state` is inline static, make them not accessible elsewhere. This API `_debug_get_cache_entry_list` is the only way for users to get all the cache entries from code. Second, since the`torch._C._dynamo.eval_frame._CacheEntry` class is a python class, I simplified the C-part code, and remove the necessity of creating a namedtuple for this in the python code. Third, I also add a small improvement, that if the argument is a function, we can automatically pass its `__code__` to the API. The above change will slightly change the output, from list of named tuple to list of `torch._C._dynamo.eval_frame._CacheEntry`. I will update the corresponding docs that use this API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108335 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-09-02 16:38:59 +00:00
drisspg	add45aea1c	Flash Attention v2 (#105602 ) # Summary ## PR Dependencies I don't use ghstack :( this is a PR where it would have been helpful. That beings said I am going to peel off some PRs to make reviewing this easier: - [x] Separate build flags for Flash and MemEff: #107985 ### Description This pull request updates the version of _scaled_dot_product_flash_attention from version 1 to version 2. The changes are based on the flash attention code originally authored by @tridao ### Changes Made The majority of the changes in this pull request involve: - Copying over the flash_attention sources. - Updating header files. - Removing padding and slicing code from within the flash_attention kernel and relocating it to the composite implicit region of the SDPA. This was need to make the kernel functional and appease autograd. - Introducing a simple kernel generator to generate different instantiations of the forward and backward flash templates. - Adding conditional compilation (ifdef) to prevent building when nvcc is invoked with gencode < sm80. - Introducing a separate dependent option for mem_eff_attention, as flash_attention v2 lacks support for Windows and cannot be built for sm50 generation codes. - Modifying build.sh to reduce parallelization on sm86 runners and to lower the maximum parallelization on the manywheel builds. This adjustment was made to address out-of-memory issues during the compilation of FlashAttentionV2 sources. - Adding/Updating tests. ### Notes for Reviewers This is not a fun review, and I apologize in advance. Most of the files-changed are in the flash_attn/ folder. The only files of interest here IMO: - aten/src/ATen/native/transformers/cuda/flash_attn/flash_api.cpp - aten/src/ATen/native/transformers/cuda/flash_attn/kernels/generate_kernels.py ( this has been incorporated upstream to flash-attention github) There are a number of files all related to avoiding OOMs in CI/CD. These are typically shell scripts. ### Follow up items - Include the updates from `e07aa036db` and `9e5e8bc91e` \| https://github.com/pytorch/pytorch/issues/108108 ### Work Items - [x] I don't think Windows will be supported for 3.1.0 - Need to update cmakee - [x] Let multi_query/attention pass through and test \| UPDATE: I have the fast path implemented here: https://github.com/pytorch/pytorch/pull/106730 but since this will require changes to semantics of math to call repeat_interleave, I think this should be done as a followup. - [x] Had to drop cutlass back to 3.0.0 to get it to compile. Need to figure out how to upgrade to 3.1.0 and later. Spoke with Tri and he is going to be taking a look. Note: compiling with clang currently errors for the cute headers. - [x] Update test exercise above codepath - [x] Still need to disable on seq_len % 128 != 0 for backward( Tri beat me to it `a4f148b6ab`) - [x] Add determinism warning to BWD, Tri got to this one as well: 1c41d2b - [x] Update dispatcher to universally prefer FlashV2 - [x] Update tests to exercise new head_dims - [x] Move the head_dim padding from kernel to top level composite implicit function in order to make it purely functional - [x] Create template generator script - [x] Initial cmake support for building kernels/ folder - [x] Replay CudaGraph changes ### Results #### Forward only The TFlops are reported here are on a100 that is underclocked. ![flashv2_tflops_vs_seq_len](https://github.com/pytorch/pytorch/assets/32754868/152de46d-8fa6-42f0-9a9c-ef1eb7ae29e7) #### Forward+Backward Ran a sweep and for large compute bound sizes we do see a ~2x performance increase for forw+back. <img width="1684" alt="Screenshot 2023-07-20 at 3 47 47 PM" src="https://github.com/pytorch/pytorch/assets/32754868/fdd26e07-0077-4878-a417-f3a418b6fb3b"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105602 Approved by: https://github.com/huydhn, https://github.com/cpuhrsch	2023-09-01 22:14:44 +00:00
lezcano	2a6ef9b04d	[dynamo] Avoid recompilation when the PyTorch function accepts scalars (#108162 ) Before, it would create a 0D tensor with the input, which would incur in a guard and specialisation. It's not clear whether the guard and specialisation is the right behaviour when we create 0D tensors, but that's a story for another day. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108162 Approved by: https://github.com/ev-br, https://github.com/peterbell10	2023-09-01 14:35:42 +00:00
Yanbo Liang	dabdb97087	[Dynamo] Graph break on functions using tensor out variants (#108182 ) Fixes #108021 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108182 Approved by: https://github.com/eellison	2023-08-31 17:49:14 +00:00
Evgeni Burovski	01dfa7620d	MAINT: np.unique works with f16 directly (#108228 ) (follow up on gh-107768) Remove a f16->f32 workaround from np.unique, since torch.unique and np.unique seem to just work with float16 tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108228 Approved by: https://github.com/lezcano	2023-08-31 16:21:13 +00:00
Yukio Siraichi	6ad5568cbc	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-30 17:24:11 +00:00
PyTorch MergeBot	4e47ea5131	Revert "Break graph on `manual_seed`. (#107594 )" This reverts commit `6c28de2437`. Reverted https://github.com/pytorch/pytorch/pull/107594 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it seems to cause failures in trunk on inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_uniform_cuda_float, likely a landrace ([comment](https://github.com/pytorch/pytorch/pull/107594#issuecomment-1697783965))	2023-08-29 16:38:01 +00:00
Yukio Siraichi	6c28de2437	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-29 12:59:57 +00:00
voznesenskym	5d85d897e0	Torchrec Enablement Fixes - Re-PR 107910 (#108018 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108018 Approved by: https://github.com/wconstab	2023-08-28 19:47:53 +00:00
kobecai	356b8f6339	[dynamo]bugfix:implement numel() for SizeVariable (#107944 ) fix the issue that SizeVariable does not support numel() method Fixes #106407 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107944 Approved by: https://github.com/Skylion007	2023-08-28 17:54:57 +00:00
Jason Ansel	f877d0a4bf	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-26 01:41:29 +00:00
PyTorch MergeBot	eefce56b66	Revert "[dynamo] Treat monkey patched .forward as dynamic (#107104 )" This reverts commit `79b3a9f945`. Reverted https://github.com/pytorch/pytorch/pull/107104 on behalf of https://github.com/ZainRizvi due to Breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/107104#issuecomment-1692072018))	2023-08-24 16:55:33 +00:00
Jason Ansel	79b3a9f945	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-23 19:03:02 +00:00
lezcano	977aba7cfe	Revert the removal of a SampleInput for gather (#107776 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/107776 Approved by: https://github.com/peterbell10	2023-08-23 19:01:03 +00:00
lezcano	207b06d099	[dynamo] Wrap ndarray dunder methods (#107689 ) Fixes https://github.com/pytorch/pytorch/issues/107437 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107689 Approved by: https://github.com/ezyang ghstack dependencies: #107687, #107688, #107710, #107711, #107746	2023-08-23 13:55:36 +00:00
lezcano	fada0527fa	Dispatch take_along_axis to gather (#107711 ) Gather does the same thing, but it's much better supported in the `torch.compile` stack Pull Request resolved: https://github.com/pytorch/pytorch/pull/107711 Approved by: https://github.com/ezyang ghstack dependencies: #107687, #107688, #107710	2023-08-23 01:21:23 +00:00
Michael Voznesensky	02c2b750c5	Add support for GET_YIELD_FROM_ITER, YIELD_FROM, SEND (#106986 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106986 Approved by: https://github.com/jansel	2023-08-19 20:38:16 +00:00
Will Constable	eee2f57257	Raise TypeError for calling moduletype in dynamo (#107393 ) Fixes #107314 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107393 Approved by: https://github.com/williamwen42	2023-08-19 20:04:33 +00:00
Edward Z. Yang	5673c0874c	Use expect_true to make split with unbacked sizes work. (#106788 ) This pattern shows up in torchrec KeyedJaggedTensor. Most of the change in this PR is mechanical: whenever we failed an unbacked symint test due to just error checking, replace the conditional with something that calls expect_true (e.g., torch._check or TORCH_SYM_CHECK). Some of the changes are a bit more nuanced, I've commented on the PR accordingly. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106788 Approved by: https://github.com/lezcano ghstack dependencies: #106720	2023-08-15 20:31:30 +00:00
Michael Voznesensky	71a336ef75	[Dynamo x FSDP][1/x] Builder support for deque, appendleft (#106884 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106884 Approved by: https://github.com/ezyang	2023-08-11 03:26:12 +00:00
lezcano	a9dca53438	NumPy support in torch.compile (#106211 ) RFC: https://github.com/pytorch/rfcs/pull/54 First commit is the contents of https://github.com/Quansight-Labs/numpy_pytorch_interop/ We have already been using this in core for the last few months as a external dependency. This PR pulls all these into core. In the next commits, I do a number of things in this order - Fix a few small issues - Make the tests that this PR adds pass - Bend backwards until lintrunner passes - Remove the optional dependency on `torch_np` and simply rely on the upstreamed code - Fix a number dynamo tests that were passing before (they were not tasting anything I think) and are not passing now. Missing from this PR (but not blocking): - Have a flag that deactivates tracing NumPy functions and simply breaks. There used to be one but after the merge stopped working and I removed it. @lezcano to investigate. - https://github.com/pytorch/pytorch/pull/106431#issuecomment-1667079543. @voznesenskym to submit a fix after we merge. All the tests in `tests/torch_np` take about 75s to run. This was a work by @ev-br, @rgommers @honno and I. I did not create this PR via ghstack (which would have been convenient) as this is a collaboration, and ghstack doesn't allow for shared contributions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106211 Approved by: https://github.com/ezyang	2023-08-11 00:39:32 +00:00
youkaichao	bd3b6f1ab4	add a debug api to extract cache entry from code (#106673 ) Per the discussion with @jansel in https://dev-discuss.pytorch.org/t/how-are-guards-installed-on-frames-that-are-transient-objects/1415/7 , guards and compiled code live in `co_extra` field in pycodeobject, which cannot be accessed in a trivial way. This PR tries to add a debug API to extract the data from that field, which can make debugging torchdynamo much easier. The API is intended to be used for debug only, and should have no compatibility issues with the current system. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106673 Approved by: https://github.com/jansel	2023-08-08 16:33:46 +00:00
Michael Voznesensky	45c03b1ad4	Better dynamo dict support via SetVariable keys (#106559 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106559 Approved by: https://github.com/ezyang	2023-08-07 20:20:06 +00:00
Yukio Siraichi	33e70e34a3	More readable Z3 expressions printer. (#106643 ) This PR makes Z3 expressions easier to read and understand by creating a custom printer for them. Z3 expressions can be printed in 2 forms: 1. Using the builtin `str(e)` function 2. Using the `e.sexpr()` method Problem is that (1) is a bit hard to read because its line breaks are not so intuitive. (2) is a bit nicer, but the `to_int` and `to_real` functions clutter things up. The custom printer is an improved `sexpr()` function: - Leaves everything in one line - Gets rid of `to_int` and `to_real` functions - Reconstruct the floor division operations - Merge commutative operation chains Pull Request resolved: https://github.com/pytorch/pytorch/pull/106643 Approved by: https://github.com/ezyang	2023-08-07 16:52:22 +00:00
Yanbo Liang	e190afb829	[Dynamo] Allow users to patch custom builtin functions and inline them (#106595 ) Fixes Meta internal user case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106595 Approved by: https://github.com/jansel	2023-08-04 23:47:09 +00:00
ydwu4	2f281949a5	[dynamo] resolve InlinedClosureVariable in InstructionTranslator stack (#106491 ) When inlining a function which loads a closure, its direct parent may not load that closure. So we cannot find the closure name in parent's symbolic locals. In this PR, we fix it by recursively searching the parent instruction translator stack to resolve the closure. Background When developing https://github.com/pytorch/pytorch/pull/105679, this corner case is triggered. A small repro is added in the test of this pr, where outer is loaded by deep2 but not by deep. ```python def test_inline_closure_not_loaded_by_parent(self): def outer(a): return a + 1 def indirect(x): return direct(x) def direct(x): def deep2(c): return outer(c) def deep(c): return deep2(c) return deep(x) x = torch.randn(3) eager = indirect(x) counter = CompileCounter() compiled = torch._dynamo.optimize(counter)(indirect)(x) ``` Running the test, we have the following error before the PR: ``` Traceback (most recent call last): File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6584, in test_inline_closure_not_loaded_by_parent compiled = torch._dynamo.optimize(counter)(indirect)(x) File "/home/yidi/local/pytorch/torch/_dynamo/eval_frame.py", line 321, in _fn return fn(args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/eval_frame.py", line 481, in catch_errors return callback(frame, cache_size, hooks, frame_state) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 543, in _convert_frame result = inner_convert(frame, cache_size, hooks, frame_state) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 130, in _fn return fn(args, *kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 362, in _convert_frame_assert return _compile( File "/home/yidi/local/pytorch/torch/_dynamo/utils.py", line 194, in time_wrapper r = func(args, **kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 531, in _compile raise InternalTorchDynamoError(str(e)).with_traceback(e.__traceback__) from None File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 432, in _compile out_code = transform_code_object(code, transform) File "/home/yidi/local/pytorch/torch/_dynamo/bytecode_transformation.py", line 1028, in transform_code_object transformations(instructions, code_options) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 417, in transform tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2067, in run super().run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 261, in call_function return super().call_function(tx, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2279, in inline_call_ tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2279, in inline_call_ tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2227, in inline_call_ sub_locals, closure_cells = func.bind_args(parent, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 471, in bind_args result[name] = parent.symbolic_locals[name] torch._dynamo.exc.InternalTorchDynamoError: outer from user code: File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6570, in indirect return direct(x) File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6579, in direct return deep(x) File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6577, in deep return deep2(c) Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information You can suppress this exception and fall back to eager by setting: import torch._dynamo torch._dynamo.config.suppress_errors = True To execute this test, run the following from the base repo dir: python test/dynamo/test_misc.py -k test_inline_closure_not_loaded_by_parent This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ---------------------------------------------------------------------------------------------------------------------------- Captured stdout call ----------------------------------------------------------------------------------------------------------------------------- frames [('total', 1)] inline_call [] ---------------------------------------------------------------------------------------------------------------------------- Captured stderr call ----------------------------------------------------------------------------------------------------------------------------- [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __init__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __enter__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping helper /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __init__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __enter__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping enable_dynamic /home/yidi/local/pytorch/torch/_dynamo/eval_frame.py [2023-08-02 15:48:36,561] torch._dynamo.symbolic_convert: [INFO] Step 1: torchdynamo start tracing indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6569 TRACE starts_line indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6569 def indirect(x): [2023-08-02 15:48:36,591] torch._dynamo.variables.builder: [DEBUG] wrap_to_fake L['x'] (3,) [<DimDynamic.STATIC: 2>] [None] TRACE starts_line indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6570 return direct(x) [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF direct [] [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST x [UserFunctionVariable()] [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [UserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] INLINING <code object direct at 0x7fbe4d366810, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6572> TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6572 (inline depth: 1) def direct(x): TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6573 (inline depth: 1) def deep2(c): [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE outer [] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [InlinedClosureVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object deep2 at 0x7fbe4d3666b0, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6573> [TupleVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST MiscTests.test_inline_closure_not_loaded_by_parent.<locals>.direct.<locals>.deep2 [TupleVariable(), ConstantVariable(code)] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF deep2 [NestedUserFunctionVariable()] TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6576 (inline depth: 1) def deep(c): [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE deep2 [] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> [TupleVariable()] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST MiscTests.test_inline_closure_not_loaded_by_parent.<locals>.direct.<locals>.deep [TupleVariable(), ConstantVariable(code)] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST deep [NestedUserFunctionVariable()] TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6579 (inline depth: 1) return deep(x) [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST deep [] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST x [NestedUserFunctionVariable()] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] INLINING <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> TRACE starts_line deep /home/yidi/local/pytorch/test/dynamo/test_misc.py:6576 (inline depth: 2) def deep(c): TRACE starts_line deep /home/yidi/local/pytorch/test/dynamo/test_misc.py:6577 (inline depth: 2) return deep2(c) [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF deep2 [] [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST c [NestedUserFunctionVariable()] [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] FAILED INLINING <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] FAILED INLINING <code object direct at 0x7fbe4d366810, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6572> [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes ``` Test Plan: add new test Pull Request resolved: https://github.com/pytorch/pytorch/pull/106491 Approved by: https://github.com/williamwen42, https://github.com/jansel, https://github.com/zou3519	2023-08-03 16:45:42 +00:00
Edward Z. Yang	76163a56c0	Refactor stack handling to always use TracingContext to populate real stack on exception (#106277 ) The basic gist of the PR is simple, but it's accompanied with some careful modifications and unit tests to make sure I got it right. Check inline comments for more details. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106277 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2023-08-02 00:09:16 +00:00
Yukio Siraichi	e514386315	Normalize builtin types to dtypes. (#106074 ) Fix: #105052 Follow-up: #105588 This PR normalizes builtin Python types (e.g. `int` and `float`) into PyTorch data types when these are passed as argument, instead of used as functions. In summary, we: - Implement `BuiltinVariable.as_proxy`, mapping Python types into PyTorch data types Pull Request resolved: https://github.com/pytorch/pytorch/pull/106074 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-08-01 13:32:19 +00:00
Edward Z. Yang	7b9d250f06	Change _dynamo.export to be export(f)(args, *kwargs) (#106109 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106109 Approved by: https://github.com/voznesenskym	2023-07-27 21:41:13 +00:00
Yukio Siraichi	707aadeedd	Track global Numpy variables as side-effect. (#105959 ) Fix: #105074 This PR makes dynamo handle Numpy global variables the same way as PyTorch tensor global variables by tracking them as side-effect. In summary, we add `NumpyNdarrayVariable` to the `VariableBuilder._can_lift_attrs_to_inputs` function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105959 Approved by: https://github.com/ezyang	2023-07-27 03:49:48 +00:00
Michael Voznesensky	aabdd2b7a1	Add support for tensor.tolist() for static sized int tensors (#105976 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105976 Approved by: https://github.com/ezyang	2023-07-26 08:13:22 +00:00
Mengwei Liu	cce2b7e3c9	[dynamo][numpy] Add support for builtin len() on numpy ndarray (#105691 ) Issue #105054 ``` def fn(x): v = x.sum() / len(x) return v ``` This creates a graph break because we don't know how to handle __len__ method. Solution is just delegate it back to `TensorVariable`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105691 Approved by: https://github.com/ezyang	2023-07-21 03:50:40 +00:00
William Wen	777fc0bb58	[dynamo] fine-grained bytecode-source attribution in python 3.11 (#104676 ) Since Python 3.11 bytecode contains endline and column information, for each bytecode, we attribute the source code corresponding to the bytecode in a more accurate way. For example, we can highlight a function call in a series of nested function calls, or highlight a function call spanning multiple lines. Sample: ```python import torch import torch._dynamo from functorch.experimental.control_flow import cond def h(x): return x * 5 def true_fn(x): return x * 2 def false_fn(x): return x * 3 def f(pred, x): x = h( h(h(x)) ) x = x[1:][:2] torch._dynamo.graph_break() x = cond(pred, true_fn, false_fn, [x]) opt_f = torch.compile(f, backend="eager") opt_f(torch.tensor(True), torch.randn(3, 3, 3, 3)) ``` Output: ``` $ TORCH_LOGS="trace_call" python playground9.py TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:16 h(h(x)) ~^^^ TRACE FX call mul from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:16 h(h(x)) ~^^^^^^ TRACE FX call mul_1 from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:15 x = h( ~^ h(h(x)) ^^^^^^^ ) ^ TRACE FX call mul_2 from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE FX call getitem from f /scratch/williamwen/work/pytorch/playground9.py:18 x = x[1:][:2] ~^^^^ TRACE FX call getitem_1 from f /scratch/williamwen/work/pytorch/playground9.py:18 x = x[1:][:2] ~~~~~^^^^ TRACE inlined call true_fn from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TRACE FX call mul from true_fn /scratch/williamwen/work/pytorch/playground9.py:9 (inline depth: 1) return x * 2 ~~^~~ TRACE inlined call false_fn from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TRACE FX call mul from false_fn /scratch/williamwen/work/pytorch/playground9.py:12 (inline depth: 1) return x * 3 ~~^~~ TRACE FX call cond from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104676 Approved by: https://github.com/ezyang	2023-07-20 17:18:52 +00:00
Yukio Siraichi	5ce5372d70	Create tensor from Numpy in current device. (#105546 ) Fix: #105046 This PR changes how tensors are created from Numpy arrays, when tracing with dynamo. Instead of using `from_numpy`, we use `as_tensor`. The latter takes into consideration the current device. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105546 Approved by: https://github.com/lezcano	2023-07-19 21:31:52 +00:00
Justin Chu	8a688277a2	[BE] Enable ruff's UP rules and autoformat dynamo / functorch and refs (#105432 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105432 Approved by: https://github.com/ezyang	2023-07-19 13:48:44 +00:00
Yukio Siraichi	0b6de0eb1c	Improve validator module behavior if Z3 is not installed. (#105168 ) Fixes: #105143 In summary, the changes are: - Check if Z3 is installed when the module is loaded - Naming consistently as "translation validation" (not "validator") - Skipping tests if Z3 is not installed Pull Request resolved: https://github.com/pytorch/pytorch/pull/105168 Approved by: https://github.com/ezyang	2023-07-19 13:11:22 +00:00
Michael Voznesensky	a6758cb304	Revert "Revert "SetVariable in dynamo (#103205 )"" + Fix for improved graph breaks (#105345 ) This reverts commit `94b3f9f646`. Fix Pull Request resolved: https://github.com/pytorch/pytorch/pull/105345 Approved by: https://github.com/atalman	2023-07-17 23:21:30 +00:00
Animesh Jain	95232c216b	[dynamo] Bugfix for enums (#105306 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105306 Approved by: https://github.com/yanboliang	2023-07-17 16:39:16 +00:00
PyTorch MergeBot	94b3f9f646	Revert "SetVariable in dynamo (#103205 )" This reverts commit `82fb5edfc7`. Reverted https://github.com/pytorch/pytorch/pull/103205 on behalf of https://github.com/atalman due to Failing cuda11.8-py3.10-gcc7-sm86 / test (inductor_torchbench_dynamic) with CUDA oom ([comment](https://github.com/pytorch/pytorch/pull/103205#issuecomment-1638115073))	2023-07-17 13:13:47 +00:00
Michael Voznesensky	82fb5edfc7	SetVariable in dynamo (#103205 ) Set initial Fixes https://github.com/pytorch/pytorch/issues/94738 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103205 Approved by: https://github.com/jansel	2023-07-15 02:25:31 +00:00
Mengwei Liu	fb376f80a2	[retry][dynamo][numpy] Add support for np.dtype (#105034 ) Original PR: #103546 Trying to support numpy function call in dynamo, with numpy dtype as argument. For example: ``` def fn(x: int): return np.empty_like(x, dtype=np.float64) ``` This currently doesn't work because `NumpyVariable` doesn't implement `as_proxy()`. The idea in `as_proxy()` for now is to convert `np.float64` and other np.<dtype> into `str` and then feed into the corresponding `torch_np` method. The assumption here is that all `torch_np` methods that are taking `dtype` kwarg will be able to also take `str` as `dtype`. This assumption stands for `numpy`. For previous example, we convert `np.float64` to `"float64"` in `as_proxy()` and then feed it into `torch_np.empy_like()` method. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105034 Approved by: https://github.com/voznesenskym	2023-07-14 21:36:36 +00:00
Yukio Siraichi	8e01f75b1b	New `Mod` class for SymPy expressions. (#104968 ) This PR introduces a new `Mod` class to be used with SymPy expressions. The main reason being due to SymPy simplification errors (#97792). Pull Request resolved: https://github.com/pytorch/pytorch/pull/104968 Approved by: https://github.com/ezyang	2023-07-14 13:34:52 +00:00
PyTorch MergeBot	f01deb23d5	Revert "[dynamo][numpy] Add support for np.dtype (#103546 )" This reverts commit `0710791929`. Reverted https://github.com/pytorch/pytorch/pull/103546 on behalf of https://github.com/voznesenskym due to Failed on bench, unclear why bench test did not run on CI ([comment](https://github.com/pytorch/pytorch/pull/103546#issuecomment-1631203461))	2023-07-11 17:23:11 +00:00
Mengwei Liu	0710791929	[dynamo][numpy] Add support for np.dtype (#103546 ) ## Problem Trying to support numpy function call in dynamo, with numpy dtype as argument. For example: ``` def fn(x: int): return np.empty_like(x, dtype=np.float64) ``` ## Solution This currently doesn't work because `NumpyVariable` doesn't implement `as_proxy()`. The idea in `as_proxy()` for now is to convert `np.float64` and other np.<dtype> into `torch.dtype` and then feed into the corresponding `torch_np` method. For previous example, we convert `np.float64` to `torch.float64` in `as_proxy()` and then feed it into `torch_np.empy_like()` method. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103546 Approved by: https://github.com/ezyang	2023-07-11 06:29:15 +00:00
Yukio Siraichi	d5dbe77629	Fix mod semantics for `Z3Ops`. (#104827 ) Python `mod` semantics is not the same as the mathematical modulus operation. According to the Python reference: `a = floor(a / b) * b + a % r`. In other words: `a % b = a - floor(a / b) * b`. This PR fixes the old implementation which used SMT-LIB2 semantics for `mod`. In short, it only worked with integers and had the following guarantee: `0 <= a % b < b`. In summary, the changes are: - `a % b = a - floordiv(a, b) * b` - `a` and `b` can be both integer or real - The result will be real if any of the arguments is real. Otherwise, it will be integer Pull Request resolved: https://github.com/pytorch/pytorch/pull/104827 Approved by: https://github.com/lezcano	2023-07-10 23:35:04 +00:00
Michael Lazos	86680a6c0b	[dynamo] handle calls to typing.cast (#104799 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/104799 Approved by: https://github.com/jansel	2023-07-10 21:05:17 +00:00
Michael Lazos	0433cb0596	[dynamo] simulate tracing tree_map_only (#104815 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/104815 Approved by: https://github.com/voznesenskym	2023-07-10 18:05:35 +00:00
Yukio Siraichi	40b8d10d5e	Re-land: Turn translation validation on for tests and accuracy runs by default. (#104467 ) Re-landing: #103611 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104467 Approved by: https://github.com/malfet	2023-07-05 19:01:50 +00:00
Edward Z. Yang	2385dad4b3	Enable automatic_dynamic_shapes by default (#103623 ) Some notes: * I now manually turn off `_generate` jobs from running with cudagraphs, as it is unrealistic to expect to cudagraph autoregressive generation up to max sequence length, this would imply compiling the entire unrolled sequence generation. Concretely, cm3leon_generate was timing out post this change, likely due to the compile time slowdown of dynamic shapes ON TOP OF accidentally unrolling all the loops * A few torch._dynamo.reset tactically inserted to force recompiles on tests that expected it * expectedFailureAutomaticDynamic flip into patching automatic_dynamic_shapes=False Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103623 Approved by: https://github.com/voznesenskym	2023-07-05 00:25:02 +00:00
PyTorch MergeBot	a2a8b4d415	Revert "Turn translation validation on for tests and accuracy runs by default. (#103611 )" This reverts commit `e311bed2a8`. Reverted https://github.com/pytorch/pytorch/pull/103611 on behalf of https://github.com/malfet due to Broke inductor tests ([comment](https://github.com/pytorch/pytorch/pull/103611#issuecomment-1614850276))	2023-06-30 15:54:18 +00:00
Yukio Siraichi	e311bed2a8	Turn translation validation on for tests and accuracy runs by default. (#103611 ) This PR turns translation validation on by default for tests and accuracy benchmark runs. It also installs Z3 on CI. The main changes are: - Add `--no-translation-validation` as an option in _test/run_tests.py_ - Set `PYTORCH_TEST_WITH_TV` environment variable - Add `TEST_WITH_TV` variable in _torch/testing/_internal/common_utils.py_ - Turn translation validation on for accuracy benchmarks in _benchmarks/dynamo/common.py_ - Add Z3 installation on CI scripts Pull Request resolved: https://github.com/pytorch/pytorch/pull/103611 Approved by: https://github.com/ezyang	2023-06-30 01:32:21 +00:00
William Wen	998c07799f	[dynamo] fix deep nested closure cell KeyError (#104222 ) Fix https://github.com/pytorch/pytorch/issues/99639 by handling the case in `InliningInstructionTranslator`'s `LOAD_CLOSURE` definition when the requested cell is not in `self.closure_cells`. My intuition is that the behavior of `LOAD_DEREF` and `STORE_DEREF` on a cell/freevar should not depend on whether or not we called `LOAD_CLOSURE` (that is, we shouldn't create a new cell var in `LOAD_CLOSURE` like in https://github.com/pytorch/pytorch/pull/101357). But we need a way to push cells created by the inlined function that were not present in the caller - `InlinedClosureVariable` is used to differentiate these cells from other cells. Adding this test causes an error though (EDIT: this test is not relevant to this PR and instead just reveals that `cond` with Python side effects is still broken): ```python def test_closure_out_of_scope_cell_with_cond(self): from functorch.experimental.control_flow import cond cell1 = torch.rand(3, 3) cell2 = torch.rand(3, 3) orig3 = torch.rand(3, 3) def test(x): cell3 = orig3.clone() def then(): nonlocal cell3 cell3 += cell1 return cell3 def els(): nonlocal cell3 cell3 += cell2 return cell3 return cond(x > 0, then, els, []) opt_fn = torch._dynamo.optimize("eager")(test) result1 = opt_fn(1) self.assertTrue(torch.allclose(result1, orig3 + cell1)) result2 = opt_fn(-1) self.assertTrue(torch.allclose(result1, orig3 + cell1 + cell2)) ``` ``` Traceback (most recent call last): File "/scratch/williamwen/work/pytorch2/test/dynamo/test_misc.py", line 1768, in test_closure_out_of_scope_cell_with_cond result1 = opt_fn(1) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/eval_frame.py", line 295, in _fn return fn(args, kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/eval_frame.py", line 448, in catch_errors return callback(frame, cache_size, hooks, frame_state) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 526, in _convert_frame result = inner_convert(frame, cache_size, hooks, frame_state) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 127, in _fn return fn(args, *kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 360, in _convert_frame_assert return _compile( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/utils.py", line 180, in time_wrapper r = func(args, *kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 430, in _compile out_code = transform_code_object(code, transform) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/bytecode_transformation.py", line 1000, in transform_code_object transformations(instructions, code_options) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 415, in transform tracer.run() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 2029, in run super().run() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 708, in run and self.step() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 668, in step getattr(self, inst.opname)(inst) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 391, in wrapper return inner_fn(self, inst) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 1100, in CALL_FUNCTION self.call_function(fn, args, {}) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 559, in call_function self.push(fn.call_function(self, args, kwargs)) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/torch.py", line 1061, in call_function (false_r, false_graph, false_lifted_freevars) = speculate_branch(False) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/torch.py", line 1044, in speculate_branch ret_val, ret_graph, ret_lifted_freevars = speculate_subgraph( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/torch.py", line 850, in speculate_subgraph output = f.call_function(tx, args, {}) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/functions.py", line 121, in call_function return tx.inline_user_function_return( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 595, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 2134, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 2231, in inline_call_ tracer.run() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 708, in run and self.step() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 668, in step getattr(self, inst.opname)(inst) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 162, in impl self.push(fn_var.call_function(self, self.popn(nargs), {})) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/builtin.py", line 497, in call_function proxy = tx.output.create_proxy( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 345, in create_proxy return self.current_tracer.create_proxy(args, **kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 1109, in create_proxy new_arg = self.lift_tracked_freevar_to_input(arg) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 1226, in lift_tracked_freevar_to_input self.parent.lift_tracked_freevar_to_input(proxy) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 1219, in lift_tracked_freevar_to_input assert ( AssertionError: lift_tracked_freevar_to_input on root SubgraphTracer from user code: File "/scratch/williamwen/work/pytorch2/test/dynamo/test_misc.py", line 1766, in test return cond(x > 0, then, els, []) File "/scratch/williamwen/work/pytorch2/test/dynamo/test_misc.py", line 1764, in els cell3 += cell2 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104222 Approved by: https://github.com/jansel	2023-06-28 17:54:13 +00:00
cdzhan	c06bb82ba1	fix specialization when you pass an unspec int into slicing on a Python list. (#104142 ) Fixes #103545 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104142 Approved by: https://github.com/malfet, https://github.com/jansel	2023-06-28 13:13:07 +00:00
Michael Voznesensky	ec24f1e4cc	Simulate treespec flattening/unflattening (#101896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101896 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-06-23 10:53:15 +00:00
Nikita Shulga	cd05c3b98c	[BE] Use `TEST_MULTIGPU` from `common_cuda.py` (#103982 ) Comment about `TEST_CUDNN` called over and over has long been alleviated by wrapping the check with `LazyVal`, that caches the results. Also, delete unused `TEST_MAGMA`. Prep change for https://github.com/pytorch/pytorch/issues/100006 <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at e3a5b39</samp> > _`common_cuda.py`_ > _Refactored for dynamo tests_ > _Winter code cleanup_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/103982 Approved by: https://github.com/atalman, https://github.com/janeyx99	2023-06-22 00:07:44 +00:00
Edward Z. Yang	7ce932a92c	Add signpost_event to dynamic_shapes (#103882 ) Added two signpost_event calls to torch.fx.experimental.symbolic_shapes, one for produce_guards (where we can give stats like how many free symbols and how many guards produced) and the other is for evaluate_expr after freeze (so we can look for cases where we're improperly discarding guards in backwards.) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103882 Approved by: https://github.com/Skylion007	2023-06-21 13:26:21 +00:00
Tugsbayasgalan Manlaibaatar	d4b85f3031	Support params/buffers inside cond and map (#102310 ) With #102022, params and buffers are always treated as special case of free variables. In this PR, I switch cond and map implementation to the this method and deprecate the old tracing mechanism. Differential Revision: [D46746202](https://our.internmc.facebook.com/intern/diff/D46746202) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102310 Approved by: https://github.com/avikchaudhuri, https://github.com/zou3519	2023-06-20 05:33:10 +00:00
Will Feng	9541053cca	[dynamo] support FakeTensor for SYM_INT/SYM_INT_LIST/INT_LIST param in python-to-cpp argument parsing (#103448 ) before the PR, when compiling a function with signature symint/symintlist/intlist, we have runtime error like ```argument 'shifts' must be tuple of ints, not FakeTensor```. see newly added unit test in test/dynamo/test_misc.py for repro after the PR, for FakeTensor with empty size and numel()=1, we will try to convert it into symint/symintlist. we will likely see expected exception ```torch._subclasses.fake_tensor.DataDependentOutputException / aten._local_scalar_dense.default``` during conversion reference PR: * we handle FakeTensor for symintlist as 1st varags: https://github.com/pytorch/pytorch/pull/97508 * we handle FakeTensor for intlist in a similar way: https://github.com/pytorch/pytorch/pull/85759/files * call local_scalar_dense on a FakeTensor: `f7365eca90` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103448 Approved by: https://github.com/yanboliang	2023-06-16 21:33:40 +00:00
Yanbo Liang	703875e364	[Reland][Dynamo] VariableTracker.recursively_contains should be updated correctly when mutation happens (#103564 ) (#103717 ) Summary: Reland of https://github.com/pytorch/pytorch/pull/103564 Test Plan: contbuild & OSS CI, see `5c3556da94` Differential Revision: D46783727 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103717 Approved by: https://github.com/angelayi	2023-06-16 04:25:27 +00:00
Edward Z. Yang	ed3a61afcc	Add automatic_dynamic_shapes test configuration (#103598 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103598 Approved by: https://github.com/Skylion007	2023-06-15 19:55:57 +00:00
PyTorch MergeBot	73be9842be	Revert "[Dynamo] VariableTracker.recursively_contains should be updated correctly when mutation happens (#103564 )" This reverts commit `5c3556da94`. Reverted https://github.com/pytorch/pytorch/pull/103564 on behalf of https://github.com/ZainRizvi due to Broke internal builds ([comment](https://github.com/pytorch/pytorch/pull/103564#issuecomment-1593552435))	2023-06-15 18:40:51 +00:00
Edward Z. Yang	bc6ec97e02	Switch dynamic_shapes to True by default (#103597 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103597 Approved by: https://github.com/voznesenskym	2023-06-15 15:16:20 +00:00
PyTorch MergeBot	2087d32811	Revert "Support params/buffers inside cond and map (#102310 )" This reverts commit `766f236bad`. Reverted https://github.com/pytorch/pytorch/pull/102310 on behalf of https://github.com/huydhn due to The test is failing in trunk `766f236bad` ([comment](https://github.com/pytorch/pytorch/pull/102310#issuecomment-1592159710))	2023-06-15 00:29:20 +00:00
Edward Z. Yang	ddf4cd69ec	Delete ifdyn and ifunspec combinators (#103596 ) Replaced with expect tests for ease of updating. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103596 Approved by: https://github.com/voznesenskym	2023-06-15 00:14:17 +00:00
Tugsbayasgalan Manlaibaatar	766f236bad	Support params/buffers inside cond and map (#102310 ) With #102022, params and buffers are always treated as special case of free variables. In this PR, I switch cond and map implementation to the this method and deprecate the old tracing mechanism. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102310 Approved by: https://github.com/avikchaudhuri, https://github.com/zou3519	2023-06-14 22:32:33 +00:00
Michael Voznesensky	aece6705d1	Move locals/globals to output graph, make it easier to access them anywhere (#103456 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103456 Approved by: https://github.com/jansel	2023-06-14 20:04:33 +00:00
Edward Z. Yang	9946499228	Continue simplifying dynamic shapes tests (#103592 ) Remove the static by default / no automatic dynamic configuration as this is about to become the default. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103592 Approved by: https://github.com/voznesenskym, https://github.com/Skylion007	2023-06-14 19:35:51 +00:00
Yanbo Liang	5c3556da94	[Dynamo] VariableTracker.recursively_contains should be updated correctly when mutation happens (#103564 ) Fixes #103563 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103564 Approved by: https://github.com/jansel	2023-06-14 17:08:00 +00:00
Edward Z. Yang	2f5fef5912	Refactor tests for dynamic shapes (#103542 ) First, infra improvements: new combinator `expectedFailureDynamic` which subsumes expectedFailure calls in test_dynamic_shapes.py. It's just nicer to have these right with the test. Implementation in torch/_dynamo/testing.py and it works by putting an attr on the test, which is then converted into a real expectedFailure when we actually generate the dynamic shapes test class Next, some housekeeping: * test/dynamo/test_unspec.py accidentally was running mostly statically due to the `assume_static_by_default` config flip. Don't assume static by default and xfail some tests which regressed in that time. * New test file test/dynamo/test_config.py, for testing permutations of configuration options. `test_dynamic_shapes` got moved there. Finally, grinding through tests in a way that will make them more compatible with dynamic by default: * If the test explicitly requires dynamic_shapes=False, remove that patch (and probably xfail it) * If the test checks dynamic_shapes internally, remove that test and patch the test so it ALWAYS runs with dynamic_shapes (this is not coverage loss because we're going to switch the default) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103542 Approved by: https://github.com/anijain2305	2023-06-14 02:04:54 +00:00
Michael Lazos	6c6c897d6b	Add graph break logging option instead of config flag (#103202 ) Make graph break logging a logging option vs a config setting Pull Request resolved: https://github.com/pytorch/pytorch/pull/103202 Approved by: https://github.com/yanboliang, https://github.com/anijain2305	2023-06-12 19:52:31 +00:00
Edward Z. Yang	49754f44ee	Rewrite size/stride/numel TensorVariable handling (#103438 ) The main concept behind this refactor is this: if we know that a size/stride/etc is constant, do NOT trace it into the graph, EXCEPT for any preexisting special cases that applied for static shapes. The refactor unfolds like this: 1. Delete the `dynamic_shapes` branches in torch/_dynamo/variables/builder.py which accept int/float/bool outputs. This is over-aggressive and we don't want to allow this (because if the operator returns a constant, we shouldn't have called wrap_fx_proxy in the first place.) This causes a bunch of failures because we are blindly feeding the result of size() call to wrap_fx_proxy when dynamic shapes is enabled. 2. Modify TensorVariable.call_method in torch/_dynamo/variables/tensor.py to avoid sending constant ints to wrap_fx_proxy. After normal specialization (which should be deleted, see https://github.com/pytorch/pytorch/pull/103434) we consult the fake tensor to see if the values in question have free variables or not. If they don't we short circuit tracing into graph. We only trace into graph if the operation in question is truly symbolic. Note that there is a near miss here: it's OK to trace x.size() call entirely into the graph, even if it doesn't have all dynamic shapes, because operator.getitem with int output is special cased in builder.py. This is a preexisting special case and I don't try to get rid of it. 3. It turns out that the change here also breaks torch_np compatibility layer. So I completely rewrite getattr handling in torch/_dynamo/variables/tensor.py to follow the same pattern (only trace into graph if truly dynamic). There's some minor housekeeping in torch/fx/experimental/symbolic_shapes.py and some test files. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103438 Approved by: https://github.com/larryliu0820	2023-06-12 19:36:24 +00:00
Yukio Siraichi	7550ec16a4	Add support for dictionary with torch object keys. (#103158 ) Fixes: #101979 This PR adds support for dictionaries with torch object as keys in dynamo. The main problem was that, for example, the source built for `d[torch.float]` (`d` being a dictionary) was `ODictGetItemSource(GlobalSource('d'), index=torch.float)`. When `Source.name` method was called, we got `odict_getitem(G['d'], torch.float)`. Evaluating that string raised an error, since `torch` was only available in the global dictionary `G` as `G["torch"]`. Instead, this PR builds the source: `ODictGetItemSource(GlobalSource('d'), index=AttrSource(GlobalSource('torch'), 'float'))`. The to-be-evaluated string is correctly generated as: `odict_getitem(G['d'], G['torch'].float)`. Here's a minimal example that reproduces the error, before this PR: ```python import torch d = { torch.float16: torch.float32, } @torch.compile def f(): return torch.randn(3, dtype=d[torch.float16]) f() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103158 Approved by: https://github.com/mlazos	2023-06-09 20:18:49 +00:00
gmagogsfm	b4f3a6f58f	[Dynamo Hackathon] Add support for hasattr on TorchVariable (#103177 ) Fixes #101154 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103177 Approved by: https://github.com/yanboliang	2023-06-08 19:34:44 +00:00
ydwu4	3c896a5adb	[dynamo] fix torch.distributions lazy_attribute failure (#103208 ) Fixes #93340. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103208 Approved by: https://github.com/yanboliang	2023-06-08 17:26:54 +00:00
Tugsbayasgalan Manlaibaatar	91e82ba0a6	[PT2 Dynamo Hackathon] Fix simple bug in inline dict (#103187 ) Fixes: https://github.com/pytorch/pytorch/issues/101980 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103187 Approved by: https://github.com/yanboliang	2023-06-08 07:16:13 +00:00
Yanbo Liang	d92bb036a4	[Dynamo] Fix if condition on UnspecializedNNModuleVariable (#102583 ) Fixes #102315 The root cause is for ```UnspecializedNNModuleVariable``` which extends from ```UserDefinedObjectVariable```, if ```__bool__``` is missing, we should use ```__len__``` to infer a truth value. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102583 Approved by: https://github.com/jansel	2023-06-03 03:42:15 +00:00
Mengwei Liu	c304fddf68	[dynamo][numpy] Support graph break for numpy ndarray (#100839 ) Issue: #93684 In previous PRs #95849 #99560 we redirect `numpy.`, `<tensor>.numpy()` calls to `torch_np.` methods and attributes, by creating `NumpyNdarrayVariable` for those calls. We need to handle `NumpyNdarrayVariable` when graph break happens. This PR did 2 things: 1. In `codegen.py` we made sure we can reconstruct the value wrapped by `NumpyNdarrayVariable`, to be `torch_np.ndarray` in the stack whenerver we recompiles the subgraph. 2. In `builder.py` we can wrap the value to be `NumpyNdarrayVariable` and save it as graph input. ----- Starting from commit 6: ## A new design for supporting numpy in dynamo In short the core concept doesn't change: we still convert `numpy` API calls to `torch_np` API calls. However, instead of wrapping a `torch_np.ndarray` in `NumpyNdarrayVariable`, the new design wraps a `torch.Tensor`. The reason for doing this change is because we need to keep `torch.Tensor` everywhere in the captured graph, so that it works well with the backend of dynamo. See discussions in https://github.com/Quansight-Labs/numpy_pytorch_interop/issues/142 for details. ### Flow This is an example showing how do we think about dynamo working on a simple function: ```python def f(x: torch.Tensor, y: torch.Tensor): a, b = x.numpy(), y.numpy() c = np.add(x, y) return torch.from_numpy(c) ``` ``` +------------+ +------------+ torch.Tensor \| \|numpy.ndarray\| \| -------------- .numpy() --------------\| \| \| \| \| \| +------------------+ +------------+ \| numpy.add \|numpy.ndarray\| \|torch.Tensor +------------+ \| --------------\| torch.from_numpy -------------- torch.Tensor \| \|numpy.ndarray\| \| \| \| -------------- .numpy() --------------\| \| +------------------+ \| \| \| \| +------------+ +------------+ +------------+ +----------------+ torch.Tensor \| \|torch.Tensor \| \| -------------- .detach() --------------\| \| \| \| \| \| +----------------+ +------------+ +------------+ \| \|torch_np.ndarray\| \|torch.Tensor\| \|torch.Tensor \| torch_np.add -----------------\| util.to_tensor -------------\| .detach() -------------- +------------+ \| \| \| \| \| \| torch.Tensor \| \|torch.Tensor \| \| +----------------+ +------------+ -------------- .detach() --------------\| \| \| \| \| \| +------------+ \| +----------------+ \| \| wrapper on torch_np.add \| +--------------------------------------------------------+ ``` ### Approach `torch_np` APIs can take both `torch_np.ndarray` as well as `torch.Tensor`. What we need to do is to have a wrapper for these APIs to convert the return value back to `torch.Tensor`. This way only the wrapper is showing up in the captured graph, with `torch.Tensor`s as input and `torch.Tensor` as output. If we have a graph break or we've traced to the end of the program, we need to inspect all the `NumpyNdarrayVariable` in the stack and convert them back to `numpy.ndarray`, to make sure the compiled version is still behaving the same as the eager version. ### Examples Here's an example of the graph generated: ```python def fn(x: np.ndarray, y: np.ndarray): a = x.real b = y.real torch._dynamo.graph_break() return np.add(a, 1), np.add(b, 1) ``` Graph generated: ``` [2023-05-16 10:31:48,737] torch._dynamo.output_graph.__graph: [DEBUG] TRACED GRAPH __compiled_fn_0 <eval_with_key>.0 opcode name target args kwargs ------------- -------------- ---------------------------------------------------------- ---------------------- -------- placeholder l_x_ L_x_ () {} placeholder l_y_ L_y_ () {} call_function from_numpy <built-in method from_numpy of type object at 0x12b1fdc80> (l_x_,) {} call_function from_numpy_1 <built-in method from_numpy of type object at 0x12b1fdc80> (l_y_,) {} call_function attr_wrapper <function attr_wrapper at 0x12e8693a0> (from_numpy, 'real') {} call_function attr_wrapper_1 <function attr_wrapper at 0x12e8693a0> (from_numpy_1, 'real') {} output output output ((),) {} [2023-05-16 10:31:48,908] torch._dynamo.output_graph.__graph: [DEBUG] TRACED GRAPH __compiled_fn_2 <eval_with_key>.1 opcode name target args kwargs ------------- ------------- ---------------------------------------------------------- ------------------------------- -------- placeholder l_a_ L_a_ () {} placeholder l_b_ L_b_ () {} call_function from_numpy <built-in method from_numpy of type object at 0x12b1fdc80> (l_a_,) {} call_function from_numpy_1 <built-in method from_numpy of type object at 0x12b1fdc80> (l_b_,) {} call_function wrapped_add <Wrapped function <original add>> (from_numpy, 1) {} call_function wrapped_add_1 <Wrapped function <original add>> (from_numpy_1, 1) {} output output output ((wrapped_add, wrapped_add_1),) {} ``` ### Changes * `codegen.py`: reconstruct `numpy.ndarray` from `NumpyNdarrayVariable` by adding bytecode to call `utils.to_numpy_helper()`. * `output_graph.py`: getting rid of legacy code that does exactly what `codegen.py` does, which only handling return case but not graph break case. * `utils.py`: added helpers to convert `numpy.ndarray` to `torch.Tensor` and vice versa. Also adding a wrapper class that takes in a function. In `__call__` it calls the function and converts its out to `torch.Tensor` (or a list of it). * `builder.py`: add method to wrap `numpy.ndarray` graph inputs into `NumpyNdarrayVariable`, by calling `torch.numpy` in the proxy. * `misc.py`: `numpy` API calls goes into `NumpyVariable` and we find the function with the same name in `torch_np` module, then wrap it with the wrapper defined in `utils.py`. * `tensor.py`, `torch.py`: proxy `tensor.numpy()` to be `torch.detach()` but wrap it with `NumpyNdarrayVariable`. Similarly, `torch.from_numpy()` -> `torch.detach()` but wrap it with `TensorVariable`. In `NumpyNdarrayVariable`, do the similar `torch_np.ndarray` to `torch.Tensor` wrapping for attributes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100839 Approved by: https://github.com/ezyang	2023-06-03 00:54:25 +00:00
Yanbo Liang	9fa82c90f7	[Dynamo] Correct UserDefinedObjectVariable.var_getattr on function/method type (#102580 ) Fixes #102329 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102580 Approved by: https://github.com/jansel	2023-06-01 05:04:13 +00:00
lantiankaikai	17166c2511	python_arg_parser to allow fake tensor element in symint_list when in dynamo mode #95424 (#97508 ) Failing mechanism on #95424 : In dynamo mode, when passing numpy.int_ to 'shape' like param (Sequence[Union[int, symint]]) is wrapped as list with FakeTensor. However, in python_arg_parser, parser expect int in symint_list but got FakeTensor. Following #85759, this PR allow tensor element in symint_list when in dynamo mode This PR also fix below test with similar failing mechanism pytest ./generated/test_huggingface_diffusers.py -k test_016 pytest ./generated/test_ustcml_RecStudio.py -k test_036 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/97508 Approved by: https://github.com/yanboliang	2023-05-31 19:19:17 +00:00
Yanbo Liang	7b6438da9e	[Dynamo] Fix if condition on NNModuleVariable (#102335 ) Fixes #102315 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102335 Approved by: https://github.com/ngimel, https://github.com/jansel	2023-05-26 17:00:43 +00:00
Bin Bao	e6af31a5a2	[dynamo] Add astunparse dependency (#102120 ) Summary: https://github.com/pytorch/pytorch/pull/98488 implements CSE for dynamo guards, and it relies on astunparse to perform the optimization. `test_guards_cse_pass_single` was broken and later was fixed by introducing a check_and_skip_if_needed. This actually fixes the root cause on fbcode and should bring some perf gain internally. Test Plan: `buck2 test @//mode/opt //caffe2/test/dynamo:test_dynamo -- --exact 'caffe2/test/dynamo:test_dynamo - test_misc.py::DynamicShapesMiscTests::test_guards_cse_pass_single' --run-disabled` Reviewed By: malfet Differential Revision: D46126742 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102120 Approved by: https://github.com/malfet	2023-05-24 21:24:24 +00:00
Michael Voznesensky	ea5eaa8692	Remove config check in specialize (#102098 ) Fixes Pull Request resolved: https://github.com/pytorch/pytorch/pull/102098 Approved by: https://github.com/ezyang	2023-05-24 01:26:22 +00:00
Yanbo Liang	e132f09e88	[Dynamo] Fix test_cuda_set_device to restore device (#102049 ) Fixes #102025 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102049 Approved by: https://github.com/ngimel	2023-05-23 07:37:12 +00:00
Michael Voznesensky	4c1bc91f42	Support autograd.Function w/ grad (#99483 ) This PR adds support for tracing autograd.Function with grad. A few important bullet points outlining our approach: 1) Our goal is to verify soundness in order to add a call_function to the autograd.Function's `apply` to the graph. 2) We achieve (1) by either verifying soundness or rejecting soundness, by ensuring that both forward and backward of the autograd.Function are sound. 3) For the forward, if we verify soundness, we install its guards into the graph. 4) For the backward, if we verify soundness, we throw it out. However, backwards soundness verification is more onerous, and has a config driven set of banned attrs and methods for tensors. 1-4 above are achieved by turning the forward and backward into UserDefinedFunctionVariables, and inlining through them, relying on dynamo's soundness detection. If we graph break in these, we raise and treat them as unsound. As noted above, backwards is stricter yet. For the tracing, the safety comes from dynamo's HigherOrderOperator system. That system ensures that not only do we trace soundly, but that no new variables are lifted into inputs during the tracing, and that the forward and backwards are entirely self contained. Whenever we reject a function as unsound, we restore back, as usual. Due to some limitations in the lifting logic, we have an escape hatch we implemented for tensors that are known in forward, but cross into backwards through save_tensors (save) /saved_tensors (load). We escape hatch here to avoid having the known saved tensors coming from forward end up being accidentally treated as lifted variables (and rejected). This is sound, but a little hacky feeling. Additionally, due to some limitations in fx node removal, combined with how we produce subgraphs for the traces installed from HigherOrderOperators, we had to improve our node removal logic. In the event of a restore, we remove the old nodes from the graph, as usual in dynamo. However, because the references to these nodes may exist in subgraphs, we traverse any nodes users and remove them first if and only if they are in another graph. This is always sound, because removal should only be downstream of restoration at this point. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99483 Approved by: https://github.com/zou3519	2023-05-19 01:26:21 +00:00
Nikita Shulga	6f46716ee2	Fix/skip CSE tests on Python-3.8 without `astunparse` (#101805 ) If `astunparse` is not installed, following guard will be generated in `test_guard_function_builder_with_cse`: ```python def ___make_guard_fn(): def guard(L): if not (x[0].a < x[1].a * (3 - x[2].a)): return False if not (a.b.c[0].d.e + a.b.c[1].d.e * a.b.c[2].d.e > 0): return False if not (f(m.n[0], '0').x.y.z * f(m.n[0], '1').x.y.z * f(m.n[0], '2').x.y.z < 512): return False if not (self.g(a, b).k + (1 - self.g(a, b).k) <= m[0].a + self.g(a, b).k): return False return True return guard ``` Though, I have to say, hardcoding string comparison is pretty weird. Also, skip `test_guards_cse_pass_[single\|multiple]` if AST unparsing is missing. Fixes failure in a test introduced by https://github.com/pytorch/pytorch/pull/98488 copilot:poem Pull Request resolved: https://github.com/pytorch/pytorch/pull/101805 Approved by: https://github.com/atalman, https://github.com/ysiraichi	2023-05-18 23:14:35 +00:00
Yanbo Liang	29de581764	[Dynamo] Graph break on torch.cuda.set_device() (#101668 ) Fixes #97280 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101668 Approved by: https://github.com/jansel	2023-05-17 21:35:08 +00:00
Yukio Siraichi	f72f0119ec	Implement CSE for dynamo guards. (#98488 ) This PR extracted the CSE part of the code in #89707. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98488 Approved by: https://github.com/ezyang, https://github.com/jansel, https://github.com/anijain2305	2023-05-17 10:47:24 +00:00
blzheng	65412f95f0	[dynamo] Graph break on ops having inplace_view tag (#100787 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100787 Approved by: https://github.com/jgong5, https://github.com/eellison, https://github.com/jansel	2023-05-14 11:42:35 +00:00
Edward Z. Yang	2621fbda7d	Turn on anomaly detection for AOTAutograd backward tracing (#101047 ) Previously, anomaly detection was only enabled on the inner forward function, and not on the overall joint function that calls backward. I believe this impeded us from printing "this is the forward that triggered the backward" because that printing only happens if anomaly mode is enabled when you run backward(). This PR fixes it. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/101047 Approved by: https://github.com/albanD, https://github.com/bdhirsh	2023-05-11 03:38:20 +00:00
Yanbo Liang	075d36d37f	[Dynamo] Fix nested function resume execution (#100426 ) Fixes #99665 Let me explain the root cause using the unit test I added: * This bug is triggered when: * ```wrapped``` is a nested function. * ```wrapped``` is in another module which is different from the main function ```fn```. * There is a graph break inside of ```wrapped```. * The root cause is when resuming nested function, actually we are using the outermost function(```fn``` in my example)'s global variables, but ```wrapped``` calls ```inner_func``` which is not part of ```fn```'s globals, so we have to set correct globals when nested function resume execution. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100426 Approved by: https://github.com/jansel	2023-05-11 03:10:23 +00:00
William Wen	7da8705f18	[dynamo 3.11] fix segfault when printing stack trace (#99934 ) Dynamo will frequently segfault when attempting to print stack traces. We fix this by: - Fixing stack size calculations, as we did not account for exception tables - Creating shadow execution frames in a way that more closely resembles what CPython does to create its execution frames Dynamo/inductor-wrapped pytorch tests are enabled up the stack - those need to be green before this PR can be merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99934 Approved by: https://github.com/albanD, https://github.com/malfet, https://github.com/jansel	2023-05-09 22:12:45 +00:00
PyTorch MergeBot	4b8127b90e	Revert "[Dynamo] Fix nested function resume execution (#100426 )" This reverts commit `d719f0276d`. Reverted https://github.com/pytorch/pytorch/pull/100426 on behalf of https://github.com/jeanschmidt due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/100426#issuecomment-1540915913))	2023-05-09 21:32:13 +00:00
Aaron Gokaslan	8769fb854d	[BE] Fix flake8 B027 errors - missing abstractmethod decorator (#100715 ) Enables B027 and applies fixes by adding abstract method decorators. Autofix generated by ruff master. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100715 Approved by: https://github.com/ezyang	2023-05-09 17:28:48 +00:00
Yanbo Liang	d719f0276d	[Dynamo] Fix nested function resume execution (#100426 ) Fixes #99665 Let me explain the root cause using the unit test I added: * This bug is triggered when: * ```wrapped``` is a nested function. * ```wrapped``` is in another module which is different from the main function ```fn```. * There is a graph break inside of ```wrapped```. * The root cause is when resuming nested function, actually we are using the outermost function(```fn``` in my example)'s global variables, but ```wrapped``` calls ```inner_func``` which is not part of ```fn```'s globals, so we have to set correct globals when nested function resume execution. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100426 Approved by: https://github.com/jansel	2023-05-06 05:04:50 +00:00
Animesh Jain	8994d9e610	[dynamo] Hide guard_fail_hook behind a flag to improve cache lookup time (+10% DebertaV2) (#100590 ) For TorchDynamo eager backend, DebertaV2 speedup improves from 0.77x to 0.87x. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100590 Approved by: https://github.com/voznesenskym, https://github.com/wconstab	2023-05-04 18:52:21 +00:00
Michael Voznesensky	ffcbd1c2de	Move tracked nn_modules from OutputGraph to TracingContext (#100457 ) Lint Pull Request resolved: https://github.com/pytorch/pytorch/pull/100457 Approved by: https://github.com/anijain2305	2023-05-03 02:00:11 +00:00
Michael Voznesensky	aafc6ce8cc	Produce constant variables in cases where a SymNode is created with a constant (#100144 ) ` AOT_DYNAMIC_SHAPES=1 TORCHDYNAMO_DYNAMIC_SHAPES=1 benchmarks/dynamo/huggingface.py --performance --training --amp --backend eager --disable-cudagraphs --device cuda --only AllenaiLongformerBase --explain` Looks promising! Goes from: Dynamo produced 173 graphs covering 2760 ops with 160 graph breaks (14 unique) To: Dynamo produced 6 graphs covering 2298 ops with 15 graph breaks (7 unique) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100144 Approved by: https://github.com/ezyang	2023-05-01 21:32:11 +00:00
PyTorch MergeBot	89c43f4108	Revert "Produce constant variables in cases where a SymNode is created with a constant (#100144 )" This reverts commit `d7bdfd3454`. Reverted https://github.com/pytorch/pytorch/pull/100144 on behalf of https://github.com/ezyang due to ci failure is real ([comment](https://github.com/pytorch/pytorch/pull/100144#issuecomment-1529587039))	2023-05-01 11:10:48 +00:00
Michael Voznesensky	d7bdfd3454	Produce constant variables in cases where a SymNode is created with a constant (#100144 ) ` AOT_DYNAMIC_SHAPES=1 TORCHDYNAMO_DYNAMIC_SHAPES=1 benchmarks/dynamo/huggingface.py --performance --training --amp --backend eager --disable-cudagraphs --device cuda --only AllenaiLongformerBase --explain` Looks promising! Goes from: Dynamo produced 173 graphs covering 2760 ops with 160 graph breaks (14 unique) To: Dynamo produced 6 graphs covering 2298 ops with 15 graph breaks (7 unique) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100144 Approved by: https://github.com/ezyang	2023-04-30 17:13:57 +00:00
Animesh Jain	006785cd46	[dynamo][hf_bigbird] Actually graph break on tensor.unsqueeze_/resize_ (#99986 ) Currently, we return `unimplemented` w/o a graph break on seeing a x.unsqueeze_ when x is input. This essentially means we fall back to the original frame. This PR actually graph breaks so that we can generate the continuation frame for the rest of the function. Instead of graph breaking at LOAD_ATTR, we delay the graph break to the actual CALL_FUNCTION, where its cleaner to graph break. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99986 Approved by: https://github.com/jansel	2023-04-26 18:50:06 +00:00
Michael Voznesensky	e789de952f	Make sizevar addition work properly (#100015 ) Rm Pull Request resolved: https://github.com/pytorch/pytorch/pull/100015 Approved by: https://github.com/ezyang	2023-04-26 15:59:26 +00:00
Jiong Gong	e5c9a0fcf5	[dynamo] avoid graph break on repeat_interleave.self_int (#99528 ) Address convit_base failure: https://github.com/pytorch/torchdynamo/issues/1886 mentioned in https://github.com/pytorch/pytorch/issues/93777 Also for models like EleutherAI/gpt-j-6B. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99528 Approved by: https://github.com/ezyang	2023-04-25 04:47:39 +00:00
Michael Voznesensky	4c2892944f	Guard static shapes alongside tensors, instead of from shape_env, in dynamic_shapes=True (#99566 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99566 Approved by: https://github.com/ezyang	2023-04-22 16:46:52 +00:00
Jason Ansel	220712f4de	Fix torch.compile() on a skipped module (#98894 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98894 Approved by: https://github.com/xw285cornell	2023-04-22 16:10:55 +00:00
Edward Z. Yang	e47e8c9d98	Guard on default device (#99551 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99551 Approved by: https://github.com/voznesenskym, https://github.com/mlazos	2023-04-20 17:02:59 +00:00
Will Constable	98907589ee	Make GetItemSource(*, slice) hashable (#99379 ) All Sources must be hashable, since we are using set equality to check for duplicate sources in AOTAutograd. We should have a more systematic way of asserting this. For this PR just fix the local issue. Fixes #99145 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99379 Approved by: https://github.com/ezyang	2023-04-19 13:50:49 +00:00
William Wen	88c8c2b71b	[dynamo 3.11] implement 3.11 exceptiontable (#96511 ) Summary of changes: - Add CPython exceptiontable parsing/assembling functions in torch/_dynamo/bytecode_transformation.py, based on https://github.com/python/cpython/blob/3.11/Objects/exception_handling_notes.txt. - Add optional `exn_tab_entry` field to dynamo `Instruction`s in torch/_dynamo/bytecode_transformation.py in order to virtualize exception table entries (start, end, target instructions). - Add checks guarding against duplicate instructions in dynamo, so that jump/exceptiontable targets are unambiguous. See `get_indexof` in torch/_dynamo/bytecode_analysis.py. Ensure that bytecode generation throughout dynamo does not generate duplicate instructions. - Allow dynamo bytecode generation logic to generate nested exception table entries for developer convenience. CPython expects entries to not overlap, so we flatten nested entries during assembly in torch/_dynamo/bytecode_transformation.py:compute_exception_table. - Simulate the block stack in torch/_dynamo/symbolic_convert.py. CPython removed the block stack in 3.11, but dynamo needs it in order to keep track of active contexts. So we simulate the block stack as before by looking at exceptiontable entries in order to determine the current blocks. - Update context codegen in torch/_dynamo/resume_execution.py. The `SETUP_FINALLY` bytecode, which conveniently had a jump target to the finally block, was removed in 3.11, so we need to keep track of the jump target of the finally block using exceptiontables. Generating resume functions is more difficult since the original exceptiontable entries pointing to old cleanup code need to be modified to point to new cleanup code. - Fix a push_null bug in torch/_dynamo/variables/functions.py introduced by https://github.com/pytorch/pytorch/pull/98699 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96511 Approved by: https://github.com/jansel, https://github.com/yanboliang, https://github.com/albanD	2023-04-18 07:53:24 +00:00
Edward Z. Yang	e2923b521b	Further improve symbolic shapes logging (#99159 ) * Introduce a frame counter which lets us uniquely identify frames. This makes it easier to tell if you are recompiling the same frame * Shorten evaluate_expr to eval for more visual distinctiveness Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99159 Approved by: https://github.com/Skylion007	2023-04-16 12:06:38 +00:00
Jason Ansel	f84078b40b	[dynamo] Remove pointless graphs from with no_grad() (#98956 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98956 Approved by: https://github.com/voznesenskym	2023-04-14 00:25:40 +00:00
Michael Voznesensky	ccc9a3d726	Automatic Dynamic Shapes (#98923 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98923 Approved by: https://github.com/ezyang	2023-04-13 02:39:23 +00:00
Animesh Jain	951df11af8	[dynamo] Raise exception on incorrect usage of disallow_in_graph (#98892 ) Summary - `disallow_in_graph` is mostly useful for backends. Suppose, your backend does not support `torch.abs()`. So, you can use `disallow_in_graph` to do a graph break. The assumption in the above statement is that `disallow_in_graph` is called on an `allowed` callable. `allowed` in Dynamo language refers to a callable that is put as-is in the Dynamo graph. Therefore, if one uses `disallow_in_graph` on some non-torch non-allowed function, we want to raise an exception to tell user that they probably want something else. * If they want to disable Dynamo - they should use torch._dynamo.disable * If they wanted to stop inlining - they should use torch._dynamo.graph_break. However this is not a decorator. So, we need to provide another API. But, the question - who would want to do this? Pull Request resolved: https://github.com/pytorch/pytorch/pull/98892 Approved by: https://github.com/jansel	2023-04-12 07:50:56 +00:00
Animesh Jain	a2e0f5128c	[dynamo] Fix bug with torch._dynamo.skip (#98862 ) Summary * Fixed an issue with `skip` * Also removed some tests from test_misc.py and moved them to test_decorators.py as test_misc.py is becoming a dumping ground. ~~~ # Code - fn1 was not getting skipped earlier def fn2(x): return x.sin() @torch._dynamo.skip def fn1(x): x = x.sigmoid() return fn2(x.cos()) def fn(x): return fn1(x.tan()) # Extracted graph def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ tan = l_x_.tan(); l_x_ = None return (tan,) def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ sin = l_x_.sin(); l_x_ = None return (sin,) ~~~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/98862 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-04-11 23:20:08 +00:00
Guang Yang	c377a8590b	Add `nonzero_static()` op to pytorch to unblock export (#97417 ) Summary: Add new experimental python op (`torch.nonzero_static`) for export. There is NO cuda impl included in this PR Example: Say input tensor is `x = torch.tensor([[1, 0], [3, 2]])` call regular `nonzero()` on x will give you a tensor `tensor([[0, 0], [1, 0], [1, 1])` call `nonzero_static(x, size=4)` on x will give you a tensor `tensor([[0, 0], [1, 0], [1, 1], [fill_value, fill_value])` (padded) call `nonzero_static(x, size=2)` on x will give you a tensor `tensor([[0, 0], [1, 0])` (truncated) Test Plan: Unit Tests ``` buck test @mode/dev-nosan //caffe2/test:test_dynamo -- 'caffe2/test:test_dynamo - test_export.py::ExportTests::test_export_with_nonzero_static' -- 'caffe2/test:test_dynamo - test_misc.py::MiscTests::test_nonzero_static' ``` PT2 Export with `nonzero_static()` Example of `GraphModule` in the exported graph ``` def forward(self, x): arg0, = fx_pytree.tree_flatten_spec(([x], {}), self._in_spec) nonzero_static_default = torch.ops.aten.nonzero_static.default(arg0, size = 4); arg0 = None return pytree.tree_unflatten([nonzero_static_default], self._out_spec) ``` Differential Revision: D44324808 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97417 Approved by: https://github.com/ezyang	2023-04-11 05:13:36 +00:00
William Wen	117da58b65	[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 ) Enable most dynamo unittests for 3.11. There are a few tests that are skipped due to failures that will be addressed in upcoming PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98104 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym, https://github.com/albanD, https://github.com/jansel, https://github.com/jerryzh168, https://github.com/malfet	2023-04-10 20:04:10 +00:00
Yanbo Liang	a5f3468618	[Dynamo] Fix bug when dynamo generate guards for enum type (#98652 ) Fixes Meta internal user case, actually I think this is a ```enum``` bug, we provide workaround in dynamo. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98652 Approved by: https://github.com/jansel	2023-04-08 04:30:30 +00:00
PyTorch MergeBot	22411b6f02	Revert "[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 )" This reverts commit `0066f3405f`. Reverted https://github.com/pytorch/pytorch/pull/98104 on behalf of https://github.com/huydhn due to Sorry for reverting your PR, but it is failing on CPU 3.11 test in trunk `0066f3405f`. This is probably a landrace	2023-04-07 00:05:30 +00:00
William Wen	0066f3405f	[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 ) Enable most dynamo unittests for 3.11. There are a few tests that are skipped due to failures that will be addressed in upcoming PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98104 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym, https://github.com/albanD, https://github.com/jansel, https://github.com/jerryzh168, https://github.com/malfet	2023-04-06 23:15:48 +00:00
chezhou	ce797795e1	Support `getattr` for ConstantVariable when compiling with Dynamo (#98153 ) This PR enables `getattr` on ConstantVariable by implementing its `call_hasattr` function. Fixes #97480 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98153 Approved by: https://github.com/ezyang	2023-04-06 16:48:24 +00:00
Edward Z. Yang	f98c1809a4	Add mark_static (#98427 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98427 Approved by: https://github.com/voznesenskym	2023-04-06 12:58:16 +00:00
Michael Voznesensky	ab95b7a05f	Support neg calls to dyn shapes (#94068 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94068 Approved by: https://github.com/jansel	2023-04-06 03:33:24 +00:00
Yanbo Liang	b1c2925493	[Dynamo] Support typing.Union and typing.Optional (#98384 ) Fixes #98265 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98384 Approved by: https://github.com/ezyang	2023-04-05 21:31:52 +00:00
Edward Z. Yang	69f9bd2323	Don't error if we mark_dynamic without dynamic_shapes on (#98324 ) In the terminal state, it won't matter if you have dynamic_shapes on or not, mark_dynamic will always work. Today, it's helpful to make this not error so I can easily swap between static or not and run experiments. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98324 Approved by: https://github.com/voznesenskym	2023-04-05 19:40:22 +00:00
Yanbo Liang	fd0be80dd1	[Dynamo] graph break when calling resize_() on graph input (#98279 ) Fixes #97921 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98279 Approved by: https://github.com/jansel, https://github.com/eellison	2023-04-04 20:39:12 +00:00
Michael Voznesensky	b1e60bfb6a	Pass f_locals as a dict rather than kwargs (#98107 ) Fixes https://github.com/pytorch/pytorch/issues/97688 One big problem is that instead of printing x < y we now print `E["x"] < E["y"]` and now all of the tests wobbled and I'm mad. Signed-off-by: Edward Z. Yang <ezyangmeta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98107 Approved by: https://github.com/ezyang	2023-04-04 00:30:08 +00:00
Michael Lazos	ee9a9b7add	Remove old logging callsites (#98095 ) Get around GH first issue, OSS only changes for https://github.com/pytorch/pytorch/pull/97182 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98095 Approved by: https://github.com/anijain2305	2023-04-01 00:57:37 +00:00
Yanbo Liang	9be9592f28	[Dynamo] Code refactor: move context managers out of misc.py (#97958 ) misc.py and test_misc.py is too big, moving context managers to context.py and test_context.py. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97958 Approved by: https://github.com/ezyang, https://github.com/anijain2305, https://github.com/mlazos, https://github.com/voznesenskym	2023-03-31 23:15:39 +00:00
William Wen	762a2079c7	[dynamo 3.11] make create_instruction kwarg mandatory (#98032 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98032 Approved by: https://github.com/albanD	2023-03-31 18:20:51 +00:00
William Wen	089134bf66	[dynamo 3.11] implement 3.11 linetable (#96509 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96509 Approved by: https://github.com/jansel	2023-03-31 18:20:28 +00:00
William Wen	14ef91cea6	[dynamo 3.11] small bug fixes (#96508 ) Bugs fixed: - CALL_FUNCTION_EX expects null pop in symbolic_convert - make_function_with_closure codegen requires a push_null - copy over the closure in eval_frame.c - add JUMP_FORWARD to terminal opcodes - enum repr fix in utils.py - fix symbolic_convert's break_graph_if_unsupported wrapper Pull Request resolved: https://github.com/pytorch/pytorch/pull/96508 Approved by: https://github.com/jansel	2023-03-31 18:18:12 +00:00
William Wen	05641b81e5	[dynamo 3.11] fix jump if (not) none (#96505 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96505 Approved by: https://github.com/jansel	2023-03-31 18:05:54 +00:00
Sam Gross	87f5e92916	[dynamo] Add guards for deterministic algos (#96695 ) Inductor now falls back to eager mode for deterministic algos. Add guards in dynamo to check if the deterministic algos mode changes. See #93537 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96695 Approved by: https://github.com/ngimel, https://github.com/jansel	2023-03-31 16:28:45 +00:00
lantiankaikai	94bae36a1f	Fix strip_function_call in GuardBuilder (#97810 ) repo: from #92670 this address one of the bug for TorchDynamo pytest ./generated/test_PeterouZh_CIPS_3D.py -k test_003 Issue: In GuardBuilder, when parsing argnames with "getattr(a.layers[slice(2)][0]._abc, '0')" it returns "getattr(a", where it suppose to return "a", and thus causing SyntaxError. This PR fix the regex and add couple test cases. Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/97810 Approved by: https://github.com/yanboliang	2023-03-30 17:46:10 +00:00
William Wen	24a5d006f2	[dynamo 3.11] Refactor create_instruction (#96499 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96499 Approved by: https://github.com/jansel, https://github.com/albanD	2023-03-30 17:05:27 +00:00
Michael Lazos	e6909f6ccc	[Dynamo] Fix for tuple construction from tuple iterators (#97862 ) Fixes #93405 In short - when calling the builtin function `Tuple` on a list variable we added a list length guard. This paired with converting tuple iterators to a ListIteratorVariable resulted in this guard being improperly added. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97862 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-03-29 19:20:05 +00:00
Michael Lazos	e626be79a4	Add config setting to error on recompile (#97829 ) Adds a config setting `error_on_recompile` - when set dynamo will raise an exception after compiling a function for the second time. This was requested to help debugging in pyper Pull Request resolved: https://github.com/pytorch/pytorch/pull/97829 Approved by: https://github.com/bertmaher	2023-03-29 19:00:43 +00:00
Edward Z. Yang	8372c5dc68	Refactor dynamic dims api, stateless internals, higher level export API (#96699 ) The purpose of this API is to execute a few large components of work: 1) Refactor all the internals of plumbing dynamic dimension information after dynamo to be stateless 2) Decouple allocation controls around dynamic dimensions from verification 3) For (2), for allocation, create an enum that dictates whether we are in DUCK (default today), STATIC (aka assume_static_default in the past), or DYNAMIC (aka user constrained, do not duck shape) 4) For (2), for verification, we separate out the list of dynamic ranges entirely from allocation. This means shape_env does not tracking for what we verify on, and instead, it is the callers job to invoke produce_guards() with the various things they want verified, specifically, with the valid ranges. We do use constrain ranges to refine value ranges when doing analysis. 5) We have decided, therefore, as an extension of (4) to double down on "late" checks versus "eager" checks, primarily because the mechanisms for gathering what actually matters happens during guards, and should be a purview of the caller seeking guards, not the shape env. However, for dynamo, these structures are essentially one and the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96699 Approved by: https://github.com/avikchaudhuri, https://github.com/ezyang	2023-03-29 16:55:49 +00:00
Yanbo Liang	f388bec985	[Dynamo] torch.Generator state should have a source and be reconstructed properly (#97403 ) Fixes #97077 partially. During FX graph propagation, we request every tensor should have source: `a524123c91/torch/_dynamo/variables/builder.py (L929)` However, the output of ```torch.Generator.get_state()``` is a tensor but without source, since it's generated inside of the FX graph. My change is following what we did for [Python random functions](https://github.com/pytorch/pytorch/blob/master/torch/_dynamo/variables/user_defined.py#L260), to have a dedicated ```GeneratorStateSource```. We have to also update the reconstruction logics, since we will reuse the ```TensorVariable``` reconstruction. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97403 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-03-29 04:31:23 +00:00
Yanbo Liang	e3df6a7c8a	[Dynamo] Unspec int list if enabling dynamic_shapes (#97557 ) Fixes #97348 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97557 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-03-27 06:12:43 +00:00
nima10khodaveisi	13dcf635e0	Dynamo stride dim kwargs (#97444 ) Fixes #97441 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97444 Approved by: https://github.com/ezyang	2023-03-25 23:43:05 +00:00
Will Constable	e8a722b9cb	Fix missing dynamo cache lookup registration in profiler.profiler (#97305 ) This follows https://github.com/pytorch/pytorch/pull/96199 and supports the 'other' profiler. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97305 Approved by: https://github.com/voznesenskym	2023-03-22 21:09:16 +00:00
nima10khodaveisi	5537792307	[dynamo] handle dim in size kwargs (#96992 ) (#97098 ) Fixes #96992 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97098 Approved by: https://github.com/ezyang	2023-03-22 14:19:59 +00:00
Will Constable	57c13fde18	Test and fix guard fail message in CompileProfiler (#97055 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/97055 Approved by: https://github.com/voznesenskym, https://github.com/jansel	2023-03-22 02:17:57 +00:00
Will Constable	141a2ebcf1	Clean up Compilation Profiler (#97029 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/97029 Approved by: https://github.com/voznesenskym	2023-03-21 06:24:22 +00:00
Michael Voznesensky	f9ce593267	Extend aot autograd dedup guards to params, stop using positions (#96774 ) The purpose of this PR is to remove reliance on argument positions in dedup guards, AND extend the functionality to params. A version of this PR was stamped prior https://github.com/pytorch/pytorch/pull/95831 - but was kinda gross, because it was based on an underlying PR that did way too much with source names. This PR leaves most of that alone, in favor of just reusing the same name standardization logic that dynamo module registration does. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96774 Approved by: https://github.com/ezyang	2023-03-21 05:59:33 +00:00
Avik Chaudhuri	e4e761b277	record caller frame instead of function frame (#96882 ) Previously, when starting to trace a function, we would record a frame summary recording the definition loc. This would lead to an unconventional-looking stack trace when used for debugging, e.g., shape guards. ``` File ".../scripts/avik/pt2/example.py", line 407, in forward def forward(self, x): ... File ".../transformers/models/bert/modeling_bert.py", line 912, in forward @add_start_docstrings_to_model_forward(BERT_INPUTS_DOCSTRING.format("batch_size, sequence_length")) ... File ".../transformers/models/bert/modeling_bert.py", line 562, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 484, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 416, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 275, in forward def forward( ... File ".../transformers/models/bert/modeling_bert.py", line 351, in forward attention_scores = attention_scores + attention_mask ``` As noted in https://github.com/pytorch/pytorch/pull/95848#discussion_r1134397096, we would like to change this to record function calls instead, like conventional stack traces do. This diff makes this change. The above stack now looks like the following, which is way more helpful at a glance to understand what's going on. ``` File ".../scripts/avik/pt2/example.py", line 408, in forward bert_out = self.bert(**x) ... File ".../transformers/models/bert/modeling_bert.py", line 1021, in forward encoder_outputs = self.encoder( ... File ".../transformers/models/bert/modeling_bert.py", line 610, in forward layer_outputs = layer_module( ... File ".../transformers/models/bert/modeling_bert.py", line 496, in forward self_attention_outputs = self.attention( ... File ".../transformers/models/bert/modeling_bert.py", line 426, in forward self_outputs = self.self( ... File ".../transformers/models/bert/modeling_bert.py", line 351, in forward attention_scores = attention_scores + attention_mask ``` Differential Revision: [D44101882](https://our.internmc.facebook.com/intern/diff/D44101882/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96882 Approved by: https://github.com/ezyang	2023-03-17 00:06:16 +00:00
Will Constable	784dd583a6	Automatically register/clear dynamo profiler hooks while profiling (#96199 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96199 Approved by: https://github.com/jansel	2023-03-14 21:19:33 +00:00
Yanbo Liang	166117e050	control_flow.{cond/map} allows tracked_fakes divergence (#96546 ) Fixes #96473 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96546 Approved by: https://github.com/ezyang	2023-03-14 07:06:54 +00:00
Yanbo Liang	760ad90518	[Dynamo] User defined functions support torch & builtin functions as default arguments (#96563 ) Fixes #96197 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96563 Approved by: https://github.com/jansel	2023-03-13 08:28:52 +00:00
Edward Z. Yang	99efe3ef5a	Generate type match guard for torch.Size input (#96421 ) I suppose hypothetically, if the user code ends up working polymorphically over the SizeVariable, in such a way that a tuple would work, this type match is not necessary. But we do not carefully test for this. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96421 Approved by: https://github.com/jansel, https://github.com/voznesenskym	2023-03-12 23:04:55 +00:00
Yanbo Liang	7fcf8b1829	[Dynamo] Support torch.{cuda/cpu}.amp.autocast (#95416 ) For Meta internal use cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95416 Approved by: https://github.com/jansel	2023-03-10 21:48:08 +00:00
Driss Guessous	11aab72dc9	[SDPA] Add an optional scale kwarg (#95259 ) # Summary This PR adds an optional kwarg to torch torch.nn.functional.scaled_dot_product_attention() The new kwarg is a scaling factor that is applied after the q@k.T step of the computation. Made updates to the efficient kernel to support but flash and math were minimally updated to support as well. Will reduce the complexity of: #94729 and has been asked for by a couple of users. # Review Highlights - As far as I know I did this the correct way and this both BC and FC compliant. However I always seem to break internal workloads so I would love if someone can advice I did this right? - I named the optional arg 'scale'. This is probably dumb and I should name it 'scale_factor'. I will make this change but this is annoying and it will require someone thinking we should rename. - 'scale' is interpreted as `Q@K.T * (scale)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/95259 Approved by: https://github.com/cpuhrsch	2023-03-08 18:07:40 +00:00
PyTorch MergeBot	3ce1e15cf7	Revert "[Dynamo] Support torch.{cuda/cpu}.amp.autocast (#95416 )" This reverts commit `c88aa336aa`. Reverted https://github.com/pytorch/pytorch/pull/95416 on behalf of https://github.com/huydhn due to Sorry for reverting your PR. But it seems that the smoke test issue is related as it starts to fail consistently in trunk https://hud.pytorch.org/hud/pytorch/pytorch/master/1?per_page=50&name_filter=inductor_torchbench_smoketest_perf	2023-03-08 06:51:57 +00:00
Michael Voznesensky	d7db5b05b4	Context manager to push/pop frame summaries (#96054 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96054 Approved by: https://github.com/avikchaudhuri, https://github.com/ezyang	2023-03-08 04:01:49 +00:00
Yanbo Liang	c88aa336aa	[Dynamo] Support torch.{cuda/cpu}.amp.autocast (#95416 ) For Meta internal use cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95416 Approved by: https://github.com/jansel	2023-03-08 01:40:27 +00:00
Will Constable	d4f5f9fdb4	Profile dynamo guards (#96119 ) Adds a profiler start and end callback to dynamo's C eval_frame impl, which can be used to profile a region providing a name for visualization. Currently only hooks up one usage to profile cache lookup (primarily covering guards and linear search through linked list). Example profile taken from toy model: `python benchmarks/dynamo/distributed.py --toy_model --profile --dynamo aot_eager` <img width="1342" alt="image" src="https://user-images.githubusercontent.com/4984825/223225931-b2f6c5a7-505a-4c90-9a03-34982f6dc033.png"> Planning to measure overhead in CI, and probably can't afford to check this in enabled by default. Will have to evaluate UX options such as `config.profile_dynamo_cache = True` or some other way. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96119 Approved by: https://github.com/jansel	2023-03-07 16:12:22 +00:00
jon-chuang	7a192cc51c	dynamo: wrap graph break inst in try except block - with context manager setup/teardown (#94758 ) Replacement to https://github.com/pytorch/pytorch/pull/94672. Follow up to https://github.com/pytorch/pytorch/pull/94137. We simply replace the set grad mode try except blocks with one for a more generic contextmanager (using `__enter__` and `__exit__`), storing the context manager into a `symbolic_local` for the duration of the try block. (see https://github.com/pytorch/torchdynamo/issues/207 for the original motivation) This allows us to handle calling inner functions with graph breaks for any arbitrarily deep nesting of live context managers subclassing `AbstractContextManager`. (see tests) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94758 Approved by: https://github.com/yanboliang	2023-03-06 14:04:17 +00:00
Edward Z. Yang	d303665d33	Make int unspecialization actually work (#95621 ) OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". As part of this, I have to turn off unspecialization by default, as there are still latent bugs in inductor. The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from https://github.com/pytorch/pytorch/pull/95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes https://github.com/pytorch/pytorch/issues/95469 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/95621 Approved by: https://github.com/jansel, https://github.com/Chillee	2023-03-04 01:22:08 +00:00
William Wen	053205aab5	[dynamo] Fix OrderedDict reconstruction bytecode (#95800 ) Fixes OrderedDict reconstruction issue found in https://github.com/pytorch/pytorch/pull/95250 with an attempt to fix it here https://github.com/pytorch/pytorch/pull/95725 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95800 Approved by: https://github.com/yanboliang, https://github.com/clee2000	2023-03-01 23:39:09 +00:00
Michael Voznesensky	8093abce3e	Always get attr static out (#95771 ) Discussion here https://github.com/pytorch/pytorch/issues/95630#issuecomment-1449596766 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95771 Approved by: https://github.com/jansel	2023-03-01 23:05:44 +00:00
Michael Voznesensky	1e2e149570	Dynamic dim guards (#95584 ) Guards for dynamic dims, essentially authored/co-authored by @ezyang by triple checking my (originally faulty) logic. Comments in code explain the guard decision tree. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95584 Approved by: https://github.com/ezyang	2023-03-01 06:17:41 +00:00
Michael Voznesensky	eff5ae8746	Better mark_dynamic assertions (#95566 ) This PR allows us to reuse the static per tensor decision making we make at fake tensorification time. We can use this to avoid setting up dynamic dim guards later if the tensor was never a candidate. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95566 Approved by: https://github.com/ezyang	2023-02-28 00:02:22 +00:00
Joel Schlosser	d6dd67a248	Dynamo: Use out-of-place binary ops instead of in-place (#95446 ) Fixes issues with things like: ```python x = 2 x += y.shape[0] ``` resulting in invalid `2 += y.shape[0]` code in the FX graph. Fix: Whenever dynamic shapes are involved, insert the out-of-place op to the FX graph instead of the in-place op. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95446 Approved by: https://github.com/ezyang	2023-02-27 02:10:37 +00:00
Yanbo Liang	02d44e5de4	[Dynamo] Support CUDA stream passed from outside of torch.compile decrator (#94627 ) Fixes #94499 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94627 Approved by: https://github.com/jansel	2023-02-25 19:15:59 +00:00
Michael Voznesensky	9ded087bac	During export, generate Python TENSOR_MATCH guards (#94970 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94970 Approved by: https://github.com/ezyang	2023-02-24 05:37:31 +00:00
PyTorch MergeBot	254b161def	Revert "During export, generate Python TENSOR_MATCH guards (#94970 )" This reverts commit `5a8092f058`. Reverted https://github.com/pytorch/pytorch/pull/94970 on behalf of https://github.com/voznesenskym due to Clowny comparison bug on edge cases for devices	2023-02-23 17:47:59 +00:00
Driss Guessous	29c235e555	[SDPA] Fix bug in parsing scaled_dot_product_attention arguments (#95311 ) Fixes #95266 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95311 Approved by: https://github.com/cpuhrsch	2023-02-23 03:12:46 +00:00
Michael Voznesensky	5a8092f058	During export, generate Python TENSOR_MATCH guards (#94970 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94970 Approved by: https://github.com/ezyang	2023-02-22 17:28:17 +00:00
PyTorch MergeBot	6ae60b19b7	Revert "During export, generate Python TENSOR_MATCH guards (#94970 )" This reverts commit `5d2eb6d636`. Reverted https://github.com/pytorch/pytorch/pull/94970 on behalf of https://github.com/jeanschmidt due to Requires codev to land internal test changes	2023-02-22 16:49:37 +00:00
Michael Voznesensky	5d2eb6d636	During export, generate Python TENSOR_MATCH guards (#94970 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94970 Approved by: https://github.com/ezyang	2023-02-21 19:12:57 +00:00
Michael Voznesensky	500ebb2cd6	Fine grained dynamic shape controls (#94787 ) https://docs.google.com/document/d/1aoIyYE8_6cYpWqS25thzVoIiKsT5aaUEOiiPwbIXt8k/edit Pull Request resolved: https://github.com/pytorch/pytorch/pull/94787 Approved by: https://github.com/ezyang	2023-02-17 22:28:37 +00:00
PyTorch MergeBot	e0ede1cc30	Revert "Fine grained dynamic shape controls (#94787 )" This reverts commit `2aa806608b`. Reverted https://github.com/pytorch/pytorch/pull/94787 on behalf of https://github.com/kit1980 due to After this PR, test_autocast_sdpa_dynamic_shapes_static_default started to fail with RuntimeError: Cannot call sizes() on tensor with symbolic sizes/strides: https://github.com/pytorch/pytorch/actions/runs/4206176846/jobs/7299657478	2023-02-17 19:52:16 +00:00
Michael Voznesensky	2aa806608b	Fine grained dynamic shape controls (#94787 ) https://docs.google.com/document/d/1aoIyYE8_6cYpWqS25thzVoIiKsT5aaUEOiiPwbIXt8k/edit Pull Request resolved: https://github.com/pytorch/pytorch/pull/94787 Approved by: https://github.com/ezyang	2023-02-17 17:39:22 +00:00
Yanbo Liang	950a9efcc3	[Dynamo] Enable test_autocast_sdpa (#95011 ) Enable test_autocast_sdpa since the blocker has been removed Pull Request resolved: https://github.com/pytorch/pytorch/pull/95011 Approved by: https://github.com/drisspg	2023-02-17 09:37:25 +00:00
David Berard	a4085ab837	[dynamo] support custom __getattr__ on torch.nn.Modules (#94658 ) Summary: torch.nn.Module implementations previously did not support custom implementations of `__getattr__`; if a torch.nn.Module subclass implemented `__getattr__` and we tried to access an attribute that was expected to be present in `__getattr__`, dynamo would not check `__getattr__` and would error out with an AttributeError. This PR copies the functionality from UserDefinedObjectVariable into torch.nn.Module so that it also supports `__getattr__` Example of a module which previously would fail: ```python class MyMod(torch.nn.Module): def __init__(self): super().__init__() self.custom_dict = {"queue": [torch.rand((2, 2)) for _ in range(3)]} self.other_attr = torch.rand((2, 2)) def __getattr__(self, name): custom_dict = self.custom_dict if name in custom_dict: return custom_dict[name] return super().__getattr__(name) def forward(self, x): return x @ self.other_attr + self.queue[-1] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94658 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-02-16 04:00:51 +00:00
jon-chuang	d1d5d16df3	dynamo: handle straight-line graph breaks for autocast context manager with constant args (#94137 ) Fixes https://github.com/pytorch/pytorch/issues/93890 We do the following: 1. fix __init__constructor for `AutocastModeVariable` with exisiting `mode` while copying 2. `resume_execution` is made aware of constant args (`target_values`), by storing said args in `ReenterWith`. To propagate between subgraphs (in straightline code), we also store the constant args in the downstream's `code_options["co_consts"]` if not already. --- Future work: 1. handle instantiating context manager in non-inlineable functions. Simultaneously fix nested grad mode bug. 2. generalize to general `ContextManager`s 3. generalize to variable arguments passed to context manager, with guards around the variable. --- Actually, if we look at the repro: `74592a43d0/test/dynamo/test_repros.py (L1249)`, we can see that the method in this PR doesn't work for graph breaks in function calls, in particular, in function calls that don't get inlined. Why inlining functions with graph breaks is hard: - When we handle graph breaks, we create a new code object for the remainder of the code. It's hard to imagine doing this when you are inside a function, then we need a frame stack. And we just want to deal with the current frame as a sequence of straight line codes. Why propagating context manager information is hard: - If we do not inline the function, the frame does not contain any information about the parent `block_stack` or `co_consts`. So we cannot store it on local objects like the eval frame. It has to be a global object in the output_graph. --- Anyway, I'm starting to see clearly that dynamo must indeed be optimized for torch use-case. Supporting more general cases tends to run into endless corner-cases and caveats. One direction that I see as viable to handle function calls which have graph breaks and `has_tensor_in_frame` is stick with not inlining them, while installing a global `ContextManagerManager`, similar to the `CleanupManager` (which cleans up global variables). We can know which context managers are active at any given point, so that we can install their setup/teardown code on those functions and their fragments. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94137 Approved by: https://github.com/yanboliang	2023-02-14 14:00:37 +00:00
William Wen	d4d13d99e4	[dynamo 3.11] support new jump opcodes (#93986 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93986 Approved by: https://github.com/jansel, https://github.com/albanD, https://github.com/malfet, https://github.com/voznesenskym	2023-02-14 04:25:14 +00:00
Jason Ansel	4d6a4401f8	Raise warning if torch.compile options change without reset (#94680 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94680 Approved by: https://github.com/wconstab, https://github.com/malfet	2023-02-13 20:21:04 +00:00
Xuehai Pan	046e88a291	[BE] [3/3] Rewrite `super()` calls in test (#94592 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94592 Approved by: https://github.com/ezyang, https://github.com/seemethere	2023-02-12 22:20:53 +00:00
Yanbo Liang	8ad10eab4d	[Dynamo] Fix bug of calling super from class extended from metaclass (#94547 ) Fixes #94299 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94547 Approved by: https://github.com/jansel	2023-02-11 18:53:17 +00:00
Michael Voznesensky	e44586a78f	Pass input tensor __dict__ along to placeholder nodes (#94080 ) ``` import torch import torch.nn as nn import torch._dynamo.config import torch._inductor.config def pre_attention_state_ops(input, mems, state): lc_key = state[0] lc_val = state[1] bar = [] for i in range(0, 4): bar2 = [] for j in range(0, 3): bar2.append( lc_key + lc_val + torch.tensor([0.1, 0.25, 0.4, 0.5, 0.1]) ) bar.append(bar2) return bar mems = torch.tensor([[[1.8364, 0.2724, -1.4917, -0.4367, 0.8640]]]) state = [ torch.tensor([[[1.0517, 0.3848, -0.6472, 0.0823, 0.9116]]]), torch.tensor([[[1.0517, 0.3848, -0.6472, 0.0823, 0.9116]]]), ] i = torch.tensor( [ [0.0313, -0.1487, -0.3846, -0.5321], [-1.7073, 1.3331, -0.0890, -1.4935], [-0.8314, -0.1862, -0.5935, 1.5232], ] ) torch._dynamo.tag(mems, "MEMS") torch._dynamo.tag(i, "FOO") torch._dynamo.tag(state[0], "STATE_0") torch._dynamo.tag(state[1], "HMMM") exported = torch._dynamo.export(pre_attention_state_ops, i, mems, state) out_graph = exported[0] dynamo_result = out_graph(i, mems, state) nodes = list(out_graph.graph.nodes) placeholders = [node for node in nodes if node.op == "placeholder"] for placeholder in placeholders: if "tags" in placeholder.meta: print("PLACEHOLDER TAGS?", placeholder.meta["tags"]) ``` prints PLACEHOLDER TAGS? ['STATE_0'] PLACEHOLDER TAGS? ['HMMM'] Pull Request resolved: https://github.com/pytorch/pytorch/pull/94080 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-02-10 18:09:41 +00:00
Joel Schlosser	dd315e5c06	Dynamo: Support ConstantVariable (comparison_op) SymNodeVariable (#94519 ) Expands the generic compare logic to handle SymNodeVariables on the right side of the expression. Also adds support for `>=`, which it appears was mistakenly left out. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94519 Approved by: https://github.com/jansel	2023-02-09 21:17:17 +00:00
Joel Schlosser	0ce95c3a17	Dynamo: Support min / max over iterables (#94350 ) Expands support for built-in `min` and `max` calls beyond binary to iterables - simply reduce over the existing binary logic. Adds support for: * lists * tuples * list iterators * vararg min / max - `min(2, 3, 4)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94350 Approved by: https://github.com/voznesenskym, https://github.com/ezyang	2023-02-09 00:02:40 +00:00
Joel Schlosser	b5ef37b9a4	Dynamo: Fix graph break when iterating over tensor (#94326 ) Supports the following with dynamic shapes: ```python for element in tensor: # do stuff with element ``` Approach follows what's done when `call_range()` is invoked with dynamic shape inputs: guard on tensor size and continue tracing with a real size value from `dyn_dim0_size.evaluate_expr()`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94326 Approved by: https://github.com/ezyang	2023-02-08 23:57:06 +00:00
David Berard	8ba87fa525	[dynamo] fix general attr on tensor for user-provided attributes (#94332 ) Problem: For a tensor `x`, you can assign `x.my_attr = 3.14` and then later access it. Dynamo does not support this right now; it errors out with an AttributError (it was broken in #91840). Fix: This fixes the problem by catching AttributeErrors in dynamo if we try to access an attr that does not exist on a standard torch.Tensor. Tests: Added tests for accessing and setting attributes to make sure dynamo does not error out. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94332 Approved by: https://github.com/yanboliang	2023-02-08 17:11:18 +00:00
Michael Voznesensky	b191a5f75f	Remove overly strict assert, add test (#94151 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94151 Approved by: https://github.com/ezyang	2023-02-08 02:57:29 +00:00
Aaron Gokaslan	8fce9a09cd	[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 ) Apply parts of pyupgrade to torch (starting with the safest changes). This PR only does two things: removes the need to inherit from object and removes unused future imports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94308 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-07 21:10:56 +00:00
Joel Schlosser	bf4fe5dddd	General in-place binary op support in dynamo (#94203 ) Continues the approach taken in #93271, expanding support to in-place binary ops (e.g. `__iadd__`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/94203 Approved by: https://github.com/ezyang	2023-02-07 15:12:32 +00:00
Joel Schlosser	f954498edf	Dynamo: Fix to unpack ConstantVariable in call_range() (#94202 ) Fixes the `pyhpc_turbulent_kinetic_energy` model in torchbench. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94202 Approved by: https://github.com/ezyang, https://github.com/voznesenskym	2023-02-07 15:12:00 +00:00
Jason Ansel	180adf8c18	Fix bug in generic_list_compare (#94156 ) https://github.com/pytorch/pytorch/pull/94054 introduced a bug in list comparisons other than `==`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94156 Approved by: https://github.com/voznesenskym	2023-02-06 19:50:04 +00:00
PyTorch MergeBot	0444b8f560	Revert "Support neg calls to dyn shapes (#94068 )" This reverts commit `9350bcf6ae`. Reverted https://github.com/pytorch/pytorch/pull/94068 on behalf of https://github.com/malfet due to This broke hugging_face shard, see https://hud.pytorch.org/hud/pytorch/pytorch/master/1?per_page=50&name_filter=inductor_huggin	2023-02-06 17:50:10 +00:00
Michael Voznesensky	9350bcf6ae	Support neg calls to dyn shapes (#94068 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94068 Approved by: https://github.com/jansel	2023-02-05 21:38:16 +00:00
Michael Voznesensky	60a3b7425d	Small refactor of shape guards to allow for 1:1 code_parts (#93894 ) By moving guard string assembly into dynamo's default behavior and letting code_parts do the work, we can have much better shape guard failures. Before this fix, the guard failure in the test would look like: ``` 'x.size()[1] == x.size()[0] and x.stride()[0] == x.[264 chars]!= 1' != 'x.size()[0] < 3' - x.size()[1] == x.size()[0] and x.stride()[0] == x.size()[0] and x.stride()[1] == 1 and x.storage_offset() == 0 and y.size()[0] == x.size()[0] and y.size()[1] == x.size()[0] and y.stride()[0] == x.size()[0] and y.stride()[1] == 1 and y.storage_offset() == 0 and x.size()[0] < 3 and x.size()[0] != 0 and x.size()[0] != 1 + x.size()[0] < 3 ``` now it is ``` "x.size()[0] < 3" ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/93894 Approved by: https://github.com/ezyang	2023-02-05 09:24:12 +00:00
Yanbo Liang	2362b5fca3	[Dynamo] Put torch.cuda.stream into Dynamo FX graph (#93808 ) Fixes #92804 This PR only handles ```torch.cuda.stream```. If this is a right direction, I'll add support for several relevant functions, e.g, ```torch.cuda.current_stream().wait_stream(s)``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/93808 Approved by: https://github.com/jansel	2023-02-05 04:52:43 +00:00
Michael Voznesensky	25c0737adc	dont graph break on list[SymInt] comparisons (#94054 ) Reland of https://github.com/pytorch/pytorch/pull/92617 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94054 Approved by: https://github.com/jansel	2023-02-05 04:47:12 +00:00
Joel Schlosser	dc7bf1a7ea	General reversible binary op support (e.g. __add__ / __radd__) in dynamo (#93271 ) Generic support for reversible binary op pairs (e.g. `__add__` / `__radd__`) in dynamo. Adds logic to flip args and try the reverse op when the forward op is unsupported. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93271 Approved by: https://github.com/voznesenskym, https://github.com/jansel, https://github.com/ezyang	2023-02-03 19:28:35 +00:00
Driss Guessous	653dc73df0	[SDPA] Wire up FlashAttention's backward (#92917 ) # Summary This PR creates _flash_attention_backward and _scaled_dot_product_flash_attention_backward native functions and registers them to the respective derivatives.yaml. The goal is to replicate the torch.autograd.Function defined in the FlashAttention repo [here](`33e0860c9c/flash_attn/flash_attn_interface.py (L126)`) natively in PyTorch. One thing that we don't have access to is ctx.save_for_backward in native PyTorch so in order to save these variables I extended the returned objects from the forward functions. ### MetaFunctions I also updated the FlashAttention meta functions to mirror the real outputs now. As well I added a meta registration for backwards. I have an XLMR training script and while eager training now works with FlashAttention compiling this module fails with the inductor error down below. ### Questions? Performance issues vs mem efficient when using torch.nn.mha_forward TorchCompile -> See purposed solution below. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92917 Approved by: https://github.com/cpuhrsch	2023-02-02 04:02:30 +00:00
David Berard	3e6978172e	[dynamo] Handle general tensor attributes with a getattr proxy node (#91840 ) Background: Before this PR, support in dynamo for tensor attributes (e.g. `x.H`, `x.T`, ...) need to be individually implemented one-by-one. This could potentially lead to errors, e.g. if the implementation in [variables/tensor.py](`21c7c7c72f/torch/_dynamo/variables/tensor.py (L160)`) differs from the implementation from a direct call to the attribute. For attributes that were not special-cased in tensor.py, dynamo tracing would fail. This PR adds generic support for tensor attributes that return tensors without needing to specially handle them. (Notably, for x.real and x.imag, which previously weren't supported). In this PR: This directly creates a proxy node for a `"call_function"` node with `target=getattr`, and feeds it into wrap_fx_proxy. This will produce a TensorVariable for the attribute returned. This also removes the implementations for H, T, mH, mT which were broken (previously `torch.relu(x.T)` would fail). They now fall back to this default implementation (for which `torch.relu(x.T)` passes). Further context: * Ed's original suggestion in [90463](https://github.com/pytorch/pytorch/pull/90463#discussion_r1043398340) is to use `torch.Tensor.H.__get__(x)`. I wasn't able to get this to work; fx compilation fails with `getset_descriptor does not have attribute __module__`. Basically, the `__module__` attribute which is available on most python attributes, is not available on `getset_descriptor` objects. (i.e., these are implemented in C++ as attributes on torch.Tensor, so they don't obey some assumptions made by fx) * Although both tensor attributes and methods (like `x.relu()`) both go through this, this PR should only handle attributes (e.g. see the `"getset_descriptor"` in variables/tensor.py). Methods are handled already by by GetAttrVariable. * Prior to this PR, we already returned GetAttrVariables for unsupported attrs: the parent caller would catch the NotImplementedError and fallback to returning a GetAttrVariable. But if this GetAttrVariable was ever passed into a torch.\* function (as it could quite possibly be, since most of these attrs are tensors), it would fail because its proxy node would be missing an [example_value](https://github.com/pytorch/pytorch/blob/master/torch/_dynamo/utils.py#L1017). So: before, for some tensor x, `x.real` would work fine; but `torch.relu(x.real)` would fail. Testing: added tests in test_misc.py for x.real, x.imag, x.T, x.real.T. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91840 Approved by: https://github.com/ezyang	2023-02-01 22:34:03 +00:00
Edward Z. Yang	902b4dba75	Change capture_scalar_outputs to use SymInt/SymFloat rather than Tensor to model scalars (#93150 ) Previously, Dynamo faked support for item() when `capture_scalar_outputs` was True by representing it internally as a Tensor. With dynamic shapes, this is no longer necessary; we can represent it directly as a SymInt/SymFloat. Do so. Doing this requires you to use dynamic shapes; in principle we could support scalar outputs WITHOUT dynamic shapes but I won't do this unless someone hollers for it. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Differential Revision: [D42885775](https://our.internmc.facebook.com/intern/diff/D42885775) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93150 Approved by: https://github.com/voznesenskym	2023-01-31 21:23:23 +00:00
Yanbo Liang	332d55d3df	[Dynamo] UserDefinedClassVariable supports python type (#93310 ) Fixes #93260 Pull Request resolved: https://github.com/pytorch/pytorch/pull/93310 Approved by: https://github.com/mlazos	2023-01-31 17:41:51 +00:00
Yanbo Liang	304d8dd6c8	[Dynamo] Support enum.Enum type as dict key (#93026 ) Fixes Meta internal user case of using ```enum.Enum``` type as dict key, pleaser refer the added test case for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93026 Approved by: https://github.com/mlazos	2023-01-29 06:37:10 +00:00
Yanbo Liang	a6b51448f5	[Dynamo] Supports if condition on user defined object (#90892 ) Fixes Meta internal user case, see the pattern in unit test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90892 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-01-26 04:19:32 +00:00
Zain Rizvi	67bb5236da	lint fix (#92685 ) This linter error was introduced in https://github.com/pytorch/pytorch/pull/91821 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92685 Approved by: https://github.com/weiwangmeta, https://github.com/malfet	2023-01-20 17:26:37 +00:00
Yanbo Liang	2a3954372a	[Dynamo] Make torch.autograd.Function.forward support graph break and no re-compilation (#91295 ) Fixes #91101 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91295 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-01-20 06:25:09 +00:00
Michael Voznesensky	38a4cb765b	Torch package support in dynamo (#91821 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/91821 Approved by: https://github.com/suo, https://github.com/malfet	2023-01-20 05:03:34 +00:00
zhxchen17	706aa51628	[dynamo] Support control flow map() operator. (#91939 ) Fixes #ISSUE_NUMBER We want to add support for control flow map() at dynamo level to unblock some internal model which will have to use map() operator in captured graph. Basically I replicate the pattern for implementing cond() op from https://github.com/pytorch/pytorch/pull/90286 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91939 Approved by: https://github.com/ezyang	2023-01-19 22:03:01 +00:00
PyTorch MergeBot	60fe2f4420	Revert "Torch package support in dynamo (#91821 )" This reverts commit `3726d23219`. Reverted https://github.com/pytorch/pytorch/pull/91821 on behalf of https://github.com/huydhn due to The change causes flakiness on trunk. See https://github.com/pytorch/pytorch/issues/92196#issuecomment-1386368909 for more details	2023-01-18 02:17:25 +00:00
Michael Voznesensky	3726d23219	Torch package support in dynamo (#91821 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/91821 Approved by: https://github.com/suo, https://github.com/malfet	2023-01-10 06:53:15 +00:00
PyTorch MergeBot	f6c7cf1bf5	Revert "Torch package support in dynamo (#91821 )" This reverts commit `eeb3e49ed4`. Reverted https://github.com/pytorch/pytorch/pull/91821 on behalf of https://github.com/malfet due to According to minihud broke misc tests, see `eeb3e49ed4`	2023-01-09 14:39:14 +00:00
Michael Voznesensky	eeb3e49ed4	Torch package support in dynamo (#91821 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/91821 Approved by: https://github.com/suo	2023-01-08 01:46:24 +00:00
PyTorch MergeBot	6a3ddd0171	Revert "Don't graph break on patched module methods or aliased methods (#91018 )" This reverts commit `d6fc2d82ca`. Reverted https://github.com/pytorch/pytorch/pull/91018 on behalf of https://github.com/kit1980 due to After this PR, inductor / cuda11.6-py3.10-gcc7-sm86 / test fails every time with CUDA out of memory during OPTForCausalLM	2022-12-21 19:54:15 +00:00
William Wen	d6fc2d82ca	Don't graph break on patched module methods or aliased methods (#91018 ) See added tests for the cases that were fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91018 Approved by: https://github.com/Morgan77523, https://github.com/anijain2305	2022-12-21 16:29:15 +00:00
Yanbo Liang	511fbad830	[Dynamo] Fix builder for class with metaclass (#90807 ) Fixes Meta internal user case: a class with metaclass can't be identified as ```UserDefinedClassVariable```. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90807 Approved by: https://github.com/jansel	2022-12-20 05:02:28 +00:00
Edward Z. Yang	dfe916ca88	Dynamo comptime, with public ComptimeContext API (#90983 ) This PR adds `@comptime`, a decorator that causes a given function to be executed at compile time when Dynamo is symbolically evaluating their program. To query the Dynamo state, we offer a public ComptimeContext API which provides a limited set of APIs for querying Dynamo's internal state. We intend for users to use this API and plan to keep it stable. Here are some things you can do with it: * You want to breakpoint Dynamo compilation when it starts processing a particular line of user code: give comptime a function that calls breakpoint * You want to manually induce a graph break for testing purposes; give comptime a function that calls unimplemented * You want to perform a debug print, but you don't want to induce a graph break; give comptime a function that prints. * You can print what the symbolic locals at a given point in time are. * You can print out the partial graph the Dynamo had traced at this point. * (My original motivating use case.) You want to add some facts to the shape env, so that a guard evaluation on an unbacked SymInt doesn't error with data-dependent. Even if you don't know what the final user API for this should be, with comptime you can hack out something quick and dirty. (This is not in this PR, as it depends on some other in flight PRs.) Check out the tests to see examples of comptime in action. In short, comptime is a very powerful debugging tool that lets you drop into Dynamo from user code, without having to manually jerry-rig pdb inside Dynamo to trigger after N calls. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/90983 Approved by: https://github.com/jansel	2022-12-19 11:06:01 +00:00
David Berard	5d70d12812	[dynamo] turn torch.backends.cudnn.is_acceptable into a constant (#90323 ) Tracing `torch.backends.cudnn.is_acceptable(Tensor) -> bool:` fails with: ``` ... File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/variables/functions.py", line 196, in call_function return super(UserFunctionVariable, self).call_function(tx, args, kwargs) File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/variables/functions.py", line 67, in call_function return tx.inline_user_function_return( File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 426, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 1698, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 1752, in inline_call_ tracer.run() File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 485, in run and self.step() File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 455, in step getattr(self, inst.opname)(inst) File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 281, in wrapper return inner_fn(self, inst) File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 912, in CALL_FUNCTION self.call_function(fn, args, {}) File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/symbolic_convert.py", line 389, in call_function self.push(fn.call_function(self, args, kwargs)) File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/variables/torch.py", line 431, in call_function tensor_variable = wrap_fx_proxy( File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/variables/builder.py", line 662, in wrap_fx_proxy return wrap_fx_proxy_cls( File "/scratch/dberard/dynamo38/pytorch/torch/_dynamo/variables/builder.py", line 820, in wrap_fx_proxy_cls raise AssertionError( AssertionError: torch.* op returned non-Tensor bool call_function <function is_acceptable at 0x7f00deefb790> ``` So instead, evaluate `is_acceptable()` and convert the result to a constant. The result of `is_acceptable(tensor) -> bool` depends on: * dtype/device of the input tensor (this should already be guarded) * properties of the build & whether cudnn is available * some global state that gets initialized during the first call to `torch.backends.cudnn._init()` (this is NOT guarded in this PR) Note: this fixes tts_angular with FSDP. This was an issue with FSDP because FSDP modules are interpreted as UnspecializedNNModules, and UnspecializedNNModules try to inline calls. In comparison, NNModules (e.g. when the tts_angular model is not wrapped in FSDP) do not inline calls and instead evaluate subsequent calls. In subsequent calls, cudnn.is_acceptable would be skipped by eval_frame.py:catch_errors because it is not in an allowlist. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90323 Approved by: https://github.com/jansel	2022-12-16 23:26:54 +00:00
Bin Bao	93ac8c4aeb	[dynamo] Refactor how autocast parameters are binded (#90953 ) Summary: Use `inspect.signature` for unified args handling Test Plan: `test_dynamo` Differential Revision: D42078621 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90953 Approved by: https://github.com/brad-mengchi	2022-12-16 23:12:49 +00:00
Michael Voznesensky	6c8ef6a4c2	Add tracing context, Integrate dynamo guards into torch._guards (#90647 ) As defined here: https://docs.google.com/document/d/1oniZEgAaHE1IMByPRWRKbUHeaW06E2HMfCTCQyMRLek/edit# This PR creates a new structure, a TracingContext, whose lifecycle matches that of the traced frame. It carries on it a GuardsContext, and eventually, a FakeTensorMode. It is the source of truth of all accumulated guards. In this PR, we create the structure, and integrate it into dynamo. We do so by mapping OutputGraph's guards structure to its guard structure. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90647 Approved by: https://github.com/ezyang	2022-12-14 07:35:32 +00:00
Yanbo Liang	e2674aafed	[Dynamo] Supports calling parent class‘s non classmethod from child class (#90682 ) Fixes https://github.com/pytorch/pytorch/issues/90558 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90682 Approved by: https://github.com/jansel	2022-12-12 22:33:46 +00:00
Michael Lazos	9c4189f82d	[dynamo] Add is_compiling for dynamo (#90329 ) `is_tracing` returns True during dynamo tracing and False when run in Eager Pull Request resolved: https://github.com/pytorch/pytorch/pull/90329 Approved by: https://github.com/jansel	2022-12-09 20:19:41 +00:00
Michael Voznesensky	4cdc96fb4f	Add hooks structure for passing around user provided hooks, add a new guard_failure_fn (#90371 ) This PR introduces a new function we can pass to torch._dynamo.optimize - guard_failure_fn. Usage is in the PR, and the one stacked on top of it, but the gist of it is that it emits failed guard reason strings alongside code. This is useful for tests and debugging, as it gives far finer grained assertions and control than the compile counter alone. This is a resubmit of https://github.com/pytorch/pytorch/pull/90129 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90371 Approved by: https://github.com/ezyang	2022-12-07 17:51:53 +00:00
Edward Z. Yang	962ebe88a2	Assert there are no outstanding side effects before calling cond (#90208 ) The current cond implementation is silently incorrect when there are outstanding side effects, since the locally tracked side effects are lost when the recursive export call is made. At least we raise an assert now. I'm working on a refactor of cond which should be able to sidestep this problem. Maybe. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D41746973](https://our.internmc.facebook.com/intern/diff/D41746973) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90208 Approved by: https://github.com/voznesenskym	2022-12-06 03:53:48 +00:00
Michael Lazos	342d78d1a2	Cache guards once per variable tracker, rather than re-propagating them repeatedly (#89827 ) This improves tracing performance of optimizer tracing significantly (2x). In essence this just removes the recursion from propagate because it is not necessary. ListVariables and ConstDictVariables already contain the guards from the items contained in them. Adds two other optimizations for special cases of `recursively_contains` helps with https://github.com/pytorch/torchdynamo/issues/1803 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89827 Approved by: https://github.com/anijain2305, https://github.com/jansel	2022-12-02 01:45:05 +00:00
Michael Lazos	2d32e5dd09	add env/config flag to disable dynamo (#89828 ) as title Pull Request resolved: https://github.com/pytorch/pytorch/pull/89828 Approved by: https://github.com/anijain2305	2022-11-30 01:59:44 +00:00
zhxchen17	a70082a863	[functorch] Move `cond.py` to `_cond.py` and expose `cond()` under functorch.experimental.control_flow. (#89819 ) Summary: Similar to https://github.com/pytorch/pytorch/pull/88767 we want to reduce the chance that users accidentally import private functions from `functorch.experimental.cond` as if they were public interfaces. We also move `cond()` under `control_flow.py` to stay consistent with `map()` op. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/89819 Approved by: https://github.com/zou3519	2022-11-30 01:50:44 +00:00
Edward Z. Yang	0c96841a20	Cond capture with fake tensors actually works; don't raise in this case (#89638 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89638 Approved by: https://github.com/anjali411	2022-11-24 22:46:40 +00:00
Yanbo Liang	e4ccec6eca	[Dynamo] Fix bug of using customized torch.autograd.Function (#89397 ) Fixes https://github.com/pytorch/torchdynamo/issues/1899 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89397 Approved by: https://github.com/jansel	2022-11-24 05:28:58 +00:00
Yanbo Liang	9eed6b7f9a	[Dynamo] Several fixes on TensorVariable & TorchVariable (#89486 ) This is a group of bug fixes for [7k github models](https://github.com/pytorch/torchdynamo/issues/1884), it would fix 30+ model tests. * Support ```tensor.type()```. * Support ```tensor.get_device()```. * Support ```torch.nn.functional._Reduction.get_enum```. * Support ```torch._utils._get_device_index()```. * Fallback ```tensor.data_ptr()```. * ```FakeTensor``` always returns 0 * For no fake tensor propagation, we ```clone``` the input tensor, which makes no sense to track the original ```data_ptr```. And I don't think this is a very popular API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89486 Approved by: https://github.com/jansel	2022-11-23 19:39:45 +00:00
Yanbo Liang	186192bb26	[Dynamo] Fix bugs when calling tensor.data and tensor.layout (#89257 ) Fix bugs in [7k github models](https://github.com/pytorch/torchdynamo/issues/1884). * Legacy code still use ```tensor.data```, I think we can use ```tensor.detach``` to rewrite, not sure if there is anything I didn't anticipate. * Support ```tensor.layout```. The root cause of these issues are: dynamo wraps unimplemented ```tensor.x``` call into ```GetAttrVariable(TensorVariable, x)```, but this op was not inserted into FX graph. Hence, during the fake tensor propagation, it throws ```KeyError: 'example_value` ```. For these two popular attributes, Dynamo should support them anyway. However, if dynamo should support ___all___ ```tensor.x``` call and not fallback to ```GetAttrVariable```, I think it's debatable. If I turn off fake tensor propagation, it works well even not including this fix. So I'm curious if we should improve the fake propagation to cover similar cases. cc @mlazos @soumith @voznesenskym @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @chunyuan-w @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @desertfire @jansel @eellison ``` Traceback (most recent call last): File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/convert_frame.py", line 404, in _compile out_code = transform_code_object(code, transform) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/bytecode_transformation.py", line 341, in transform_code_object transformations(instructions, code_options) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/convert_frame.py", line 392, in transform tracer.run() File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 1523, in run super().run() File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 389, in run and self.step() File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 359, in step getattr(self, inst.opname)(inst) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 193, in wrapper return inner_fn(self, inst) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 865, in CALL_FUNCTION_KW self.call_function(fn, args, kwargs) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/symbolic_convert.py", line 301, in call_function self.push(fn.call_function(self, args, kwargs)) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/variables/torch.py", line 407, in call_function tensor_variable = wrap_fx_proxy( File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/variables/builder.py", line 636, in wrap_fx_proxy return wrap_fx_proxy_cls( File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/variables/builder.py", line 676, in wrap_fx_proxy_cls example_value = get_fake_value(proxy.node, tx) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/utils.py", line 1024, in get_fake_value args, kwargs = torch.fx.node.map_arg((node.args, node.kwargs), visit) File "/scratch/ybliang/work/repos/pytorch/torch/fx/node.py", line 613, in map_arg return map_aggregate(a, lambda x: fn(x) if isinstance(x, Node) else x) File "/scratch/ybliang/work/repos/pytorch/torch/fx/node.py", line 621, in map_aggregate t = tuple(map_aggregate(elem, fn) for elem in a) File "/scratch/ybliang/work/repos/pytorch/torch/fx/node.py", line 621, in <genexpr> t = tuple(map_aggregate(elem, fn) for elem in a) File "/scratch/ybliang/work/repos/pytorch/torch/fx/node.py", line 627, in map_aggregate return immutable_dict((k, map_aggregate(v, fn)) for k, v in a.items()) File "/scratch/ybliang/work/repos/pytorch/torch/fx/node.py", line 627, in <genexpr> return immutable_dict((k, map_aggregate(v, fn)) for k, v in a.items()) File "/scratch/ybliang/work/repos/pytorch/torch/fx/node.py", line 631, in map_aggregate return fn(a) File "/scratch/ybliang/work/repos/pytorch/torch/fx/node.py", line 613, in <lambda> return map_aggregate(a, lambda x: fn(x) if isinstance(x, Node) else x) File "/scratch/ybliang/work/repos/pytorch/torch/_dynamo/utils.py", line 1022, in visit return n.meta["example_value"] KeyError: 'example_value\n\nfrom user code:\n File "./generated/test_BayesWatch_pytorch_prunes.py", line 108, in forward\n return torch.zeros([x.size()[0], self.channels, x.size()[2] // self.spatial, x.size()[3] // self.spatial], dtype=x.dtype, layout=x.layout, device=x.device)\n\nSet torch._dynamo.config.verbose=True for more information\n\n\nYou can suppress this exception and fall back to eager by setting:\n torch._dynamo.config.suppress_errors = True\n' ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/89257 Approved by: https://github.com/jansel	2022-11-21 22:44:01 +00:00
Yanbo Liang	81a4aeabdf	[Dynamo] Support Tensor.nelement & torch.cuda.is_available (#89164 ) Fix several errors in [7k github models](https://github.com/pytorch/torchdynamo/issues/1198). Pull Request resolved: https://github.com/pytorch/pytorch/pull/89164 Approved by: https://github.com/soumith	2022-11-18 18:43:15 +00:00
Yanbo Liang	b72f5b9ae3	[Dynamo] Support typing.Mapping & Support function as argument (#88963 ) These missing features come from https://github.com/pytorch/benchmark/pull/1302, where we'd like to enable E2E hf_bert dynamo train/eval. The dependent [HuggingFace accelerate library](https://huggingface.co/docs/accelerate/index) requires these improvements. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88963 Approved by: https://github.com/jansel	2022-11-17 06:57:42 +00:00
Yanbo Liang	e70f446a16	[Dynamo] Fix bug in NamedTupleVariable (#89110 ) Fixes https://github.com/pytorch/torchdynamo/issues/1866 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89110 Approved by: https://github.com/jansel	2022-11-16 21:59:31 +00:00
Yanbo Liang	848e7240a1	[Dynamo] Add a dummy profiler to avoid activating real profiler (#88930 ) See context at https://github.com/pytorch/torchdynamo/issues/1721#issuecomment-1312396059 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88930 Approved by: https://github.com/jansel	2022-11-16 19:08:49 +00:00
Animesh Jain	9d2f5a2784	[dynamo] Support if cond on NNModuleVariable (#89095 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89095 Approved by: https://github.com/yanboliang, https://github.com/mlazos	2022-11-16 08:51:30 +00:00
Yanbo Liang	911a1349dd	[Dynamo] Fix torch.is_tensor and torch.overrides.is_tensor_like (#88704 ) Fixes error from 7k github models: https://github.com/jansel/pytorch-jit-paritybench/blob/master/generated/test_arashwan_matrixnet.py Error: ``` AssertionError: torch.* op returned non-Tensor bool call_function <function is_tensor at 0x7fca94d0faf0> from user code: File "/scratch/ybliang/work/repos/pytorch-jit-paritybench/generated/test_arashwan_matrixnet.py", line 749, in scatter return scatter_map(inputs) File "/scratch/ybliang/work/repos/pytorch-jit-paritybench/generated/test_arashwan_matrixnet.py", line 741, in scatter_map assert not torch.is_tensor(obj), 'Tensors not supported in scatter.' ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/88704 Approved by: https://github.com/jansel	2022-11-14 22:45:50 +00:00
Michael Voznesensky	06ce1338bc	[dynamo] Port all pytorch/dynamo and test/dynamo pieces over from symbolic-shapes branch (#88768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88768 Approved by: https://github.com/jansel, https://github.com/ezyang	2022-11-13 04:50:21 +00:00
Yanbo Liang	6fe47b682f	[Dynamo] Fix str(Guard.obj_weakref) bug to re-ennable support overriding __getattr__ (#88564 ) See my inline comments! Pull Request resolved: https://github.com/pytorch/pytorch/pull/88564 Approved by: https://github.com/ezyang, https://github.com/anijain2305	2022-11-11 22:31:32 +00:00
Yanbo Liang	b30222e0c4	[Dynamo] Add complete support for Tensor.is_contiguous (#88407 ) Fixes https://github.com/pytorch/torchdynamo/issues/1783 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88407 Approved by: https://github.com/jansel	2022-11-10 23:47:21 +00:00
Michael Suo	c0e6b4329f	[dynamo] only error out on nested fx trace if dynamo is optimizing (#88640 ) I think this is the final resolution to issue caused by https://github.com/pytorch/pytorch/pull/87797. The nvfuser issue that PR tripped up was because, even though we're correctly disabling torchdynamo via a `DisableContext`, the nested fx trace check was still firing. This PR properly narrows it to only fire if we're not disabled. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88640 Approved by: https://github.com/yf225	2022-11-08 23:52:21 +00:00
Yu Guo	a37524085d	[torchdynamo] support torch.autograd._profiler_enabled (#88378 ) fix https://github.com/pytorch/torchdynamo/issues/1826 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88378 Approved by: https://github.com/voznesenskym	2022-11-07 20:36:26 +00:00
Animesh Jain	36582574f3	[dynamo] Skip mutation detection for inference mode (#88406 ) Skip the mutation detection for inference_mode, and raise a warning. This helps one internal model Related to https://github.com/pytorch/torchdynamo/issues/1768 @ezyang What do you think about this? The issue that Dynamo mutation detector uses version counter to detect mutation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88406 Approved by: https://github.com/ezyang	2022-11-03 22:56:05 +00:00
Michael Suo	923a5e9685	[dynamo] Error when user nests FX with dynamo (#87797 ) Today, this doesn't work and dynamo errors out in a very non-obvious way (see: https://gist.github.com/suo/dde04830372ab51a4a34ea760f14200a). Here, we detect the error early and exit with a nicer msg. Also add a config option to just no-op dynamo (which need to unblock internal enablement). Pull Request resolved: https://github.com/pytorch/pytorch/pull/87797 Approved by: https://github.com/yf225, https://github.com/soumith, https://github.com/jansel	2022-11-02 17:38:56 +00:00
Yanbo Liang	ccf6b558a4	[Dynamo] UserFunctionVariable supports type & ABCMeta as arguments (#88257 ) Fixes https://github.com/pytorch/torchdynamo/issues/1785 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88257 Approved by: https://github.com/ezyang	2022-11-02 06:58:04 +00:00

... 3 4 5 6 7 ...

460 Commits