pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Evgeni Burovski	48989bc820	trace frames with np.ndarray (#110512 ) Fixes #109604 Resubmit gh-109715 + several skips and small fixes to make tests pass. The main fix here is by @ysiraichi : previously, dynamo did not resume tracing numpy ndarrays after a graph break. While at it, fix several small issues Yukio's fix uncovers: - graph break gracefully on numpy dtypes which do not map to torch.dtypes (uint16 etc) - recognize array scalars in dynamo, treat them as 0D ndarrays - make sure that iterating over torch.ndarray generates arrays not bare tensors Pull Request resolved: https://github.com/pytorch/pytorch/pull/110512 Approved by: https://github.com/lezcano	2023-10-15 00:56:10 +00:00
Peter Bell	8747e4c8c1	[dynamo] Add specialized variable tracker for sys.modules (#110990 ) `sys.modules` is currently treated as a constant dictionary and any reference to it will result in guards on the full contents of `sys.modules`. This instead adds a specialized variable tracker which tries to guard only on the modules referenced by the code. e.g. ``` sys.modules["operator"].add(x, x) ``` will generate the guard ``` ___dict_contains('operator', G['sys'].modules) ``` It does this with special support for `__contains__` `__getitem__` and `.get` which are probably the most commonly used with `sys.modules`. For anything else we just fall back to building the dict tracker as normal. While accessing `sys.modules` may seem unusual, it actually comes up when inlining the `warnings.catch_warnings` context manager which internally accesses `sys.modules["warnings"]`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110990 Approved by: https://github.com/ezyang	2023-10-13 20:08:40 +00:00
Tugsbayasgalan Manlaibaatar	5614023f5e	Move export.constrain_as_* to torch._constrain_as_* (#110757 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110757 Approved by: https://github.com/avikchaudhuri ghstack dependencies: #109859	2023-10-12 05:37:44 +00:00
PyTorch MergeBot	6ce3a38050	Revert "Move export.constrain_as_* to torch._constrain_as_* (#110757 )" This reverts commit `5aee22e0e0`. Reverted https://github.com/pytorch/pytorch/pull/110757 on behalf of https://github.com/kit1980 due to Depends on https://github.com/pytorch/pytorch/pull/109859 that needs to be reverted ([comment](https://github.com/pytorch/pytorch/pull/110757#issuecomment-1758908371))	2023-10-12 04:53:29 +00:00
Tugsbayasgalan Manlaibaatar	5aee22e0e0	Move export.constrain_as_* to torch._constrain_as_* (#110757 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110757 Approved by: https://github.com/avikchaudhuri ghstack dependencies: #109859	2023-10-11 02:37:55 +00:00
Edward Z. Yang	24bf9aeb6b	Fix arange with dynamic end argument. (#110979 ) Fixes https://github.com/pytorch/pytorch/issues/93468 There's a few extra tests that are sort of unrelated, but I ended up writing them while working on the fix and decided to keep them. The big idea here is to split the `_check` so that `expect_true` works; I could have probably also improved the symbolic reasoning but I'm lazy. One small logging fix too. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/110979 Approved by: https://github.com/Skylion007	2023-10-11 00:32:34 +00:00
Jon Chuang	6e770c0dda	[dynamo] Add `itertools.repeat` via polyfill (#110953 ) Fixes https://github.com/pytorch/pytorch/issues/110286 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110953 Approved by: https://github.com/ezyang	2023-10-10 20:40:33 +00:00
Jon Chuang	844ea6408b	feat(dynamo): handle accumulate kwargs ("func", "initial") (#110686 ) Follow up to: https://github.com/pytorch/pytorch/pull/110683 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110686 Approved by: https://github.com/ezyang	2023-10-08 07:06:52 +00:00
cdzhan	fa8e4ea212	Add support for hasattr on ListVariable (#110438 ) Fixes #109502 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110438 Approved by: https://github.com/jansel	2023-10-08 05:34:00 +00:00
Animesh Jain	58637c4b43	[dynamo] Remove SuperSource (#110475 ) The motivation for removing this is already present in the pre-PR comments. Copying it ~~~ # NB - SuperSource is a weird one. # it is our only source with 2 bases, so we use the objec # as the base, rather than the type, since an invocation # like super(Foo, foo) is represented here, the source object base is more spiritually # aligned with the instance, rather than the type. # This whole construction is questionable tho, and we should probably find a way to # avoid this exception to our otherwise nice source parentage invariant. ~~~ Instead of using super(a, b), we can use `type(b).__mro__[index]`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110475 Approved by: https://github.com/jansel	2023-10-08 04:45:06 +00:00
Jon Chuang	9b55194f81	fix(dynamo): Incorrect `accumulate` implementation, bad tests (#110683 ) Root cause of: https://github.com/pytorch/pytorch/issues/110287 Fixed many tests that didn't actually test due to unreliability of `CompileCounter.frame_count` in detecting graph breaks: https://github.com/pytorch/pytorch/issues/110730 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110683 Approved by: https://github.com/voznesenskym	2023-10-06 23:07:56 +00:00
William Wen	71beca4899	[dynamo, logging] Report name of defining class along side function name in Dynamo logs (#110190 ) Implement https://github.com/pytorch/pytorch/issues/109236 Sample code: ```python import torch class AAA: class DUMMY: class DUMMY2: pass def dummy(self): def dummy2(): pass class BBB: @staticmethod def CCC(): class DDD: if True: @staticmethod def EEE(): x = [torch.ones(3, 3) for _ in range(5)] return x return DDD def fn(): return AAA.BBB.CCC().EEE() opt_fn = torch.compile(fn, backend="eager") opt_fn() ``` Logs: ```bash $TORCH_LOGS="trace_source" python playground2.py [2023-09-27 17:38:35,641] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:21 in fn (fn) [2023-09-27 17:38:35,641] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] def fn(): [2023-09-27 17:38:35,642] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:22 in fn (fn) [2023-09-27 17:38:35,642] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] return AAA.BBB.CCC().EEE() [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:11 in CCC (AAA.BBB) (inline depth: 1) [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] @staticmethod [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:13 in CCC (AAA.BBB.CCC.DDD) (inline depth: 1) [2023-09-27 17:38:35,661] [0/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] class DDD: [2023-09-27 17:38:35,723] [1/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] TRACE starts_line /data/users/williamwen/pytorch/playground2.py:17 in <listcomp> (AAA.BBB.CCC.DDD.EEE) [2023-09-27 17:38:35,723] [1/0] torch._dynamo.symbolic_convert.__trace_source: [DEBUG] x = [torch.ones(3, 3) for _ in range(5)] ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/110190 Approved by: https://github.com/ezyang, https://github.com/mlazos	2023-10-05 20:41:38 +00:00
Yu Guo	2bf3ca1be7	[torchdynamo] preserve deterministic_algorithms_warn_only in convert_context (#110457 ) Summary: preserve deterministic_algorithms_warn_only in dynamo context Test Plan: modified unit tests to test warn_only Differential Revision: D49872622 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110457 Approved by: https://github.com/jansel	2023-10-04 07:12:32 +00:00
Peter Bell	a8a31bc165	[dynamo][BE] test_misc.py shouldn't change the default dtype globally (#110412 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110412 Approved by: https://github.com/jansel, https://github.com/lezcano, https://github.com/Fidget-Spinner ghstack dependencies: #110398	2023-10-03 19:25:37 +00:00
Peter Bell	dc794ec32c	[dynamo] Trace through builtin `abs` (#110398 ) In python `abs(x)` does nothing but delegate to `x.__abs__()` so we should do the same in dynamo. This also adds `SymNode.__abs__` so we can trace through indexing expressions involving `abs`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110398 Approved by: https://github.com/jansel, https://github.com/lezcano	2023-10-03 19:25:37 +00:00
Yukio Siraichi	6f48d872d0	Re-land: Break graph on `manual_seed`. (#109109 ) Re-landing: #108647 (old #107594) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109109 Approved by: https://github.com/lezcano	2023-09-28 15:28:40 +00:00
aashishthakur10	ee8983da70	109605 dynamo scalar ndarray pow gen (#109953 ) Fixes #109605 Generated code before: ``` def call(args): arg0_1, = args args.clear() assert_size_stride(arg0_1, (8, ), (1, )) buf0 = empty_strided((), (), device='cpu', dtype=torch.int64) cpp_fused_lift_fresh_0(c_void_p(buf0.data_ptr())) # Source Nodes: [wrapped_pow], Original ATen: [aten.lift_fresh, aten.pow] buf1 = aten.pow(arg0_1, reinterpret_tensor(buf0, (8, ), (0, ), 0)) del arg0_1 del buf0 buf2 = buf1 assert_size_stride(buf2, (8, ), (1, )) del buf1 return (buf2, ) ``` Generated code now: ``` def call(args): arg0_1, = args args.clear() assert_size_stride(arg0_1, (8, ), (1, )) buf0 = empty_strided((8, ), (1, ), device='cpu', dtype=torch.int64) cpp_fused_pow_0(c_void_p(arg0_1.data_ptr()), c_void_p(buf0.data_ptr())) del arg0_1 return (buf0, ) ``` @lezcano What would be a good way to add a test for this? Pull Request resolved: https://github.com/pytorch/pytorch/pull/109953 Approved by: https://github.com/lezcano	2023-09-28 13:11:06 +00:00
Yukio Siraichi	51a8c166a6	Add test for `ShapeEnv` recording fallback. (#109944 ) This PR adds a test for the previous PR in this stack: #109904. In summary, it calls functions decorated with `@record_shapeenv_event`, that don't have an explicit `ShapeEnv` parameter, with arguments that don't hold a `ShapeEnv` instance. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109944 Approved by: https://github.com/ezyang	2023-09-27 00:50:14 +00:00
PyTorch MergeBot	194d9aa0f2	Revert "[Dynamo] Match closures by code ID (#109427 )" This reverts commit `3de0857503`. Reverted https://github.com/pytorch/pytorch/pull/109427 on behalf of https://github.com/voznesenskym due to Fails test `PYTORCH_TEST_WITH_DYNAMO=1 python test_ops.py -k test_out_warning__refs_cat_cpu ([comment](https://github.com/pytorch/pytorch/pull/109427#issuecomment-1736101561))	2023-09-26 18:54:36 +00:00
PyTorch MergeBot	812bf847b7	Revert "Add test for `ShapeEnv` recording fallback. (#109944 )" This reverts commit `a4dec8d306`. Reverted https://github.com/pytorch/pytorch/pull/109944 on behalf of https://github.com/atalman due to New test failing internally ([comment](https://github.com/pytorch/pytorch/pull/109944#issuecomment-1735512734))	2023-09-26 13:11:22 +00:00
Yukio Siraichi	26e8cc0465	Add test for `ShapeEnv` state when not recording. (#109945 ) This PR adds a test for checking `ShapeEnv` state when it's built with `should_record_events=False`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109945 Approved by: https://github.com/ezyang ghstack dependencies: #109904, #109944	2023-09-26 07:20:46 +00:00
Yanbo Liang	a81cb0de16	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-26 00:03:41 +00:00
Yukio Siraichi	a4dec8d306	Add test for `ShapeEnv` recording fallback. (#109944 ) This PR adds a test for the previous PR in this stack: #109904. In summary, it calls functions decorated with `@record_shapeenv_event`, that don't have an explicit `ShapeEnv` parameter, with arguments that don't hold a `ShapeEnv` instance. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109944 Approved by: https://github.com/ezyang ghstack dependencies: #109904	2023-09-25 20:59:41 +00:00
Ken Jin	3de0857503	[Dynamo] Match closures by code ID (#109427 ) Closes https://github.com/pytorch/pytorch/issues/107866 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109427 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-09-25 19:10:35 +00:00
PyTorch MergeBot	829b5c0949	Revert "[Dynamo] Support python class member_descriptor (#109956 )" This reverts commit `12cd776d90`. Reverted https://github.com/pytorch/pytorch/pull/109956 on behalf of https://github.com/jeanschmidt due to multiple slow jobs broken ([comment](https://github.com/pytorch/pytorch/pull/109956#issuecomment-1733706269))	2023-09-25 13:25:45 +00:00
Yanbo Liang	12cd776d90	[Dynamo] Support python class member_descriptor (#109956 ) Fixes Meta internal cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109956 Approved by: https://github.com/jansel	2023-09-25 03:15:39 +00:00
Evgeni Burovski	ca5f3a7436	TST: test that numpy dtypes do not graph break (#109974 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109974 Approved by: https://github.com/lezcano	2023-09-25 01:00:39 +00:00
Animesh Jain	8ed08e5a7c	[dynamo] Graph break on rng get/set state - remove GeneratorStateSource (#109410 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109410 Approved by: https://github.com/ezyang ghstack dependencies: #109411	2023-09-22 22:31:55 +00:00
Michael Voznesensky	a902150a1e	[Easy] ConstantVariable() -> .create (#109896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109896 Approved by: https://github.com/ezyang	2023-09-22 22:30:15 +00:00
Michael Lazos	24ba4b7059	[dynamo][`__torch_function__` 1/n] Add getset descriptor and `__get__` vars (#109542 ) Adds the MethodWrapperVariable and GetSetDescriptor variable types. These are used in `__torch_function__` tracing to represent attribute reads (`__get__`) and for comparing unbound methods. (the func argument when `__torch_function__` is dispatched from a method call) towards tracing for https://github.com/pytorch/pytorch/issues/93723 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109542 Approved by: https://github.com/jansel	2023-09-22 10:39:15 +00:00
Edward Z. Yang	09622d8d49	Allow inferring size-nature from sizes passed to empty constructor (#109720 ) This removes the need for many constrain_as_size calls as we now infer them from error checking for sizes. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/109720 Approved by: https://github.com/aakhundov	2023-09-21 17:57:40 +00:00
lezcano	8597d37536	Implement numpy(force=True) (#109636 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109636 Approved by: https://github.com/ezyang ghstack dependencies: #109634	2023-09-20 20:06:13 +00:00
lezcano	1f6828ca99	Fix numpy(force=False) (#109634 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109634 Approved by: https://github.com/ezyang	2023-09-20 20:06:13 +00:00
Edward Z. Yang	103260a43b	Re-define check for `typing` classes. (#109201 ) This PR fix the `is_typing` function: checks whether a value is an instance of a class from the `typing` package. This reverts commit b09c09f7bb3adb6a5b8a107a5b96757b569daa8d. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109201 Approved by: https://github.com/ezyang	2023-09-20 00:04:56 +00:00
aashishthakur10	9e86a093e4	add torch.device to python type (#108116 ) Fixes #107856 This PR adds torch.device instance check in the python_type method for torch variables in dynamo. @ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/108116 Approved by: https://github.com/msaroufim, https://github.com/ezyang	2023-09-18 02:20:30 +00:00
Ken Jin	f9e72acc8f	Guard default dtype in torchdynamo (#109459 ) Fixes https://github.com/pytorch/pytorch/issues/109458 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109459 Approved by: https://github.com/ezyang	2023-09-17 22:51:33 +00:00
Oguz Ulgen	b03ef1d969	[Dynamo] Fix numpy error in test_numpy_torch_operators (#109087 ) When you inplace matmul two one dimensional numpy arrays, numpy=="1.24.3" gives ``` TypeError: In-place matrix multiplication is not (yet) supported. Use 'a = a @ b' instead of 'a @= b'. ``` but numpy=="1.25.2" gives ``` ValueError: inplace matrix multiplication requires the first operand to have at least one and the second at least two dimensions. ``` This diff makes it so that newer versions of numpy does not fail on this test because we do not catch ValueError. An alternative solution would be to update the test cases to be 2 dimensional, but that would have impact on other operators being tested. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109087 Approved by: https://github.com/jansel	2023-09-16 07:37:07 +00:00
ydwu4	94a54b89aa	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. Note: More lines are printed for debug log due to newly added context manager and guard adds . Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-14 15:49:30 +00:00
Nakul Camsamudram	109ab6a0df	Support str() on user defined functions (#108973 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108973 Approved by: https://github.com/anijain2305	2023-09-14 01:32:02 +00:00
Guilherme Leobas	d046376c4f	Dispatch `numpy.take_along_axis` to `torch.take_along_dim` (#108880 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108880 Approved by: https://github.com/lezcano ghstack dependencies: #108879	2023-09-13 23:13:09 +00:00
drisspg	ad90ab31f2	Flash Attention v2 (#105602 ) # Summary ## PR Dependencies I don't use ghstack :( this is a PR where it would have been helpful. That beings said I am going to peel off some PRs to make reviewing this easier: - [x] Separate build flags for Flash and MemEff: #107985 ### Description This pull request updates the version of _scaled_dot_product_flash_attention from version 1 to version 2. The changes are based on the flash attention code originally authored by @tridao ### Changes Made The majority of the changes in this pull request involve: - Copying over the flash_attention sources. - Updating header files. - Removing padding and slicing code from within the flash_attention kernel and relocating it to the composite implicit region of the SDPA. This was need to make the kernel functional and appease autograd. - Introducing a simple kernel generator to generate different instantiations of the forward and backward flash templates. - Adding conditional compilation (ifdef) to prevent building when nvcc is invoked with gencode < sm80. - Introducing a separate dependent option for mem_eff_attention, as flash_attention v2 lacks support for Windows and cannot be built for sm50 generation codes. - Modifying build.sh to reduce parallelization on sm86 runners and to lower the maximum parallelization on the manywheel builds. This adjustment was made to address out-of-memory issues during the compilation of FlashAttentionV2 sources. - Adding/Updating tests. ### Notes for Reviewers This is not a fun review, and I apologize in advance. Most of the files-changed are in the flash_attn/ folder. The only files of interest here IMO: - aten/src/ATen/native/transformers/cuda/flash_attn/flash_api.cpp - aten/src/ATen/native/transformers/cuda/flash_attn/kernels/generate_kernels.py ( this has been incorporated upstream to flash-attention github) There are a number of files all related to avoiding OOMs in CI/CD. These are typically shell scripts. ### Follow up items - Include the updates from `e07aa036db` and `9e5e8bc91e` \| https://github.com/pytorch/pytorch/issues/108108 ### Work Items - [x] I don't think Windows will be supported for 3.1.0 - Need to update cmakee - [x] Let multi_query/attention pass through and test \| UPDATE: I have the fast path implemented here: https://github.com/pytorch/pytorch/pull/106730 but since this will require changes to semantics of math to call repeat_interleave, I think this should be done as a followup. - [x] Had to drop cutlass back to 3.0.0 to get it to compile. Need to figure out how to upgrade to 3.1.0 and later. Spoke with Tri and he is going to be taking a look. Note: compiling with clang currently errors for the cute headers. - [x] Update test exercise above codepath - [x] Still need to disable on seq_len % 128 != 0 for backward( Tri beat me to it `a4f148b6ab`) - [x] Add determinism warning to BWD, Tri got to this one as well: 1c41d2b - [x] Update dispatcher to universally prefer FlashV2 - [x] Update tests to exercise new head_dims - [x] Move the head_dim padding from kernel to top level composite implicit function in order to make it purely functional - [x] Create template generator script - [x] Initial cmake support for building kernels/ folder - [x] Replay CudaGraph changes ### Results #### Forward only The TFlops are reported here are on a100 that is underclocked. ![flashv2_tflops_vs_seq_len](https://github.com/pytorch/pytorch/assets/32754868/152de46d-8fa6-42f0-9a9c-ef1eb7ae29e7) #### Forward+Backward Ran a sweep and for large compute bound sizes we do see a ~2x performance increase for forw+back. <img width="1684" alt="Screenshot 2023-07-20 at 3 47 47 PM" src="https://github.com/pytorch/pytorch/assets/32754868/fdd26e07-0077-4878-a417-f3a418b6fb3b"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105602 Approved by: https://github.com/huydhn, https://github.com/cpuhrsch	2023-09-13 13:59:05 +00:00
Yukio Siraichi	12e8530b35	Record and replay for ShapeEnv. (#107989 ) This PR introduces record and replay functionality for `ShapeEnv` instances. In short, throughout the execution of a program, we record events (e.g. function calls that modify its state) so that, in the future, we are able to reproduce any intermediary state of the instance. In summary, this PR introduces the following changes (they mostly belong to _symbolic_shapes.py_ unless otherwise stated): - Create `ShapeEnvEvent` class for recording function calls + arguments - Create `record_shapeenv_event` decorator and decorate every function that changes the state of a `ShapeEnv`: it creates an appropriate event and add it to the available ShapeEnv instance (sometimes it has to extract from `SymTypes`). - Create `SymNode.with_shape_env` convenient function for replacing `ShapeEnv` references - Wraps `ShapeEnv` initialization method: so that we also save the exact way a `ShapeEnv` was constructed, i.e. arguments - Introduces a way to compare two `ShapeEnv` instances, defining a concept of state for that class. In short, the state of `ShapeEnv` is every variable that may change the execution flow - Create `check_shape_env_recorded_events` dynamo configuration for enabling the check for equality the state of `ShapeEnv` with another one that was constructed by replaying all the recorded events. This check takes place inside `produce_guards` - Create `replay_shape_env_events` function for replaying given events. It assumes the first event is `ShapeEnv` initialization function Pull Request resolved: https://github.com/pytorch/pytorch/pull/107989 Approved by: https://github.com/ezyang	2023-09-13 00:22:38 +00:00
Nakul Camsamudram	3b265e021f	Support Optional typehint without graph breaking (#108970 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108970 Approved by: https://github.com/anijain2305	2023-09-11 16:42:44 +00:00
William Wen	2c3febb273	[dynamo] disable flaky test_unhandled_exception_in_dynamo2 (#108906 ) Fix https://github.com/pytorch/pytorch/issues/106028. The test `test_unhandled_exception_in_dynamo` should cover most cases. The disabled test `test_unhandled_exception_in_dynamo2` covered some weird case that I found when implementing dynamo 3.11. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108906 Approved by: https://github.com/yanboliang	2023-09-09 01:10:09 +00:00
PyTorch MergeBot	8caaa4f4cd	Revert "Re-land: Break graph on `manual_seed`. (#108647 )" This reverts commit `c887309437`. Reverted https://github.com/pytorch/pytorch/pull/108647 on behalf of https://github.com/huydhn due to Ouch, we are hit again my another internal import error from https://github.com/pytorch/pytorch/blob/main/torch/_inductor/config.py#L205-L206 ([comment](https://github.com/pytorch/pytorch/pull/108647#issuecomment-1712230103))	2023-09-08 21:18:00 +00:00
Yanbo Liang	8990174676	[Dynamo] Should inline __new__ function rather than skipping frame (#108549 ) Fixes #107460 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108549 Approved by: https://github.com/jansel	2023-09-08 16:51:47 +00:00
Huy Do	a9c663c269	Revert "Flash Attention v2 (#105602 )" (#108827 ) This reverts commit `add45aea1c`. There are some conflicts on some benchmark csv file https://github.com/pytorch/pytorch/pull/105602#issuecomment-1710988951 so I need to revert this manually. The diff has been reverted internally. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108827 Approved by: https://github.com/kit1980	2023-09-08 07:43:04 +00:00
Jason Ansel	4965fffeda	[dynamo] Move global state guards to C++ (#108624 ) This combines a bunch of python global state guards into a single C++ guard and switches to checking them 100% of the time. It also adds a few new guards for things that change inductor's behavior. Even though we are checking more things, I expect this to be much faster. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108624 Approved by: https://github.com/anijain2305	2023-09-08 04:07:08 +00:00
PyTorch MergeBot	e45b290127	Revert "Revert "Flash Attention v2 (#105602 )" (#108827 )" This reverts commit `24e9bbe22a`. Reverted https://github.com/pytorch/pytorch/pull/108827 on behalf of https://github.com/huydhn due to I need to land this revert properly as there are new failures showing up on trunk ([comment](https://github.com/pytorch/pytorch/pull/108827#issuecomment-1711020924))	2023-09-08 03:25:45 +00:00
Huy Do	24e9bbe22a	Revert "Flash Attention v2 (#105602 )" (#108827 ) This reverts commit `add45aea1c`. There are some conflicts on some benchmark csv file https://github.com/pytorch/pytorch/pull/105602#issuecomment-1710988951 so I need to revert this manually. The diff has been reverted internally. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108827 Approved by: https://github.com/kit1980	2023-09-08 02:54:20 +00:00
PyTorch MergeBot	38fcf77a1b	Revert "[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 )" This reverts commit `1a64ec7dd4`. Reverted https://github.com/pytorch/pytorch/pull/107337 on behalf of https://github.com/huydhn due to Sorry for reverting your change but inductor perf smoke test starts to regress after this ([comment](https://github.com/pytorch/pytorch/pull/107337#issuecomment-1710974588))	2023-09-08 02:03:48 +00:00
ydwu4	1a64ec7dd4	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-07 22:45:54 +00:00
Evgeni Burovski	1f20531939	fall back to eager on `NotImplementedError` (#107863 ) Follow-up to https://github.com/pytorch/pytorch/pull/107710: Help dynamo fall back to eager when compiling unimplemented numpy constructs: - arrays of strings - (arg){min, max} for complex types - various arguments typed as NotImplemented (`np.ones(4, order="F")` etc) - numpy functions which torch._numpy does not implement To test, run (we do not implement arrays of strings) ``` import torch import numpy as np @torch.compile(fullgraph=False) def fn(): return np.asarray(["L", "U"]) ``` and observe it compiles with fullgraph=False and fails with fullgraph=True Fixes https://github.com/pytorch/pytorch/issues/107970 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107863 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-09-07 21:22:20 +00:00
Yukio Siraichi	c887309437	Re-land: Break graph on `manual_seed`. (#108647 ) Trying to re-land #107594. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108647 Approved by: https://github.com/eellison	2023-09-07 12:52:38 +00:00
Animesh Jain	29f1097891	[dynamo] Reduce cache size limit to 8 (#108526 ) As title Pull Request resolved: https://github.com/pytorch/pytorch/pull/108526 Approved by: https://github.com/ezyang	2023-09-05 17:56:26 +00:00
Peter Bell	a16b0aa26a	[dynamo] Fix return type of Tensor.shape (#108240 ) This should be `torch.Size` but was returning a plain tuple under dynamo. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108240 Approved by: https://github.com/ezyang ghstack dependencies: #108239	2023-09-05 14:58:39 +00:00
Peter Bell	7c931f2491	[dynamo] Add dynamic shapes support to torch.Size.numel (#108239 ) Currently numel only supports static shapes, but this expands it to support generating symbolic arithmetic into the graph. e.g. ``` # x.size().numel with x.size() = [s0, 1, s1] size = l_x_.size() getitem = size[0] getitem_2 = size[2]; size = None mul = getitem * getitem_2; getitem = getitem_2 = None ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/108239 Approved by: https://github.com/ezyang	2023-09-05 14:58:39 +00:00
PyTorch MergeBot	48286d34a4	Revert "Break graph on `manual_seed`. (#107594 )" This reverts commit `6ad5568cbc`. Reverted https://github.com/pytorch/pytorch/pull/107594 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it has an import issue that breaks internal code ([comment](https://github.com/pytorch/pytorch/pull/107594#issuecomment-1705584405))	2023-09-04 18:00:37 +00:00
David Berard	06b173780d	[dynamo] "TorchDynamo Cache Lookup" event: use C++ api (#108436 ) Background: "TorchDynamo Cache Lookup" events appear in traces to indicate a dynamo cache lookup; it's useful to check when cache lookups are taking a long time. To add a profiler event, one can use the `torch.profiler.record_function` context manager, or the C++ equivalent. Previously, the python version was used; first, when the profiler was enabled, callbacks for record_function_enter and record_function_exit were registered; then those would be called before and after every cache lookup. This PR: Instead of calling the python bindings for `torch.profiler.record_function`, directly call the C++ implementation. This simplifies a lot of the code for binding C/C++. It also improves performance; previously there was a lot of overhead in the "TorchDynamo Cache Lookup" event, making the event artificially take a long time. After this change the events now appear shorter, because there's less overhead in starting/stopping the event: in other words, the profiler no longer distorts the results as much. Performance results: I ran using the script below on a cpu-only 1.6GHz machine. I report the median time (from 100 measurements) of a "TorchDynamo Cache Lookup" event before and after this PR. I think it is reasonable to consider the difference to be due to a reduction in overhead. <details> <summary>Benchmarking script</summary> ```python def fn(x, y): return (x * y).relu() a, b = [torch.rand((4, 4), requires_grad=True) for _ in range(2)] opt_fn = torch.compile(fn) opt_fn(a, b) opt_fn(a, b) with torch.profiler.profile() as prof: opt_fn(a, b) ``` </details> Median before PR: 198-228 us (median of 100, measured 5 times) Median after PR: 27us Pull Request resolved: https://github.com/pytorch/pytorch/pull/108436 Approved by: https://github.com/anijain2305, https://github.com/jansel	2023-09-04 04:37:26 +00:00
youkaichao	b9fc6d7ded	[Dynamo] Update the implementation of _debug_get_cache_entry_list (#108335 ) In https://github.com/pytorch/pytorch/pull/106673 , I created a private API `_debug_get_cache_entry_list` to help pull out cache entries from compiled functions. Recently, I find that @anijain2305 commented in the code that this API should be revisited, and so I created this PR. First, this API cannot be removed even if cache entry becomes a first-class python class`torch._C._dynamo.eval_frame._CacheEntry`. The facts that `extra_index` is static, and `get_extra_state` is inline static, make them not accessible elsewhere. This API `_debug_get_cache_entry_list` is the only way for users to get all the cache entries from code. Second, since the`torch._C._dynamo.eval_frame._CacheEntry` class is a python class, I simplified the C-part code, and remove the necessity of creating a namedtuple for this in the python code. Third, I also add a small improvement, that if the argument is a function, we can automatically pass its `__code__` to the API. The above change will slightly change the output, from list of named tuple to list of `torch._C._dynamo.eval_frame._CacheEntry`. I will update the corresponding docs that use this API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108335 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-09-02 16:38:59 +00:00
drisspg	add45aea1c	Flash Attention v2 (#105602 ) # Summary ## PR Dependencies I don't use ghstack :( this is a PR where it would have been helpful. That beings said I am going to peel off some PRs to make reviewing this easier: - [x] Separate build flags for Flash and MemEff: #107985 ### Description This pull request updates the version of _scaled_dot_product_flash_attention from version 1 to version 2. The changes are based on the flash attention code originally authored by @tridao ### Changes Made The majority of the changes in this pull request involve: - Copying over the flash_attention sources. - Updating header files. - Removing padding and slicing code from within the flash_attention kernel and relocating it to the composite implicit region of the SDPA. This was need to make the kernel functional and appease autograd. - Introducing a simple kernel generator to generate different instantiations of the forward and backward flash templates. - Adding conditional compilation (ifdef) to prevent building when nvcc is invoked with gencode < sm80. - Introducing a separate dependent option for mem_eff_attention, as flash_attention v2 lacks support for Windows and cannot be built for sm50 generation codes. - Modifying build.sh to reduce parallelization on sm86 runners and to lower the maximum parallelization on the manywheel builds. This adjustment was made to address out-of-memory issues during the compilation of FlashAttentionV2 sources. - Adding/Updating tests. ### Notes for Reviewers This is not a fun review, and I apologize in advance. Most of the files-changed are in the flash_attn/ folder. The only files of interest here IMO: - aten/src/ATen/native/transformers/cuda/flash_attn/flash_api.cpp - aten/src/ATen/native/transformers/cuda/flash_attn/kernels/generate_kernels.py ( this has been incorporated upstream to flash-attention github) There are a number of files all related to avoiding OOMs in CI/CD. These are typically shell scripts. ### Follow up items - Include the updates from `e07aa036db` and `9e5e8bc91e` \| https://github.com/pytorch/pytorch/issues/108108 ### Work Items - [x] I don't think Windows will be supported for 3.1.0 - Need to update cmakee - [x] Let multi_query/attention pass through and test \| UPDATE: I have the fast path implemented here: https://github.com/pytorch/pytorch/pull/106730 but since this will require changes to semantics of math to call repeat_interleave, I think this should be done as a followup. - [x] Had to drop cutlass back to 3.0.0 to get it to compile. Need to figure out how to upgrade to 3.1.0 and later. Spoke with Tri and he is going to be taking a look. Note: compiling with clang currently errors for the cute headers. - [x] Update test exercise above codepath - [x] Still need to disable on seq_len % 128 != 0 for backward( Tri beat me to it `a4f148b6ab`) - [x] Add determinism warning to BWD, Tri got to this one as well: 1c41d2b - [x] Update dispatcher to universally prefer FlashV2 - [x] Update tests to exercise new head_dims - [x] Move the head_dim padding from kernel to top level composite implicit function in order to make it purely functional - [x] Create template generator script - [x] Initial cmake support for building kernels/ folder - [x] Replay CudaGraph changes ### Results #### Forward only The TFlops are reported here are on a100 that is underclocked. ![flashv2_tflops_vs_seq_len](https://github.com/pytorch/pytorch/assets/32754868/152de46d-8fa6-42f0-9a9c-ef1eb7ae29e7) #### Forward+Backward Ran a sweep and for large compute bound sizes we do see a ~2x performance increase for forw+back. <img width="1684" alt="Screenshot 2023-07-20 at 3 47 47 PM" src="https://github.com/pytorch/pytorch/assets/32754868/fdd26e07-0077-4878-a417-f3a418b6fb3b"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105602 Approved by: https://github.com/huydhn, https://github.com/cpuhrsch	2023-09-01 22:14:44 +00:00
lezcano	2a6ef9b04d	[dynamo] Avoid recompilation when the PyTorch function accepts scalars (#108162 ) Before, it would create a 0D tensor with the input, which would incur in a guard and specialisation. It's not clear whether the guard and specialisation is the right behaviour when we create 0D tensors, but that's a story for another day. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108162 Approved by: https://github.com/ev-br, https://github.com/peterbell10	2023-09-01 14:35:42 +00:00
Yanbo Liang	dabdb97087	[Dynamo] Graph break on functions using tensor out variants (#108182 ) Fixes #108021 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108182 Approved by: https://github.com/eellison	2023-08-31 17:49:14 +00:00
Evgeni Burovski	01dfa7620d	MAINT: np.unique works with f16 directly (#108228 ) (follow up on gh-107768) Remove a f16->f32 workaround from np.unique, since torch.unique and np.unique seem to just work with float16 tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108228 Approved by: https://github.com/lezcano	2023-08-31 16:21:13 +00:00
Yukio Siraichi	6ad5568cbc	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-30 17:24:11 +00:00
PyTorch MergeBot	4e47ea5131	Revert "Break graph on `manual_seed`. (#107594 )" This reverts commit `6c28de2437`. Reverted https://github.com/pytorch/pytorch/pull/107594 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it seems to cause failures in trunk on inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_uniform_cuda_float, likely a landrace ([comment](https://github.com/pytorch/pytorch/pull/107594#issuecomment-1697783965))	2023-08-29 16:38:01 +00:00
Yukio Siraichi	6c28de2437	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-29 12:59:57 +00:00
voznesenskym	5d85d897e0	Torchrec Enablement Fixes - Re-PR 107910 (#108018 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108018 Approved by: https://github.com/wconstab	2023-08-28 19:47:53 +00:00
kobecai	356b8f6339	[dynamo]bugfix:implement numel() for SizeVariable (#107944 ) fix the issue that SizeVariable does not support numel() method Fixes #106407 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107944 Approved by: https://github.com/Skylion007	2023-08-28 17:54:57 +00:00
Jason Ansel	f877d0a4bf	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-26 01:41:29 +00:00
PyTorch MergeBot	eefce56b66	Revert "[dynamo] Treat monkey patched .forward as dynamic (#107104 )" This reverts commit `79b3a9f945`. Reverted https://github.com/pytorch/pytorch/pull/107104 on behalf of https://github.com/ZainRizvi due to Breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/107104#issuecomment-1692072018))	2023-08-24 16:55:33 +00:00
Jason Ansel	79b3a9f945	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-23 19:03:02 +00:00
lezcano	977aba7cfe	Revert the removal of a SampleInput for gather (#107776 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/107776 Approved by: https://github.com/peterbell10	2023-08-23 19:01:03 +00:00
lezcano	207b06d099	[dynamo] Wrap ndarray dunder methods (#107689 ) Fixes https://github.com/pytorch/pytorch/issues/107437 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107689 Approved by: https://github.com/ezyang ghstack dependencies: #107687, #107688, #107710, #107711, #107746	2023-08-23 13:55:36 +00:00
lezcano	fada0527fa	Dispatch take_along_axis to gather (#107711 ) Gather does the same thing, but it's much better supported in the `torch.compile` stack Pull Request resolved: https://github.com/pytorch/pytorch/pull/107711 Approved by: https://github.com/ezyang ghstack dependencies: #107687, #107688, #107710	2023-08-23 01:21:23 +00:00
Michael Voznesensky	02c2b750c5	Add support for GET_YIELD_FROM_ITER, YIELD_FROM, SEND (#106986 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106986 Approved by: https://github.com/jansel	2023-08-19 20:38:16 +00:00
Will Constable	eee2f57257	Raise TypeError for calling moduletype in dynamo (#107393 ) Fixes #107314 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107393 Approved by: https://github.com/williamwen42	2023-08-19 20:04:33 +00:00
Edward Z. Yang	5673c0874c	Use expect_true to make split with unbacked sizes work. (#106788 ) This pattern shows up in torchrec KeyedJaggedTensor. Most of the change in this PR is mechanical: whenever we failed an unbacked symint test due to just error checking, replace the conditional with something that calls expect_true (e.g., torch._check or TORCH_SYM_CHECK). Some of the changes are a bit more nuanced, I've commented on the PR accordingly. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106788 Approved by: https://github.com/lezcano ghstack dependencies: #106720	2023-08-15 20:31:30 +00:00
Michael Voznesensky	71a336ef75	[Dynamo x FSDP][1/x] Builder support for deque, appendleft (#106884 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106884 Approved by: https://github.com/ezyang	2023-08-11 03:26:12 +00:00
lezcano	a9dca53438	NumPy support in torch.compile (#106211 ) RFC: https://github.com/pytorch/rfcs/pull/54 First commit is the contents of https://github.com/Quansight-Labs/numpy_pytorch_interop/ We have already been using this in core for the last few months as a external dependency. This PR pulls all these into core. In the next commits, I do a number of things in this order - Fix a few small issues - Make the tests that this PR adds pass - Bend backwards until lintrunner passes - Remove the optional dependency on `torch_np` and simply rely on the upstreamed code - Fix a number dynamo tests that were passing before (they were not tasting anything I think) and are not passing now. Missing from this PR (but not blocking): - Have a flag that deactivates tracing NumPy functions and simply breaks. There used to be one but after the merge stopped working and I removed it. @lezcano to investigate. - https://github.com/pytorch/pytorch/pull/106431#issuecomment-1667079543. @voznesenskym to submit a fix after we merge. All the tests in `tests/torch_np` take about 75s to run. This was a work by @ev-br, @rgommers @honno and I. I did not create this PR via ghstack (which would have been convenient) as this is a collaboration, and ghstack doesn't allow for shared contributions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106211 Approved by: https://github.com/ezyang	2023-08-11 00:39:32 +00:00
youkaichao	bd3b6f1ab4	add a debug api to extract cache entry from code (#106673 ) Per the discussion with @jansel in https://dev-discuss.pytorch.org/t/how-are-guards-installed-on-frames-that-are-transient-objects/1415/7 , guards and compiled code live in `co_extra` field in pycodeobject, which cannot be accessed in a trivial way. This PR tries to add a debug API to extract the data from that field, which can make debugging torchdynamo much easier. The API is intended to be used for debug only, and should have no compatibility issues with the current system. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106673 Approved by: https://github.com/jansel	2023-08-08 16:33:46 +00:00
Michael Voznesensky	45c03b1ad4	Better dynamo dict support via SetVariable keys (#106559 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106559 Approved by: https://github.com/ezyang	2023-08-07 20:20:06 +00:00
Yukio Siraichi	33e70e34a3	More readable Z3 expressions printer. (#106643 ) This PR makes Z3 expressions easier to read and understand by creating a custom printer for them. Z3 expressions can be printed in 2 forms: 1. Using the builtin `str(e)` function 2. Using the `e.sexpr()` method Problem is that (1) is a bit hard to read because its line breaks are not so intuitive. (2) is a bit nicer, but the `to_int` and `to_real` functions clutter things up. The custom printer is an improved `sexpr()` function: - Leaves everything in one line - Gets rid of `to_int` and `to_real` functions - Reconstruct the floor division operations - Merge commutative operation chains Pull Request resolved: https://github.com/pytorch/pytorch/pull/106643 Approved by: https://github.com/ezyang	2023-08-07 16:52:22 +00:00
Yanbo Liang	e190afb829	[Dynamo] Allow users to patch custom builtin functions and inline them (#106595 ) Fixes Meta internal user case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106595 Approved by: https://github.com/jansel	2023-08-04 23:47:09 +00:00
ydwu4	2f281949a5	[dynamo] resolve InlinedClosureVariable in InstructionTranslator stack (#106491 ) When inlining a function which loads a closure, its direct parent may not load that closure. So we cannot find the closure name in parent's symbolic locals. In this PR, we fix it by recursively searching the parent instruction translator stack to resolve the closure. Background When developing https://github.com/pytorch/pytorch/pull/105679, this corner case is triggered. A small repro is added in the test of this pr, where outer is loaded by deep2 but not by deep. ```python def test_inline_closure_not_loaded_by_parent(self): def outer(a): return a + 1 def indirect(x): return direct(x) def direct(x): def deep2(c): return outer(c) def deep(c): return deep2(c) return deep(x) x = torch.randn(3) eager = indirect(x) counter = CompileCounter() compiled = torch._dynamo.optimize(counter)(indirect)(x) ``` Running the test, we have the following error before the PR: ``` Traceback (most recent call last): File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6584, in test_inline_closure_not_loaded_by_parent compiled = torch._dynamo.optimize(counter)(indirect)(x) File "/home/yidi/local/pytorch/torch/_dynamo/eval_frame.py", line 321, in _fn return fn(args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/eval_frame.py", line 481, in catch_errors return callback(frame, cache_size, hooks, frame_state) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 543, in _convert_frame result = inner_convert(frame, cache_size, hooks, frame_state) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 130, in _fn return fn(args, *kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 362, in _convert_frame_assert return _compile( File "/home/yidi/local/pytorch/torch/_dynamo/utils.py", line 194, in time_wrapper r = func(args, **kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 531, in _compile raise InternalTorchDynamoError(str(e)).with_traceback(e.__traceback__) from None File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 432, in _compile out_code = transform_code_object(code, transform) File "/home/yidi/local/pytorch/torch/_dynamo/bytecode_transformation.py", line 1028, in transform_code_object transformations(instructions, code_options) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 417, in transform tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2067, in run super().run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 261, in call_function return super().call_function(tx, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2279, in inline_call_ tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2279, in inline_call_ tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2227, in inline_call_ sub_locals, closure_cells = func.bind_args(parent, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 471, in bind_args result[name] = parent.symbolic_locals[name] torch._dynamo.exc.InternalTorchDynamoError: outer from user code: File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6570, in indirect return direct(x) File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6579, in direct return deep(x) File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6577, in deep return deep2(c) Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information You can suppress this exception and fall back to eager by setting: import torch._dynamo torch._dynamo.config.suppress_errors = True To execute this test, run the following from the base repo dir: python test/dynamo/test_misc.py -k test_inline_closure_not_loaded_by_parent This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ---------------------------------------------------------------------------------------------------------------------------- Captured stdout call ----------------------------------------------------------------------------------------------------------------------------- frames [('total', 1)] inline_call [] ---------------------------------------------------------------------------------------------------------------------------- Captured stderr call ----------------------------------------------------------------------------------------------------------------------------- [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __init__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __enter__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping helper /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __init__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __enter__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping enable_dynamic /home/yidi/local/pytorch/torch/_dynamo/eval_frame.py [2023-08-02 15:48:36,561] torch._dynamo.symbolic_convert: [INFO] Step 1: torchdynamo start tracing indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6569 TRACE starts_line indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6569 def indirect(x): [2023-08-02 15:48:36,591] torch._dynamo.variables.builder: [DEBUG] wrap_to_fake L['x'] (3,) [<DimDynamic.STATIC: 2>] [None] TRACE starts_line indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6570 return direct(x) [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF direct [] [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST x [UserFunctionVariable()] [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [UserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] INLINING <code object direct at 0x7fbe4d366810, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6572> TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6572 (inline depth: 1) def direct(x): TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6573 (inline depth: 1) def deep2(c): [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE outer [] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [InlinedClosureVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object deep2 at 0x7fbe4d3666b0, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6573> [TupleVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST MiscTests.test_inline_closure_not_loaded_by_parent.<locals>.direct.<locals>.deep2 [TupleVariable(), ConstantVariable(code)] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF deep2 [NestedUserFunctionVariable()] TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6576 (inline depth: 1) def deep(c): [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE deep2 [] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> [TupleVariable()] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST MiscTests.test_inline_closure_not_loaded_by_parent.<locals>.direct.<locals>.deep [TupleVariable(), ConstantVariable(code)] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST deep [NestedUserFunctionVariable()] TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6579 (inline depth: 1) return deep(x) [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST deep [] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST x [NestedUserFunctionVariable()] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] INLINING <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> TRACE starts_line deep /home/yidi/local/pytorch/test/dynamo/test_misc.py:6576 (inline depth: 2) def deep(c): TRACE starts_line deep /home/yidi/local/pytorch/test/dynamo/test_misc.py:6577 (inline depth: 2) return deep2(c) [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF deep2 [] [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST c [NestedUserFunctionVariable()] [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] FAILED INLINING <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] FAILED INLINING <code object direct at 0x7fbe4d366810, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6572> [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes ``` Test Plan: add new test Pull Request resolved: https://github.com/pytorch/pytorch/pull/106491 Approved by: https://github.com/williamwen42, https://github.com/jansel, https://github.com/zou3519	2023-08-03 16:45:42 +00:00
Edward Z. Yang	76163a56c0	Refactor stack handling to always use TracingContext to populate real stack on exception (#106277 ) The basic gist of the PR is simple, but it's accompanied with some careful modifications and unit tests to make sure I got it right. Check inline comments for more details. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106277 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2023-08-02 00:09:16 +00:00
Yukio Siraichi	e514386315	Normalize builtin types to dtypes. (#106074 ) Fix: #105052 Follow-up: #105588 This PR normalizes builtin Python types (e.g. `int` and `float`) into PyTorch data types when these are passed as argument, instead of used as functions. In summary, we: - Implement `BuiltinVariable.as_proxy`, mapping Python types into PyTorch data types Pull Request resolved: https://github.com/pytorch/pytorch/pull/106074 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-08-01 13:32:19 +00:00
Edward Z. Yang	7b9d250f06	Change _dynamo.export to be export(f)(args, *kwargs) (#106109 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106109 Approved by: https://github.com/voznesenskym	2023-07-27 21:41:13 +00:00
Yukio Siraichi	707aadeedd	Track global Numpy variables as side-effect. (#105959 ) Fix: #105074 This PR makes dynamo handle Numpy global variables the same way as PyTorch tensor global variables by tracking them as side-effect. In summary, we add `NumpyNdarrayVariable` to the `VariableBuilder._can_lift_attrs_to_inputs` function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105959 Approved by: https://github.com/ezyang	2023-07-27 03:49:48 +00:00
Michael Voznesensky	aabdd2b7a1	Add support for tensor.tolist() for static sized int tensors (#105976 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105976 Approved by: https://github.com/ezyang	2023-07-26 08:13:22 +00:00
Mengwei Liu	cce2b7e3c9	[dynamo][numpy] Add support for builtin len() on numpy ndarray (#105691 ) Issue #105054 ``` def fn(x): v = x.sum() / len(x) return v ``` This creates a graph break because we don't know how to handle __len__ method. Solution is just delegate it back to `TensorVariable`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105691 Approved by: https://github.com/ezyang	2023-07-21 03:50:40 +00:00
William Wen	777fc0bb58	[dynamo] fine-grained bytecode-source attribution in python 3.11 (#104676 ) Since Python 3.11 bytecode contains endline and column information, for each bytecode, we attribute the source code corresponding to the bytecode in a more accurate way. For example, we can highlight a function call in a series of nested function calls, or highlight a function call spanning multiple lines. Sample: ```python import torch import torch._dynamo from functorch.experimental.control_flow import cond def h(x): return x * 5 def true_fn(x): return x * 2 def false_fn(x): return x * 3 def f(pred, x): x = h( h(h(x)) ) x = x[1:][:2] torch._dynamo.graph_break() x = cond(pred, true_fn, false_fn, [x]) opt_f = torch.compile(f, backend="eager") opt_f(torch.tensor(True), torch.randn(3, 3, 3, 3)) ``` Output: ``` $ TORCH_LOGS="trace_call" python playground9.py TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:16 h(h(x)) ~^^^ TRACE FX call mul from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:16 h(h(x)) ~^^^^^^ TRACE FX call mul_1 from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:15 x = h( ~^ h(h(x)) ^^^^^^^ ) ^ TRACE FX call mul_2 from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE FX call getitem from f /scratch/williamwen/work/pytorch/playground9.py:18 x = x[1:][:2] ~^^^^ TRACE FX call getitem_1 from f /scratch/williamwen/work/pytorch/playground9.py:18 x = x[1:][:2] ~~~~~^^^^ TRACE inlined call true_fn from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TRACE FX call mul from true_fn /scratch/williamwen/work/pytorch/playground9.py:9 (inline depth: 1) return x * 2 ~~^~~ TRACE inlined call false_fn from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TRACE FX call mul from false_fn /scratch/williamwen/work/pytorch/playground9.py:12 (inline depth: 1) return x * 3 ~~^~~ TRACE FX call cond from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104676 Approved by: https://github.com/ezyang	2023-07-20 17:18:52 +00:00
Yukio Siraichi	5ce5372d70	Create tensor from Numpy in current device. (#105546 ) Fix: #105046 This PR changes how tensors are created from Numpy arrays, when tracing with dynamo. Instead of using `from_numpy`, we use `as_tensor`. The latter takes into consideration the current device. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105546 Approved by: https://github.com/lezcano	2023-07-19 21:31:52 +00:00
Justin Chu	8a688277a2	[BE] Enable ruff's UP rules and autoformat dynamo / functorch and refs (#105432 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105432 Approved by: https://github.com/ezyang	2023-07-19 13:48:44 +00:00
Yukio Siraichi	0b6de0eb1c	Improve validator module behavior if Z3 is not installed. (#105168 ) Fixes: #105143 In summary, the changes are: - Check if Z3 is installed when the module is loaded - Naming consistently as "translation validation" (not "validator") - Skipping tests if Z3 is not installed Pull Request resolved: https://github.com/pytorch/pytorch/pull/105168 Approved by: https://github.com/ezyang	2023-07-19 13:11:22 +00:00
Michael Voznesensky	a6758cb304	Revert "Revert "SetVariable in dynamo (#103205 )"" + Fix for improved graph breaks (#105345 ) This reverts commit `94b3f9f646`. Fix Pull Request resolved: https://github.com/pytorch/pytorch/pull/105345 Approved by: https://github.com/atalman	2023-07-17 23:21:30 +00:00
Animesh Jain	95232c216b	[dynamo] Bugfix for enums (#105306 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105306 Approved by: https://github.com/yanboliang	2023-07-17 16:39:16 +00:00
PyTorch MergeBot	94b3f9f646	Revert "SetVariable in dynamo (#103205 )" This reverts commit `82fb5edfc7`. Reverted https://github.com/pytorch/pytorch/pull/103205 on behalf of https://github.com/atalman due to Failing cuda11.8-py3.10-gcc7-sm86 / test (inductor_torchbench_dynamic) with CUDA oom ([comment](https://github.com/pytorch/pytorch/pull/103205#issuecomment-1638115073))	2023-07-17 13:13:47 +00:00
Michael Voznesensky	82fb5edfc7	SetVariable in dynamo (#103205 ) Set initial Fixes https://github.com/pytorch/pytorch/issues/94738 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103205 Approved by: https://github.com/jansel	2023-07-15 02:25:31 +00:00
Mengwei Liu	fb376f80a2	[retry][dynamo][numpy] Add support for np.dtype (#105034 ) Original PR: #103546 Trying to support numpy function call in dynamo, with numpy dtype as argument. For example: ``` def fn(x: int): return np.empty_like(x, dtype=np.float64) ``` This currently doesn't work because `NumpyVariable` doesn't implement `as_proxy()`. The idea in `as_proxy()` for now is to convert `np.float64` and other np.<dtype> into `str` and then feed into the corresponding `torch_np` method. The assumption here is that all `torch_np` methods that are taking `dtype` kwarg will be able to also take `str` as `dtype`. This assumption stands for `numpy`. For previous example, we convert `np.float64` to `"float64"` in `as_proxy()` and then feed it into `torch_np.empy_like()` method. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105034 Approved by: https://github.com/voznesenskym	2023-07-14 21:36:36 +00:00
Yukio Siraichi	8e01f75b1b	New `Mod` class for SymPy expressions. (#104968 ) This PR introduces a new `Mod` class to be used with SymPy expressions. The main reason being due to SymPy simplification errors (#97792). Pull Request resolved: https://github.com/pytorch/pytorch/pull/104968 Approved by: https://github.com/ezyang	2023-07-14 13:34:52 +00:00
PyTorch MergeBot	f01deb23d5	Revert "[dynamo][numpy] Add support for np.dtype (#103546 )" This reverts commit `0710791929`. Reverted https://github.com/pytorch/pytorch/pull/103546 on behalf of https://github.com/voznesenskym due to Failed on bench, unclear why bench test did not run on CI ([comment](https://github.com/pytorch/pytorch/pull/103546#issuecomment-1631203461))	2023-07-11 17:23:11 +00:00
Mengwei Liu	0710791929	[dynamo][numpy] Add support for np.dtype (#103546 ) ## Problem Trying to support numpy function call in dynamo, with numpy dtype as argument. For example: ``` def fn(x: int): return np.empty_like(x, dtype=np.float64) ``` ## Solution This currently doesn't work because `NumpyVariable` doesn't implement `as_proxy()`. The idea in `as_proxy()` for now is to convert `np.float64` and other np.<dtype> into `torch.dtype` and then feed into the corresponding `torch_np` method. For previous example, we convert `np.float64` to `torch.float64` in `as_proxy()` and then feed it into `torch_np.empy_like()` method. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103546 Approved by: https://github.com/ezyang	2023-07-11 06:29:15 +00:00
Yukio Siraichi	d5dbe77629	Fix mod semantics for `Z3Ops`. (#104827 ) Python `mod` semantics is not the same as the mathematical modulus operation. According to the Python reference: `a = floor(a / b) * b + a % r`. In other words: `a % b = a - floor(a / b) * b`. This PR fixes the old implementation which used SMT-LIB2 semantics for `mod`. In short, it only worked with integers and had the following guarantee: `0 <= a % b < b`. In summary, the changes are: - `a % b = a - floordiv(a, b) * b` - `a` and `b` can be both integer or real - The result will be real if any of the arguments is real. Otherwise, it will be integer Pull Request resolved: https://github.com/pytorch/pytorch/pull/104827 Approved by: https://github.com/lezcano	2023-07-10 23:35:04 +00:00
Michael Lazos	86680a6c0b	[dynamo] handle calls to typing.cast (#104799 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/104799 Approved by: https://github.com/jansel	2023-07-10 21:05:17 +00:00
Michael Lazos	0433cb0596	[dynamo] simulate tracing tree_map_only (#104815 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/104815 Approved by: https://github.com/voznesenskym	2023-07-10 18:05:35 +00:00
Yukio Siraichi	40b8d10d5e	Re-land: Turn translation validation on for tests and accuracy runs by default. (#104467 ) Re-landing: #103611 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104467 Approved by: https://github.com/malfet	2023-07-05 19:01:50 +00:00
Edward Z. Yang	2385dad4b3	Enable automatic_dynamic_shapes by default (#103623 ) Some notes: * I now manually turn off `_generate` jobs from running with cudagraphs, as it is unrealistic to expect to cudagraph autoregressive generation up to max sequence length, this would imply compiling the entire unrolled sequence generation. Concretely, cm3leon_generate was timing out post this change, likely due to the compile time slowdown of dynamic shapes ON TOP OF accidentally unrolling all the loops * A few torch._dynamo.reset tactically inserted to force recompiles on tests that expected it * expectedFailureAutomaticDynamic flip into patching automatic_dynamic_shapes=False Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103623 Approved by: https://github.com/voznesenskym	2023-07-05 00:25:02 +00:00
PyTorch MergeBot	a2a8b4d415	Revert "Turn translation validation on for tests and accuracy runs by default. (#103611 )" This reverts commit `e311bed2a8`. Reverted https://github.com/pytorch/pytorch/pull/103611 on behalf of https://github.com/malfet due to Broke inductor tests ([comment](https://github.com/pytorch/pytorch/pull/103611#issuecomment-1614850276))	2023-06-30 15:54:18 +00:00
Yukio Siraichi	e311bed2a8	Turn translation validation on for tests and accuracy runs by default. (#103611 ) This PR turns translation validation on by default for tests and accuracy benchmark runs. It also installs Z3 on CI. The main changes are: - Add `--no-translation-validation` as an option in _test/run_tests.py_ - Set `PYTORCH_TEST_WITH_TV` environment variable - Add `TEST_WITH_TV` variable in _torch/testing/_internal/common_utils.py_ - Turn translation validation on for accuracy benchmarks in _benchmarks/dynamo/common.py_ - Add Z3 installation on CI scripts Pull Request resolved: https://github.com/pytorch/pytorch/pull/103611 Approved by: https://github.com/ezyang	2023-06-30 01:32:21 +00:00
William Wen	998c07799f	[dynamo] fix deep nested closure cell KeyError (#104222 ) Fix https://github.com/pytorch/pytorch/issues/99639 by handling the case in `InliningInstructionTranslator`'s `LOAD_CLOSURE` definition when the requested cell is not in `self.closure_cells`. My intuition is that the behavior of `LOAD_DEREF` and `STORE_DEREF` on a cell/freevar should not depend on whether or not we called `LOAD_CLOSURE` (that is, we shouldn't create a new cell var in `LOAD_CLOSURE` like in https://github.com/pytorch/pytorch/pull/101357). But we need a way to push cells created by the inlined function that were not present in the caller - `InlinedClosureVariable` is used to differentiate these cells from other cells. Adding this test causes an error though (EDIT: this test is not relevant to this PR and instead just reveals that `cond` with Python side effects is still broken): ```python def test_closure_out_of_scope_cell_with_cond(self): from functorch.experimental.control_flow import cond cell1 = torch.rand(3, 3) cell2 = torch.rand(3, 3) orig3 = torch.rand(3, 3) def test(x): cell3 = orig3.clone() def then(): nonlocal cell3 cell3 += cell1 return cell3 def els(): nonlocal cell3 cell3 += cell2 return cell3 return cond(x > 0, then, els, []) opt_fn = torch._dynamo.optimize("eager")(test) result1 = opt_fn(1) self.assertTrue(torch.allclose(result1, orig3 + cell1)) result2 = opt_fn(-1) self.assertTrue(torch.allclose(result1, orig3 + cell1 + cell2)) ``` ``` Traceback (most recent call last): File "/scratch/williamwen/work/pytorch2/test/dynamo/test_misc.py", line 1768, in test_closure_out_of_scope_cell_with_cond result1 = opt_fn(1) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/eval_frame.py", line 295, in _fn return fn(args, kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/eval_frame.py", line 448, in catch_errors return callback(frame, cache_size, hooks, frame_state) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 526, in _convert_frame result = inner_convert(frame, cache_size, hooks, frame_state) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 127, in _fn return fn(args, *kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 360, in _convert_frame_assert return _compile( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/utils.py", line 180, in time_wrapper r = func(args, *kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 430, in _compile out_code = transform_code_object(code, transform) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/bytecode_transformation.py", line 1000, in transform_code_object transformations(instructions, code_options) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/convert_frame.py", line 415, in transform tracer.run() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 2029, in run super().run() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 708, in run and self.step() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 668, in step getattr(self, inst.opname)(inst) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 391, in wrapper return inner_fn(self, inst) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 1100, in CALL_FUNCTION self.call_function(fn, args, {}) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 559, in call_function self.push(fn.call_function(self, args, kwargs)) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/torch.py", line 1061, in call_function (false_r, false_graph, false_lifted_freevars) = speculate_branch(False) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/torch.py", line 1044, in speculate_branch ret_val, ret_graph, ret_lifted_freevars = speculate_subgraph( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/torch.py", line 850, in speculate_subgraph output = f.call_function(tx, args, {}) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/functions.py", line 121, in call_function return tx.inline_user_function_return( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 595, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 2134, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 2231, in inline_call_ tracer.run() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 708, in run and self.step() File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 668, in step getattr(self, inst.opname)(inst) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/symbolic_convert.py", line 162, in impl self.push(fn_var.call_function(self, self.popn(nargs), {})) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/variables/builtin.py", line 497, in call_function proxy = tx.output.create_proxy( File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 345, in create_proxy return self.current_tracer.create_proxy(args, **kwargs) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 1109, in create_proxy new_arg = self.lift_tracked_freevar_to_input(arg) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 1226, in lift_tracked_freevar_to_input self.parent.lift_tracked_freevar_to_input(proxy) File "/scratch/williamwen/work/pytorch2/torch/_dynamo/output_graph.py", line 1219, in lift_tracked_freevar_to_input assert ( AssertionError: lift_tracked_freevar_to_input on root SubgraphTracer from user code: File "/scratch/williamwen/work/pytorch2/test/dynamo/test_misc.py", line 1766, in test return cond(x > 0, then, els, []) File "/scratch/williamwen/work/pytorch2/test/dynamo/test_misc.py", line 1764, in els cell3 += cell2 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104222 Approved by: https://github.com/jansel	2023-06-28 17:54:13 +00:00
cdzhan	c06bb82ba1	fix specialization when you pass an unspec int into slicing on a Python list. (#104142 ) Fixes #103545 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104142 Approved by: https://github.com/malfet, https://github.com/jansel	2023-06-28 13:13:07 +00:00
Michael Voznesensky	ec24f1e4cc	Simulate treespec flattening/unflattening (#101896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/101896 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-06-23 10:53:15 +00:00
Nikita Shulga	cd05c3b98c	[BE] Use `TEST_MULTIGPU` from `common_cuda.py` (#103982 ) Comment about `TEST_CUDNN` called over and over has long been alleviated by wrapping the check with `LazyVal`, that caches the results. Also, delete unused `TEST_MAGMA`. Prep change for https://github.com/pytorch/pytorch/issues/100006 <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at e3a5b39</samp> > _`common_cuda.py`_ > _Refactored for dynamo tests_ > _Winter code cleanup_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/103982 Approved by: https://github.com/atalman, https://github.com/janeyx99	2023-06-22 00:07:44 +00:00
Edward Z. Yang	7ce932a92c	Add signpost_event to dynamic_shapes (#103882 ) Added two signpost_event calls to torch.fx.experimental.symbolic_shapes, one for produce_guards (where we can give stats like how many free symbols and how many guards produced) and the other is for evaluate_expr after freeze (so we can look for cases where we're improperly discarding guards in backwards.) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103882 Approved by: https://github.com/Skylion007	2023-06-21 13:26:21 +00:00
Tugsbayasgalan Manlaibaatar	d4b85f3031	Support params/buffers inside cond and map (#102310 ) With #102022, params and buffers are always treated as special case of free variables. In this PR, I switch cond and map implementation to the this method and deprecate the old tracing mechanism. Differential Revision: [D46746202](https://our.internmc.facebook.com/intern/diff/D46746202) Pull Request resolved: https://github.com/pytorch/pytorch/pull/102310 Approved by: https://github.com/avikchaudhuri, https://github.com/zou3519	2023-06-20 05:33:10 +00:00
Will Feng	9541053cca	[dynamo] support FakeTensor for SYM_INT/SYM_INT_LIST/INT_LIST param in python-to-cpp argument parsing (#103448 ) before the PR, when compiling a function with signature symint/symintlist/intlist, we have runtime error like ```argument 'shifts' must be tuple of ints, not FakeTensor```. see newly added unit test in test/dynamo/test_misc.py for repro after the PR, for FakeTensor with empty size and numel()=1, we will try to convert it into symint/symintlist. we will likely see expected exception ```torch._subclasses.fake_tensor.DataDependentOutputException / aten._local_scalar_dense.default``` during conversion reference PR: * we handle FakeTensor for symintlist as 1st varags: https://github.com/pytorch/pytorch/pull/97508 * we handle FakeTensor for intlist in a similar way: https://github.com/pytorch/pytorch/pull/85759/files * call local_scalar_dense on a FakeTensor: `f7365eca90` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103448 Approved by: https://github.com/yanboliang	2023-06-16 21:33:40 +00:00
Yanbo Liang	703875e364	[Reland][Dynamo] VariableTracker.recursively_contains should be updated correctly when mutation happens (#103564 ) (#103717 ) Summary: Reland of https://github.com/pytorch/pytorch/pull/103564 Test Plan: contbuild & OSS CI, see `5c3556da94` Differential Revision: D46783727 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103717 Approved by: https://github.com/angelayi	2023-06-16 04:25:27 +00:00
Edward Z. Yang	ed3a61afcc	Add automatic_dynamic_shapes test configuration (#103598 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103598 Approved by: https://github.com/Skylion007	2023-06-15 19:55:57 +00:00
PyTorch MergeBot	73be9842be	Revert "[Dynamo] VariableTracker.recursively_contains should be updated correctly when mutation happens (#103564 )" This reverts commit `5c3556da94`. Reverted https://github.com/pytorch/pytorch/pull/103564 on behalf of https://github.com/ZainRizvi due to Broke internal builds ([comment](https://github.com/pytorch/pytorch/pull/103564#issuecomment-1593552435))	2023-06-15 18:40:51 +00:00
Edward Z. Yang	bc6ec97e02	Switch dynamic_shapes to True by default (#103597 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103597 Approved by: https://github.com/voznesenskym	2023-06-15 15:16:20 +00:00
PyTorch MergeBot	2087d32811	Revert "Support params/buffers inside cond and map (#102310 )" This reverts commit `766f236bad`. Reverted https://github.com/pytorch/pytorch/pull/102310 on behalf of https://github.com/huydhn due to The test is failing in trunk `766f236bad` ([comment](https://github.com/pytorch/pytorch/pull/102310#issuecomment-1592159710))	2023-06-15 00:29:20 +00:00
Edward Z. Yang	ddf4cd69ec	Delete ifdyn and ifunspec combinators (#103596 ) Replaced with expect tests for ease of updating. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103596 Approved by: https://github.com/voznesenskym	2023-06-15 00:14:17 +00:00
Tugsbayasgalan Manlaibaatar	766f236bad	Support params/buffers inside cond and map (#102310 ) With #102022, params and buffers are always treated as special case of free variables. In this PR, I switch cond and map implementation to the this method and deprecate the old tracing mechanism. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102310 Approved by: https://github.com/avikchaudhuri, https://github.com/zou3519	2023-06-14 22:32:33 +00:00
Michael Voznesensky	aece6705d1	Move locals/globals to output graph, make it easier to access them anywhere (#103456 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/103456 Approved by: https://github.com/jansel	2023-06-14 20:04:33 +00:00
Edward Z. Yang	9946499228	Continue simplifying dynamic shapes tests (#103592 ) Remove the static by default / no automatic dynamic configuration as this is about to become the default. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103592 Approved by: https://github.com/voznesenskym, https://github.com/Skylion007	2023-06-14 19:35:51 +00:00
Yanbo Liang	5c3556da94	[Dynamo] VariableTracker.recursively_contains should be updated correctly when mutation happens (#103564 ) Fixes #103563 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103564 Approved by: https://github.com/jansel	2023-06-14 17:08:00 +00:00
Edward Z. Yang	2f5fef5912	Refactor tests for dynamic shapes (#103542 ) First, infra improvements: new combinator `expectedFailureDynamic` which subsumes expectedFailure calls in test_dynamic_shapes.py. It's just nicer to have these right with the test. Implementation in torch/_dynamo/testing.py and it works by putting an attr on the test, which is then converted into a real expectedFailure when we actually generate the dynamic shapes test class Next, some housekeeping: * test/dynamo/test_unspec.py accidentally was running mostly statically due to the `assume_static_by_default` config flip. Don't assume static by default and xfail some tests which regressed in that time. * New test file test/dynamo/test_config.py, for testing permutations of configuration options. `test_dynamic_shapes` got moved there. Finally, grinding through tests in a way that will make them more compatible with dynamic by default: * If the test explicitly requires dynamic_shapes=False, remove that patch (and probably xfail it) * If the test checks dynamic_shapes internally, remove that test and patch the test so it ALWAYS runs with dynamic_shapes (this is not coverage loss because we're going to switch the default) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103542 Approved by: https://github.com/anijain2305	2023-06-14 02:04:54 +00:00
Michael Lazos	6c6c897d6b	Add graph break logging option instead of config flag (#103202 ) Make graph break logging a logging option vs a config setting Pull Request resolved: https://github.com/pytorch/pytorch/pull/103202 Approved by: https://github.com/yanboliang, https://github.com/anijain2305	2023-06-12 19:52:31 +00:00
Edward Z. Yang	49754f44ee	Rewrite size/stride/numel TensorVariable handling (#103438 ) The main concept behind this refactor is this: if we know that a size/stride/etc is constant, do NOT trace it into the graph, EXCEPT for any preexisting special cases that applied for static shapes. The refactor unfolds like this: 1. Delete the `dynamic_shapes` branches in torch/_dynamo/variables/builder.py which accept int/float/bool outputs. This is over-aggressive and we don't want to allow this (because if the operator returns a constant, we shouldn't have called wrap_fx_proxy in the first place.) This causes a bunch of failures because we are blindly feeding the result of size() call to wrap_fx_proxy when dynamic shapes is enabled. 2. Modify TensorVariable.call_method in torch/_dynamo/variables/tensor.py to avoid sending constant ints to wrap_fx_proxy. After normal specialization (which should be deleted, see https://github.com/pytorch/pytorch/pull/103434) we consult the fake tensor to see if the values in question have free variables or not. If they don't we short circuit tracing into graph. We only trace into graph if the operation in question is truly symbolic. Note that there is a near miss here: it's OK to trace x.size() call entirely into the graph, even if it doesn't have all dynamic shapes, because operator.getitem with int output is special cased in builder.py. This is a preexisting special case and I don't try to get rid of it. 3. It turns out that the change here also breaks torch_np compatibility layer. So I completely rewrite getattr handling in torch/_dynamo/variables/tensor.py to follow the same pattern (only trace into graph if truly dynamic). There's some minor housekeeping in torch/fx/experimental/symbolic_shapes.py and some test files. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/103438 Approved by: https://github.com/larryliu0820	2023-06-12 19:36:24 +00:00
Yukio Siraichi	7550ec16a4	Add support for dictionary with torch object keys. (#103158 ) Fixes: #101979 This PR adds support for dictionaries with torch object as keys in dynamo. The main problem was that, for example, the source built for `d[torch.float]` (`d` being a dictionary) was `ODictGetItemSource(GlobalSource('d'), index=torch.float)`. When `Source.name` method was called, we got `odict_getitem(G['d'], torch.float)`. Evaluating that string raised an error, since `torch` was only available in the global dictionary `G` as `G["torch"]`. Instead, this PR builds the source: `ODictGetItemSource(GlobalSource('d'), index=AttrSource(GlobalSource('torch'), 'float'))`. The to-be-evaluated string is correctly generated as: `odict_getitem(G['d'], G['torch'].float)`. Here's a minimal example that reproduces the error, before this PR: ```python import torch d = { torch.float16: torch.float32, } @torch.compile def f(): return torch.randn(3, dtype=d[torch.float16]) f() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103158 Approved by: https://github.com/mlazos	2023-06-09 20:18:49 +00:00
gmagogsfm	b4f3a6f58f	[Dynamo Hackathon] Add support for hasattr on TorchVariable (#103177 ) Fixes #101154 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103177 Approved by: https://github.com/yanboliang	2023-06-08 19:34:44 +00:00
ydwu4	3c896a5adb	[dynamo] fix torch.distributions lazy_attribute failure (#103208 ) Fixes #93340. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103208 Approved by: https://github.com/yanboliang	2023-06-08 17:26:54 +00:00
Tugsbayasgalan Manlaibaatar	91e82ba0a6	[PT2 Dynamo Hackathon] Fix simple bug in inline dict (#103187 ) Fixes: https://github.com/pytorch/pytorch/issues/101980 Pull Request resolved: https://github.com/pytorch/pytorch/pull/103187 Approved by: https://github.com/yanboliang	2023-06-08 07:16:13 +00:00
Yanbo Liang	d92bb036a4	[Dynamo] Fix if condition on UnspecializedNNModuleVariable (#102583 ) Fixes #102315 The root cause is for ```UnspecializedNNModuleVariable``` which extends from ```UserDefinedObjectVariable```, if ```__bool__``` is missing, we should use ```__len__``` to infer a truth value. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102583 Approved by: https://github.com/jansel	2023-06-03 03:42:15 +00:00
Mengwei Liu	c304fddf68	[dynamo][numpy] Support graph break for numpy ndarray (#100839 ) Issue: #93684 In previous PRs #95849 #99560 we redirect `numpy.`, `<tensor>.numpy()` calls to `torch_np.` methods and attributes, by creating `NumpyNdarrayVariable` for those calls. We need to handle `NumpyNdarrayVariable` when graph break happens. This PR did 2 things: 1. In `codegen.py` we made sure we can reconstruct the value wrapped by `NumpyNdarrayVariable`, to be `torch_np.ndarray` in the stack whenerver we recompiles the subgraph. 2. In `builder.py` we can wrap the value to be `NumpyNdarrayVariable` and save it as graph input. ----- Starting from commit 6: ## A new design for supporting numpy in dynamo In short the core concept doesn't change: we still convert `numpy` API calls to `torch_np` API calls. However, instead of wrapping a `torch_np.ndarray` in `NumpyNdarrayVariable`, the new design wraps a `torch.Tensor`. The reason for doing this change is because we need to keep `torch.Tensor` everywhere in the captured graph, so that it works well with the backend of dynamo. See discussions in https://github.com/Quansight-Labs/numpy_pytorch_interop/issues/142 for details. ### Flow This is an example showing how do we think about dynamo working on a simple function: ```python def f(x: torch.Tensor, y: torch.Tensor): a, b = x.numpy(), y.numpy() c = np.add(x, y) return torch.from_numpy(c) ``` ``` +------------+ +------------+ torch.Tensor \| \|numpy.ndarray\| \| -------------- .numpy() --------------\| \| \| \| \| \| +------------------+ +------------+ \| numpy.add \|numpy.ndarray\| \|torch.Tensor +------------+ \| --------------\| torch.from_numpy -------------- torch.Tensor \| \|numpy.ndarray\| \| \| \| -------------- .numpy() --------------\| \| +------------------+ \| \| \| \| +------------+ +------------+ +------------+ +----------------+ torch.Tensor \| \|torch.Tensor \| \| -------------- .detach() --------------\| \| \| \| \| \| +----------------+ +------------+ +------------+ \| \|torch_np.ndarray\| \|torch.Tensor\| \|torch.Tensor \| torch_np.add -----------------\| util.to_tensor -------------\| .detach() -------------- +------------+ \| \| \| \| \| \| torch.Tensor \| \|torch.Tensor \| \| +----------------+ +------------+ -------------- .detach() --------------\| \| \| \| \| \| +------------+ \| +----------------+ \| \| wrapper on torch_np.add \| +--------------------------------------------------------+ ``` ### Approach `torch_np` APIs can take both `torch_np.ndarray` as well as `torch.Tensor`. What we need to do is to have a wrapper for these APIs to convert the return value back to `torch.Tensor`. This way only the wrapper is showing up in the captured graph, with `torch.Tensor`s as input and `torch.Tensor` as output. If we have a graph break or we've traced to the end of the program, we need to inspect all the `NumpyNdarrayVariable` in the stack and convert them back to `numpy.ndarray`, to make sure the compiled version is still behaving the same as the eager version. ### Examples Here's an example of the graph generated: ```python def fn(x: np.ndarray, y: np.ndarray): a = x.real b = y.real torch._dynamo.graph_break() return np.add(a, 1), np.add(b, 1) ``` Graph generated: ``` [2023-05-16 10:31:48,737] torch._dynamo.output_graph.__graph: [DEBUG] TRACED GRAPH __compiled_fn_0 <eval_with_key>.0 opcode name target args kwargs ------------- -------------- ---------------------------------------------------------- ---------------------- -------- placeholder l_x_ L_x_ () {} placeholder l_y_ L_y_ () {} call_function from_numpy <built-in method from_numpy of type object at 0x12b1fdc80> (l_x_,) {} call_function from_numpy_1 <built-in method from_numpy of type object at 0x12b1fdc80> (l_y_,) {} call_function attr_wrapper <function attr_wrapper at 0x12e8693a0> (from_numpy, 'real') {} call_function attr_wrapper_1 <function attr_wrapper at 0x12e8693a0> (from_numpy_1, 'real') {} output output output ((),) {} [2023-05-16 10:31:48,908] torch._dynamo.output_graph.__graph: [DEBUG] TRACED GRAPH __compiled_fn_2 <eval_with_key>.1 opcode name target args kwargs ------------- ------------- ---------------------------------------------------------- ------------------------------- -------- placeholder l_a_ L_a_ () {} placeholder l_b_ L_b_ () {} call_function from_numpy <built-in method from_numpy of type object at 0x12b1fdc80> (l_a_,) {} call_function from_numpy_1 <built-in method from_numpy of type object at 0x12b1fdc80> (l_b_,) {} call_function wrapped_add <Wrapped function <original add>> (from_numpy, 1) {} call_function wrapped_add_1 <Wrapped function <original add>> (from_numpy_1, 1) {} output output output ((wrapped_add, wrapped_add_1),) {} ``` ### Changes * `codegen.py`: reconstruct `numpy.ndarray` from `NumpyNdarrayVariable` by adding bytecode to call `utils.to_numpy_helper()`. * `output_graph.py`: getting rid of legacy code that does exactly what `codegen.py` does, which only handling return case but not graph break case. * `utils.py`: added helpers to convert `numpy.ndarray` to `torch.Tensor` and vice versa. Also adding a wrapper class that takes in a function. In `__call__` it calls the function and converts its out to `torch.Tensor` (or a list of it). * `builder.py`: add method to wrap `numpy.ndarray` graph inputs into `NumpyNdarrayVariable`, by calling `torch.numpy` in the proxy. * `misc.py`: `numpy` API calls goes into `NumpyVariable` and we find the function with the same name in `torch_np` module, then wrap it with the wrapper defined in `utils.py`. * `tensor.py`, `torch.py`: proxy `tensor.numpy()` to be `torch.detach()` but wrap it with `NumpyNdarrayVariable`. Similarly, `torch.from_numpy()` -> `torch.detach()` but wrap it with `TensorVariable`. In `NumpyNdarrayVariable`, do the similar `torch_np.ndarray` to `torch.Tensor` wrapping for attributes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100839 Approved by: https://github.com/ezyang	2023-06-03 00:54:25 +00:00
Yanbo Liang	9fa82c90f7	[Dynamo] Correct UserDefinedObjectVariable.var_getattr on function/method type (#102580 ) Fixes #102329 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102580 Approved by: https://github.com/jansel	2023-06-01 05:04:13 +00:00
lantiankaikai	17166c2511	python_arg_parser to allow fake tensor element in symint_list when in dynamo mode #95424 (#97508 ) Failing mechanism on #95424 : In dynamo mode, when passing numpy.int_ to 'shape' like param (Sequence[Union[int, symint]]) is wrapped as list with FakeTensor. However, in python_arg_parser, parser expect int in symint_list but got FakeTensor. Following #85759, this PR allow tensor element in symint_list when in dynamo mode This PR also fix below test with similar failing mechanism pytest ./generated/test_huggingface_diffusers.py -k test_016 pytest ./generated/test_ustcml_RecStudio.py -k test_036 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/97508 Approved by: https://github.com/yanboliang	2023-05-31 19:19:17 +00:00
Yanbo Liang	7b6438da9e	[Dynamo] Fix if condition on NNModuleVariable (#102335 ) Fixes #102315 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102335 Approved by: https://github.com/ngimel, https://github.com/jansel	2023-05-26 17:00:43 +00:00
Bin Bao	e6af31a5a2	[dynamo] Add astunparse dependency (#102120 ) Summary: https://github.com/pytorch/pytorch/pull/98488 implements CSE for dynamo guards, and it relies on astunparse to perform the optimization. `test_guards_cse_pass_single` was broken and later was fixed by introducing a check_and_skip_if_needed. This actually fixes the root cause on fbcode and should bring some perf gain internally. Test Plan: `buck2 test @//mode/opt //caffe2/test/dynamo:test_dynamo -- --exact 'caffe2/test/dynamo:test_dynamo - test_misc.py::DynamicShapesMiscTests::test_guards_cse_pass_single' --run-disabled` Reviewed By: malfet Differential Revision: D46126742 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102120 Approved by: https://github.com/malfet	2023-05-24 21:24:24 +00:00
Michael Voznesensky	ea5eaa8692	Remove config check in specialize (#102098 ) Fixes Pull Request resolved: https://github.com/pytorch/pytorch/pull/102098 Approved by: https://github.com/ezyang	2023-05-24 01:26:22 +00:00
Yanbo Liang	e132f09e88	[Dynamo] Fix test_cuda_set_device to restore device (#102049 ) Fixes #102025 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102049 Approved by: https://github.com/ngimel	2023-05-23 07:37:12 +00:00
Michael Voznesensky	4c1bc91f42	Support autograd.Function w/ grad (#99483 ) This PR adds support for tracing autograd.Function with grad. A few important bullet points outlining our approach: 1) Our goal is to verify soundness in order to add a call_function to the autograd.Function's `apply` to the graph. 2) We achieve (1) by either verifying soundness or rejecting soundness, by ensuring that both forward and backward of the autograd.Function are sound. 3) For the forward, if we verify soundness, we install its guards into the graph. 4) For the backward, if we verify soundness, we throw it out. However, backwards soundness verification is more onerous, and has a config driven set of banned attrs and methods for tensors. 1-4 above are achieved by turning the forward and backward into UserDefinedFunctionVariables, and inlining through them, relying on dynamo's soundness detection. If we graph break in these, we raise and treat them as unsound. As noted above, backwards is stricter yet. For the tracing, the safety comes from dynamo's HigherOrderOperator system. That system ensures that not only do we trace soundly, but that no new variables are lifted into inputs during the tracing, and that the forward and backwards are entirely self contained. Whenever we reject a function as unsound, we restore back, as usual. Due to some limitations in the lifting logic, we have an escape hatch we implemented for tensors that are known in forward, but cross into backwards through save_tensors (save) /saved_tensors (load). We escape hatch here to avoid having the known saved tensors coming from forward end up being accidentally treated as lifted variables (and rejected). This is sound, but a little hacky feeling. Additionally, due to some limitations in fx node removal, combined with how we produce subgraphs for the traces installed from HigherOrderOperators, we had to improve our node removal logic. In the event of a restore, we remove the old nodes from the graph, as usual in dynamo. However, because the references to these nodes may exist in subgraphs, we traverse any nodes users and remove them first if and only if they are in another graph. This is always sound, because removal should only be downstream of restoration at this point. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99483 Approved by: https://github.com/zou3519	2023-05-19 01:26:21 +00:00
Nikita Shulga	6f46716ee2	Fix/skip CSE tests on Python-3.8 without `astunparse` (#101805 ) If `astunparse` is not installed, following guard will be generated in `test_guard_function_builder_with_cse`: ```python def ___make_guard_fn(): def guard(L): if not (x[0].a < x[1].a * (3 - x[2].a)): return False if not (a.b.c[0].d.e + a.b.c[1].d.e * a.b.c[2].d.e > 0): return False if not (f(m.n[0], '0').x.y.z * f(m.n[0], '1').x.y.z * f(m.n[0], '2').x.y.z < 512): return False if not (self.g(a, b).k + (1 - self.g(a, b).k) <= m[0].a + self.g(a, b).k): return False return True return guard ``` Though, I have to say, hardcoding string comparison is pretty weird. Also, skip `test_guards_cse_pass_[single\|multiple]` if AST unparsing is missing. Fixes failure in a test introduced by https://github.com/pytorch/pytorch/pull/98488 copilot:poem Pull Request resolved: https://github.com/pytorch/pytorch/pull/101805 Approved by: https://github.com/atalman, https://github.com/ysiraichi	2023-05-18 23:14:35 +00:00
Yanbo Liang	29de581764	[Dynamo] Graph break on torch.cuda.set_device() (#101668 ) Fixes #97280 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101668 Approved by: https://github.com/jansel	2023-05-17 21:35:08 +00:00
Yukio Siraichi	f72f0119ec	Implement CSE for dynamo guards. (#98488 ) This PR extracted the CSE part of the code in #89707. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98488 Approved by: https://github.com/ezyang, https://github.com/jansel, https://github.com/anijain2305	2023-05-17 10:47:24 +00:00
blzheng	65412f95f0	[dynamo] Graph break on ops having inplace_view tag (#100787 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100787 Approved by: https://github.com/jgong5, https://github.com/eellison, https://github.com/jansel	2023-05-14 11:42:35 +00:00
Edward Z. Yang	2621fbda7d	Turn on anomaly detection for AOTAutograd backward tracing (#101047 ) Previously, anomaly detection was only enabled on the inner forward function, and not on the overall joint function that calls backward. I believe this impeded us from printing "this is the forward that triggered the backward" because that printing only happens if anomaly mode is enabled when you run backward(). This PR fixes it. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/101047 Approved by: https://github.com/albanD, https://github.com/bdhirsh	2023-05-11 03:38:20 +00:00
Yanbo Liang	075d36d37f	[Dynamo] Fix nested function resume execution (#100426 ) Fixes #99665 Let me explain the root cause using the unit test I added: * This bug is triggered when: * ```wrapped``` is a nested function. * ```wrapped``` is in another module which is different from the main function ```fn```. * There is a graph break inside of ```wrapped```. * The root cause is when resuming nested function, actually we are using the outermost function(```fn``` in my example)'s global variables, but ```wrapped``` calls ```inner_func``` which is not part of ```fn```'s globals, so we have to set correct globals when nested function resume execution. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100426 Approved by: https://github.com/jansel	2023-05-11 03:10:23 +00:00
William Wen	7da8705f18	[dynamo 3.11] fix segfault when printing stack trace (#99934 ) Dynamo will frequently segfault when attempting to print stack traces. We fix this by: - Fixing stack size calculations, as we did not account for exception tables - Creating shadow execution frames in a way that more closely resembles what CPython does to create its execution frames Dynamo/inductor-wrapped pytorch tests are enabled up the stack - those need to be green before this PR can be merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99934 Approved by: https://github.com/albanD, https://github.com/malfet, https://github.com/jansel	2023-05-09 22:12:45 +00:00
PyTorch MergeBot	4b8127b90e	Revert "[Dynamo] Fix nested function resume execution (#100426 )" This reverts commit `d719f0276d`. Reverted https://github.com/pytorch/pytorch/pull/100426 on behalf of https://github.com/jeanschmidt due to breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/100426#issuecomment-1540915913))	2023-05-09 21:32:13 +00:00
Aaron Gokaslan	8769fb854d	[BE] Fix flake8 B027 errors - missing abstractmethod decorator (#100715 ) Enables B027 and applies fixes by adding abstract method decorators. Autofix generated by ruff master. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100715 Approved by: https://github.com/ezyang	2023-05-09 17:28:48 +00:00
Yanbo Liang	d719f0276d	[Dynamo] Fix nested function resume execution (#100426 ) Fixes #99665 Let me explain the root cause using the unit test I added: * This bug is triggered when: * ```wrapped``` is a nested function. * ```wrapped``` is in another module which is different from the main function ```fn```. * There is a graph break inside of ```wrapped```. * The root cause is when resuming nested function, actually we are using the outermost function(```fn``` in my example)'s global variables, but ```wrapped``` calls ```inner_func``` which is not part of ```fn```'s globals, so we have to set correct globals when nested function resume execution. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100426 Approved by: https://github.com/jansel	2023-05-06 05:04:50 +00:00
Animesh Jain	8994d9e610	[dynamo] Hide guard_fail_hook behind a flag to improve cache lookup time (+10% DebertaV2) (#100590 ) For TorchDynamo eager backend, DebertaV2 speedup improves from 0.77x to 0.87x. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100590 Approved by: https://github.com/voznesenskym, https://github.com/wconstab	2023-05-04 18:52:21 +00:00
Michael Voznesensky	ffcbd1c2de	Move tracked nn_modules from OutputGraph to TracingContext (#100457 ) Lint Pull Request resolved: https://github.com/pytorch/pytorch/pull/100457 Approved by: https://github.com/anijain2305	2023-05-03 02:00:11 +00:00
Michael Voznesensky	aafc6ce8cc	Produce constant variables in cases where a SymNode is created with a constant (#100144 ) ` AOT_DYNAMIC_SHAPES=1 TORCHDYNAMO_DYNAMIC_SHAPES=1 benchmarks/dynamo/huggingface.py --performance --training --amp --backend eager --disable-cudagraphs --device cuda --only AllenaiLongformerBase --explain` Looks promising! Goes from: Dynamo produced 173 graphs covering 2760 ops with 160 graph breaks (14 unique) To: Dynamo produced 6 graphs covering 2298 ops with 15 graph breaks (7 unique) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100144 Approved by: https://github.com/ezyang	2023-05-01 21:32:11 +00:00
PyTorch MergeBot	89c43f4108	Revert "Produce constant variables in cases where a SymNode is created with a constant (#100144 )" This reverts commit `d7bdfd3454`. Reverted https://github.com/pytorch/pytorch/pull/100144 on behalf of https://github.com/ezyang due to ci failure is real ([comment](https://github.com/pytorch/pytorch/pull/100144#issuecomment-1529587039))	2023-05-01 11:10:48 +00:00
Michael Voznesensky	d7bdfd3454	Produce constant variables in cases where a SymNode is created with a constant (#100144 ) ` AOT_DYNAMIC_SHAPES=1 TORCHDYNAMO_DYNAMIC_SHAPES=1 benchmarks/dynamo/huggingface.py --performance --training --amp --backend eager --disable-cudagraphs --device cuda --only AllenaiLongformerBase --explain` Looks promising! Goes from: Dynamo produced 173 graphs covering 2760 ops with 160 graph breaks (14 unique) To: Dynamo produced 6 graphs covering 2298 ops with 15 graph breaks (7 unique) Pull Request resolved: https://github.com/pytorch/pytorch/pull/100144 Approved by: https://github.com/ezyang	2023-04-30 17:13:57 +00:00
Animesh Jain	006785cd46	[dynamo][hf_bigbird] Actually graph break on tensor.unsqueeze_/resize_ (#99986 ) Currently, we return `unimplemented` w/o a graph break on seeing a x.unsqueeze_ when x is input. This essentially means we fall back to the original frame. This PR actually graph breaks so that we can generate the continuation frame for the rest of the function. Instead of graph breaking at LOAD_ATTR, we delay the graph break to the actual CALL_FUNCTION, where its cleaner to graph break. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99986 Approved by: https://github.com/jansel	2023-04-26 18:50:06 +00:00
Michael Voznesensky	e789de952f	Make sizevar addition work properly (#100015 ) Rm Pull Request resolved: https://github.com/pytorch/pytorch/pull/100015 Approved by: https://github.com/ezyang	2023-04-26 15:59:26 +00:00
Jiong Gong	e5c9a0fcf5	[dynamo] avoid graph break on repeat_interleave.self_int (#99528 ) Address convit_base failure: https://github.com/pytorch/torchdynamo/issues/1886 mentioned in https://github.com/pytorch/pytorch/issues/93777 Also for models like EleutherAI/gpt-j-6B. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99528 Approved by: https://github.com/ezyang	2023-04-25 04:47:39 +00:00
Michael Voznesensky	4c2892944f	Guard static shapes alongside tensors, instead of from shape_env, in dynamic_shapes=True (#99566 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99566 Approved by: https://github.com/ezyang	2023-04-22 16:46:52 +00:00
Jason Ansel	220712f4de	Fix torch.compile() on a skipped module (#98894 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98894 Approved by: https://github.com/xw285cornell	2023-04-22 16:10:55 +00:00
Edward Z. Yang	e47e8c9d98	Guard on default device (#99551 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99551 Approved by: https://github.com/voznesenskym, https://github.com/mlazos	2023-04-20 17:02:59 +00:00
Will Constable	98907589ee	Make GetItemSource(*, slice) hashable (#99379 ) All Sources must be hashable, since we are using set equality to check for duplicate sources in AOTAutograd. We should have a more systematic way of asserting this. For this PR just fix the local issue. Fixes #99145 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99379 Approved by: https://github.com/ezyang	2023-04-19 13:50:49 +00:00
William Wen	88c8c2b71b	[dynamo 3.11] implement 3.11 exceptiontable (#96511 ) Summary of changes: - Add CPython exceptiontable parsing/assembling functions in torch/_dynamo/bytecode_transformation.py, based on https://github.com/python/cpython/blob/3.11/Objects/exception_handling_notes.txt. - Add optional `exn_tab_entry` field to dynamo `Instruction`s in torch/_dynamo/bytecode_transformation.py in order to virtualize exception table entries (start, end, target instructions). - Add checks guarding against duplicate instructions in dynamo, so that jump/exceptiontable targets are unambiguous. See `get_indexof` in torch/_dynamo/bytecode_analysis.py. Ensure that bytecode generation throughout dynamo does not generate duplicate instructions. - Allow dynamo bytecode generation logic to generate nested exception table entries for developer convenience. CPython expects entries to not overlap, so we flatten nested entries during assembly in torch/_dynamo/bytecode_transformation.py:compute_exception_table. - Simulate the block stack in torch/_dynamo/symbolic_convert.py. CPython removed the block stack in 3.11, but dynamo needs it in order to keep track of active contexts. So we simulate the block stack as before by looking at exceptiontable entries in order to determine the current blocks. - Update context codegen in torch/_dynamo/resume_execution.py. The `SETUP_FINALLY` bytecode, which conveniently had a jump target to the finally block, was removed in 3.11, so we need to keep track of the jump target of the finally block using exceptiontables. Generating resume functions is more difficult since the original exceptiontable entries pointing to old cleanup code need to be modified to point to new cleanup code. - Fix a push_null bug in torch/_dynamo/variables/functions.py introduced by https://github.com/pytorch/pytorch/pull/98699 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96511 Approved by: https://github.com/jansel, https://github.com/yanboliang, https://github.com/albanD	2023-04-18 07:53:24 +00:00
Edward Z. Yang	e2923b521b	Further improve symbolic shapes logging (#99159 ) * Introduce a frame counter which lets us uniquely identify frames. This makes it easier to tell if you are recompiling the same frame * Shorten evaluate_expr to eval for more visual distinctiveness Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/99159 Approved by: https://github.com/Skylion007	2023-04-16 12:06:38 +00:00
Jason Ansel	f84078b40b	[dynamo] Remove pointless graphs from with no_grad() (#98956 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98956 Approved by: https://github.com/voznesenskym	2023-04-14 00:25:40 +00:00
Michael Voznesensky	ccc9a3d726	Automatic Dynamic Shapes (#98923 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98923 Approved by: https://github.com/ezyang	2023-04-13 02:39:23 +00:00
Animesh Jain	951df11af8	[dynamo] Raise exception on incorrect usage of disallow_in_graph (#98892 ) Summary - `disallow_in_graph` is mostly useful for backends. Suppose, your backend does not support `torch.abs()`. So, you can use `disallow_in_graph` to do a graph break. The assumption in the above statement is that `disallow_in_graph` is called on an `allowed` callable. `allowed` in Dynamo language refers to a callable that is put as-is in the Dynamo graph. Therefore, if one uses `disallow_in_graph` on some non-torch non-allowed function, we want to raise an exception to tell user that they probably want something else. * If they want to disable Dynamo - they should use torch._dynamo.disable * If they wanted to stop inlining - they should use torch._dynamo.graph_break. However this is not a decorator. So, we need to provide another API. But, the question - who would want to do this? Pull Request resolved: https://github.com/pytorch/pytorch/pull/98892 Approved by: https://github.com/jansel	2023-04-12 07:50:56 +00:00
Animesh Jain	a2e0f5128c	[dynamo] Fix bug with torch._dynamo.skip (#98862 ) Summary * Fixed an issue with `skip` * Also removed some tests from test_misc.py and moved them to test_decorators.py as test_misc.py is becoming a dumping ground. ~~~ # Code - fn1 was not getting skipped earlier def fn2(x): return x.sin() @torch._dynamo.skip def fn1(x): x = x.sigmoid() return fn2(x.cos()) def fn(x): return fn1(x.tan()) # Extracted graph def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ tan = l_x_.tan(); l_x_ = None return (tan,) def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ sin = l_x_.sin(); l_x_ = None return (sin,) ~~~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/98862 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-04-11 23:20:08 +00:00
Guang Yang	c377a8590b	Add `nonzero_static()` op to pytorch to unblock export (#97417 ) Summary: Add new experimental python op (`torch.nonzero_static`) for export. There is NO cuda impl included in this PR Example: Say input tensor is `x = torch.tensor([[1, 0], [3, 2]])` call regular `nonzero()` on x will give you a tensor `tensor([[0, 0], [1, 0], [1, 1])` call `nonzero_static(x, size=4)` on x will give you a tensor `tensor([[0, 0], [1, 0], [1, 1], [fill_value, fill_value])` (padded) call `nonzero_static(x, size=2)` on x will give you a tensor `tensor([[0, 0], [1, 0])` (truncated) Test Plan: Unit Tests ``` buck test @mode/dev-nosan //caffe2/test:test_dynamo -- 'caffe2/test:test_dynamo - test_export.py::ExportTests::test_export_with_nonzero_static' -- 'caffe2/test:test_dynamo - test_misc.py::MiscTests::test_nonzero_static' ``` PT2 Export with `nonzero_static()` Example of `GraphModule` in the exported graph ``` def forward(self, x): arg0, = fx_pytree.tree_flatten_spec(([x], {}), self._in_spec) nonzero_static_default = torch.ops.aten.nonzero_static.default(arg0, size = 4); arg0 = None return pytree.tree_unflatten([nonzero_static_default], self._out_spec) ``` Differential Revision: D44324808 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97417 Approved by: https://github.com/ezyang	2023-04-11 05:13:36 +00:00
William Wen	117da58b65	[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 ) Enable most dynamo unittests for 3.11. There are a few tests that are skipped due to failures that will be addressed in upcoming PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98104 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym, https://github.com/albanD, https://github.com/jansel, https://github.com/jerryzh168, https://github.com/malfet	2023-04-10 20:04:10 +00:00
Yanbo Liang	a5f3468618	[Dynamo] Fix bug when dynamo generate guards for enum type (#98652 ) Fixes Meta internal user case, actually I think this is a ```enum``` bug, we provide workaround in dynamo. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98652 Approved by: https://github.com/jansel	2023-04-08 04:30:30 +00:00
PyTorch MergeBot	22411b6f02	Revert "[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 )" This reverts commit `0066f3405f`. Reverted https://github.com/pytorch/pytorch/pull/98104 on behalf of https://github.com/huydhn due to Sorry for reverting your PR, but it is failing on CPU 3.11 test in trunk `0066f3405f`. This is probably a landrace	2023-04-07 00:05:30 +00:00
William Wen	0066f3405f	[dynamo 3.11] enable dynamo unittests in 3.11 (#98104 ) Enable most dynamo unittests for 3.11. There are a few tests that are skipped due to failures that will be addressed in upcoming PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/98104 Approved by: https://github.com/yanboliang, https://github.com/voznesenskym, https://github.com/albanD, https://github.com/jansel, https://github.com/jerryzh168, https://github.com/malfet	2023-04-06 23:15:48 +00:00
chezhou	ce797795e1	Support `getattr` for ConstantVariable when compiling with Dynamo (#98153 ) This PR enables `getattr` on ConstantVariable by implementing its `call_hasattr` function. Fixes #97480 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98153 Approved by: https://github.com/ezyang	2023-04-06 16:48:24 +00:00
Edward Z. Yang	f98c1809a4	Add mark_static (#98427 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98427 Approved by: https://github.com/voznesenskym	2023-04-06 12:58:16 +00:00
Michael Voznesensky	ab95b7a05f	Support neg calls to dyn shapes (#94068 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94068 Approved by: https://github.com/jansel	2023-04-06 03:33:24 +00:00
Yanbo Liang	b1c2925493	[Dynamo] Support typing.Union and typing.Optional (#98384 ) Fixes #98265 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98384 Approved by: https://github.com/ezyang	2023-04-05 21:31:52 +00:00
Edward Z. Yang	69f9bd2323	Don't error if we mark_dynamic without dynamic_shapes on (#98324 ) In the terminal state, it won't matter if you have dynamic_shapes on or not, mark_dynamic will always work. Today, it's helpful to make this not error so I can easily swap between static or not and run experiments. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98324 Approved by: https://github.com/voznesenskym	2023-04-05 19:40:22 +00:00
Yanbo Liang	fd0be80dd1	[Dynamo] graph break when calling resize_() on graph input (#98279 ) Fixes #97921 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98279 Approved by: https://github.com/jansel, https://github.com/eellison	2023-04-04 20:39:12 +00:00
Michael Voznesensky	b1e60bfb6a	Pass f_locals as a dict rather than kwargs (#98107 ) Fixes https://github.com/pytorch/pytorch/issues/97688 One big problem is that instead of printing x < y we now print `E["x"] < E["y"]` and now all of the tests wobbled and I'm mad. Signed-off-by: Edward Z. Yang <ezyangmeta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98107 Approved by: https://github.com/ezyang	2023-04-04 00:30:08 +00:00
Michael Lazos	ee9a9b7add	Remove old logging callsites (#98095 ) Get around GH first issue, OSS only changes for https://github.com/pytorch/pytorch/pull/97182 Pull Request resolved: https://github.com/pytorch/pytorch/pull/98095 Approved by: https://github.com/anijain2305	2023-04-01 00:57:37 +00:00
Yanbo Liang	9be9592f28	[Dynamo] Code refactor: move context managers out of misc.py (#97958 ) misc.py and test_misc.py is too big, moving context managers to context.py and test_context.py. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97958 Approved by: https://github.com/ezyang, https://github.com/anijain2305, https://github.com/mlazos, https://github.com/voznesenskym	2023-03-31 23:15:39 +00:00
William Wen	762a2079c7	[dynamo 3.11] make create_instruction kwarg mandatory (#98032 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98032 Approved by: https://github.com/albanD	2023-03-31 18:20:51 +00:00
William Wen	089134bf66	[dynamo 3.11] implement 3.11 linetable (#96509 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96509 Approved by: https://github.com/jansel	2023-03-31 18:20:28 +00:00
William Wen	14ef91cea6	[dynamo 3.11] small bug fixes (#96508 ) Bugs fixed: - CALL_FUNCTION_EX expects null pop in symbolic_convert - make_function_with_closure codegen requires a push_null - copy over the closure in eval_frame.c - add JUMP_FORWARD to terminal opcodes - enum repr fix in utils.py - fix symbolic_convert's break_graph_if_unsupported wrapper Pull Request resolved: https://github.com/pytorch/pytorch/pull/96508 Approved by: https://github.com/jansel	2023-03-31 18:18:12 +00:00
William Wen	05641b81e5	[dynamo 3.11] fix jump if (not) none (#96505 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96505 Approved by: https://github.com/jansel	2023-03-31 18:05:54 +00:00
Sam Gross	87f5e92916	[dynamo] Add guards for deterministic algos (#96695 ) Inductor now falls back to eager mode for deterministic algos. Add guards in dynamo to check if the deterministic algos mode changes. See #93537 Pull Request resolved: https://github.com/pytorch/pytorch/pull/96695 Approved by: https://github.com/ngimel, https://github.com/jansel	2023-03-31 16:28:45 +00:00
lantiankaikai	94bae36a1f	Fix strip_function_call in GuardBuilder (#97810 ) repo: from #92670 this address one of the bug for TorchDynamo pytest ./generated/test_PeterouZh_CIPS_3D.py -k test_003 Issue: In GuardBuilder, when parsing argnames with "getattr(a.layers[slice(2)][0]._abc, '0')" it returns "getattr(a", where it suppose to return "a", and thus causing SyntaxError. This PR fix the regex and add couple test cases. Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/97810 Approved by: https://github.com/yanboliang	2023-03-30 17:46:10 +00:00
William Wen	24a5d006f2	[dynamo 3.11] Refactor create_instruction (#96499 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96499 Approved by: https://github.com/jansel, https://github.com/albanD	2023-03-30 17:05:27 +00:00
Michael Lazos	e6909f6ccc	[Dynamo] Fix for tuple construction from tuple iterators (#97862 ) Fixes #93405 In short - when calling the builtin function `Tuple` on a list variable we added a list length guard. This paired with converting tuple iterators to a ListIteratorVariable resulted in this guard being improperly added. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97862 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-03-29 19:20:05 +00:00
Michael Lazos	e626be79a4	Add config setting to error on recompile (#97829 ) Adds a config setting `error_on_recompile` - when set dynamo will raise an exception after compiling a function for the second time. This was requested to help debugging in pyper Pull Request resolved: https://github.com/pytorch/pytorch/pull/97829 Approved by: https://github.com/bertmaher	2023-03-29 19:00:43 +00:00
Edward Z. Yang	8372c5dc68	Refactor dynamic dims api, stateless internals, higher level export API (#96699 ) The purpose of this API is to execute a few large components of work: 1) Refactor all the internals of plumbing dynamic dimension information after dynamo to be stateless 2) Decouple allocation controls around dynamic dimensions from verification 3) For (2), for allocation, create an enum that dictates whether we are in DUCK (default today), STATIC (aka assume_static_default in the past), or DYNAMIC (aka user constrained, do not duck shape) 4) For (2), for verification, we separate out the list of dynamic ranges entirely from allocation. This means shape_env does not tracking for what we verify on, and instead, it is the callers job to invoke produce_guards() with the various things they want verified, specifically, with the valid ranges. We do use constrain ranges to refine value ranges when doing analysis. 5) We have decided, therefore, as an extension of (4) to double down on "late" checks versus "eager" checks, primarily because the mechanisms for gathering what actually matters happens during guards, and should be a purview of the caller seeking guards, not the shape env. However, for dynamo, these structures are essentially one and the same. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96699 Approved by: https://github.com/avikchaudhuri, https://github.com/ezyang	2023-03-29 16:55:49 +00:00
Yanbo Liang	f388bec985	[Dynamo] torch.Generator state should have a source and be reconstructed properly (#97403 ) Fixes #97077 partially. During FX graph propagation, we request every tensor should have source: `a524123c91/torch/_dynamo/variables/builder.py (L929)` However, the output of ```torch.Generator.get_state()``` is a tensor but without source, since it's generated inside of the FX graph. My change is following what we did for [Python random functions](https://github.com/pytorch/pytorch/blob/master/torch/_dynamo/variables/user_defined.py#L260), to have a dedicated ```GeneratorStateSource```. We have to also update the reconstruction logics, since we will reuse the ```TensorVariable``` reconstruction. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97403 Approved by: https://github.com/jansel, https://github.com/mlazos	2023-03-29 04:31:23 +00:00
Yanbo Liang	e3df6a7c8a	[Dynamo] Unspec int list if enabling dynamic_shapes (#97557 ) Fixes #97348 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97557 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-03-27 06:12:43 +00:00
nima10khodaveisi	13dcf635e0	Dynamo stride dim kwargs (#97444 ) Fixes #97441 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97444 Approved by: https://github.com/ezyang	2023-03-25 23:43:05 +00:00
Will Constable	e8a722b9cb	Fix missing dynamo cache lookup registration in profiler.profiler (#97305 ) This follows https://github.com/pytorch/pytorch/pull/96199 and supports the 'other' profiler. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97305 Approved by: https://github.com/voznesenskym	2023-03-22 21:09:16 +00:00
nima10khodaveisi	5537792307	[dynamo] handle dim in size kwargs (#96992 ) (#97098 ) Fixes #96992 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97098 Approved by: https://github.com/ezyang	2023-03-22 14:19:59 +00:00

... 2 3 4 5 6 ...

460 Commits