pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-08 07:39:33 +01:00

Author	SHA1	Message	Date
Yanbo Liang	8990174676	[Dynamo] Should inline __new__ function rather than skipping frame (#108549 ) Fixes #107460 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108549 Approved by: https://github.com/jansel	2023-09-08 16:51:47 +00:00
Richard Zou	ef2bbe1ae1	Dynamo support for autograd.Function w/ once_differentiable (#108686 ) Fixes #106893 There are two main changes: - Before this PR, the function returned by once_differentiable was included in skipfiles (because its .co_code is torch/autograd/function.py). This PR adds a mechanism to tell Dynamo to inline a function, no matter if it is included in skipfiles. - A bugfix: when we are introspecting the backward, we need to turn the grad mode off. This is to accurately model the eager-mode semantics: In eager-mode PyTorch, if second-order gradients were not requested, then the grad mode is off. torch.compile does not work with higher-order gradients and just assumes we do first-order gradients, so this is OK. Test Plan: - new test Differential Revision: [D49064185](https://our.internmc.facebook.com/intern/diff/D49064185) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108686 Approved by: https://github.com/voznesenskym	2023-09-08 16:10:32 +00:00
CK Luk	366baf690b	Back out "[Dynamo x FSDP] Add support for params, buffers, submodules on FSDPManagedNNModuleVariable (#107923 )" (#108823 ) Summary: Original commit changeset: 33650f7cb0fb Original Phabricator Diff: D48833682 Test Plan: See T162942232 for how we figured out that this diff caused significant numeric difference. Reviewed By: voznesenskym Differential Revision: D49082219 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108823 Approved by: https://github.com/xw285cornell	2023-09-08 14:39:43 +00:00
Edward Z. Yang	39180a8414	Comment about prune_dead_locals in dynamo (#107787 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107787 Approved by: https://github.com/mlazos	2023-09-08 14:37:28 +00:00
Jason Ansel	4965fffeda	[dynamo] Move global state guards to C++ (#108624 ) This combines a bunch of python global state guards into a single C++ guard and switches to checking them 100% of the time. It also adds a few new guards for things that change inductor's behavior. Even though we are checking more things, I expect this to be much faster. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108624 Approved by: https://github.com/anijain2305	2023-09-08 04:07:08 +00:00
PyTorch MergeBot	72f24d0001	Revert "[dynamo][finishing colesbury's PR 100642] Guard on nn.Module dicts and type (#108528 )" This reverts commit `34bb74c4cf`. Reverted https://github.com/pytorch/pytorch/pull/108528 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it has some nasty merge conflicts after the revert of D48910794. I need to revert this so the conflict could be resolved. Please help rebase this tomorrow and reland the change ([comment](https://github.com/pytorch/pytorch/pull/108528#issuecomment-1711034781))	2023-09-08 03:49:41 +00:00
PyTorch MergeBot	38fcf77a1b	Revert "[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 )" This reverts commit `1a64ec7dd4`. Reverted https://github.com/pytorch/pytorch/pull/107337 on behalf of https://github.com/huydhn due to Sorry for reverting your change but inductor perf smoke test starts to regress after this ([comment](https://github.com/pytorch/pytorch/pull/107337#issuecomment-1710974588))	2023-09-08 02:03:48 +00:00
ydwu4	1a64ec7dd4	[dynamo] Add BACKEND_MATCH guard to detect and recompile when backend changes (#107337 ) Motivation: We try to make torch.cond use torch.compile automatically so that we could error out when there is side-effects in the branches and correctly handle the closures. Before this PR, we have a warning if we don't turn on a config raise_on_backend_change (turning it on gives us an error) for the following code: ```python def foo() # Inside torch.cond, we'd like to do something like torch.compile(foo, backend="eager", fullgraph=True)(...) ... # Users may then call torch.compile somewhere else. # Dynamo will use the cached code of foo for "eager" backend # but we expect dynamo to recompile with "inductor" backend. torch.compile(foo, backend="inductor")(...) ``` This PR adds a BACKEND_MATCH guard. Effectively, it implements a per-backend cache. In the above example, the cached code for "eager" won't work for "inductor" due to guard check failures and the second torch.compile will do a re-compilation. In the future, it might be useful to have something like a configuration guard that guards against dynamo configuration changes across different compiles (e.g. compile a function with fullgraph=False then compile it again with fullgraph=True). Implementation: 1. We add a guarded_backend_cache and check the most_recent_backend against the backend associated with cached code. We also remove the raise_on_backend_change flag. 2. Then newly added context manager and guard adds more lines for debug log so we change the uppper limit from 50 to 55. Test Plan: Removed original tests that raise on different backend and add a new test to test whether the BACKEND_MATCH guard can guard against backend change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107337 Approved by: https://github.com/jansel	2023-09-07 22:45:54 +00:00
Evgeni Burovski	1f20531939	fall back to eager on `NotImplementedError` (#107863 ) Follow-up to https://github.com/pytorch/pytorch/pull/107710: Help dynamo fall back to eager when compiling unimplemented numpy constructs: - arrays of strings - (arg){min, max} for complex types - various arguments typed as NotImplemented (`np.ones(4, order="F")` etc) - numpy functions which torch._numpy does not implement To test, run (we do not implement arrays of strings) ``` import torch import numpy as np @torch.compile(fullgraph=False) def fn(): return np.asarray(["L", "U"]) ``` and observe it compiles with fullgraph=False and fails with fullgraph=True Fixes https://github.com/pytorch/pytorch/issues/107970 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107863 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-09-07 21:22:20 +00:00
ydwu4	774c822979	Fix expected test failures for predispatch export nested cond and out_dtype (#108715 ) Before this PR, we use get_fake_value to get the fake_sub_args then call op(fake_sub_args) to get the example value for out dtype. This causes problem when the input proxy's op type is `get_attr`, get_fake_value for a `get_attr` node will actually look at the original param/buffer and return a real tensor* instead of fake tensor. This is OK for export, since export's fake_mode allows non_fake_inputs see [here](https://github.com/pytorch/pytorch/blob/main/torch/_dynamo/output_graph.py#L278). But it causes problem when nesting cond with out_dtype where cond will use torch.compile(full_graph=True) to inspect out_dtype and find the inputs to op are mixed FakeTensor and real tensor. This PR changes how we get the example values from proxies by directly looking at node.meta["example_value"]. This meta data is guaranteed to exist for all proxies during dynamo tracing so it's safe to use ( it's also used by get_fake_value to get fake tensors from args for general ops see [here](https://github.com/pytorch/pytorch/blob/main/torch/_dynamo/utils.py#L1318)). Test Plan: existing tests + remove expected failure for a test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108715 Approved by: https://github.com/zou3519	2023-09-07 18:13:00 +00:00
PyTorch MergeBot	77691e8bc3	Revert "[dynamo][activation checkpointing] Trace through ActivationWrapper (#108599 )" This reverts commit `9efe0f7bf2`. Reverted https://github.com/pytorch/pytorch/pull/108599 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but test_ddp_activation_checkpointing is failing distributed ROCm test in trunk ([comment](https://github.com/pytorch/pytorch/pull/108599#issuecomment-1710479387))	2023-09-07 16:47:40 +00:00
Zhengxu Chen	c75aec90d3	[dynamo] Record nn_module_stack also for unspecialized nn modules. (#108281 ) Summary: Currently node metadata "nn_module_stack" is only being used by export. For some export model, we still want to retain nn_module_stack for unspecialized module for various purposes. This diff add a path to also record nn_module_stack when unspecialized module has a source available. Test Plan: test_export_nn_module_stack_patched_module Differential Revision: D48841193 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108281 Approved by: https://github.com/yanboliang, https://github.com/tugsbayasgalan	2023-09-07 15:38:46 +00:00
Yukio Siraichi	c887309437	Re-land: Break graph on `manual_seed`. (#108647 ) Trying to re-land #107594. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108647 Approved by: https://github.com/eellison	2023-09-07 12:52:38 +00:00
Catherine Lee	54e73271c7	When patching dynamic shapes test class, don't run the original tests (#108681 ) redo of https://github.com/pytorch/pytorch/pull/103523 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108681 Approved by: https://github.com/ezyang	2023-09-07 02:13:59 +00:00
Yanbo Liang	027e3b7910	[Forward-fix] check if source is None when using tensor out variants (#108700 ) Summary: As title Test Plan: Sandcastle Reviewed By: JacobSzwejbka Differential Revision: D49029357 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108700 Approved by: https://github.com/angelayi	2023-09-07 01:51:02 +00:00
Animesh Jain	34bb74c4cf	[dynamo][finishing colesbury's PR 100642] Guard on nn.Module dicts and type (#108528 ) This PR is a 99% copy paste of Sam Gross (@colesbury) work at https://github.com/pytorch/pytorch/pull/100642. Copied from there -------- The NN_MODULE guard now subsumes guards on Module attributes. The check_fn will fail if the module attributes are changed (such as Module.training), parameters, submodules, and buffers are added or removed, and if fields are changed on the type itself. This gives up specificity in the guard check -- if any field is changed the check_fn fails -- for faster overall checks. ----- Pull Request resolved: https://github.com/pytorch/pytorch/pull/108528 Approved by: https://github.com/ezyang	2023-09-07 01:45:47 +00:00
Animesh Jain	9efe0f7bf2	[dynamo][activation checkpointing] Trace through ActivationWrapper (#108599 ) Fixes https://github.com/pytorch/pytorch/issues/108269 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108599 Approved by: https://github.com/rohan-varma	2023-09-07 00:32:18 +00:00
Huy Do	5a4fe05a15	Revert "Force synced KJT to trace unbacked SymInt (#107788 )" (#108684 ) This reverts commit `3b92ef814d`. So let's manually revert it instead. (Not sure why the bot doesn't work on https://github.com/pytorch/pytorch/pull/107788) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108684 Approved by: https://github.com/ezyang	2023-09-06 19:15:45 +00:00
Edward Z. Yang	3b92ef814d	Force synced KJT to trace unbacked SymInt (#107788 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107788 Approved by: https://github.com/voznesenskym	2023-09-06 03:18:26 +00:00
Brian Hirsh	da914aed21	error when using _dynamo.optimize_ddp=True and _inductor.keep_output_stride=False together (#108235 ) From talking to @wconstab, we agreed that because of the way DDPOptimizer is written, it is (sort of) incompatible with inductor's `keep_output_stride=False` optimizations (and will cause silent correctness problems if you use them ogether). Added an assertion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108235 Approved by: https://github.com/wconstab ghstack dependencies: #108081	2023-09-05 20:02:35 +00:00
Animesh Jain	29f1097891	[dynamo] Reduce cache size limit to 8 (#108526 ) As title Pull Request resolved: https://github.com/pytorch/pytorch/pull/108526 Approved by: https://github.com/ezyang	2023-09-05 17:56:26 +00:00
Peter Bell	a16b0aa26a	[dynamo] Fix return type of Tensor.shape (#108240 ) This should be `torch.Size` but was returning a plain tuple under dynamo. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108240 Approved by: https://github.com/ezyang ghstack dependencies: #108239	2023-09-05 14:58:39 +00:00
Peter Bell	7c931f2491	[dynamo] Add dynamic shapes support to torch.Size.numel (#108239 ) Currently numel only supports static shapes, but this expands it to support generating symbolic arithmetic into the graph. e.g. ``` # x.size().numel with x.size() = [s0, 1, s1] size = l_x_.size() getitem = size[0] getitem_2 = size[2]; size = None mul = getitem * getitem_2; getitem = getitem_2 = None ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/108239 Approved by: https://github.com/ezyang	2023-09-05 14:58:39 +00:00
PyTorch MergeBot	48286d34a4	Revert "Break graph on `manual_seed`. (#107594 )" This reverts commit `6ad5568cbc`. Reverted https://github.com/pytorch/pytorch/pull/107594 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it has an import issue that breaks internal code ([comment](https://github.com/pytorch/pytorch/pull/107594#issuecomment-1705584405))	2023-09-04 18:00:37 +00:00
youkaichao	b9fc6d7ded	[Dynamo] Update the implementation of _debug_get_cache_entry_list (#108335 ) In https://github.com/pytorch/pytorch/pull/106673 , I created a private API `_debug_get_cache_entry_list` to help pull out cache entries from compiled functions. Recently, I find that @anijain2305 commented in the code that this API should be revisited, and so I created this PR. First, this API cannot be removed even if cache entry becomes a first-class python class`torch._C._dynamo.eval_frame._CacheEntry`. The facts that `extra_index` is static, and `get_extra_state` is inline static, make them not accessible elsewhere. This API `_debug_get_cache_entry_list` is the only way for users to get all the cache entries from code. Second, since the`torch._C._dynamo.eval_frame._CacheEntry` class is a python class, I simplified the C-part code, and remove the necessity of creating a namedtuple for this in the python code. Third, I also add a small improvement, that if the argument is a function, we can automatically pass its `__code__` to the API. The above change will slightly change the output, from list of named tuple to list of `torch._C._dynamo.eval_frame._CacheEntry`. I will update the corresponding docs that use this API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108335 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-09-02 16:38:59 +00:00
Kimish Patel	eb67c452c8	[Quant] Add DQ duplication pass (#107900 ) Summary: During convert step observers are first replaced by Q-DQ pair. In some scenarios like following output DQ has a fan out. ---> OP2 -> Q -> DQ / OP -> Q -> DQ - \ ---> OP3 -> Q -> DQ If either op OP2 or OP3 are configured to be quantized, then the input is expected to quantized. In this case quantized equivalent of some pattern, that quantizer asked to be quantized, should look like: [DQ -> {pattern} -> Q]. However, in scenario like above where DQ node is shared between multiple "quantized" patterns, boundary of "quantized" pattern is not clear because DQ now belongs to multiple quantized patterns. This poses challenge for: - Porting metadata: which "quantized" partition this DQ node belongs - Quantized representation, equivalently, needs to identify self-contained quantized pattern that is replaced by its equivalent pattern that captures compute in the quantized precision. Test Plan: test_duplicate_dq_pass Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D48663147](https://our.internmc.facebook.com/intern/diff/D48663147) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107900 Approved by: https://github.com/jerryzh168, https://github.com/andrewor14, https://github.com/leslie-fang-intel ghstack dependencies: #107105, #107106, #107899	2023-09-02 06:20:03 +00:00
Yukio Siraichi	2e3fce5450	Add dynamo support for `rdiv` dunder method. (#108422 ) Fix: #106646 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108422 Approved by: https://github.com/eellison	2023-09-02 00:59:22 +00:00
Elias Ellison	e18f512b81	Update accuracy checking for nan, floats (#108202 ) Fixes inference accuracy for `doctr_reco_predictor` and `pyhpc_turbulent_kinetic_energy`. For the `same(float, float)` comparison we weren't going through the more rigorous tensor comparison path which takes into account the fp64 base results. Also return True when fp64 base result are not well formed (nan). I debugged these models and the source of divergence were innocuous: `doctr_reco_predictor` - can be fixed by turning off layout optimization, decomp for batch norm `pyhpc_turbulent_kinetic_energy` - divergence caused because fused kernel keeps precision in fp32 instead of casting back and forth from/to fp32/bf16. Fused kernel is better precision, anyway. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108202 Approved by: https://github.com/jansel	2023-09-01 02:54:01 +00:00
vasiliy	3702980717	dynamo: trace autograd.Function with tensor subclass input (#108093 ) Summary: Enables dynamo eager mode tracing for the following situation: 1. we have a torch.autograd.Function 2. the input to that function is a tensor subclass which is an intermediary This is useful for float8 training UX. Test Plan: ``` python test/dynamo/test_autograd_function.py -k intermediary_input ``` Reviewers: Subscribers: Tasks: Tags: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/108093 Approved by: https://github.com/bdhirsh, https://github.com/wanchaol	2023-09-01 02:12:38 +00:00
Wanchao Liang	a29b9101fa	[dynamo] fix dynamo + DTensor to work with 2d (#108329 ) pair debugged with @wconstab and we found some issue in both dynamo and the TP's fsdp extension side. This PR fixes the dynamo + DTensor integration so that the current graph break FSDP can work with tensor parallel by moving the torch.compile after FSDP wrapping. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108329 Approved by: https://github.com/Skylion007, https://github.com/wconstab	2023-08-31 22:46:26 +00:00
Michael Lazos	49df1de383	Cudagraphs support for compiled optimizers (#107504 ) Marks all params/optimizer state as static addresses and a finalizer which cleans up the graph attributes when the optimizer goes out of scope. **Note: this does not mark grads as static because this will increase memory usage significantly There are two cases: 1. The upstream graph is cudagraphed - this case will work fine OOTB 2. The upstream graph is not cudagraphed - in this case, there will be a lot of copies introduced from the upstream (to copy the grads) into cudagraphed-owned memory, unless the user explicitly marks the grads as static. If the user does this, this will also require not deallocating the grads in zero_grad() (either the mod or optimizer version) by setting them to zero vs None. There is a PR (https://github.com/pytorch/pytorch/pull/107853) in flight to throw an error if zero_grad attempts to set static grads to None. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107504 Approved by: https://github.com/eellison	2023-08-31 20:47:18 +00:00
Yanbo Liang	dabdb97087	[Dynamo] Graph break on functions using tensor out variants (#108182 ) Fixes #108021 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108182 Approved by: https://github.com/eellison	2023-08-31 17:49:14 +00:00
Nakul Camsamudram	335767e7da	Raise an error for unsupported ctx managers (#108272 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108272 Approved by: https://github.com/anijain2305	2023-08-31 17:20:36 +00:00
kshitij12345	31ef33871d	[vmap][dynamo] run vmap under python dispatcher (#107947 ) Before `test_op_has_batch_rule_cholesky_solve_cpu_float32` failed: ``` PYTORCH_TEST_WITH_DYNAMO=1 pytest test/functorch/test_vmap.py -k test_op_has_batch_rule_cholesky_solve_cpu_float32 test/functorch/test_vmap.py terminate called after throwing an instance of 'pybind11::error_already_set' what(): RuntimeError: /home/kshiteej/Pytorch/pytorch_functorch/build/aten/src/ATen/RegisterCompositeExplicitAutograd.cpp:2214: SymIntArrayRef expected to contain only concrete integers ``` After this PR the test cases NOTE: We can't be 100% of tests on CI till we figure out https://github.com/pytorch/pytorch/issues/107444 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107947 Approved by: https://github.com/zou3519	2023-08-31 13:16:44 +00:00
Yanbo Liang	9862c7196b	[Dynamo] SetVariable supports contains (#108189 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/108189 Approved by: https://github.com/voznesenskym	2023-08-31 04:28:49 +00:00
Yukio Siraichi	6ad5568cbc	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-30 17:24:11 +00:00
Michael Lazos	0297232053	Fix operator precedence (#108196 ) Summary: Ensure that modules are only installed if they are not fsdp modules. Differential Revision: D48810186 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108196 Approved by: https://github.com/shunting314, https://github.com/anijain2305	2023-08-30 14:00:33 +00:00
PyTorch MergeBot	4e47ea5131	Revert "Break graph on `manual_seed`. (#107594 )" This reverts commit `6c28de2437`. Reverted https://github.com/pytorch/pytorch/pull/107594 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it seems to cause failures in trunk on inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_uniform_cuda_float, likely a landrace ([comment](https://github.com/pytorch/pytorch/pull/107594#issuecomment-1697783965))	2023-08-29 16:38:01 +00:00
Yukio Siraichi	6c28de2437	Break graph on `manual_seed`. (#107594 ) Fix: #107187 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107594 Approved by: https://github.com/eellison	2023-08-29 12:59:57 +00:00
voznesenskym	f3a8d57aea	[Dynamo x FSDP] Add support for params, buffers, submodules on FSDPManagedNNModuleVariable (#107923 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107923 Approved by: https://github.com/wconstab	2023-08-29 08:54:13 +00:00
Jason Ansel	6d61d74545	[dynamo] Fix setattr nn.Module with new attribute (#108098 ) This is one (but not all) issues in DALLE2_pytorch Pull Request resolved: https://github.com/pytorch/pytorch/pull/108098 Approved by: https://github.com/eellison ghstack dependencies: #108096, #108087	2023-08-29 02:58:48 +00:00
Brian Hirsh	5efd63b1b8	better support for fakeifying and dynamoing through torch_dispatch subclasses (with dynamic shapes) (#107415 ) There is already some support for plumbing `__torch_dispatch__` tensor subclasses through dynamo, but this PR beefs it up a bit and adds a test. In particular: (1) Fakeifying tensor subclasses didn't properly set autograd metadata (requires_grad, is_leaf) on the newly fakeified wrapper subclass. I don't actually have a test for this in this PR, but it's tested pretty heavily later in my aot autograd tests (2) Fakeifying tensor subclasses didn't properly track source information for dynamic shapes on the inner tensors. I added a new `WrapperSubclassFieldSource` subclass, that represents a source coming from a tensor field on a wrapper subclass, which I use in the fakeifying logic, and again in symbolic_shapes.py to generate proper guards. (3) `_make_wrapper_subclass()` marginally updated this code to work better with dynamic shapes. One thing that's a bit weird about `_make_wrapper_subclass`: it has two overloads, and the first explicitly does not support dynamic shapes (and the second.. does not support kwargs). I think that later we probably want to consolidate / at least make the first overload work with dynamic shapes, but I didn't want to handle that in this PR (so these smaller changes seemed like a strict improvement). Pull Request resolved: https://github.com/pytorch/pytorch/pull/107415 Approved by: https://github.com/ezyang	2023-08-29 02:36:48 +00:00
Jason Ansel	73235d08c3	[dynamo] Graph break on pack_padded_sequence (#108096 ) This is to workaround #93501. Fixes errors in: ``` ./benchmarks/dynamo/torchbench.py --inference --performance --no-skip --inductor --freezing --only tacotron2 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/108096 Approved by: https://github.com/davidberard98	2023-08-29 00:08:11 +00:00
Flavio Sales Truzzi	cd4f74fb2e	[PT2] - Add check for stack (#108012 ) Summary: Add check for `guard.stack` which was causing exceptions like: ``` toch._dynamo.exc.InternalTorchDynamoError: 'NoneType' object has no attribute 'format' ``` Test Plan: contbuild & OSS CI Differential Revision: D48709458 Pull Request resolved: https://github.com/pytorch/pytorch/pull/108012 Approved by: https://github.com/anijain2305	2023-08-28 23:30:34 +00:00
Animesh Jain	9d2ffc5dfa	[reland][Dynamo] cache_size policy #107496 (#108069 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108069 Approved by: https://github.com/yanboliang	2023-08-28 22:06:54 +00:00
voznesenskym	5d85d897e0	Torchrec Enablement Fixes - Re-PR 107910 (#108018 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108018 Approved by: https://github.com/wconstab	2023-08-28 19:47:53 +00:00
kobecai	356b8f6339	[dynamo]bugfix:implement numel() for SizeVariable (#107944 ) fix the issue that SizeVariable does not support numel() method Fixes #106407 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107944 Approved by: https://github.com/Skylion007	2023-08-28 17:54:57 +00:00
ydwu4	49e964cad6	Automatically turn on dynamo in cond (#108028 ) A replacement of https://github.com/pytorch/pytorch/pull/107932. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108028 Approved by: https://github.com/zou3519 ghstack dependencies: #108025, #108026, #108027	2023-08-28 10:16:41 +00:00
ydwu4	6f8eecfb10	Add UncapturedHigherOrderOpError to always raise exceptions for cond. (#108027 ) We want cond to always throw errors despite user's torch.compile mode. The current implementation is to 1. catch the UserError.GRAPH_BREAK_IN_CONTROL_FLOW and once saw it, we directly raise: once in [break_graph_if_unsupported](`bad3f2db40/torch/_dynamo/symbolic_convert.py (L1250)`), which catches and raises for call_function (entry point of higher order operator) and a few others. 2. The raised exception is caught and raised again in [step](`bad3f2db40/torch/_dynamo/symbolic_convert.py (L691)`), where all instructions' exceptions are handled. 3. At the top-level, we treat it like an hard error and not supressing the errors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108027 Approved by: https://github.com/zou3519 ghstack dependencies: #108025, #108026	2023-08-28 07:23:03 +00:00
ydwu4	138e2895d0	Enable tuple operands for cond (#108026 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108026 Approved by: https://github.com/zou3519 ghstack dependencies: #108025	2023-08-28 00:17:54 +00:00
angelayi	a432f37e49	Serialize pytree to json string (#106116 ) Fixes https://github.com/pytorch/pytorch/pull/102577#issuecomment-1650905536 Serializing to json is more stable, and renamed the API: ``` # Takes in a treespec and returns the serialized treespec as a string. Also optionally takes in a protocol version number. def treespec_dumps(treespec: TreeSpec, protocol: Optional[int] = None) -> str: # Takes in a serialized treespec and outputs a TreeSpec def treespec_loads(data: str) -> TreeSpec: ``` If users want to register their own serialization format for a given pytree, they can go through the `_register_treespec_serializer` API which optionally takes in a `getstate` and `setstate` function. ``` _register_treespec_serializer(type_, *, getstate, setstate) # Takes in the context, and outputs a json-dumpable context def getstate(context: Context) -> DumpableContext: # Takes in a json-dumpable context, and reconstructs the original context def setstate(dumpable_context: DumpableContext) -> Context: ``` We will serialize to the following dataclass, and then json.dump this it to string. ``` class TreeSpec type: Optional[str] # a string name of the type. null for the case of a LeafSpec context: Optional[Any] # optional, a json dumpable format of the context children_specs: List[TreeSpec], } ``` If no getstate/setstate function is registered, we will by default serialize the context using `json.dumps/loads`. We will also serialize the type through `f"{typ.__module__}.{typ.__name__}"`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106116 Approved by: https://github.com/zou3519	2023-08-27 14:34:49 +00:00
Aaron Bockover	15e5bd5103	[ONNX] Support `torch.compile(backend="onnxrt", options=OrtBackendOptions(...))` (#107973 ) This reworks the DORT backend factory function to support the options kwarg of torch.compile, and defines a concrete OrtBackendOptions type that can be used to influence the backend. Caching is also implemented in order to reuse backends with equal options. Wrapping the backend in auto_autograd also becomes an option, which allows `OrtBackend` to always be returned as the callable for torch.compile; wrapping happens internally if opted into (True by default). Lastly, expose options for configuring preferred execution providers (will be attempted first), whether or not to attempt to infer an ORT EP from a torch found device in the graph or inputs, and finally the default/fallback EPs. ### Demo The following demo runs `Gelu` through `torch.compile(backend="onnxrt")` using various backend options through a dictionary form and a strongly typed form. It additionally exports the model through both the ONNX TorchScript exporter and the new TorchDynamo exporter. ```python import math import onnx.inliner import onnxruntime import torch import torch.onnx torch.manual_seed(0) class Gelu(torch.nn.Module): def forward(self, x): return x * (0.5 * torch.erf(math.sqrt(0.5) * x) + 1.0) @torch.compile( backend="onnxrt", options={ "preferred_execution_providers": [ "NotARealEP", "CPUExecutionProvider", ], "export_options": torch.onnx.ExportOptions(dynamic_shapes=True), }, ) def dort_gelu(x): return Gelu()(x) ort_session_options = onnxruntime.SessionOptions() ort_session_options.log_severity_level = 0 dort_gelu2 = torch.compile( Gelu(), backend="onnxrt", options=torch.onnx._OrtBackendOptions( preferred_execution_providers=[ "NotARealEP", "CPUExecutionProvider", ], export_options=torch.onnx.ExportOptions(dynamic_shapes=True), ort_session_options=ort_session_options, ), ) x = torch.randn(10) torch.onnx.export(Gelu(), (x,), "gelu_ts.onnx") export_output = torch.onnx.dynamo_export(Gelu(), x) export_output.save("gelu_dynamo.onnx") inlined_model = onnx.inliner.inline_local_functions(export_output.model_proto) onnx.save_model(inlined_model, "gelu_dynamo_inlined.onnx") print("Torch Eager:") print(Gelu()(x)) print("DORT:") print(dort_gelu(x)) print(dort_gelu2(x)) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/107973 Approved by: https://github.com/BowenBao	2023-08-26 18:20:18 +00:00
Jason Ansel	f877d0a4bf	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-26 01:41:29 +00:00
JackCaoG	08e49fe97a	Make openxla and opexla_eval backend show up in list_backends (#107905 ) The reason to keep the non-aot(openxla_eval) backend is discussed in https://github.com/pytorch/xla/issues/5430#issuecomment-1683191662. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107905 Approved by: https://github.com/jansel	2023-08-25 21:52:17 +00:00
Will Constable	572bc4817d	Fix how DDPOptimizer clones dynamo callback (#107834 ) Instead of hardcoding a new callback creation using 'convert_frame', add an attribute to both callbacks that implement 'self cloning with new backend', so DDPOptimizer can invoke this in a consistent way. Fixes #107686 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107834 Approved by: https://github.com/ezyang	2023-08-25 17:46:36 +00:00
PyTorch MergeBot	b4c6c4da88	Revert "[Dynamo] cache_size policy (#107496 )" This reverts commit `4175a6e944`. Reverted https://github.com/pytorch/pytorch/pull/107496 on behalf of https://github.com/ZainRizvi due to Breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/107496#issuecomment-1693590121))	2023-08-25 16:07:14 +00:00
PyTorch MergeBot	3a3cf0e09d	Revert "[optim] Make casting to match params a hook (#106725 )" This reverts commit `9f86d85172`. Reverted https://github.com/pytorch/pytorch/pull/106725 on behalf of https://github.com/janeyx99 due to We acknowledge this is a huge risk because people do not remember to call super().__init__ from their Optimizer subclasses and so this will break lots of load_state_dict behavior ([comment](https://github.com/pytorch/pytorch/pull/106725#issuecomment-1693386137))	2023-08-25 13:47:19 +00:00
Avik Chaudhuri	bfcd26459c	improved error message for IO mismatch (#107907 ) Previously when we found some input or output mismatch between original args / traced result vs. graph-captured input / output, we would have a pretty sparse error message. (This might be partly due to the urge to reuse the same code for matching both inputs and outputs.) With this PR we now point out which input or output is problematic, what its type is, and also present the expected types along with descriptions of what they mean. We don't suggest any fixes, but the idea is that it should be evident what went wrong looking at the error message. Differential Revision: [D48668059](https://our.internmc.facebook.com/intern/diff/D48668059/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107907 Approved by: https://github.com/gmagogsfm	2023-08-25 06:08:44 +00:00
Tugsbayasgalan Manlaibaatar	485de73004	Improve unbacked symint error msg (#107806 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107806 Approved by: https://github.com/avikchaudhuri	2023-08-25 01:07:09 +00:00
Animesh Jain	4175a6e944	[Dynamo] cache_size policy (#107496 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107496 Approved by: https://github.com/ezyang ghstack dependencies: #107645	2023-08-24 21:50:00 +00:00
PyTorch MergeBot	eefce56b66	Revert "[dynamo] Treat monkey patched .forward as dynamic (#107104 )" This reverts commit `79b3a9f945`. Reverted https://github.com/pytorch/pytorch/pull/107104 on behalf of https://github.com/ZainRizvi due to Breaking internal builds ([comment](https://github.com/pytorch/pytorch/pull/107104#issuecomment-1692072018))	2023-08-24 16:55:33 +00:00
Animesh Jain	0156eeb564	[dynamo] bugfix - make module setattr more restrictive (#107828 ) A check got missed in https://github.com/pytorch/pytorch/pull/106092 Fixes https://github.com/pytorch/pytorch/issues/107721 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107828 Approved by: https://github.com/eellison	2023-08-24 16:00:29 +00:00
Avik Chaudhuri	cf76938f70	remove redundant dynamic_dim (#107815 ) Differential Revision: D48618472 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107815 Approved by: https://github.com/tugsbayasgalan, https://github.com/gmagogsfm	2023-08-24 10:46:24 +00:00
gmagogsfm	f8119f8bda	Move `Constraint` class to torch.export() to avoid circular dependency in _dynamo package (#107750 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107750 Approved by: https://github.com/tugsbayasgalan	2023-08-24 03:07:28 +00:00
Michael Voznesensky	9b2d43df93	Handle empty lists properly (#107803 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107803 Approved by: https://github.com/ezyang	2023-08-24 01:42:29 +00:00
Jane Xu	9f86d85172	[optim] Make casting to match params a hook (#106725 ) Moves the logic to casting state to match parameters into a hook so that users can choose to enable their hooks before or after the casting has happened. With this, there is a little bit of redundancy of the id_map building and the check that the param groups are still aligned in length. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106725 Approved by: https://github.com/albanD	2023-08-23 22:25:33 +00:00
Animesh Jain	8c62f01cb7	[dynamo][guards] Use dict for storing weakrefs (#107645 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107645 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-08-23 20:52:38 +00:00
Jason Ansel	79b3a9f945	[dynamo] Treat monkey patched .forward as dynamic (#107104 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107104 Approved by: https://github.com/anijain2305	2023-08-23 19:03:02 +00:00
lezcano	207b06d099	[dynamo] Wrap ndarray dunder methods (#107689 ) Fixes https://github.com/pytorch/pytorch/issues/107437 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107689 Approved by: https://github.com/ezyang ghstack dependencies: #107687, #107688, #107710, #107711, #107746	2023-08-23 13:55:36 +00:00
ydwu4	cbcd551045	Fix torch.compile FunctionalTensor inputs for higherOrderOps (#107604 ) Before this PR, for the added [test](https://github.com/pytorch/pytorch/pull/107604/files#diff-c618f2274b6b5ccc533c580549d2e552edbd9fc5ac0da1aa4b00338525c8f78dR224), which feeds FunctionTensorWrapper inputs to higherOrderOperator, we have an assertion error in this line [code](https://github.com/pytorch/pytorch/pull/107604/files#diff-9f0663783bcd93e948e0491ef61b48123bdc9977bcc632fd707da578df13bfa1R1284). The key difference of this PR is this [line ](https://github.com/pytorch/pytorch/pull/107604/files#diff-9f0663783bcd93e948e0491ef61b48123bdc9977bcc632fd707da578df13bfa1L1263)of check: ```python elif ( isinstance(example_value, FakeTensor) and example_value.fake_mode is tx.fake_mode ): ``` The original intention of it seems to be dealing with case where we want to wrap an fx proxy for an intermediate fake tensor that's produced by some tensor ops and an example value is provided (as is the case for higherOrderOps [here](https://github.com/pytorch/pytorch/blob/main/torch/_dynamo/variables/higher_order_ops.py#L85)). A fakified FunctionalTensorWrapper(FakeTensor) always fails this check. This PR changes it to checking whether it's already fakified by tx.fake_mode. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107604 Approved by: https://github.com/zou3519 ghstack dependencies: #107569	2023-08-23 02:42:18 +00:00
lezcano	db39a81e1e	Add a flag that allows breaking on NumPy ops (#107687 ) This was removed in `63d406a6a9` Resotiring, as it's rather useful for debugging. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107687 Approved by: https://github.com/larryliu0820	2023-08-23 01:21:22 +00:00
Jane Xu	874d1b18b0	[BE] reorganize opt disables in dynamo for clarity (#107709 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107709 Approved by: https://github.com/Skylion007, https://github.com/mlazos	2023-08-23 00:17:34 +00:00
ydwu4	a408920817	Reland fakify FunctionalTensor (#107569 ) Try to rebase and reland https://github.com/pytorch/pytorch/pull/107062 . One difference compared with previous is to make the DTensor logic same as previously in _clone_input. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107569 Approved by: https://github.com/zou3519	2023-08-22 15:46:25 +00:00
Yukio Siraichi	bcede143bd	Do not mutate `SymNode` expression. (#107492 ) This PR stops `SymNode` from mutating (i.e. simplifying) its expression. Instead, the simplification (without mutation) is deferred to the `SymNode.maybe_as_int` method. ```python - FakeTensor(size=(s0,), ...) - FakeTensor(size=(s1, s2, s3), ...) - Eq(s0, s1 + s2 + s3) - FakeTensor(size=(s0,), ...) - FakeTensor(size=(s1, s2, s3), ...) ``` In summary, this PR: - Replaces `SymNode._expr` by `SymNode.expr`, removing the old property function - This makes it so `SymNode` instances never update their expression - Creates `SymNode.simplified_expr()` method for actually calling `ShapeEnv.replace` on its expression. Note that this doesn't updates `SymNode.expr` - Changes how `tensor.size()` gets converted to its Python `torch.Size` type - Instead of calling `SymInt::maybe_as_int()` method, we create a new `SymInt::is_symbolic()` method for checking whether it is actually a symbolic value - This is needed so that when we call `tensor.size()` in the Python side, the returned sequence is faithful to the actual data, instead of possibly simplifying it and returning an integer - 2 files needs this modification: - _torch/csrc/Size.cpp_: for handling `torch.Tensor.size` Python calls - _torch/csrc/utils/pybind.cpp_: for handling `symint.cast()` C++ calls Pull Request resolved: https://github.com/pytorch/pytorch/pull/107492 Approved by: https://github.com/ezyang ghstack dependencies: #107523	2023-08-22 12:38:05 +00:00
Tugsbayasgalan Manlaibaatar	ee72071fc7	Avoid executing side-effectful graph_module as validation step (#107271 ) Dynamo currently runs the real graph module with real inputs as a way to match the return result of graph module with the eager return type. This is unsafe when graph module is side effectful. In the long term, we will get rid of this step. But in the short term, we just fakify the graph module again and run it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107271 Approved by: https://github.com/ezyang	2023-08-22 04:22:31 +00:00
Animesh Jain	a506d0ad8f	[dynamo] Store originating source in the Guard object (#107634 ) Many times, I find myself wanting to know the source for the guard. This PR adds that as a field of guard itself. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107634 Approved by: https://github.com/voznesenskym ghstack dependencies: #107622	2023-08-22 02:16:31 +00:00
Animesh Jain	12b0372a75	[dynamo] Continue on fbgemm import fail (#107622 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107622 Approved by: https://github.com/voznesenskym	2023-08-22 02:16:31 +00:00
lezcano	612c8a8c84	Guard numpy imports in the dynamo folder (#107299 ) Fixes https://github.com/pytorch/pytorch/issues/107228 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107299 Approved by: https://github.com/atalman	2023-08-21 19:07:20 +00:00
Animesh Jain	e201e3ffa1	[dynamo][eval frame] Make CacheEntry a PyObject (#107405 ) This PR makes CacheEntry a PyObject. This is prep PR for cache size changes. As CacheEntry is a py object, we can now traverse the linked list in Python and write cache size policies. It was possible to do in C, but Python is just easier to iterate upon. We call convert_frame only when we (re)compile, so a small bump in latency going from C to Python is acceptable here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107405 Approved by: https://github.com/ezyang ghstack dependencies: #106917, #107117	2023-08-21 18:47:53 +00:00
Edward Z. Yang	ad07a4bc56	Print per-tensor guard messages for TENSOR_MATCH (#107562 ) The new guard messages look like: ``` check_tensor(L['y'], Tensor, DispatchKeySet(CPU, BackendSelect, ADInplaceOrView, AutogradCPU), torch.float32, device=None, requires_grad=False, size=[3], stride=[1]) # _dynamo/variables/builder.py:1237 in wrap_fx_proxy_cls ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107562 Approved by: https://github.com/anijain2305, https://github.com/jansel ghstack dependencies: #107505, #107516, #107530, #107532	2023-08-21 18:00:00 +00:00
Richard Zou	72de9b2ec2	[HigherOrderOp] stop erroring out on non-Tensor returns (#107461 ) If map or autograd.Function have an input that returns a non-Tensor, then the code just errors out. Instead of erroring out we should graph break by raising Unsupported so users aren't confused. The better thing to do is actually support non-Tensor returns but that requires more work. Test Plan: - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/107461 Approved by: https://github.com/ydwu4 ghstack dependencies: #107459	2023-08-21 13:39:36 +00:00
Richard Zou	c5c41f9601	[HigherOrderOps] Saner error message (#107459 ) Sometimes the Unsupported error messages can be pretty opaque (see https://github.com/pytorch/pytorch/issues/106390 for an example). This PR ensures the error message says something sane by raising a new Unsupported exception (that includes the older one in the stack trace) with a description of what's going on. Test Plan: - new test utility to check that a dictionary matches a regex so we don't need to write out this super long error message every time. - new tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/107459 Approved by: https://github.com/ydwu4, https://github.com/kshitij12345	2023-08-21 13:39:34 +00:00
Edward Z. Yang	796ce67229	Single source of truth for guard logging (#107532 ) Instead of (poorly) reconstructing the guard list from the guards on OutputGraph, we log them at the horses mouth: when we actually codegen the guard. This only requires very modest refactoring: as we translate guards into code parts, we also have to pass the source guard along so we can use it to give stack information. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107532 Approved by: https://github.com/anijain2305 ghstack dependencies: #107505, #107516, #107530	2023-08-21 13:02:12 +00:00
Edward Z. Yang	8316affc45	Add frame/recompile counter to all log messages in tracing context (#107530 ) All log messages that occur while running Dynamo compilation now have `[X/Y]` added to the beginning of their message. X represents the frame being compiled, while Y says which compilation of the frame. For example, if you are debugging a frame that is repeatedly recompiling, you can look for N/0, N/1, N/2, etc. for the same N. Here is what the logs look like as you transition from one frame to another: <img width="1372" alt="image" src="https://github.com/pytorch/pytorch/assets/13564/4897e368-1e50-4807-b342-54e911bcf087"> To accurately get this prefix added to all messages, I had to expand the scope of the `tracing` context manager. Its scope now coincides with `log_compilation_event`. To do this, I had to populate fake mode lazily in the TracingContext, since it isn't created until later, inside the OutputGraph. This subsumes the previous X.Y logging that was solely for dynamic shapes. Unfortunately I had to reindent some stuff. Review the diff with whitespace off. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107530 Approved by: https://github.com/anijain2305 ghstack dependencies: #107505, #107516	2023-08-21 13:02:12 +00:00
PyTorch MergeBot	96c5be8bc4	Revert "Fakify leaf of FunctionalTensor (#107062 )" This reverts commit `3349725766`. Reverted https://github.com/pytorch/pytorch/pull/107062 on behalf of https://github.com/ydwu4 due to This appears to have broken the test TestDTensorCompile.test_dtensor_fullgraph. Probably a land race ([comment](https://github.com/pytorch/pytorch/pull/107062#issuecomment-1685447747))	2023-08-21 00:30:16 +00:00
Edward Z. Yang	68b9bf9671	Simplify verbose error guard printing (#107516 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107516 Approved by: https://github.com/anijain2305 ghstack dependencies: #107505	2023-08-20 06:50:27 +00:00
Edward Z. Yang	d6d485fa8c	Revamp guard debug logging (#107505 ) The new guard printout looks like this: ``` [DEBUG] GUARDS: [DEBUG] ___check_type_id(L['name'], 7605632) # if name == "special_attr": # test/dynamo/test_misc.py:1155 in __getattribute__ [DEBUG] L['name'] == '_backward_pre_hooks' # if name == "special_attr": # test/dynamo/test_misc.py:1155 in __getattribute__ [DEBUG] ___check_obj_id(L['self'], 139746432564960) # return super().__getattribute__(name) # test/dynamo/test_misc.py:1157 in __getattribute__ [DEBUG] ___check_obj_id(L['__class__'], 1451499216) # return super().__getattribute__(name) # test/dynamo/test_misc.py:1157 in __getattribute__ [DEBUG] ___is_grad_enabled() # _dynamo/output_graph.py:346 in init_ambient_guards [DEBUG] not ___are_deterministic_algorithms_enabled() # _dynamo/output_graph.py:342 in init_ambient_guards [DEBUG] ___is_torch_function_enabled() # _dynamo/output_graph.py:350 in init_ambient_guards [DEBUG] utils_device.CURRENT_DEVICE == None # _dynamo/output_graph.py:348 in init_ambient_guards ``` Along with the guards, we also print what line of user code caused the guard to be added, or what line of Dynamo internal code added the guard (if there is no user stack trace, which is typically the case for ambient guards.) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107505 Approved by: https://github.com/mlazos, https://github.com/voznesenskym, https://github.com/anijain2305	2023-08-20 06:50:27 +00:00
Aaron Gokaslan	b1e8e01e50	[BE]: Apply PYI autofixes to various types (#107521 ) Applies some autofixes from the ruff PYI rules to improve the typing of PyTorch. I haven't enabled most of these ruff rules yet as they do not have autofixes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107521 Approved by: https://github.com/ezyang	2023-08-20 02:42:21 +00:00
Michael Voznesensky	02c2b750c5	Add support for GET_YIELD_FROM_ITER, YIELD_FROM, SEND (#106986 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106986 Approved by: https://github.com/jansel	2023-08-19 20:38:16 +00:00
Will Constable	eee2f57257	Raise TypeError for calling moduletype in dynamo (#107393 ) Fixes #107314 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107393 Approved by: https://github.com/williamwen42	2023-08-19 20:04:33 +00:00
ydwu4	3349725766	Fakify leaf of FunctionalTensor (#107062 ) This PR allows dynamo to fakify FunctionalTensorWrapper by unwrapping, replacing and wrapping again for FunctionalTensorWrapper so that FunctionalTensorWrapper can be passed in as input for dynamo.optimize and we can support something like this ```python ff = torch.func.functionalize(f) torch.compile(ff)(x) ``` This PR didn't follow the \_\_tensor_flatten\_\_ and \_\_tensor_unflatten\_\_ protocol right now because we're not sure the plan of doing that for FunctionalTensorWrapper (it's implemented in C++). Test Plan: Add a new test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107062 Approved by: https://github.com/zou3519 ghstack dependencies: #107042	2023-08-19 17:33:42 +00:00
kshitij12345	11602ac564	[dynamo] fix disable_saved_tensors_hooks - graph break (#106875 ) ```python def wrapper_fn(x): with torch.autograd.graph.disable_saved_tensors_hooks("ERROR"): y = x + 1 print("HI") return y + 2 x = torch.randn(()) a = wrapper_fn(x) opt = torch.compile(wrapper_fn, backend='eager', fullgraph=False) e = opt(x) ``` Without the fix fails with, ``` Traceback (most recent call last): File "/home/kshiteej/Pytorch/pytorch_functorch/test/test_trace_grad.py", line 182, in <module> e = opt(x) File "/home/kshiteej/Pytorch/pytorch_functorch/torch/_dynamo/eval_frame.py", line 333, in _fn return fn(args, *kwargs) File "/home/kshiteej/Pytorch/pytorch_functorch/test/test_trace_grad.py", line 165, in wrapper_fn def wrapper_fn(x): AttributeError: module 'torch.autograd.graph' has no attribute 'disable_saved_tensors_hook' ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/106875 Approved by: https://github.com/zou3519	2023-08-19 11:41:40 +00:00
lezcano	4eac43d046	Trace through Tensor slots (#107159 ) Namely ``` __delattr__ __delitem__ __getattribute__ __getitem__ __setattr__ __setitem__ __str__ ``` We don't trace through `__init__`. Fixes https://github.com/pytorch/pytorch/issues/106648 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107159 Approved by: https://github.com/Skylion007	2023-08-19 08:56:25 +00:00
kshitij12345	8df298bc1e	[functorch] vmap-dynamo: run vmap_impl under fake_mode (#107462 ) Fixes https://github.com/pytorch/pytorch/issues/107050 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107462 Approved by: https://github.com/zou3519	2023-08-19 08:32:01 +00:00
Edward Z. Yang	67bb3c05b0	Add verbose_guards logging artifact (#107388 ) It looks like this: ``` [DEBUG] GUARD: ___check_type_id(L['z'][L["MyEnum"].BAR], 7640416) and L['z'][L["MyEnum"].BAR] == 10 [DEBUG] Stack: [DEBUG] File "/data/users/ezyang/b/pytorch/test/dynamo/test_misc.py", line 6657, in <module> [DEBUG] run_tests() [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/test_case.py", line 38, in run_tests [DEBUG] run_tests() [DEBUG] File "/data/users/ezyang/b/pytorch/torch/testing/_internal/common_utils.py", line 985, in run_tests [DEBUG] unittest.main(argv=argv) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/main.py", line 101, in __init__ [DEBUG] self.runTests() [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/main.py", line 271, in runTests [DEBUG] self.result = testRunner.run(self.test) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/runner.py", line 184, in run [DEBUG] test(result) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/suite.py", line 84, in __call__ [DEBUG] return self.run(args, kwds) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/suite.py", line 122, in run [DEBUG] test(result) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/suite.py", line 84, in __call__ [DEBUG] return self.run(args, *kwds) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/suite.py", line 122, in run [DEBUG] test(result) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/case.py", line 650, in __call__ [DEBUG] return self.run(args, *kwds) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/testing/_internal/common_utils.py", line 2521, in run [DEBUG] self._run_with_retry( [DEBUG] File "/data/users/ezyang/b/pytorch/torch/testing/_internal/common_utils.py", line 2450, in _run_with_retry [DEBUG] super_run(result=result) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/case.py", line 591, in run [DEBUG] self._callTestMethod(testMethod) [DEBUG] File "/home/ezyang/local/b/pytorch-env/lib/python3.10/unittest/case.py", line 549, in _callTestMethod [DEBUG] method() [DEBUG] File "/data/users/ezyang/b/pytorch/torch/testing/_internal/common_utils.py", line 2377, in wrapper [DEBUG] method(args, *kwargs) [DEBUG] File "/data/users/ezyang/b/pytorch/test/dynamo/test_misc.py", line 2529, in test_enum_as_dict_key_with_overloaded_str [DEBUG] res = opt_fn(x) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/eval_frame.py", line 333, in _fn [DEBUG] return fn(args, *kwargs) [DEBUG] File "/data/users/ezyang/b/pytorch/test/dynamo/test_misc.py", line 2519, in fn [DEBUG] torch._dynamo.graph_break() [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/eval_frame.py", line 493, in catch_errors [DEBUG] return callback(frame, cache_size, hooks, frame_state) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/convert_frame.py", line 637, in _convert_frame [DEBUG] result = inner_convert(frame, cache_size, hooks, frame_state) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/convert_frame.py", line 133, in _fn [DEBUG] return fn(args, *kwargs) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/convert_frame.py", line 371, in _convert_frame_assert [DEBUG] return _compile( [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/convert_frame.py", line 567, in _compile [DEBUG] guarded_code = compile_inner(code, one_graph, hooks, transform) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/utils.py", line 181, in time_wrapper [DEBUG] r = func(args, kwargs) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/convert_frame.py", line 466, in compile_inner [DEBUG] out_code = transform_code_object(code, transform) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/bytecode_transformation.py", line 1028, in transform_code_object [DEBUG] transformations(instructions, code_options) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/convert_frame.py", line 416, in transform [DEBUG] tracer = InstructionTranslator( [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/symbolic_convert.py", line 2018, in __init__ [DEBUG] self.symbolic_locals = collections.OrderedDict( [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/symbolic_convert.py", line 2021, in <genexpr> [DEBUG] VariableBuilder( [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 211, in __call__ [DEBUG] vt = self._wrap(value).clone(self.options()) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 404, in _wrap [DEBUG] result = { [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 405, in <dictcomp> [DEBUG] k: VariableBuilder( [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 211, in __call__ [DEBUG] vt = self._wrap(value).clone(*self.options()) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 354, in _wrap [DEBUG] return type_dispatch(self, value) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 837, in wrap_literal [DEBUG] return self.wrap_unspecialized_primitive(value) [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 1073, in wrap_unspecialized_primitive [DEBUG] guards=self.make_guards(GuardBuilder.CONSTANT_MATCH), [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 269, in make_guards [DEBUG] return {source.make_guard(guard) for guard in guards} [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_dynamo/variables/builder.py", line 269, in <setcomp> [DEBUG] return {source.make_guard(guard) for guard in guards} [DEBUG] File "/data/users/ezyang/b/pytorch/torch/_guards.py", line 641, in make_guard [DEBUG] return Guard(self.name(), self.guard_sou ``` One downside is I can't report why* the guard was added. I'm not entirely sure how to do this; the problem is guards will propagate to a bunch of variables before finally getting included as part of the final set. Maybe a very very verbose version could report stack traces at every handoff point. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107388 Approved by: https://github.com/mlazos ghstack dependencies: #107438, #107358	2023-08-18 19:05:54 +00:00
Edward Z. Yang	36bb7a1f42	Add fast traceback utilities (#107358 ) This adds some utilities for conveniently working with fast combined CapturedTraceback from Python. The main goal of these utilities is to make it easier for people to use CapturedTraceback as a drop-in replacement for `traceback.extract_stack`, which is 20x slower than CapturedTraceback. I port symbolic shapes to use the new CapturedTraceback code, to validate that the APIs work and are useful. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107358 Approved by: https://github.com/zdevito, https://github.com/albanD ghstack dependencies: #107438	2023-08-18 19:05:54 +00:00
zhxchen17	8d6a487d69	[dynamo] Make KeyedJaggedTensor a variable. (#107319 ) This is extracted from https://github.com/pytorch/pytorch/pull/107156/ to model KeyedKaggedTensor as a first class concept in dynamo. Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/107319 Approved by: https://github.com/ezyang	2023-08-18 17:15:46 +00:00
JackCaoG	139437bb84	Make Openxla dynamo backend take boxed input (#107260 ) Fixes https://github.com/pytorch/xla/issues/5454 Also adding the inference(non-aot) backend back since we see a speed regression when using the aot-backend compared to the non-aot openxla backend. It is being tracked in https://github.com/pytorch/xla/issues/5430 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107260 Approved by: https://github.com/shunting314, https://github.com/jansel	2023-08-18 16:58:05 +00:00
PyTorch MergeBot	3c11184ca8	Revert "Fakify leaf of FunctionalTensor (#107062 )" This reverts commit `6cb0128c8a`. Reverted https://github.com/pytorch/pytorch/pull/107062 on behalf of https://github.com/ZainRizvi due to This appears to have broken the test TestDTensorCompile.test_dtensor_fullgraph. Probably a land race ([comment](https://github.com/pytorch/pytorch/pull/107062#issuecomment-1684124230))	2023-08-18 16:02:54 +00:00
Avik Chaudhuri	95f1591acb	error on bad input to equality constraint (#107311 ) Differential Revision: D48401664 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107311 Approved by: https://github.com/angelayi	2023-08-18 09:01:51 +00:00
ydwu4	6cb0128c8a	Fakify leaf of FunctionalTensor (#107062 ) This PR allows dynamo to fakify FunctionalTensorWrapper by unwrapping, replacing and wrapping again for FunctionalTensorWrapper so that FunctionalTensorWrapper can be passed in as input for dynamo.optimize and we can support something like this ```python ff = torch.func.functionalize(f) torch.compile(ff)(x) ``` This PR didn't follow the \_\_tensor_flatten\_\_ and \_\_tensor_unflatten\_\_ protocol right now because we're not sure the plan of doing that for FunctionalTensorWrapper (it's implemented in C++). Test Plan: Add a new test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107062 Approved by: https://github.com/zou3519 ghstack dependencies: #107042	2023-08-18 03:05:45 +00:00
Animesh Jain	7cb2a6bfab	[dynamo][fallback] Fallback to eager when backend fails with fake tensor exceptions (#107179 ) Example (I think we should fix this test case for real, but using this to test the ux around fallbacks) ~~~ @torch.compile(backend="aot_eager") def fn(x): return torch.sum(x, dim=1).tolist() print(fn(torch.rand(4, 4).to(dtype=torch.int64))) ~~~ Running the script as is ~~~ [2023-08-14 14:53:48,863] torch._dynamo.output_graph: [WARNING] Backend compiler failed with a fake tensor exception at [2023-08-14 14:53:48,863] torch._dynamo.output_graph: [WARNING] File "/data/users/anijain/pytorch/examples/spl.py", line 5, in fn [2023-08-14 14:53:48,863] torch._dynamo.output_graph: [WARNING] return torch.sum(x, dim=1).tolist() [2023-08-14 14:53:48,863] torch._dynamo.output_graph: [WARNING] Falling back to eager for this frame. Please use TORCH_LOGS=graph_breaks to see the full stack trace. [0, 0, 0, 0] ~~~ Running the script with TORCH_LOGS="graph_breaks" ~~~ [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] WON'T CONVERT fn /data/users/anijain/pytorch/examples/spl.py line 3 [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] ========== TorchDynamo Stack Trace ========== [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] Traceback (most recent call last): [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_dynamo/output_graph.py", line 995, in call_user_compiler [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] compiled_fn = compiler_fn(gm, self.example_inputs()) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_dynamo/repro/after_dynamo.py", line 117, in debug_wrapper [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] compiled_gm = compiler_fn(gm, example_inputs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/__init__.py", line 1586, in __call__ [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] return self.compiler_fn(model_, inputs_, self.kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_dynamo/backends/common.py", line 55, in compiler_fn [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] cg = aot_module_simplified(gm, example_inputs, kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_functorch/aot_autograd.py", line 3795, in aot_module_simplified [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] compiled_fn = create_aot_dispatcher_function( [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_dynamo/utils.py", line 194, in time_wrapper [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] r = func(args, kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_functorch/aot_autograd.py", line 3283, in create_aot_dispatcher_function [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] fw_metadata = run_functionalized_fw_and_collect_metadata( [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_functorch/aot_autograd.py", line 757, in inner [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] flat_f_outs = f(flat_f_args) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_functorch/aot_autograd.py", line 3400, in functional_call [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] out = Interpreter(mod).run(args[params_len:], kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/fx/interpreter.py", line 138, in run [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] self.env[node] = self.run_node(node) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/fx/interpreter.py", line 195, in run_node [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] return getattr(self, n.op)(n.target, args, kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/fx/interpreter.py", line 289, in call_method [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] return getattr(self_obj, target)(args_tail, *kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/utils/_stats.py", line 20, in wrapper [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] return fn(args, *kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_subclasses/fake_tensor.py", line 1233, in __torch_dispatch__ [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] return self.dispatch(func, types, args, kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_subclasses/fake_tensor.py", line 1470, in dispatch [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] op_impl_out = op_impl(self, func, args, **kwargs) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/torch/_subclasses/fake_tensor.py", line 501, in local_scalar_dense [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] raise DataDependentOutputException(func) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] torch._subclasses.fake_tensor.DataDependentOutputException: aten._local_scalar_dense.default [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] While executing %item : [num_users=1] = call_method[target=item](args = (%getitem,), kwargs = {}) [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] Original traceback: [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] File "/data/users/anijain/pytorch/examples/spl.py", line 5, in fn [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] return torch.sum(x, dim=1).tolist() [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] [2023-08-14 14:54:15,689] torch._dynamo.output_graph.__graph_breaks: [DEBUG] ~~~~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/107179 Approved by: https://github.com/ezyang	2023-08-16 14:57:42 +00:00
Michael Lazos	e0d6072f69	Add API to mark input tensors static for cudagraphs (#107154 ) Adds API to mark tensor as a static input - To make this trigger recompiles properly, I'll need to update tensor match checks to also check for this new attribute Additional concern is memory - the tensors will be kept alive, but this is the current behavior for nn modules and parameters. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107154 Approved by: https://github.com/eellison	2023-08-16 04:38:19 +00:00
Wei-Sheng Chin	22f5889753	[Dynamo, ONNX] Replace `onnxrt` backend with new backend from ONNXRuntime team (#106929 ) In https://github.com/pytorch/pytorch/pull/106589, a new ONNXRuntime-based Dynamo backend is introduced. As mentioned in that PR, we hope to replace legacy `onnxrt` with that new backend. This PR remove legacy `onnxrt` and register the new backend under the same name. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106929 Approved by: https://github.com/thiagocrepaldi, https://github.com/BowenBao, https://github.com/abock, https://github.com/msaroufim, https://github.com/jansel	2023-08-15 22:50:46 +00:00
ydwu4	c71828b097	Lift non-FakeTensor restriction for compile (#107042 ) Currently, we have the assertion that dynamo won't accept FakeTensor input unless we're exporting. This PR try to remove this restriction to finish https://github.com/pytorch/pytorch/pull/105679. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107042 Approved by: https://github.com/ezyang, https://github.com/zou3519	2023-08-15 20:58:56 +00:00
Edward Z. Yang	10ce16bebb	Specify if mismatch is input or output in export (#107145 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/107145 Approved by: https://github.com/suo, https://github.com/gmagogsfm	2023-08-15 20:34:25 +00:00
Tugsbayasgalan Manlaibaatar	20c5add133	[export] Refactor `constrain_as_value` and `constrain_as_size` (#106591 ) Some notable changes: 1. `constrain_as_size` allows min value to be less than 2 as it will unconditionally assume min >= 2 for compiler purposes. Instead, we add additional check to make sure max value is always greater than 2. 2. Previously, we used to runtime assert on the unbacked symint's val range which would be always between [2, max]. I modified this logic to assert on [0, max] unless user explicitly specifies the min range. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106591 Approved by: https://github.com/gmagogsfm, https://github.com/ezyang	2023-08-15 05:41:43 +00:00
Yukio Siraichi	d8ad74857c	Run translation validation on tracing error. (#106645 ) This PR wraps `InstructionTranslator` run with a try-catch block so as to run the translation validation (TV) if it ends up raising an error. In this context, we run TV so as to catch simplification errors. These may turn `ShapeEnv.divisible` and `ShapeEnv.replacements` incorrect. For example: #101173 describes a SymPy simplification bug that doesn't reach TV, since it's run only in the end of the tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106645 Approved by: https://github.com/ezyang	2023-08-14 13:43:34 +00:00
Zhengxu Chen	547ccae0db	[export] Support preserving calling convention to some modules. (#106798 ) Summary: APS use this feature to swap out some submodules after unflattening. Test Plan: test_export_preserve_signature Differential Revision: D48154341 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106798 Approved by: https://github.com/tugsbayasgalan	2023-08-11 21:17:45 +00:00
Yanbo Liang	fbfb9a1648	[Dynamo] Improve PT2 fbcode logging observability (#106932 ) Summary: https://docs.google.com/document/d/1D5K3_ELsda3tIUeSyNL_2yee-M3jVWbirqSQ5BDNvHQ/edit This is the revamped version of D47908299. For each frame, we will record a list of compilation metrics: e.g, backend_compile time, entire_frame_compile time, cache_size, co_filename, co_firstlineno, co_name, guards, graph input_count, graph node_count, graph op_count. With the help of job info: mast_job_name, global_rank, we can satisfy the requirements from `Things I’ve used/wanted to use our logging to determine` in https://docs.google.com/document/d/1D5K3_ELsda3tIUeSyNL_2yee-M3jVWbirqSQ5BDNvHQ/edit (or add more metrics for this framework) Test Plan: ``` buck2 test //caffe2/test:test_dynamo ``` Differential Revision: D48142400 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106932 Approved by: https://github.com/anijain2305	2023-08-11 20:46:04 +00:00
PyTorch MergeBot	745d29b0cc	Revert "[export] Refactor `constrain_as_value` and `constrain_as_size` (#106591 )" This reverts commit `18989890bf`. Reverted https://github.com/pytorch/pytorch/pull/106591 on behalf of https://github.com/izaitsevfb due to Breaks inductor test on trunk ([comment](https://github.com/pytorch/pytorch/pull/106591#issuecomment-1675069091))	2023-08-11 16:37:47 +00:00
Tugsbayasgalan Manlaibaatar	18989890bf	[export] Refactor `constrain_as_value` and `constrain_as_size` (#106591 ) Some notable changes: 1. `constrain_as_size` allows min value to be less than 2 as it will unconditionally assume min >= 2 for compiler purposes. Instead, we add additional check to make sure max value is always greater than 2. 2. Previously, we used to runtime assert on the unbacked symint's val range which would be always between [2, max]. I modified this logic to assert on [0, max] unless user explicitly specifies the min range. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106591 Approved by: https://github.com/gmagogsfm, https://github.com/ezyang	2023-08-11 05:29:22 +00:00
Michael Voznesensky	71a336ef75	[Dynamo x FSDP][1/x] Builder support for deque, appendleft (#106884 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106884 Approved by: https://github.com/ezyang	2023-08-11 03:26:12 +00:00
lezcano	a9dca53438	NumPy support in torch.compile (#106211 ) RFC: https://github.com/pytorch/rfcs/pull/54 First commit is the contents of https://github.com/Quansight-Labs/numpy_pytorch_interop/ We have already been using this in core for the last few months as a external dependency. This PR pulls all these into core. In the next commits, I do a number of things in this order - Fix a few small issues - Make the tests that this PR adds pass - Bend backwards until lintrunner passes - Remove the optional dependency on `torch_np` and simply rely on the upstreamed code - Fix a number dynamo tests that were passing before (they were not tasting anything I think) and are not passing now. Missing from this PR (but not blocking): - Have a flag that deactivates tracing NumPy functions and simply breaks. There used to be one but after the merge stopped working and I removed it. @lezcano to investigate. - https://github.com/pytorch/pytorch/pull/106431#issuecomment-1667079543. @voznesenskym to submit a fix after we merge. All the tests in `tests/torch_np` take about 75s to run. This was a work by @ev-br, @rgommers @honno and I. I did not create this PR via ghstack (which would have been convenient) as this is a collaboration, and ghstack doesn't allow for shared contributions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106211 Approved by: https://github.com/ezyang	2023-08-11 00:39:32 +00:00
Animesh Jain	8aca724312	[dynamo] use cache size to detect recompilation (#106878 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106878 Approved by: https://github.com/yanboliang, https://github.com/jansel, https://github.com/mlazos	2023-08-09 21:15:40 +00:00
kshitij12345	cce2c52b0b	[pt2] support vmap (#101707 ) Teach dynamo about `vmap` Pull Request resolved: https://github.com/pytorch/pytorch/pull/101707 Approved by: https://github.com/zou3519	2023-08-09 03:39:33 +00:00
Ivan Yashchuk	c913f3857f	Remove dynamo+nvfuser (#105789 ) This PR removes unmaintained Dynamo+nvFuser. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105789 Approved by: https://github.com/jansel, https://github.com/jjsjann123, https://github.com/albanD	2023-08-08 22:29:32 +00:00
ydwu4	3a300ed84e	[export] refactor and add same_signature flag to dynamo.export (#106569 ) This PR adds a same_signature flag to dynamo.export. Motivation: In https://github.com/pytorch/pytorch/pull/105679, we experimented on using dynamo to inspect the UDFs for cond in eager mode (without torch.compile). This helps us to normalize the inputs (e.g. lifting closure to inputs) and makes higher order operator more robust (e.g. forbid python side effects) and less error-prone in general. We decided to use dynamo.export (instead of torch.compile) to do the inspection (pointed out by @voznesenskym @zou3519): - We'd like a whole-graph capture for the UDF. - We'd like the dynamo inspection to be stateless. Using torch.compile would require resetting dynamo context before and after the inspection because the compile flags may be different from users' torch.compile. This will clear all dynamo cache. - We can still implement some caching based on the guards. However, this requires export to be able to handle the case where it cannot always rewrite signature: e.g. closure lifted as input. This PR makes the rewrite optional. Implementation: We just put all the code that are related to signature rewriting into a function called rewrite_signature and use a same_signature flag to optionally to the transformation. Test Plan: existing tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106569 Approved by: https://github.com/ezyang	2023-08-08 17:16:18 +00:00
youkaichao	bd3b6f1ab4	add a debug api to extract cache entry from code (#106673 ) Per the discussion with @jansel in https://dev-discuss.pytorch.org/t/how-are-guards-installed-on-frames-that-are-transient-objects/1415/7 , guards and compiled code live in `co_extra` field in pycodeobject, which cannot be accessed in a trivial way. This PR tries to add a debug API to extract the data from that field, which can make debugging torchdynamo much easier. The API is intended to be used for debug only, and should have no compatibility issues with the current system. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106673 Approved by: https://github.com/jansel	2023-08-08 16:33:46 +00:00
Jason Lu	bc88028e8e	Back out "Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 )" (#106743 ) Summary: Original commit changeset: 81319beb97f3 Original Phabricator Diff: D47961182 Test Plan: revert to maintain backward compat with legacy ads_dper3 production package. Read details in: S357822 Reviewed By: atuljangra Differential Revision: D48131623 @diff-train-skip-merge (D48131623 landed internally) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106743 Approved by: https://github.com/malfet	2023-08-08 15:27:34 +00:00
PyTorch MergeBot	891bb259f8	Revert "Remove dynamo+nvfuser (#105789 )" This reverts commit `6030151d37`. Reverted https://github.com/pytorch/pytorch/pull/105789 on behalf of https://github.com/DanilBaibak due to Break a lot of tests on main. ([comment](https://github.com/pytorch/pytorch/pull/105789#issuecomment-1669710571))	2023-08-08 14:20:32 +00:00
Ivan Yashchuk	6030151d37	Remove dynamo+nvfuser (#105789 ) This PR removes unmaintained Dynamo+nvFuser. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105789 Approved by: https://github.com/jansel, https://github.com/jjsjann123, https://github.com/albanD	2023-08-08 13:29:31 +00:00
Thomas Ortner	cc21fa75a3	Enable dynamic shapes of torch.nn.Parameter (#105855 ) This PR adds a new configuration that enables shapes of torch.nn.Parameter to be treated as dynamic in order to avoid extensive recompilation when Paramters are used instead of Tensor. This features addresses part of issue #105279 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105855 Approved by: https://github.com/ezyang	2023-08-08 05:40:01 +00:00
Michael Voznesensky	45c03b1ad4	Better dynamo dict support via SetVariable keys (#106559 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106559 Approved by: https://github.com/ezyang	2023-08-07 20:20:06 +00:00
Kshiteej K	af78e139a8	[functorch] fix dynamo support for functorch.grad (#106610 ) Ref: https://github.com/pytorch/pytorch/pull/106475#discussion_r1282384503 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106610 Approved by: https://github.com/zou3519	2023-08-07 17:44:49 +00:00
Yanbo Liang	e190afb829	[Dynamo] Allow users to patch custom builtin functions and inline them (#106595 ) Fixes Meta internal user case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106595 Approved by: https://github.com/jansel	2023-08-04 23:47:09 +00:00
Jason Ansel	a01e795a6d	[Compiled Autograd] Fix bug with multithreading check (#106621 ) Fixes #106555 There was bug where the multithreading check would fire because of the `compiled_autograd.disable()` calls in AotAutograd, even though compiled autograd was already disabled, so that call was doing nothing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106621 Approved by: https://github.com/yanboliang	2023-08-04 20:49:21 +00:00
Yanbo Liang	df8abaaf5f	[Dynamo] Revert 'Enable torch._dynamo.config.suppress_errors by default' (#106562 ) D47969512 was the original diff to revert this, but the diff train doesn't work well, so I have to split it into two part: this OSS PR and another separate diff to revert the fbcode change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106562 Approved by: https://github.com/angelayi	2023-08-04 16:46:21 +00:00
Edward Z. Yang	91afefb55b	Fix some fake mode confusion between inner/outer fake mode in export (#106515 ) Fixes https://github.com/pytorch/pytorch/issues/106412 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106515 Approved by: https://github.com/voznesenskym, https://github.com/BowenBao, https://github.com/thiagocrepaldi	2023-08-04 15:42:23 +00:00
JackCaoG	c9eb95cca4	Update XLA dyanmo backend name (#106489 ) This is to deprecate the old XLA dyanmo backend and rename it `openxla`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106489 Approved by: https://github.com/jansel, https://github.com/shunting314	2023-08-03 20:00:37 +00:00
ydwu4	2f281949a5	[dynamo] resolve InlinedClosureVariable in InstructionTranslator stack (#106491 ) When inlining a function which loads a closure, its direct parent may not load that closure. So we cannot find the closure name in parent's symbolic locals. In this PR, we fix it by recursively searching the parent instruction translator stack to resolve the closure. Background When developing https://github.com/pytorch/pytorch/pull/105679, this corner case is triggered. A small repro is added in the test of this pr, where outer is loaded by deep2 but not by deep. ```python def test_inline_closure_not_loaded_by_parent(self): def outer(a): return a + 1 def indirect(x): return direct(x) def direct(x): def deep2(c): return outer(c) def deep(c): return deep2(c) return deep(x) x = torch.randn(3) eager = indirect(x) counter = CompileCounter() compiled = torch._dynamo.optimize(counter)(indirect)(x) ``` Running the test, we have the following error before the PR: ``` Traceback (most recent call last): File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6584, in test_inline_closure_not_loaded_by_parent compiled = torch._dynamo.optimize(counter)(indirect)(x) File "/home/yidi/local/pytorch/torch/_dynamo/eval_frame.py", line 321, in _fn return fn(args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/eval_frame.py", line 481, in catch_errors return callback(frame, cache_size, hooks, frame_state) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 543, in _convert_frame result = inner_convert(frame, cache_size, hooks, frame_state) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 130, in _fn return fn(args, *kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 362, in _convert_frame_assert return _compile( File "/home/yidi/local/pytorch/torch/_dynamo/utils.py", line 194, in time_wrapper r = func(args, **kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 531, in _compile raise InternalTorchDynamoError(str(e)).with_traceback(e.__traceback__) from None File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 432, in _compile out_code = transform_code_object(code, transform) File "/home/yidi/local/pytorch/torch/_dynamo/bytecode_transformation.py", line 1028, in transform_code_object transformations(instructions, code_options) File "/home/yidi/local/pytorch/torch/_dynamo/convert_frame.py", line 417, in transform tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2067, in run super().run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 261, in call_function return super().call_function(tx, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2279, in inline_call_ tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2279, in inline_call_ tracer.run() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 724, in run and self.step() File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 688, in step getattr(self, inst.opname)(inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 392, in wrapper return inner_fn(self, inst) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 1116, in CALL_FUNCTION self.call_function(fn, args, {}) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 90, in call_function return tx.inline_user_function_return( File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 598, in inline_user_function_return result = InliningInstructionTranslator.inline_call(self, fn, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2172, in inline_call return cls.inline_call_(parent, func, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/symbolic_convert.py", line 2227, in inline_call_ sub_locals, closure_cells = func.bind_args(parent, args, kwargs) File "/home/yidi/local/pytorch/torch/_dynamo/variables/functions.py", line 471, in bind_args result[name] = parent.symbolic_locals[name] torch._dynamo.exc.InternalTorchDynamoError: outer from user code: File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6570, in indirect return direct(x) File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6579, in direct return deep(x) File "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6577, in deep return deep2(c) Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information You can suppress this exception and fall back to eager by setting: import torch._dynamo torch._dynamo.config.suppress_errors = True To execute this test, run the following from the base repo dir: python test/dynamo/test_misc.py -k test_inline_closure_not_loaded_by_parent This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ---------------------------------------------------------------------------------------------------------------------------- Captured stdout call ----------------------------------------------------------------------------------------------------------------------------- frames [('total', 1)] inline_call [] ---------------------------------------------------------------------------------------------------------------------------- Captured stderr call ----------------------------------------------------------------------------------------------------------------------------- [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __init__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __enter__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping helper /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __init__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping __enter__ /home/yidi/local/miniconda3/envs/pytorch-3.10/lib/python3.10/contextlib.py [2023-08-02 15:48:36,560] torch._dynamo.eval_frame: [DEBUG] skipping enable_dynamic /home/yidi/local/pytorch/torch/_dynamo/eval_frame.py [2023-08-02 15:48:36,561] torch._dynamo.symbolic_convert: [INFO] Step 1: torchdynamo start tracing indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6569 TRACE starts_line indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6569 def indirect(x): [2023-08-02 15:48:36,591] torch._dynamo.variables.builder: [DEBUG] wrap_to_fake L['x'] (3,) [<DimDynamic.STATIC: 2>] [None] TRACE starts_line indirect /home/yidi/local/pytorch/test/dynamo/test_misc.py:6570 return direct(x) [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF direct [] [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST x [UserFunctionVariable()] [2023-08-02 15:48:36,594] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [UserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] INLINING <code object direct at 0x7fbe4d366810, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6572> TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6572 (inline depth: 1) def direct(x): TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6573 (inline depth: 1) def deep2(c): [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE outer [] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [InlinedClosureVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object deep2 at 0x7fbe4d3666b0, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6573> [TupleVariable()] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST MiscTests.test_inline_closure_not_loaded_by_parent.<locals>.direct.<locals>.deep2 [TupleVariable(), ConstantVariable(code)] [2023-08-02 15:48:36,595] torch._dynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF deep2 [NestedUserFunctionVariable()] TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6576 (inline depth: 1) def deep(c): [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE deep2 [] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> [TupleVariable()] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST MiscTests.test_inline_closure_not_loaded_by_parent.<locals>.direct.<locals>.deep [TupleVariable(), ConstantVariable(code)] [2023-08-02 15:48:36,597] torch._dynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST deep [NestedUserFunctionVariable()] TRACE starts_line direct /home/yidi/local/pytorch/test/dynamo/test_misc.py:6579 (inline depth: 1) return deep(x) [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST deep [] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST x [NestedUserFunctionVariable()] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,598] torch._dynamo.symbolic_convert: [DEBUG] INLINING <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> TRACE starts_line deep /home/yidi/local/pytorch/test/dynamo/test_misc.py:6576 (inline depth: 2) def deep(c): TRACE starts_line deep /home/yidi/local/pytorch/test/dynamo/test_misc.py:6577 (inline depth: 2) return deep2(c) [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF deep2 [] [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST c [NestedUserFunctionVariable()] [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] FAILED INLINING <code object deep at 0x7fbe4d366760, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6576> [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes [2023-08-02 15:48:36,599] torch._dynamo.symbolic_convert: [DEBUG] FAILED INLINING <code object direct at 0x7fbe4d366810, file "/home/yidi/local/pytorch/test/dynamo/test_misc.py", line 6572> [2023-08-02 15:48:36,599] torch._dynamo.output_graph: [DEBUG] restore_graphstate: removed 0 nodes ``` Test Plan: add new test Pull Request resolved: https://github.com/pytorch/pytorch/pull/106491 Approved by: https://github.com/williamwen42, https://github.com/jansel, https://github.com/zou3519	2023-08-03 16:45:42 +00:00
Edward Z. Yang	697893568d	Improve error message when export encounters non-local input (#106403 ) Previously, you would get an error like ``` Dynamo input and output is a strict subset of traced input/output ``` now you get ``` Cannot export model which references tensors that are neither buffers/parameters/constants nor are direct inputs. For each tensor, if you'd like this tensor to be an explicit input, add it as a dummy argument to the top-level model definition you are exporting; if you would like its value to be embedded as an exported constant, wrap its access in a function marked with @assume_constant_result. G['bulbous_bouffant'], accessed at: File "test_export.py", line N, in f return bulbous_bouffant + y ``` This doesn't handle outputs, I'm going to hit that next. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106403 Approved by: https://github.com/tugsbayasgalan	2023-08-03 12:35:25 +00:00
ydwu4	1f1dfa9be9	Fix grad higher order handling TupleVariable (#106425 ) Previously, we assume the argnums is a ConstantVariable. However I accidentally triggered an error on CI where argnums could be a TupleVariable. In that case, we have an attribute error when access the .value of argnums. This PR adds support for the TupleVariable. It allows the unit test to pass without falling back to eager "PYTORCH_TEST_WITH_DYNAMO=1 python test/functorch/test_eager_transforms.py -k test_argnums_cpu" Test Plan: see modified test. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106425 Approved by: https://github.com/yanboliang, https://github.com/anijain2305, https://github.com/kshitij12345	2023-08-02 20:57:05 +00:00
Edward Z. Yang	76163a56c0	Refactor stack handling to always use TracingContext to populate real stack on exception (#106277 ) The basic gist of the PR is simple, but it's accompanied with some careful modifications and unit tests to make sure I got it right. Check inline comments for more details. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106277 Approved by: https://github.com/albanD, https://github.com/voznesenskym	2023-08-02 00:09:16 +00:00
Thiago Crepaldi	6d2162e644	Remove fake_mode arg from torch._dynamo.export API (#106345 ) #105477 removes the need of explicitly specifying `fake_mode`. The same effect can be achieved by wrapping `torch._dynamo.export` around a `torch._subclasses.FakeTensorMode` context. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106345 Approved by: https://github.com/ezyang	2023-08-01 17:52:06 +00:00
Yukio Siraichi	e514386315	Normalize builtin types to dtypes. (#106074 ) Fix: #105052 Follow-up: #105588 This PR normalizes builtin Python types (e.g. `int` and `float`) into PyTorch data types when these are passed as argument, instead of used as functions. In summary, we: - Implement `BuiltinVariable.as_proxy`, mapping Python types into PyTorch data types Pull Request resolved: https://github.com/pytorch/pytorch/pull/106074 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-08-01 13:32:19 +00:00
Jouni K. Seppänen	186352a625	[inductor] Make autotune_process.py pass mypy (#105791 ) `TensorMeta.from_irnodes` handles either a single `IRNode` or a tuple or list of them. I tried to express this with overloading, but because this file is in MYPYNOFOLLOW, the `IRNode` subclasses become `Any`, which causes the overloads to be overlapping. This changes the type of the argument to `benchmark_in_sub_process` to the more specific `TritonTemplateCaller`, since that one has the `bmreq` member and existing docstrings indicate that only the triton template benchmark is handled. The `rand_strided` call caused a mypy error because the default value for device was a string. This is fixed by adding type hints to `rand_strided` in `torch/_dynamo/testing.py`. Likewise, the return value of `PyCodeCache.load_by_key_path` can be inferred from the type hint on `PyCodeCache.cache`. Fixes one part of #105230 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105791 Approved by: https://github.com/jansel, https://github.com/Skylion007	2023-07-31 23:58:38 +00:00
William Wen	018ac76362	fix x.numpy() breaks in #106211 (#106327 ) Fixes https://github.com/pytorch/pytorch/issues/106316. Need to promote [this](https://dev-discuss.pytorch.org/t/supporting-dynamo-in-python-3-11-null/1393) a little more I guess. I'm going to make a PR soon that will add `push_null` arg to `load_import_from` and other function call codegen methods that are missing the field, so that we can push null as early in the function call sequence as possible. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106327 Approved by: https://github.com/lezcano	2023-07-31 21:19:27 +00:00
wangxiyuan	4eeda6616c	Correct URL Link for torchDynamo (#105903 ) Correct some error or 404 urls for torchDynamo doc Pull Request resolved: https://github.com/pytorch/pytorch/pull/105903 Approved by: https://github.com/malfet	2023-07-31 20:50:09 +00:00
Mikayla Gawarecki	d8e5f2aa6d	Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106224 Approved by: https://github.com/atalman, https://github.com/albanD	2023-07-31 17:18:56 +00:00
Edward Z. Yang	d3b508d068	Fix typo which suppresses user exception reporting (#106289 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106289 Approved by: https://github.com/albanD	2023-07-31 14:35:33 +00:00
Michael Voznesensky	8549abc347	Grab bag of DTensor enablement stuff (Enable whole graph capture for DTensor) (#105787 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105787 Approved by: https://github.com/ezyang	2023-07-30 00:17:45 +00:00
Edward Z. Yang	1da4115702	Make _dynamo.export return a NamedTuple (#106062 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106062 Approved by: https://github.com/voznesenskym	2023-07-29 06:17:33 +00:00
Tugsbayasgalan Manlaibaatar	df50f91571	Support fx_pytree in dynamo (#105574 ) This PR does two things: 1. Make dynamo trace through fx_pytree (on top of torch.utils._pytree) so that generated graph modules can be retraced. 2. Fix bug where unflatten not returning dynamo VariableTracker. Differential Revision: [D47734623](https://our.internmc.facebook.com/intern/diff/D47734623) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105574 Approved by: https://github.com/yanboliang, https://github.com/ydwu4	2023-07-29 05:08:15 +00:00
Jason Ansel	099345f1e5	[Compiled Autograd] Handle aten.sym_size/aten.sym_stride (#105814 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105814 Approved by: https://github.com/voznesenskym	2023-07-28 21:42:51 +00:00
Michael Lazos	db2239706e	Fix TORCH_COMPILE_DEBUG incompatibility with aot inductor (#106169 ) Record replay tries to record a module which is already available Pull Request resolved: https://github.com/pytorch/pytorch/pull/106169 Approved by: https://github.com/anijain2305, https://github.com/jansel	2023-07-28 17:17:58 +00:00
Elias Ellison	76a2ec49d7	[Dynamo] Ignore no-op tensor assignment (#106092 ) Ignore no-op `self.attr = self.attr` on NN Modules when attr is a Tensor attribute. This comes from a [llama pattern](https://github.com/pytorch/benchmark/blob/main/torchbenchmark/models/llama/model.py#L121-L122). Normally, when a set attr occurs on an nn module we turn it into an `UnspecializedNNModuleVariable` which prevents static buffers and parameters. In subsequent pr i will add support for cudagraph mutation of buffers/params, which with this pr takes llama 1.6x -> 4.4x in inference Pull Request resolved: https://github.com/pytorch/pytorch/pull/106092 Approved by: https://github.com/yanboliang	2023-07-28 17:16:19 +00:00
Jason Ansel	2e02dfae9a	[Compiled Autograd] Fix handling of undefined gradients in hooks (#105813 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105813 Approved by: https://github.com/albanD	2023-07-28 15:59:35 +00:00
Edward Z. Yang	7b9d250f06	Change _dynamo.export to be export(f)(args, *kwargs) (#106109 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106109 Approved by: https://github.com/voznesenskym	2023-07-27 21:41:13 +00:00
Yukio Siraichi	707aadeedd	Track global Numpy variables as side-effect. (#105959 ) Fix: #105074 This PR makes dynamo handle Numpy global variables the same way as PyTorch tensor global variables by tracking them as side-effect. In summary, we add `NumpyNdarrayVariable` to the `VariableBuilder._can_lift_attrs_to_inputs` function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105959 Approved by: https://github.com/ezyang	2023-07-27 03:49:48 +00:00
Edward Z. Yang	edebdaf182	Change _dynamo.explain to be explain(f)(args, *kwargs) (#106066 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106066 Approved by: https://github.com/wanchaol, https://github.com/voznesenskym	2023-07-27 03:21:52 +00:00
Edward Z. Yang	49e047e0f9	Delete dead summarize_dim_constraints (#106053 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106053 Approved by: https://github.com/ydwu4	2023-07-27 03:08:24 +00:00
Edward Z. Yang	6847c965f5	Turn on capture_dynamic_output_shape_ops/capture_scalar_outputs by default for export (#105962 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105962 Approved by: https://github.com/tugsbayasgalan	2023-07-27 01:02:09 +00:00
Jerry Zhang	3a77f9aaaf	[quant][api] Move torch.ao.quantization.pt2e.quantizer to torch.ao.quantization.quantizer (#105885 ) Summary: moving quantizer to torch.ao.quantization to make it a public api, since pt2e is a folder for implementations Test Plan: CIs sanity check: "buck test //executorch/backends/xnnpack/test:test_xnnpack_quantized_models -- test_resnet18" Differential Revision: D47727838 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105885 Approved by: https://github.com/andrewor14	2023-07-26 18:20:09 +00:00
Michael Voznesensky	aabdd2b7a1	Add support for tensor.tolist() for static sized int tensors (#105976 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105976 Approved by: https://github.com/ezyang	2023-07-26 08:13:22 +00:00
kshitij12345	920b446da9	dynamo: support disable_saved_tensors_hooks (#104869 ) Functorch transforms use this context manager which will lead to graph-breaks. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104869 Approved by: https://github.com/zou3519	2023-07-26 07:27:37 +00:00
Wanchao Liang	c76c84bde4	[dynamo] make ProcessGroupVariable a DistributedVariable (#105593 ) This PR move the ProcessGroupVariable from UDO to DistributedVT so that Distributed VTs are consolidated together Pull Request resolved: https://github.com/pytorch/pytorch/pull/105593 Approved by: https://github.com/voznesenskym	2023-07-26 06:42:50 +00:00
PyTorch MergeBot	6dd4b99ec2	Revert "Disable torchrec/sparse from top-level Dynamo tracing (#105733 )" This reverts commit `60d5efdb15`. Reverted https://github.com/pytorch/pytorch/pull/105733 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/105733#issuecomment-1650931609))	2023-07-26 03:44:47 +00:00
Jason Ansel	c902b84e0b	Compiled autograd (#103822 ) This branch: 1) converts the autograd tape into an FX graph 2) caches that conversion using a "shadow" graph 3) compiles and runs the generated FX graph instead of the normal autograd What works currently: 1) Caching, capture, and initial integration 2) Backwards hooks 3) Inlining AotAutograd generated subgraphs 4) torch.compiling the generated FX graph 5) Auto-detecting dynamic shapes based on changes Future work 1) Larger scale testing 1) Boxed calling convention, so memory can be freed incrementally 1) Support hooks on SavedTensor 1) Additional testing by running eager autograd tests under compiled_autograd.enable() Pull Request resolved: https://github.com/pytorch/pytorch/pull/103822 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-07-24 21:12:05 +00:00
Michael Voznesensky	bf693f2000	Strengthen ConstantVariable invariants (#105796 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105796 Approved by: https://github.com/ezyang	2023-07-24 20:41:12 +00:00
Edward Z. Yang	3045e84e67	Tweak dynamic=False behavior (#105715 ) Previously, dynamic=False is a no-op, and dynamic=True preemptively turns on dynamic shapes everywhere. Now, dynamic=False disables automatic dynamic, and an unset dynamic defaults to dynamic=None (which uses automatic dynamic.) This seems to be more intuitive per https://github.com/pytorch/pytorch/issues/105634#issuecomment-1644883477 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105715 Approved by: https://github.com/voznesenskym	2023-07-24 16:56:41 +00:00
Michael Voznesensky	54a673bdcf	Initial sourceless builder (#104734 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/104734 Approved by: https://github.com/ezyang	2023-07-24 02:48:32 +00:00
Wanchao Liang	66fbffce1f	Fix unstable CI related to dynamo tests (#105797 ) this PR fix the current unstable CI. The test failure comes from a bad revert in https://github.com/pytorch/pytorch/pull/105581 where it does not revert the intended PR correctly (there were some merge conflicts and some logic got deleted during this revert) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105797 Approved by: https://github.com/ezyang	2023-07-23 05:43:54 +00:00
Aaron Gokaslan	6d43c89f37	[BE]: Update Ruff to 0.0.280 (#105724 ) Removes unusued loop values in python dictionary iteration. Automated fix from Ruff master Pull Request resolved: https://github.com/pytorch/pytorch/pull/105724 Approved by: https://github.com/ezyang, https://github.com/janeyx99	2023-07-22 23:03:34 +00:00
gmagogsfm	f5def50461	Supress eager fallback suggestions when exporting (#105767 ) Previously during torch.export(), when an exception is raised during tracing, Dynamo displays this error: “You can suppress this exception and fall back to eager by setting: import torch._dynamo torch._dynamo.config.suppress_errors = True” This is not viable in torch.export(), thus this diff suppresses this suggestion during export. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105767 Approved by: https://github.com/anijain2305	2023-07-22 19:17:08 +00:00
Animesh Jain	afd955f3de	[dynamo][constant] Kwargs already supported for str methods (#105785 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105785 Approved by: https://github.com/yanboliang	2023-07-22 09:33:23 +00:00
Thiago Crepaldi	09b5c35911	Support torch.onnx.dynamo_export within FakeTensorMode (#105477 ) Currently, exporting a model to ONNX with fake tensor mode requires the user to load data and model within `torch.onnx.enable_fake_mode` context, but the actual call to `torch.onnx.dynamo_export` is done outside such context. With this PR, we enable `torch.onnx.dynamo_export` to be called either within `torch.onnx.enable_fake_mode` or outside of it. This feature required changes to the core PyTorch Dynamo, which were greatly supported by @ezyang In future steps we will determine which scenario we are going to support, but for now we can use either to explore different options and scenarios and asses their pros and cons. This PR also creates a separate suite of tests for fake mode specific scenarios (`TestFxToOnnxFakeTensorWithOnnxRuntime`). It was done separately to decrease the test time, but we could merge it with the default `TestFxToOnnxWithOnnxRuntime`. The additional parameters are `load_checkpoint_during_init` and `export_within_fake_mode` With the newly added supported of nested export within fake mode, the following scenarios are now supported: ```python import torch with torch.onnx.enable_fake_mode() as fake_context: fake_args = create_args() fake_kwargs = create_kwargs() fake_model = create_model() fake_model.load_state_dict(torch.load(tmp_checkpoint_file.name)) export_options = torch.onnx.ExportOptions(fake_context=fake_context) # `torch.onnx.dynamo_export` called WITHIN `torch.onnx.enable_fake_mode` export_output = torch.onnx.dynamo_export( fake_model, fake_args, *fake_kwargs, export_options=export_options, ) export_output.save("/path/to/model.onnx", model_state_dict=create_model()) ``` If we decide to only support scenarios in which `torch._dynamo.export` is called within `FakeTensorMode`, then we can remove `fake_mode` argument from `torch._dynamo.export` as a follow-up task ps: This PR is mostly Edward's https://github.com/pytorch/pytorch/pull/105468 + unit tests after an offline discussion ps: https://github.com/pytorch/pytorch/issues/105464 tracks pending tasks/limitations from this PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/105477 Approved by: https://github.com/ezyang, https://github.com/BowenBao	2023-07-22 03:50:52 +00:00
Edward Z. Yang	60d5efdb15	Disable torchrec/sparse from top-level Dynamo tracing (#105733 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105733 Approved by: https://github.com/voznesenskym	2023-07-22 02:00:36 +00:00
Edward Z. Yang	45e0193174	Add telemetry for number of nodes being compiled (#105741 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105741 Approved by: https://github.com/Chillee	2023-07-22 01:56:02 +00:00
Animesh Jain	a6b8c30726	[dynamo][higher order ops] Bugfix for kwargs support (#105699 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105699 Approved by: https://github.com/Skylion007, https://github.com/ydwu4, https://github.com/zou3519	2023-07-21 23:44:37 +00:00
PyTorch MergeBot	050d3de07d	Revert "Correct dynamo logging docs (#105658 )" This reverts commit `f3a261e096`. Reverted https://github.com/pytorch/pytorch/pull/105658 on behalf of https://github.com/PaliC due to breaking docs `f3a261e096` ([comment](https://github.com/pytorch/pytorch/pull/105658#issuecomment-1646310865))	2023-07-21 22:38:28 +00:00
David Radley	f3a261e096	Correct dynamo logging docs (#105658 ) Fixes #105657 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105658 Approved by: https://github.com/zou3519	2023-07-21 21:37:02 +00:00
Yanbo Liang	4c73016ff2	[Dynamo] Enable torch._dynamo.config.suppress_errors by default (#105307 ) Summary: We are working toward full model compilation, where when compilation error happens, we just fall back to eager mode rather than error out. But at the same time, we should fix these issues if they are bugs. We will: * 1/ log warnings in OSS; * 2/ log warnings and write them into Scuba in fbcode; to prevent us from ignoring these issues. Test Plan: Manual test Differential Revision: D47506314 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105307 Approved by: https://github.com/jansel	2023-07-21 19:17:46 +00:00
angelayi	b0a04331b4	[dynamo] Fix import if numpy is not installed (#105711 ) This [line](https://github.com/pytorch/pytorch/blob/main/torch/_dynamo/allowed_functions.py#L18) results in an import issue if numpy is not installed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105711 Approved by: https://github.com/yanboliang, https://github.com/ezyang	2023-07-21 05:52:32 +00:00
Mengwei Liu	cce2b7e3c9	[dynamo][numpy] Add support for builtin len() on numpy ndarray (#105691 ) Issue #105054 ``` def fn(x): v = x.sum() / len(x) return v ``` This creates a graph break because we don't know how to handle __len__ method. Solution is just delegate it back to `TensorVariable`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105691 Approved by: https://github.com/ezyang	2023-07-21 03:50:40 +00:00
Edward Z. Yang	a8f568e99b	Make recompiles log print stack traces (#105663 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/105663 Approved by: https://github.com/voznesenskym	2023-07-21 00:31:22 +00:00
Yanbo Liang	6560750d08	[Dynamo] Support list indexed by constant tensor (#105509 ) Fixes #104092 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105509 Approved by: https://github.com/eellison	2023-07-20 20:14:04 +00:00
William Wen	777fc0bb58	[dynamo] fine-grained bytecode-source attribution in python 3.11 (#104676 ) Since Python 3.11 bytecode contains endline and column information, for each bytecode, we attribute the source code corresponding to the bytecode in a more accurate way. For example, we can highlight a function call in a series of nested function calls, or highlight a function call spanning multiple lines. Sample: ```python import torch import torch._dynamo from functorch.experimental.control_flow import cond def h(x): return x * 5 def true_fn(x): return x * 2 def false_fn(x): return x * 3 def f(pred, x): x = h( h(h(x)) ) x = x[1:][:2] torch._dynamo.graph_break() x = cond(pred, true_fn, false_fn, [x]) opt_f = torch.compile(f, backend="eager") opt_f(torch.tensor(True), torch.randn(3, 3, 3, 3)) ``` Output: ``` $ TORCH_LOGS="trace_call" python playground9.py TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:16 h(h(x)) ~^^^ TRACE FX call mul from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:16 h(h(x)) ~^^^^^^ TRACE FX call mul_1 from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE inlined call h from f /scratch/williamwen/work/pytorch/playground9.py:15 x = h( ~^ h(h(x)) ^^^^^^^ ) ^ TRACE FX call mul_2 from h /scratch/williamwen/work/pytorch/playground9.py:6 (inline depth: 1) return x * 5 ~~^~~ TRACE FX call getitem from f /scratch/williamwen/work/pytorch/playground9.py:18 x = x[1:][:2] ~^^^^ TRACE FX call getitem_1 from f /scratch/williamwen/work/pytorch/playground9.py:18 x = x[1:][:2] ~~~~~^^^^ TRACE inlined call true_fn from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TRACE FX call mul from true_fn /scratch/williamwen/work/pytorch/playground9.py:9 (inline depth: 1) return x * 2 ~~^~~ TRACE inlined call false_fn from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TRACE FX call mul from false_fn /scratch/williamwen/work/pytorch/playground9.py:12 (inline depth: 1) return x * 3 ~~^~~ TRACE FX call cond from <resume in f> /scratch/williamwen/work/pytorch/playground9.py:20 x = cond(pred, true_fn, false_fn, [x]) ~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104676 Approved by: https://github.com/ezyang	2023-07-20 17:18:52 +00:00
Jerry Zhang	dff4e034b8	[quant][pt2e][be] Rename qnnpack quantizer to xnnpack quantizer (#105551 ) Summary: att Test Plan: sandcastle CI and OSS CI Reviewed By: andrewor14 Differential Revision: D47422894 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105551 Approved by: https://github.com/andrewor14	2023-07-20 03:52:40 +00:00
Andrey Talman	c6653b65d8	Back out "Make adding buffers more like adding parameters (#104069 )" (#105581 ) Summary: D47537831 is breaking pyper tests: https://fb.workplace.com/groups/802176577445480/posts/1018902842439518/ with `TypeError: register_buffer() takes 3 positional arguments but 4 were given` Original commit changeset: d4b4069fbd38 Original Phabricator Diff: D47537831 Test Plan: ``` buck2 run //caffe2/torch/fb/training_toolkit/integration_tests/training_lifecycle/cogwheel_tests/pyper_release_v2:cogwheel_smallworld_inline_cvr_infer_pyper_pyper__canary_offline_training-launcher -- --run-harness-in-tupperware --build-fbpkg ads_dper3 --build-fbpkg training_platform ``` Reviewed By: atalman Differential Revision: D47600140 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105581 Approved by: https://github.com/mikaylagawarecki	2023-07-20 03:39:53 +00:00
Michael Lazos	690ea933ca	Enable more e2e foreach optimizer compilation tests (#105438 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/105438 Approved by: https://github.com/jansel	2023-07-20 02:41:19 +00:00
Animesh Jain	af9a4e08fa	[dynamo][rewrite_asserts] Insert assertion msg in bytecode only when needed (#105549 ) Fixes https://github.com/pytorch/pytorch/issues/105513 The main issue is that we could call `self.LOAD_CONST` and change Dynamo stack, and then decide that we can't rewrite it later. This PR ensures that we change the dynamo stack only when we decide to rewrite asserts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105549 Approved by: https://github.com/tugsbayasgalan	2023-07-19 23:14:01 +00:00
Yukio Siraichi	5ce5372d70	Create tensor from Numpy in current device. (#105546 ) Fix: #105046 This PR changes how tensors are created from Numpy arrays, when tracing with dynamo. Instead of using `from_numpy`, we use `as_tensor`. The latter takes into consideration the current device. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105546 Approved by: https://github.com/lezcano	2023-07-19 21:31:52 +00:00
Wanchao Liang	f139aab2f4	[dynamo] add initial dynamo support for DTensor (#103146 ) This PR adds initial dynamo support for DTensor, in particular, it: - allows DTensor be passed into a compiled function, and allow fakify DTensor during dynamo tracing by turning the inner local tensor to meta tensor. - We use `allow_in_graph` to include `DTensor` and `DTensor.from_local` to be represented as `TorchVariable` - The dtensor created becomes a normal `TensorVariable` and it would insert any tensor operations to the output graph just like torch.Tensor - note that dtensor have a new instance method `redistribute` compare to plain tensor, and we currently special handle it in `TensorVariable` `from_local` and `redistribute` both accepts some non-trival metadata as arguments (i.e. DeviceMesh, Placement) which fx.Graph does not support. In order to let these two APIs appear in the dynamo captured graph, we encoded the metadata into a new_function (like `functools.partial`) and the new function only accepts prim args (i.e. tensor), then we put `call_function` with this new_function to the graph. This is suggested by @ezyang. The underlying rationale here is that the metadata will not change across the graph invocations so it's safe to encode them. Captured graph: ``` def forward(self, L_x_ : torch.Tensor): l_x_ = L_x_ # File: /scratch/wanchaol/work/pytorch/test/distributed/_tensor/test_dtensor.py:685, code: dt = DTensor.from_local(x, mesh, [Shard(0)], run_check=False) prim_from_local = torch__dynamo_variables_torch_prim_from_local(l_x_, run_check = False); l_x_ = None # File: /scratch/wanchaol/work/pytorch/test/distributed/_tensor/test_dtensor.py:686, code: return dt.redistribute(mesh, [Replicate()]).to_local() + 2 prim_redistribute = torch__dynamo_variables_tensor_prim_redistribute(prim_from_local); prim_from_local = None to_local = prim_redistribute.to_local(); prim_redistribute = None add = to_local + 2; to_local = None return (add,) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103146 Approved by: https://github.com/voznesenskym	2023-07-19 16:01:12 +00:00
Justin Chu	8a688277a2	[BE] Enable ruff's UP rules and autoformat dynamo / functorch and refs (#105432 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105432 Approved by: https://github.com/ezyang	2023-07-19 13:48:44 +00:00
Yukio Siraichi	0b6de0eb1c	Improve validator module behavior if Z3 is not installed. (#105168 ) Fixes: #105143 In summary, the changes are: - Check if Z3 is installed when the module is loaded - Naming consistently as "translation validation" (not "validator") - Skipping tests if Z3 is not installed Pull Request resolved: https://github.com/pytorch/pytorch/pull/105168 Approved by: https://github.com/ezyang	2023-07-19 13:11:22 +00:00
kshitij12345	e137ac6c59	[dynamo][torch_np] support linalg, random and fft module (#105320 ) Support tracing through `np.linalg` with `torch_np` installed. Will update with other modules if this approach makes sense. TODO: * [x] Add test for `fft` and `random`. Fixes https://github.com/pytorch/pytorch/issues/105269 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105320 Approved by: https://github.com/ezyang, https://github.com/lezcano	2023-07-19 11:06:37 +00:00
Michael Lazos	1597dd7a54	Report guard failures with recompiles logging (#105500 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/105500 Approved by: https://github.com/Chillee, https://github.com/anijain2305	2023-07-19 02:20:44 +00:00
Wanchao Liang	cb23373264	[dynamo] allow tensor subclass fakification in dynamo (#105308 ) This PR adds necessary plumbing through torchdynamo to allow tensor subclasses with certain contract (i.e. with `__tensor_flatten__` and `__tensor_unflatten__`) to goes through the dynamo fakification pass by fakifying the tensor subclass internal components. Some of the tensor subclass contract logic mostly borrowed from https://github.com/pytorch/pytorch/pull/97540 Added some tests to verify simply passing through a tensor subclass (i.e. DTensor) through dynamo eager works as expected. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105308 Approved by: https://github.com/ezyang	2023-07-18 17:28:04 +00:00
angelayi	133c5ec997	Add torch.ops.out_dtype (#103333 ) https://docs.google.com/document/d/10DYFG2sU3TSvguFP5kYwYLlo45KHFg3BhBOkUk0NKsU/edit#bookmark=id.hgfzmhlzkamk Renamed mixed_dtype --> out_dtype because "mixed_dtype is not very descriptive in the context of regular pytorch where we support type promotion on most ops" Pull Request resolved: https://github.com/pytorch/pytorch/pull/103333 Approved by: https://github.com/zou3519	2023-07-18 16:25:45 +00:00
willfengg	8010f6bf48	[dynamo][inductor] Provide public API to get compiler options/configs (#105026 ) issues resolved: https://github.com/pytorch/pytorch/issues/101832 context: get torch.compile config for further usage. E.g, the training platform wants to get if model is compiled with cudagraph enabled and trigger further action how it is implemented * the core logic is backend.get_compiler_config() in torch/_dynamo/eval_frame.py * for backend='inductor' / _TorchCompileInductorWrapper, we have inductor-specific implementation in get_compiler_config in torch/_inductor/compile_fx.py and torch/__init__.py how to use it: Below is an example. ``` model = DummyModule() optimized_module = torch.compile( model, options={"triton.cudagraphs": True} ) compiler_config = optimized_module.get_compiler_config() if compiler_config["triton.cudagraphs"]: pass ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105026 Approved by: https://github.com/yanboliang, https://github.com/jansel	2023-07-18 06:12:06 +00:00
Aleksandar Samardžić	5d473a950f	Make conversions from/to sparse semi-structured always @torch.compile-d (#105272 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105272 Approved by: https://github.com/ezyang	2023-07-18 04:51:28 +00:00
David Berard	ad6dad810e	[dynamo][profiler] More verbose profiler warning (#105362 ) torch.profiler.record_function and torch.profiler.profile are ignored by dynamo. In the common case, users have `record_function` in the middle of their program in order to annotate a section of the profile. The previous error message was `Profiler will be ignored`. Users would think that profiling would be completely ignored. Now the message will look like `Profiler function <class 'torch.autograd.profiler.record_function'> will be ignored`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105362 Approved by: https://github.com/yanboliang, https://github.com/aaronenyeshi	2023-07-18 04:42:13 +00:00
Animesh Jain	88aa51fe85	[dynamo] Support defaults for namedtuples (#105341 ) Fixes https://github.com/pytorch/pytorch/issues/103008 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105341 Approved by: https://github.com/jansel	2023-07-17 23:52:57 +00:00
Michael Voznesensky	a6758cb304	Revert "Revert "SetVariable in dynamo (#103205 )"" + Fix for improved graph breaks (#105345 ) This reverts commit `94b3f9f646`. Fix Pull Request resolved: https://github.com/pytorch/pytorch/pull/105345 Approved by: https://github.com/atalman	2023-07-17 23:21:30 +00:00
Tugsbayasgalan Manlaibaatar	d623f22b8b	Skip frame if the graph is empty (#105228 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105228 Approved by: https://github.com/anijain2305	2023-07-17 21:50:00 +00:00
lezcano	a26afb9848	Better comparisons for np.ndarrays in dynamo (#105333 ) This takes tolerances into account. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105333 Approved by: https://github.com/larryliu0820	2023-07-17 20:20:50 +00:00
ekamiti	32d422f335	Make adding buffers more like adding parameters (#104069 ) Add similar semantics for creating a buffer object similar to creating a parameter. This is done by introducing a new `Buffer` class that can be used for type disambiguation. The underlying functionality of registering a buffer remains the same as the `register_buffer` method has not been changed. The `persistent` parameter in the `Buffer` type is to indicate whether a buffer object should be persistent or not. Other non-test changes have to do with getting the new `Buffer` type recognized by inductor and dynamo. Remaining changes are test changes to make sure that the `Buffer` type can be used as a drop in replacement for `register_buffer` as it just leads to `register_buffer` being called. The addition of this new functionality still allows for normal tensors to be used as buffers so these changes are intended to be backwards compatible. Fixes #35735 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104069 Approved by: https://github.com/mikaylagawarecki	2023-07-17 17:59:05 +00:00
Animesh Jain	95232c216b	[dynamo] Bugfix for enums (#105306 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105306 Approved by: https://github.com/yanboliang	2023-07-17 16:39:16 +00:00
PyTorch MergeBot	94b3f9f646	Revert "SetVariable in dynamo (#103205 )" This reverts commit `82fb5edfc7`. Reverted https://github.com/pytorch/pytorch/pull/103205 on behalf of https://github.com/atalman due to Failing cuda11.8-py3.10-gcc7-sm86 / test (inductor_torchbench_dynamic) with CUDA oom ([comment](https://github.com/pytorch/pytorch/pull/103205#issuecomment-1638115073))	2023-07-17 13:13:47 +00:00

... 2 3 4 5 6 ...

1423 Commits