pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
youkaichao	034e871710	[Dynamo] Look up variables from old frame, rather than copy variables to new frame; skip some copy to save time. (#115062 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/115062 Approved by: https://github.com/williamwen42	2023-12-16 00:02:59 +00:00
William Wen	ad1c3467e2	[dynamo] run guard fail hooks for each cache entry for which there is a cache miss (#110325 ) Attempt number 2 at https://github.com/pytorch/pytorch/issues/108950. Improves debugging for guard failures/recompilations by: - only running guard fail reason generation during recompilation, instead of when a guard fails during dynamo cache lookup (so generating guard failure reasons is not on the critical path) - ~~always reporting all guard failures~~ Reports the first-failing guard failure for each cache entry. We don't expect a performance hit since the guard fail reasons are only generated at recompile time rather than runtime. Perf benchmark to check this (https://hud.pytorch.org/benchmark/torchbench/inductor_with_cudagraphs?startTime=Fri,%2027%20Oct%202023%2017:42:43%20GMT&stopTime=Fri,%2003%20Nov%202023%2017:42:43%20GMT&granularity=hour&mode=training&dtype=amp&lBranch=gh/williamwen42/62/head&lCommit=f4724f5ffc6d17ceae513a42fc18627be7b85482&rBranch=main&rCommit=29f3d392bf230072e3bffae37b078e770cae1956). We may also need to verify this on benchmarks where guard fails are common. Sample script: ```python import torch def generate_data(b): return ( torch.randn(b, 3, 32, 32).to(torch.float32).cuda(), torch.randint(1000, (b,)).cuda(), ) from torchvision.models import resnet18 def init_model(): return resnet18().to(torch.float32).cuda() model = init_model() model_opt = torch.compile(model, dynamic=False) for b in range(16, 32): data = generate_data(b) model_opt(data[0]) ``` Sample logs: ```bash (/data/users/williamwen/py310-env) [williamwen@devgpu020.odn1 /data/users/williamwen/pytorch (wwen/log-all-guards)]$ python playground5.py /data/users/williamwen/pytorch/torch/_inductor/compile_fx.py:141: UserWarning: TensorFloat32 tensor cores for float32 matrix multiplication available but not enabled. Consider setting `torch.set_float32_matmul_precision('high')` for better performance. warnings.warn( [2023-11-06 14:50:47,605] torch._dynamo.convert_frame: [WARNING] torch._dynamo hit config.cache_size_limit (8) [2023-11-06 14:50:47,605] torch._dynamo.convert_frame: [WARNING] function: 'forward' (/data/users/williamwen/torchvision/torchvision/models/resnet.py:284) [2023-11-06 14:50:47,605] torch._dynamo.convert_frame: [WARNING] last reason: tensor 'L['x']' size mismatch at index 0. expected 16, actual 24 [2023-11-06 14:50:47,605] torch._dynamo.convert_frame: [WARNING] To log all recompilation reasons, use TORCH_LOGS="recompiles". [2023-11-06 14:50:47,605] torch._dynamo.convert_frame: [WARNING] To diagnose recompilation issues, see https://pytorch.org/docs/master/compile/troubleshooting.html. (/data/users/williamwen/py310-env) [williamwen@devgpu020.odn1 /data/users/williamwen/pytorch (wwen/log-all-guards)]$ TORCH_LOGS="recompiles" python playground5.py /data/users/williamwen/pytorch/torch/_inductor/compile_fx.py:141: UserWarning: TensorFloat32 tensor cores for float32 matrix multiplication available but not enabled. Consider setting `torch.set_float32_matmul_precision('high')` for better performance. warnings.warn( [2023-11-06 14:53:31,591] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:53:31,591] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:53:31,591] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 17 [2023-11-06 14:53:41,333] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:53:41,333] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:53:41,333] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 17, actual 18 [2023-11-06 14:53:41,333] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 18 [2023-11-06 14:53:50,463] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:53:50,463] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:53:50,463] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 18, actual 19 [2023-11-06 14:53:50,463] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 17, actual 19 [2023-11-06 14:53:50,463] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 19 [2023-11-06 14:53:59,848] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:53:59,848] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:53:59,848] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 19, actual 20 [2023-11-06 14:53:59,848] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 18, actual 20 [2023-11-06 14:53:59,848] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 17, actual 20 [2023-11-06 14:53:59,848] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 20 [2023-11-06 14:54:08,549] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:54:08,549] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:54:08,549] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 20, actual 21 [2023-11-06 14:54:08,549] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 19, actual 21 [2023-11-06 14:54:08,549] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 18, actual 21 [2023-11-06 14:54:08,549] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 17, actual 21 [2023-11-06 14:54:08,549] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 21 [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 21, actual 22 [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 20, actual 22 [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 19, actual 22 [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 18, actual 22 [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 17, actual 22 [2023-11-06 14:54:17,795] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 22 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 22, actual 23 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 21, actual 23 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 20, actual 23 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 19, actual 23 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 18, actual 23 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 17, actual 23 [2023-11-06 14:54:27,430] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 23 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function forward in /data/users/williamwen/torchvision/torchvision/models/resnet.py:284 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 23, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 22, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 21, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 20, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 19, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 18, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 17, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 16, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.convert_frame: [WARNING] torch._dynamo hit config.cache_size_limit (8) [2023-11-06 14:54:36,744] torch._dynamo.convert_frame: [WARNING] function: 'forward' (/data/users/williamwen/torchvision/torchvision/models/resnet.py:284) [2023-11-06 14:54:36,744] torch._dynamo.convert_frame: [WARNING] last reason: tensor 'L['x']' size mismatch at index 0. expected 16, actual 24 [2023-11-06 14:54:36,744] torch._dynamo.convert_frame: [WARNING] To log all recompilation reasons, use TORCH_LOGS="recompiles". [2023-11-06 14:54:36,744] torch._dynamo.convert_frame: [WARNING] To diagnose recompilation issues, see https://pytorch.org/docs/master/compile/troubleshooting.html. [2023-11-06 14:54:45,922] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function _forward_impl in /data/users/williamwen/torchvision/torchvision/models/resnet.py:266 [2023-11-06 14:54:45,922] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:54:45,922] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 24, actual 25 [2023-11-06 14:54:54,691] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function _forward_impl in /data/users/williamwen/torchvision/torchvision/models/resnet.py:266 [2023-11-06 14:54:54,691] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:54:54,691] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 25, actual 26 [2023-11-06 14:54:54,691] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 24, actual 26 [2023-11-06 14:55:03,591] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function _forward_impl in /data/users/williamwen/torchvision/torchvision/models/resnet.py:266 [2023-11-06 14:55:03,591] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:55:03,591] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 26, actual 27 [2023-11-06 14:55:03,591] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 25, actual 27 [2023-11-06 14:55:03,591] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 24, actual 27 [2023-11-06 14:55:12,384] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function _forward_impl in /data/users/williamwen/torchvision/torchvision/models/resnet.py:266 [2023-11-06 14:55:12,384] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:55:12,384] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 27, actual 28 [2023-11-06 14:55:12,384] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 26, actual 28 [2023-11-06 14:55:12,384] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 25, actual 28 [2023-11-06 14:55:12,384] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 24, actual 28 [2023-11-06 14:55:21,442] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function _forward_impl in /data/users/williamwen/torchvision/torchvision/models/resnet.py:266 [2023-11-06 14:55:21,442] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:55:21,442] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 28, actual 29 [2023-11-06 14:55:21,442] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 27, actual 29 [2023-11-06 14:55:21,442] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 26, actual 29 [2023-11-06 14:55:21,442] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 25, actual 29 [2023-11-06 14:55:21,442] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 24, actual 29 [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function _forward_impl in /data/users/williamwen/torchvision/torchvision/models/resnet.py:266 [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 29, actual 30 [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 28, actual 30 [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 27, actual 30 [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 26, actual 30 [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 25, actual 30 [2023-11-06 14:55:30,315] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 24, actual 30 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] Recompiling function _forward_impl in /data/users/williamwen/torchvision/torchvision/models/resnet.py:266 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] triggered by the following guard failure(s): [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 30, actual 31 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 29, actual 31 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 28, actual 31 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 27, actual 31 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 26, actual 31 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 25, actual 31 [2023-11-06 14:55:39,839] torch._dynamo.guards.__recompiles: [DEBUG] - tensor 'L['x']' size mismatch at index 0. expected 24, actual 31 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/110325 Approved by: https://github.com/ezyang, https://github.com/jon-chuang	2023-11-07 20:10:59 +00:00
Kazuaki Ishizaki	b5f9696d81	Fix typo under torch directory (#110824 ) This PR fixes typo `the the` of comments and exception messages in files under `torch` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110824 Approved by: https://github.com/H-Huang	2023-10-09 19:16:43 +00:00
Kaichao You	34ded74399	[Dynamo] fix signature in dynamo types (#110081 ) The type signature is obsolete. This PR fixes the type signature, leaves comments in the C code. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110081 Approved by: https://github.com/jansel	2023-09-27 09:30:04 +00:00
David Berard	dec2b267d4	[dynamo] Add "Torch-Compiled Region" profiler event (#108462 ) Motivation: We already have a `CompiledFunction` event that comes from the autograd.Function added by aot_autograd. However, this doesn't appear during inference, or if none of the inputs to a graph require grad. It also doesn't appear if your backend doesn't use aot_autograd. This adds a profiler event that will always appear. <img width="615" alt="Screenshot 2023-09-01 at 4 46 28 PM" src="https://github.com/pytorch/pytorch/assets/5067123/fed90ca9-a8e7-458c-80eb-b4160de55218"> Perf - increase in latency (with profiler turned off) was within noise when I measured a simple cpu-only torch-compiled function that returned `x.view_as(x)`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108462 Approved by: https://github.com/anijain2305	2023-09-08 02:10:17 +00:00
David Berard	06b173780d	[dynamo] "TorchDynamo Cache Lookup" event: use C++ api (#108436 ) Background: "TorchDynamo Cache Lookup" events appear in traces to indicate a dynamo cache lookup; it's useful to check when cache lookups are taking a long time. To add a profiler event, one can use the `torch.profiler.record_function` context manager, or the C++ equivalent. Previously, the python version was used; first, when the profiler was enabled, callbacks for record_function_enter and record_function_exit were registered; then those would be called before and after every cache lookup. This PR: Instead of calling the python bindings for `torch.profiler.record_function`, directly call the C++ implementation. This simplifies a lot of the code for binding C/C++. It also improves performance; previously there was a lot of overhead in the "TorchDynamo Cache Lookup" event, making the event artificially take a long time. After this change the events now appear shorter, because there's less overhead in starting/stopping the event: in other words, the profiler no longer distorts the results as much. Performance results: I ran using the script below on a cpu-only 1.6GHz machine. I report the median time (from 100 measurements) of a "TorchDynamo Cache Lookup" event before and after this PR. I think it is reasonable to consider the difference to be due to a reduction in overhead. <details> <summary>Benchmarking script</summary> ```python def fn(x, y): return (x * y).relu() a, b = [torch.rand((4, 4), requires_grad=True) for _ in range(2)] opt_fn = torch.compile(fn) opt_fn(a, b) opt_fn(a, b) with torch.profiler.profile() as prof: opt_fn(a, b) ``` </details> Median before PR: 198-228 us (median of 100, measured 5 times) Median after PR: 27us Pull Request resolved: https://github.com/pytorch/pytorch/pull/108436 Approved by: https://github.com/anijain2305, https://github.com/jansel	2023-09-04 04:37:26 +00:00
youkaichao	b9fc6d7ded	[Dynamo] Update the implementation of _debug_get_cache_entry_list (#108335 ) In https://github.com/pytorch/pytorch/pull/106673 , I created a private API `_debug_get_cache_entry_list` to help pull out cache entries from compiled functions. Recently, I find that @anijain2305 commented in the code that this API should be revisited, and so I created this PR. First, this API cannot be removed even if cache entry becomes a first-class python class`torch._C._dynamo.eval_frame._CacheEntry`. The facts that `extra_index` is static, and `get_extra_state` is inline static, make them not accessible elsewhere. This API `_debug_get_cache_entry_list` is the only way for users to get all the cache entries from code. Second, since the`torch._C._dynamo.eval_frame._CacheEntry` class is a python class, I simplified the C-part code, and remove the necessity of creating a namedtuple for this in the python code. Third, I also add a small improvement, that if the argument is a function, we can automatically pass its `__code__` to the API. The above change will slightly change the output, from list of named tuple to list of `torch._C._dynamo.eval_frame._CacheEntry`. I will update the corresponding docs that use this API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108335 Approved by: https://github.com/jansel, https://github.com/anijain2305	2023-09-02 16:38:59 +00:00
cyy	01fc6466d1	[Reland] [1/N] fix clang-tidy warnings in torch/csrc (#108114 ) Reland of PR #107648 with auto replaced with Py_ssize_t in eval_frame.c. This PR applies fixes to some found issues by clang-tidy in torch/csrc. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108114 Approved by: https://github.com/Skylion007	2023-08-30 17:11:16 +00:00
PyTorch MergeBot	8cbf77585d	Revert "[1/N] fix clang-tidy warnings in torch/csrc (#107648 )" This reverts commit `49eeca00d1`. Reverted https://github.com/pytorch/pytorch/pull/107648 on behalf of https://github.com/osalpekar due to This causes breakages due to underspecified type ([comment](https://github.com/pytorch/pytorch/pull/107648#issuecomment-1696372588))	2023-08-28 20:35:12 +00:00
albanD	b9472decf8	Initial Python 3.12 build fixes (#106083 ) This compiles with python 3.12 You can get numpy from https://anaconda.org/scientific-python-nightly-wheels/numpy/files so that you don't need to remove numpy from test files. Basic core tests work but obviously dynamo and first class dims don't work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106083 Approved by: https://github.com/ezyang	2023-08-25 13:23:48 +00:00
cyy	49eeca00d1	[1/N] fix clang-tidy warnings in torch/csrc (#107648 ) Apply fixes to some found issues by clang-tidy in torch/csrc. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107648 Approved by: https://github.com/Skylion007	2023-08-25 00:30:09 +00:00
Animesh Jain	e201e3ffa1	[dynamo][eval frame] Make CacheEntry a PyObject (#107405 ) This PR makes CacheEntry a PyObject. This is prep PR for cache size changes. As CacheEntry is a py object, we can now traverse the linked list in Python and write cache size policies. It was possible to do in C, but Python is just easier to iterate upon. We call convert_frame only when we (re)compile, so a small bump in latency going from C to Python is acceptable here. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107405 Approved by: https://github.com/ezyang ghstack dependencies: #106917, #107117	2023-08-21 18:47:53 +00:00
Animesh Jain	b31a357eaa	[dynamo][eval_frame] Set destroy_extra_state deleter as part of co_extra (#107117 ) Using the `freefunc` facility to free the ExtraState objects - https://peps.python.org/pep-0523/ Pull Request resolved: https://github.com/pytorch/pytorch/pull/107117 Approved by: https://github.com/jansel ghstack dependencies: #106917	2023-08-18 03:52:08 +00:00
Animesh Jain	4608b9422c	[dynamo][eval_frame] Unify cache entry and frame_state on the same co_extra index (#106917 ) Handling follow up from https://github.com/pytorch/pytorch/pull/106413#discussion_r1288971923 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106917 Approved by: https://github.com/ezyang	2023-08-18 03:52:08 +00:00
Animesh Jain	77acb04a00	[dynamo] Readability - Rename name to get_frame_name (#106880 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106880 Approved by: https://github.com/jansel ghstack dependencies: #106878	2023-08-09 21:15:41 +00:00
youkaichao	bd3b6f1ab4	add a debug api to extract cache entry from code (#106673 ) Per the discussion with @jansel in https://dev-discuss.pytorch.org/t/how-are-guards-installed-on-frames-that-are-transient-objects/1415/7 , guards and compiled code live in `co_extra` field in pycodeobject, which cannot be accessed in a trivial way. This PR tries to add a debug API to extract the data from that field, which can make debugging torchdynamo much easier. The API is intended to be used for debug only, and should have no compatibility issues with the current system. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106673 Approved by: https://github.com/jansel	2023-08-08 16:33:46 +00:00
Zhengxu Chen	bdcc454be4	[dynamo] Add missing fields for THPPyInterpreterFrame. (#103227 ) Fixes https://github.com/pytorch/pytorch/issues/103210 Test Plan: Before the fix: ``` pytest test/dynamo/test_export.py -k suppress_errors ``` got result: ``` File "/data/users/zhxchen17/pytorch/torch/nn/modules/module.py", line 1502, in _wrapped_call_impl return self._call_impl(args, kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/nn/modules/module.py", line 1511, in _call_impl return forward_call(args, *kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/_dynamo/eval_frame.py", line 295, in _fn return fn(args, *kwargs) ^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/nn/modules/module.py", line 1502, in _wrapped_call_impl return self._call_impl(args, *kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/nn/modules/module.py", line 1511, in _call_impl return forward_call(args, *kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/_dynamo/eval_frame.py", line 448, in catch_errors return callback(frame, cache_size, hooks, frame_state) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/_dynamo/convert_frame.py", line 127, in _fn return fn(args, *kwargs) ^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/_dynamo/convert_frame.py", line 360, in _convert_frame_assert return _compile( ^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/_dynamo/utils.py", line 180, in time_wrapper r = func(args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/_dynamo/convert_frame.py", line 511, in _compile exception_handler(e, code, frame) File "/data/users/zhxchen17/pytorch/torch/_dynamo/convert_frame.py", line 216, in exception_handler log.error(format_error_msg(e, code, record_filename, frame)) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/data/users/zhxchen17/pytorch/torch/_dynamo/exc.py", line 248, in format_error_msg stack_above_dynamo = filter_stack(extract_stack(frame)) ^^^^^^^^^^^^^^^^^^^^ File "/home/zhxchen17/miniconda3/envs/dev/lib/python3.11/traceback.py", line 231, in extract_stack stack = StackSummary.extract(walk_stack(f), limit=limit) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/zhxchen17/miniconda3/envs/dev/lib/python3.11/traceback.py", line 393, in extract return klass._extract_from_extended_frame_gen( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/zhxchen17/miniconda3/envs/dev/lib/python3.11/traceback.py", line 416, in _extract_from_extended_frame_gen for f, (lineno, end_lineno, colno, end_colno) in frame_gen: File "/home/zhxchen17/miniconda3/envs/dev/lib/python3.11/traceback.py", line 390, in extended_frame_gen for f, lineno in frame_gen: File "/home/zhxchen17/miniconda3/envs/dev/lib/python3.11/traceback.py", line 334, in walk_stack yield f, f.f_lineno ^^^^^^^^^^ AttributeError: 'torch._C.dynamo.eval_frame._PyInterpreterFrame' object has no attribute 'f_lineno' ``` After the fix: ``` pytest test/dynamo/test_export.py -k suppress_errors -s ``` Got Result: ``` File "/data/users/zhxchen17/pytorch/torch/_dynamo/exc.py", line 135, in unimplemented raise Unsupported(msg) torch._dynamo.exc.Unsupported: map() operator doesn't support scalar or zero-sized tensors during tracing. ========== The above exception occurred while processing the following code ========== File "/data/users/zhxchen17/pytorch/test/dynamo/test_export.py", line 3043, in forward def forward(self, xs): File "/data/users/zhxchen17/pytorch/test/dynamo/test_export.py", line 3047, in forward return map(body, xs) ========== unimplemented [("map() operator doesn't support scalar or zero-sized tensors during tracing.", 1)] . =============================== 1 passed, 133 deselected in 4.60s ================================ ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/103227 Approved by: https://github.com/williamwen42	2023-08-02 21:48:49 +00:00
lantiankaikai	17166c2511	python_arg_parser to allow fake tensor element in symint_list when in dynamo mode #95424 (#97508 ) Failing mechanism on #95424 : In dynamo mode, when passing numpy.int_ to 'shape' like param (Sequence[Union[int, symint]]) is wrapped as list with FakeTensor. However, in python_arg_parser, parser expect int in symint_list but got FakeTensor. Following #85759, this PR allow tensor element in symint_list when in dynamo mode This PR also fix below test with similar failing mechanism pytest ./generated/test_huggingface_diffusers.py -k test_016 pytest ./generated/test_ustcml_RecStudio.py -k test_036 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/97508 Approved by: https://github.com/yanboliang	2023-05-31 19:19:17 +00:00
William Wen	0e811044bd	[dynamo 3.11] enable other torch 3.11 dynamo-related tests (#99180 ) Notes: - No segfaults observed in any CI tests: dynamo unittests, inductor unittests, dynamo-wrapped pytorch tests. So we remove the warning that using dynamo 3.11 may result in segfaults. - Fixed a weakreflist copying bug that caused a few dynamo-wrapped tests to hang. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99180 Approved by: https://github.com/malfet, https://github.com/TamirFriedman-RecoLabs	2023-05-15 22:06:28 +00:00
PyTorch MergeBot	a64e97b62c	Revert "[dynamo 3.11] enable other torch 3.11 dynamo-related tests (#99180 )" This reverts commit `aa8dcab1ce`. Reverted https://github.com/pytorch/pytorch/pull/99180 on behalf of https://github.com/huydhn due to Sorry for reverting this, but linux-bionic-py3.11-clang9 test starts to timeout after this taking more than 3h30m. This is probably a landrace ([comment](https://github.com/pytorch/pytorch/pull/99180#issuecomment-1545982256))	2023-05-12 16:18:22 +00:00
William Wen	aa8dcab1ce	[dynamo 3.11] enable other torch 3.11 dynamo-related tests (#99180 ) Notes: - No segfaults observed in any CI tests: dynamo unittests, inductor unittests, dynamo-wrapped pytorch tests. So we remove the warning that using dynamo 3.11 may result in segfaults. - Some dynamo-wrapped pytorch tests hang. They will be skipped in the dynamo-wrapped test suite and will be addressed in a future PR Pull Request resolved: https://github.com/pytorch/pytorch/pull/99180 Approved by: https://github.com/malfet	2023-05-12 07:03:09 +00:00
William Wen	7da8705f18	[dynamo 3.11] fix segfault when printing stack trace (#99934 ) Dynamo will frequently segfault when attempting to print stack traces. We fix this by: - Fixing stack size calculations, as we did not account for exception tables - Creating shadow execution frames in a way that more closely resembles what CPython does to create its execution frames Dynamo/inductor-wrapped pytorch tests are enabled up the stack - those need to be green before this PR can be merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99934 Approved by: https://github.com/albanD, https://github.com/malfet, https://github.com/jansel	2023-05-09 22:12:45 +00:00
Sam Gross	fd841763e1	[dynamo] Minor fixes and clean-up in eval_frame.c (#100496 ) This fixes a few reference counting bugs in eval_frame.c, simplifies a few functions a bit, and adds a few missing error handling code paths. Probably the only important reference counting bug is that `call_callback` previously leaked `THPPyInterpreterFrame` in Python 3.11+. Summary below: - eval_frame_callback_get shouldn't incref Py_None - Don't leak THPPyInterpreterFrame in Python 3.11+ - set_profiler_hooks would decref profiler_start_hook and profiler_end_hook too many times if called with None as an argument (but we never actually used that code path). - Simplify some argument parsing - Only create guard_profiler_name_str once - Add a few missing error checks Pull Request resolved: https://github.com/pytorch/pytorch/pull/100496 Approved by: https://github.com/albanD	2023-05-04 00:45:15 +00:00
William Wen	785676ccb0	[dynamo 3.11] refactor cpython function defs out of eval_frame.c (#99947 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99947 Approved by: https://github.com/voznesenskym, https://github.com/albanD	2023-04-26 00:18:12 +00:00
Michael Voznesensky	ccc9a3d726	Automatic Dynamic Shapes (#98923 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98923 Approved by: https://github.com/ezyang	2023-04-13 02:39:23 +00:00
William Wen	e663143871	[dynamo 3.11] fix 3.11.2 issues (#98364 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/98364 Approved by: https://github.com/albanD	2023-04-06 17:37:25 +00:00
Michael Voznesensky	b1e60bfb6a	Pass f_locals as a dict rather than kwargs (#98107 ) Fixes https://github.com/pytorch/pytorch/issues/97688 One big problem is that instead of printing x < y we now print `E["x"] < E["y"]` and now all of the tests wobbled and I'm mad. Signed-off-by: Edward Z. Yang <ezyangmeta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/98107 Approved by: https://github.com/ezyang	2023-04-04 00:30:08 +00:00
William Wen	14ef91cea6	[dynamo 3.11] small bug fixes (#96508 ) Bugs fixed: - CALL_FUNCTION_EX expects null pop in symbolic_convert - make_function_with_closure codegen requires a push_null - copy over the closure in eval_frame.c - add JUMP_FORWARD to terminal opcodes - enum repr fix in utils.py - fix symbolic_convert's break_graph_if_unsupported wrapper Pull Request resolved: https://github.com/pytorch/pytorch/pull/96508 Approved by: https://github.com/jansel	2023-03-31 18:18:12 +00:00
William Wen	ba52268da5	[dynamo 3.11] properly copy free/cell vars in eval_frame.c (#96501 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96501 Approved by: https://github.com/jansel, https://github.com/albanD	2023-03-30 20:23:38 +00:00
William Wen	c905251f9f	[dynamo 3.11] fix eval_frame.c debug prints for 3.11 (#96500 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/96500 Approved by: https://github.com/jansel, https://github.com/albanD	2023-03-30 20:20:12 +00:00
Will Constable	57c13fde18	Test and fix guard fail message in CompileProfiler (#97055 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/97055 Approved by: https://github.com/voznesenskym, https://github.com/jansel	2023-03-22 02:17:57 +00:00
Lu, Chengjun	dbb31672b2	Fix the compatible issue of the Dynamo and the PyDev.Debugger. (#96721 ) The PyDev.Debugger use the _PyFrameEvalFunction to debug the python script. Fallback to the previous _PyFrameEvalFunction to fix the dynamo with PyDev.Debugger issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96721 Approved by: https://github.com/ezyang	2023-03-21 17:36:14 +00:00
Edward Z. Yang	80ce1a934e	Fix flaky Dynamo export tests (#96488 ) Planning to do a full writeup later. The short story is, sometimes the following chain of events happens: 1. We turn on Dynamo's custom frame handler 2. GC triggers (and all of the finalizers run under Dynamo) 3. GC hits a GeneratorExit frame 4. You end up in the custom frame handler with throw_flag == TRUE and PyErr_Occurred() != NULL If this happens and we blindly call into other Python functions (like the Python callback), the executed Python code will immediately raise an exception (because there's already an ambient exception set.) This is very, very confusing. The fix is to defer to the regular handler when throw_flag is TRUE. I triggered this locally with ``` PYTHONUNBUFFERED=1 pytest test/dynamo/test_dynamic_shapes.py -k 'Unspec and export and not dupes and not reorder' -v -x -s ``` But I also have some tests which trigger the problem synthetically. Fixes https://github.com/pytorch/pytorch/issues/93781 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/96488 Approved by: https://github.com/albanD	2023-03-10 21:51:54 +00:00
Will Constable	d4f5f9fdb4	Profile dynamo guards (#96119 ) Adds a profiler start and end callback to dynamo's C eval_frame impl, which can be used to profile a region providing a name for visualization. Currently only hooks up one usage to profile cache lookup (primarily covering guards and linear search through linked list). Example profile taken from toy model: `python benchmarks/dynamo/distributed.py --toy_model --profile --dynamo aot_eager` <img width="1342" alt="image" src="https://user-images.githubusercontent.com/4984825/223225931-b2f6c5a7-505a-4c90-9a03-34982f6dc033.png"> Planning to measure overhead in CI, and probably can't afford to check this in enabled by default. Will have to evaluate UX options such as `config.profile_dynamo_cache = True` or some other way. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96119 Approved by: https://github.com/jansel	2023-03-07 16:12:22 +00:00
William Wen	307ebacf94	[dynamo 3.11] fix to eval_frame.c (#94102 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94102 Approved by: https://github.com/albanD, https://github.com/jansel, https://github.com/malfet	2023-02-21 18:47:36 +00:00
albanD	c7302075f3	Fix passing frame to callback (#91170 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91170 Approved by: https://github.com/ezyang	2022-12-22 19:05:18 +00:00
albanD	c7f1974cf1	Fix FastToLocals call by copy pasting (#91168 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91168 Approved by: https://github.com/ezyang	2022-12-21 19:39:04 +00:00
albanD	5e77971a6e	Fix all simple compilation issues in eval_frame.c (#91166 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91166 Approved by: https://github.com/ezyang	2022-12-21 19:39:04 +00:00
Edward Z. Yang	fadf222661	Propagate guard failures to userland (#91053 ) Previously we would abort() but this is annoying when you're running pytest or something. Don't hard crash. It would be nice to apply this treatment to the other uses of CHECK macro in this file, but it was just guards that was bothering me. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/91053 Approved by: https://github.com/jansel	2022-12-19 23:39:48 +00:00
zyq8709	eaf4fe3d2b	Most recently used cache management for TorchDynamo (#88076 ) Modify the lookup procedure for TorchDynamo caches to keep the head of the single linked list as the most recently used cache entry, which may potentially improve probability for cache hitting. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88076 Approved by: https://github.com/jansel	2022-11-08 18:46:59 +00:00
Jason Ansel	c7c09722ad	Move TorchDynamo into PyTorch core (#86461 ) Context: https://github.com/pytorch/torchdynamo/issues/1588 This PR moves [TorchDynamo](https://github.com/pytorch/torchdynamo) and TorchInductor into PyTorch core. - `torchdynamo` becomes `torch._dynamo` - `torchinductor` becomes `torch._inductor` This PR was generated by running `copy_to_core.sh` in https://github.com/pytorch/torchdynamo/pull/1538 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86461 Approved by: https://github.com/voznesenskym	2022-10-13 23:18:06 +00:00
Jason Ansel	f1fdb6efbd	Manual changes for moving dynamo to core (#86621 ) This is the subset of the changes in #86461 not auto-generated by `copy_to_core.sh`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86621 Approved by: https://github.com/albanD	2022-10-11 23:01:21 +00:00

42 Commits