pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Simon Fan	87b002b6fb	[ca] make torch.compile API respect ambient disable contexts (#155473 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155473 Approved by: https://github.com/jansel	2025-06-11 19:09:29 +00:00
Pian Pawakapan	247f83e0a4	[dynamic shapes] guard individual terms in sym_and; user-code-friendly sym_and/sym_or (#154737 ) Previously when processing `sym_and(a, b, c)`, symbolic shapes wouldn't individually process a, b, and c and store their implications. This would lead us to data-dependent error on individual checks, e.g. we stored `u0 >= 0 & u0 <= 10`, but then couldn't figure out `u0 <= 10`. This handles that, and also makes `sym_and/or` user-code friendly, for testing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154737 Approved by: https://github.com/laithsakka	2025-06-11 18:08:06 +00:00
Joel Schlosser	c4b93e6579	Replace frame_traced_fn hook with get_traced_code() util (#155249 ) #153622 introduced a hook for getting the relevant code objects after frame tracing. The idea is to have vLLM use this instead of monkey-patching `inline_call_()` to determine the source code files to hash. Unfortunately, the hook runs too late; the vLLM backend needs access to the set of source code filenames while it's running. This PR replaces the newly-added hook with a utility function that a backend can call to get this information. I've made the change in vLLM and can verify that this allows the information to be queried at the right time. Pull Request resolved: https://github.com/pytorch/pytorch/pull/155249 Approved by: https://github.com/zou3519	2025-06-10 22:40:58 +00:00
Ryan Guo	07eb374e7e	[dynamo] Avoid unncessary caching source codegen (#155376 ) We only need to cache a source (e.g., `x.y.z`) into a temporary local if it's used multiple times in the codegen, otherwise we'd just be creating redundant `DUP` and `STORE_FAST tmp_...` instructions, which might degrade perf and definitely makes generated bytecode harder to read. Example: ```python import torch @torch.compile(backend="eager") def fn(x, y): return x + y fn(torch.ones(2), torch.ones(1)) ``` Original bytecode: ```verbatim [0/0] [__bytecode] 3 0 RESUME 0 [0/0] [__bytecode] [0/0] [__bytecode] 5 2 LOAD_FAST 0 (x) [0/0] [__bytecode] 4 LOAD_FAST 1 (y) [0/0] [__bytecode] 6 BINARY_OP 0 (+) [0/0] [__bytecode] 10 RETURN_VALUE ``` Modified bytecode (before this patch): ```verbatim [__bytecode] 3 0 RESUME 0 [__bytecode] 2 LOAD_GLOBAL 1 (NULL + __compiled_fn_1_578c8d9a_2a9b_4d15_bac7_267591cdee32) [__bytecode] 14 LOAD_FAST 0 (x) [__bytecode] 16 COPY 1 [__bytecode] 18 STORE_FAST 3 (tmp_1) [__bytecode] 20 LOAD_FAST 1 (y) [__bytecode] 22 COPY 1 [__bytecode] 24 STORE_FAST 4 (tmp_2) [__bytecode] 26 PRECALL 2 [__bytecode] 30 CALL 2 [__bytecode] 40 STORE_FAST 2 (graph_out_0) [__bytecode] 42 LOAD_FAST 2 (graph_out_0) [__bytecode] 44 LOAD_CONST 1 (0) [__bytecode] 46 BINARY_SUBSCR [__bytecode] 56 DELETE_FAST 2 (graph_out_0) [__bytecode] 58 RETURN_VALUE ``` Modified bytecode (after this patch): ```verbatim [__bytecode] 3 0 RESUME 0 [__bytecode] 2 LOAD_GLOBAL 1 (NULL + __compiled_fn_1_2c498af2_ce5c_49cb_abba_a0c7489b09ce) [__bytecode] 14 LOAD_FAST 0 (x) [__bytecode] 16 LOAD_FAST 1 (y) [__bytecode] 18 PRECALL 2 [__bytecode] 22 CALL 2 [__bytecode] 32 STORE_FAST 2 (graph_out_0) [__bytecode] 34 LOAD_FAST 2 (graph_out_0) [__bytecode] 36 LOAD_CONST 1 (0) [__bytecode] 38 BINARY_SUBSCR [__bytecode] 48 DELETE_FAST 2 (graph_out_0) [__bytecode] 50 RETURN_VALUE ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/155376 Approved by: https://github.com/williamwen42	2025-06-10 19:38:15 +00:00
zhxchen17	38c4d05535	[precompile] Ensure @disable()-ed function won't trigger recompile from precompile bytecode. (#155363 ) In a precompiled bytecode, it looks like the following: ``` pre-graph bytecode ... compiled graph code ... post-graph bytecode ``` In pre-graph bytecode we have calls into helper functions like torch._dynamo.utils.call_size which will invoke @disable inside the bytecode. Normally torch.compile() will handle these frames fine, but for precompile we will load bytecode from a clean state of dynamo and we want a way to assert recompile never happen, so the current way to ensure this is by doing set_stance("fail_on_recompile") (open to any other idea to test this, but IMO this is the closest thing we have today). This approach doesn't work when util functions like call_size() is involved and this PR fixes a bunch of places to make sure "fail_on_recompile" can skip through the functions meant to be skipped during compilation. Differential Revision: [D76156867](https://our.internmc.facebook.com/intern/diff/D76156867/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155363 Approved by: https://github.com/jamesjwu, https://github.com/jansel ghstack dependencies: #155329	2025-06-10 16:13:38 +00:00
Brian Hirsh	6c05f2fca0	[test] use JK to force graph break on slow aliasing/mutation/dynamic_shape behavior (#155257 ) Summary: test to unblock shampoo, needs cleanup Test Plan: CI Rollback Plan: steps: - jk.update: jk: pytorch/compiler:aliased_inputs_with_mutation_and_dyn_shapes_killswitch constant_bool: null consistent_pass_rate: null fractional_host_rollout: null sampling_rate: null - manual.note: content: Set it to false. Reviewed By: c00w Differential Revision: D76051868 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155257 Approved by: https://github.com/c00w	2025-06-09 16:21:59 +00:00
Yuanhao Ji	9968c854b6	[Dynamo] Replace `unimplemented` with `unimplemented_v2` in `torch/_dynamo/variables/tensor.py` (#153146 ) Part of #147913 Replace `unimplemented` with`unimplemented_v2` in `torch/_dynamo/variables/tensor.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/153146 Approved by: https://github.com/williamwen42 Co-authored-by: William Wen <william.wen42@gmail.com>	2025-06-09 06:27:50 +00:00
James Wu	be2ad70cfa	Fix dynamo tracing into AOTAutogradCache results in cpu tensors (#155251 ) On this line, we see that the bw_compiler that dynamo uses for AotAutograd automatically disables the backward runnable: `05dd638ee9/torch/_dynamo/backends/common.py (L76)` This disables dynamo in the bw_compiler but also disables the runnable the compiler returns. On a AOTAutogradCache hit, however, we never call the bw_compiler! So we don't disable dynamo properly. This only has an effect on certain cases of cpu tensors' backwards, where the backward is being done in python land, and dynamo unnecessarily tries to trace through the inductor generated code. It also only matters if the backward is being accessed outside of dynamo itself (say, in a graph break in eager mode), since dynamo properly disables the forward function already. ``` I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] TorchDynamo attempted to trace the following frames: [ I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] * fn /home/jjwu/test.py:9 I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] * cast /data/users/jjwu/a/pytorch-env/lib/python3.10/typing.py:1737 I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] * call /tmp/torchinductor_jjwu/rq/crq327nhoyjzog5n3qlchauucdrunrtutwmmoh7ipoe2ngnson5s.py:35 I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] * fn /home/jjwu/test.py:9 I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] * cast /data/users/jjwu/a/pytorch-env/lib/python3.10/typing.py:1737 I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] * call /tmp/torchinductor_jjwu/rq/crq327nhoyjzog5n3qlchauucdrunrtutwmmoh7ipoe2ngnson5s.py:35 I0605 09:58:40.135000 3981970 torch/_dynamo/eval_frame.py:517] ] ``` This PR fixes the issue and adds a unit test showing that with or without cache hit, the frames dynamo is tracing is identical. Fixes https://github.com/pytorch/pytorch/issues/154536 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155251 Approved by: https://github.com/bdhirsh, https://github.com/anijain2305	2025-06-09 02:06:16 +00:00
Bob Ren	b981fb6744	Add docblock to torch/_dynamo/variables/builtin.py (#155402 ) Add comprehensive module docstring explaining built-in function and type variable tracking, including handling of Python built-ins, type constructors, operators, and special constructs during symbolic execution. Originally generated by claude but reviewed and edited by me. Pull Request resolved: https://github.com/pytorch/pytorch/pull/155402 Approved by: https://github.com/Skylion007 ghstack dependencies: #155403	2025-06-08 15:24:29 +00:00
Bob Ren	1339e88105	Add docblock to torch/_dynamo/side_effects.py (#155403 ) Add comprehensive module docstring explaining side effect tracking and management, including mutation tracking, context changes, aliasing, and state preservation during symbolic execution. Originally generated by claude but reviewed and edited by me. Pull Request resolved: https://github.com/pytorch/pytorch/pull/155403 Approved by: https://github.com/williamwen42	2025-06-08 07:02:30 +00:00
Bob Ren	0756ebcd48	Add docblock to torch/_dynamo/trace_rules.py (#155401 ) Add comprehensive module docstring explaining the tracing rules and policies that govern TorchDynamo's compilation decisions, including skip rules, inlining policies, and library-specific handling. Originally generated by claude but reviewed and edited by me. Pull Request resolved: https://github.com/pytorch/pytorch/pull/155401 Approved by: https://github.com/williamwen42	2025-06-08 04:30:03 +00:00
Animesh Jain	db491825e0	[invoke_subgraph] Add logging (#155284 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155284 Approved by: https://github.com/zou3519 ghstack dependencies: #155270	2025-06-07 11:31:53 +00:00
Animesh Jain	0f3f59784d	[invoke_subgraph] Throw assertion on uncaptured speculate_subgraph (#155270 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155270 Approved by: https://github.com/zou3519	2025-06-07 11:31:53 +00:00
William Wen	81b0b308ca	[dynamo] constant fold torch.cuda.is_initialized (#155300 ) Fixes https://github.com/pytorch/pytorch/issues/129659 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155300 Approved by: https://github.com/StrongerXi, https://github.com/jansel	2025-06-07 02:21:11 +00:00
Aaron Gokaslan	83d22256f8	[BE][Ez]: Improve typing in torch._logging (#155345 ) Add a few missing returns in torch._logging and use ruff to infer the obvious ones. LazyStr now properly checks the return type of the Callable and the args and kwargs passed to it Pull Request resolved: https://github.com/pytorch/pytorch/pull/155345 Approved by: https://github.com/ezyang	2025-06-07 00:04:39 +00:00
Animesh Jain	067fd0b3ab	[dynamo][cleanup] Simplify disabling of the helper functions on tensor properties (#155259 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155259 Approved by: https://github.com/zhxchen17	2025-06-06 19:44:40 +00:00
Animesh Jain	271ca679a8	[reland][dynamo] Record the pre-graph bytecode using fast record function event (#154974 ) reland of https://github.com/pytorch/pytorch/pull/154769 @diff-train-skip-merge Pull Request resolved: https://github.com/pytorch/pytorch/pull/154974 Approved by: https://github.com/Lucaskabela, https://github.com/jansel	2025-06-06 13:11:03 +00:00
Simon Fan	28796f71d0	Redo D75092426: [internal] Expose additional metadata to compilation callbacks (#155063 ) Originally https://github.com/pytorch/pytorch/pull/153596 --------------- Summary: via reverting D75708685 gate the ROCm failure Test Plan: Unit tests in OSS, sandcastle Rollback Plan: Bifferential Revision: D75894349 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155063 Approved by: https://github.com/masnesral	2025-06-05 23:40:31 +00:00
Animesh Jain	13ea0f2c0a	[dynamo][dynamic] Recompilation hint for nn module integer attributes (#154867 ) For program like this ``` class Mod(torch.nn.Module): def __init__(self): super().__init__() self.c = 0 def forward(self, x): self.c += 1 return x * self.c ``` You can check the recompile reasons at https://manifold.edge.x2p.facebook.net/v0/read/tree/logs/.tmpzv9z6Q/index.html?bucketName=tlparse_reports&apiKey=tlparse_reports-key&withPayload=1&timeoutMsec=10000 ![image](https://github.com/user-attachments/assets/856a95fd-0533-4abc-a213-1f73ae2cb766) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154867 Approved by: https://github.com/zou3519	2025-06-05 16:37:22 +00:00
PyTorch MergeBot	e01fde8213	Revert "[reland][dynamo] Record the pre-graph bytecode using fast record function event (#154974 )" This reverts commit `bee9c70c5d`. Reverted https://github.com/pytorch/pytorch/pull/154974 on behalf of https://github.com/malfet due to Broke inductor tests, see `3c72b9fd8f/1` ([comment](https://github.com/pytorch/pytorch/pull/154974#issuecomment-2944370617))	2025-06-05 13:36:21 +00:00
Animesh Jain	bee9c70c5d	[reland][dynamo] Record the pre-graph bytecode using fast record function event (#154974 ) reland of https://github.com/pytorch/pytorch/pull/154769 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154974 Approved by: https://github.com/Lucaskabela, https://github.com/jansel	2025-06-05 07:25:04 +00:00
drisspg	80703ca332	[FlexAttention] Allow dispatch to SAC for flex (#150080 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/150080 Approved by: https://github.com/zou3519	2025-06-05 04:34:27 +00:00
Animesh Jain	c881f2ddf3	[reland][dynamo] Mark a vt unspecialized nn module variable source earlier (#155099 ) Reland of https://github.com/pytorch/pytorch/pull/154780 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/155099 Approved by: https://github.com/williamwen42	2025-06-04 23:05:36 +00:00
Thomas Bohnstingl	b084e1b81c	[HOP] Rework Autograd DispatchKey for scan and map (#153336 ) This PR introduces the `py_autograd_impl` instead of the `DispatchKey.Autograd` for some HOPs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/153336 Approved by: https://github.com/ydwu4	2025-06-04 20:54:02 +00:00
sandishkumarhn	e9c31fb86d	[torch.compile] handle a custom __delattr__ method correctly (#150899 ) Fixes #150765 - handle a custom __delattr__ method correctly Test: ``` import torch class MyObject: def __init__(self, val): self.val = val # Flag to track deletion attempts instead of using print self.deletion_attempted = False def __delattr__(self, attr): if attr == "val": # Set flag instead of printing self.deletion_attempted = True else: super().__delattr__(attr) @torch.compile(fullgraph=True, backend="eager") def test(input_tensor): instance_a = MyObject(1) instance_b = MyObject(2) del instance_a.val del instance_b.val exists_a = hasattr(instance_a, 'val') exists_b = hasattr(instance_b, 'val') deletion_attempted_a = instance_a.deletion_attempted deletion_attempted_b = instance_b.deletion_attempted return input_tensor + 1, exists_a, exists_b, deletion_attempted_a, deletion_attempted_b # Run the test result = test(torch.ones(1)) print(f"Result tensor: {result[0]}") print(f"val attribute still exists on instance_a: {result[1]}") print(f"val attribute still exists on instance_b: {result[2]}") print(f"Deletion was attempted on instance_a: {result[3]}") print(f"Deletion was attempted on instance_b: {result[4]}") ``` output: ``` (base) sany@sandishs-Laptop pytorch % python3 test_delattr_fix.py Result tensor: tensor([2.]) val attribute still exists on instance_a: True val attribute still exists on instance_b: True Deletion was attempted on instance_a: True Deletion was attempted on instance_b: True ``` ``` (pytorch-dev) sany@sandishs-Laptop pytorch % python3 -m pytest test/dynamo/test_repros.py::ReproTests::test_delattr_return -v ========================================================= test session starts ========================================================= platform darwin -- Python 3.12.5, pytest-8.3.5, pluggy-1.5.0 -- /Library/Frameworks/Python.framework/Versions/3.12/bin/python3 cachedir: .pytest_cache rootdir: /Users/sany/git/pytorch configfile: pytest.ini plugins: typeguard-4.3.0 collected 1 item Running 1 items in this shard test/dynamo/test_repros.py::ReproTests::test_delattr_return PASSED [0.0659s] [100%] ========================================================== 1 passed in 1.71s ========================================================== (pytorch-dev) sany@sandishs-Laptop pytorch % ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/150899 Approved by: https://github.com/jansel, https://github.com/StrongerXi	2025-06-04 17:27:20 +00:00
PyTorch MergeBot	a99a01a677	Revert "[dynamo] Mark a vt unspecialized nn module variable source earlier (#154780 )" This reverts commit `cc96febb97`. Reverted https://github.com/pytorch/pytorch/pull/154780 on behalf of https://github.com/seemethere due to This fails internal testing see, https://fburl.com/diff/b0yuxk4w ([comment](https://github.com/pytorch/pytorch/pull/154780#issuecomment-2940381691))	2025-06-04 15:03:34 +00:00
PyTorch MergeBot	a0f2544502	Revert "[dynamo][dynamic] Recompilation hint for nn module integer attributes (#154867 )" This reverts commit `6c2f941e25`. Reverted https://github.com/pytorch/pytorch/pull/154867 on behalf of https://github.com/seemethere due to This fails internal testing see, https://fburl.com/diff/b0yuxk4w ([comment](https://github.com/pytorch/pytorch/pull/154780#issuecomment-2940381691))	2025-06-04 15:03:34 +00:00
Animesh Jain	6c2f941e25	[dynamo][dynamic] Recompilation hint for nn module integer attributes (#154867 ) For program like this ``` class Mod(torch.nn.Module): def __init__(self): super().__init__() self.c = 0 def forward(self, x): self.c += 1 return x * self.c ``` You can check the recompile reasons at https://manifold.edge.x2p.facebook.net/v0/read/tree/logs/.tmpzv9z6Q/index.html?bucketName=tlparse_reports&apiKey=tlparse_reports-key&withPayload=1&timeoutMsec=10000 ![image](https://github.com/user-attachments/assets/856a95fd-0533-4abc-a213-1f73ae2cb766) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154867 Approved by: https://github.com/zou3519 ghstack dependencies: #154780	2025-06-04 00:05:53 +00:00
Animesh Jain	cc96febb97	[dynamo] Mark a vt unspecialized nn module variable source earlier (#154780 ) I am working on providing some skip guard helper functions to allow users to reduce guard overhead. This is a refactor to allow that. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154780 Approved by: https://github.com/StrongerXi, https://github.com/jansel	2025-06-03 19:19:47 +00:00
Ryan Guo	6f7694f18f	[dynamo] Reconstruct defaultdict properly (#154931 ) `DefaultDictVariable` inherited `ConstDictVariable.reconstruct`, causing dynamo to reconstruct a `DefaultDictVariable` into a dict rather than defaultdict. This patch fixes that. Fixes #138412. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154931 Approved by: https://github.com/williamwen42, https://github.com/zou3519 ghstack dependencies: #154930	2025-06-03 18:18:40 +00:00
Animesh Jain	635b73e697	[dynamo][guards] Flush cache to more accurately measure guard overhead (#154764 ) We observed that guard overhead at runtime using profiler traces was higher than reported in this profiling function at the compile time. After investigation, we found that f_locals are already in cache and that was causing the guard overhead to be way smaller while profiling during the compilation. To be more realistic, we flush the cache here. Profiling the guard overhead during compilation (in addition to at runtime) allows faster iteration time, and logging in tlparse and internal databases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154764 Approved by: https://github.com/zou3519, https://github.com/jansel, https://github.com/StrongerXi	2025-06-03 11:50:57 +00:00
bobrenjc93	ea5b9eca74	Combine sticky pgo key with job id (#154863 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154863 Approved by: https://github.com/Mingming-Ding	2025-06-03 07:58:38 +00:00
PyTorch MergeBot	a7e496a896	Revert "[dynamo] Record the pre-graph bytecode using fast record function event (#154769 )" This reverts commit `409c396a48`. Reverted https://github.com/pytorch/pytorch/pull/154769 on behalf of https://github.com/seemethere due to This fails internal tests see [fburl.com/diff/67gyp7gp](https://fburl.com/diff/67gyp7gp) ([comment](https://github.com/pytorch/pytorch/pull/154769#issuecomment-2933629894))	2025-06-03 06:13:49 +00:00
PyTorch MergeBot	b86aaaae0b	Revert "[dynamo][guards] Flush cache to more accurately measure guard overhead (#154764 )" This reverts commit `7dee899130`. Reverted https://github.com/pytorch/pytorch/pull/154764 on behalf of https://github.com/seemethere due to This fails internal tests see [fburl.com/diff/67gyp7gp](https://fburl.com/diff/67gyp7gp) ([comment](https://github.com/pytorch/pytorch/pull/154769#issuecomment-2933629894))	2025-06-03 06:13:49 +00:00
Isuru Fernando	7f44b589be	[dynamo] fix pruning locals with ShapeEnvSource (#154752 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154752 Approved by: https://github.com/zhxchen17	2025-06-03 00:35:11 +00:00
Animesh Jain	7dee899130	[dynamo][guards] Flush cache to more accurately measure guard overhead (#154764 ) We observed that guard overhead at runtime using profiler traces was higher than reported in this profiling function at the compile time. After investigation, we found that f_locals are already in cache and that was causing the guard overhead to be way smaller while profiling during the compilation. To be more realistic, we flush the cache here. Profiling the guard overhead during compilation (in addition to at runtime) allows faster iteration time, and logging in tlparse and internal databases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154764 Approved by: https://github.com/zou3519, https://github.com/jansel, https://github.com/StrongerXi ghstack dependencies: #154769	2025-06-02 23:01:58 +00:00
Animesh Jain	409c396a48	[dynamo] Record the pre-graph bytecode using fast record function event (#154769 ) ![image](https://github.com/user-attachments/assets/1d06618b-1c14-4ed5-ab7b-dcfecbb4d632) Adds another event in the profiler traces. This can help us find models where pre-graph bytecode is very expensive. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154769 Approved by: https://github.com/zou3519, https://github.com/williamwen42, https://github.com/StrongerXi, https://github.com/jansel	2025-06-02 22:33:27 +00:00
Animesh Jain	1258aac1c2	[dynamo] Upcast torch.Size + tuple to be of size torch.Size (#154830 ) Fixes https://github.com/pytorch/pytorch/issues/154432 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154830 Approved by: https://github.com/StrongerXi, https://github.com/Skylion007, https://github.com/williamwen42	2025-06-02 17:57:23 +00:00
Animesh Jain	7368eeba5e	[dynamo][guards] Prevent LENGTH guard on nn modules (#154763 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154763 Approved by: https://github.com/williamwen42	2025-05-31 05:32:31 +00:00
Pian Pawakapan	5f1c3c67b2	[pgo] log dynamic whitelist in PT2 Compile Events (#154747 ) Summary: logs the whitelist to PT2 Compile Events Test Plan: loggercli codegen GeneratedPt2CompileEventsLoggerConfig Reviewed By: bobrenjc93 Differential Revision: D75617963 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154747 Approved by: https://github.com/angelayi	2025-05-30 23:54:24 +00:00
Aaron Gokaslan	bbda22e648	[BE][Ez]: Optimize unnecessary lambda with operator (#154722 ) Automated edits performed by FURB118. Operator is implemented in C and way faster when passed to another C method like sorted, max etc as a `key=` Pull Request resolved: https://github.com/pytorch/pytorch/pull/154722 Approved by: https://github.com/jansel	2025-05-30 23:47:10 +00:00
Ryan Guo	967937872f	[dynamo] Remove dead code path for `torch.Tensor.view(*shape)` (#154646 ) This was introduced in early days of Dynamo, and looks like it's been fixed since -- the regression test `test_transpose_for_scores` passes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154646 Approved by: https://github.com/Skylion007, https://github.com/zou3519 ghstack dependencies: #154645	2025-05-30 18:50:58 +00:00
Ryan Guo	f9dc20c7a3	[dynamo] Fix syntax error in aot graph from kwarg-less `torch.Tensor.[random_\|uniform_]` calls (#154645 ) As title, fixes #151432, see more context in the issue discussion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154645 Approved by: https://github.com/zou3519	2025-05-30 18:50:58 +00:00
PyTorch MergeBot	35fc5c49b4	Revert "[internal] Expose additional metadata to compilation callbacks (#153596 )" This reverts commit `f889dea97d`. Reverted https://github.com/pytorch/pytorch/pull/153596 on behalf of https://github.com/izaitsevfb due to introduces bunch of callback-related failures on rocm ([comment](https://github.com/pytorch/pytorch/pull/153596#issuecomment-2923139061))	2025-05-30 18:39:27 +00:00
Aaron Gokaslan	b6b9311f4f	[BE][Ez]: Fix typo in dynamo utils #154639 (#154748 ) Fixes a typo in #154639 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154748 Approved by: https://github.com/ngimel	2025-05-30 18:39:01 +00:00
Aaron Gokaslan	2120eeb8de	[BE][Ez]: Improve dynamo utils typing with TypeIs and TypeGuard (#154639 ) Adds some additional TypeIs and TypeGuard to some _dynamo utils for additional type narrowing Pull Request resolved: https://github.com/pytorch/pytorch/pull/154639 Approved by: https://github.com/jansel	2025-05-30 18:09:50 +00:00
Sidharth	3bdceab124	[dynamo] fix: added star operator for graph_break_hints (#154713 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154713 Approved by: https://github.com/zou3519, https://github.com/williamwen42	2025-05-30 17:31:03 +00:00
bobrenjc93	e7bf72c908	[multigraph] fix composabilty with aotautograd cache (#153526 ) AOTAutogradCache uses FXGraphCache which uses the tracing context to get the ShapeEnv. Although the TracingContext global_context is cleared by the time we get around to reusing it, we don't actually need it. We just need the ShapeEnv in the TracingContext, which isn't cleared at the end of dynamo and does persist. This PR adds the tracing context manager around the specialized compile to ensure our caching infrastructure can get access to the ShapeEnv. A test was also added to prove correctness. Pull Request resolved: https://github.com/pytorch/pytorch/pull/153526 Approved by: https://github.com/jamesjwu, https://github.com/zou3519 ghstack dependencies: #153433, #153449	2025-05-30 16:56:17 +00:00
Ryan Guo	7183f52675	[dynamo] Support namedtuple subclass (#153982 ) Fixes #133762. This involves 1. support tuple subclass constructed inside compile region. 2. handle the "fake" global scope associated with NamedTuple-generated `__new__`. 3. handle `namedtuple._tuplegetter` more faithfully. Differential Revision: [D75488091](https://our.internmc.facebook.com/intern/diff/D75488091) Pull Request resolved: https://github.com/pytorch/pytorch/pull/153982 Approved by: https://github.com/jansel ghstack dependencies: #154176	2025-05-30 16:14:37 +00:00
Ryan Guo	8002d22ce3	[dynamo] Trace into descriptor with `__set__` (#154176 ) As title, this patch basically implements https://github.com/python/cpython/blob/3.11/Objects/object.c#L1371-L1452, and make the `__get__` handling more robust. I ran into this while fixing #133762. Differential Revision: [D75488090](https://our.internmc.facebook.com/intern/diff/D75488090) Pull Request resolved: https://github.com/pytorch/pytorch/pull/154176 Approved by: https://github.com/jansel	2025-05-30 16:14:37 +00:00

1 2 3 4 5 ...

4594 Commits