pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Sam Larsen	fc1105b282	[inductor] Implement Fx graph caching to improve warm compilation time. (#103453 ) Summary: Implement an on-disk cache to save and reuse compiled FX Graphs. This implementation does not handle tensors with symbolic shapes. This needs to be done in a follow-up PR. Test Plan: * New unit tests exercising saving and load from the cache. * New unit tests to exercise the cache key calculations. * Ran several benchmarks to see cache hit and resulting compilation times. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103453 Approved by: https://github.com/eellison, https://github.com/Chillee	2023-10-11 14:39:14 +00:00
Tugsbayasgalan Manlaibaatar	5aee22e0e0	Move export.constrain_as_* to torch._constrain_as_* (#110757 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110757 Approved by: https://github.com/avikchaudhuri ghstack dependencies: #109859	2023-10-11 02:37:55 +00:00
leslie-fang-intel	a11d4a8378	[Reland] [Inductor] Break the loop fusion when node2 depends on node1 mutations (#110677 ) Reland PR https://github.com/pytorch/pytorch/pull/109172 which has been reverted in https://github.com/pytorch/pytorch/pull/110622 Differential Revision: [D50097373](https://our.internmc.facebook.com/intern/diff/D50097373) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110677 Approved by: https://github.com/jgong5, https://github.com/ezyang	2023-10-11 00:26:45 +00:00
PyTorch MergeBot	3100d3e661	Revert "[inductor] Implement Fx graph caching to improve warm compilation time. (#103453 )" This reverts commit `8a8668e1ae`. Reverted https://github.com/pytorch/pytorch/pull/103453 on behalf of https://github.com/kit1980 due to The newly added test fails on internal builds ([comment](https://github.com/pytorch/pytorch/pull/103453#issuecomment-1756449919))	2023-10-10 23:21:59 +00:00
vfdev-5	d2a2a67fa4	Added new test sample to interpolate op in OpInfo (#104181 ) Description: - Added new test sample to interpolate op in OpInfo - Fixed silent issue with zero tensor test sample for uint8 dtype Pull Request resolved: https://github.com/pytorch/pytorch/pull/104181 Approved by: https://github.com/pmeier, https://github.com/lezcano	2023-10-09 10:55:56 +00:00
Sam Larsen	8a8668e1ae	[inductor] Implement Fx graph caching to improve warm compilation time. (#103453 ) Summary: Implement an on-disk cache to save and reuse compiled FX Graphs. This implementation does not handle tensors with symbolic shapes. This needs to be done in a follow-up PR. Test Plan: * New unit tests exercising saving and load from the cache. * New unit tests to exercise the cache key calculations. * Ran several benchmarks to see cache hit and resulting compilation times. Pull Request resolved: https://github.com/pytorch/pytorch/pull/103453 Approved by: https://github.com/eellison	2023-10-08 20:32:15 +00:00
Adnan Akhundov	98b79e9488	[inductor] Add AOTI ABI shim function for torch.nonzero (#110766 ) Summary: `torch.nonzero` doesn't have inductor lowering (yet). To invoke the operator in AOT Inductor's ABI compatibility mode we need a dedicated shim function. Test Plan: ``` $ python test/inductor/test_aot_inductor.py -k test_zero_grid_with_unbacked_symbols ... ---------------------------------------------------------------------- Ran 4 tests in 78.650s OK ``` Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/110766 Approved by: https://github.com/chenyang78 ghstack dependencies: #110713, #110745, #110764	2023-10-07 08:32:27 +00:00
Adnan Akhundov	abb00f66d8	[inductor] Add AOTI ABI shim function for repeat_interleave.Tensor (#110745 ) Summary: `repeat_interleave.Tensor` doesn't have inductor lowering. To invoke the operator in AOT Inductor's ABI compatibility mode we need a dedicated shim function. Test Plan: ``` $ python test/inductor/test_aot_inductor.py -k test_repeat_interleave ... ---------------------------------------------------------------------- Ran 4 tests in 70.526s OK ``` Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/110745 Approved by: https://github.com/chenyang78 ghstack dependencies: #110713	2023-10-07 08:18:01 +00:00
Yang Chen	432df71820	[inductor] added a config to always add tensor constants (#110491 ) Summary: In some scenarios, we want to update constants at runtime. In such cases, we have to keep the original constants in the generated code without applying any constant-inlining optimizations. This PR adds a config to force us to add tensor constants. Differential Revision: D49895154 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110491 Approved by: https://github.com/mikekgfb	2023-10-07 07:51:54 +00:00
Oguz Ulgen	e8ef8bfdce	[Inductor] Allow matmul to have flexiable layout when we are not autotuning (#110726 ) Fixes #102804 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110726 Approved by: https://github.com/Chillee	2023-10-07 04:08:37 +00:00
chilli	6b1007b2a7	Fix error in div lowering with integers (#102809 ) Fixes https://github.com/pytorch/pytorch/issues/101016 Pull Request resolved: https://github.com/pytorch/pytorch/pull/102809 Approved by: https://github.com/ngimel ghstack dependencies: #110501, #110504, #110591, #110668, #110687	2023-10-06 23:21:40 +00:00
Adnan Akhundov	f74937741e	Remove runtime assertions between export and AOT compilation (#110710 ) Summary: The runtime assertions inserted in the `torch._export.export` by the `_AddRuntimeAssertionsForInlineConstraintsPass` lead to errors in AOT Inductor like #109884. In `torch._export.aot_compile` export and AOT compilation are run consecutively which would lead to the above issue if any assertions are inserted. In this PR, we're adding a new parameter / flag to `torch._export.aot_compile`, `remove_runtime_assertions`, to remove the assertions inserted during export before AOT compilation. The flag is set to `False` for BC. Additionally, we remove the flag `add_runtime_assertions_for_inline_constraints` recently added to `torch._dynamo.config`, as it can lead to undesirable `torch._export` behavior and is 's no longer required for the AOT Inductor testing purposes. Test Plan: CI Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/110710 Approved by: https://github.com/zhxchen17, https://github.com/chenyang78	2023-10-06 21:09:35 +00:00
Michael Voznesensky	7d98549ca9	retain_graph=True in compiled_autograd (#110367 ) Adds support for retain_graph=True - known as keep_graph_ internally in the autograd engine. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110367 Approved by: https://github.com/jansel	2023-10-06 08:22:10 +00:00
CK Luk	ecdd1bcf03	Back out "[Inductor] Break the loop fusion when node2 depends on node1 mutations (#109172 )" (#110622 ) Summary: Original commit changeset: 03980fb054d5 Original Phabricator Diff: D49519512 Bisecting shows that this diff is the cause of S369683. Since this affects Ads production, need to back out this diff immediately. Test Plan: See S369683 Reviewed By: ezyang Differential Revision: D49958638 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110622 Approved by: https://github.com/yanboliang	2023-10-05 20:09:09 +00:00
Bin Bao	298f01d9a2	[aotinductor] Avoid generating redundant kernel loading code (#110510 ) Summary: 1) Stop forcing triton.unique_kernel_names to True for AOTInductor, because the unique kernel name can be read from metadata; 2) Only generate load_kernel once for each kernel since we don't have control flow in our generated code. This solves https://github.com/pytorch/pytorch/issues/105553. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110510 Approved by: https://github.com/chenyang78, https://github.com/jansel	2023-10-05 19:59:38 +00:00
chilli	f767a6c57a	Made pattern-matcher diagnostics lazily reported + added TORCH_COMPILE_CPROFILE (#110504 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110504 Approved by: https://github.com/mlazos, https://github.com/eellison ghstack dependencies: #110501	2023-10-05 15:47:30 +00:00
PyTorch MergeBot	1e4c0641ce	Revert "Made pattern-matcher diagnostics lazily reported + added TORCH_COMPILE_CPROFILE (#110504 )" This reverts commit `9648df1a6a`. Reverted https://github.com/pytorch/pytorch/pull/110504 on behalf of https://github.com/PaliC due to temporarily will revert as it's causing problems with difftrain import ([comment](https://github.com/pytorch/pytorch/pull/110504#issuecomment-1749132253))	2023-10-05 15:28:23 +00:00
Oleg Khabinov	cf1b494afd	[AOTInductor] Store loaded kernels in the model (#110554 ) Defining kernels as static vars is problematic for subsequent model loading on non-default CUDA devices. Assuming those kernels were loaded in context of the device #0, so, they are not nullptr anymore, therefore kernels won't work on devices other than the device #0. This change makes devices remembered at model level in AOT mode. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110554 Approved by: https://github.com/chenyang78, https://github.com/desertfire	2023-10-05 10:17:05 +00:00
chilli	9648df1a6a	Made pattern-matcher diagnostics lazily reported + added TORCH_COMPILE_CPROFILE (#110504 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110504 Approved by: https://github.com/mlazos, https://github.com/eellison ghstack dependencies: #110501	2023-10-05 01:34:57 +00:00
Bin Bao	c121f957c2	[aotinductor] Enable test_non_default_cuda_device on CI (#110509 ) Summary: test_non_default_cuda_device needs to run on a multi-gpu CI instance Differential Revision: [D49937115](https://our.internmc.facebook.com/intern/diff/D49937115) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110509 Approved by: https://github.com/angelayi, https://github.com/khabinov, https://github.com/chenyang78	2023-10-05 01:25:50 +00:00
Edward Z. Yang	6a974bec5d	Change flash attention outputs to be SymInt instead of int (#110533 ) Fixes https://github.com/pytorch/pytorch/issues/110322 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/110533 Approved by: https://github.com/albanD	2023-10-05 01:00:07 +00:00
Jon Chuang	37afa0c349	fix(inductor): Increase coverage of Inductor ATen lowering (#110473 ) Add sqrt to decomp testing path and fix missing `minimum`, `clamp_min`,`clamp_max` lowerings and/or registrations. Follow up to: https://github.com/pytorch/pytorch/pull/110468#issuecomment-1745718602 (requires upstream to merge to avoid merge conflict) CC: @janeyx99 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110473 Approved by: https://github.com/janeyx99	2023-10-04 23:40:46 +00:00
Yang Chen	46a5558cd5	[AOTInductor] Simplified AOTInductor interface and model class (#110411 ) Summary: This PR removed several APIs from the AOTInductor interface, which are not used by the client. It also simplified AOTInductor's model class by removing the dim info for input/output tensors. We included dim info before to return max output shapes, which was used by the client to allocate memory for output tensors. Now, we allocate output tensor memory from the .so so that we don't need to maintain such information any more. The deletion of dim info from the model class also simplified the codegen quite a bit. Test Plan: ci Reviewed By: khabinov Differential Revision: D49835430 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110411 Approved by: https://github.com/khabinov, https://github.com/desertfire, https://github.com/jansel	2023-10-04 18:35:24 +00:00
Jon Chuang	3fd938369f	add `foreach_abs` meta registration and inductor decomp (#110468 ) Fixes https://github.com/pytorch/pytorch/issues/110458 Somehow it is on allowlist but not on testing path. CC @janeyx99 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110468 Approved by: https://github.com/janeyx99	2023-10-04 06:09:37 +00:00
Mu-Chu Lee	836ba6430a	[AOTInductor] Initial functionality for Inf and NaN checker (#109526 ) Summary: Add initial functionality for Inf and NaN checker for AOTInductor. Test Plan: Included in commit. Skipped for CI as SIGABRT can't be captured by pytest. Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D49379751](https://our.internmc.facebook.com/intern/diff/D49379751) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109526 Approved by: https://github.com/chenyang78	2023-10-03 22:39:42 +00:00
eellison	98c8550158	Fix Triplet Margin Loss Opinfo (#110302 ) Triplet Margin Loss takes in a Callable `distance_function` parameter which is not supported as an argument on the fx graph. See previous error: > File "/scratch/eellison/work/pytorch/torch/_dynamo/symbolic_convert.py", line 562, in call_function self.push(fn.call_function(self, args, kwargs)) File "/scratch/eellison/work/pytorch/torch/_dynamo/variables/torch.py", line 723, in call_function proxy_args_kwargs(args, kwargs), File "/scratch/eellison/work/pytorch/torch/_dynamo/utils.py", line 504, in proxy_args_kwargs f"call_function args: {typestr(args)} {typestr(*list(kwargs.values()))}" File "/scratch/eellison/work/pytorch/torch/_dynamo/exc.py", line 143, in unimplemented raise Unsupported(msg) torch._dynamo.exc.Unsupported: call_function args: TensorVariable() TensorVariable() TensorVariable() ConstantVariable(float) NNModuleVariable() This is fixable by just inlining into `triplet_margin_loss` and continuing to compile it. This required support for `has_torch_function_variadic`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110302 Approved by: https://github.com/mlazos	2023-10-03 20:26:13 +00:00
Peter Bell	dc794ec32c	[dynamo] Trace through builtin `abs` (#110398 ) In python `abs(x)` does nothing but delegate to `x.__abs__()` so we should do the same in dynamo. This also adds `SymNode.__abs__` so we can trace through indexing expressions involving `abs`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110398 Approved by: https://github.com/jansel, https://github.com/lezcano	2023-10-03 19:25:37 +00:00
Pruthvi Madugundu	9ce2e02fd6	Revert "[ROCm] Remove PYTORCH_MIOPEN_SUGGEST_NHWC flag (#90725 )" (#110319 ) This reverts commit `66bfcd32fd`. NHWC is have perf regression on MIOpen, so reverting till the performance issue is fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110319 Approved by: https://github.com/jeffdaily, https://github.com/jithunnair-amd, https://github.com/kit1980	2023-10-03 19:14:47 +00:00
Yang Chen	da63c7f2c3	[AOTInductor] remove CUDA dependency for cpp backend (#110409 ) Summary: Previously, we link against cuda libs even for pure cpp backend. This caused issues for cases where the inference platform does not have GPUs. This diff removed cuda dependency for cpp backend. Reviewed By: bertmaher, muchulee8, mikekgfb Differential Revision: D49800712 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110409 Approved by: https://github.com/bertmaher, https://github.com/desertfire	2023-10-03 18:36:00 +00:00
PyTorch MergeBot	df3ab70dde	Revert "Added new test sample to interpolate op in OpInfo (#104181 )" This reverts commit `87f8bc65f8`. Reverted https://github.com/pytorch/pytorch/pull/104181 on behalf of https://github.com/peterbell10 due to Causing OOM in slow-gradcheck ([comment](https://github.com/pytorch/pytorch/pull/104181#issuecomment-1745472323))	2023-10-03 18:07:02 +00:00
Fuzzkatt	e55d6f923c	minor tf32 fixes for unit tests on H100 and L40 (#110201 ) fixes the following tests which were failing in the NVIDIA internal CI on H100 and L40: test/test_nn.py: * test_TransformerEncoderLayer_gelu_activation_cuda_tf32 * test_Transformer_multilayer_coder_cuda_tf32 test/inductor/test_torchinductor.py: * test_batch_norm_2d_2_cuda Pull Request resolved: https://github.com/pytorch/pytorch/pull/110201 Approved by: https://github.com/mikaylagawarecki, https://github.com/jansel, https://github.com/Skylion007	2023-10-03 00:10:37 +00:00
eellison	3812f2e40c	Preserve layout on like constructors (#110242 ) Partially fixes `test_memory_format_factory_like_functions_preserve` with PYTORCH_TEST_WITH_INDUCTOR. Inductor preserves memory layouts for user-visible outputs as annotated on the fx graph that it is passed in. That graph is generated from running aot_autograd with decompositions. If the decompositions give incorrect strides, so will inductor. This preserves the layout of `_like` operators when it corresponds to a `torch.memory_format`. It doesnt fix a) arbitrary permutations, b) striding of non-dense outputs. Both of these are lower-pri compared to preserving channels last. We would need either https://github.com/pytorch/pytorch/issues/92920 or a `to` variant that takes in a physical layout arbitrary permutations. I converted the output of rand to the correct layout instead of passing the layout in so that this would compose with the `replace_random` pass, and because the two pointwise ops will get fused anyway. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110242 Approved by: https://github.com/int3	2023-10-02 23:53:55 +00:00
vfdev-5	87f8bc65f8	Added new test sample to interpolate op in OpInfo (#104181 ) Description: - Added new test sample to interpolate op in OpInfo - Fixed silent issue with zero tensor test sample for uint8 dtype Pull Request resolved: https://github.com/pytorch/pytorch/pull/104181 Approved by: https://github.com/pmeier, https://github.com/lezcano	2023-10-02 15:35:48 +00:00
Michael Voznesensky	06464a3477	Change compiled_autograd tests to xfail instead of skip (#110348 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110348 Approved by: https://github.com/Chillee, https://github.com/jansel, https://github.com/Skylion007	2023-10-01 23:03:36 +00:00
chilli	13681382d5	Add heuristic for when `evict_first` should be set (and some other minor things) (#108841 ) Example of when the `evict_first` heuristic helps. ``` @torch.compile def f(a, b): return (a * b).sum(dim=-1) N = 512 inps = (torch.randn(N, N, N).permute(2, 1, 0), torch.randn(N, N, N).permute(1, 2, 0)) from torch._inductor.utils import do_bench print(do_bench(lambda: f(*inps))) ``` This generates code like this: http://ix.io/4HFs ``` Original: 3.8 ms This PR: 3.54 ms Always `evict_first: 5.4ms ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/108841 Approved by: https://github.com/lezcano, https://github.com/jansel	2023-10-01 17:06:12 +00:00
Oleg Khabinov	669faab0ad	[AOTInductor] Add non-default device test (#110024 ) Differential Revision: D49604597 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110024 Approved by: https://github.com/chenyang78	2023-10-01 05:08:23 +00:00
Oleg Khabinov	e8c0364f36	[AOTInductor] Add model runner to avoid using torch_extension (#110263 ) Differential Revision: D49609669 Pull Request resolved: https://github.com/pytorch/pytorch/pull/110263 Approved by: https://github.com/chenyang78	2023-10-01 00:52:17 +00:00
Adnan Akhundov	2ead6c2f6e	Skip launching kernels with zero grid in AOT Inductor (#110312 ) Summary: with the grid computed in terms of unbacked `SymInt`s, it can happen that the grid is zero size. This causes CUDA error on `cuLaunchKernel` in the AOT Inductor codegen. In this PR, when the grid contains unbacked `SymInt`s, a check is added around the `launchKernel` in the AOT Inductor's C++ wrapper codegen to make sure that the grid is not zero-size. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110312 Approved by: https://github.com/chenyang78	2023-09-30 09:12:56 +00:00
leslie-fang-intel	7eeb392eb3	[Inductor] Enable the item() and nonzero() codegen test on CPU (#110262 ) Summary Follow up https://github.com/pytorch/pytorch/pull/109893 which has issue in support of CPU as reported in https://github.com/pytorch/pytorch/issues/109897. This fix mainly includes 2 changes: - Current implementation of `rename_indexing` `10c646295d/torch/_inductor/codegen/common.py (L1023)` only add symbol name start with `s` or `ps` into `kernel.args.sizevars`. However, `Unbacked symint` will start as `i`, so we extend the implementation of `rename_indexing` to support symbol start with `i`. - Currently, the internal loop index also name start as `i`. Since `i` has has been used as `Unbacked symint`, change the name to start with `x` which should align with trition. Test Plan ``` python -u -m pytest -s -v test_torchinductor_dynamic_shapes.py -k test_bool_mask_nobreak python -u -m pytest -s -v test_torchinductor_dynamic_shapes.py -k test_nonzero_size_factory_nobreak python -u -m pytest -s -v test_torchinductor_dynamic_shapes.py -k test_item_zeros_nobreak ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/110262 Approved by: https://github.com/ezyang, https://github.com/jgong5	2023-09-30 00:13:20 +00:00
Bin Bao	993eea0edd	[aotinductor] Fix a missing schema issue for repeat_interleave (#110105 ) Differential Revision: [D49686812](https://our.internmc.facebook.com/intern/diff/D49686812) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110105 Approved by: https://github.com/zou3519, https://github.com/jansel, https://github.com/aakhundov	2023-09-29 23:01:37 +00:00
Peter Bell	bc047ec906	[inductor] Make sure unfuse_addmm and addmm patterns don't overlap (#110235 ) Inductor has two opposing patterns, ``` addmm -> add + mm add + mm -> addmm ``` This uses the `extra_check` to disable the addmm fusion pattern when the heuristic to unfuse add is met, for consistency. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110235 Approved by: https://github.com/lezcano, https://github.com/eellison ghstack dependencies: #110232	2023-09-29 19:35:29 +00:00
Peter Bell	d04b35e7e3	[inductor] Fix bug in input mutation (#107614 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107614 Approved by: https://github.com/jansel	2023-09-29 18:27:06 +00:00
Bin Bao	0ff1155d3a	[aotinductor] Refactor test_aot_inductor to take different devices (#110216 ) Summary: Replace hardcoded device to self.device, to make it easier to test both cpu and cuda Pull Request resolved: https://github.com/pytorch/pytorch/pull/110216 Approved by: https://github.com/chenyang78, https://github.com/bertmaher ghstack dependencies: #110215	2023-09-29 16:30:19 +00:00
Bin Bao	ce6d09a775	[aotinductor] Refactor test_aot_inductor (#110215 ) Summary: Remove the usage of output tensors in the test script, since AOTInductor now returns output tensors instead of taking in pre-allocated output tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110215 Approved by: https://github.com/angelayi, https://github.com/chenyang78	2023-09-29 16:30:19 +00:00
chunyuan	20dabea35d	Inductor cpp wrapper: support MkldnnRnnLayer (#107858 ) 1. Directly use the `codegen` function of the parent class which already supported both python and cpp wrapper. 2. The output of the `at::mkldnn_rnn_layer` OP is actually a `std::tuple` `1491bae277/aten/src/ATen/native/mkldnn/RNN.cpp (L218)` Fix the type when calling `MultiOutput`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107858 Approved by: https://github.com/jgong5, https://github.com/jansel	2023-09-29 00:22:42 +00:00
Edward Z. Yang	d1a13129bb	Add support for item() and nonzero() codegen in Inductor (#109893 ) This is another version of https://github.com/pytorch/pytorch/pull/109262 that I think is more harmonious with inductor design. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/109893 Approved by: https://github.com/jansel	2023-09-28 23:37:31 +00:00
Yukio Siraichi	6f48d872d0	Re-land: Break graph on `manual_seed`. (#109109 ) Re-landing: #108647 (old #107594) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109109 Approved by: https://github.com/lezcano	2023-09-28 15:28:40 +00:00
angelayi	c71a64ccce	[aotinductor] Rename if name is prefixed with integer (#110113 ) Fixes https://github.com/pytorch/pytorch/issues/109894. Since in c++ we cannot have variables that start with an integer, we can do some additional handling in inductor to not produce constant tensors with names starting with integers. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110113 Approved by: https://github.com/desertfire	2023-09-28 07:26:28 +00:00
Mu-Chu Lee	840bb650f8	[AOTInductor] Update regex rule for symbol (#110184 ) Summary: Update regex rule to match _ letter. Test Plan: Included in commit Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/110184 Approved by: https://github.com/desertfire	2023-09-28 01:13:18 +00:00
Mu-Chu Lee	7782108792	[AOTIndutor] Fix freeze for AOTInductor (#110055 ) Summary: Add test for freeze graph in AOTInductor. Remove unused code path. Test Plan: Included in commit. Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/110055 Approved by: https://github.com/angelayi	2023-09-27 21:21:47 +00:00

1 2 3 4 5 ...

1159 Commits