pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Aaron Gokaslan	8cad88e1f3	[BE]: Improve exception typing. Remove NOQAs (#125535 ) Improve some exception typing Pull Request resolved: https://github.com/pytorch/pytorch/pull/125535 Approved by: https://github.com/albanD	2024-05-08 14:07:13 +00:00
PyTorch MergeBot	7ffa5558ee	Revert "[FX] Update type hints in `torch.fx._compatibility.py` (#125469 )" This reverts commit `235b4d6ec2`. Reverted https://github.com/pytorch/pytorch/pull/125469 on behalf of https://github.com/izaitsevfb due to breaks pyre in dependent projects (internal: see D56986361) ([comment](https://github.com/pytorch/pytorch/pull/125469#issuecomment-2096665396))	2024-05-06 18:36:43 +00:00
Aaron Gokaslan	1dd42e42c4	[BE]: Try TCH autofixes on torch/ (#125536 ) Tries TCH autofixes and see what breaks Pull Request resolved: https://github.com/pytorch/pytorch/pull/125536 Approved by: https://github.com/ezyang	2024-05-05 23:13:59 +00:00
Xuehai Pan	235b4d6ec2	[FX] Update type hints in `torch.fx._compatibility.py` (#125469 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/125469 Approved by: https://github.com/Skylion007 ghstack dependencies: #125468	2024-05-05 19:30:22 +00:00
Aaron Gokaslan	2f3b0befed	[BE]: Apply ruff FURB 118. (#124743 ) Replaces various lambdas with operator.itemgetter which is more efficient (as it's a builtin function). Particularly useful for when lambdas are used as 'key' functions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/124743 Approved by: https://github.com/albanD, https://github.com/malfet	2024-04-26 14:34:52 +00:00
Aaron Gokaslan	c5fafe9f48	[BE]: TRY002 - Ban raising vanilla exceptions (#124570 ) Adds a ruff lint rule to ban raising raw exceptions. Most of these should at the very least be runtime exception, value errors, type errors or some other errors. There are hundreds of instance of these bad exception types already in the codebase, so I have noqa'd most of them. Hopefully this error code will get commiters to rethink what exception type they should raise when they submit a PR. I also encourage people to gradually go and fix all the existing noqas that have been added so they can be removed overtime and our exception typing can be improved. Pull Request resolved: https://github.com/pytorch/pytorch/pull/124570 Approved by: https://github.com/ezyang	2024-04-21 22:26:40 +00:00
Aaron Orenstein	37215a4fa2	Fix memory leak in pattern_matcher (#124345 ) #121313 changed precompiled patterns so they are more integrated with the pattern matching code. This resulted with a list of "known" patterns (with their example data) being stored globally. Unfortunately since small FakeTensors store a constant of the original tensor it meant that we leaked cuda tensors in the example data. Fix this by clearing out the constant storage for the example data that we keep around. Fixes #124081 Pull Request resolved: https://github.com/pytorch/pytorch/pull/124345 Approved by: https://github.com/xuzhao9	2024-04-18 17:38:12 +00:00
Xuehai Pan	93e249969b	[BE] enable `ruff` rule `RSE` and remove useless parentheses in `raise` statements (#124261 ) Remove useless parentheses in `raise` statements if the exception type is raised with no argument. Pull Request resolved: https://github.com/pytorch/pytorch/pull/124261 Approved by: https://github.com/albanD	2024-04-17 19:29:34 +00:00
Aaron Orenstein	5712c326a5	Teach pattern_matcher to use a pre-traced pattern if given (#121314 ) The check_fn portion of pattern_matcher was retracing the pattern even if a pre-traced pattern was provided. I think that as long as the patterns don't have control flow based on their inputs then this should be safe. For this benchmark ``` python benchmarks/dynamo/huggingface.py --training --amp --performance --only MobileBertForQuestionAnswering --backend=inductor ``` this improves the performance of `joint_graph_passes` from about 9s down to 3s. In the performance dashboard it seems to be a small win - most of the compilation times dropped by a couple seconds: Torchbench 126s -> 124s Huggingface 114s -> 110s TIMM models 209s -> 208s Dynamic 44s -> 43s Blueberries 84s -> 81s Pull Request resolved: https://github.com/pytorch/pytorch/pull/121314 Approved by: https://github.com/eellison ghstack dependencies: #121313	2024-04-09 19:42:19 +00:00
Aaron Orenstein	4044e93a51	Add mm_pattern and bmm_pattern to serialized_patterns (#121313 ) Make it easier to serialize patterns by adding `pattern_matcher.gen_register_replacement()` which is like `pattern_matcher.register_replacement()` but also requires the replacement to be precompiled. To precompile patterns (and save to disk) run: ``` torchgen/fuse_attention_patterns/gen_attention_patterns.py ``` - Updated the sfdp patterns to use `gen_register_replacement`. - Add serialized patterns for mm_pattern and bmm_pattern (The 'misc' patterns don't serialize cleanly so can't be added). - Updated the testing so it checked the round-trip patterns match and not just that it serialized the same way. - Checking that the patterns round-trip properly found that the `users` field wasn't being serialized properly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/121313 Approved by: https://github.com/eellison	2024-04-09 19:42:19 +00:00
Oguz Ulgen	89724843bb	Use graph.find_nodes in pattern matcher (#122331 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122331 Approved by: https://github.com/jansel ghstack dependencies: #121565, #122255, #122256, #122257, #122258	2024-04-07 18:51:22 +00:00
Oguz Ulgen	222dfc4282	[Inductor] Run pattern matcher over the original graph (#122519 ) Differential Revision: [D55429070](https://our.internmc.facebook.com/intern/diff/D55429070) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122519 Approved by: https://github.com/jansel	2024-03-27 22:09:36 +00:00
PyTorch MergeBot	b63f6f78dc	Revert "[Inductor] Run pattern matcher over the original graph (#122519 )" This reverts commit `1f5fcb4e20`. Reverted https://github.com/pytorch/pytorch/pull/122519 on behalf of https://github.com/atalman due to Breaks internal tests ([comment](https://github.com/pytorch/pytorch/pull/122519#issuecomment-2023022311))	2024-03-27 15:13:26 +00:00
Oguz Ulgen	1f5fcb4e20	[Inductor] Run pattern matcher over the original graph (#122519 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/122519 Approved by: https://github.com/jansel	2024-03-26 17:30:32 +00:00
Jason Ansel	07d037674f	[inductor] Fix issue with randint + symbolic shapes (#122428 ) Fixes #122405 Pull Request resolved: https://github.com/pytorch/pytorch/pull/122428 Approved by: https://github.com/ezyang	2024-03-24 03:41:13 +00:00
Menglu Yu	7b1f5c874f	[PT2][Optimus][Observability] Log the optimus graph transformation to the scuba (#119745 ) Summary: Current everstore upload logging may cuase excessive compilation time when the model has lots of graph breaks (post: https://fb.workplace.com/groups/257735836456307/permalink/633533465543207/), we here log the transformation only when the graph changed Test Plan: timeout flows: f528209775 f530084719 Differential Revision: D53692344 Pull Request resolved: https://github.com/pytorch/pytorch/pull/119745 Approved by: https://github.com/jackiexu1992	2024-02-16 21:32:04 +00:00
Jason Ansel	75a6d6aef7	[inductor] Support storage resizing (#119749 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/119749 Approved by: https://github.com/yf225 ghstack dependencies: #119647, #119671	2024-02-14 03:03:38 +00:00
rzou	cf474a09f5	Decompose torch.ops.higher_order.auto_functionalized in Inductor (#118673 ) We'd like to get auto_functionalized to work with AOTInductor. To get there, we decompose `output = auto_functionalized(inplace_op, ...)` into its corresponding aten ops (clones + inplace_op) before the Inductor lowering phase. This decomposition must happen at the end of the Inductor FX passes because it introduces in-place operations. The pattern matcher's "replace this single node with multiple nodes" API isn't robust enough here. The problem is that `auto_functionalized` returns a single output (this output is a List), but the decomposition ends up returning the unpacked List (e.g. it may return two tensors). Previously, there was an assertion that this was not the case; I fixed up `replace_with_graph` to handle this. Future: Not all of the clones are necessary (e.g. if the input's last usage is this operator, then we don't need to clone it). We can add this logic later. Test Plan: - existing tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/118673 Approved by: https://github.com/oulgen	2024-02-12 17:30:01 +00:00
Edward Z. Yang	3f0fd36835	Introduce size oblivious guards (#118579 ) Fixes https://github.com/pytorch/pytorch/issues/117361 The implementation here slightly diverges from what was proposed in the issue, so I will recap what this PR is doing here. Today, when doing computations involving size-like unbacked SymInts, we assume for all operations that the compile time range of the integer is `[2, inf]`, even though at runtime we also accept zero and one. This PR removes the carte blanche assumption, and instead does the analysis in a much more limited and controlled fashion: only for guards which we have designated as "size oblivious" are we willing to do the analysis under the assumption that the range of all size-like unbacked SymInts is `[2, inf]`; otherwise, we will faithfully only do analysis with `[0, inf]` (or whatever the user provided) bounds. The infra pieces of this PR are: * Remove runtime_var_to_range from torch/fx/experimental/symbolic_shapes.py; modify `_constrain_range_for_size` to refine the range without clamping min to 2, and instead add the symbol to a `size_like` set in the ShapeEnv * When evaluating an expression, if the expression is requested to be evaluated in a `size_oblivious` way, we attempt to statically compute the value of the expression with the assumption that all symbols in `size_like` are updated to assume that they are `>= 2`. * Add Python and C++ APIs for guarding on a SymBool in a size-oblivious way. In C++, I also need to add some helpers for performing symbolic comparisons, since the stock comparisons immediately specialize in the "normal" way. The rest of the changes of the PR are marking various spots in PyTorch framework code as size oblivious, based on what our current test suite exercises. As you review the places where we have marked things as size oblivious, it may become clear why I ended up not opting for the "designate a branch as the default branch when it's not statically obvious which way to go": for some of the conditions, this answer is rather non-obvious. I think potentially there is another refinement on top of this PR, which is something like "I don't care if you can't figure it out with ValueRange analysis, go down this path anyway if there are unbacked sizes involved." But even if we add this API, I think we are obligated to attempt the ValueRange analysis first, since it can lead to better outcomes sometimes (e.g., we are able to figure out that something is contiguous no matter what the unbacked size is.) When is it permissible to mark something as size oblivious? Heuristically, it is OK anywhere in framework code if it gets you past a guard on unbacked SymInt problem. It is somewhat difficult to provide a true semantic answer, however. In particular, these annotations don't have any observational equivalence guarantee; for example, if I have `torch.empty(u0, 1).squeeze()`, we will always produce a `[u0]` size tensor, even though if `u0 == 1` PyTorch will actually produce a `[]` size tensor. The argument that I gave to Lezcano is that we are in fact defining an alternate semantics for a "special" size = 0, 1, for which we have these alternate eager mode semantics. In particular, suppose that we have a constant `special1` which semantically denotes 1, but triggers alternate handling rules. We would define `torch.empty(special1, 1).squeeze()` to always produce a `[special1]` size tensor, making its semantics coincide with unbacked SymInt semantics. In this model, the decision to designate guards as size oblivious is simply a user API question: you put them where ever you need some handling for special1! As we conservatively error out whenever it is not obvious what `special1` semantics should be, it is always valid to expand these semantics to cover more cases (although you can always choose the wrong semantics!) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118579 Approved by: https://github.com/eellison, https://github.com/lezcano	2024-02-06 19:45:32 +00:00
Edward Z. Yang	68c3cb7594	s/fialure/failure/ (#118744 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118744 Approved by: https://github.com/peterbell10	2024-01-31 17:42:44 +00:00
Catherine Lee	4f5785b6b3	Enable possibly-undefined error code (#118533 ) Fixes https://github.com/pytorch/pytorch/issues/118129 Suppressions automatically added with ``` import re with open("error_file.txt", "r") as f: errors = f.readlines() error_lines = {} for error in errors: match = re.match(r"(.):(\d+):\d+: error:.\[(.*)\]", error) if match: file_path, line_number, error_type = match.groups() if file_path not in error_lines: error_lines[file_path] = {} error_lines[file_path][int(line_number)] = error_type for file_path, lines in error_lines.items(): with open(file_path, "r") as f: code = f.readlines() for line_number, error_type in sorted(lines.items(), key=lambda x: x[0], reverse=True): code[line_number - 1] = code[line_number - 1].rstrip() + f" # type: ignore[{error_type}]\n" with open(file_path, "w") as f: f.writelines(code) ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Co-authored-by: Catherine Lee <csl@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118533 Approved by: https://github.com/Skylion007, https://github.com/zou3519	2024-01-30 21:07:01 +00:00
PyTorch MergeBot	40ece2e579	Revert "Enable possibly-undefined error code (#118533 )" This reverts commit `4f13f69a45`. Reverted https://github.com/pytorch/pytorch/pull/118533 on behalf of https://github.com/clee2000 due to sorry i'm trying to figure out a codev merge conflict, if this works i'll be back to rebase and merge ([comment](https://github.com/pytorch/pytorch/pull/118533#issuecomment-1917695185))	2024-01-30 19:00:34 +00:00
Edward Z. Yang	4f13f69a45	Enable possibly-undefined error code (#118533 ) Fixes https://github.com/pytorch/pytorch/issues/118129 Suppressions automatically added with ``` import re with open("error_file.txt", "r") as f: errors = f.readlines() error_lines = {} for error in errors: match = re.match(r"(.):(\d+):\d+: error:.\[(.*)\]", error) if match: file_path, line_number, error_type = match.groups() if file_path not in error_lines: error_lines[file_path] = {} error_lines[file_path][int(line_number)] = error_type for file_path, lines in error_lines.items(): with open(file_path, "r") as f: code = f.readlines() for line_number, error_type in sorted(lines.items(), key=lambda x: x[0], reverse=True): code[line_number - 1] = code[line_number - 1].rstrip() + f" # type: ignore[{error_type}]\n" with open(file_path, "w") as f: f.writelines(code) ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118533 Approved by: https://github.com/Skylion007, https://github.com/zou3519	2024-01-30 05:08:10 +00:00
Edward Z. Yang	d03173e88c	Unify MYPYINDUCTOR and MYPY (#118432 ) The original motivation for MYPYINDUCTOR was a faster type checking configuration that only checked a subset of files. With the removal of `follow_imports = ignore`, we are now able to use dmypy to do fast incremental typechecking, eliminating the need for this. Perhaps erroneously, when I tee'ed up this PR I elected to delete the `follow_imports = skip` designations in the mypy-inductor.ini. This lead to a number of extra type error suppressions that I manually edited. You will need to review. Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118432 Approved by: https://github.com/Skylion007 ghstack dependencies: #118414, #118418	2024-01-27 17:23:20 +00:00
Animesh Jain	e056cf5507	[ac][pattern matcher] Do not percolate tags beyond the inputs of matched portion (#118034 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/118034 Approved by: https://github.com/yf225	2024-01-23 05:02:32 +00:00
Animesh Jain	f7d9047864	[inductor] Iterative percolate tags (#117306 ) Fixes https://github.com/pytorch/pytorch/issues/116581 Pull Request resolved: https://github.com/pytorch/pytorch/pull/117306 Approved by: https://github.com/aorenste, https://github.com/eellison	2024-01-12 07:52:32 +00:00
Elias Ellison	4c7b602645	Add Support For Symbolic Shapes in Register_replacement, SDPA Pattern Matching (#115441 ) Many of our pattern matching replacements are specified as a `search_fn` and a `replacment_fn`. The search_fn's are traced out once with static shapes, converted to a pattern, and then matched on every graph compiled with inductor. The static shape patterns would not match with graphs that are traced out with dynamic shapes because SymInts would be added to the graph as `sym_size` fx nodes which added additional uses and prevented matching. The previous PR partially addresses this by deduping SymInts that are resolvable to graph inputs, as is the calling convention in aot autograd. This PR adjusts our matching of the `search_fn` by adding SymInts to the arguments we trace out the search_fn with so that their symint accesses are deduped. Later, if we have a match, we will trace out the replacement graph with the correct Tensors and corresponding symbolic shapes that will get added to the graph. Note: the replacement patterns will insert sym_size uses which could potentially be removed, but I'll leave that for follow up. Fix for https://github.com/pytorch/pytorch/issues/111190. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115441 Approved by: https://github.com/jansel ghstack dependencies: #116158	2024-01-11 15:58:37 +00:00
Aaron Orenstein	71d8fe690f	Replace recursive stable_topological_sort() with iterative. (#116761 ) Summary: A graph with a deep set of nodes caused stable_topological_sort() to recurse and pop the stack. Rewrite it to be iterative and avoid recursion. Fixes #115506 Pull Request resolved: https://github.com/pytorch/pytorch/pull/116761 Approved by: https://github.com/jansel, https://github.com/oulgen, https://github.com/Skylion007	2024-01-05 20:13:49 +00:00
Jason Ansel	69a8f9b07e	[inductor] Fix shape mismatch in sdpa pattern matcher (#115038 ) Fixes #100316 Pull Request resolved: https://github.com/pytorch/pytorch/pull/115038 Approved by: https://github.com/oulgen	2023-12-03 22:32:12 +00:00
Jez Ng	47e6cc4d22	Remove yet more type-ignores in dynamo/inductor (#114684 ) Probably the last big batch for a while Pull Request resolved: https://github.com/pytorch/pytorch/pull/114684 Approved by: https://github.com/Skylion007	2023-11-28 22:09:38 +00:00
Jez Ng	b0ede09682	[inductor] Make pattern_matcher.py pass follow_imports typechecking (#113409 ) Import following reveals that a good number of hints were wrong... Pull Request resolved: https://github.com/pytorch/pytorch/pull/113409 Approved by: https://github.com/Skylion007	2023-11-10 19:58:08 +00:00
Tugsbayasgalan Manlaibaatar	84d64d72d6	Persist copy_ in training graph for inputs that don't require grad (#111046 ) In this PR, we try to keep the input mutations in the forward graph IFF input mutation is data mutation and not metadata mutation and doesn't require grad. This is for optimizing inductor training graphs. (For more details: https://github.com/pytorch/pytorch/issues/109240) We keep the input mutation in the graph by wrapping the original callable in a wrapper function where in the end we add input.copy_(updated_input) call which is then traced via make_fx. Previously, this was only enabled for forward-only path but unconditionally disabled for joint graph. Another caveat is that when we are tracing through tensor subclasses, we won't allow any input mutations to be preserved in the graph. The reason is that it makes the code logic quite ugly for no obvious performance improvement. Most of the changes in this PR are mechanical and I didn't have to make any change to the partitioner. Previously forward/backward heavily relied on metadata field `num_mutated_inps` to figure out whether something is returned as extra output or not. But now since we keep some mutations in the graph, we need to propogate something similar to `num_mutated_inps - num_graph_handled_inps`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111046 Approved by: https://github.com/ezyang, https://github.com/bdhirsh	2023-11-09 00:40:29 +00:00
Aaron Gokaslan	8219bf051b	[BE]: Apply RUF015 to torch folder (#113025 ) Removes unnecessary allocations of iterators. There is a small chance this may have side effects as the entire iterator is no longer consumed, but this is a way more efficient method for retrieving the first element. Pull Request resolved: https://github.com/pytorch/pytorch/pull/113025 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-11-07 00:48:15 +00:00
chilli	0ac748cd29	Make pattern-matcher failure diagnostics lazy (again) and added an error message if format string is too long (#112923 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112923 Approved by: https://github.com/eellison ghstack dependencies: #112476	2023-11-04 02:54:17 +00:00
chilli	3cee033b98	Reland of a bunch of pattern matcher + indexing fixes (#112476 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112476 Approved by: https://github.com/oulgen	2023-11-01 02:13:44 +00:00
Yanbo Liang	710337244d	[Inductor] Extend Pattern Matcher to Match Equivalent Function Invocation (#107832 ) Fixes #104391 Pull Request resolved: https://github.com/pytorch/pytorch/pull/107832 Approved by: https://github.com/jansel	2023-10-31 03:32:33 +00:00
PyTorch MergeBot	fc0b0820fc	Revert "Readded device_assert skipping in index and index_put (and also added (#112093 )" This reverts commit `b110d87ac2`. Reverted https://github.com/pytorch/pytorch/pull/112093 on behalf of https://github.com/ZainRizvi due to Stack breaks internal builds ([comment](https://github.com/pytorch/pytorch/pull/112093#issuecomment-1785922905))	2023-10-30 19:45:41 +00:00
PyTorch MergeBot	4439b906c4	Revert "Some cleanups in pattern matcher (#112101 )" This reverts commit `f7dc0ae16c`. Reverted https://github.com/pytorch/pytorch/pull/112101 on behalf of https://github.com/ZainRizvi due to Stack breaks internal builds ([comment](https://github.com/pytorch/pytorch/pull/112101#issuecomment-1785920248))	2023-10-30 19:43:40 +00:00
Peter Bell	bbd5b935e4	Use `pytree.tree_leaves` everywhere (#112324 ) This changes all the instances I could find of `tree_flatten(...)[0]` or `x, _ = tree_flatten` to use `tree_leaves`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/112324 Approved by: https://github.com/lezcano ghstack dependencies: #112327, #112323	2023-10-30 03:39:04 +00:00
chilli	f7dc0ae16c	Some cleanups in pattern matcher (#112101 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112101 Approved by: https://github.com/eellison ghstack dependencies: #112093	2023-10-27 21:04:39 +00:00
chilli	b110d87ac2	Readded device_assert skipping in index and index_put (and also added (#112093 ) copy to noop pass) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112093 Approved by: https://github.com/oulgen, https://github.com/lezcano	2023-10-27 18:23:49 +00:00
PyTorch MergeBot	0a3199dd7e	Revert "Readded device_assert skipping in index and index_put (and also added (#112093 )" This reverts commit `e38347f490`. Reverted https://github.com/pytorch/pytorch/pull/112093 on behalf of https://github.com/izaitsevfb due to Sorry, trying to resolve a conflict with intern, and unblock the revert of #108690 ([comment](https://github.com/pytorch/pytorch/pull/112093#issuecomment-1782154814))	2023-10-27 01:37:33 +00:00
chilli	e38347f490	Readded device_assert skipping in index and index_put (and also added (#112093 ) copy to noop pass) Pull Request resolved: https://github.com/pytorch/pytorch/pull/112093 Approved by: https://github.com/oulgen, https://github.com/lezcano ghstack dependencies: #111990	2023-10-26 07:54:44 +00:00
Brian Hirsh	7fb09b804b	Reland "AOTAutograd: Go down inference path if no outputs require grad (#111011 )" (#111347 ) Re-land of https://github.com/pytorch/pytorch/pull/111011. The original PR ended up having a bad interaction with code that tried to run `torch.compile` under `with torch.inference_mode`, which caused some internal tests to fail. The issue was that: (1) AOTInductor invokes the pattern matcher passes in inductor (2) The pattern matcher registers some code with [training_graph](https://github.com/pytorch/pytorch/blob/main/torch/_inductor/fx_passes/pad_mm.py#L461) (3) The `training_graph` function expects to be able to set the global autograd state to `requires_grad`, and always get out a join graph (assertion [here](https://github.com/pytorch/pytorch/blob/main/torch/_inductor/pattern_matcher.py#L1196)). (4) However, when inference_mode is activated, and you try to run AOTAutograd, AOTAutograd will witness that all outputs to the traced function will not require grad, and (now correctly) think that we are tracing an inference graph, which fails the above assert. After talking to Bin, it sounds like these training-only patterns aren't necessary when we know we are compiling an inference graph (which should always be the case if you're running torch.compile with inference_mode). So I updated the pattern matcher to ignore any pattern matches using `training_graph`, when inference_mode is enabled. This reverts commit `cf6b1cdf6a`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111347 Approved by: https://github.com/Chillee	2023-10-17 00:11:15 +00:00
chilli	f767a6c57a	Made pattern-matcher diagnostics lazily reported + added TORCH_COMPILE_CPROFILE (#110504 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110504 Approved by: https://github.com/mlazos, https://github.com/eellison ghstack dependencies: #110501	2023-10-05 15:47:30 +00:00
PyTorch MergeBot	1e4c0641ce	Revert "Made pattern-matcher diagnostics lazily reported + added TORCH_COMPILE_CPROFILE (#110504 )" This reverts commit `9648df1a6a`. Reverted https://github.com/pytorch/pytorch/pull/110504 on behalf of https://github.com/PaliC due to temporarily will revert as it's causing problems with difftrain import ([comment](https://github.com/pytorch/pytorch/pull/110504#issuecomment-1749132253))	2023-10-05 15:28:23 +00:00
Kazuaki Ishizaki	434a996c42	Fix typo under torch/_inductor directory (#110530 ) This PR fixes typo of comments and messages in files under `torch/_dynamo` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110530 Approved by: https://github.com/kit1980	2023-10-05 02:17:20 +00:00
chilli	9648df1a6a	Made pattern-matcher diagnostics lazily reported + added TORCH_COMPILE_CPROFILE (#110504 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110504 Approved by: https://github.com/mlazos, https://github.com/eellison ghstack dependencies: #110501	2023-10-05 01:34:57 +00:00
Huamin Li	85ddc985d0	Back out "[pytorch][PR] [Inductor] Extend Pattern Matcher to Match Equivalent Function Invocation" (#109931 ) Summary: Original commit changeset: 3466b85fe0a1 Original Phabricator Diff: D49433268 More context D49536556 bypass-github-pytorch-ci-checks Test Plan: revertreverthammer Differential Revision: D49565384 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109931 Approved by: https://github.com/houseroad	2023-09-23 05:58:08 +00:00
Jez Ng	2895fbd857	Enable typechecking for _inductor/pattern_matcher.py (#109613 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/109613 Approved by: https://github.com/Skylion007	2023-09-22 20:50:21 +00:00

1 2 3

111 Commits