pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
David Berard	69e82d02d3	[inductor][3/N] triton support post-#5512, tt.divisibility format (#145575 ) 1. Fix the tt.divisibility format in hints.py. Previously, it was `{((0,), (1,)): [["tt.divisibility", 16]]}`. Now it is `{(0,): [["tt.divisibility", 16]], (1,): [["tt.divisibility", 16]]}`. This was an oversight in the first PR I added. I've verified that we now get `{ tt.divisibility = 16 }` in the generated TTGIR. 2. Update the test_codegen_triton.py test to work with multiple triton versions (and test this divisibility format in the new triton version) Pull Request resolved: https://github.com/pytorch/pytorch/pull/145575 Approved by: https://github.com/SamGinzburg	2025-01-27 21:48:58 +00:00
Alex Baden	ecf2240243	[Inductor] New Triton Attrs Descriptor Fixups (#138390 ) Fixes additional areas where we need to use the new Triton AttrsDescriptor if it is available. Pull Request resolved: https://github.com/pytorch/pytorch/pull/138390 Approved by: https://github.com/jansel, https://github.com/huydhn	2024-10-23 14:13:49 +00:00
PyTorch MergeBot	9f7b987087	Revert "[Inductor] New Triton Attrs Descriptor Fixups (#138390 )" This reverts commit `215999452e`. Reverted https://github.com/pytorch/pytorch/pull/138390 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but it still has another lint error ([comment](https://github.com/pytorch/pytorch/pull/138390#issuecomment-2430566004))	2024-10-23 00:37:28 +00:00
Alex Baden	215999452e	[Inductor] New Triton Attrs Descriptor Fixups (#138390 ) Fixes additional areas where we need to use the new Triton AttrsDescriptor if it is available. Pull Request resolved: https://github.com/pytorch/pytorch/pull/138390 Approved by: https://github.com/jansel	2024-10-22 23:16:05 +00:00
Xuehai Pan	134bc4fc34	[BE][Easy][12/19] enforce style for empty lines in import segments in `test/i*/` (#129763 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129763 Approved by: https://github.com/jansel	2024-07-18 07:49:19 +00:00
PyTorch MergeBot	b732b52f1e	Revert "[BE][Easy][12/19] enforce style for empty lines in import segments in `test/i*/` (#129763 )" This reverts commit `aecc746fcc`. Reverted https://github.com/pytorch/pytorch/pull/129763 on behalf of https://github.com/XuehaiPan due to need reland after rerunning lintrunner on main ([comment](https://github.com/pytorch/pytorch/pull/129763#issuecomment-2235736732))	2024-07-18 06:39:58 +00:00
Xuehai Pan	aecc746fcc	[BE][Easy][12/19] enforce style for empty lines in import segments in `test/i*/` (#129763 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129763 Approved by: https://github.com/jansel	2024-07-18 05:13:41 +00:00
Sam Larsen	535bc71d03	Enable FX graph caching in another batch of inductor tests (#121697 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/121697 Approved by: https://github.com/eellison	2024-03-15 19:38:51 +00:00
xinan.lin	e60bc502b4	[Inductor Intel GPU backend Upstream] Generalize part of Inductor test case (#117513 ) Following the RFC https://github.com/pytorch/pytorch/issues/114856, before upstream Intel XPU Inductor Backend, we need to preapre corresponding Inductor test cases. This PR aims to generalize part of Inductor test case so that a new GPU backend can reuse the existing test case with minimal code change. This Pull Request preferentially generalizes the test cases that cover Inductor's base functionality as follow: - test/inductor/test_codecache.py - test/inductor/test_codegen_triton.py - test/inductor/test_kernel_benchmark.py - test/inductor/test_torchinductor.py - test/inductor/test_torchinductor_codegen_dynamic_shapes.py - test/inductor/test_torchinductor_dynamic_shapes.py - test/inductor/test_torchinductor_opinfo.py - test/inductor/test_triton_heuristics.py - test/inductor/test_triton_wrapper.py Feature request: https://github.com/pytorch/pytorch/issues/114856 Pull Request resolved: https://github.com/pytorch/pytorch/pull/117513 Approved by: https://github.com/EikanWang, https://github.com/jansel	2024-01-18 08:26:21 +00:00
PyTorch MergeBot	1e60174891	Revert "[dynamo] Add run_inductor_tests entrypoint (#113278 )" This reverts commit `b00311ce9e`. Reverted https://github.com/pytorch/pytorch/pull/113278 on behalf of https://github.com/huydhn due to Sorry for reverting your stack, but it is failing to list test internally with buck2 ([comment](https://github.com/pytorch/pytorch/pull/113278#issuecomment-1811646325))	2023-11-15 01:19:48 +00:00
Jason Ansel	b00311ce9e	[dynamo] Add run_inductor_tests entrypoint (#113278 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/113278 Approved by: https://github.com/yanboliang	2023-11-11 08:54:43 +00:00
David Berard	becb8dc91a	[inductor] triton_utils.config_of: check for divisibility by 16, even when expr is not an Integer (#105743 ) TL;DR: triton_utils.config_of determines divisibility by 16 for each of the inputs to the kernel (pointer alignment for pointers, and divisibility by 16 for sizes). For sizes, the check previously could only return true if the expr representing the size was an integer. However, it's possible for non-integral exprs to be divisible by 16, e.g. for an expr like 16*s0. Motivation: Knowledge about divisibility by 16 allows for vectorizing loads and stores, which can improve memory bandwidth. If we have, for example, kernels with shape [s0, 16] (dynamic batch size; static, divisible-by-16 other dimensions), we want to still be able to vectorize those loads and stores. Dashboard results suggest that this improves dynamic shape training performance for timm, and possibly a small improvement for torchbench as well. More details are provided in a comment below. Pull Request resolved: https://github.com/pytorch/pytorch/pull/105743 Approved by: https://github.com/ezyang, https://github.com/aakhundov	2023-07-24 22:41:50 +00:00

12 Commits