pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Benjamin Glass d6d670ab4d [AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587 ) In the future, we may also want to add LTO linking to further optimize the results (while still hopefully netting compile time benefits). Differential Revision: [D70641543](https://our.internmc.facebook.com/intern/diff/D70641543) Pull Request resolved: https://github.com/pytorch/pytorch/pull/148587 Approved by: https://github.com/desertfire		2025-03-05 22:47:46 +00:00
..
aoti_runtime	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
cuda	[cutlass backend] fix assertion that prevent self multiplication (#148233 )	2025-03-05 21:26:22 +00:00
rocm	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
xpu	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
__init__.py
aoti_hipify_utils.py	remove allow-untyped-defs from _inductor/codegen/aoti_hipify_utils.py (#143916 )	2024-12-27 23:25:37 +00:00
block_analysis.py	[Inductor] Expand Identity ops prior to block pattern matching (#146000 )	2025-02-08 18:11:53 +00:00
common.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
cpp_bmm_template.py	[inductor][cpu] Move VNNI weight packing into AMX GEMM kernel for contiguous BMM weights (#146843 )	2025-02-21 21:46:00 +00:00
cpp_flex_attention_template.py	[Inductor][CPP] Avoid transpose with cpp micro-gemm for FlexAttention (#147069 )	2025-03-03 15:22:11 +00:00
cpp_gemm_template.py	[inductor][cpu] Fix error with FlexibleLayout weights in BMM (#148188 )	2025-03-05 01:05:05 +00:00
cpp_grouped_gemm_template.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
cpp_micro_gemm.py	[Inductor][CPP] Avoid transpose with cpp micro-gemm for FlexAttention (#147069 )	2025-03-03 15:22:11 +00:00
cpp_prefix.h	[Inductor][CPP] fix store mode atomic add (#147961 )	2025-02-26 14:04:34 +00:00
cpp_template_kernel.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
cpp_template.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
cpp_utils.py	[AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587 )	2025-03-05 22:47:46 +00:00
cpp_wrapper_cpu_array_ref.py	[AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587 )	2025-03-05 22:47:46 +00:00
cpp_wrapper_cpu.py	[AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587 )	2025-03-05 22:47:46 +00:00
cpp_wrapper_gpu.py	[AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587 )	2025-03-05 22:47:46 +00:00
cpp.py	[AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587 )	2025-03-05 22:47:46 +00:00
cpu_device_op_overrides.py	[inductor] Add types to DeviceOpOverrides (#145913 )	2025-02-01 16:33:49 +00:00
cuda_combined_scheduling.py	PEP585: More UP006 fixes (#146392 )	2025-02-20 06:18:13 +00:00
debug_utils.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
halide.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
memory_planning.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
mps_device_op_overrides.py	[inductor] Add types to DeviceOpOverrides (#145913 )	2025-02-01 16:33:49 +00:00
mps.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
multi_kernel.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
simd_kernel_features.py	PEP585: More UP006 fixes (#146392 )	2025-02-20 06:18:13 +00:00
simd.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
triton_combo_kernel.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
triton_split_scan.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
triton_utils.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
triton.py	Revert "[import][inductor] Simplify grid handling (#147583 )"	2025-03-03 00:49:32 +00:00
wrapper.py	[AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587 )	2025-03-05 22:47:46 +00:00