pytorch/torch/_inductor/codegen
2025-03-20 21:33:29 +00:00
..
aoti_runtime [AOTInductor] Add function to free buffer (#149161) 2025-03-18 02:43:14 +00:00
cuda [Inductor Cutlass backend] Fix imports and compilation of Cutlass SM100 Kernels (#149515) 2025-03-20 20:35:18 +00:00
rocm Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
xpu Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
__init__.py
aoti_hipify_utils.py remove allow-untyped-defs from _inductor/codegen/aoti_hipify_utils.py (#143916) 2024-12-27 23:25:37 +00:00
block_analysis.py [inductor][triton] Block ptr analysis fix assert on matched index expression (#148446) 2025-03-10 05:26:55 +00:00
common.py Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
cpp_bmm_template.py [inductor][cpu] Move VNNI weight packing into AMX GEMM kernel for contiguous BMM weights (#146843) 2025-02-21 21:46:00 +00:00
cpp_flex_attention_template.py [Inductor][CPP] Avoid transpose with cpp micro-gemm for FlexAttention (#147069) 2025-03-03 15:22:11 +00:00
cpp_gemm_template.py Iterate over dense dim first in split reduction reindexing (#147229) 2025-03-18 17:35:21 +00:00
cpp_grouped_gemm_template.py [BE][PYFMT] migrate PYFMT for torch._inductor to ruff format (#144550) 2025-02-28 13:33:19 +00:00
cpp_micro_gemm.py [Inductor][CPP] Avoid transpose with cpp micro-gemm for FlexAttention (#147069) 2025-03-03 15:22:11 +00:00
cpp_prefix.h [Inductor] optimize welford reduction (#145061) 2025-03-18 02:05:35 +00:00
cpp_template_kernel.py Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
cpp_template.py [BE][PYFMT] migrate PYFMT for torch._inductor to ruff format (#144550) 2025-02-28 13:33:19 +00:00
cpp_utils.py [AOTI] build CPU CPP kernels at O3, and all other code at O1 (#148587) 2025-03-05 22:47:46 +00:00
cpp_wrapper_cpu_array_ref.py Fix too big to optimize in test, actually use O0 when aot_inductor.compile_wrapper_with_O0 is set (#148714) 2025-03-13 10:22:06 +00:00
cpp_wrapper_cpu.py Fix too big to optimize in test, actually use O0 when aot_inductor.compile_wrapper_with_O0 is set (#148714) 2025-03-13 10:22:06 +00:00
cpp_wrapper_gpu.py Fix local compilication and hipification (#149384) 2025-03-18 22:56:02 +00:00
cpp.py [pt2_provenance_tracking] add support for cpp kernel (#149185) 2025-03-18 04:43:07 +00:00
cpu_device_op_overrides.py [inductor] Add types to DeviceOpOverrides (#145913) 2025-02-01 16:33:49 +00:00
cuda_combined_scheduling.py Revert "Use the device interface for detecting Triton availability (#139171)" 2025-03-11 18:49:21 +00:00
debug_utils.py [AOTI][debug logger] small fix for intermediate value debugger for jit when arg is not tensor (#149007) 2025-03-14 19:40:41 +00:00
halide.py Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
memory_planning.py [BE][PYFMT] migrate PYFMT for torch._inductor to ruff format (#144550) 2025-02-28 13:33:19 +00:00
mps_device_op_overrides.py [inductor] Add types to DeviceOpOverrides (#145913) 2025-02-01 16:33:49 +00:00
mps.py [MPS/Inductor] Add support for modified_bessel_k0. (#149593) 2025-03-20 04:51:44 +00:00
multi_kernel.py Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
simd_kernel_features.py [BE][Ez]: Use itertools.chain.from_iterable when possible (#148190) 2025-03-06 20:37:06 +00:00
simd.py [inductor] Add a helper for convert index_dtype to torch dtype (#149531) 2025-03-20 21:33:29 +00:00
triton_combo_kernel.py Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
triton_split_scan.py Reland: [inductor] Simplify grid handling (#148305) 2025-03-12 15:52:16 +00:00
triton_utils.py [BE][PYFMT] migrate PYFMT for torch._inductor to ruff format (#144550) 2025-02-28 13:33:19 +00:00
triton.py [inductor] Add a helper for convert index_dtype to torch dtype (#149531) 2025-03-20 21:33:29 +00:00
wrapper.py [AOTI][refactor] Remove dead code (#149287) 2025-03-20 07:29:27 +00:00