pytorch/torch/_inductor/codegen
bglass@quansight.com 40ccb7a86d cpp_wrapper: Move #includes to per-device header files (#145932)
Summary:
This prepares us for the next PR in the stack, where we introduce pre-compiled per-device header files to save compilation time.

Reland https://github.com/pytorch/pytorch/pull/143909 after merge conflicts.

Co-authored-by: Benjamin Glass <[bglass@quansight.com](mailto:bglass@quansight.com)>

Differential Revision: D68656960

Pulled By: benjaminglass1

Pull Request resolved: https://github.com/pytorch/pytorch/pull/145932
Approved by: https://github.com/yushangdi, https://github.com/benjaminglass1

Co-authored-by: bglass@quansight.com <bglass@quansight.com>
2025-01-29 21:08:45 +00:00
..
aoti_runtime cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
cuda cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
rocm [inductor] Add some typing to common.py (#145691) 2025-01-27 06:27:13 +00:00
xpu cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
__init__.py
aoti_hipify_utils.py remove allow-untyped-defs from _inductor/codegen/aoti_hipify_utils.py (#143916) 2024-12-27 23:25:37 +00:00
block_analysis.py PEP585 update - torch/_inductor/codegen (#145106) 2025-01-18 06:56:03 +00:00
common.py cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
cpp_bmm_template.py [inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583) 2025-01-29 05:46:05 +00:00
cpp_flex_attention_template.py [Inductor-CPU] Add profiling support for codegened flex attention kernels (#145894) 2025-01-29 20:54:46 +00:00
cpp_gemm_template.py [Inductor][CPU] Add auto-tuning support for da8w8 sym act sym wgt GEMM (#143187) 2025-01-22 02:27:53 +00:00
cpp_grouped_gemm_template.py PEP585 update - torch/_inductor/codegen (#145106) 2025-01-18 06:56:03 +00:00
cpp_micro_gemm.py [Inductor][CPU] Add auto-tuning support for da8w8 sym act sym wgt GEMM (#143187) 2025-01-22 02:27:53 +00:00
cpp_prefix.h Remove is_reduced_floating_point from namespace std (#144502) 2025-01-10 03:24:10 +00:00
cpp_template_kernel.py PEP585 update - torch/_inductor/codegen (#145106) 2025-01-18 06:56:03 +00:00
cpp_template.py PEP585 update - torch/_inductor/codegen (#145106) 2025-01-18 06:56:03 +00:00
cpp_utils.py [inductor] Remove type ignores from scheduler.py (#145712) 2025-01-28 01:44:32 +00:00
cpp_wrapper_cpu_array_ref.py cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
cpp_wrapper_cpu.py cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
cpp_wrapper_gpu.py cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
cpp.py [inductor] Change type of get_backend_features to OrderedSet (#145692) 2025-01-28 01:44:32 +00:00
cpu_device_op_overrides.py remove allow-untyped-defs from _inductor/codegen/cpu_device_op_overrides.py (#143881) 2024-12-27 04:10:47 +00:00
cuda_combined_scheduling.py Parallelize epilogue/prologue benchmarking (#143408) 2025-01-28 18:18:24 +00:00
debug_utils.py cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00
halide.py [inductor] Change type of get_backend_features to OrderedSet (#145692) 2025-01-28 01:44:32 +00:00
memory_planning.py PEP585 update - torch/_inductor/codegen (#145106) 2025-01-18 06:56:03 +00:00
mps_device_op_overrides.py [Inductor] Add MPS device op overrides (#143892) 2024-12-28 02:11:45 +00:00
mps.py [inductor] Add some typing to common.py (#145691) 2025-01-27 06:27:13 +00:00
multi_kernel.py [inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583) 2025-01-29 05:46:05 +00:00
simd_kernel_features.py [inductor] Kernel memory analysis for use in heuristics (#142026) 2025-01-25 04:58:54 +00:00
simd.py [inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583) 2025-01-29 05:46:05 +00:00
triton_combo_kernel.py [inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583) 2025-01-29 05:46:05 +00:00
triton_split_scan.py PEP585 update - torch/_inductor/codegen (#145106) 2025-01-18 06:56:03 +00:00
triton_utils.py [inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583) 2025-01-29 05:46:05 +00:00
triton.py [inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583) 2025-01-29 05:46:05 +00:00
wrapper.py cpp_wrapper: Move #includes to per-device header files (#145932) 2025-01-29 21:08:45 +00:00