pytorch/torch/_inductor/codegen
Bin Bao d21841d077 [AOTI] Create another wrapper class to handle ArrayRef (#136318)
Summary: Create another wrapper codegen class to handle ArrayRef for CPU. The goal is to simplify the regular cpp wrapper codegen logic and the generated cpp code.

Test Plan: CI

Differential Revision: D62961885

Pull Request resolved: https://github.com/pytorch/pytorch/pull/136318
Approved by: https://github.com/frank-wei
2024-09-23 15:10:27 +00:00
..
aoti_runtime [AOTI] Fix bfloat16 in CPU (#132150) 2024-08-01 22:26:30 +00:00
cuda [Inductor] Generalize cuda cpp wrapper as common triton based GPU cpp wrapper, will be reused by xpu in next PR. (#135312) 2024-09-11 23:59:54 +00:00
rocm Revert "Add Triton CPU as an Inductor backend (#133408)" 2024-09-16 18:33:33 +00:00
xpu [Inductor] Generalize device guard codegen for cpp_wrapper mode. (#134761) 2024-09-10 10:11:52 +00:00
__init__.py
aoti_hipify_utils.py [BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768) 2024-07-20 16:20:58 +00:00
common.py [AOTI] Create another wrapper class to handle ArrayRef (#136318) 2024-09-23 15:10:27 +00:00
cpp_gemm_template.py [inductor][cpp][gemm] fix perf regression xcit_large_24_p8_224 (#134686) (#135438) 2024-09-09 05:16:02 +00:00
cpp_micro_gemm.py [AOTI][reland] Fix assert_function call in cpu autotune template (#135920) 2024-09-13 12:21:57 +00:00
cpp_prefix.h [AOTI][reland] Fix assert_function call in cpu autotune template (#135920) 2024-09-13 12:21:57 +00:00
cpp_template_kernel.py Revert "Add Triton CPU as an Inductor backend (#133408)" 2024-09-16 18:33:33 +00:00
cpp_template.py [AOTI][reland] Fix assert_function call in cpu autotune template (#135920) 2024-09-13 12:21:57 +00:00
cpp_utils.py [inductor] Move LoopBody to its own file (#135257) 2024-09-07 16:29:15 +00:00
cpp_wrapper_cpu_array_ref.py [AOTI] Create another wrapper class to handle ArrayRef (#136318) 2024-09-23 15:10:27 +00:00
cpp_wrapper_cpu.py [inductor] Make DtypeView work with cpp_wrapper without abi_compatible (#136233) 2024-09-20 17:30:35 +00:00
cpp_wrapper_gpu.py [aoti] Fix workspace generation for triton (#135552) 2024-09-22 04:51:37 +00:00
cpp.py [inductor] Relax the conditions for loop split (#135335) 2024-09-20 05:42:52 +00:00
cuda_combined_scheduling.py [BE]: Update mypy to 1.11.2 (#133816) 2024-09-16 19:44:11 +00:00
debug_utils.py [AOTI][Tooling] Filter out kernels based off lowercase names (#135395) 2024-09-20 21:56:08 +00:00
halide.py Revert "Add Triton CPU as an Inductor backend (#133408)" 2024-09-16 18:33:33 +00:00
memory_planning.py [BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768) 2024-07-20 16:20:58 +00:00
multi_kernel.py remove fast_flush arguments (#135387) 2024-09-13 08:13:46 +00:00
simd.py [inductor] Split reduction loops when there is no shared reads (#134307) 2024-09-12 09:45:08 +00:00
triton_combo_kernel.py Revert "[inductor] More fixes on the keys of constants and signature dictionaries (#135406)" 2024-09-16 17:58:02 +00:00
triton_split_scan.py [aoti] Fix workspace generation for triton (#135552) 2024-09-22 04:51:37 +00:00
triton_utils.py Revert "[inductor] More fixes on the keys of constants and signature dictionaries (#135406)" 2024-09-16 17:58:02 +00:00
triton.py [aoti] Fix workspace generation for triton (#135552) 2024-09-22 04:51:37 +00:00
wrapper.py [aoti] Fix workspace generation for triton (#135552) 2024-09-22 04:51:37 +00:00