pytorch/torch/_inductor/codegen
2024-02-16 08:10:51 +00:00
..
aoti_runtime Add Runtime Constant-Folding function of AOTInductor for AOTInductorModels used internally. (#119823) 2024-02-16 06:45:48 +00:00
cuda [Inductor max autotune] Multithreaded Precompilation (#119386) 2024-02-09 16:11:30 +00:00
__init__.py
common.py Check alignment of ReinterpretView args of custom Triton kernels (#119649) 2024-02-11 20:21:17 +00:00
cpp_prefix.h [Inductor] Add Int8 data type into Inductor CPP backend vectorized code generation (#119179) 2024-02-09 07:33:12 +00:00
cpp_wrapper_cuda.py [aot_inductor] move CudaWrapperCodeGen into a separate file (#119870) 2024-02-16 08:10:51 +00:00
cpp.py Add lowerings to special functions (#119187) 2024-02-11 16:35:40 +00:00
cuda_combined_scheduling.py [mypy] added type annotations to codegen_nodes methods (#119080) 2024-02-05 18:33:52 +00:00
memory_planning.py Remove follow_imports = skip from sympy (#118469) 2024-01-28 13:38:38 +00:00
multi_kernel.py [inductor] make multi-kernel work with cpp-wrapper (#117813) 2024-02-05 23:35:41 +00:00
triton_foreach.py [inductor] make inductor work with new triton compile interface (#115878) 2023-12-22 00:09:29 +00:00
triton_split_scan.py [inductor] Add split scan kernel (#117992) 2024-02-09 01:56:00 +00:00
triton_utils.py Check alignment of ReinterpretView args of custom Triton kernels (#119649) 2024-02-11 20:21:17 +00:00
triton.py Check alignment of ReinterpretView args of custom Triton kernels (#119649) 2024-02-11 20:21:17 +00:00
wrapper.py [aot_inductor] move CudaWrapperCodeGen into a separate file (#119870) 2024-02-16 08:10:51 +00:00