| .. |
|
aoti_runtime
|
cpp_wrapper: Move #includes to per-device header files (#145932)
|
2025-01-29 21:08:45 +00:00 |
|
cuda
|
[cutlass backend] Do not change dtype of GEMM template (#146877)
|
2025-02-13 18:36:16 +00:00 |
|
rocm
|
[inductor] Add typing to common.CSE (#145993)
|
2025-02-04 16:05:39 +00:00 |
|
xpu
|
[inductor] Add types to DeviceOpOverrides (#145913)
|
2025-02-01 16:33:49 +00:00 |
|
__init__.py
|
|
|
|
aoti_hipify_utils.py
|
remove allow-untyped-defs from _inductor/codegen/aoti_hipify_utils.py (#143916)
|
2024-12-27 23:25:37 +00:00 |
|
block_analysis.py
|
[Inductor] Expand Identity ops prior to block pattern matching (#146000)
|
2025-02-08 18:11:53 +00:00 |
|
common.py
|
try print stacktrace for error (#147061)
|
2025-02-14 18:28:03 +00:00 |
|
cpp_bmm_template.py
|
[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583)
|
2025-01-29 05:46:05 +00:00 |
|
cpp_flex_attention_template.py
|
[inductor] [cpp] Support vectorization for score and mask in FlexAttention CPU (#143638)
|
2025-02-14 05:26:18 +00:00 |
|
cpp_gemm_template.py
|
[Inductor][CPP] Fix node name for wgt delete (#147056)
|
2025-02-14 06:27:41 +00:00 |
|
cpp_grouped_gemm_template.py
|
[inductor] Finish typing common.py (#146225)
|
2025-02-04 23:35:33 +00:00 |
|
cpp_micro_gemm.py
|
[CPUInductor] Fix SVE256 detection (#146207)
|
2025-02-01 18:51:34 +00:00 |
|
cpp_prefix.h
|
Add torch._scaled_mm for CPU (#139975)
|
2025-02-14 02:03:53 +00:00 |
|
cpp_template_kernel.py
|
[inductor] [cpp] Support vectorization for score and mask in FlexAttention CPU (#143638)
|
2025-02-14 05:26:18 +00:00 |
|
cpp_template.py
|
Fix assertion failure in gemm template lowering (#146353)
|
2025-02-08 01:52:20 +00:00 |
|
cpp_utils.py
|
cpp_wrapper: enable all CPU repro tests (#145655)
|
2025-02-04 22:05:59 +00:00 |
|
cpp_wrapper_cpu_array_ref.py
|
cpp_wrapper: handle mixed-device C-shim fallbacks (#146449)
|
2025-02-12 23:21:04 +00:00 |
|
cpp_wrapper_cpu.py
|
cpp_wrapper: handle mixed-device C-shim fallbacks (#146449)
|
2025-02-12 23:21:04 +00:00 |
|
cpp_wrapper_gpu.py
|
cpp_wrapper: Move #includes to per-device header files (#145932)
|
2025-01-29 21:08:45 +00:00 |
|
cpp.py
|
[Inductor] Unifiy Low Precision FP Legalization for to_dtype_bitcast & constant (#144646)
|
2025-02-11 19:45:04 +00:00 |
|
cpu_device_op_overrides.py
|
[inductor] Add types to DeviceOpOverrides (#145913)
|
2025-02-01 16:33:49 +00:00 |
|
cuda_combined_scheduling.py
|
[BE] Type annotate wrapper_benchmark.py and cuda_combined_scheduling.py (#145542)
|
2025-01-30 03:53:52 +00:00 |
|
debug_utils.py
|
fix intermediate debug information with cpp_wrapper (#145527)
|
2025-02-10 22:24:26 +00:00 |
|
halide.py
|
[inductor] Refactor op handlers part 5 (#146257)
|
2025-02-08 18:00:30 +00:00 |
|
memory_planning.py
|
PEP585 update - torch/_inductor/codegen (#145106)
|
2025-01-18 06:56:03 +00:00 |
|
mps_device_op_overrides.py
|
[inductor] Add types to DeviceOpOverrides (#145913)
|
2025-02-01 16:33:49 +00:00 |
|
mps.py
|
[inductor] Refactor op handlers part 5 (#146257)
|
2025-02-08 18:00:30 +00:00 |
|
multi_kernel.py
|
[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583)
|
2025-01-29 05:46:05 +00:00 |
|
simd_kernel_features.py
|
[inductor] Kernel memory analysis for use in heuristics (#142026)
|
2025-01-25 04:58:54 +00:00 |
|
simd.py
|
[inductor] Add typing to common.CSE (#145993)
|
2025-02-04 16:05:39 +00:00 |
|
triton_combo_kernel.py
|
[inductor] Add typing to common.KernelArgs (#145916)
|
2025-02-04 16:05:39 +00:00 |
|
triton_split_scan.py
|
[inductor] Remove _get_grid_fn_str (#146800)
|
2025-02-10 23:14:30 +00:00 |
|
triton_utils.py
|
[inductor][5/N] triton support post-#5512, fix 1 and None handling (#145515)
|
2025-02-01 02:11:48 +00:00 |
|
triton.py
|
Only call triton in worker process, kick off worker processes earlier, during inductor codegen (#146417)
|
2025-02-11 03:46:16 +00:00 |
|
wrapper.py
|
cpp_wrapper: handle mixed-device C-shim fallbacks (#146449)
|
2025-02-12 23:21:04 +00:00 |