| .. |
|
aoti_runtime
|
[AOTI] Save data sizes to constants_info (#154534)
|
2025-05-29 06:39:13 +00:00 |
|
cuda
|
[Triton] [Inductor] Add a Blackwell specific Template for persistent matmul (#162916)
|
2025-09-15 23:23:04 +00:00 |
|
cutedsl
|
Updates to CuTe DSL template renderer (#161117)
|
2025-08-27 23:01:31 +00:00 |
|
mtia
|
[Re-land][Inductor] Support native Inductor as backend for MTIA (#159211)
|
2025-07-29 17:03:24 +00:00 |
|
rocm
|
[ROCm][Inductor][CK backend] Install rocm-composable-kernel python package on ROCm Linux CI docker images (#162288)
|
2025-09-10 19:33:40 +00:00 |
|
xpu
|
[Inductor] Update Intel Triton for PyTorch 2.9. (#161050)
|
2025-08-25 17:18:19 +00:00 |
|
__init__.py
|
|
|
|
aoti_hipify_utils.py
|
[BE][3/16] fix typos in torch/ (torch/_inductor/) (#156313)
|
2025-06-23 02:57:12 +00:00 |
|
block_analysis.py
|
[Inductor] Restrict block analysis to only match integer dims and strides (#149615)
|
2025-06-24 22:43:12 +00:00 |
|
common.py
|
[inductor][ez] add src_hash property for Templates (#161468)
|
2025-09-12 21:10:45 +00:00 |
|
cpp_bmm_template.py
|
[inductor][cpu] Move VNNI weight packing into AMX GEMM kernel for contiguous BMM weights (#146843)
|
2025-02-21 21:46:00 +00:00 |
|
cpp_flex_attention_template.py
|
[FlexAttn] Fix Paged Attention Accuracy via Upper Mask Mod and Prevent Invalid Memory Access (#160861)
|
2025-08-30 04:50:23 +00:00 |
|
cpp_gemm_template.py
|
[inductor][cpu] Fix double-offset issue in GEMM_TEMPLATE (#159233)
|
2025-08-22 03:47:28 +00:00 |
|
cpp_grouped_gemm_template.py
|
[BE][PYFMT] migrate PYFMT for torch._inductor to ruff format (#144550)
|
2025-02-28 13:33:19 +00:00 |
|
cpp_micro_gemm.py
|
[CPU][Inductor] Improve performance of A16W8 GEMM template (#161148)
|
2025-08-31 09:56:29 +00:00 |
|
cpp_template_kernel.py
|
[AOTInductor] ABI-Compatibility for RecordFunction. (#159842)
|
2025-08-15 21:45:47 +00:00 |
|
cpp_template.py
|
[AOTInductor] ABI-Compatibility for RecordFunction. (#159842)
|
2025-08-15 21:45:47 +00:00 |
|
cpp_utils.py
|
[doc]: Small typos (#162982)
|
2025-09-16 17:42:19 +00:00 |
|
cpp_wrapper_cpu_array_ref.py
|
[Inductor-FX] Support IndexPutFallback (#162863)
|
2025-09-16 08:52:47 +00:00 |
|
cpp_wrapper_cpu.py
|
[Inductor-FX] Support IndexPutFallback (#162863)
|
2025-09-16 08:52:47 +00:00 |
|
cpp_wrapper_gpu.py
|
[inductor][triton] support profile_scratch launcher arg (#159772)
|
2025-08-08 14:27:38 +00:00 |
|
cpp_wrapper_mps.py
|
[aoti][mps] Initialize mps kernels first (#159753)
|
2025-08-06 07:54:29 +00:00 |
|
cpp.py
|
Fix windows path escape characters (#162761)
|
2025-09-17 23:39:39 +00:00 |
|
cpu_device_op_overrides.py
|
[Inductor][CPP] Reuse the pre-existing kernel for the same kernels (#158404)
|
2025-09-16 01:54:24 +00:00 |
|
cuda_combined_scheduling.py
|
Add cutedsl template support to compile (#160108)
|
2025-08-18 04:37:15 +00:00 |
|
debug_utils.py
|
[Inductor] Refactor wrapper codegen to use Wrapper IR. (#150458)
|
2025-04-15 17:28:36 +00:00 |
|
halide.py
|
[Inductor] Add DeviceAssert op to enable device-side assertion in torch.compile (#160677)
|
2025-08-28 18:57:34 +00:00 |
|
memory_planning.py
|
Fix unbacked symint and memory leak in inductor memory planning (#159839)
|
2025-08-11 17:16:15 +00:00 |
|
mps_device_op_overrides.py
|
[aoti] Initial Metal support (#153959)
|
2025-05-23 05:45:35 +00:00 |
|
mps.py
|
[MPS] Add igamma/igammac ops (#161927)
|
2025-09-02 20:52:02 +00:00 |
|
multi_kernel.py
|
multi-kernel matmuls based on varying hint sizes (#156628)
|
2025-07-12 15:08:21 +00:00 |
|
python_wrapper_mtia.py
|
[Re-land][Inductor] Support native Inductor as backend for MTIA (#159211)
|
2025-07-29 17:03:24 +00:00 |
|
segmented_tree.py
|
[inductor] dont reuse buffers if it affects peak (#145883) (#159530)
|
2025-08-19 19:02:56 +00:00 |
|
simd_kernel_features.py
|
Replace runtime type parameterization (#155221)
|
2025-06-05 21:43:54 +00:00 |
|
simd.py
|
[inductor] Enable combo kernels with unbacked inputs (#162442)
|
2025-09-10 20:49:38 +00:00 |
|
subgraph.py
|
[inductor][mm] restructure decompose k (#161026)
|
2025-08-28 20:14:41 +00:00 |
|
triton_combo_kernel.py
|
[Inductor] Fix ComboKernels failing due to missing helper functions (#162759)
|
2025-09-12 20:01:06 +00:00 |
|
triton_split_scan.py
|
[inductor] propagate shapes in CSEVariable (#152198)
|
2025-08-19 16:46:38 +00:00 |
|
triton_utils.py
|
[Inductor] Fix a user-defined Triton kernel bool param codegen issue (#158845)
|
2025-07-24 00:19:27 +00:00 |
|
triton.py
|
[inductor] fix expand_shape when copy_shape is not a string (#162739)
|
2025-09-15 23:22:07 +00:00 |
|
wrapper_fxir.py
|
[AOTI-FX] Solve for undefined symbols in dynamic input shapes (#163044)
|
2025-09-17 04:12:03 +00:00 |
|
wrapper.py
|
[Graph Partition] allow sharing default device context (#162873)
|
2025-09-16 19:36:42 +00:00 |