pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

ghostspiders 26eefd5ae2 Fix windows path escape characters (#162761 ) Fixes #135954 Torch Inductor Windows Path Escape Characters Pull Request resolved: https://github.com/pytorch/pytorch/pull/162761 Approved by: https://github.com/jansel, https://github.com/mlazos		2025-09-17 23:39:39 +00:00
..
aoti_runtime	[AOTI] Save data sizes to constants_info (#154534 )	2025-05-29 06:39:13 +00:00
cuda	[Triton] [Inductor] Add a Blackwell specific Template for persistent matmul (#162916 )	2025-09-15 23:23:04 +00:00
cutedsl	Updates to CuTe DSL template renderer (#161117 )	2025-08-27 23:01:31 +00:00
mtia	[Re-land][Inductor] Support native Inductor as backend for MTIA (#159211 )	2025-07-29 17:03:24 +00:00
rocm	[ROCm][Inductor][CK backend] Install rocm-composable-kernel python package on ROCm Linux CI docker images (#162288 )	2025-09-10 19:33:40 +00:00
xpu	[Inductor] Update Intel Triton for PyTorch 2.9. (#161050 )	2025-08-25 17:18:19 +00:00
__init__.py
aoti_hipify_utils.py	[BE][3/16] fix typos in torch/ (torch/_inductor/) (#156313 )	2025-06-23 02:57:12 +00:00
block_analysis.py	[Inductor] Restrict block analysis to only match integer dims and strides (#149615 )	2025-06-24 22:43:12 +00:00
common.py	[inductor][ez] add src_hash property for Templates (#161468 )	2025-09-12 21:10:45 +00:00
cpp_bmm_template.py	[inductor][cpu] Move VNNI weight packing into AMX GEMM kernel for contiguous BMM weights (#146843 )	2025-02-21 21:46:00 +00:00
cpp_flex_attention_template.py	[FlexAttn] Fix Paged Attention Accuracy via Upper Mask Mod and Prevent Invalid Memory Access (#160861 )	2025-08-30 04:50:23 +00:00
cpp_gemm_template.py	[inductor][cpu] Fix double-offset issue in `GEMM_TEMPLATE` (#159233 )	2025-08-22 03:47:28 +00:00
cpp_grouped_gemm_template.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
cpp_micro_gemm.py	[CPU][Inductor] Improve performance of A16W8 GEMM template (#161148 )	2025-08-31 09:56:29 +00:00
cpp_template_kernel.py	[AOTInductor] ABI-Compatibility for RecordFunction. (#159842 )	2025-08-15 21:45:47 +00:00
cpp_template.py	[AOTInductor] ABI-Compatibility for RecordFunction. (#159842 )	2025-08-15 21:45:47 +00:00
cpp_utils.py	[doc]: Small typos (#162982 )	2025-09-16 17:42:19 +00:00
cpp_wrapper_cpu_array_ref.py	[Inductor-FX] Support IndexPutFallback (#162863 )	2025-09-16 08:52:47 +00:00
cpp_wrapper_cpu.py	[Inductor-FX] Support IndexPutFallback (#162863 )	2025-09-16 08:52:47 +00:00
cpp_wrapper_gpu.py	[inductor][triton] support profile_scratch launcher arg (#159772 )	2025-08-08 14:27:38 +00:00
cpp_wrapper_mps.py	[aoti][mps] Initialize mps kernels first (#159753 )	2025-08-06 07:54:29 +00:00
cpp.py	Fix windows path escape characters (#162761 )	2025-09-17 23:39:39 +00:00
cpu_device_op_overrides.py	[Inductor][CPP] Reuse the pre-existing kernel for the same kernels (#158404 )	2025-09-16 01:54:24 +00:00
cuda_combined_scheduling.py	Add cutedsl template support to compile (#160108 )	2025-08-18 04:37:15 +00:00
debug_utils.py	[Inductor] Refactor wrapper codegen to use Wrapper IR. (#150458 )	2025-04-15 17:28:36 +00:00
halide.py	[Inductor] Add DeviceAssert op to enable device-side assertion in torch.compile (#160677 )	2025-08-28 18:57:34 +00:00
memory_planning.py	Fix unbacked symint and memory leak in inductor memory planning (#159839 )	2025-08-11 17:16:15 +00:00
mps_device_op_overrides.py	[aoti] Initial Metal support (#153959 )	2025-05-23 05:45:35 +00:00
mps.py	[MPS] Add `igamma/igammac` ops (#161927 )	2025-09-02 20:52:02 +00:00
multi_kernel.py	multi-kernel matmuls based on varying hint sizes (#156628 )	2025-07-12 15:08:21 +00:00
python_wrapper_mtia.py	[Re-land][Inductor] Support native Inductor as backend for MTIA (#159211 )	2025-07-29 17:03:24 +00:00
segmented_tree.py	[inductor] dont reuse buffers if it affects peak (#145883 ) (#159530 )	2025-08-19 19:02:56 +00:00
simd_kernel_features.py	Replace runtime type parameterization (#155221 )	2025-06-05 21:43:54 +00:00
simd.py	[inductor] Enable combo kernels with unbacked inputs (#162442 )	2025-09-10 20:49:38 +00:00
subgraph.py	[inductor][mm] restructure decompose k (#161026 )	2025-08-28 20:14:41 +00:00
triton_combo_kernel.py	[Inductor] Fix ComboKernels failing due to missing helper functions (#162759 )	2025-09-12 20:01:06 +00:00
triton_split_scan.py	[inductor] propagate shapes in CSEVariable (#152198 )	2025-08-19 16:46:38 +00:00
triton_utils.py	[Inductor] Fix a user-defined Triton kernel bool param codegen issue (#158845 )	2025-07-24 00:19:27 +00:00
triton.py	[inductor] fix expand_shape when copy_shape is not a string (#162739 )	2025-09-15 23:22:07 +00:00
wrapper_fxir.py	[AOTI-FX] Solve for undefined symbols in dynamic input shapes (#163044 )	2025-09-17 04:12:03 +00:00
wrapper.py	[Graph Partition] allow sharing default device context (#162873 )	2025-09-16 19:36:42 +00:00