pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Isuru Fernando 5f62d07ec6 Fix log2, PowByNatural printing (#147592 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/147592 Approved by: https://github.com/eellison		2025-04-02 14:12:15 +00:00
..
aoti_runtime	[AOTInductor] Add function for users to extract constants in container (#150163 )	2025-03-29 03:36:12 +00:00
cuda	[Inductor Cutlass backend] Fix imports and compilation of Cutlass SM100 Kernels (#149515 )	2025-03-20 20:35:18 +00:00
rocm	Reland: [inductor] Simplify grid handling (#148305 )	2025-03-12 15:52:16 +00:00
xpu	Reland: [inductor] Simplify grid handling (#148305 )	2025-03-12 15:52:16 +00:00
__init__.py
aoti_hipify_utils.py	remove allow-untyped-defs from _inductor/codegen/aoti_hipify_utils.py (#143916 )	2024-12-27 23:25:37 +00:00
block_analysis.py	[inductor][triton] Block ptr analysis fix assert on matched index expression (#148446 )	2025-03-10 05:26:55 +00:00
common.py	[Inductor] Fix `torch.polygamma()` when n == 1 (#147453 )	2025-03-31 05:27:46 +00:00
cpp_bmm_template.py	[inductor][cpu] Move VNNI weight packing into AMX GEMM kernel for contiguous BMM weights (#146843 )	2025-02-21 21:46:00 +00:00
cpp_flex_attention_template.py	[Inductor][CPP] Avoid transpose with cpp micro-gemm for FlexAttention (#147069 )	2025-03-03 15:22:11 +00:00
cpp_gemm_template.py	Iterate over dense dim first in split reduction reindexing (#147229 )	2025-03-18 17:35:21 +00:00
cpp_grouped_gemm_template.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
cpp_micro_gemm.py	[Inductor-CPU] Fix int8 WoQ AMX micro-kernel when `block_n` is 16 or 48 (#149359 )	2025-03-24 21:27:46 +00:00
cpp_prefix.h	[Inductor] Unify the data type propagation between Triton and CPP Backend (#146970 )	2025-03-21 17:52:51 +00:00
cpp_template_kernel.py	Reland: [inductor] Simplify grid handling (#148305 )	2025-03-12 15:52:16 +00:00
cpp_template.py	[Inductor] Unify the data type propagation between Triton and CPP Backend (#146970 )	2025-03-21 17:52:51 +00:00
cpp_utils.py	[Inductor] Unify the data type propagation between Triton and CPP Backend (#146970 )	2025-03-21 17:52:51 +00:00
cpp_wrapper_cpu_array_ref.py	[Inductor] Use real input to autotune user defined triton kernels (#149553 )	2025-03-26 16:42:48 +00:00
cpp_wrapper_cpu.py	cpp_wrapper: precompile a few more commonly used headers, and improve RAIIPyObject interface (#149350 )	2025-04-02 09:54:27 +00:00
cpp_wrapper_gpu.py	[AOTI] Emit Triton kernels as comment (#150188 )	2025-04-02 12:41:54 +00:00
cpp.py	[Inductor] optimize the heuristics of parallel reduction (#149614 )	2025-04-01 01:31:00 +00:00
cpu_device_op_overrides.py	[inductor] Add types to DeviceOpOverrides (#145913 )	2025-02-01 16:33:49 +00:00
cuda_combined_scheduling.py	Revert "Use the device interface for detecting Triton availability (#139171 )"	2025-03-11 18:49:21 +00:00
debug_utils.py	[AOTI][debug logger] small fix for intermediate value debugger for jit when arg is not tensor (#149007 )	2025-03-14 19:40:41 +00:00
halide.py	Fix log2, PowByNatural printing (#147592 )	2025-04-02 14:12:15 +00:00
memory_planning.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
mps_device_op_overrides.py	[inductor] Add types to DeviceOpOverrides (#145913 )	2025-02-01 16:33:49 +00:00
mps.py	[MPSInductor] Add `store_reduce` method (#150457 )	2025-04-02 05:12:49 +00:00
multi_kernel.py	Make sure to write to caches atomically (#149654 )	2025-03-21 18:59:41 +00:00
simd_kernel_features.py	[BE][Ez]: Use itertools.chain.from_iterable when possible (#148190 )	2025-03-06 20:37:06 +00:00
simd.py	[pt2_provenance_tracing] add combo kernel nodes post_grad nodes origin info (#149598 )	2025-03-27 00:26:24 +00:00
triton_combo_kernel.py	Reland: [inductor] Simplify grid handling (#148305 )	2025-03-12 15:52:16 +00:00
triton_split_scan.py	Reland: [inductor] Simplify grid handling (#148305 )	2025-03-12 15:52:16 +00:00
triton_utils.py	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 )	2025-02-28 13:33:19 +00:00
triton.py	Fix log2, PowByNatural printing (#147592 )	2025-04-02 14:12:15 +00:00
wrapper.py	[Graph Partition] Support symbol inputs (#149458 )	2025-03-26 17:21:30 +00:00