pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Adam Mainz d0640b945b [inductor][nit] removing unnecessary else statements (#138789 ) Summary: while reading through inductor template code I found a few places where else statements were driving me crazy. Fixing them as I read Test Plan: CI Differential Revision: D64882385 Pull Request resolved: https://github.com/pytorch/pytorch/pull/138789 Approved by: https://github.com/aakhundov		2024-10-25 17:59:25 +00:00
..
aoti_runtime	[aoti][inplace] Support skipping model buffers (#136770 )	2024-09-30 18:28:42 +00:00
cuda	[inductor] Move V.graph.scheduler.current_device to V.graph.current_device (#138252 )	2024-10-18 23:05:54 +00:00
rocm	[Inductor][ROCm][CK] add CK grouped conv2d fwd kernels to ROCm codegen (#137947 )	2024-10-22 18:25:23 +00:00
xpu	[Inductor] Generalize device guard codegen for cpp_wrapper mode. (#134761 )	2024-09-10 10:11:52 +00:00
__init__.py
aoti_hipify_utils.py
common.py	[inductor][nit] removing unnecessary else statements (#138789 )	2024-10-25 17:59:25 +00:00
cpp_gemm_template.py	Port Inductor dataclasses to be kw_only (#137768 )	2024-10-14 10:33:43 +00:00
cpp_micro_gemm.py	[Inductor][CPP] Optimize WOQ INT8 wgt dequant in AMX GEMM template (#136630 )	2024-09-26 08:41:58 +00:00
cpp_prefix.h	[PyTorch] Fix out-of-bounds array access in atomic_add_vec (#138744 )	2024-10-24 19:37:12 +00:00
cpp_template_kernel.py	Refactored debug_extra to be on ChoiceCaller (and called description) (#137857 )	2024-10-15 00:48:14 +00:00
cpp_template.py	[AOTI] Remove the non-ABI-compatible mode (part 1) (#138009 )	2024-10-17 02:48:26 +00:00
cpp_utils.py	Moved some of Inductor IR nodes to be frozen (#137859 )	2024-10-17 18:04:45 +00:00
cpp_wrapper_cpu_array_ref.py	[AOTI][refactor] Move use_minimal_arrayref_interface logic (#138250 )	2024-10-23 01:00:34 +00:00
cpp_wrapper_cpu.py	[aoti] fix typo in codegen_dynamic_scalar (#138760 )	2024-10-24 23:16:30 +00:00
cpp_wrapper_gpu.py	[AOTI] Fix a special case compile time data type codegen for sym int variables (#138106 )	2024-10-19 02:30:53 +00:00
cpp.py	[PyTorch] Fix inductor CPU masked() body codegen when result dtype is bool and operator is where (#138486 )	2024-10-24 19:36:41 +00:00
cpu_device_op_overrides.py	Add Triton CPU as an Inductor backend (#133408 )	2024-09-30 20:24:52 +00:00
cuda_combined_scheduling.py	[BE]: Update mypy to 1.11.2 (#133816 )	2024-09-16 19:44:11 +00:00
debug_utils.py	[AOTI] Remove the non-ABI-compatible mode (part 1) (#138009 )	2024-10-17 02:48:26 +00:00
halide.py	[inductor] Move V.graph.scheduler.current_device to V.graph.current_device (#138252 )	2024-10-18 23:05:54 +00:00
memory_planning.py	[inductor] Generalize WorkspaceArg for graph-level semaphores (#138170 )	2024-10-18 23:05:54 +00:00
multi_kernel.py	remove fast_flush arguments (#135387 )	2024-09-13 08:13:46 +00:00
simd.py	Remove an unused variable in _inductor/codegen/simd.py (#138000 )	2024-10-16 13:54:21 +00:00
triton_combo_kernel.py	[inductor] Move V.graph.scheduler.current_device to V.graph.current_device (#138252 )	2024-10-18 23:05:54 +00:00
triton_split_scan.py	[inductor] Delete dead code and lints (#137753 )	2024-10-11 18:55:08 +00:00
triton_utils.py	[inductor] Move V.graph.scheduler.current_device to V.graph.current_device (#138252 )	2024-10-18 23:05:54 +00:00
triton.py	[inductor] Move V.graph.scheduler.current_device to V.graph.current_device (#138252 )	2024-10-18 23:05:54 +00:00
wrapper.py	[user triton] fix codegen for tl.constexpr globals (#138757 )	2024-10-25 03:00:42 +00:00