pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

History

bglass@quansight.com 40ccb7a86d cpp_wrapper: Move #includes to per-device header files (#145932 ) Summary: This prepares us for the next PR in the stack, where we introduce pre-compiled per-device header files to save compilation time. Reland https://github.com/pytorch/pytorch/pull/143909 after merge conflicts. Co-authored-by: Benjamin Glass <[bglass@quansight.com](mailto:bglass@quansight.com)> Differential Revision: D68656960 Pulled By: benjaminglass1 Pull Request resolved: https://github.com/pytorch/pytorch/pull/145932 Approved by: https://github.com/yushangdi, https://github.com/benjaminglass1 Co-authored-by: bglass@quansight.com <bglass@quansight.com>		2025-01-29 21:08:45 +00:00
..
aoti_runtime	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
cuda	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
rocm	[inductor] Add some typing to common.py (#145691 )	2025-01-27 06:27:13 +00:00
xpu	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
__init__.py
aoti_hipify_utils.py	remove allow-untyped-defs from _inductor/codegen/aoti_hipify_utils.py (#143916 )	2024-12-27 23:25:37 +00:00
block_analysis.py	PEP585 update - torch/_inductor/codegen (#145106 )	2025-01-18 06:56:03 +00:00
common.py	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
cpp_bmm_template.py	[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583 )	2025-01-29 05:46:05 +00:00
cpp_flex_attention_template.py	[Inductor-CPU] Add profiling support for codegened flex attention kernels (#145894 )	2025-01-29 20:54:46 +00:00
cpp_gemm_template.py	[Inductor][CPU] Add auto-tuning support for da8w8 sym act sym wgt GEMM (#143187 )	2025-01-22 02:27:53 +00:00
cpp_grouped_gemm_template.py	PEP585 update - torch/_inductor/codegen (#145106 )	2025-01-18 06:56:03 +00:00
cpp_micro_gemm.py	[Inductor][CPU] Add auto-tuning support for da8w8 sym act sym wgt GEMM (#143187 )	2025-01-22 02:27:53 +00:00
cpp_prefix.h	Remove is_reduced_floating_point from namespace std (#144502 )	2025-01-10 03:24:10 +00:00
cpp_template_kernel.py	PEP585 update - torch/_inductor/codegen (#145106 )	2025-01-18 06:56:03 +00:00
cpp_template.py	PEP585 update - torch/_inductor/codegen (#145106 )	2025-01-18 06:56:03 +00:00
cpp_utils.py	[inductor] Remove type ignores from scheduler.py (#145712 )	2025-01-28 01:44:32 +00:00
cpp_wrapper_cpu_array_ref.py	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
cpp_wrapper_cpu.py	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
cpp_wrapper_gpu.py	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
cpp.py	[inductor] Change type of get_backend_features to OrderedSet (#145692 )	2025-01-28 01:44:32 +00:00
cpu_device_op_overrides.py	remove allow-untyped-defs from _inductor/codegen/cpu_device_op_overrides.py (#143881 )	2024-12-27 04:10:47 +00:00
cuda_combined_scheduling.py	Parallelize epilogue/prologue benchmarking (#143408 )	2025-01-28 18:18:24 +00:00
debug_utils.py	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00
halide.py	[inductor] Change type of get_backend_features to OrderedSet (#145692 )	2025-01-28 01:44:32 +00:00
memory_planning.py	PEP585 update - torch/_inductor/codegen (#145106 )	2025-01-18 06:56:03 +00:00
mps_device_op_overrides.py	[Inductor] Add MPS device op overrides (#143892 )	2024-12-28 02:11:45 +00:00
mps.py	[inductor] Add some typing to common.py (#145691 )	2025-01-27 06:27:13 +00:00
multi_kernel.py	[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583 )	2025-01-29 05:46:05 +00:00
simd_kernel_features.py	[inductor] Kernel memory analysis for use in heuristics (#142026 )	2025-01-25 04:58:54 +00:00
simd.py	[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583 )	2025-01-29 05:46:05 +00:00
triton_combo_kernel.py	[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583 )	2025-01-29 05:46:05 +00:00
triton_split_scan.py	PEP585 update - torch/_inductor/codegen (#145106 )	2025-01-18 06:56:03 +00:00
triton_utils.py	[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583 )	2025-01-29 05:46:05 +00:00
triton.py	[inductor][4/N] triton support post-#5512, fix constexpr signatures (#145583 )	2025-01-29 05:46:05 +00:00
wrapper.py	cpp_wrapper: Move #includes to per-device header files (#145932 )	2025-01-29 21:08:45 +00:00