pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Xu Han bafd68b4fc [inductor] fix windows python module ext and func export declaration (#129059 ) I have run the first inductor case on Windows base on the exploration code: https://github.com/pytorch/pytorch/pull/128330 Due to some fundamental PR still need pass `fb_code`: https://github.com/pytorch/pytorch/pull/128303 This PR would land some part of exploration code: 1. Fix Windows python module ext type: pyd. 2. Add function export declaration for Windows. Pull Request resolved: https://github.com/pytorch/pytorch/pull/129059 Approved by: https://github.com/jgong5, https://github.com/jansel		2024-06-19 17:51:32 +00:00
..
aoti_runtime	[Inductor] Enable ABI-compatible mode for cpp-wrapper JIT (#121309 )	2024-03-07 14:22:06 +00:00
cuda	[inductor][refactor] Unify the use of generate_kernel_call (#128467 )	2024-06-19 07:47:25 +00:00
xpu	Flip default value for mypy disallow_untyped_defs [2/11] (#127839 )	2024-06-08 18:23:08 +00:00
__init__.py
aoti_hipify_utils.py	Flip default value for mypy disallow_untyped_defs [2/11] (#127839 )	2024-06-08 18:23:08 +00:00
codegen_device_driver.py	[5/x][AMD][Lowering Enablement] Hipifying aoti code_wrapper (#124241 )	2024-04-19 18:57:38 +00:00
common.py	Prevent expansion of cat indexing to avoid int64 intermediate (#127815 )	2024-06-14 15:42:08 +00:00
cpp_gemm_template.py	[inductor][cpp] support bf16/fp16 gemm template epilogue fusion (#126545 )	2024-06-13 09:46:22 +00:00
cpp_micro_gemm.py	[RELAND][inductor][cpp] bf16/fp16 gemm template computed with fp32 (#128472 )	2024-06-12 08:37:16 +00:00
cpp_prefix.h	Revert "Set simdlen based on ATEN_CPU_CAPABILITY (#123514 )"	2024-06-10 22:46:01 +00:00
cpp_template_kernel.py	[inductor][cpp] support bf16/fp16 gemm template epilogue fusion (#126545 )	2024-06-13 09:46:22 +00:00
cpp_template.py	[inductor][cpp] support bf16/fp16 gemm template epilogue fusion (#126545 )	2024-06-13 09:46:22 +00:00
cpp_utils.py	[Inductor][CPP] Add Min/Max with VecMask (#126841 )	2024-06-18 00:20:32 +00:00
cpp_wrapper_cpu.py	[AOTI] Fix a minimal_arrayref_interface test failure (#128613 )	2024-06-13 18:25:04 +00:00
cpp_wrapper_cuda.py	[inductor][refactor] Unify the use of generate_kernel_call (#128467 )	2024-06-19 07:47:25 +00:00
cpp.py	[inductor] fix windows python module ext and func export declaration (#129059 )	2024-06-19 17:51:32 +00:00
cuda_combined_scheduling.py	[inductor] Add BackendFeature gating (#128266 )	2024-06-13 07:31:51 +00:00
memory_planning.py	Flip default value for mypy disallow_untyped_defs [2/11] (#127839 )	2024-06-08 18:23:08 +00:00
multi_kernel.py	[inductor][refactor] Unify the use of generate_kernel_call (#128467 )	2024-06-19 07:47:25 +00:00
simd.py	[inductor] Improve superfluous mask handling in triton codegen (#128518 )	2024-06-14 17:52:55 +00:00
triton_foreach.py	[inductor][refactor] Unify the use of generate_kernel_call (#128467 )	2024-06-19 07:47:25 +00:00
triton_split_scan.py	Flip default value for mypy disallow_untyped_defs [2/11] (#127839 )	2024-06-08 18:23:08 +00:00
triton_utils.py	Flip default value for mypy disallow_untyped_defs [2/11] (#127839 )	2024-06-08 18:23:08 +00:00
triton.py	[inductor][refactor] Unify the use of generate_kernel_call (#128467 )	2024-06-19 07:47:25 +00:00
wrapper.py	[inductor][refactor] Unify the use of generate_kernel_call (#128467 )	2024-06-19 07:47:25 +00:00