pytorch/torch/_inductor/codegen
Xu Han bafd68b4fc [inductor] fix windows python module ext and func export declaration (#129059)
I have run the first inductor case on Windows base on the exploration code: https://github.com/pytorch/pytorch/pull/128330
Due to some fundamental PR still need pass `fb_code`: https://github.com/pytorch/pytorch/pull/128303
This PR would land some part of exploration code:
1. Fix Windows python module ext type: pyd.
2. Add function export declaration for Windows.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/129059
Approved by: https://github.com/jgong5, https://github.com/jansel
2024-06-19 17:51:32 +00:00
..
aoti_runtime [Inductor] Enable ABI-compatible mode for cpp-wrapper JIT (#121309) 2024-03-07 14:22:06 +00:00
cuda [inductor][refactor] Unify the use of generate_kernel_call (#128467) 2024-06-19 07:47:25 +00:00
xpu Flip default value for mypy disallow_untyped_defs [2/11] (#127839) 2024-06-08 18:23:08 +00:00
__init__.py
aoti_hipify_utils.py Flip default value for mypy disallow_untyped_defs [2/11] (#127839) 2024-06-08 18:23:08 +00:00
codegen_device_driver.py [5/x][AMD][Lowering Enablement] Hipifying aoti code_wrapper (#124241) 2024-04-19 18:57:38 +00:00
common.py Prevent expansion of cat indexing to avoid int64 intermediate (#127815) 2024-06-14 15:42:08 +00:00
cpp_gemm_template.py [inductor][cpp] support bf16/fp16 gemm template epilogue fusion (#126545) 2024-06-13 09:46:22 +00:00
cpp_micro_gemm.py [RELAND][inductor][cpp] bf16/fp16 gemm template computed with fp32 (#128472) 2024-06-12 08:37:16 +00:00
cpp_prefix.h Revert "Set simdlen based on ATEN_CPU_CAPABILITY (#123514)" 2024-06-10 22:46:01 +00:00
cpp_template_kernel.py [inductor][cpp] support bf16/fp16 gemm template epilogue fusion (#126545) 2024-06-13 09:46:22 +00:00
cpp_template.py [inductor][cpp] support bf16/fp16 gemm template epilogue fusion (#126545) 2024-06-13 09:46:22 +00:00
cpp_utils.py [Inductor][CPP] Add Min/Max with VecMask (#126841) 2024-06-18 00:20:32 +00:00
cpp_wrapper_cpu.py [AOTI] Fix a minimal_arrayref_interface test failure (#128613) 2024-06-13 18:25:04 +00:00
cpp_wrapper_cuda.py [inductor][refactor] Unify the use of generate_kernel_call (#128467) 2024-06-19 07:47:25 +00:00
cpp.py [inductor] fix windows python module ext and func export declaration (#129059) 2024-06-19 17:51:32 +00:00
cuda_combined_scheduling.py [inductor] Add BackendFeature gating (#128266) 2024-06-13 07:31:51 +00:00
memory_planning.py Flip default value for mypy disallow_untyped_defs [2/11] (#127839) 2024-06-08 18:23:08 +00:00
multi_kernel.py [inductor][refactor] Unify the use of generate_kernel_call (#128467) 2024-06-19 07:47:25 +00:00
simd.py [inductor] Improve superfluous mask handling in triton codegen (#128518) 2024-06-14 17:52:55 +00:00
triton_foreach.py [inductor][refactor] Unify the use of generate_kernel_call (#128467) 2024-06-19 07:47:25 +00:00
triton_split_scan.py Flip default value for mypy disallow_untyped_defs [2/11] (#127839) 2024-06-08 18:23:08 +00:00
triton_utils.py Flip default value for mypy disallow_untyped_defs [2/11] (#127839) 2024-06-08 18:23:08 +00:00
triton.py [inductor][refactor] Unify the use of generate_kernel_call (#128467) 2024-06-19 07:47:25 +00:00
wrapper.py [inductor][refactor] Unify the use of generate_kernel_call (#128467) 2024-06-19 07:47:25 +00:00