..
codegen
Express y grid > 2^16 in terms of z grid ( #121554 )
2024-03-12 02:36:19 +00:00
fx_passes
Port remove_split_ops to PT2 pre-grad passes ( #121674 )
2024-03-12 01:15:19 +00:00
kernel
Revert "Add CUTLASS kernel as choice for _int_mm() Inductor autotuning ( #119685 )"
2024-03-09 07:20:53 +00:00
__init__.py
[export][reland] Disable exported_program.__call__ ( #120019 )
2024-03-05 05:29:46 +00:00
autotune_process.py
[inductor][Gemm] Autotune with matrix_instr_nonkdim for AMDGPU ( #120742 )
2024-02-28 20:27:14 +00:00
bounds.py
Some minor type stub improvements ( #118529 )
2024-02-04 00:19:00 +00:00
codecache.py
[inductor] Make some improvements to FX graph caching ( #117888 )
2024-03-08 02:30:49 +00:00
comm_analysis.py
[reland] Fix estimate_nccl_collective_runtime ( #118986 )
2024-02-12 18:48:06 +00:00
comms.py
[BE]: Apply RUF025 dict.fromkeys preview rule ( #118637 )
2024-01-30 20:46:54 +00:00
compile_fx.py
Switch cudagraph backend to cudagraph trees ( #121019 )
2024-03-08 22:56:26 +00:00
config.py
[Inductor] Enable ABI-compatible mode for cpp-wrapper JIT ( #121309 )
2024-03-07 14:22:06 +00:00
constant_folding.py
[AOTInductor] Add Runtime Constant-folding for AOTInductor ( #118765 )
2024-02-01 04:54:25 +00:00
coordinate_descent_tuner.py
Fix typo under torch/_inductor directory ( #110530 )
2023-10-05 02:17:20 +00:00
cudagraph_trees.py
improve cuda graph symint logging msg ( #119739 )
2024-02-13 06:26:36 +00:00
cudagraph_utils.py
Switch cudagraph backend to cudagraph trees ( #121019 )
2024-03-08 22:56:26 +00:00
debug.py
[inductor max autotune] Detailed autotuning result logs ( machine-readable ) ( #119004 )
2024-02-29 18:24:13 +00:00
decomposition.py
Revert "Batch Norm Consolidation ( #116092 )"
2024-03-11 22:22:41 +00:00
dependencies.py
add decomposition for frexp ( #119217 )
2024-02-23 21:52:42 +00:00
exc.py
[AOTInductor] Improve validation for C++ wrapper codegen ( #111102 )
2023-10-13 08:46:17 +00:00
freezing.py
FF inductor failure ( #114980 )
2023-12-04 18:26:34 +00:00
fx_utils.py
Unify MYPYINDUCTOR and MYPY ( #118432 )
2024-01-27 17:23:20 +00:00
graph.py
[AOTInductor] Reuse generated kernels between constant graph and main graph ( #121564 )
2024-03-11 22:44:38 +00:00
hooks.py
Enable local_partial_types ( #118467 )
2024-01-28 13:38:22 +00:00
index_propagation.py
Correct index propagation for % ( #119864 )
2024-02-16 19:14:39 +00:00
inductor_prims.py
[BE]: Remove useless lambdas ( #113602 )
2023-11-14 20:06:48 +00:00
ir.py
[easy] Refactor MultiOutput. codegen_list_tuple_access to use subclass type checks ( #121662 )
2024-03-12 19:27:56 +00:00
lowering.py
[Quant][Inductor] Fix q/dq per channel lowering with 64-bit qparams ( #120984 )
2024-03-07 06:23:52 +00:00
metrics.py
[inductor] Make some improvements to FX graph caching ( #117888 )
2024-03-08 02:30:49 +00:00
ops_handler.py
add decomposition for frexp ( #119217 )
2024-02-23 21:52:42 +00:00
optimize_indexing.py
Some minor type stub improvements ( #118529 )
2024-02-04 00:19:00 +00:00
pattern_matcher.py
[PT2][Optimus][Observability] Log the optimus graph transformation to the scuba ( #119745 )
2024-02-16 21:32:04 +00:00
quantized_lowerings.py
[Quant][Inductor] Enable the lowering of quantized maxpool2d ( #105906 )
2023-08-26 08:36:47 +00:00
scheduler.py
Benchmark templates ( #118880 )
2024-03-11 23:55:13 +00:00
script.ld
Place .lrodata later in the binary ( #117575 )
2024-01-18 17:58:18 +00:00
select_algorithm.py
[inductor] Refactor common triton imports into one function ( #121438 )
2024-03-09 18:17:36 +00:00
sizevars.py
Remove follow_imports = skip from sympy ( #118469 )
2024-01-28 13:38:38 +00:00
test_case.py
Refactor some testing helpers for FX graph cache testing ( #121520 )
2024-03-11 21:46:27 +00:00
test_operators.py
Grandfather in some more pytorch ops to be pt2_compliant ( #113050 )
2023-11-09 02:35:33 +00:00
triton_helpers.py
[inductor] Changes to support newer triton pin ( #121267 )
2024-03-09 18:17:36 +00:00
triton_heuristics.py
Express y grid > 2^16 in terms of z grid ( #121554 )
2024-03-12 02:36:19 +00:00
utils.py
Express y grid > 2^16 in terms of z grid ( #121554 )
2024-03-12 02:36:19 +00:00
virtualized.py
[EASY] Move OpsHandler and MockHandler to their own file ( #119851 )
2024-02-15 18:54:41 +00:00
wrapper_benchmark.py
[inductor] Refactor common triton imports into one function ( #121438 )
2024-03-09 18:17:36 +00:00