pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Colin Peppler a6149eba12 [easy] Refactor MultiOutput. codegen_list_tuple_access to use subclass type checks (#121662 ) Summary: # Why? Right now I'm running into a case where `itype` is `torch.fx.immutable_collections.immutable_list` which is a subclass of `list`. However, currently we're checking the concrete types (i.e. `list`) and `immutable_list` isn't explictly supported here. Thus, we use a runtime check that looks at the subclass so we can support subclasses -- such as immutable_list -- as well. Test Plan: ci Differential Revision: D54764829 Pull Request resolved: https://github.com/pytorch/pytorch/pull/121662 Approved by: https://github.com/aakhundov		2024-03-12 19:27:56 +00:00
..
codegen	Express y grid > 2^16 in terms of z grid (#121554 )	2024-03-12 02:36:19 +00:00
fx_passes	Port remove_split_ops to PT2 pre-grad passes (#121674 )	2024-03-12 01:15:19 +00:00
kernel	Revert "Add CUTLASS kernel as choice for _int_mm() Inductor autotuning (#119685 )"	2024-03-09 07:20:53 +00:00
__init__.py	[export][reland] Disable exported_program.__call__ (#120019 )	2024-03-05 05:29:46 +00:00
autotune_process.py	[inductor][Gemm] Autotune with matrix_instr_nonkdim for AMDGPU (#120742 )	2024-02-28 20:27:14 +00:00
bounds.py	Some minor type stub improvements (#118529 )	2024-02-04 00:19:00 +00:00
codecache.py	[inductor] Make some improvements to FX graph caching (#117888 )	2024-03-08 02:30:49 +00:00
comm_analysis.py	[reland] Fix estimate_nccl_collective_runtime (#118986 )	2024-02-12 18:48:06 +00:00
comms.py	[BE]: Apply RUF025 dict.fromkeys preview rule (#118637 )	2024-01-30 20:46:54 +00:00
compile_fx.py	Switch cudagraph backend to cudagraph trees (#121019 )	2024-03-08 22:56:26 +00:00
config.py	[Inductor] Enable ABI-compatible mode for cpp-wrapper JIT (#121309 )	2024-03-07 14:22:06 +00:00
constant_folding.py	[AOTInductor] Add Runtime Constant-folding for AOTInductor (#118765 )	2024-02-01 04:54:25 +00:00
coordinate_descent_tuner.py	Fix typo under torch/_inductor directory (#110530 )	2023-10-05 02:17:20 +00:00
cudagraph_trees.py	improve cuda graph symint logging msg (#119739 )	2024-02-13 06:26:36 +00:00
cudagraph_utils.py	Switch cudagraph backend to cudagraph trees (#121019 )	2024-03-08 22:56:26 +00:00
debug.py	[inductor max autotune] Detailed autotuning result logs ( machine-readable ) (#119004 )	2024-02-29 18:24:13 +00:00
decomposition.py	Revert "Batch Norm Consolidation (#116092 )"	2024-03-11 22:22:41 +00:00
dependencies.py	add decomposition for frexp (#119217 )	2024-02-23 21:52:42 +00:00
exc.py	[AOTInductor] Improve validation for C++ wrapper codegen (#111102 )	2023-10-13 08:46:17 +00:00
freezing.py	FF inductor failure (#114980 )	2023-12-04 18:26:34 +00:00
fx_utils.py	Unify MYPYINDUCTOR and MYPY (#118432 )	2024-01-27 17:23:20 +00:00
graph.py	[AOTInductor] Reuse generated kernels between constant graph and main graph (#121564 )	2024-03-11 22:44:38 +00:00
hooks.py	Enable local_partial_types (#118467 )	2024-01-28 13:38:22 +00:00
index_propagation.py	Correct index propagation for % (#119864 )	2024-02-16 19:14:39 +00:00
inductor_prims.py	[BE]: Remove useless lambdas (#113602 )	2023-11-14 20:06:48 +00:00
ir.py	[easy] Refactor MultiOutput. codegen_list_tuple_access to use subclass type checks (#121662 )	2024-03-12 19:27:56 +00:00
lowering.py	[Quant][Inductor] Fix q/dq per channel lowering with 64-bit qparams (#120984 )	2024-03-07 06:23:52 +00:00
metrics.py	[inductor] Make some improvements to FX graph caching (#117888 )	2024-03-08 02:30:49 +00:00
ops_handler.py	add decomposition for frexp (#119217 )	2024-02-23 21:52:42 +00:00
optimize_indexing.py	Some minor type stub improvements (#118529 )	2024-02-04 00:19:00 +00:00
pattern_matcher.py	[PT2][Optimus][Observability] Log the optimus graph transformation to the scuba (#119745 )	2024-02-16 21:32:04 +00:00
quantized_lowerings.py	[Quant][Inductor] Enable the lowering of quantized maxpool2d (#105906 )	2023-08-26 08:36:47 +00:00
scheduler.py	Benchmark templates (#118880 )	2024-03-11 23:55:13 +00:00
script.ld	Place .lrodata later in the binary (#117575 )	2024-01-18 17:58:18 +00:00
select_algorithm.py	[inductor] Refactor common triton imports into one function (#121438 )	2024-03-09 18:17:36 +00:00
sizevars.py	Remove follow_imports = skip from sympy (#118469 )	2024-01-28 13:38:38 +00:00
test_case.py	Refactor some testing helpers for FX graph cache testing (#121520 )	2024-03-11 21:46:27 +00:00
test_operators.py	Grandfather in some more pytorch ops to be pt2_compliant (#113050 )	2023-11-09 02:35:33 +00:00
triton_helpers.py	[inductor] Changes to support newer triton pin (#121267 )	2024-03-09 18:17:36 +00:00
triton_heuristics.py	Express y grid > 2^16 in terms of z grid (#121554 )	2024-03-12 02:36:19 +00:00
utils.py	Express y grid > 2^16 in terms of z grid (#121554 )	2024-03-12 02:36:19 +00:00
virtualized.py	[EASY] Move OpsHandler and MockHandler to their own file (#119851 )	2024-02-15 18:54:41 +00:00
wrapper_benchmark.py	[inductor] Refactor common triton imports into one function (#121438 )	2024-03-09 18:17:36 +00:00