mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-07 00:21:07 +01:00
Putting this PR as an RFC since I have resorted to some horrible hacks in order to make this work. ``` (Pdb) p triton.language.float32 triton.language.fp32 (Pdb) p str(triton.language.float32) 'fp32' (Pdb) p repr(triton.language.float32) 'triton.language.fp32' ``` This means that we need to "rewrite" them for fx graph and inductor execution. This PR allows Mamba2 to work with `torch.compile`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/121690 Approved by: https://github.com/Skylion007 |
||
|---|---|---|
| .. | ||
| aoti_runtime | ||
| cuda | ||
| __init__.py | ||
| common.py | ||
| cpp_prefix.h | ||
| cpp_wrapper_cpu.py | ||
| cpp_wrapper_cuda.py | ||
| cpp.py | ||
| cuda_combined_scheduling.py | ||
| memory_planning.py | ||
| multi_kernel.py | ||
| triton_foreach.py | ||
| triton_split_scan.py | ||
| triton_utils.py | ||
| triton.py | ||
| wrapper.py | ||