| .. |
|
autoheuristic
|
AutoHeuristic: mm ranking heuristic h100 (#133608)
|
2024-08-16 16:20:38 +00:00 |
|
codegen
|
[inductor] Simplify remove_kernel_local_buffers (#139452)
|
2024-11-01 20:36:39 +00:00 |
|
compile_worker
|
typing subproc_pool.py (#138032)
|
2024-10-18 15:31:05 +00:00 |
|
fx_passes
|
[Easy] Add joint graph passes, fallback_random to bisector (#139295)
|
2024-11-01 23:21:53 +00:00 |
|
kernel
|
[Flex Attention] Paged Attention (#137164)
|
2024-10-29 17:05:22 +00:00 |
|
package
|
[aoti] Accept constant inputs (#137197)
|
2024-10-03 20:59:33 +00:00 |
|
runtime
|
[inductor] set sanitize_overflow=False for triton kernels (#139502)
|
2024-11-01 23:10:21 +00:00 |
|
__init__.py
|
[pt2] improve error logs for torch.cond and aoti package (#138647)
|
2024-10-24 23:38:07 +00:00 |
|
aoti_eager.py
|
[4/N] Non-Tensor: Support layout, device and dtype for aten operations (#125897)
|
2024-07-23 17:50:17 +00:00 |
|
async_compile.py
|
[easy] Log subproc pool creation (#138642)
|
2024-10-23 02:41:42 +00:00 |
|
autotune_process.py
|
Add workspace to TritonTemplates (#138050)
|
2024-10-29 18:17:54 +00:00 |
|
bounds.py
|
Existing mypy issues (#136236)
|
2024-09-24 01:02:07 +00:00 |
|
codecache.py
|
[AOTI] Unify how weights are stored as data section (#139471)
|
2024-11-02 00:23:24 +00:00 |
|
comm_analysis.py
|
[BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768)
|
2024-07-20 16:20:58 +00:00 |
|
comm_lowering.py
|
Preliminary registered-buffer collective support via Inductor (#138029)
|
2024-10-30 18:11:09 +00:00 |
|
comms.py
|
[Traceable FSDP2] Use .copy_ instead of .set_ for unsharded_param inplace update; Replace unsharded_param graph input usage with graph intermediate; Support FSDP2+LoRA (#133730)
|
2024-09-11 23:01:05 +00:00 |
|
compile_fx.py
|
[fx graph cache] Support freezing with FX graph caching (#136505)
|
2024-11-01 18:29:29 +00:00 |
|
compiler_bisector.py
|
Class rename (#139490)
|
2024-11-02 00:10:17 +00:00 |
|
config.py
|
[AOTI] Unify how weights are stored as data section (#139471)
|
2024-11-02 00:23:24 +00:00 |
|
constant_folding.py
|
Constant folding for lifted graph (#135060)
|
2024-10-28 06:28:31 +00:00 |
|
cpp_builder.py
|
Enable Windows Arm64 (#133088)
|
2024-10-24 16:10:44 +00:00 |
|
cpu_vec_isa.py
|
[PyTorch] Use 128-bit vectors for ARM64 (#137426)
|
2024-10-26 00:20:35 +00:00 |
|
cudagraph_trees.py
|
typing compile_fx.py (#138033)
|
2024-10-21 18:14:59 +00:00 |
|
cudagraph_utils.py
|
[CUDAGraph] Warn once if too many distinct sizes (#132832)
|
2024-08-07 19:48:06 +00:00 |
|
custom_graph_pass.py
|
[inductor] Add API to make post_grad_custom passes cache-able (#137298)
|
2024-10-07 21:11:54 +00:00 |
|
debug.py
|
Refactor FxGraphDrawer to use HTML-like labels (#137726)
|
2024-11-01 23:19:50 +00:00 |
|
decomposition.py
|
Revert "Fix unbind_copy and add its decomposition (#134319)"
|
2024-10-29 04:54:37 +00:00 |
|
dependencies.py
|
[inductor] Fix reduction_hint sum to single element (#137754)
|
2024-10-13 01:08:23 +00:00 |
|
exc.py
|
Type _inductor/exc.py (#137595)
|
2024-10-09 23:15:06 +00:00 |
|
extern_node_serializer.py
|
[BE][Easy][14/19] enforce style for empty lines in import segments in torch/_[a-c]*/ and torch/_[e-h]*/ and torch/_[j-z]*/ (#129765)
|
2024-07-31 10:42:50 +00:00 |
|
freezing.py
|
[aotd] Cleanup on subclasses in inductor freezing (#136549)
|
2024-09-27 11:20:03 +00:00 |
|
fx_utils.py
|
Revert "Dont decompose aten.baddmm in inductor (#137904)"
|
2024-10-29 04:08:11 +00:00 |
|
graph.py
|
Class rename (#139490)
|
2024-11-02 00:10:17 +00:00 |
|
hooks.py
|
[BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768)
|
2024-07-20 16:20:58 +00:00 |
|
index_propagation.py
|
Add try except for _maybe_evaluate_static call in IndexPropagation (#132128)
|
2024-08-05 01:02:51 +00:00 |
|
inductor_prims.py
|
inductor: dont use default_dtype during rng functionalization (#136041)
|
2024-09-17 03:40:54 +00:00 |
|
ir.py
|
[inductor] make requires_stride_order more unbacked-symint-aware (#137063)
|
2024-10-31 13:11:02 +00:00 |
|
jagged_lowerings.py
|
Add lowering for aten.searchsorted (#135701)
|
2024-10-04 19:26:05 +00:00 |
|
loop_body.py
|
[inductor] Add LoopBody.op_counts (#137945)
|
2024-10-16 06:35:10 +00:00 |
|
lowering.py
|
[inductor] make requires_stride_order more unbacked-symint-aware (#137063)
|
2024-10-31 13:11:02 +00:00 |
|
memory.py
|
add condition for freeable input buffer (#139480)
|
2024-11-01 21:15:40 +00:00 |
|
metrics.py
|
[inductor] Multi-kernel + cooperative reductions (#138893)
|
2024-10-29 15:45:17 +00:00 |
|
mkldnn_ir.py
|
[AOTI] add C shim for QConvPointWise (#138540)
|
2024-10-31 02:03:01 +00:00 |
|
mkldnn_lowerings.py
|
[AOTI] add C shim for QConvPointWise (#138540)
|
2024-10-31 02:03:01 +00:00 |
|
ops_handler.py
|
Add lowering for aten.searchsorted (#135701)
|
2024-10-04 19:26:05 +00:00 |
|
optimize_indexing.py
|
type _inductor/optimize_indexing.py (#137599)
|
2024-10-09 23:29:47 +00:00 |
|
pattern_matcher.py
|
[Easy] Refactor post grad application of passes (#139293)
|
2024-10-31 17:05:27 +00:00 |
|
quantized_lowerings.py
|
type _inductor/quantized_lowerings.py (#137598)
|
2024-10-09 23:29:26 +00:00 |
|
remote_cache.py
|
Profile guided optimization for automatic_dynamic (#139001)
|
2024-11-01 21:43:25 +00:00 |
|
scheduler.py
|
[inductor] Simplify remove_kernel_local_buffers (#139452)
|
2024-11-01 20:36:39 +00:00 |
|
script.ld
|
|
|
|
select_algorithm.py
|
[inductor] Refactor kernel args into SIMDKernelFeatures (#139327)
|
2024-11-01 00:30:14 +00:00 |
|
sizevars.py
|
In Inductor, be willing to generate deferred runtime asserts when unbacked (#138804)
|
2024-10-28 02:19:55 +00:00 |
|
subgraph_lowering.py
|
[BE] typing for decorators - fx/_compatibility (part 1) (#134202)
|
2024-08-22 17:07:33 +00:00 |
|
test_case.py
|
[BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768)
|
2024-07-20 16:20:58 +00:00 |
|
test_operators.py
|
[BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768)
|
2024-07-20 16:20:58 +00:00 |
|
triton_bundler.py
|
Consolidate Triton cache into Inductor cache (#138239)
|
2024-10-31 01:37:16 +00:00 |
|
utils.py
|
[inductor] Remove SIMDKernel.last_usage (#139364)
|
2024-11-01 16:28:15 +00:00 |
|
virtualized.py
|
[inductor] Move LoopBody to its own file (#135257)
|
2024-09-07 16:29:15 +00:00 |
|
wrapper_benchmark.py
|
[fx graph cache] Support freezing with FX graph caching (#136505)
|
2024-11-01 18:29:29 +00:00 |