pytorch/torch/_inductor
Bin Bao 8c17830dea [AOTI] Unify how weights are stored as data section (#139471)
Summary: https://github.com/pytorch/pytorch/pull/118076 introduced a cleaner way to link weights as a data section for macos. Unify the code by adopting that approach for Linux as well.

Test Plan: CI

Differential Revision: D65302273

Pull Request resolved: https://github.com/pytorch/pytorch/pull/139471
Approved by: https://github.com/chenyang78
2024-11-02 00:23:24 +00:00
..
autoheuristic AutoHeuristic: mm ranking heuristic h100 (#133608) 2024-08-16 16:20:38 +00:00
codegen [inductor] Simplify remove_kernel_local_buffers (#139452) 2024-11-01 20:36:39 +00:00
compile_worker typing subproc_pool.py (#138032) 2024-10-18 15:31:05 +00:00
fx_passes [Easy] Add joint graph passes, fallback_random to bisector (#139295) 2024-11-01 23:21:53 +00:00
kernel [Flex Attention] Paged Attention (#137164) 2024-10-29 17:05:22 +00:00
package [aoti] Accept constant inputs (#137197) 2024-10-03 20:59:33 +00:00
runtime [inductor] set sanitize_overflow=False for triton kernels (#139502) 2024-11-01 23:10:21 +00:00
__init__.py [pt2] improve error logs for torch.cond and aoti package (#138647) 2024-10-24 23:38:07 +00:00
aoti_eager.py [4/N] Non-Tensor: Support layout, device and dtype for aten operations (#125897) 2024-07-23 17:50:17 +00:00
async_compile.py [easy] Log subproc pool creation (#138642) 2024-10-23 02:41:42 +00:00
autotune_process.py Add workspace to TritonTemplates (#138050) 2024-10-29 18:17:54 +00:00
bounds.py Existing mypy issues (#136236) 2024-09-24 01:02:07 +00:00
codecache.py [AOTI] Unify how weights are stored as data section (#139471) 2024-11-02 00:23:24 +00:00
comm_analysis.py [BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768) 2024-07-20 16:20:58 +00:00
comm_lowering.py Preliminary registered-buffer collective support via Inductor (#138029) 2024-10-30 18:11:09 +00:00
comms.py [Traceable FSDP2] Use .copy_ instead of .set_ for unsharded_param inplace update; Replace unsharded_param graph input usage with graph intermediate; Support FSDP2+LoRA (#133730) 2024-09-11 23:01:05 +00:00
compile_fx.py [fx graph cache] Support freezing with FX graph caching (#136505) 2024-11-01 18:29:29 +00:00
compiler_bisector.py Class rename (#139490) 2024-11-02 00:10:17 +00:00
config.py [AOTI] Unify how weights are stored as data section (#139471) 2024-11-02 00:23:24 +00:00
constant_folding.py Constant folding for lifted graph (#135060) 2024-10-28 06:28:31 +00:00
cpp_builder.py Enable Windows Arm64 (#133088) 2024-10-24 16:10:44 +00:00
cpu_vec_isa.py [PyTorch] Use 128-bit vectors for ARM64 (#137426) 2024-10-26 00:20:35 +00:00
cudagraph_trees.py typing compile_fx.py (#138033) 2024-10-21 18:14:59 +00:00
cudagraph_utils.py [CUDAGraph] Warn once if too many distinct sizes (#132832) 2024-08-07 19:48:06 +00:00
custom_graph_pass.py [inductor] Add API to make post_grad_custom passes cache-able (#137298) 2024-10-07 21:11:54 +00:00
debug.py Refactor FxGraphDrawer to use HTML-like labels (#137726) 2024-11-01 23:19:50 +00:00
decomposition.py Revert "Fix unbind_copy and add its decomposition (#134319)" 2024-10-29 04:54:37 +00:00
dependencies.py [inductor] Fix reduction_hint sum to single element (#137754) 2024-10-13 01:08:23 +00:00
exc.py Type _inductor/exc.py (#137595) 2024-10-09 23:15:06 +00:00
extern_node_serializer.py [BE][Easy][14/19] enforce style for empty lines in import segments in torch/_[a-c]*/ and torch/_[e-h]*/ and torch/_[j-z]*/ (#129765) 2024-07-31 10:42:50 +00:00
freezing.py [aotd] Cleanup on subclasses in inductor freezing (#136549) 2024-09-27 11:20:03 +00:00
fx_utils.py Revert "Dont decompose aten.baddmm in inductor (#137904)" 2024-10-29 04:08:11 +00:00
graph.py Class rename (#139490) 2024-11-02 00:10:17 +00:00
hooks.py [BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768) 2024-07-20 16:20:58 +00:00
index_propagation.py Add try except for _maybe_evaluate_static call in IndexPropagation (#132128) 2024-08-05 01:02:51 +00:00
inductor_prims.py inductor: dont use default_dtype during rng functionalization (#136041) 2024-09-17 03:40:54 +00:00
ir.py [inductor] make requires_stride_order more unbacked-symint-aware (#137063) 2024-10-31 13:11:02 +00:00
jagged_lowerings.py Add lowering for aten.searchsorted (#135701) 2024-10-04 19:26:05 +00:00
loop_body.py [inductor] Add LoopBody.op_counts (#137945) 2024-10-16 06:35:10 +00:00
lowering.py [inductor] make requires_stride_order more unbacked-symint-aware (#137063) 2024-10-31 13:11:02 +00:00
memory.py add condition for freeable input buffer (#139480) 2024-11-01 21:15:40 +00:00
metrics.py [inductor] Multi-kernel + cooperative reductions (#138893) 2024-10-29 15:45:17 +00:00
mkldnn_ir.py [AOTI] add C shim for QConvPointWise (#138540) 2024-10-31 02:03:01 +00:00
mkldnn_lowerings.py [AOTI] add C shim for QConvPointWise (#138540) 2024-10-31 02:03:01 +00:00
ops_handler.py Add lowering for aten.searchsorted (#135701) 2024-10-04 19:26:05 +00:00
optimize_indexing.py type _inductor/optimize_indexing.py (#137599) 2024-10-09 23:29:47 +00:00
pattern_matcher.py [Easy] Refactor post grad application of passes (#139293) 2024-10-31 17:05:27 +00:00
quantized_lowerings.py type _inductor/quantized_lowerings.py (#137598) 2024-10-09 23:29:26 +00:00
remote_cache.py Profile guided optimization for automatic_dynamic (#139001) 2024-11-01 21:43:25 +00:00
scheduler.py [inductor] Simplify remove_kernel_local_buffers (#139452) 2024-11-01 20:36:39 +00:00
script.ld
select_algorithm.py [inductor] Refactor kernel args into SIMDKernelFeatures (#139327) 2024-11-01 00:30:14 +00:00
sizevars.py In Inductor, be willing to generate deferred runtime asserts when unbacked (#138804) 2024-10-28 02:19:55 +00:00
subgraph_lowering.py [BE] typing for decorators - fx/_compatibility (part 1) (#134202) 2024-08-22 17:07:33 +00:00
test_case.py [BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768) 2024-07-20 16:20:58 +00:00
test_operators.py [BE][Easy][16/19] enforce style for empty lines in import segments in torch/_i*/ (#129768) 2024-07-20 16:20:58 +00:00
triton_bundler.py Consolidate Triton cache into Inductor cache (#138239) 2024-10-31 01:37:16 +00:00
utils.py [inductor] Remove SIMDKernel.last_usage (#139364) 2024-11-01 16:28:15 +00:00
virtualized.py [inductor] Move LoopBody to its own file (#135257) 2024-09-07 16:29:15 +00:00
wrapper_benchmark.py [fx graph cache] Support freezing with FX graph caching (#136505) 2024-11-01 18:29:29 +00:00