pytorch/torch/_library
Simon Fan 669009bcd1 [inductor] respect layout tags for ops with registered lowerings (#159134)
scaled_grouped_mm's kernel only supports column-major on the second operand. I -think- this is just for efficiency reasons. But inductor treats that buffer as flexible and may tweak the strides to be row-major instead, as seen in the issue.

~Tagging the op as "needs_fixed_stride_order"/"needs_exact_strides" does not work. Inductor only considers those tags for ops that don't have registered lowering (not sure if this is intended). scaled_grouped_mm does have a lowering, so we never check its tags.~ From discussion below, the op tags are expected to work.

FIXES https://github.com/pytorch/pytorch/issues/159097

Pull Request resolved: https://github.com/pytorch/pytorch/pull/159134
Approved by: https://github.com/eellison
2025-07-31 21:29:40 +00:00
..
__init__.py fix typo in doc and import for torch._library.triton (#144882) 2025-01-17 17:32:12 +00:00
autograd.py [custom_ops][perf] Move expensive pytree traversals of tensors to C++ (#148555) 2025-04-01 18:45:48 +00:00
custom_ops.py [BE] remove torch deploy - conditionals (#158288) 2025-07-29 17:40:49 +00:00
fake_class_registry.py [torchbind] fix fakifying a staitc tensor returns dynamic accidentally (#158607) 2025-07-25 20:55:41 +00:00
fake_impl.py [custom ops] Override fake registration (#150806) 2025-04-12 02:43:47 +00:00
fake_profile.py [BE][2/16] fix typos in torch/ (torch/_*/) (#156312) 2025-07-12 05:47:06 +00:00
infer_schema.py [BE][PYFMT] migrate PYFMT for torch/_[a-h]*/ to ruff format (#144551) 2025-06-25 06:16:06 +00:00
simple_registry.py
triton.py Typo Errors fixed in multiple files (#148262) 2025-03-09 12:21:40 +00:00
utils.py [inductor] respect layout tags for ops with registered lowerings (#159134) 2025-07-31 21:29:40 +00:00