mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-07 12:21:27 +01:00
scaled_grouped_mm's kernel only supports column-major on the second operand. I -think- this is just for efficiency reasons. But inductor treats that buffer as flexible and may tweak the strides to be row-major instead, as seen in the issue. ~Tagging the op as "needs_fixed_stride_order"/"needs_exact_strides" does not work. Inductor only considers those tags for ops that don't have registered lowering (not sure if this is intended). scaled_grouped_mm does have a lowering, so we never check its tags.~ From discussion below, the op tags are expected to work. FIXES https://github.com/pytorch/pytorch/issues/159097 Pull Request resolved: https://github.com/pytorch/pytorch/pull/159134 Approved by: https://github.com/eellison |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| autograd.py | ||
| custom_ops.py | ||
| fake_class_registry.py | ||
| fake_impl.py | ||
| fake_profile.py | ||
| infer_schema.py | ||
| simple_registry.py | ||
| triton.py | ||
| utils.py | ||