mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-07 12:21:27 +01:00
Retry of #75983. The change is to handle cases where attr::cache_id is not set. This can happen if compilation fails. Original message: 1) remember when fusions fail; and on subsequent runs, always take the fallback. 2) during the first fallback, cache the Code object. On autogen-69 from the nvfuser microbenchmarks (https://github.com/pytorch/benchmark/pull/801) this improved performanance as follows: * Original (always attempt fusion): 25ms * Always take fallback after first failure: 0.79ms * Always take fallback + cache Code object: 0.62ms * Eager: 0.58ms Pull Request resolved: https://github.com/pytorch/pytorch/pull/76505 Approved by: https://github.com/eellison |
||
|---|---|---|
| .. | ||
| cuda | ||
| fuser | ||