mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-07 12:21:27 +01:00
Introduced RECORD_OUTPUTS() macro that goes with RECORD_FUNCTION(). It is used to capture the output tensors from a kernel launch. The tensors automatically get passed to the profiler using record_function methods. This allows the profiler to track the tensors that flow into and out of each op. Fixes #85575 cc @robieta @chaekit @aaronenyeshi @ngimel @nbcsm @guotuofeng @guyang3532 @gaoteng-git @tiffzhaofb Pull Request resolved: https://github.com/pytorch/pytorch/pull/86514 Approved by: https://github.com/robieta |
||
|---|---|---|
| .. | ||
| cuda | ||
| fuser | ||
| onednn | ||