mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-07 12:21:27 +01:00
Summary: - Replace `C10_CUDA_KERNEL_LAUNCH_CHECK()` in the `KernelLauncher`, as the latter does not print __FILE__ and __LINE__ The existing `C10_CUDA_KERNEL_LAUNCH_CHECK()` implementation does not print the source file and line number when a CUDA kernel launch throws an error, leaving users confused with a context-less message like `CUDA error: invalid arguments`. This new check is a slimmed re-implementation of the macro with extra context information added to the error (beyond just file and line number) so that we can at least locate the FBGEMM source file or template where the error first surfaces. Test Plan: ``` buck2 run 'fbcode//mode/opt' fbcode//deeplearning/fbgemm/fbgemm_gpu/test/utils:kernel_launcher buck2 run 'fbcode//mode/opt-amd-gpu' fbcode//deeplearning/fbgemm/fbgemm_gpu/test/utils:kernel_launcher ``` Reviewed By: sryap Differential Revision: D74364031 Pull Request resolved: https://github.com/pytorch/pytorch/pull/153178 Approved by: https://github.com/atalman, https://github.com/huydhn |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| constants.py | ||
| cuda_to_hip_mappings.py | ||
| hipify_python.py | ||
| version.py | ||