mirror of
https://github.com/zebrajr/pytorch.git
synced 2025-12-07 12:21:27 +01:00
- Refactored traceback code into `work.printTraceback()`. cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o @shuqiangzhang - Refactored desync debug code into `class DesyncDebugger`. - Moved occurrences of `futureWorkResult_->markCompleted` into `checkAndSetException` and `checkTimeout`, respectively. cc @shuqiangzhang - Modularized dump signal broadcast code into `ProcessGroupNCCL::broadcastDumpSignal`. cc @fduwjj @c-p-i-o Pull Request resolved: https://github.com/pytorch/pytorch/pull/139834 Approved by: https://github.com/shuqiangzhang |
||
|---|---|---|
| .. | ||
| example | ||
| BackoffTest.cpp | ||
| CMakeLists.txt | ||
| CUDATest.cu | ||
| CUDATest.hpp | ||
| FileStoreTest.cpp | ||
| HashStoreTest.cpp | ||
| ProcessGroupGlooAsyncTest.cpp | ||
| ProcessGroupGlooTest.cpp | ||
| ProcessGroupMPITest.cpp | ||
| ProcessGroupNCCLErrorsTest.cpp | ||
| ProcessGroupNCCLTest.cpp | ||
| ProcessGroupUCCTest.cpp | ||
| StoreTestCommon.hpp | ||
| TCPStoreTest.cpp | ||
| TestUtils.hpp | ||