pytorch/torch/testing
Wei Wang d7e275d4b4 [CI][CUDA] Add periodic b200 distributed job (#159323)
1. Run distributed job with B200 runner, periodically.
2. discovered generic distributed test issue that certain unit test hard-coded ranks, calling for require_exact_world_size(world_size) API instead of require_world_size(world_size).

Pull Request resolved: https://github.com/pytorch/pytorch/pull/159323
Approved by: https://github.com/eqy

Co-authored-by: Aidyn-A <aidyn.b.aitzhan@gmail.com>
2025-10-16 21:54:04 +00:00
..
_internal [CI][CUDA] Add periodic b200 distributed job (#159323) 2025-10-16 21:54:04 +00:00
__init__.py Add pyrefly suppressions 2/n (#164513) 2025-10-03 02:46:13 +00:00
_comparison.py Pyrefly suppressions 6/n (#164877) 2025-10-08 02:30:57 +00:00
_creation.py [BE][PYFMT] migrate PYFMT for torch/[p-z]*/ to ruff format (#144552) 2025-08-07 00:09:56 +00:00
_utils.py Fix code descriptions in the test package. (#148145) 2025-03-04 19:14:41 +00:00