[torch][cuda] fix race condition in cuda initialization (#143238)

The access to lazy init callbacks (`_lazy_seed_tracker` and `_queued_calls`) is not synchronized with the initialization lock. This exposes us to the following race: 1. start `_lazy_init` 2. take `_initialization_lock` 3. flush `_queued_calls` and run them all 4. another thread comes in and uses `_lazy_call` to put something on the queue (in our case, the `manual_seed`) 5. original thread finishes initializing, but never runs that call Pull Request resolved: https://github.com/pytorch/pytorch/pull/143238 Approved by: https://github.com/ngimel
2025-12-06 12:20:52 +01:00 · 2024-12-14 07:41:22 +00:00 · 2024-12-14 07:41:22 +00:00 · 9933e59c2b
commit 9933e59c2b
parent 28d8297712
1 changed files with 14 additions and 13 deletions
--- a/torch/cuda/init.py
+++ b/torch/cuda/init.py
@ -245,6 +245,7 @@ def is_initialized():


 def _lazy_call(callable, **kwargs):
+    with _initialization_lock:
        if is_initialized():
            callable()
        else: