Teja
|
dd3e7170c2
|
Add async checkpointing impl to experimental checkpointer and add a builder API (#156927)
1. Adds an AsyncCheckpointer with out-of-process checkpointing and state_dict_stager with shared memory, pinned memory and Zero Overhead Support.
2. Adds two conveinient functions to create sync/async checkpointers
Differential Revision: [D77336833](https://our.internmc.facebook.com/intern/diff/D77336833/)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/156927
Approved by: https://github.com/pradeepfn
|
2025-07-03 22:49:20 +00:00 |
|