pytorch/torch/testing/_internal/distributed
Yanli Zhao 250c71121b Create a DDPLoggingData and expose it to python interface (#50622)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/50622

1. Define a DDPLoggingData struct that is the placeholder for all the ddp related logging fields
2. Put the DDPLoggingData struct in the C10 directory so that it can be easily imported by c10 and torch files
3. Expose get_ddp_logging_data() method in python so that users can get the logging data and dump in their applications
4. Unit test tested the logging data can be set and got as expected
5. Follow up will add more logging fields such as perf stats, internal states, env variables and etc
ghstack-source-id: 120275870

Test Plan: unit tests

Reviewed By: SciPioneer

Differential Revision: D25930527

fbshipit-source-id: 290c200161019c58e28eed9a5a2a7a8153113f99
2021-01-25 15:23:07 -08:00
..
nn Add new backend type for Intel heterogeneous computation platform. (#49786) 2021-01-20 08:15:18 -08:00
pipeline Remove balance and devices parameter from Pipe. (#48432) 2020-12-01 11:21:59 -08:00
rpc [dist_optim] add distributed functional AdamW optimizer (#50620) 2021-01-23 01:04:45 -08:00
__init__.py remediation of S205607 2020-07-17 17:19:47 -07:00
ddp_under_dist_autograd_test.py [RPC Framework] Support remote device format "<workername>/<device>" (#46773) 2020-10-29 00:14:56 -07:00
distributed_test.py Create a DDPLoggingData and expose it to python interface (#50622) 2021-01-25 15:23:07 -08:00
pipe_with_ddp_test.py Ensure DDP + Pipe works with find_unused_parameters. (#49908) 2021-01-11 16:52:37 -08:00
rpc_utils.py Test pipeline parallelism works with DDP. (#48470) 2020-12-18 13:34:44 -08:00