pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

History

Yi Wang ecb5ac90ed [Gradient Compression] Add get_per_parameter_tensors method to GradBucket class (#53009 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53009 It can be a common operation to apply layer-wise operations over per-parameter tensors in a DDP communication hook. Create a util method in GradBucket class before publishing GradBucket APIs. ghstack-source-id: 122833594 Test Plan: buck test mode/dev-nosan caffe2/test/distributed:c10d -- test_powerSGD_ddp_comm_hook_nccl f254364097 Reviewed By: rohan-varma Differential Revision: D26717893 fbshipit-source-id: 916db319de8b85dd22bc4e35db5671bf4e34740f	2021-03-02 14:39:03 -08:00
..
ddp_comm_hooks	[Gradient Compression] Add get_per_parameter_tensors method to GradBucket class (#53009 )	2021-03-02 14:39:03 -08:00
__init__.py	[Gradient Compression] Add unit tests that test default Python comm hook implementations (#47158 )	2020-11-06 00:28:09 -08:00

Yi Wang ecb5ac90ed [Gradient Compression] Add get_per_parameter_tensors method to GradBucket class (#53009 )

Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/53009

It can be a common operation to apply layer-wise operations over per-parameter tensors in a DDP communication hook.

Create a util method in GradBucket class before publishing GradBucket APIs.
ghstack-source-id: 122833594

Test Plan:
buck test mode/dev-nosan caffe2/test/distributed:c10d -- test_powerSGD_ddp_comm_hook_nccl

f254364097

Reviewed By: rohan-varma

Differential Revision: D26717893

fbshipit-source-id: 916db319de8b85dd22bc4e35db5671bf4e34740f

2021-03-02 14:39:03 -08:00

ddp_comm_hooks

[Gradient Compression] Add get_per_parameter_tensors method to GradBucket class (#53009 )

2021-03-02 14:39:03 -08:00

__init__.py

[Gradient Compression] Add unit tests that test default Python comm hook implementations (#47158 )

2020-11-06 00:28:09 -08:00