pytorch

OSSForks/pytorch

Fork 0

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Commit Graph

Author	SHA1	Message	Date
Ke Wen	ed838793df	[pipelining] Remove qualname mapping (#127018 ) `QualnameMapMixin` was intended to provide a mapping from new FQN of the piped model to the FQN of the original model. It was there because previous tracers and flattening during tracing would modify the FQNs. Now that we use unflattener, the FQN of the stage modules are the same as the original FQNs. We don't need `QualnameMapMixin` any more. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127018 Approved by: https://github.com/H-Huang	2024-05-25 02:32:40 +00:00
Will Constable	6b39146b3f	[pipelining] Validate stage input/output shape/dtype (#126732 ) Address the classes of user errors stemming from (possibly) unintentional dynamic shapes usage or mismatch of configuration time and run time data shapes/dtypes. The goal is to ensure a clear error is raised rather than relying on some underlying error to bubble up when a tensor shape is not compatible, or worse, having a silent correctness issue. Classes of shape/dtype errors * (a) error is thrown within the stage-module forward code, but may be hard to understand/trace back to an input issue * (b) silent correctness issue happens inside the stage-module forward, but the correct output shape is still produced produces the expected output shape * (c) the stage-module produces an output that is locally correct, but not matching the expectation of the following stage, leading to a hang or correctness issue down the line How validation helps Input shape validation - improves debugability of case (a) - guards against case (b) - only needed on first stage, since subsequent stages use pre-allocated recv buffers that can't change shape/size even if they wanted to Output shape validation - guards against case (c) Validation of first stage input and all stages' outputs inductively verifies all shapes Shape/dtype are most critical as they literally affect the number of bytes on the wire. Strides and other tensor properties may also (?) matter, and the validation function can be adjusted accordingly if needed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/126732 Approved by: https://github.com/kwen2501	2024-05-23 20:16:06 +00:00
Ke Wen	c1a3fcfa47	[pipelining] Add util and debug facilities (#124875 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/124875 Approved by: https://github.com/H-Huang ghstack dependencies: #124776	2024-04-30 19:41:41 +00:00

Author

SHA1

Message

Date

Ke Wen

ed838793df

[pipelining] Remove qualname mapping (#127018 )

`QualnameMapMixin` was intended to provide a mapping from new FQN of the piped model to the FQN of the original model. It was there because previous tracers and flattening during tracing would modify the FQNs.

Now that we use unflattener, the FQN of the stage modules are the same as the original FQNs. We don't need `QualnameMapMixin` any more.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/127018
Approved by: https://github.com/H-Huang

2024-05-25 02:32:40 +00:00

Will Constable

6b39146b3f

[pipelining] Validate stage input/output shape/dtype (#126732 )

Address the classes of user errors stemming from (possibly)
unintentional dynamic shapes usage or mismatch of configuration time and
run time data shapes/dtypes.

The goal is to ensure a clear error is raised rather than relying on some underlying
error to bubble up when a tensor shape is not compatible, or worse,
having a silent correctness issue.

**Classes of shape/dtype errors**
* (a) error is thrown within the stage-module forward code, but may be
hard to understand/trace back to an input issue
* (b) silent correctness issue happens inside the stage-module forward,
but the correct output shape is still produced
produces the expected output shape
* (c) the stage-module produces an output that is locally correct, but not
matching the expectation of the following stage, leading to a hang or
correctness issue down the line

**How validation helps**

Input shape validation
- improves debugability of case (a)
- guards against case (b)
- only needed on first stage, since subsequent stages use pre-allocated recv
  buffers that can't change shape/size even if they wanted to

Output shape validation
- guards against case (c)

Validation of first stage input and all stages' outputs inductively verifies all shapes

Shape/dtype are most critical as they literally affect the number of
bytes on the wire.  Strides and other tensor properties may also (?)
matter, and the validation function can be adjusted accordingly if needed.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/126732
Approved by: https://github.com/kwen2501

2024-05-23 20:16:06 +00:00

Ke Wen

c1a3fcfa47

[pipelining] Add util and debug facilities (#124875 )

Pull Request resolved: https://github.com/pytorch/pytorch/pull/124875
Approved by: https://github.com/H-Huang
ghstack dependencies: #124776

2024-04-30 19:41:41 +00:00

3 Commits