pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Shen Li	7ec1cb8553	[FSDP] Fix _pre_forward type annotation (#90621 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90621 Approved by: https://github.com/awgu, https://github.com/Skylion007	2022-12-11 06:39:38 +00:00
Shen Li	80542add73	[FSDP] Allow MixedPrecision to skip inputs (#90620 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90620 Approved by: https://github.com/rohan-varma, https://github.com/awgu	2022-12-11 06:39:38 +00:00
Andrew Gu	31351c61dd	[FSDP] Tighten post-bwd cast to `reduce_dtype` (#90615 ) This lowers the `reduce_dtype` retrieval to the `handle` instead of the `state` in preparation for `fully_shard`, and this adds a guard to avoid a no-op `to()` call. Note that this change pretty much gets overridden in following PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90615 Approved by: https://github.com/rohan-varma	2022-12-11 06:39:34 +00:00
Rohan Varma	c7d2fb7f86	Adopt state_dict_pre_hook in FSDP (#90436 ) Use register_state_dict_pre_hook in FSDP to simplify state_dict implementations & remove hacks. This removes `def state_dict` entirely and paves the path for composable API as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90436 Approved by: https://github.com/fegin	2022-12-11 03:54:26 +00:00
Andrew Gu	e7efeb5282	[FSDP] Save `_stream_to_name` for debugging (#90611 ) This saves a data structure `_stream_to_name: Dict[torch.cuda.Stream, str]` that maps each FSDP stream to its name. This can help in debugging by checking `_stream_to_name[torch.cuda.current_stream()]` to see if it is `"default"` or `"unshard"` in the post-backward hook for example. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90611 Approved by: https://github.com/rohan-varma	2022-12-11 03:46:18 +00:00
Andrew Gu	9eccfedca2	[Reland][FSDP] Another fix for `DTensor`, `use_orig_params=True` (#90562 ) This is a reland of https://github.com/pytorch/pytorch/pull/89845 with nothing changed. This should avoid the internal breakage now that `DTensor` does not import `torchgen` (https://github.com/pytorch/pytorch/pull/90106). Pull Request resolved: https://github.com/pytorch/pytorch/pull/90562 Approved by: https://github.com/fduwjj	2022-12-10 22:50:30 +00:00
Shen Li	a69cdd9cf8	Add global registry to composable API contract (#90579 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90579 Approved by: https://github.com/awgu, https://github.com/yhcharles	2022-12-10 22:41:10 +00:00
Yanli Zhao	2bac4d1fae	[reland] add save and load stats in memory_tracker (#90510 ) reland https://github.com/pytorch/pytorch/pull/90144, this PR removed temporary path "memory.trace" in the unit test Pull Request resolved: https://github.com/pytorch/pytorch/pull/90510 Approved by: https://github.com/rohan-varma	2022-12-10 01:39:22 +00:00
Wanchao Liang	7afba50508	[dtensor] delete unused torch_function (#90449 ) torch_function is not actually getting used yet today, deleting it first and we can revisit once we really need it Pull Request resolved: https://github.com/pytorch/pytorch/pull/90449 Approved by: https://github.com/fduwjj	2022-12-10 01:29:02 +00:00
Sergii Dymchenko	f51f6aa387	Fix non-existing parameters in docstrings (#90505 ) Continuation after https://github.com/pytorch/pytorch/pull/90163. Here is a script I used to find all the non-existing arguments in the docstrings (the script can give false positives in presence of args/*kwargs or decorators): _Edit:_ I've realized that the indentation is wrong for the last `break` in the script, so the script only gives output for a function if the first docstring argument is wrong. I'll create a separate PR if I find more issues with corrected script. ``` python import ast import os import docstring_parser for root, dirs, files in os.walk('.'): for name in files: if root.startswith("./.git/") or root.startswith("./third_party/"): continue if name.endswith(".py"): full_name = os.path.join(root, name) with open(full_name, "r") as source: tree = ast.parse(source.read()) for node in ast.walk(tree): if isinstance(node, ast.FunctionDef): all_node_args = node.args.args if node.args.vararg is not None: all_node_args.append(node.args.vararg) if node.args.kwarg is not None: all_node_args.append(node.args.kwarg) if node.args.posonlyargs is not None: all_node_args.extend(node.args.posonlyargs) if node.args.kwonlyargs is not None: all_node_args.extend(node.args.kwonlyargs) args = [a.arg for a in all_node_args] docstring = docstring_parser.parse(ast.get_docstring(node)) doc_args = [a.arg_name for a in docstring.params] clean_doc_args = [] for a in doc_args: clean_a = "" for c in a.split()[0]: if c.isalnum() or c == '_': clean_a += c if clean_a: clean_doc_args.append(clean_a) doc_args = clean_doc_args for a in doc_args: if a not in args: print(full_name, node.lineno, args, doc_args) break ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/90505 Approved by: https://github.com/malfet, https://github.com/ZainRizvi	2022-12-09 21:43:09 +00:00
Shen Li	082450609c	[FSDP] Allow nested FSDP wrapper to use different mixed precision (#90523 ) The main change is to move `args` and `kwargs` dtype convertion from `_root_pre_forward` to `_pre_forward`, so that every FSDP has a chance to apply its own precision. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90523 Approved by: https://github.com/awgu, https://github.com/rohan-varma	2022-12-09 20:06:05 +00:00
Andrew Gu	2cf703214b	[Composable API][Easy] Fix some follow-ups (#90471 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90471 Approved by: https://github.com/mrshenli	2022-12-09 00:26:38 +00:00
Rohan Varma	43660051d8	[Ez] Omit HSDP Z2 from doc (#90503 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90503 Approved by: https://github.com/awgu	2022-12-08 23:05:49 +00:00
Rohan Varma	793a999ce0	Hybrid Sharded Data Parallel (#89915 ) Adds 2 new hybrid sharding strategy to FSDP: 1. HYBRID_SHARD: applies zero-3 style sharding within a node, and data parallel across 2. HYBRID_SHARD_ZERO2: applies zero-2 style sharding within a node, and data parallel across These are useful for medium sized models and aim to decrease communication volume, tests and benchmarks will be run to understand which workloads are optimal under which sharding strategy. Hybrid sharding in general works by sharding the model using a process group within a single node, and creating intra-node process groups for replication / data parallelism. The user either needs to pass in a tuple of these process groups, or None, and we generate the process groups appropriately. Acknowledgements - @awgu 's excellent prototype: `5ad3a16d48` - @liangluofb For ideation, feedback, and initial implementation and experimentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/89915 Approved by: https://github.com/awgu	2022-12-08 16:18:03 +00:00
Andrew Gu	21a0e809c2	[Composable API] Match `fully_shard()` comm. schedule with wrapper FSDP (#90387 ) - This PR introduces a new concept, the _communication module_ (denoted `comm_module`), that represents the module responsible for the unshard/reshard pair for a `FlatParamHandle`. This is well-defined because the current design assumes that each `FlatParamHandle` only has _one_ unshard/reshard pair for either the forward or backward pass. - For the wrapper code path, the `comm_module` is exactly the module already being passed to the `FlatParamHandle` constructor. - For the composable code path, the `comm_module` is not necessarily the module already being passed to the `FlatParamHandle`. This is because the module already being passed is always the local FSDP root module to give complete FQNs, instead of local FQNs. Distinguishing the communication module from the local FSDP root module can provide more flexibility for non-recursive wrapping designs in the future. - This PR adds a unit test `test_unshard_reshard_order` that explicitly checks that `_unshard` and `_reshard` are called in the exactly the same order across the two code paths. - This PR does not fix `test_checkpoint_fsdp_submodules_use_reentrant`. However, the error message changes, so this PR accommodates that. - The error is now the same as if we used the equivalent wrapper FSDP: ``` test_model.u1 = FSDP(test_model.u1, use_orig_params=True) test_model.u2 = FSDP(test_model.u2, use_orig_params=True) ``` - The error is also the same as if we used wrapper FSDP with `use_orig_params=False`, so it is not unique to `use_orig_params=True`. --- `comm_module` Example ``` model = Model( seq1: nn.Sequential( nn.Linear nn.ReLU nn.Linear nn.ReLU ) seq2: nn.Sequential( nn.Linear nn.ReLU nn.Linear nn.ReLU ) ) policy = ModuleWrapPolicy({nn.Sequential}) fully_shard(model, policy=policy) FullyShardedDataParallel(model, auto_wrap_policy=policy) ``` - This policy constructs two `FlatParamHandle`s, one for `seq1` and one for `seq2`. - `FullyShardedDataParallel` will pass `seq1` and `seq2` as the `module` argument to the two `FlatParamHandle`s, respectively. - `fully_shard()` will pass `model` as the `module` argument to every `FlatParamHandle`. - `FullyShardedDataParallel` will pass `seq1` and `seq2` as the `comm_module` argument to the two `FlatParamHandle`s, respectively. - `fully_shard()` will pass `seq1` and `seq2` as the `comm_module` argument to the two `FlatParamHandle`s, respectively. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90387 Approved by: https://github.com/mrshenli	2022-12-08 15:55:20 +00:00
Rohan Varma	9c80f13692	[Resubmit] state_dict_pre_hook (#90435 ) Resubmit of https://github.com/pytorch/pytorch/pull/88541 which got stale. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90435 Approved by: https://github.com/fegin	2022-12-08 07:54:14 +00:00
fduwjj	1a48ae96ba	[PT-D][Easy] Reformat the optim code within PTD code base (#90399 ) Just run two commands: ``` ufmt format torch/distributed/optim/ ufmt format test/distributed/optim/ ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/90399 Approved by: https://github.com/awgu	2022-12-08 06:38:59 +00:00
PyTorch MergeBot	5f3ca208c5	Revert "add save and load stats in memory_tracker (#90144 )" This reverts commit `1f137c1e2f`. Reverted https://github.com/pytorch/pytorch/pull/90144 on behalf of https://github.com/ezyang due to dirty git working copy broke master	2022-12-08 05:16:56 +00:00
Iris	b8b7480065	[Checkpoint][2D][6/N] Add optimizer and update default_planner to core distributed (#90212 ) This is the last PR for integrating 2D into core distributed. This PR does the following: 1. Add optimizer.py: this adds ability to load a state_dict in conjunction with FSDP sharded optimzer state. 2. Update default_planner.py to support 2D checkpoint. 3. Add test_fsdp_optim_state.py as a unit test for No. 1. 4. Fix bug in torch/testing/_internal/distributed/checkpoint_utils.py 5. Rename the filename for the APIs that should be private. Will organize and cleanup further in following PRs. #90328 Docstring and integration test will be added in the following PRs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90212 Approved by: https://github.com/wanchaol	2022-12-08 02:53:29 +00:00
Yanli Zhao	1f137c1e2f	add save and load stats in memory_tracker (#90144 ) add save and load stats in memory_tracker, so that users could plot the traces in another place, rather than just inside trainer Pull Request resolved: https://github.com/pytorch/pytorch/pull/90144 Approved by: https://github.com/rohan-varma	2022-12-08 00:17:21 +00:00
Chien-Chin Huang	44779d9bc6	[FSDP][optim_state_dict][2/N] Add _get_fqn_to_fsdp_param_info to map from original FQN to flat_param (#89899 ) Motivation: Add a helper to map from the FQN to the corresponding flat_param. The helper will directly get flat_param from fsdp_state and flat_handler as flat_param is not registered to the module if `use_orig_params` is True. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89899 Approved by: https://github.com/awgu	2022-12-07 19:40:47 +00:00
Charlie Yan	99fb39f508	reland #89243 : [Composable API] replicate: add support for DDP args (#90255 ) reland https://github.com/pytorch/pytorch/pull/89243 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90255 Approved by: https://github.com/zhaojuanmao	2022-12-07 15:22:33 +00:00
fduwjj	85ae28b454	Reformat optim import (#90294 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90294 Approved by: https://github.com/awgu	2022-12-07 07:11:12 +00:00
Ram Rachum	351d73b97f	Fix exception causes all over the codebase (#90271 ) This is the continuation to #90134 and hopefully the final PR in this series. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90271 Approved by: https://github.com/kit1980	2022-12-07 04:29:00 +00:00
fduwjj	1abe264ef0	[Upstream _NamedOptimzer] Reland PR (89480) (#90293 ) Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): Reland https://github.com/pytorch/pytorch/pull/89480/ * #90294 * __->__ #90293 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90293 Approved by: https://github.com/awgu	2022-12-06 21:47:12 +00:00
Charlie Yan	e818c36647	reland #89222 : [Composable API] replicate: change to per module call, remove mark_root_module() (#90254 ) reland https://github.com/pytorch/pytorch/pull/89222 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90254 Approved by: https://github.com/zhaojuanmao	2022-12-06 21:17:53 +00:00
Wanchao Liang	9e314bd822	[dtensor] handle the case where output of op is Optional[Tensor] (#90241 ) Observed by @aazzolini, some op might have Optional[Tensor] returns where it return None (i.e. native_layer_norm_backward), it's a mismatch between C++ aten op signature and python None, but we need to handle it in the python side Pull Request resolved: https://github.com/pytorch/pytorch/pull/90241 Approved by: https://github.com/aazzolini	2022-12-06 18:17:20 +00:00
PyTorch MergeBot	176b962f4b	Revert "[PT-D][Composability][1/N] Upstream NamedOptimizer from TorchRec (KeyedOptimizer in TR) (#89480 )" This reverts commit `31ec1a1ef7`. Reverted https://github.com/pytorch/pytorch/pull/89480 on behalf of https://github.com/kit1980 due to Broke test_correct_module_names	2022-12-06 07:22:37 +00:00
Andrew Gu	45b40be078	[FSDP()] Fix `fully_shard` fwd hook registration (#90201 ) I need to rebase later after Shen's PRs land. The idea is to only register the pre/post-forward hook on the _root modules_ among the modules that consume a `FlatParameter`. (Yes, the term _root module_ is heavily overloaded. We may want to clarify that at some point. Here, _root_ is being used in the graph sense, meaning parent-less, and the scope is only among the modules consuming a `FlatParameter`.) This avoids unnecessary pre/post-forward hooks running, which would lead to errors because the unshard is not truly idempotent. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90201 Approved by: https://github.com/mrshenli, https://github.com/rohan-varma	2022-12-06 06:09:03 +00:00
Wanchao Liang	2c2cce73d4	[dtensor] remove torchgen function schema and parse manually (#90106 ) This PR get rids of torchgen FunctionSchema parsing and parse it manually, it should resolve torchgen package issue and also provide some perf wins when running DTensor eagerly Pull Request resolved: https://github.com/pytorch/pytorch/pull/90106 Approved by: https://github.com/awgu	2022-12-06 05:45:00 +00:00
Yanli Zhao	a0c7b88861	remove backward hook in memory_tracker (#90143 ) remove backward hook in memory_tracker, as it does not work well with jagged tensor in some cases, it is OK to remove this hook for now as it does not really track any stats Pull Request resolved: https://github.com/pytorch/pytorch/pull/90143 Approved by: https://github.com/rohan-varma	2022-12-06 05:39:59 +00:00
fduwjj	31ec1a1ef7	[PT-D][Composability][1/N] Upstream NamedOptimizer from TorchRec (KeyedOptimizer in TR) (#89480 ) In pytorch, the optim state_dict will always use number to index optimizer state_dict for parameters. Now composability workstream need a FQN based way to index optimizer state_dict for parameters.. For example, SGD optimizer might have something in its `state_dict` like: ``` {'state': {0: {'momentum_buffer': tensor(...)}, {1: {'momentum_buffer': tensor(...)}, ... } 'param_groups': [{'lr': 0.001, 'momentum': 0.9, 'dampening': 0, 'weight_decay': 0, 'nesterov': False, 'maximize': False, 'foreach': None, 'differentiable': False, 'params': [0, 1, 2, 3, 4, 5, 6, 7]}] } ``` And in NamedOptimizer we want the `state_dict` can be: ``` {'state': {'net1.0.weight': {'momentum_buffer': tensor(...)}, {'net1.0.bias': {'momentum_buffer': tensor(...)}, ... } 'param_groups': [{'lr': 0.001, 'momentum': 0.9, 'dampening': 0, 'weight_decay': 0, 'nesterov': False, 'maximize': False, 'foreach': None, 'differentiable': False, 'params': ['net1.0.weight', 'net1.0.bias', 'net2.0.weight', 'net2.0.bias', 'net3.weight', 'net3.bias', 'net4.1.weight', 'net4.1.bias']}] } ``` We also want to support load_state_dict to enable optim `state_dict` override for NameOptimizer. For the next couple PR/diffs, we also need to: 1. To make `NamedOptimizer` working with FSDP (like registering a hook for model wrapped with FSDP) and other PTD/PT components. 2. Make `NamedOptimizer` works well with apply_optim_in_backward 3. Upstream also `CombinedOptimizer`. Differential Revision: [D41432088](https://our.internmc.facebook.com/intern/diff/D41432088/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D41432088/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/89480 Approved by: https://github.com/rohan-varma	2022-12-06 04:34:19 +00:00
PyTorch MergeBot	0d8e53dfe7	Revert "[Composable API] `replicate`: change to per module call, remove `mark_root_module()` (#89222 )" This reverts commit `65a0dcffd8`. Reverted https://github.com/pytorch/pytorch/pull/89222 on behalf of https://github.com/malfet due to Included unintended submodule updates	2022-12-06 03:26:28 +00:00
PyTorch MergeBot	3749b9dc73	Revert "[Composable API] `replicate`: add support for DDP args (#89243 )" This reverts commit `0f274ed385`. Reverted https://github.com/pytorch/pytorch/pull/89243 on behalf of https://github.com/malfet due to Depends on https://github.com/pytorch/pytorch/pull/89222 that introduced spurious module updates	2022-12-06 03:22:18 +00:00
Shen Li	5d6aa99c45	Add sharding strategy to fully_shard (#90192 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90192 Approved by: https://github.com/awgu, https://github.com/rohan-varma	2022-12-05 22:20:25 +00:00
Charlie Yan	0f274ed385	[Composable API] `replicate`: add support for DDP args (#89243 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89243 Approved by: https://github.com/zhaojuanmao	2022-12-05 21:38:23 +00:00
Chien-Chin Huang	72fdfad4ad	[FSDP][optim_state_dict][1/N] Restructure _optim_state_dict to prepare the support of use_orig_param (#89898 ) Motivation: Restructure some APIs in _optim_state_dict.py to allow better future extension, mostly for supporting use_orig_params. NO logic change in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89898 Approved by: https://github.com/awgu	2022-12-05 21:01:48 +00:00
Charlie Yan	65a0dcffd8	[Composable API] `replicate`: change to per module call, remove `mark_root_module()` (#89222 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89222 Approved by: https://github.com/zhaojuanmao	2022-12-05 17:54:55 +00:00
Shen Li	7a08261a9c	Fix fully_shard error when policy is not provided (#90151 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90151 Approved by: https://github.com/awgu	2022-12-05 15:21:47 +00:00
Zheng Yan	c00d395f05	Revert D41682843: Multisect successfully blamed D41682843 for test or build failures (#90132 ) Summary: This diff is reverting D41682843 D41682843 has been identified to be causing the following test or build failures: Tests affected: - https://www.internalfb.com/intern/test/281475048939643/ Here's the Multisect link: https://www.internalfb.com/intern/testinfra/multisect/1444954 Here are the tasks that are relevant to this breakage: T93770103: 5 tests started failing for oncall assistant_multimodal in the last 2 weeks We're generating a revert to back out the changes in this diff, please note the backout may land if someone accepts it. Test Plan: NA Reviewed By: zyan0, atuljangra, YazhiGao Differential Revision: D41710749 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90132 Approved by: https://github.com/awgu	2022-12-04 05:35:17 +00:00
Andrew Gu	e47af44eb8	[FSDP][Easy] Remove unused methods (#89229 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89229 Approved by: https://github.com/mrshenli	2022-12-03 17:55:27 +00:00
Andrew Gu	1ee189ce8e	[FSDP] Issue warning when clamping to `NO_SHARD` (#90060 ) Fixes https://github.com/pytorch/pytorch/issues/90050. I hope that this was not meant as an onboarding task :/ Pull Request resolved: https://github.com/pytorch/pytorch/pull/90060 Approved by: https://github.com/zhaojuanmao	2022-12-03 15:58:25 +00:00
PyTorch MergeBot	cba96366a2	Revert "remove torch.equal usages (#89527 )" This reverts commit `4095ef8b80`. Reverted https://github.com/pytorch/pytorch/pull/89527 on behalf of https://github.com/clee2000 due to broke periodic multigpu tests `4095ef8b80` https://github.com/pytorch/pytorch/actions/runs/3592806602/jobs/6049368502	2022-12-02 21:36:13 +00:00
Andrew Gu	eb56b08f96	[FSDP] Fix `clip_grad_norm_()` for low prec grads (#90028 ) For PyTorch FSDP, the only way that gradients are in low precision is if `keep_low_precision_grads=True` or if the user turns on AMP. This PR adds tests for the former and improves the documentation for `clip_grad_norm_()`, especially around these non-full-precision cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90028 Approved by: https://github.com/rohan-varma	2022-12-02 21:10:45 +00:00
Andrew Gu	688b767265	[FSDP] Fix `keep_low_precision_grads=True` for `use_orig_params=True` (#90027 ) For any `flat_param.data = flat_param.to(...)` or `flat_param.grad.data = flat_param.grad.to(...)`, we must also refresh sharded parameter/gradient views, respectively, if the storage changes. For `keep_low_precision_grads=True` and a sharded strategy, we cast the gradient back to the low precision using `.data` to bypass the PyTorch check that a parameter and its gradient have the same dtype. For `use_orig_params=True` before this PR, the gradient would incorrectly still be in full precision, not low precision, since we did not refresh views (this can actually be considered a memory leak since we have two copies of the gradient now, one in low precision and one in full precision). This PR refreshes the views. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90027 Approved by: https://github.com/mrshenli	2022-12-02 21:10:45 +00:00
Zheng Yan	6efedfd774	Revert D41609017: Multisect successfully blamed D41609017 for test or build failures (#90034 ) Summary: This diff is reverting D41609017 D41609017 has been identified to be causing the following test or build failures: Tests affected: - https://www.internalfb.com/intern/test/281475052567659/ - https://www.internalfb.com/intern/test/562950029295825/ Here's the Multisect link: https://www.internalfb.com/intern/testinfra/multisect/1440332 Here are the tasks that are relevant to this breakage: T93368156: 5 tests started failing for oncall admarket_predictor_pushmaster in the last 2 weeks We're generating a revert to back out the changes in this diff, please note the backout may land if someone accepts it. Test Plan: NA Reviewed By: zyan0 Differential Revision: D41656946 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90034 Approved by: https://github.com/awgu	2022-12-02 01:31:50 +00:00
Shen Li	7bd284495a	Add non-reentrant checkpoint to composable APIs (#90015 ) Differential Revision: [D41661027](https://our.internmc.facebook.com/intern/diff/D41661027) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90015 Approved by: https://github.com/zhaojuanmao	2022-12-01 23:05:55 +00:00
jiaruifang	29ea1c9c8e	[doc] update dtensor readme (#89991 ) I fixed some import erros in readme of dtensor. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89991 Approved by: https://github.com/wanchaol	2022-12-01 22:16:39 +00:00
Wanchao Liang	9b5e6b029f	[tp] umft distributed.tensor.parallel (#89969 ) cmd: `ufmt format torch/distributed/tensor` Pull Request resolved: https://github.com/pytorch/pytorch/pull/89969 Approved by: https://github.com/fduwjj	2022-12-01 20:58:16 +00:00
Wanchao Liang	bf23e0bdbd	[dtensor] ufmt distributed._tensor (#89967 ) cmd: `ufmt format torch/distributed/_tensor` copy from Andrew: Notes For VSCode users, Install ufmt: https://pypi.org/project/ufmt/ Install VSCode ufmt extension: https://marketplace.visualstudio.com/items?itemName=omnilib.ufmt Include in settings.json: ``` { "[python]": { "editor.defaultFormatter": "omnilib.ufmt", "editor.formatOnSave": true, }, } ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/89967 Approved by: https://github.com/fduwjj	2022-12-01 20:58:13 +00:00

1 2 3 4 5 ...

1521 Commits