pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Boyuan Feng	35d3adb4b0	Add ATen Op _chunk_cat and _chunk_cat.out (#121081 ) # Motivation In backward of per-parameter sharding FSDP, each rank performs reduce scatter to sync gradients across ranks. A rank chunks each gradient tensor into `world_size` slices along the 0-th dimension and concatenate all slices along the 1-th dimension. Gradient tensors will be padded before concatenation when tensor.size(0) % world_size != 0. ### Example 1 Consider `world_size=3` and tensors A (2x4), B (3x3), C (1x2): Input tensors: ``` AAAA BBB CC AAAA BBB BBB ``` Reduce-scatter-copy-in Output: ``` AAAABBBCC AAAABBB00 0000BBB00 ``` ### Example 2 Consider `world_size=2` and tensors A (2x4), B (3x3), C(1x2), D(4x2): Input tensors: ``` AAAA BBB CC DD AAAA BBB 00 DD BBB DD 000 DD ``` Reduce-scatter-copy-in first pad: ``` AAAA BBB CC DD AAAA BBB 00 DD BBB DD 000 DD ``` Then chunk and cat along dim as the output: ``` AAAABBBBBBCCDDDD AAAABBB00000DDDD ``` The performance of reduce-scatter-copy-in is critical to per-parameter sharding FSDP. However, reduce-scatter-copy-in via composing existing ATen ops involves `cat` and irregular `pad`, leading redundant data copies and unsatisfactory performance. # PR We provide aten native support for reduce-scatter-copy-in, namely `_chunk_cat()`: ``` _chunk_cat(Tensor[] tensors, int dim, int num_chunks) -> Tensor ``` This PR includes the registration of `_chunk_cat` and `_chunk_cat.out`, OpInfo tests, and basic implementation composing existing ATen ops. In the next PR, we will add the CUDA implementation. Comparing with baselines of composing existing ATen ops, `_chunk_cat()` CUDA implementation improves copy bandwidth from 498 GB/s to 966 GB/s on a production benchmark. ## Requirements on input 1. If input tensors have different ndims, dim should be non-negative and be less than the ndims of every input tensors. If all input tensors have the same ndims, we support both negative and non-negative dim. 2. For wrapped_dim, all tensors should have the same size for 0,...,wrapped_dim-1 dimensions. No requirements for (wrapped_dim, ...)-th dimension. 3. Expect positive num_chunks 4. Expect non-empty input tensor list and each input tensor should have at least 1 element Pull Request resolved: https://github.com/pytorch/pytorch/pull/121081 Approved by: https://github.com/albanD	2024-03-08 21:48:12 +00:00
PyTorch MergeBot	b3b585af64	Revert "[codemod] markDynamoStrictTest batch 16 (#117218 )" This reverts commit `47119785ac`. Reverted https://github.com/pytorch/pytorch/pull/117218 on behalf of https://github.com/zou3519 due to just felt like reverting this ([comment](https://github.com/pytorch/pytorch/pull/117218#issuecomment-1888360366))	2024-01-12 03:06:20 +00:00
rzou	47119785ac	[codemod] markDynamoStrictTest batch 16 (#117218 ) [codemod] markDynamoStrictTest test_dataloader [codemod] markDynamoStrictTest test_public_bindings [codemod] markDynamoStrictTest test_namedtensor [codemod] markDynamoStrictTest test_fx [codemod] markDynamoStrictTest test_content_store [codemod] markDynamoStrictTest test_schema_check [codemod] markDynamoStrictTest lazy/test_ts_opinfo [codemod] markDynamoStrictTest functorch/test_ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/117218 Approved by: https://github.com/bdhirsh	2024-01-12 00:32:36 +00:00
Yukio Siraichi	d8ad74857c	Run translation validation on tracing error. (#106645 ) This PR wraps `InstructionTranslator` run with a try-catch block so as to run the translation validation (TV) if it ends up raising an error. In this context, we run TV so as to catch simplification errors. These may turn `ShapeEnv.divisible` and `ShapeEnv.replacements` incorrect. For example: #101173 describes a SymPy simplification bug that doesn't reach TV, since it's run only in the end of the tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106645 Approved by: https://github.com/ezyang	2023-08-14 13:43:34 +00:00
Jason Lu	bc88028e8e	Back out "Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 )" (#106743 ) Summary: Original commit changeset: 81319beb97f3 Original Phabricator Diff: D47961182 Test Plan: revert to maintain backward compat with legacy ads_dper3 production package. Read details in: S357822 Reviewed By: atuljangra Differential Revision: D48131623 @diff-train-skip-merge (D48131623 landed internally) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106743 Approved by: https://github.com/malfet	2023-08-08 15:27:34 +00:00
Yukio Siraichi	33e70e34a3	More readable Z3 expressions printer. (#106643 ) This PR makes Z3 expressions easier to read and understand by creating a custom printer for them. Z3 expressions can be printed in 2 forms: 1. Using the builtin `str(e)` function 2. Using the `e.sexpr()` method Problem is that (1) is a bit hard to read because its line breaks are not so intuitive. (2) is a bit nicer, but the `to_int` and `to_real` functions clutter things up. The custom printer is an improved `sexpr()` function: - Leaves everything in one line - Gets rid of `to_int` and `to_real` functions - Reconstruct the floor division operations - Merge commutative operation chains Pull Request resolved: https://github.com/pytorch/pytorch/pull/106643 Approved by: https://github.com/ezyang	2023-08-07 16:52:22 +00:00
Benjamin Ghaemmaghami	424dc238f4	Fix split module interaction with dead code (#104554 ) Summary: This change fixes split_module's interaction with dead code. Previously if a dead region was split out, split module would throw an error while attempting to access the outputs for the partition even though the partition has no outputs. This change adds a new unit test to cover the dead code case and changes the output check to allow no output. The split module with no output will now output None like a normal python function Unit Test Added: test_split_module_dead_code A module with dead code: ``` class ModWithDeadCode(torch.nn.Module): def forward(self, x): output = x * 2 # we want this dead_line = x + 2 # this is dead return output ``` Before: ``` torch/fx/passes/split_module.py, line 357, in split_module base_mod_env[list(partition.outputs)[0]] = output_val IndexError: list index out of range ``` After: ``` class GraphModule(torch.nn.Module): def forward(self, x): # No stacktrace found for following nodes submod_2 = self.submod_2(x) submod_1 = self.submod_1(x); x = None return submod_1 class GraphModule(torch.nn.Module): def forward(self, x): # No stacktrace found for following nodes add = x + 2; x = None return None class GraphModule(torch.nn.Module): def forward(self, x): # No stacktrace found for following nodes mul = x * 2; x = None return mul ``` Submod 2 is correctly extracted Test Plan: Tested with new unit test Differential Revision: D47196732 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104554 Approved by: https://github.com/yf225	2023-08-03 21:36:35 +00:00
Mikayla Gawarecki	d8e5f2aa6d	Reland "Make adding buffers more like adding parameters (#104069 )" (#106224 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/106224 Approved by: https://github.com/atalman, https://github.com/albanD	2023-07-31 17:18:56 +00:00
Andrey Talman	c6653b65d8	Back out "Make adding buffers more like adding parameters (#104069 )" (#105581 ) Summary: D47537831 is breaking pyper tests: https://fb.workplace.com/groups/802176577445480/posts/1018902842439518/ with `TypeError: register_buffer() takes 3 positional arguments but 4 were given` Original commit changeset: d4b4069fbd38 Original Phabricator Diff: D47537831 Test Plan: ``` buck2 run //caffe2/torch/fb/training_toolkit/integration_tests/training_lifecycle/cogwheel_tests/pyper_release_v2:cogwheel_smallworld_inline_cvr_infer_pyper_pyper__canary_offline_training-launcher -- --run-harness-in-tupperware --build-fbpkg ads_dper3 --build-fbpkg training_platform ``` Reviewed By: atalman Differential Revision: D47600140 Pull Request resolved: https://github.com/pytorch/pytorch/pull/105581 Approved by: https://github.com/mikaylagawarecki	2023-07-20 03:39:53 +00:00
ekamiti	32d422f335	Make adding buffers more like adding parameters (#104069 ) Add similar semantics for creating a buffer object similar to creating a parameter. This is done by introducing a new `Buffer` class that can be used for type disambiguation. The underlying functionality of registering a buffer remains the same as the `register_buffer` method has not been changed. The `persistent` parameter in the `Buffer` type is to indicate whether a buffer object should be persistent or not. Other non-test changes have to do with getting the new `Buffer` type recognized by inductor and dynamo. Remaining changes are test changes to make sure that the `Buffer` type can be used as a drop in replacement for `register_buffer` as it just leads to `register_buffer` being called. The addition of this new functionality still allows for normal tensors to be used as buffers so these changes are intended to be backwards compatible. Fixes #35735 Pull Request resolved: https://github.com/pytorch/pytorch/pull/104069 Approved by: https://github.com/mikaylagawarecki	2023-07-17 17:59:05 +00:00
XiaobingSuper	9b0b31a5e3	fix conv+bn folding issue for mixed dtype (#99696 ) Align the conv+bn folding behavior with jit path for mixed type case: always keep conv's weight and bias dtype after folding. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99696 Approved by: https://github.com/jgong5, https://github.com/jansel	2023-04-23 05:13:40 +00:00
Yanan Cao (PyTorch)	039b4c8809	Add meta function for _upsample_bilinear2d_aa (#94982 ) Differential Revision: D43353000 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94982 Approved by: https://github.com/ezyang	2023-02-19 07:11:20 +00:00
Xuehai Pan	046e88a291	[BE] [3/3] Rewrite `super()` calls in test (#94592 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94592 Approved by: https://github.com/ezyang, https://github.com/seemethere	2023-02-12 22:20:53 +00:00
Aaron Gokaslan	67d9790985	[BE] Apply almost all remaining flake8-comprehension checks (#94676 ) Applies the remaining flake8-comprehension fixes and checks. This changes replace all remaining unnecessary generator expressions with list/dict/set comprehensions which are more succinct, performant, and better supported by our torch.jit compiler. It also removes useless generators such as 'set(a for a in b)`, resolving it into just the set call. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94676 Approved by: https://github.com/ezyang	2023-02-12 01:01:25 +00:00
Aaron Gokaslan	9171f7d4cd	[BE] Modernize PyTorch even more for 3.8 with pyupgrade (#94520 ) Applies some more pyupgrade fixits to PyTorch Pull Request resolved: https://github.com/pytorch/pytorch/pull/94520 Approved by: https://github.com/ezyang	2023-02-10 18:02:50 +00:00
Tran Le	b769005924	[fx][passes] Implement annotate getitem node FX passes (#90237 ) Summary: One common cause of jit unscriptability issue is loss of node type annotations on local names after one or several FX transform(s). One way to improve the type coverage is to eagerly annotate the type for `getitem` nodes from its parent sequence node. This diff introduces an fx pass to do that. Test Plan: ``` buck2 test //caffe2/test:fx_experimental ``` Reviewed By: xush6528 Differential Revision: D41749744 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90237 Approved by: https://github.com/xush6528	2022-12-06 23:18:55 +00:00
Philip Meier	bc73affdad	prepare removal of deprecated functionality in torch.testing (#87969 ) _Redo of #86586 with all BC breaking changes granularly placed into separate commits._ --- Per title. Deprecation happened on Feb 25, 2022 in `c6f1bbc0ac`, which made it into the 1.12 release. Since it is now 245 days later and the next release will be 1.14, the removals later in the stack comply with the [BC policy](https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#minimizing-the-disruption-of-bc-breaking-changes). Pull Request resolved: https://github.com/pytorch/pytorch/pull/87969 Approved by: https://github.com/mruberry	2022-11-02 14:04:48 +00:00
Mike Iovine	f325c29b05	[fx] Make NormalizeArgs preserve node type (#85637 ) Summary: Make `NormalizeArgs` preserve node types when transforming the graph. This bug is preventing me from scripting a graph that goes through the fx2trt `acc_tracer`. Test Plan: New unit test Reviewed By: ipiszy Differential Revision: D39753021 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85637 Approved by: https://github.com/Chillee	2022-09-26 21:30:16 +00:00
Edward Z. Yang	604487f239	OpInfo for Slice (#85554 ) This is based on wconstab tests from #84680 Technically, slice is covered by the __getitem__ opinfo, but it is easier to debug/test on a more narrow internal function that only uses this functionality and not other advanced indexing stuff. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85554 Approved by: https://github.com/mruberry, https://github.com/wconstab	2022-09-23 22:01:32 +00:00
Edward Z. Yang	5b88a2078b	Follow GitHub relabeling of oncall: fx for test owners (#81821 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/81821 Approved by: https://github.com/janeyx99	2022-07-21 01:50:06 +00:00
Drazen Borkovic	9402219a36	Move serialize_module() out of OSS graph_manipulation.py to internal (#80785 ) Differential Revision: D37582495 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80785 Approved by: https://github.com/jfix71	2022-07-05 23:39:13 +00:00
Horace He	4d88affb5d	Ported proxy tensor tests over to core (#78890 ) Will fill out later Pull Request resolved: https://github.com/pytorch/pytorch/pull/78890 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-06-07 00:28:53 +00:00
Horace He	50cadfae10	Add strictness check and made tensors into leaves if input tensors were leaves (#77474 ) I think this makes sense to do? Otherwise, if you call `backward()` in your traced function, you can't get gradients out of any tensors that should have been leaves. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77474 Approved by: https://github.com/ezyang	2022-05-21 01:16:39 +00:00
samdow	ba0ca0f591	Add torch dispatch mode to ProxyTensor tracing (#77174 ) Uses a mode for ProxyTensor tracing so that it traces factory functions as well cc @dhruvbird Pull Request resolved: https://github.com/pytorch/pytorch/pull/77174 Approved by: https://github.com/ezyang	2022-05-19 19:53:57 +00:00
Jordan Fix	18e36a6295	[graph_manipulation] Set fused dtypes for all constant params/buffers (#77401 ) Summary: We were handling constant attrs in a few different ways before, leading to confusion and missed handing for fused dtypes. This diff consolidates some of that code and unbreaks current breakage. Test Plan: CI. Recently broken tests now pass. Differential Revision: D36335238 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77401 Approved by: https://github.com/jaybean-dev, https://github.com/jamesr66a	2022-05-17 07:42:29 +00:00
James Reed	7311390d35	[WIP] Make constructor calls in experimental MetaTracer serializable Pull Request resolved: https://github.com/pytorch/pytorch/pull/76789 Approved by: https://github.com/pbelevich	2022-05-11 00:19:47 +00:00
Elias Ellison	023aafbcd7	Fix for normalizing signature for op overloads (#77182 ) Previously, we were taking the `.op` from OpOverload/OpOverloadPacket and looking for a mapping in `_jit_builtins` for their signature. Those will only exist for operators on the public api, not the overload packets, e.g. `torch.resize_as_` not `torch.ops.aten.resize_as_` (as least in this case, and im pretty sure generally). The OpOverloads/OpOverloadPackets have schemas stored on them so we can just use those directly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77182 Approved by: https://github.com/anjali411	2022-05-10 23:36:26 +00:00
Horace He	fc95eda285	Added proxy tensor This is the `__torch_dispatch__` subclass used for tracing by AOTAutograd (https://github.com/pytorch/functorch/blob/main/functorch/_src/python_key.py). Given that a couple of folks are now interested in using this infra, it seems like a good idea to put it in core, and focus our efforts on a single implementation. I put this up as a WIP, just for discussion, but some questions off the top of my head. 1. What should be the intended way of extending this tracer? Should we define extension points, or should folks simply copy paste and modify? If we do define extension points, what are the extension points we should define? 2. There are some open questions about the way we're overriding FX to resolve some lingering issues (i.e. dealing with `nn.Parameter` and `call_module` calls). @ezyang implemented an alternate version of this tensor in https://github.com/albanD/subclass_zoo/blob/main/tracer_tensor.py, but it appears he ran into some issues with it that led to me submitting this implementation. That being said, I think some of the things over there should still be ported. 3. Given that this is going to be shared infra, what other features should we put in here? One that comes to mind is to allow for meta-tensor tracing (perhaps by default?), with a more solid fallback. Some of the other implementations (for reference on requirements). 1. FX2TRT: D34868356 (internal only) 2. Edge's? @gmagogsfm cc: @ezyang , @jamesr66a , @zou3519 , @gmagogsfm, @842974287 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74360 Approved by: https://github.com/ezyang	2022-05-03 22:46:30 +00:00
Jordan Fix	1c5a66c2aa	[FX] Fix operator_schemas normalize_function to consider OpOverloads (#76469 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/76469 Broken by Original commit changeset: 450e86c4e08a Original Phabricator Diff: D35874477 Test Plan: Added unit test coverage to test_fx_experimental Reviewed By: albanD Differential Revision: D35978105 fbshipit-source-id: f22670b3b00a86777a26feaf4cb911595d150a17 (cherry picked from commit 91868b1e872c19d58d96a6c80a5e78dc6ffe4c7b)	2022-04-28 01:38:16 +00:00
James Reed	15e36f03ad	Experimental MetaTensorTracer Pull Request resolved: https://github.com/pytorch/pytorch/pull/76003 Approved by: https://github.com/jansel	2022-04-20 02:04:33 +00:00
Mike Ruberry	de949a0e59	Various OpInfo architecture improvements This PR makes the following improvements: - moves the custom skip list for test_normalize_operator_exhaustive in test_fx_experimental to use the typical OpInfo skip architecture. The skips were updated to xfails, and that identified some operators which were no longer failing the test - redundant tests with OpInfo-based testing in test_jit.py were removed - test_dtypes was improved so its error messages are clear and it makes test_nondifferentiable redundant; the latter test has been removed - OpInfo.supports_complex_autograd() is removed in favor of a more accurate and general test for whether the particular dtype is in the backward dtypes of the operator - gradchecks have been improved to verify that an operator doesn't support grad if it claims not to - gradchecks have been improved to test the gradient of all input tensors that require gradient - the concept of "default test dtypes" has been removed - excessive and mostly redundant out testing for elementwise unary operators has been removed - metadata for whether an op supports nuanced "safe casting" to out behavior has been removed from OpInfos - numerous skips have been converted to xfails - numerous OpInfos have had their metadata fixed based on the new checks - jit-specific utilities in common_methods_invocations.py have been moved to jit_programming_utils.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/75951 Approved by: https://github.com/ngimel	2022-04-18 21:55:32 +00:00
James Reed	214951bc6b	[FX] Make split_module preserve proper placeholder names (#74736 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74736 Previously, `split_module` would incorrectly carry over the `name` of placeholders rather than their `target`: Original GraphModule ``` def forward(self, x, kwargs): _kwargs = kwargs getitem = _kwargs['foo']; _kwargs = None add = x + getitem; x = getitem = None return add ``` After splitting: ``` def forward(self, x, _kwargs): submod_0 = self.submod_0(_kwargs); _kwargs = None submod_1 = self.submod_1(x, submod_0); x = submod_0 = None return submod_1 ``` Notice that `kwargs` is turned into `_kwargs`, which is incorrect and we lose the kwarg expansion behavior. This patch switches the erroneous code in `split_module`, resulting in the correct split code being emitted: Original GraphModule ``` def forward(self, x, kwargs): _kwargs = kwargs getitem = _kwargs['foo']; _kwargs = None add = x + getitem; x = getitem = None return add ``` After splitting: ``` def forward(self, x, kwargs): _kwargs = kwargs submod_0 = self.submod_0(_kwargs); _kwargs = None submod_1 = self.submod_1(x, submod_0); x = submod_0 = None return submod_1 ``` Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D35137361 Pulled By: jamesr66a fbshipit-source-id: 46d079cfe16093c293fc268404fb8bc86ffcf583 (cherry picked from commit a020066281856184621561a8672eb57f5de31e92)	2022-03-25 23:36:27 +00:00
David Berard	15c98700ed	Add CPU slow test job (#73748 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73748 This adds CPU-only slow test jobs, which previously would never run. Includes fixes/skips for slow tests which fail (they need to be skipped now because they used to never run) Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D34628803 Pulled By: davidberard98 fbshipit-source-id: c090ab7bf7bda9e24ec5cdefa6fd35c6310dbac0 (cherry picked from commit 06f7a94a57cc7023e9c5442be8298d20cd011144)	2022-03-23 21:17:27 +00:00
James Reed	a8d9fbb021	[FX] Make `immutable_list` and `immutable_dict` work with pytrees (#73766 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73766 Test Plan: Imported from OSS Reviewed By: zou3519, Chillee Differential Revision: D34630217 Pulled By: jamesr66a fbshipit-source-id: f23420deaeed7e54d5e6759b486ca4a02243a7b3 (cherry picked from commit 8854c60e60e79b144077f3021d305ea3d06a2a21)	2022-03-04 19:35:41 +00:00
James Reed	dae7ed179f	[FX] Make module getattr wrapper proxy buffers (#73612 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73612 Test Plan: Imported from OSS Reviewed By: zdevito Differential Revision: D34568113 Pulled By: jamesr66a fbshipit-source-id: 95a7106cf6ce45999c1b3c06b34965e725961771 (cherry picked from commit 54841e028478ea641fb4d7895f726553b8b48353)	2022-03-03 04:32:49 +00:00
Ke Wen	d14de3139a	[PyTorch FX] Return mapping of qualified names from split_module() (#73564 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73564 While maintaining API backward compatibility, add an optional output parameter to split_module() that returns a mapping from the new qualified names in the modules after split to the old qualified names in the original module Test Plan: 1. Added a test (test_split_qualname_mapping) to test_fx_experimental.py to check the returned qualname mapping ``` $ python test_fx_experimental.py ... Ran 1084 tests in 73.464s OK (skipped=531, expected failures=4) ``` 2. Ask test_fx.py to accept split_module's new signature ``` $ python test_fx.py --accept ``` Reviewed By: jamesr66a Differential Revision: D34541792 fbshipit-source-id: e8ec7e77ec884e4db7cad0c0593e31861c76e42d (cherry picked from commit d2e5a95a353ee5fb52cdba065f127489e9df47ae)	2022-03-02 23:32:54 +00:00
Peter Bell	e8d226cd9a	Remove some unnecessary python functional wrappers (#61608 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61608 See #61544 for an example of issues created by functional wrappers. In this case, these are directly wrapping the native function with no added functionality. One exception was `bilinear` which was just missing the default argument in C++, but was otherwise the same. I've kept the symbol `torch.functional.istft` because it looks like public API, but it could just as easily be moved to `_torch_docs.py`. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D31401361 Pulled By: albanD fbshipit-source-id: 162b74d0b2d4f2e5c4834687a94541960cefdd52 (cherry picked from commit `700cd73ca1`)	2022-02-01 16:59:26 +00:00
Jason Ansel	7d613ab1d6	Fix indentation typo in test_fx_experimental.py (#71885 ) Summary: These tests were not actually running as they were defined in the local scope of another test Pull Request resolved: https://github.com/pytorch/pytorch/pull/71885 Reviewed By: scottxu0730 Differential Revision: D33806251 Pulled By: jansel fbshipit-source-id: 48a2d7b472f160759ef55e6fff1f8890511e3345 (cherry picked from commit `9ae14efb25`)	2022-01-28 00:41:12 +00:00
XiaobingSuper	b8679ee1fc	fix conv+bn folding issue when bn hasn't running states (#71259 ) Summary: Doing conv+bn folding which bn hasn't a running stats, there have error for JIT and FX path: ``` import torch import torch.nn as nn import torch.fx.experimental.optimization as optimization class M(nn.Module): def __init__(self): super(M, self).__init__() self.conv = nn.Conv2d(32, 64, 3, stride=2) self.bn = nn.BatchNorm2d(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=False) def forward(self, x): x = self.conv(x) x = self.bn(x) return x x = torch.randn([1, 32, 50, 50]) model = M().eval() ''' # jit path with torch.no_grad(): traced = torch.jit.trace(model, x).eval() traced = torch.jit.freeze(traced) ''' # FX path fused_model = optimization.fuse(model) ``` expected result: 1. JIT path ``` Traceback (most recent call last): File "bn_test.py", line 27, in <module> traced = torch.jit.freeze(traced) File "/home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.8/site-packages/torch/jit/_freeze.py", line 119, in freeze run_frozen_optimizations(out, optimize_numerics, preserved_methods) File "/home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.8/site-packages/torch/jit/_freeze.py", line 167, in run_frozen_optimizations torch._C._jit_pass_optimize_frozen_graph(mod.graph, optimize_numerics) RuntimeError: Expected Tensor but got None ``` 2. FX path ``` Traceback (most recent call last): File "bn_test.py", line 31, in <module> model = optimization.fuse(model, inplace=True) File "/home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.8/site-packages/torch/fx/experimental/optimization.py", line 71, in fuse fused_conv = fuse_conv_bn_eval(conv, bn) File "/home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.8/site-packages/torch/nn/utils/fusion.py", line 11, in fuse_conv_bn_eval fuse_conv_bn_weights(fused_conv.weight, fused_conv.bias, File "/home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.8/site-packages/torch/nn/utils/fusion.py", line 23, in fuse_conv_bn_weights bn_var_rsqrt = torch.rsqrt(bn_rv + bn_eps) TypeError: unsupported operand type(s) for +: 'NoneType' and 'float' ``` This PR will fix this issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/71259 Reviewed By: anjali411 Differential Revision: D33595049 Pulled By: davidberard98 fbshipit-source-id: 0fe56bb2bb25d6d54ebc53789d2ad22458da9012 (cherry picked from commit `5672c08378`)	2022-01-18 22:12:41 +00:00
James Reed	de902b5d02	[FX] Add a default_value arg to Graph.placeholder and fix split_module (#71016 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71016 I found out that `split_module` doesn't preserve default values for arguments. In trying to fix that, I noticed that `Graph.placeholder` doesn't make it easy to add a default argument when making a placeholder. This PR addresses both of those issues Test Plan: Imported from OSS Reviewed By: ansley Differential Revision: D33482218 Pulled By: jamesr66a fbshipit-source-id: 57ebcdab25d267333fb1034994e08fc1bdb128ee	2022-01-12 14:03:17 -08:00
Horace He	df6eb9bbab	Fixed to_folder not saving dtype (#69983 ) Summary: As above. Pull Request resolved: https://github.com/pytorch/pytorch/pull/69983 Reviewed By: pbelevich, ngimel Differential Revision: D33466529 Pulled By: Chillee fbshipit-source-id: 2d2f0ad5b8e2492aba4c19fa034c8b6c0848a568	2022-01-06 22:15:56 -08:00
anjali411	4a6a5d1630	OpInfos for torch.{flatten, column_stack} (#69237 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69237 Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D32988956 Pulled By: anjali411 fbshipit-source-id: b7f5c537ff9731f56232aa5647910f03edf4582a	2021-12-16 17:50:58 -08:00
Richard Zou	620a1fcb55	OpInfos for: normal, bernoulli, multinomial (#66358 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66358 Test Plan: - run tests Reviewed By: mruberry Differential Revision: D31551695 Pulled By: zou3519 fbshipit-source-id: cf1b43118a0414a1af9ece9ae8c0598b2701aa0a	2021-12-14 06:59:38 -08:00
Kushashwa Ravi Shrimali	2cb385dd6e	OpInfo for `nn.functional.dropout2d`, revise sample inputs for `dropout` (#67891 ) Summary: Earlier, we were only testing for inputs with the shape of `(5,)` for `nn.functional.dropout`, but since it's used a lot - I feel it's a good idea to test for a few more shapes including scalars. This PR: 1. Revises sample inputs for `nn.functional.dropout` 2. Adds an OpInfo for `nn.functional.dropout2d`. A note regarding the documentation: Looks like `nn.functional.dropout2d` also supports inputs of shape `(H, W)` apart from `(N, C, H, W) / (C, H, W)` but the [documentation](https://pytorch.org/docs/stable/generated/torch.nn.Dropout2d.html#torch.nn.Dropout2d) doesn't mention that (`H, W` case). Should that be revised or am I missing anything here? (Filed an issue here: https://github.com/pytorch/pytorch/issues/67892) ```python # A 2D tensor is a valid input for Dropout2d In [11]: tensor = torch.randn((3, 4), device='cpu', dtype=torch.float32) In [12]: dropout2d = torch.nn.Dropout2d(p=0.5) In [13]: dropout2d(tensor) Out[13]: tensor([[-0.1026, -0.0000, -0.0000, -0.0000], [-1.5647, 0.0000, -0.0000, -0.5820], [-0.0000, -3.2080, 0.1164, -3.6780]]) ``` Issue Tracker: https://github.com/pytorch/pytorch/issues/54261 cc: mruberry zou3519 Pull Request resolved: https://github.com/pytorch/pytorch/pull/67891 Reviewed By: mrshenli Differential Revision: D32628527 Pulled By: mruberry fbshipit-source-id: 4c9b89550f1d49526e294378ce107eba9f29cabb	2021-12-08 08:54:16 -08:00
Nikita Vedeneev	c236247826	OpInfo tests for `(svd\|pca)_lowrank` (#69107 ) Summary: As per title. While working on this I have discovered several issues with these methods related to grad instabilities. I will file them and link here later. These were quite painful to force to pass all the tests with these discovered issues, sorry for the delay, mruberry! cc jianyuh nikitaved pearu mruberry walterddr IvanYashchuk xwang233 Lezcano Pull Request resolved: https://github.com/pytorch/pytorch/pull/69107 Reviewed By: zou3519 Differential Revision: D32920341 Pulled By: mruberry fbshipit-source-id: 15b33e2b46acdcbff8a37d8e43e381eb55d1a296	2021-12-07 19:50:12 -08:00
Saketh Are	6a4fa86026	Add OpInfos for misc nn.functional operators (#68922 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68922 Reviewed By: Chillee Differential Revision: D32842301 Pulled By: saketh-are fbshipit-source-id: b7166faefb64668fc76cca6c528501b0d360c43b	2021-12-03 17:03:02 -08:00
Saketh Are	a07ffe8d0e	Add OpInfos for combinations, cartesian_prod, sum_to_size, ldexp, as_strided (#68853 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68853 Reviewed By: davidberard98 Differential Revision: D32811147 Pulled By: saketh-are fbshipit-source-id: 941dcf949072f8d10faf4d5a0fa0ef409ac6a7db	2021-12-02 21:22:56 -08:00
Kshiteej K	e5e0c19882	OpInfo : embedding_bag (#67252 ) Summary: Adds OpInfo for `embedding_bag`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/67252 Reviewed By: VitalyFedyunin Differential Revision: D32462157 Pulled By: zou3519 fbshipit-source-id: 70303349a718720c4fa47519fa94ae900e052939	2021-12-01 07:00:50 -08:00
Nikita Shulga	14dc9759f2	Revert D32650384: OpInfos for torch.{flatten, column_stack} Test Plan: revert-hammer Differential Revision: D32650384 (`aceb46e4ce`) Original commit changeset: 9ead83b378d0 fbshipit-source-id: 3ef281e536b1f21a6f13c6c51309021cf92b53b2	2021-11-24 14:55:26 -08:00
anjali411	aceb46e4ce	OpInfos for torch.{flatten, column_stack} (#67555 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67555 Test Plan: Imported from OSS Reviewed By: cpuhrsch Differential Revision: D32650384 Pulled By: anjali411 fbshipit-source-id: 9ead83b378d0ece60569e1a0fc7d8849f89566b3	2021-11-24 10:25:37 -08:00

1 2 3 4

161 Commits