pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Kushashwa Ravi Shrimali	d37636901e	[Doc] `make_tensor` to `torch.testing` module (#63925 ) Summary: This PR aims to add `make_tensor` to the `torch.testing` module in PyTorch docs. TODOs: * [x] Add examples cc: pmeier mruberry brianjo Pull Request resolved: https://github.com/pytorch/pytorch/pull/63925 Reviewed By: ngimel Differential Revision: D30633487 Pulled By: mruberry fbshipit-source-id: 8e5a1f880c6ece5925b4039fee8122bd739538af	2021-08-30 12:25:40 -07:00
Thomas J. Fan	d3bcba5f85	ENH Adds label_smoothing to cross entropy loss (#63122 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/7455 Partially resolves pytorch/vision#4281 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63122 Reviewed By: iramazanli Differential Revision: D30586076 Pulled By: jbschlosser fbshipit-source-id: 06afc3aa1f8b9edb07fe9ed68c58968ad1926924	2021-08-29 23:33:04 -07:00
Garrett Cramer	7ebdbf82dc	add support for sending cpu sparse tensors over rpc (#62794 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62794 This pr updates jit serialization to support pickling Sparse COO tensors. This pr updates message.cpp to support Sparse COO tensors. A bug was filed a few years ago https://github.com/pytorch/pytorch/issues/30807. I tested the fix by adding sparse tensor tests to rpc_test.py and dist_autograd_test.py. cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse agolynski SciPioneer H-Huang mrzzd cbalioglu gcramer23 gmagogsfm Test Plan: Imported from OSS Reviewed By: soulitzer Differential Revision: D30608848 Pulled By: gcramer23 fbshipit-source-id: 629ba8e4a3d8365875a709c9b87447c7a71204fb	2021-08-29 11:35:00 -07:00
Jessica Choi	8406dba65a	Removing references to ProcessGroupAgent in comments (#64051 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64051 cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse agolynski SciPioneer H-Huang mrzzd cbalioglu gcramer23 Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D30587076 Pulled By: jaceyca fbshipit-source-id: 414cb95faad0b4da0eaf2956c0668af057f93574	2021-08-27 14:47:37 -07:00
Heitor Schueroff	eca87f729d	Added reference tests to ReductionOpInfo (#62900 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62900 Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D30408815 Pulled By: heitorschueroff fbshipit-source-id: 6a1f82ac281920ff7405a42f46ccd796e60af9d6	2021-08-27 10:32:16 -07:00
Supriya Rao	c7027f19ef	[quant][fx] Add support for dynamic linear + relu fusion (INT8) (#63799 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63799 Add a new module that can be used for module swap with the nni.LinearReLU module in convert function. Supports INT8 currently (since FP16 op doesn't have relu fusion yet). Fixes #55393 Test Plan: python test/test_quantization.py test_dynamic_fusion Imported from OSS Reviewed By: heitorschueroff Differential Revision: D30502812 fbshipit-source-id: 3668e4f001a0626d469e17ac323acf582ee28a51	2021-08-26 21:10:46 -07:00
Ansley Ussery	c5cc185b6d	Allow uncompiled strings as input to `checkScriptRaisesRegex` (#63901 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63901 cc gmagogsfm Test Plan: Imported from OSS Reviewed By: gmagogsfm Differential Revision: D30579472 Pulled By: ansley fbshipit-source-id: 59ee09c1f25278d4f6e51f626588251bd095c6ea	2021-08-26 12:17:07 -07:00
Heitor Schueroff	950f7c0237	Added API tests to ReductionOpInfo and ported amax/amin/nansum tests (#62899 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62899 Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D30408816 Pulled By: heitorschueroff fbshipit-source-id: 6cb0aa7fa7edba93549ef873baa2fb8a003bd91d	2021-08-26 07:18:43 -07:00
Heitor Schueroff	774ae0851d	[OpInfo] Added ReductionOpInfo subclass of OpInfo and ported sum test (#62737 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62737 ReductionOpInfo is a specialization of OpInfo for reduction operators. For now, it is designed to work with reductions that return a single tensor and that reduce all elements along one or more dimensions to a single value. In particular this excludes operators such as `max` and `min` that return multiple tensors and `quantile` that can return multiple values. fixes https://github.com/pytorch/pytorch/issues/49746 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30406568 Pulled By: heitorschueroff fbshipit-source-id: 218b1da1902f67bcf4c3681e2a0f0029a25d51f1	2021-08-26 06:06:38 -07:00
Rohan Varma	a6f767ed3d	Fix issue re: DDP and create_graph=True (#63831 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63831 Closes https://github.com/pytorch/pytorch/issues/63812 `at::mul_out` is not supported when `grad` itself requires grad, which is useful for computing higher order derivatives. In this case, fall back to a mul + copy instead of mul_out. ghstack-source-id: 136614644 Test Plan: UT Reviewed By: SciPioneer Differential Revision: D30505573 fbshipit-source-id: 83532b6207b3d80116fcc4dff0e5520d73b3454f	2021-08-25 23:50:25 -07:00
Philip Meier	57d4c6cf42	replace `self.assertTrue(torch.allclose(..))` with `self.assertEqual(…)` (#63637 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/63565 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63637 Reviewed By: malfet Differential Revision: D30541266 Pulled By: mruberry fbshipit-source-id: ab461949782c6908a589ea098fcfcf5c3e081ee6	2021-08-25 16:47:40 -07:00
Rong Rong (AI Infra)	52ebe7e14e	Back out "Temporary fix for remote gpu execution issue" (#63983 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63983 Test for fixes in D30545351. it should resolve the remote execution flag being populated incorrectly issue. Test Plan: CI Reviewed By: malfet, seemethere Differential Revision: D30549443 fbshipit-source-id: b3895909f5cd654ba163b77950872b332fbad3fe	2021-08-25 14:37:01 -07:00
Rong Rong (AI Infra)	34ed16ffef	Temporary fix for remote gpu execution issue (#63899 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63899 See: T99020845 Test Plan: sandcastle Reviewed By: heitorschueroff Differential Revision: D30527384 fbshipit-source-id: ce9933e5e181322c02d4ed17f3fdaabe4c5ba29e	2021-08-25 09:14:03 -07:00
Thomas J. Fan	ba126df614	TST Adds more modules into common module tests (#62999 ) Summary: This PR moves some modules into `common_modules` to see what it looks like. While migrating some no batch modules into `common_modules`, I noticed that `desc` is not used for the name. This means we can not use `-k` to filter tests. This PR moves the sample generation into `_parametrize_test`, and passes in the already generated `module_input` into users of `modules(modules_db)`. I can see this is a little different from opsinfo and would be happy to revert to the original implementation of `modules`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/62999 Reviewed By: heitorschueroff Differential Revision: D30522737 Pulled By: jbschlosser fbshipit-source-id: 7ed1aeb3753fc97a4ad6f1a3c789727c78e1bc73	2021-08-24 19:16:32 -07:00
Xiaodong Wang	6d58c83007	Turn off layer norm in jit symbolic differentiation (#63816 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63816 Test Plan: Confirmed this can rescue the NE: https://www.internalfb.com/mast/job/torchx_xdwang-SparseNNApplication_72cf593d Reviewed By: ngimel Differential Revision: D30498746 fbshipit-source-id: 4a387f32ee2f70685de6104459c7f21bfbddc187	2021-08-24 15:47:13 -07:00
Xiang Gao	227cb268bc	[Reland] Embedding thrust->cub migration (#63806 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/63427 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63806 Reviewed By: bdhirsh Differential Revision: D30498255 Pulled By: ngimel fbshipit-source-id: 78b7085a92a168cf0163f53dcb712bac922f5235	2021-08-24 09:30:32 -07:00
mingfeima	94d621584a	optimize BFloat16 elemwise operators CPU: sigmoid, sigmoid_backward, tanh_backward, addcmul, addcdiv (#55221 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55221 Test Plan: Imported from OSS Reviewed By: bdhirsh Differential Revision: D28836797 Pulled By: VitalyFedyunin fbshipit-source-id: 6b79098c902ffe65d228668118ef36fb49bab800	2021-08-24 08:56:17 -07:00
yanbing-j	33a163d886	Enable BFloat16 LeakyReLU and RReLU in CPU path (#61514 ) Summary: Enable and optimize BFloat16 LeakyReLU and RReLU in CPU path. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61514 Reviewed By: ejguan Differential Revision: D30257612 Pulled By: VitalyFedyunin fbshipit-source-id: 8cc0d1faacd02dcc9827af724a86d95b6952748f	2021-08-24 08:34:56 -07:00
Thomas J. Fan	2ca2761f3c	ENH Adds no_batch_dim for NLLLoss (#62651 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62651 Reviewed By: VitalyFedyunin Differential Revision: D30303340 Pulled By: jbschlosser fbshipit-source-id: 7ab478cf63bf6cd1f850cad5fd101e74a2cfe3f5	2021-08-24 08:27:27 -07:00
Thomas J. Fan	9914fb6615	ENH Adds no_batch_dim tests/docs for LPPool1d and Identity (#62190 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62190 Reviewed By: ejguan Differential Revision: D29942385 Pulled By: jbschlosser fbshipit-source-id: 00df6f6f01ad039631bb8679f8de94863aac7650	2021-08-24 06:59:41 -07:00
Marjan Fariborz	c545b099aa	Separating quantization test from distributed_test (#63058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63058 Dedicating separate tests for different quantization methods. Currently supporting FP16 method. ghstack-source-id: 136499767 Test Plan: uck test mode/dev //caffe2/test/distributed/algorithms/quantization:quantization_gloo_fork -- name_of_the_test Reviewed By: wanchaol Differential Revision: D30142580 fbshipit-source-id: 3aacec1a231a662067d2b48c001f0c69fefcdd60	2021-08-24 01:44:55 -07:00
Jane Xu	f5d585391d	Add ROCm as a platform for which tests can be disabled (#63813 ) Summary: Realized we were missing ROCm as a platform on which one could disable a flaky test. (like how this issue specifies windows https://github.com/pytorch/pytorch/issues/61655) cc jeffdaily sunway513 jithunnair-amd ROCmSupport Pull Request resolved: https://github.com/pytorch/pytorch/pull/63813 Reviewed By: seemethere Differential Revision: D30498478 Pulled By: janeyx99 fbshipit-source-id: f1abe8677e1ddd01de3291e1618272ad8e287dc4	2021-08-23 18:50:04 -07:00
Rohan Varma	fc07489ec5	[BE] Enable PostLocalSGD tests on windows (#63463 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63463 Now that `torch.distributed.optim` gates DistributedOptimizer on RPC availability, local sgd optimizer can be used on windows. ghstack-source-id: 136437632 Test Plan: Ci Reviewed By: SciPioneer Differential Revision: D30358922 fbshipit-source-id: 9b56aebf1075f026637296d338805ad8851c9d40	2021-08-23 17:49:03 -07:00
Rohan Varma	16a4434422	[BE] Enable functional optim tests for windows (#63462 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63462 Now that `torch.distributed.optim` gates DistributedOptimizer on RPC availability, these tests can be run on windows. ghstack-source-id: 136437635 Test Plan: CI Reviewed By: SciPioneer Differential Revision: D30358923 fbshipit-source-id: 36739bdfe7214789f17de652d30c62c2bc124c73	2021-08-23 17:49:01 -07:00
Bert Maher	a709ab34a8	[nnc] Re-enable CPU fusion" (#63665 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63665 This reverts commit `125e2d02e5`. Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D30471646 Pulled By: bertmaher fbshipit-source-id: 4189869566f03b5f9ada78d78830f6a34946eed6	2021-08-23 12:42:42 -07:00
Pritam Damania	d6133b2fe6	Remove `_fork_processes` from common_distributed.py (#63711 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63711 This removes `_fork_process` from common_distributed.py and fixes all other callpoints to use `spawn_process` instead. ghstack-source-id: 136395719 Test Plan: waitforbuildbot Reviewed By: xush6528 Differential Revision: D30463834 fbshipit-source-id: 0c09e8a996d0e5b912c8cdd45488a39951bac4db	2021-08-22 18:57:12 -07:00
jiej	e926f75b0b	BatchNorm autodiff re-enabled (#57321 ) Summary: Turns on BN in autodiff: 1. outputs an empty tensor for running stats to by pass autodiff issue on None; 2. fixing BN inference backward in cudnn & miopen, where backward falls back to native batchnorm kernel instead; Pull Request resolved: https://github.com/pytorch/pytorch/pull/57321 Reviewed By: albanD, ngimel Differential Revision: D30250419 Pulled By: jansel fbshipit-source-id: a62553789c20fb50a820003a056f40d9d642dfaa	2021-08-21 09:07:31 -07:00
jiayisun	da0820e553	add BFloat16 operators on CPU: range, sinh, cosh, frexp, nan_to_num (#61826 ) Summary: Added BFloat16 support for range, sinh, cosh, frexp, and nan_to_num on CPU, and collected the benchmark data of these OPs(range, sinh, cosh, frexp, and nan_to_num) for BFloat16 and Float32 data type by using the operator_benchmark tool of PyTorch on the platform of Intel(R) Xeon(R) Platinum 8180 CPU @ 2.50GHz Number of cores: 1 core, 28 cores(1 socket) [cosh_sinh_benchmark.txt](https://github.com/pytorch/pytorch/files/6974313/cosh_sinh_benchmark.txt) [frexp_benchmark.txt](https://github.com/pytorch/pytorch/files/6974315/frexp_benchmark.txt) [nan_to_num_benchmark.txt](https://github.com/pytorch/pytorch/files/6974317/nan_to_num_benchmark.txt) [range_benchmark.txt](https://github.com/pytorch/pytorch/files/6974318/range_benchmark.txt) Pull Request resolved: https://github.com/pytorch/pytorch/pull/61826 Reviewed By: saketh-are Differential Revision: D30257259 Pulled By: VitalyFedyunin fbshipit-source-id: 394cd713e6394050a8c90b2160633beb675d71dd	2021-08-20 14:56:52 -07:00
Pritam Damania	2d671ca41b	[8/N] Remove c10d/ddp fork tests. (#63454 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63454 Continuation of https://github.com/pytorch/pytorch/pull/63443, this PR removes all fork tests from torch.distributed. ghstack-source-id: 136285511 Test Plan: waitforbuildbot Reviewed By: SciPioneer Differential Revision: D30387872 fbshipit-source-id: f6d6313db126ae7b95b86f78a1e0726887c5c513	2021-08-20 12:23:18 -07:00
Philip Meier	70a3210eca	Add `BinaryUfuncOpInfo` and broadcasting tests (#61964 ) Summary: As proof of concept, this PR uses the new `BinaryUfuncOpInfo` in broadcasting tests for `add`, `sub`, `mul`, `div`, `floor_div`, and `true_div`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61964 Reviewed By: ngimel Differential Revision: D30407734 Pulled By: mruberry fbshipit-source-id: ada28994f43b0635f279f45a02ecba18bc8ee033	2021-08-20 11:44:15 -07:00
Philip Meier	99203580a9	Updates internal `assert_allclose` callsites in favor of `assert_close` (#61841 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61841 Redo of #60863. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D30408145 Pulled By: mruberry fbshipit-source-id: 0b34ebc7f23ba38ecd89640b61d8aca59b7eab58	2021-08-19 12:50:41 -07:00
Pritam Damania	535d44141b	[7/N] Remove fork tests for RPC. (#63443 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63443 After https://github.com/pytorch/pytorch/pull/63442, all distributed tests can run with opt-asan. As a result, we can now remove all of our fork based tests. This is the first PR in a stack, which first removes fork based tests from RPC. ghstack-source-id: 136177744 Test Plan: waitforbuildbot Reviewed By: lw Differential Revision: D30384905 fbshipit-source-id: 86d438aebaa6cb02ae2a966fea244849849a1889	2021-08-19 11:22:40 -07:00
driazati	bd8608cd5c	Use CMake for breakpad (#63186 ) Summary: We currently build breakpad from [this fork](https://github.com/driazati/breakpad) to include extra logic to restore signal handlers that were previously present. With some [new additions](https://github.com/google/breakpad/compare/main...driazati:main) this fork now includes a CMake based build, so we can add breakpad as a proper dependency rather than rely on including it in Docker images as a system library which is error prone (we have a bunch of images) and hard to extend to MacOS / Windows. This also includes some changes to the crash handling code to support MacOS / Windows in a similar way to Linux. ```python import torch # On Windows this writes crashes to C:\Users\<user>\AppData\pytorch_crashes # On MacOS/Linux this writes crashes to /tmp/pytorch_crashes torch.utils._crash_handler.enable_minidumps() # Easy way to cause a segfault and trigger the handler torch.bincount(input=torch.tensor([9223372036854775807])) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/63186 Reviewed By: malfet, seemethere Differential Revision: D30318404 Pulled By: driazati fbshipit-source-id: 0d7daf3701cfaba5451cc529a0730272ab1eb1dc	2021-08-19 10:42:01 -07:00
anjali411	e1334512a3	Add fastpath for dot and vdot when the inputs have conj bit set to True (#62915 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62915 As much as 45% and 20% perf improvement on CUDA and CPU respectively. consistent improvement in perf for all cases -- see perf numbers in comments below Test Plan: Imported from OSS Reviewed By: heitorschueroff Differential Revision: D30404006 Pulled By: anjali411 fbshipit-source-id: 565940da28c7761d993cf43346932c24292e8a4d	2021-08-19 08:42:24 -07:00
kshitij12345	4dcc2197ce	[fix] tensor_split : non-contiguous indices tensor (#63390 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/63281 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63390 Reviewed By: ejguan Differential Revision: D30362649 Pulled By: mruberry fbshipit-source-id: 3ea3ad02199e4345beb0b580d056babd56112309	2021-08-18 16:10:17 -07:00
Heitor Schueroff	50a3b6a6a8	Make SkipInfo with expected_failure an XFAIL (#63481 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63481 This PR changes the SkipInfo decorators to use unittest.expectedFailure so that the test reports as XFAIL as opposed to PASSED. Note that changing the expectedFailure here `30e1c74dc1/torch/testing/_internal/common_device_type.py (L879)` to an XFAIL is not possible because the decision of whether to decorate is delayed until the wrapper function is called. fixes https://github.com/pytorch/pytorch/issues/63363 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D30397154 Pulled By: heitorschueroff fbshipit-source-id: c5e4911969ad8667763eec4203dbbc6a51178592	2021-08-18 11:36:18 -07:00
Philip Meier	a00d587849	add `OpInfo` for `torch.linalg.tensorinv` (#62326 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53739. Pull Request resolved: https://github.com/pytorch/pytorch/pull/62326 Reviewed By: H-Huang Differential Revision: D30136376 Pulled By: zou3519 fbshipit-source-id: 04ec9450e8866667649af401c7559b96ddc91491	2021-08-18 07:37:34 -07:00
Pritam Damania	f8a84a80cd	[5/N] Run opt-asan with detect_leaks=0 (#63361 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63361 Python multiprocessing doesn't support LSAN and causes false positives instead. As a result, disabling LSAN for these tests so that we can still run with opt-asan ghstack-source-id: 135962489 Test Plan: waitforbuildbot Reviewed By: rohan-varma Differential Revision: D30352269 fbshipit-source-id: f6ab5abce7bdef00cd5e1f5977424d2b151174af	2021-08-18 01:59:56 -07:00
Shen Li	3fd8e09102	Fix RPC Python User Function Error Handling (#63406 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63406 The `RemoteException` will be thrown on the caller side when converting the response message to IValue. Since it is a Python error, the error message needs to be extracted explicitly and clear the `PyErr`. Test Plan: Imported from OSS Reviewed By: rohan-varma, ngimel Differential Revision: D30372741 Pulled By: mrshenli fbshipit-source-id: 1f72a7ee0c39cc2ef070f99884c142f7b3e0543d	2021-08-17 20:14:03 -07:00
Rohan Varma	dcf90b797c	[BE] remove _SUPPORTED_OPTIM_MAP from tests (#63383 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63383 Per title ghstack-source-id: 135966157 Test Plan: CI Reviewed By: SciPioneer Differential Revision: D30358921 fbshipit-source-id: 965e054e525194b1ee55980340df275bab355c9b	2021-08-17 17:17:25 -07:00
Rohan Varma	5b8862abf1	[DDP] Support step_param for AdamW (#63382 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63382 Per title ghstack-source-id: 135966156 Test Plan: CI Reviewed By: SciPioneer Differential Revision: D30255446 fbshipit-source-id: e6ffbf339db0bc5b4702d02b74a462309df07c75	2021-08-17 17:16:11 -07:00
Rohan Varma	078dcc4e97	[wip] Move smallest bucket to end after rebuild buckets (#62279 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62279 Before rebuild buckets, `kDefaultFirstBucketBytes` is actually misleading because we reverse the parameter indices when initialize reducer so it is actually the size of the last bucket. Currently rebuild buckets sets this to be the first bucket size, but seeing if keeping it as last can help perf. This is currently experimental only and don't plan to land it unless experiments show a clear win. ghstack-source-id: 135966897 Test Plan: CI Reviewed By: SciPioneer Differential Revision: D29927931 fbshipit-source-id: 55b949986fa2c3bade6fcb4bf5b513461bf0f490	2021-08-17 15:04:50 -07:00
Richard Zou	495e7e4815	Fix zero-dim handling in torch.matmul (#63359 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63359 Fixes #63352. The problem was that in e.g. `torch.matmul(A, B)` with A, B having shapes [3, 2, 0] and [0, 2], the code attempts to call `A.view(-1, 0)` which fails due to "-1 being ambiguous". The solution is to manually compute what we want the shape of the view to be. Test Plan: - new tests Reviewed By: ngimel Differential Revision: D30351583 Pulled By: zou3519 fbshipit-source-id: 7625691fe8b85d96a4073409596a932c303e3e8c	2021-08-17 13:44:47 -07:00
Kushashwa Ravi Shrimali	a2db5d34a5	OpInfo fix: `conv_transpose2d` (#63389 ) Summary: Addresses comment: https://github.com/pytorch/pytorch/pull/62882#issuecomment-899679606. cc: mruberry ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/63389 Reviewed By: mruberry Differential Revision: D30377481 Pulled By: ngimel fbshipit-source-id: 0fa21acc3503c259c9b27463e8555247c43d9e2e	2021-08-17 13:42:36 -07:00
Michael Dagitses	cae5ddc427	add torch.meshgrid() OpInfo (#62720 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/62719 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62720 Reviewed By: astaff Differential Revision: D30344574 Pulled By: dagitses fbshipit-source-id: ed42d9fe20741df98018efb08e640fca370583fb	2021-08-17 04:04:24 -07:00
kshitij12345	3ce67efea2	[opinfo] nn.functional.pad (#62814 ) Summary: Reference: https://github.com/facebookresearch/functorch/issues/78 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62814 Reviewed By: VitalyFedyunin Differential Revision: D30307492 Pulled By: zou3519 fbshipit-source-id: 4f6062eb4a3c91ed1795df1f82846afa0abafcdc	2021-08-16 13:29:34 -07:00
Nikita Vedeneev	dbcfd7739f	Make `torch.lu` differentiable for wide/tall inputs + jit (#61564 ) Summary: As per title. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61564 Reviewed By: astaff Differential Revision: D30338136 Pulled By: mruberry fbshipit-source-id: f01436fc90980544cdfa270feee16bb3dda21b93	2021-08-16 11:40:57 -07:00
Yi Wang	979180cd01	[Model Averaging] Allow subgroup to be None in PostLocalSGDState (#63277 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63277 `PostLocalSGDState` requires a subgroup. To initialize this subgroup, a global process group must be initialized. However, this imposes a restriction that a hook state can only be provided after distributed environment initialization, which is not compatible with lightning DDP plugin setup where hook state should be provided before distributed environment initialization. Proposal: https://github.com/pytorch/pytorch/issues/59699 ghstack-source-id: 135848575 Test Plan: buck test mode/dev-nosan caffe2/test/distributed:distributed_nccl_fork -- test_ddp_hook_parity_post_localSGD Reviewed By: cbalioglu Differential Revision: D30325041 fbshipit-source-id: 7b870166d096d306c3f2f7c69816a705cec0bebd	2021-08-16 10:07:41 -07:00
Heitor Schueroff	50fc8e8250	[OpInfo] Add expected_failure kwarg to SkipInfo (#62963 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62963 Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D30327199 Pulled By: heitorschueroff fbshipit-source-id: 45231eca11d1697a4449d79849fb17264d128a6b	2021-08-15 18:09:20 -07:00
Heitor Schueroff	8987726cc6	Small refactor for OpInfo decorators (#62713 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62713 Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D30327200 Pulled By: heitorschueroff fbshipit-source-id: 1899293990c8c0a66da88646714b38f1aae9179d	2021-08-15 18:08:12 -07:00

1 2 3 4 5 ...

1707 Commits