pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Philip Meier	20c2bb4c9f	fix kl_div for negative targets ghstack-source-id: `d69d60f4fe` Pull Request resolved: https://github.com/pytorch/pytorch/pull/69212	2022-02-08 14:36:26 +01:00
Nikita Shulga	74c44ba9d6	Revert D33850228: [pytorch][PR] Implement Tanh Gelu Approximation Test Plan: revert-hammer Differential Revision: D33850228 (`23d03025dc`) Original commit changeset: 3cc33fb298e4 Original Phabricator Diff: D33850228 (`23d03025dc`) fbshipit-source-id: 9436e7df73c2b2e2011f321674f24973316d3692 (cherry picked from commit `c9efb58223`)	2022-01-31 17:44:19 +00:00
Ryan Spring	23d03025dc	Implement Tanh Gelu Approximation (#61439 ) Summary: 1. Implements https://github.com/pytorch/pytorch/issues/39853 2. Adds approximate boolean flag to Gelu 3. Enables Tanh Gelu approximation 4. Adds double backward support for Gelu 5. Enable Tanh Gelu in NvFuser ``` def gelu(x, approximate : str = 'none'): if approximate == 'tanh': # sqrt(2/pi) = 0.7978845608028654 return 0.5 * x * (1.0 + torch.tanh(0.7978845608028654 * (x + 0.044715 * torch.pow(x, 3.0)))) else: return x * normcdf(x) ``` Linking XLA PR - https://github.com/pytorch/xla/pull/3039 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61439 Reviewed By: cpuhrsch Differential Revision: D33850228 Pulled By: jbschlosser fbshipit-source-id: 3cc33fb298e480d7ecc5c67716da019d60c6ab33 (cherry picked from commit `3a53b3e94f`)	2022-01-31 17:07:45 +00:00
Joel Schlosser	cb823d9f07	Revert D33744717: [pytorch][PR] Implement Tanh Gelu Approximation Test Plan: revert-hammer Differential Revision: D33744717 (`f499ab9cef`) Original commit changeset: d64532a562ed Original Phabricator Diff: D33744717 (`f499ab9cef`) fbshipit-source-id: 396c3f63de5865f894dbc353d0790a01a624be93 (cherry picked from commit `e9fb2d1db1`)	2022-01-28 18:35:01 +00:00
Ryan Spring	f499ab9cef	Implement Tanh Gelu Approximation (#61439 ) Summary: 1. Implements https://github.com/pytorch/pytorch/issues/39853 2. Adds approximate boolean flag to Gelu 3. Enables Tanh Gelu approximation 4. Adds double backward support for Gelu 5. Enable Tanh Gelu in NvFuser ``` def gelu(x, approximate : str = 'none'): if approximate == 'tanh': # sqrt(2/pi) = 0.7978845608028654 return 0.5 * x * (1.0 + torch.tanh(0.7978845608028654 * (x + 0.044715 * torch.pow(x, 3.0)))) else: return x * normcdf(x) ``` Linking XLA PR - https://github.com/pytorch/xla/pull/3039 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61439 Reviewed By: mikaylagawarecki Differential Revision: D33744717 Pulled By: jbschlosser fbshipit-source-id: d64532a562ed53247bb4fa52bb16722634d5c187 (cherry picked from commit `4713dd9cca`)	2022-01-28 16:59:09 +00:00
soulitzer	25e84fa4e5	Add forward AD formulas for some losses (#71026 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71026 ...and fmod Testing: - L1Loss: new module tests (linear in the real case only) - SmoothL1Loss: new module tests - MSELoss: tested - OpInfo + new module tests - huberloss: tested - OpInfo + new module tests - multi-margin-loss: new module tests - kl-div: OpInfo + new module tests - fmod: OpInfo Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D33485661 Pulled By: soulitzer fbshipit-source-id: 542ef5148183b9f574d06b2e2e345d0d889537b7 (cherry picked from commit `60765438e8`)	2022-01-26 16:31:26 +00:00
kshitij12345	a421ee0e52	[nn] InstanceNorm : no batch dim for modules (#65323 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/60585 cc albanD mruberry jbschlosser walterddr kshitij12345 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65323 Reviewed By: davidberard98 Differential Revision: D33285268 Pulled By: jbschlosser fbshipit-source-id: c5210bb431eaf27190e1cd75c42af3e5bcf83f72	2021-12-22 18:00:36 -08:00
George Qi	7c690ef1c2	FractionalMaxPool3d with no_batch_dim support (#69732 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69732 Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D33280090 Pulled By: george-qi fbshipit-source-id: aaf90a372b6d80da0554bad28d56436676f9cb89	2021-12-22 14:30:32 -08:00
kshitij12345	7407e3d6fd	[fix] cross_entropy : fix weight with ignore_index and label_smoothing (#69511 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/69339 cc albanD mruberry jbschlosser walterddr Pull Request resolved: https://github.com/pytorch/pytorch/pull/69511 Reviewed By: mrshenli Differential Revision: D32951935 Pulled By: jbschlosser fbshipit-source-id: 482eae851861a32f96bd6231dd3448fb6d44a015	2021-12-08 12:08:33 -08:00
kshitij12345	828a9dcc04	[nn] MarginRankingLoss : no batch dim (#64975 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/60585 cc albanD mruberry jbschlosser walterddr Pull Request resolved: https://github.com/pytorch/pytorch/pull/64975 Reviewed By: albanD Differential Revision: D31906528 Pulled By: jbschlosser fbshipit-source-id: 1127242a859085b1e06a4b71be19ad55049b38ba	2021-10-26 09:03:31 -07:00
Eddie Yan	d9c4b3feab	Do rowwisemoments computation in `float` for `half` `LayerNorm` (#66920 ) Summary: https://github.com/pytorch/pytorch/issues/66707 Pull Request resolved: https://github.com/pytorch/pytorch/pull/66920 Reviewed By: mrshenli Differential Revision: D31850612 Pulled By: ngimel fbshipit-source-id: a95a33567285dcf9ee28d33f503cead3268960f9	2021-10-22 09:50:42 -07:00
kshitij12345	1db50505d5	[nn] MultiLabelSoftMarginLoss : no batch dim support (#65690 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/60585 cc albanD mruberry jbschlosser walterddr Pull Request resolved: https://github.com/pytorch/pytorch/pull/65690 Reviewed By: zou3519 Differential Revision: D31731162 Pulled By: jbschlosser fbshipit-source-id: d26f27555f78afdadd49126e0548a8bfda50cc5a	2021-10-18 15:30:01 -07:00
Gary Miguel	543b7fb942	[JIT] Fix type annotations of pooling modules (#65847 ) Summary: All of the pooling modules except MaxUnpool and LPPool return either a Tensor or [Tensor, Tensor]. The current type annotations are inaccurate, and prevent scripting the module if return_indices is set as True in the module. There's not a great way to make this agree with mypy because the overload is dependent on the value of return_indices, an attribute. I tried changing the annotations from `Tensor` to `Union[Tensor, Tuple[Tensor, Tensor]]`, but that breaks a bunch of uses that have return_indices=False. For example, this breaks: `4e94e84f65/torch/nn/modules/container.py (L139)` Also clean up how test names were being constructed in test_jit, since otherwise we were getting name collisions when there were two tests on the same nn.Module. Fixes https://github.com/pytorch/pytorch/issues/45904 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65847 Reviewed By: ZolotukhinM Differential Revision: D31462517 Pulled By: eellison fbshipit-source-id: 6f9e8df1be6c75e5e1e9bae07cf3ad3603ba59bd	2021-10-14 10:59:19 -07:00
kshitij12345	a012216b96	[nn] Fold : no batch dim (#64909 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/64907 Reference: https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/64909 Reviewed By: cpuhrsch, heitorschueroff Differential Revision: D30991087 Pulled By: jbschlosser fbshipit-source-id: 91a37e0b1d51472935ff2308719dfaca931513f3	2021-09-23 08:37:32 -07:00
kshitij12345	9c23f6eb7d	[nn] TripletMarginLoss and PairwiseDistance : no batch dim (#64882 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/64882 Reviewed By: malfet Differential Revision: D31055577 Pulled By: jbschlosser fbshipit-source-id: 2f0a5a08619b672026b48a78bc7d83a6dccba0bf	2021-09-21 07:29:48 -07:00
Xiang Gao	816048e7e6	EmbeddingBag sort thrust->cub (#64498 ) Summary: Partially fixes https://github.com/pytorch/pytorch/issues/57505 Also fixes a warning I found when compiling: ``` /home/gaoxiang/pytorch-cub/torch/csrc/distributed/c10d/quantization/quantization_gpu.cu(7): warning: inline qualifier ignored for "__global__" function ``` I also updated the bfloat16 guard to CUDA 11.5 Pull Request resolved: https://github.com/pytorch/pytorch/pull/64498 Reviewed By: mruberry Differential Revision: D30917077 Pulled By: ngimel fbshipit-source-id: fb9df08fd469038478a563014b5af7452b4b28c0	2021-09-13 19:51:12 -07:00
kshitij12345	01e92f2a56	[nn] no batch dim support: CosineEmbeddingLoss (#64590 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/60585 TODO * [x] Add tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/64590 Reviewed By: H-Huang Differential Revision: D30900775 Pulled By: jbschlosser fbshipit-source-id: d24e72787017e79afbf8f04a94901a290485b81a	2021-09-13 10:45:33 -07:00
Thomas J. Fan	7d010539c9	ENH Adds test and docs for modules that already support no batch dims (#62729 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62729 Reviewed By: H-Huang Differential Revision: D30669546 Pulled By: jbschlosser fbshipit-source-id: c771c98c1fd9d28fa984b72893585c738c736505	2021-09-02 12:36:54 -07:00
Thomas J. Fan	d3bcba5f85	ENH Adds label_smoothing to cross entropy loss (#63122 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/7455 Partially resolves pytorch/vision#4281 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63122 Reviewed By: iramazanli Differential Revision: D30586076 Pulled By: jbschlosser fbshipit-source-id: 06afc3aa1f8b9edb07fe9ed68c58968ad1926924	2021-08-29 23:33:04 -07:00
Xiang Gao	227cb268bc	[Reland] Embedding thrust->cub migration (#63806 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/63427 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63806 Reviewed By: bdhirsh Differential Revision: D30498255 Pulled By: ngimel fbshipit-source-id: 78b7085a92a168cf0163f53dcb712bac922f5235	2021-08-24 09:30:32 -07:00
Thomas J. Fan	2ca2761f3c	ENH Adds no_batch_dim for NLLLoss (#62651 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62651 Reviewed By: VitalyFedyunin Differential Revision: D30303340 Pulled By: jbschlosser fbshipit-source-id: 7ab478cf63bf6cd1f850cad5fd101e74a2cfe3f5	2021-08-24 08:27:27 -07:00
Thomas J. Fan	9914fb6615	ENH Adds no_batch_dim tests/docs for LPPool1d and Identity (#62190 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62190 Reviewed By: ejguan Differential Revision: D29942385 Pulled By: jbschlosser fbshipit-source-id: 00df6f6f01ad039631bb8679f8de94863aac7650	2021-08-24 06:59:41 -07:00
Thomas J. Fan	c5f3ab6982	ENH Adds no_batch_dim to FractionalMaxPool2d (#62490 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62490 Reviewed By: bdhirsh Differential Revision: D30287143 Pulled By: jbschlosser fbshipit-source-id: 1b9dd932157f571adf3aa2c98c3c6b56ece8fa6e	2021-08-13 08:48:40 -07:00
Thomas J. Fan	59d09b148c	BUG Fixes bug in no_batch_dim tests (#62726 ) Summary: The way that Python captures variables for lambdas meant that only the last `input_fn`, etc were captured. This PR adds makes sure the local variable to captured by a lambda. REF: https://docs.python.org/3/faq/programming.html#why-do-lambdas-defined-in-a-loop-with-different-values-all-return-the-same-result Pull Request resolved: https://github.com/pytorch/pytorch/pull/62726 Reviewed By: zou3519 Differential Revision: D30159478 Pulled By: jbschlosser fbshipit-source-id: cfef3d9776d2676b2f5bb6d39d569b8ca07b0fe5	2021-08-06 11:11:25 -07:00
Joel Schlosser	a42345adee	Support for target with class probs in CrossEntropyLoss (#61044 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/11959 Alternative approach to creating a new `CrossEntropyLossWithSoftLabels` class. This PR simply adds support for "soft targets" AKA class probabilities to the existing `CrossEntropyLoss` and `NLLLoss` classes. Implementation is dumb and simple right now, but future work can add higher performance kernels for this case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61044 Reviewed By: zou3519 Differential Revision: D29876894 Pulled By: jbschlosser fbshipit-source-id: 75629abd432284e10d4640173bc1b9be3c52af00	2021-07-29 10:04:41 -07:00
Thomas J. Fan	80a662e773	ENH Updates docs and tests for classification modules that already support no batch dims (#61874 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61874 Reviewed By: heitorschueroff Differential Revision: D29979977 Pulled By: jbschlosser fbshipit-source-id: 82c19151aa7220e564337b05d7677d52981e0aa2	2021-07-29 09:14:52 -07:00
Joel Schlosser	35307b131d	Callable activation function support for Transformer modules (Python) (#61355 ) Summary: Fixes Python part of https://github.com/pytorch/pytorch/issues/60747 Enhances the Python versions of `Transformer`, `TransformerEncoderLayer`, and `TransformerDecoderLayer` to support callables as their activation functions. The old way of specifying activation function still works as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61355 Reviewed By: bdhirsh Differential Revision: D29967302 Pulled By: jbschlosser fbshipit-source-id: 8ee6f20083d49dcd3ab432a18e6ad64fe1e05705	2021-07-28 21:42:56 -07:00
Thomas J. Fan	7c588d5d00	ENH Adds no_batch_dim support for pad 2d and 3d (#62183 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62183 Reviewed By: ejguan Differential Revision: D29942250 Pulled By: jbschlosser fbshipit-source-id: d1df4ddcb90969332dc1a2a7937e66ecf46f0443	2021-07-28 11:10:44 -07:00
Thomas J. Fan	89ca638c18	ENH Adds no batch dim support for AdativeMaxPool*D (#61847 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61847 Reviewed By: suo Differential Revision: D29883887 Pulled By: jbschlosser fbshipit-source-id: de3fcf1cc3878b138ab766d2a50cc59c52ec5a60	2021-07-26 07:35:36 -07:00
Thomas J. Fan	f03e7170f0	ENH Updates docs and tests for regression modules that already support no-batch-dims (#61461 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 This PR does not use `check_sum_reduction` because I wanted to test every reduction option. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61461 Reviewed By: suo Differential Revision: D29883744 Pulled By: jbschlosser fbshipit-source-id: cdad0effb41f0484938caad0d4c9d6d83e2aec07	2021-07-23 16:40:17 -07:00
Thomas J. Fan	0309c5780d	ENH Adds no batch dim support for AvgPool1d (#61860 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61860 Reviewed By: albanD Differential Revision: D29826382 Pulled By: jbschlosser fbshipit-source-id: 47e12073d866f0604310fc1ff270cde9907e516d	2021-07-22 12:46:48 -07:00
Thomas J. Fan	48af9de92f	ENH Enables No-batch for *Pad1d Modules (#61060 ) Summary: Toward https://github.com/pytorch/pytorch/issues/60585 This PR adds a `single_batch_reference_fn` that uses the single batch implementation to check no-batch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61060 Reviewed By: mrshenli Differential Revision: D29739823 Pulled By: jbschlosser fbshipit-source-id: d90d88a3671177a647171801cc6ec7aa3df35482	2021-07-21 07:12:41 -07:00
soulitzer	1b0a7f3887	Always use fast gradcheck for LayerNorm 3d_no_affine_large_feature (#61848 ) Summary: Due to the introduction of a test from https://github.com/pytorch/pytorch/pull/59987/files, slow gradcheck has been failing intermittently (timing out/getting killed). Pull Request resolved: https://github.com/pytorch/pytorch/pull/61848 Reviewed By: mrshenli Differential Revision: D29765773 Pulled By: soulitzer fbshipit-source-id: d78bee758cab76f26ba9f54925c42d4825db9449	2021-07-19 12:33:55 -07:00
Thomas J. Fan	24a6eb3fda	ENH Adds tests and docs for 2d & 3d modules that already support no batch (#61262 ) Summary: Toward https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61262 Reviewed By: mrshenli Differential Revision: D29660554 Pulled By: jbschlosser fbshipit-source-id: d5e3dc7096fcf8621bce4a1063d521b84092e0ca	2021-07-19 11:12:28 -07:00
Thomas J. Fan	dc0d1612e1	ENH Updates docs and tests for activation modules for no-batch dims (#61300 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 This PR updates docs and tests for activation modules that already support no-batch dims. Pull Request resolved: https://github.com/pytorch/pytorch/pull/61300 Reviewed By: heitorschueroff Differential Revision: D29660543 Pulled By: jbschlosser fbshipit-source-id: 5edad45f7e9995aca6c3403469668e6e1cbb94b6	2021-07-16 14:42:18 -07:00
Thomas J. Fan	25a705610f	ENH Adds support for no-batch dim in AdaptiveAvgPool1d (#61264 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61264 Reviewed By: iramazanli Differential Revision: D29615292 Pulled By: jbschlosser fbshipit-source-id: 826d1c87d67261a7211270e90e3a1022bbbe37bd	2021-07-12 10:24:37 -07:00
Thomas J. Fan	87dbdef65d	MAINT Adds test and docs for Linear with no batch dims (#60992 ) Summary: Towards https://github.com/pytorch/pytorch/issues/60585 This PR updates docs for `Linear` and adds a non-batch test case to `common_nn.py`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/60992 Reviewed By: VitalyFedyunin Differential Revision: D29518451 Pulled By: jbschlosser fbshipit-source-id: 6dd79c0f21ac5b6f693e3e1ba954379d2606d4e0	2021-07-01 14:49:24 -07:00
Xiaomeng Yang	963c983366	Improve numerical stability of LayerNorm (#59987 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59987 Similar as GroupNorm, improve numerical stability of LayerNorm by Welford algorithm and pairwise sum. Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "LayerNorm" Reviewed By: ngimel Differential Revision: D29115235 fbshipit-source-id: 5183346c3c535f809ec7d98b8bdf6d8914bfe790	2021-06-25 02:22:42 -07:00
Natalia Gimelshein	ddec2e0ef4	tentative fix for adaptiveavgpool gradient computation (#60630 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/60524 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60630 Reviewed By: jbschlosser Differential Revision: D29374257 Pulled By: ngimel fbshipit-source-id: be05f0ceb53e6f0f0a59a83b710dafde469d4e8a	2021-06-24 19:02:32 -07:00
Alexander Grund	0ba4044b9d	Increase some tolerances for tf32 for Conv3d tests (#60451 ) Summary: Allow those tests to pass on A100 GPUs which support tf32 Basically follow-up to https://github.com/pytorch/pytorch/pull/52871 which also increased some precisions to 0.05 For reference these are the failures I see (only ones in testnn with 1.9.0): ``` FAIL: test_Conv3d_pad_same_cuda_tf32 (__main__.TestNN) ---------------------------------------------------------------------- Traceback (most recent call last): File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1033, in wrapper method(args, kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1033, in wrapper method(args, kwargs) File "test_nn.py", line 11296, in with_tf32_on test.test_cuda(self, kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_nn.py", line 5103, in test_cuda test_case.assertEqualIgnoreType(cpu_d_i, gpu_d_i, atol=self.precision, rtol=0) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1254, in assertEqualIgnoreType return self.assertEqual(args, exact_dtype=False, kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1355, in assertEqual super().assertTrue(result, msg=self._get_assert_msg(msg, debug_msg=debug_msg)) AssertionError: False is not true : Tensors failed to compare as equal!With rtol=0 and atol=0.005, found 161 element(s) (out of 288) whose difference(s) exceeded the margin of error (including 0 nan compariso ns). The greatest difference was 0.032408137116391345 (-33.45570601919647 vs. -33.42329788208008), which occurred at index (2, 0, 0, 1, 0). ====================================================================== FAIL: test_Conv3d_pad_same_dilated_cuda_tf32 (__main__.TestNN) ---------------------------------------------------------------------- Traceback (most recent call last): File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1033, in wrapper method(args, *kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1033, in wrapper method(args, kwargs) File "test_nn.py", line 11296, in with_tf32_on test.test_cuda(self, kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_nn.py", line 5103, in test_cuda test_case.assertEqualIgnoreType(cpu_d_i, gpu_d_i, atol=self.precision, rtol=0) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1254, in assertEqualIgnoreType return self.assertEqual(args, exact_dtype=False, kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1355, in assertEqual super().assertTrue(result, msg=self._get_assert_msg(msg, debug_msg=debug_msg)) AssertionError: False is not true : Tensors failed to compare as equal!With rtol=0 and atol=0.005, found 111 element(s) (out of 288) whose difference(s) exceeded the margin of error (including 0 nan compariso ns). The greatest difference was 0.024654212557543076 (35.104286017977465 vs. 35.07963180541992), which occurred at index (3, 0, 0, 0, 2). ====================================================================== FAIL: test_Conv3d_pad_valid_cuda_tf32 (__main__.TestNN) ---------------------------------------------------------------------- Traceback (most recent call last): File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1033, in wrapper method(args, *kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1033, in wrapper method(args, kwargs) File "test_nn.py", line 11296, in with_tf32_on test.test_cuda(self, kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_nn.py", line 5103, in test_cuda test_case.assertEqualIgnoreType(cpu_d_i, gpu_d_i, atol=self.precision, rtol=0) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1254, in assertEqualIgnoreType return self.assertEqual(args, exact_dtype=False, *kwargs) File "/tmp/easybuild-tmp/eb-ED4 (`1f47a80e88`)M3d/tmpqOhUjN/lib/python3.8/site-packages/torch/testing/_internal/common_utils.py", line 1355, in assertEqual super().assertTrue(result, msg=self._get_assert_msg(msg, debug_msg=debug_msg)) AssertionError: False is not true : Tensors failed to compare as equal!With rtol=0 and atol=0.005, found 41 element(s) (out of 288) whose difference(s) exceeded the margin of error (including 0 nan comparisons). The greatest difference was 0.010903167642320355 (8.074376869119371 vs. 8.06347370147705), which occurred at index (0, 0, 1, 0, 0). ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/60451 Reviewed By: albanD Differential Revision: D29353255 Pulled By: ngimel fbshipit-source-id: 155a02242be5a11dcbd9dd40ab63f15c6757ae1b	2021-06-24 13:36:27 -07:00
Thomas J. Fan	c16f87949f	ENH Adds nn.ReflectionPad3d (#59791 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/27655 This PR adds a C++ and Python version of ReflectionPad3d with structured kernels. The implementation uses lambdas extensively to better share code from the backward and forward pass. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59791 Reviewed By: gchanan Differential Revision: D29242015 Pulled By: jbschlosser fbshipit-source-id: 18e692d3b49b74082be09f373fc95fb7891e1b56	2021-06-21 10:53:14 -07:00
Eddie Yan	3870e68644	TF32 threshold twiddling for tests (#60209 ) Summary: Following https://github.com/pytorch/pytorch/issues/59624 I observed some straggling failing tests on Ampere due to TF32 thresholds. This PR just twiddles some more thresholds to fix the (6) failing tests I saw on A100. CC Flamefire ptrblck ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/60209 Reviewed By: gchanan Differential Revision: D29220508 Pulled By: ngimel fbshipit-source-id: 7c83187a246e1b3a24b181334117c0ccf2baf311	2021-06-18 11:41:33 -07:00
Xiaomeng Yang	ff15d93b88	Improve numerical stability of GroupNorm (#54921 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54921 Improve numerical stability of GroupNorm Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GroupNorm" Reviewed By: ngimel Differential Revision: D27414438 fbshipit-source-id: 815517240ca5ea3e2beb77ced3bd862e9c83d445	2021-06-13 16:13:32 -07:00
Adnios	09a8f22bf9	Add mish activation function (#58648 ) Summary: See issus: https://github.com/pytorch/pytorch/issues/58375 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58648 Reviewed By: gchanan Differential Revision: D28625390 Pulled By: jbschlosser fbshipit-source-id: 23ea2eb7d5b3dc89c6809ff6581b90ee742149f4	2021-05-25 10:36:21 -07:00
Jeffrey Wan	2b54cec7e8	Clean up naming and comments (#56964 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56964 This PR does many things but does not update any logic: - Prefixes all function names that are not `gradcheck`, `gradgradcheck`, `get_numerical_jacobian`, and `get_analytical_jacobian` with underscore to indicate that they aren't part of the public API (https://github.com/pytorch/pytorch/issues/55714). - Improve naming to avoid referencing Jacobian rows or Jacobian cols when we really mean vjp and jvp as suggested by zou3519 - Try to reduce comment line length so they are more consistent and easier to read - Other misc improvements to documentaiton Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D28096571 Pulled By: soulitzer fbshipit-source-id: d372b5f8ee080669e525a987402ded72810baa0c	2021-04-30 17:40:14 -07:00
Kurt Mohler	1f04494c0e	Consolidate nondeterministic error tests (#55631 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/51498 Pull Request resolved: https://github.com/pytorch/pytorch/pull/55631 Reviewed By: malfet Differential Revision: D27909953 Pulled By: mruberry fbshipit-source-id: 9115b2433f9c276555be55bd51b270a7a2846829	2021-04-22 23:37:01 -07:00
Horace He	3c4e1cd141	remove annoying warnings from common_nn.py (#55982 ) Summary: ^^ Pull Request resolved: https://github.com/pytorch/pytorch/pull/55982 Reviewed By: mruberry Differential Revision: D27776380 Pulled By: Chillee fbshipit-source-id: 22b3a8de73416821bed56b75b68dca1c33a21250	2021-04-15 16:03:00 -07:00
Kurt Mohler	3fe4718d16	Add `padding_idx` argument to EmbeddingBag (#49237 ) Summary: This PR adds a `padding_idx` parameter to `nn.EmbeddingBag` and `nn.functional.embedding_bag`. As with `nn.Embedding`'s `padding_idx` argument, if an embedding's index is equal to `padding_idx` it is ignored, so it is not included in the reduction. This PR does not add support for `padding_idx` for quantized or ONNX `EmbeddingBag` for opset10/11 (opset9 is supported). In these cases, an error is thrown if `padding_idx` is provided. Fixes https://github.com/pytorch/pytorch/issues/3194 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49237 Reviewed By: walterddr, VitalyFedyunin Differential Revision: D26948258 Pulled By: jbschlosser fbshipit-source-id: 3ca672f7e768941f3261ab405fc7597c97ce3dfc	2021-04-14 09:38:01 -07:00
Jeffrey Wan	381b3d8f4b	Refactor get numerical jacobian to calculate wrt all outputs at once (#54378 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54378 ### For release notes `torch.autograd.gradcheck.get_numerical_jacobian` (not part of the public api) is being deprecated. In the future, user code relying on this function will break because, among other changes, `get_numerical_jacobian` now returns `List[Tuple[torch.Tensor]]` instead of `List[torch.Tensor]`. (more details if necessary) For a `fn` that takes in M inputs and N outputs we now return a list of M N-tuples of jacobians where `output[i][j]` would represent the numerical jacobian w.r.t. to the ith input and the jth output. Previously `get_numerical_jacobian` returned a list of tensors where each tensor represents the jacobian w.r.t. to each of the M inputs and a specific output. Finally, the function passed in as the parameter `fn` should expect to handle individual parameters, where previously `fn` is required to expect its parameters wrapped in a tuple. --- end -- This PR addresses the comment here https://github.com/pytorch/pytorch/pull/53857#discussion_r595429639, to reduce the run-time of old gradcheck's get numerical jacobian by a factor of num_outputs. However, because very few ops actually return multiple outputs, there is not too much real speed up here. The main benefit of doing this change as part of the refactor is that it helps us isolate the possible bugs that are specific to switching `get numerical jacobian` to run in a per output way vs all outputs at once. Much of the logic implemented here will be the same for the fast gradcheck case, so knowing for certain that everything should pass after this stage will make the next step much simpler. The get_numerical_jacobian api is also being used in common_nn. So we update the callsite there as well. Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D27728720 Pulled By: soulitzer fbshipit-source-id: ee0f90b4f26ddc5fdbe949c4965eaa91c9ed0bb8	2021-04-13 10:06:20 -07:00
Yu Guo	846c8d94c7	mark embedding backward non-deterministic for max mode rather than all reducing modes (#55574 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55574 nn.EmbeddingBag backward is non-deterministic when reducing_mode = Max and on GPU, reducing mode Mean and Sum should be deterministic Test Plan: NA Reviewed By: ngimel Differential Revision: D27633832 fbshipit-source-id: 50786ed8522f1aae27442f5f244a65eab8000b06	2021-04-09 16:19:01 -07:00

1 2 3

126 Commits