pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
mattip	345844d9d8	test, fix deepcopy of tensor with grad (#50663 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/3307 Previously, `self.grad` was not ~cloned~ deepcopied to the returned tensor in `deepcopy`. Added a test and an implementation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50663 Reviewed By: heitorschueroff Differential Revision: D26074811 Pulled By: albanD fbshipit-source-id: 536dad36415f1d03714b4ce57453f406ad802b8c	2021-01-26 16:19:53 -08:00
Richard Zou	83315965ab	Turn on batched grad testing for CriterionTest (#50744 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50744 This PR adds a `check_batched_grad=True` option to CriterionTest and turns it on by default for all CriterionTest-generated tests Test Plan: - run tests Reviewed By: ejguan Differential Revision: D25997676 Pulled By: zou3519 fbshipit-source-id: cc730731e6fae2bddc01bc93800fd0e3de28b32d	2021-01-26 07:37:15 -08:00
Richard Zou	63838b9330	Turn on batched_grad testing for NewModuleTest (#50740 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50740 This PR adds a `check_batched_grad=True` option to NewModuleTest-generated NN tests. Test Plan: - run tests (`pytest test/test_nn.py -v -rf`) Reviewed By: ejguan Differential Revision: D25997679 Pulled By: zou3519 fbshipit-source-id: b75e73d7e86fd3af9bad6efed7127b36551587b3	2021-01-22 15:33:09 -08:00
Peter Bell	db079a9877	Padding: support complex dtypes (#50594 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50594 Fixes #50234 Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D25987316 Pulled By: anjali411 fbshipit-source-id: c298b771fe52b267a86938e886ea402badecfe3e	2021-01-22 11:57:42 -08:00
Peter Bell	47f0bda3ef	Improve complex support in common_nn test machinery (#50593 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50593 There are no equivalent to torch.FloatTensor, torch.cuda.FloatTensor for complex types. So `get_gpu_type` and `get_cpu_type` are broken for complex tensors. Also found a few places that explicitly cast inputs to floating point types, which would drop the imaginary component before running the test. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D25954050 Pulled By: mruberry fbshipit-source-id: 1fa8e5af233aa095c839d5e2f860564baaf92aef	2021-01-22 09:44:45 -08:00
Natalia Gimelshein	4d169258ef	Revert D25976245: [pytorch][PR] Enable Skipped ROCM Tests in common_nn.py Test Plan: revert-hammer Differential Revision: D25976245 (`24a0272132`) Original commit changeset: 801032534f91 fbshipit-source-id: 561e6d761cb694451d5f87557b4f96f37d19dd90	2021-01-21 13:28:37 -08:00
Arindam Roy	24a0272132	Enable Skipped ROCM Tests in common_nn.py (#50753 ) Summary: Removed test_cuda=(not TEST_WITH_ROCM) in common_nn.py to enable the skipped tests for ROCM. Signed-off-by: Arindam Roy <rarindam@gmail.com> Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/50753 Reviewed By: mrshenli Differential Revision: D25976245 Pulled By: ngimel fbshipit-source-id: 801032534f911d24d231bc9f0d3235a4506412c0	2021-01-21 09:48:47 -08:00
Jeffrey Wan	6e3e57095c	Add complex support for torch.nn.L1Loss (#49912 ) Summary: Building on top of the work of anjali411 (https://github.com/pytorch/pytorch/issues/46640) Things added in this PR: 1. Modify backward and double-backward formulas 2. Add complex support for `new module tests` and criterion tests (and add complex tests for L1) 3. Modify some existing tests to support complex Pull Request resolved: https://github.com/pytorch/pytorch/pull/49912 Reviewed By: zhangguanheng66 Differential Revision: D25853036 Pulled By: soulitzer fbshipit-source-id: df619f1b71c450ab2818eb17804e0c55990aa8ad	2021-01-15 15:53:15 -08:00
Hugo van Kemenade	473e78c0fa	Remove redundant code for unsupported Python versions (#49486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49486 Remove code for Python 3.5 and lower. There's more that can be removed/modernised, but sticking mainly to redundant version checks here, to keep the diff/PR smaller. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46579 Reviewed By: zou3519 Differential Revision: D24453571 Pulled By: ezyang fbshipit-source-id: c2cfcf05d6c5f65df64d89c331692c9aec09248e	2021-01-06 12:45:46 -08:00
Natalia Gimelshein	e35b822d7d	fixes indices computation for trilinear interpolate backwards (#50084 ) Summary: https://github.com/pytorch/pytorch/issues/48675 had some typos in indices computations so that results for trilinear interpolation where height is not equal to width were wrong. This PR fixes it. cc xwang233 Pull Request resolved: https://github.com/pytorch/pytorch/pull/50084 Reviewed By: BIT-silence Differential Revision: D25777083 Pulled By: ngimel fbshipit-source-id: 71be545628735fe875b7ea30bf6a09df4f2fae5c	2021-01-05 09:20:59 -08:00
Joel Schlosser	68d438c9da	Add PixelUnshuffle (#49334 ) Summary: Adds an implementation of `torch.nn.PixelUnshuffle` as the inverse operation of `torch.nn.PixelShuffle`. This addresses https://github.com/pytorch/pytorch/issues/2456 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49334 Test Plan: ``` # Unit tests. python test/test_nn.py TestNN.test_pixel_shuffle_unshuffle # Module test. python test/test_nn.py TestNN.test_PixelUnshuffle # C++ API tests. build/bin/test_api # C++ / python parity tests. python test/test_cpp_api_parity.py # JIT test. python test/test_jit.py TestJitGeneratedFunctional.test_nn_pixel_unshuffle # Override tests. python test/test_overrides.py # Type hint tests. python test/test_type_hints.py ``` Screenshots of rendered docs: <img width="876" alt="Screen Shot 2020-12-18 at 12 19 05 PM" src="https://user-images.githubusercontent.com/75754324/102642255-6b07bb00-412b-11eb-88fa-e53e7e8ba720.png"> <img width="984" alt="Screen Shot 2020-12-18 at 12 19 26 PM" src="https://user-images.githubusercontent.com/75754324/102642276-70fd9c00-412b-11eb-8548-445082a2db02.png"> <img width="932" alt="Screen Shot 2020-12-18 at 12 19 34 PM" src="https://user-images.githubusercontent.com/75754324/102642704-19abfb80-412c-11eb-9546-95bdd1c3cf22.png"> <img width="876" alt="Screen Shot 2020-12-22 at 12 51 36 PM" src="https://user-images.githubusercontent.com/75754324/102918259-986aa680-4454-11eb-99e7-a0b4c8b3e283.png"> <img width="869" alt="Screen Shot 2020-12-22 at 12 51 44 PM" src="https://user-images.githubusercontent.com/75754324/102918274-9ef91e00-4454-11eb-94bb-91b58aff47d3.png"> Reviewed By: mruberry Differential Revision: D25401439 Pulled By: jbschlosser fbshipit-source-id: 209d92ce7295e51699e83616d0c62170a7ce75c8	2020-12-22 20:14:55 -08:00
Richard Zou	dfb7520c47	NewModuleTest: Don't call both check_jacobian and gradcheck (#49566 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49566 Fixes #49422. check_jacobian and gradcheck do roughly the same thing: they both compute an analytic jacobian and a numeric jacobian and check that they are equivalent. Furthermore, NewModuleTest will (by default) call both check_jacobian and gradcheck, leading to some redundant checks that waste CI resources. However, there is one subtle difference: `check_jacobian` can handle the special case where a Module takes in dense inputs and dense parameters but returns sparse gradients, but that is not something gradcheck can handle. This is only used in the tests for nn.Embedding and nn.EmbeddingBag. This PR does the following: - have NewModuleTest call gradcheck instead of check_jacobian by default - add a new "has_sparse_gradients" flag to NewModuleTest. These are True for the nn.Embedding and nn.EmbeddingBag sparse gradient tests. If `has_sparse_gradients` is True, then we call check_jacobian, otherwise, we call gradcheck. - Kills the "jacobian_input" flag. This flag was used to tell NewModuleTest to not attempt to compute the jacobian for the inputs to the module. This is only desireable if the input to the module isn't differentiable and was only set in the case of nn.Embedding / nn.EmbeddingBag that take a LongTensor input. `gradcheck` handles these automatically by not checking gradients for non-differentiable inputs. Test Plan: - Code reading - run test_nn.py Reviewed By: albanD Differential Revision: D25622929 Pulled By: zou3519 fbshipit-source-id: 8d831ada98b6a95d63f087ea9bce1b574c996a22	2020-12-22 06:48:31 -08:00
Guilherme Leobas	a4e13fcf3f	add type annotations to common_nn.py (#48190 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48189 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48190 Reviewed By: walterddr, zhangguanheng66 Differential Revision: D25245261 Pulled By: malfet fbshipit-source-id: 0eabaed54996be83ead0fd7668f4d2be20adfc17	2020-12-02 14:46:00 -08:00
Chester Liu	17a6bc7c1b	Cleanup unused code for Python < 3.6 (#47822 ) Summary: I think these can be safely removed since the min version of supported Python is now 3.6 Pull Request resolved: https://github.com/pytorch/pytorch/pull/47822 Reviewed By: smessmer Differential Revision: D24954936 Pulled By: ezyang fbshipit-source-id: 5d4b2aeb78fc97d7ee4abaf5fb2aae21bf765e8b	2020-11-13 21:37:01 -08:00
Gao, Xiang	0652d755d3	Fix some flaky tests in test_torch.py and test_nn.py (#46941 ) Summary: Fixed test: - `test_is_nonzero`, this is asserting exact match, which is flaky when `TORCH_SHOW_CPP_STACKTRACES=1`, I changed this to non-exact assert - `test_pinverse` TF32 - `test_symeig` TF32 - `test_triangular_solve_batched_many_batches_cpu_float64` precision on CPU BLAS - `test_qr` TF32, as well as the tensor factory forgets a `dtype=dtype` - `test_lu` TF32 - `ConvTranspose2d` TF32 - `Conv3d_1x1x1_no_bias` TF32 - `Transformer*` TF32 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46941 Reviewed By: heitorschueroff Differential Revision: D24852725 Pulled By: mruberry fbshipit-source-id: ccd4740cc643476178d81059d1c78da34e5082ed	2020-11-12 22:35:42 -08:00
Alexander Grund	93719440b8	Replace map(lambda constructs (#46462 ) Summary: Follow-up of https://github.com/pytorch/pytorch/issues/46461 with a similar goal Makes them more readable and possibly faster. Care has to be taken because `map` applies the function immediately while `(x for x in xs)` is a generator expression which gets evaluated later. This is a benefit in some cases where it is not required to actually create the list of values in memory (e.g. when passing to `tuple` or `extend` or `join`) Pull Request resolved: https://github.com/pytorch/pytorch/pull/46462 Reviewed By: zou3519 Differential Revision: D24422343 Pulled By: ezyang fbshipit-source-id: 252e33499c92ac0b15238f2df32681dbbda2b237	2020-10-22 09:50:22 -07:00
Xiaomeng Yang	a87a1c1103	Fix perfornance issue of GroupNorm on CUDA when feature map is small. (#46170 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46170 Fix perfornance issue of GroupNorm on CUDA when feature map is small. Benchmark script: ``` import torch import torch.nn.functional as F from timeit import Timer norm = torch.nn.GroupNorm(8, 512).cuda() num = 5000 sizes = [(1024, 512, 14, 14), (1024, 512, 7, 7), (1024, 512)] def forward(x): _ = norm(x) torch.cuda.synchronize() def backward(y, grad): y.backward(grad, retain_graph=True) torch.cuda.synchronize() if __name__ == "__main__": # warm up x = torch.rand((sizes[0]), dtype=torch.float, device="cuda", requires_grad=True) for _ in range(100): forward(x) for size in sizes: x = torch.rand(size, dtype=torch.float, device="cuda", requires_grad=True) t = Timer("forward(x)", "from __main__ import forward, x") print(f"size = {size}:") t1 = t.timeit(num) / num * 1e6 print(f"avg_forward_time = {t1}us") y = norm(x) grad = torch.randn_like(y) t = Timer("backward(y, grad)", "from __main__ import backward, y, grad") t2 = t.timeit(num) / num * 1e6 print(f"avg_backward_time = {t2}us") ``` Benchmark result before this Diff: ``` size = (1024, 512, 14, 14): avg_forward_time = 1636.729855206795us avg_backward_time = 5488.682465581223us size = (1024, 512, 7, 7): avg_forward_time = 465.88476160541177us avg_backward_time = 3129.9425506033003us size = (1024, 512): avg_forward_time = 96.90486900508404us avg_backward_time = 2319.4099438143894us ``` Benchmark result after this Diff: ``` size = (1024, 512, 14, 14): avg_forward_time = 1635.6191572034732us avg_backward_time = 4140.7730475999415us size = (1024, 512, 7, 7): avg_forward_time = 463.6513736099005us avg_backward_time = 1641.7451039887965us size = (1024, 512): avg_forward_time = 66.59087920561433us avg_backward_time = 128.6882139975205us ``` Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GroupNorm" Reviewed By: hl475, houseroad Differential Revision: D24242738 fbshipit-source-id: b52c82d7b6e47855c48fa8ceacd0c55d03bb92d5	2020-10-14 23:34:33 -07:00
ashish	5500b62f28	Enable zero batch conv tests for ROCm (#46305 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/26669 This PR enables convolution tests for zero batch size implemented in https://github.com/pytorch/pytorch/pull/26214/. jamesr66a jeffdaily sunway513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46305 Reviewed By: navahgar Differential Revision: D24307981 Pulled By: heitorschueroff fbshipit-source-id: dfc595fa855ae084b60a693e209b0fdcc714221d	2020-10-14 11:36:30 -07:00
Natalia Gimelshein	9201c37d02	Use addmm directly for 1x1 convolution (#45557 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/45274 Based on https://github.com/pytorch/pytorch/issues/44041, sets intermediate for backward computation (otherwise, backward tests are failing). Pull Request resolved: https://github.com/pytorch/pytorch/pull/45557 Reviewed By: izdeby Differential Revision: D24030655 Pulled By: ngimel fbshipit-source-id: 368fe9440668dffc004879f8b1d2dd3787d915c9	2020-10-02 00:26:53 -07:00
lixinyu	417e3f85e5	Support tuple inputs in NN Module test (#44853 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44853 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23750441 Pulled By: glaringlee fbshipit-source-id: 1b111a370a726b40521134b711c35f48dda99411	2020-09-28 22:05:05 -07:00
Gregory Chanan	1097fe0088	Remove CriterionTest.test_cuda code for dtype None. (#45316 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45316 It's never used. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23919449 Pulled By: gchanan fbshipit-source-id: f9aaeeabf3940389156bfc01bc3118d348ca4cf6	2020-09-28 15:08:09 -07:00
Gregory Chanan	5d1fee23b3	Remove convert_target from NN tests. (#45291 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45291 It's not necessary, you can just check if the dtype is integral. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23911963 Pulled By: gchanan fbshipit-source-id: 230139e1651eb76226f4095e31068dded30e03e8	2020-09-28 14:21:42 -07:00
Brian Hirsh	439930c81b	adding a beta parameter to the smooth_l1 loss fn (#44433 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44433 Not entirely sure why, but changing the type of beta from `float` to `double in autocast_mode.cpp and FunctionsManual.h fixes my compiler errors, failing instead at link time fixing some type errors, updated fn signature in a few more files removing my usage of Scalar, making beta a double everywhere instead Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D23636720 Pulled By: bdhirsh fbshipit-source-id: caea2a1f8dd72b3b5fd1d72dd886b2fcd690af6d	2020-09-25 16:36:28 -07:00
Gao, Xiang	3f5eee666c	Adjust TF32 tests (#44240 ) Summary: - The thresholds of some tests are bumped up. Depending on the random generator, sometimes these tests fail with things like 0.0059 is not smaller than 0.005. I ran `test_nn.py` and `test_torch.py` for 10+ times to check these are no longer flaky. - Add `tf32_on_and_off` to new `matrix_exp` tests. - Disable TF32 on test suites other than `test_nn.py` and `test_torch.py` cc: ptrblck Pull Request resolved: https://github.com/pytorch/pytorch/pull/44240 Reviewed By: mruberry Differential Revision: D23882498 Pulled By: ngimel fbshipit-source-id: 44a9ec08802c93a2efaf4e01d7487222478b6df8	2020-09-24 10:25:58 -07:00
Gao, Xiang	2111ec3bf3	CUDA BFloat16 losses (#45011 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45011 Reviewed By: mruberry Differential Revision: D23805840 Pulled By: ngimel fbshipit-source-id: 3eb60d4367c727100763879e20e9df9d58bf5ad6	2020-09-21 22:51:17 -07:00
Gregory Chanan	a6895d43b6	Turn on gradgrad check for BCELoss Criterion Tests. (#44894 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44894 Looks like we added double backwards support but only turned on the ModuleTests. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23762544 Pulled By: gchanan fbshipit-source-id: b5cef579608dd71f3de245c4ba92e49216ce8a5e	2020-09-21 07:14:22 -07:00
Gregory Chanan	07b7e44ed1	Stop using check_criterion_jacobian. (#44786 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44786 This predates gradcheck and gradcheck does the same and more. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23731902 Pulled By: gchanan fbshipit-source-id: 425fd30e943194f63a663708bada8960265b8f05	2020-09-18 07:04:57 -07:00
Gregory Chanan	6d178f6b8e	Stop ignoring errors in cuda nn module tests. (#44783 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44783 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23731778 Pulled By: gchanan fbshipit-source-id: 32df903a9e36bbf3f66645ee2d77efa5ed6ee429	2020-09-18 07:03:41 -07:00
Gregory Chanan	192c4111a3	Simplify target handling in nn gradcheck. (#44507 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44507 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23635799 Pulled By: gchanan fbshipit-source-id: 75090d6a48771e5c92e737a0829fbfa949f7c8a7	2020-09-11 13:25:59 -07:00
Gregory Chanan	5579b53a7f	Fix SmoothL1Loss when target.requires_grad is True. (#44486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44486 SmoothL1Loss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the SmoothL1Loss CriterionTests to verify that the target derivative is checked. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23630699 Pulled By: gchanan fbshipit-source-id: 0f94d1a928002122d6b6875182867618e713a917	2020-09-11 12:13:36 -07:00
Gregory Chanan	3de2c0b42f	Fix L1Loss when target.requires_grad is True. (#44471 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44471 L1Loss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the L1Loss CriterionTests to verify that the target derivative is checked. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23626008 Pulled By: gchanan fbshipit-source-id: 2828be16b56b8dabe114962223d71b0e9a85f0f5	2020-09-11 09:51:16 -07:00
Gregory Chanan	d07d25a8c5	Fix MSELoss when target.requires_grad is True. (#44437 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44437 MSELoss had a completely different (and incorrect, see https://github.com/pytorch/pytorch/issues/43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23612166 Pulled By: gchanan fbshipit-source-id: 4f74d38d8a81063c74e002e07fbb7837b2172a10	2020-09-11 08:51:28 -07:00
Gregory Chanan	c8914afdfa	Merge criterion_tests and new_criterion_tests. (#44398 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44398 These end up executing the same tests, so no reason to have them separate. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D23600855 Pulled By: gchanan fbshipit-source-id: 0952492771498bf813f1bf8e1d7c8dce574ec965	2020-09-10 08:29:59 -07:00
Gregory Chanan	fa158c4ca6	Combine criterion and new criterion tests in test_jit. (#43958 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43958 There is not any difference between these tests (I'm merging them), so let's merge them in the JIT as well. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D23452337 Pulled By: gchanan fbshipit-source-id: e6d13cdb164205eec3dbb7cdcd0052b02c961778	2020-09-10 08:28:14 -07:00
Gregory Chanan	af9cad761a	Stop ignoring NotImplementedErrors in cuda CriterionTests. (#44381 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44381 Perhaps this was necessary when the test was originally introduced, but it's difficult to figure out what is actually tested. And I don't think we actually use NotImplementedErorrs. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D23598646 Pulled By: gchanan fbshipit-source-id: aa18154bfc4969cca22323e61683a301198823be	2020-09-10 08:18:33 -07:00
Gregory Chanan	49215d7f26	For CriterionTests, have check_gradgrad actually only affect gradgrad checks. (#44060 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44060 Right now it skips grad checks as well. Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23484018 Pulled By: gchanan fbshipit-source-id: 24a8f1af41f9918aaa62bc3cd78b139b2f8de1e1	2020-09-03 12:29:32 -07:00
Gregory Chanan	5973b44d9e	Rename NewCriterionTest to CriterionTest. (#44056 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44056 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23482573 Pulled By: gchanan fbshipit-source-id: dde0f1624330dc85f48e5a0b9d98fb55fdb72f68	2020-09-03 10:29:20 -07:00
Gregory Chanan	cae52b4036	Merge CriterionTest into NewCriterionTest. (#44055 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44055 There is no functional change here. Another patch will rename NewCriterionTest to CriterionTest. Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23482572 Pulled By: gchanan fbshipit-source-id: de364579067e2cc9de7df6767491f8fa3a685de2	2020-09-03 08:14:34 -07:00
Gregory Chanan	68a1fbe308	Allow criterion backwards test on modules requiring extra args (i.e. CTCLoss). (#44050 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44050 We don't actually turn on the CTCLoss tests since they fail, but this allows you to toggle check_forward_only and for the code to actually run. Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23481091 Pulled By: gchanan fbshipit-source-id: f2a3b0a2dee27341933c5d25f1e37a878b04b9f6	2020-09-03 07:41:21 -07:00
Gregory Chanan	5f89aa36cf	Actually run backward criterion tests. (#44030 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44030 This looks to have been a mistake from https://github.com/pytorch/pytorch/pull/9287. Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23476274 Pulled By: gchanan fbshipit-source-id: 81ed9d0c9a40d49153fc97cd69fdcd469bec0c73	2020-09-03 07:39:13 -07:00
Gregory Chanan	c61a16b237	Kill dead code in common_nn as part of merging Criterion and NewCriterionTests. (#43956 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43956 See https://github.com/pytorch/pytorch/pull/43769 and https://github.com/pytorch/pytorch/pull/43776 for proof this code is dead. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D23452217 Pulled By: gchanan fbshipit-source-id: 6850aab2daaa1c321a6b7714f6f113f364f41973	2020-09-02 07:54:05 -07:00
Gao, Xiang	5e97f251a8	Enable TF32 support for cuDNN (#40737 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40737 Reviewed By: mruberry Differential Revision: D22801525 Pulled By: ngimel fbshipit-source-id: ac7f7e728b4b3e01925337e8c9996f26a6433fd2	2020-09-01 15:34:24 -07:00
Xiaomeng Yang	4ae832e106	Optimize SiLU (Swish) op in PyTorch (#42976 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42976 Optimize SiLU (Swish) op in PyTorch. Some benchmark result input = torch.rand(1024, 32768, dtype=torch.float, device="cpu") forward: 221ms -> 133ms backward: 600ms -> 170ms input = torch.rand(1024, 32768, dtype=torch.double, device="cpu") forward: 479ms -> 297ms backward: 1438ms -> 387ms input = torch.rand(8192, 32768, dtype=torch.float, device="cuda") forward: 24.34ms -> 9.83ms backward: 97.05ms -> 29.03ms input = torch.rand(4096, 32768, dtype=torch.double, device="cuda") forward: 44.24ms -> 30.15ms backward: 126.21ms -> 49.68ms Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "SiLU" Reviewed By: houseroad Differential Revision: D23093593 fbshipit-source-id: 1ba7b95d5926c4527216ed211a5ff1cefa3d3bfd	2020-08-16 13:21:57 -07:00
Kurt Mohler	ec683299eb	Reland Add non-deterministic alert to CUDA operations that use `atomicAdd()` (#41538 ) Summary: Reland PR https://github.com/pytorch/pytorch/issues/40056 A new overload of upsample_linear1d_backward_cuda was added in a recent commit, so I had to add the nondeterministic alert to it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41538 Reviewed By: zou3519 Differential Revision: D22608376 Pulled By: ezyang fbshipit-source-id: 54a2aa127e069197471f1feede6ad8f8dc6a2f82	2020-07-22 13:12:29 -07:00
Shen Li	954c260061	Revert D22480638: [pytorch][PR] Add non-deterministic alert to CUDA operations that use `atomicAdd()` Test Plan: revert-hammer Differential Revision: D22480638 (`6ff306b8b5`) Original commit changeset: 4cc913cb3ca6 fbshipit-source-id: e47fa14b5085bb2b74a479bd0830efc2d7604eea	2020-07-15 12:10:05 -07:00
Kurt Mohler	6ff306b8b5	Add non-deterministic alert to CUDA operations that use `atomicAdd()` (#40056 ) Summary: Issue https://github.com/pytorch/pytorch/issues/15359 Pull Request resolved: https://github.com/pytorch/pytorch/pull/40056 Differential Revision: D22480638 Pulled By: ezyang fbshipit-source-id: 4cc913cb3ca6d4206de80f4665bbc9031aa3ca01	2020-07-15 10:57:32 -07:00
Natalia Gimelshein	155fb22e77	Run single-threaded gradgradcheck in testnn (#41147 ) Summary: Reland https://github.com/pytorch/pytorch/issues/40999 Pull Request resolved: https://github.com/pytorch/pytorch/pull/41147 Reviewed By: mruberry Differential Revision: D22450357 Pulled By: ngimel fbshipit-source-id: 02b6e020af5e6ef52542266bd9752b9cfbec4159	2020-07-08 22:53:27 -07:00
Brian Vaughan	a04af4dccb	Revert D22396896: [pytorch][PR] run single-threaded gradgradcheck in test_nn Test Plan: revert-hammer Differential Revision: D22396896 (`dac63a13cb`) Original commit changeset: 3b247caceb65 fbshipit-source-id: 90bbd71ca5128a7f07fe2907c061ee0922d16edf	2020-07-07 07:43:39 -07:00
Natalia Gimelshein	dac63a13cb	run single-threaded gradgradcheck in test_nn (#40999 ) Summary: Most time-consuming tests in test_nn (taking about half the time) were gradgradchecks on Conv3d. Reduce their sizes, and, most importantly, run gradgradcheck single-threaded, because that cuts the time of conv3d tests by an order of magnitude, and barely affects other tests. These changes bring test_nn time down from 1200 s to ~550 s on my machine. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40999 Differential Revision: D22396896 Pulled By: ngimel fbshipit-source-id: 3b247caceb65d64be54499de1a55de377fdf9506	2020-07-06 17:21:25 -07:00
Mike Ruberry	13120bf677	Updates assertEqual to require atol and rtol, removes positional atol (#38872 ) Summary: This updates assertEqual and assertEqual-like functions to either require both or neither of atol and rtol be specified. This should improve clarity around handling precision in the test suite, and it allows us to remove the legacy positional atol argument from assertEqual. In addition, the "message" kwarg is replace with a kwarg-only "msg" argument whose name is consistent with unittest's assertEqual argument. In the future we could make "msg" an optional third positional argument to be more consistent with unittest's assertEqual, but requiring it be specified should be clear, and we can easily update the signature to make "msg" an optional positional argument in the future, too. Pull Request resolved: https://github.com/pytorch/pytorch/pull/38872 Differential Revision: D21740237 Pulled By: mruberry fbshipit-source-id: acbc027aa1d7877a49664d94db9a5fff91a07042	2020-05-27 06:31:07 -07:00

1 2

66 Commits