pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jeffrey Wan	381b3d8f4b	Refactor get numerical jacobian to calculate wrt all outputs at once (#54378 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54378 ### For release notes `torch.autograd.gradcheck.get_numerical_jacobian` (not part of the public api) is being deprecated. In the future, user code relying on this function will break because, among other changes, `get_numerical_jacobian` now returns `List[Tuple[torch.Tensor]]` instead of `List[torch.Tensor]`. (more details if necessary) For a `fn` that takes in M inputs and N outputs we now return a list of M N-tuples of jacobians where `output[i][j]` would represent the numerical jacobian w.r.t. to the ith input and the jth output. Previously `get_numerical_jacobian` returned a list of tensors where each tensor represents the jacobian w.r.t. to each of the M inputs and a specific output. Finally, the function passed in as the parameter `fn` should expect to handle individual parameters, where previously `fn` is required to expect its parameters wrapped in a tuple. --- end -- This PR addresses the comment here https://github.com/pytorch/pytorch/pull/53857#discussion_r595429639, to reduce the run-time of old gradcheck's get numerical jacobian by a factor of num_outputs. However, because very few ops actually return multiple outputs, there is not too much real speed up here. The main benefit of doing this change as part of the refactor is that it helps us isolate the possible bugs that are specific to switching `get numerical jacobian` to run in a per output way vs all outputs at once. Much of the logic implemented here will be the same for the fast gradcheck case, so knowing for certain that everything should pass after this stage will make the next step much simpler. The get_numerical_jacobian api is also being used in common_nn. So we update the callsite there as well. Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D27728720 Pulled By: soulitzer fbshipit-source-id: ee0f90b4f26ddc5fdbe949c4965eaa91c9ed0bb8	2021-04-13 10:06:20 -07:00
Yu Guo	846c8d94c7	mark embedding backward non-deterministic for max mode rather than all reducing modes (#55574 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55574 nn.EmbeddingBag backward is non-deterministic when reducing_mode = Max and on GPU, reducing mode Mean and Sum should be deterministic Test Plan: NA Reviewed By: ngimel Differential Revision: D27633832 fbshipit-source-id: 50786ed8522f1aae27442f5f244a65eab8000b06	2021-04-09 16:19:01 -07:00
Bel H	645119eaef	Lowering NLLLoss/CrossEntropyLoss to ATen code (#53789 ) Summary: * Lowering NLLLoss/CrossEntropyLoss to ATen dispatch * This allows the MLC device to override these ops * Reduce code duplication between the Python and C++ APIs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53789 Reviewed By: ailzhang Differential Revision: D27345793 Pulled By: albanD fbshipit-source-id: 99c0d617ed5e7ee8f27f7a495a25ab4158d9aad6	2021-03-26 07:31:08 -07:00
Xiang Gao	9f336bdf10	Fixes new tf32 failures in test_nn.py (#52871 ) Summary: Also modify the `tf32_on_and_off` decorator to make it support function without `device` argument. Pull Request resolved: https://github.com/pytorch/pytorch/pull/52871 Reviewed By: ngimel Differential Revision: D27286674 Pulled By: mruberry fbshipit-source-id: 14f6d558271bd6a1d0bc40691c170d47e81de1ff	2021-03-24 21:53:33 -07:00
Yukio Siraichi	27048c1dfa	Remove legacy constructor calls from _torch_ folder. (#53889 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53146 Related to https://github.com/pytorch/pytorch/issues/47112 As mentioned in https://github.com/pytorch/pytorch/issues/47112, the plan is to: 1. Verify that all `torch.Tensor()` scenarios are covered by other functions 2. Scrub internal `torch.Tensor()` uses 3. Update the docs and throw `TORCH_WARN_ONCE` if someone uses `torch.Tensor()` In this PR, I replaced all occurrences of `torch.Tensor` present in the _torch_ folder. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53889 Reviewed By: walterddr, zou3519 Differential Revision: D27190743 Pulled By: jbschlosser fbshipit-source-id: 7ecc201d57935b8dbb98ae3718b60d95cb55a010	2021-03-19 15:20:19 -07:00
Peter Bell	04e0cbf5a9	Add padding='same' mode to conv{1,2,3}d (#45667 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45667 First part of #3867 (Pooling operators still to do) This adds a `padding='same'` mode to the interface of `conv{n}d`and `nn.Conv{n}d`. This should match the behaviour of `tensorflow`. I couldn't find it explicitly documented but through experimentation I found `tensorflow` returns the shape `ceil(len/stride)` and always adds any extra asymmetric padding onto the right side of the input. Since the `native_functions.yaml` schema doesn't seem to support strings or enums, I've moved the function interface into python and it now dispatches between the numerically padded `conv{n}d` and the `_conv{n}d_same` variant. Underscores because I couldn't see any way to avoid exporting a function into the `torch` namespace. A note on asymmetric padding. The total padding required can be odd if both the kernel-length is even and the dilation is odd. mkldnn has native support for asymmetric padding, so there is no overhead there, but for other backends I resort to padding the input tensor by 1 on the right hand side to make the remaining padding symmetrical. In these cases, I use `TORCH_WARN_ONCE` to notify the user of the performance implications. Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D27170744 Pulled By: jbschlosser fbshipit-source-id: b3d8a0380e0787ae781f2e5d8ee365a7bfd49f22	2021-03-18 16:22:03 -07:00
Joel Schlosser	e86476f736	Huber loss (#50553 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48595. ## Background This PR implements HuberLoss, which differs from SmoothL1Loss by a factor of beta. The current implementation does not share logic between the two. Feedback is welcome for the optimal way to minimize code duplication while remaining performant. I've done some early [benchmarking](https://pytorch.org/tutorials/recipes/recipes/benchmark.html#collecting-instruction-counts-with-callgrind) with Huber calling in to the Smooth L1 kernel and scaling afterwards; for the simple test case I used, instruction counts are as follows: ``` Huber loss calls dedicated Huber kernel: 2,795,300 Huber loss calls Smooth L1 kernel and scales afterwards: 4,523,612 ``` With these numbers, instruction counts are ~62% higher when using the pre-existing Smooth L1 kernel. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50553 Test Plan: ``` python test/test_nn.py TestNN.test_HuberLoss python test/test_nn.py TestNN.test_HuberLoss_delta python test/test_nn.py TestNN.test_huber_loss_invalid_delta python test/test_nn.py TestNNDeviceTypeCPU.test_smooth_l1_loss_vs_huber_loss_cpu python test/test_nn.py TestNNDeviceTypeCUDA.test_smooth_l1_loss_vs_huber_loss_cuda python test/test_nn.py TestNNDeviceTypeCPU.test_invalid_reduction_strings_cpu python test/test_nn.py TestNNDeviceTypeCUDA.test_invalid_reduction_strings_cuda python test/test_nn.py TestNN.test_loss_equal_input_target_shape python test/test_nn.py TestNN.test_pointwise_loss_broadcast python test/test_overrides.py python test/test_jit.py TestJitGeneratedFunctional.test_nn_huber_loss python test/test_type_hints.py python test/test_cpp_api_parity.py build/bin/test_api ``` ## Documentation <img width="677" alt="Screen Shot 2021-01-14 at 4 25 08 PM" src="https://user-images.githubusercontent.com/75754324/104651224-5a445980-5685-11eb-884b-14ea517958c2.png"> <img width="677" alt="Screen Shot 2021-01-14 at 4 24 35 PM" src="https://user-images.githubusercontent.com/75754324/104651190-4e589780-5685-11eb-974d-8c63a89c050e.png"> <img width="661" alt="Screen Shot 2021-01-14 at 4 24 45 PM" src="https://user-images.githubusercontent.com/75754324/104651198-50225b00-5685-11eb-958e-136b36f6f8a8.png"> <img width="869" alt="Screen Shot 2021-01-14 at 4 25 27 PM" src="https://user-images.githubusercontent.com/75754324/104651208-53b5e200-5685-11eb-9fe4-5ff433aa13c5.png"> <img width="862" alt="Screen Shot 2021-01-14 at 4 25 48 PM" src="https://user-images.githubusercontent.com/75754324/104651209-53b5e200-5685-11eb-8051-b0cfddcb07d3.png"> Reviewed By: H-Huang Differential Revision: D26734071 Pulled By: jbschlosser fbshipit-source-id: c98c1b5f32a16f7a2a4e04bdce678080eceed5d5	2021-03-02 17:30:45 -08:00
Brian Hirsh	57637e0ab4	port upsample_nearest3d and upsample_trilinear3d to structured (#52065 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52065 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D26373027 Pulled By: bdhirsh fbshipit-source-id: 76b283ea8142732ffc8f7b200a8494349739e326	2021-02-22 10:38:52 -08:00
Brian Hirsh	d659477ae0	port upsample_bilinear2d and upsample_bicubic2d to structured (#52012 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52012 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D26356329 Pulled By: bdhirsh fbshipit-source-id: 8f974224799493e3172fe5dff3fbd43af8c09722	2021-02-22 10:38:48 -08:00
Brian Hirsh	f3ea5ca672	port upsample_linear1d to structured (#51917 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51917 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D26327750 Pulled By: bdhirsh fbshipit-source-id: 443ad278010ce655eb5f08fa6889c45ccb328268	2021-02-22 10:38:43 -08:00
Arindam Roy	da920fa141	Enable rocm tests in common nn (#51227 ) Summary: Fixes #{issue number} Resubmitting a new PR as the older one got reverted due to problems in test_optim.py. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51227 Reviewed By: ezyang Differential Revision: D26142505 Pulled By: ailzhang fbshipit-source-id: a2ab5d85630aac2d2ce17652ba19c11ea668a6a9	2021-01-29 12:54:04 -08:00
Jeffrey Wan	c0966914bc	Internal gradcheck wrapper in testing._internal that sets certain flags to True (#51133 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/49409 There are many call sites where, gradcheck/gradgradcheck is now being implicitly invoked with `check_batched_grad` as True, but they were previously False. Cases fall into two basic categories: 1) the call site was previously using `torch.autograd.gradcheck` but is now changed to use the globally imported function instead 3) the call site was already using globally imported function, but does not explicitly pass `check_batched_grad` flag Only in the _assertGradAndGradgradChecks cases, which are infrequent, I assumed that the the author is aware that omitting the flag means not applying check_batched_grad=True. (but maybe that is not the case?) Overall this PR in its current state assumes that unless the author explicitly specified `check_batched_grad=False`, they were just probably not aware of this flag and did not mean to have this flag as False. So far exceptions to the above (as discovered by CI) include: - Mkldnn (opaque tensors do not have strides) https://app.circleci.com/pipelines/github/pytorch/pytorch/264416/workflows/e4d87886-6247-4305-8526-2696130aa9a4/jobs/10401882/tests - all cases in test_sparse (https://app.circleci.com/pipelines/github/pytorch/pytorch/264553/workflows/3c1cbe30-830d-4acd-b240-38d833dccd9b/jobs/10407103) - all cases in test_overrides (https://app.circleci.com/pipelines/github/pytorch/pytorch/264553/workflows/3c1cbe30-830d-4acd-b240-38d833dccd9b/jobs/10407236) - test_autograd (test_LSTM_grad_and_gradgrad) - (https://app.circleci.com/pipelines/github/pytorch/pytorch/264553/workflows/3c1cbe30-830d-4acd-b240-38d833dccd9b/jobs/10407235) - test_data_parallel (test_data_parallel_buffers_requiring_grad) - SIGSEGV (https://app.circleci.com/pipelines/github/pytorch/pytorch/264820/workflows/14d89503-040d-4e3d-9f7b-0bc04833589b/jobs/10422697) - test_nn (https://app.circleci.com/pipelines/github/pytorch/pytorch/264919/workflows/df79e3ed-8a31-4a8e-b584-858ee99686ff/jobs/10427315) Possible TODO is to prevent new tests from invoking external gradcheck. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51133 Reviewed By: ezyang Differential Revision: D26147919 Pulled By: soulitzer fbshipit-source-id: dff883b50f337510a89f391ea2fd87de2d531432	2021-01-29 09:13:37 -08:00
mattip	345844d9d8	test, fix deepcopy of tensor with grad (#50663 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/3307 Previously, `self.grad` was not ~cloned~ deepcopied to the returned tensor in `deepcopy`. Added a test and an implementation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50663 Reviewed By: heitorschueroff Differential Revision: D26074811 Pulled By: albanD fbshipit-source-id: 536dad36415f1d03714b4ce57453f406ad802b8c	2021-01-26 16:19:53 -08:00
Richard Zou	83315965ab	Turn on batched grad testing for CriterionTest (#50744 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50744 This PR adds a `check_batched_grad=True` option to CriterionTest and turns it on by default for all CriterionTest-generated tests Test Plan: - run tests Reviewed By: ejguan Differential Revision: D25997676 Pulled By: zou3519 fbshipit-source-id: cc730731e6fae2bddc01bc93800fd0e3de28b32d	2021-01-26 07:37:15 -08:00
Richard Zou	63838b9330	Turn on batched_grad testing for NewModuleTest (#50740 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50740 This PR adds a `check_batched_grad=True` option to NewModuleTest-generated NN tests. Test Plan: - run tests (`pytest test/test_nn.py -v -rf`) Reviewed By: ejguan Differential Revision: D25997679 Pulled By: zou3519 fbshipit-source-id: b75e73d7e86fd3af9bad6efed7127b36551587b3	2021-01-22 15:33:09 -08:00
Peter Bell	db079a9877	Padding: support complex dtypes (#50594 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50594 Fixes #50234 Test Plan: Imported from OSS Reviewed By: pbelevich Differential Revision: D25987316 Pulled By: anjali411 fbshipit-source-id: c298b771fe52b267a86938e886ea402badecfe3e	2021-01-22 11:57:42 -08:00
Peter Bell	47f0bda3ef	Improve complex support in common_nn test machinery (#50593 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50593 There are no equivalent to torch.FloatTensor, torch.cuda.FloatTensor for complex types. So `get_gpu_type` and `get_cpu_type` are broken for complex tensors. Also found a few places that explicitly cast inputs to floating point types, which would drop the imaginary component before running the test. Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D25954050 Pulled By: mruberry fbshipit-source-id: 1fa8e5af233aa095c839d5e2f860564baaf92aef	2021-01-22 09:44:45 -08:00
Natalia Gimelshein	4d169258ef	Revert D25976245: [pytorch][PR] Enable Skipped ROCM Tests in common_nn.py Test Plan: revert-hammer Differential Revision: D25976245 (`24a0272132`) Original commit changeset: 801032534f91 fbshipit-source-id: 561e6d761cb694451d5f87557b4f96f37d19dd90	2021-01-21 13:28:37 -08:00
Arindam Roy	24a0272132	Enable Skipped ROCM Tests in common_nn.py (#50753 ) Summary: Removed test_cuda=(not TEST_WITH_ROCM) in common_nn.py to enable the skipped tests for ROCM. Signed-off-by: Arindam Roy <rarindam@gmail.com> Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/50753 Reviewed By: mrshenli Differential Revision: D25976245 Pulled By: ngimel fbshipit-source-id: 801032534f911d24d231bc9f0d3235a4506412c0	2021-01-21 09:48:47 -08:00
Jeffrey Wan	6e3e57095c	Add complex support for torch.nn.L1Loss (#49912 ) Summary: Building on top of the work of anjali411 (https://github.com/pytorch/pytorch/issues/46640) Things added in this PR: 1. Modify backward and double-backward formulas 2. Add complex support for `new module tests` and criterion tests (and add complex tests for L1) 3. Modify some existing tests to support complex Pull Request resolved: https://github.com/pytorch/pytorch/pull/49912 Reviewed By: zhangguanheng66 Differential Revision: D25853036 Pulled By: soulitzer fbshipit-source-id: df619f1b71c450ab2818eb17804e0c55990aa8ad	2021-01-15 15:53:15 -08:00
Hugo van Kemenade	473e78c0fa	Remove redundant code for unsupported Python versions (#49486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49486 Remove code for Python 3.5 and lower. There's more that can be removed/modernised, but sticking mainly to redundant version checks here, to keep the diff/PR smaller. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46579 Reviewed By: zou3519 Differential Revision: D24453571 Pulled By: ezyang fbshipit-source-id: c2cfcf05d6c5f65df64d89c331692c9aec09248e	2021-01-06 12:45:46 -08:00
Natalia Gimelshein	e35b822d7d	fixes indices computation for trilinear interpolate backwards (#50084 ) Summary: https://github.com/pytorch/pytorch/issues/48675 had some typos in indices computations so that results for trilinear interpolation where height is not equal to width were wrong. This PR fixes it. cc xwang233 Pull Request resolved: https://github.com/pytorch/pytorch/pull/50084 Reviewed By: BIT-silence Differential Revision: D25777083 Pulled By: ngimel fbshipit-source-id: 71be545628735fe875b7ea30bf6a09df4f2fae5c	2021-01-05 09:20:59 -08:00
Joel Schlosser	68d438c9da	Add PixelUnshuffle (#49334 ) Summary: Adds an implementation of `torch.nn.PixelUnshuffle` as the inverse operation of `torch.nn.PixelShuffle`. This addresses https://github.com/pytorch/pytorch/issues/2456 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49334 Test Plan: ``` # Unit tests. python test/test_nn.py TestNN.test_pixel_shuffle_unshuffle # Module test. python test/test_nn.py TestNN.test_PixelUnshuffle # C++ API tests. build/bin/test_api # C++ / python parity tests. python test/test_cpp_api_parity.py # JIT test. python test/test_jit.py TestJitGeneratedFunctional.test_nn_pixel_unshuffle # Override tests. python test/test_overrides.py # Type hint tests. python test/test_type_hints.py ``` Screenshots of rendered docs: <img width="876" alt="Screen Shot 2020-12-18 at 12 19 05 PM" src="https://user-images.githubusercontent.com/75754324/102642255-6b07bb00-412b-11eb-88fa-e53e7e8ba720.png"> <img width="984" alt="Screen Shot 2020-12-18 at 12 19 26 PM" src="https://user-images.githubusercontent.com/75754324/102642276-70fd9c00-412b-11eb-8548-445082a2db02.png"> <img width="932" alt="Screen Shot 2020-12-18 at 12 19 34 PM" src="https://user-images.githubusercontent.com/75754324/102642704-19abfb80-412c-11eb-9546-95bdd1c3cf22.png"> <img width="876" alt="Screen Shot 2020-12-22 at 12 51 36 PM" src="https://user-images.githubusercontent.com/75754324/102918259-986aa680-4454-11eb-99e7-a0b4c8b3e283.png"> <img width="869" alt="Screen Shot 2020-12-22 at 12 51 44 PM" src="https://user-images.githubusercontent.com/75754324/102918274-9ef91e00-4454-11eb-94bb-91b58aff47d3.png"> Reviewed By: mruberry Differential Revision: D25401439 Pulled By: jbschlosser fbshipit-source-id: 209d92ce7295e51699e83616d0c62170a7ce75c8	2020-12-22 20:14:55 -08:00
Richard Zou	dfb7520c47	NewModuleTest: Don't call both check_jacobian and gradcheck (#49566 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49566 Fixes #49422. check_jacobian and gradcheck do roughly the same thing: they both compute an analytic jacobian and a numeric jacobian and check that they are equivalent. Furthermore, NewModuleTest will (by default) call both check_jacobian and gradcheck, leading to some redundant checks that waste CI resources. However, there is one subtle difference: `check_jacobian` can handle the special case where a Module takes in dense inputs and dense parameters but returns sparse gradients, but that is not something gradcheck can handle. This is only used in the tests for nn.Embedding and nn.EmbeddingBag. This PR does the following: - have NewModuleTest call gradcheck instead of check_jacobian by default - add a new "has_sparse_gradients" flag to NewModuleTest. These are True for the nn.Embedding and nn.EmbeddingBag sparse gradient tests. If `has_sparse_gradients` is True, then we call check_jacobian, otherwise, we call gradcheck. - Kills the "jacobian_input" flag. This flag was used to tell NewModuleTest to not attempt to compute the jacobian for the inputs to the module. This is only desireable if the input to the module isn't differentiable and was only set in the case of nn.Embedding / nn.EmbeddingBag that take a LongTensor input. `gradcheck` handles these automatically by not checking gradients for non-differentiable inputs. Test Plan: - Code reading - run test_nn.py Reviewed By: albanD Differential Revision: D25622929 Pulled By: zou3519 fbshipit-source-id: 8d831ada98b6a95d63f087ea9bce1b574c996a22	2020-12-22 06:48:31 -08:00
Guilherme Leobas	a4e13fcf3f	add type annotations to common_nn.py (#48190 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48189 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48190 Reviewed By: walterddr, zhangguanheng66 Differential Revision: D25245261 Pulled By: malfet fbshipit-source-id: 0eabaed54996be83ead0fd7668f4d2be20adfc17	2020-12-02 14:46:00 -08:00
Chester Liu	17a6bc7c1b	Cleanup unused code for Python < 3.6 (#47822 ) Summary: I think these can be safely removed since the min version of supported Python is now 3.6 Pull Request resolved: https://github.com/pytorch/pytorch/pull/47822 Reviewed By: smessmer Differential Revision: D24954936 Pulled By: ezyang fbshipit-source-id: 5d4b2aeb78fc97d7ee4abaf5fb2aae21bf765e8b	2020-11-13 21:37:01 -08:00
Gao, Xiang	0652d755d3	Fix some flaky tests in test_torch.py and test_nn.py (#46941 ) Summary: Fixed test: - `test_is_nonzero`, this is asserting exact match, which is flaky when `TORCH_SHOW_CPP_STACKTRACES=1`, I changed this to non-exact assert - `test_pinverse` TF32 - `test_symeig` TF32 - `test_triangular_solve_batched_many_batches_cpu_float64` precision on CPU BLAS - `test_qr` TF32, as well as the tensor factory forgets a `dtype=dtype` - `test_lu` TF32 - `ConvTranspose2d` TF32 - `Conv3d_1x1x1_no_bias` TF32 - `Transformer*` TF32 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46941 Reviewed By: heitorschueroff Differential Revision: D24852725 Pulled By: mruberry fbshipit-source-id: ccd4740cc643476178d81059d1c78da34e5082ed	2020-11-12 22:35:42 -08:00
Alexander Grund	93719440b8	Replace map(lambda constructs (#46462 ) Summary: Follow-up of https://github.com/pytorch/pytorch/issues/46461 with a similar goal Makes them more readable and possibly faster. Care has to be taken because `map` applies the function immediately while `(x for x in xs)` is a generator expression which gets evaluated later. This is a benefit in some cases where it is not required to actually create the list of values in memory (e.g. when passing to `tuple` or `extend` or `join`) Pull Request resolved: https://github.com/pytorch/pytorch/pull/46462 Reviewed By: zou3519 Differential Revision: D24422343 Pulled By: ezyang fbshipit-source-id: 252e33499c92ac0b15238f2df32681dbbda2b237	2020-10-22 09:50:22 -07:00
Xiaomeng Yang	a87a1c1103	Fix perfornance issue of GroupNorm on CUDA when feature map is small. (#46170 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46170 Fix perfornance issue of GroupNorm on CUDA when feature map is small. Benchmark script: ``` import torch import torch.nn.functional as F from timeit import Timer norm = torch.nn.GroupNorm(8, 512).cuda() num = 5000 sizes = [(1024, 512, 14, 14), (1024, 512, 7, 7), (1024, 512)] def forward(x): _ = norm(x) torch.cuda.synchronize() def backward(y, grad): y.backward(grad, retain_graph=True) torch.cuda.synchronize() if __name__ == "__main__": # warm up x = torch.rand((sizes[0]), dtype=torch.float, device="cuda", requires_grad=True) for _ in range(100): forward(x) for size in sizes: x = torch.rand(size, dtype=torch.float, device="cuda", requires_grad=True) t = Timer("forward(x)", "from __main__ import forward, x") print(f"size = {size}:") t1 = t.timeit(num) / num * 1e6 print(f"avg_forward_time = {t1}us") y = norm(x) grad = torch.randn_like(y) t = Timer("backward(y, grad)", "from __main__ import backward, y, grad") t2 = t.timeit(num) / num * 1e6 print(f"avg_backward_time = {t2}us") ``` Benchmark result before this Diff: ``` size = (1024, 512, 14, 14): avg_forward_time = 1636.729855206795us avg_backward_time = 5488.682465581223us size = (1024, 512, 7, 7): avg_forward_time = 465.88476160541177us avg_backward_time = 3129.9425506033003us size = (1024, 512): avg_forward_time = 96.90486900508404us avg_backward_time = 2319.4099438143894us ``` Benchmark result after this Diff: ``` size = (1024, 512, 14, 14): avg_forward_time = 1635.6191572034732us avg_backward_time = 4140.7730475999415us size = (1024, 512, 7, 7): avg_forward_time = 463.6513736099005us avg_backward_time = 1641.7451039887965us size = (1024, 512): avg_forward_time = 66.59087920561433us avg_backward_time = 128.6882139975205us ``` Test Plan: buck test mode/dev-nosan //caffe2/test:nn -- "GroupNorm" Reviewed By: hl475, houseroad Differential Revision: D24242738 fbshipit-source-id: b52c82d7b6e47855c48fa8ceacd0c55d03bb92d5	2020-10-14 23:34:33 -07:00
ashish	5500b62f28	Enable zero batch conv tests for ROCm (#46305 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/26669 This PR enables convolution tests for zero batch size implemented in https://github.com/pytorch/pytorch/pull/26214/. jamesr66a jeffdaily sunway513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46305 Reviewed By: navahgar Differential Revision: D24307981 Pulled By: heitorschueroff fbshipit-source-id: dfc595fa855ae084b60a693e209b0fdcc714221d	2020-10-14 11:36:30 -07:00
Natalia Gimelshein	9201c37d02	Use addmm directly for 1x1 convolution (#45557 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/45274 Based on https://github.com/pytorch/pytorch/issues/44041, sets intermediate for backward computation (otherwise, backward tests are failing). Pull Request resolved: https://github.com/pytorch/pytorch/pull/45557 Reviewed By: izdeby Differential Revision: D24030655 Pulled By: ngimel fbshipit-source-id: 368fe9440668dffc004879f8b1d2dd3787d915c9	2020-10-02 00:26:53 -07:00
lixinyu	417e3f85e5	Support tuple inputs in NN Module test (#44853 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44853 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23750441 Pulled By: glaringlee fbshipit-source-id: 1b111a370a726b40521134b711c35f48dda99411	2020-09-28 22:05:05 -07:00
Gregory Chanan	1097fe0088	Remove CriterionTest.test_cuda code for dtype None. (#45316 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45316 It's never used. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23919449 Pulled By: gchanan fbshipit-source-id: f9aaeeabf3940389156bfc01bc3118d348ca4cf6	2020-09-28 15:08:09 -07:00
Gregory Chanan	5d1fee23b3	Remove convert_target from NN tests. (#45291 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45291 It's not necessary, you can just check if the dtype is integral. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23911963 Pulled By: gchanan fbshipit-source-id: 230139e1651eb76226f4095e31068dded30e03e8	2020-09-28 14:21:42 -07:00
Brian Hirsh	439930c81b	adding a beta parameter to the smooth_l1 loss fn (#44433 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44433 Not entirely sure why, but changing the type of beta from `float` to `double in autocast_mode.cpp and FunctionsManual.h fixes my compiler errors, failing instead at link time fixing some type errors, updated fn signature in a few more files removing my usage of Scalar, making beta a double everywhere instead Test Plan: Imported from OSS Reviewed By: mrshenli Differential Revision: D23636720 Pulled By: bdhirsh fbshipit-source-id: caea2a1f8dd72b3b5fd1d72dd886b2fcd690af6d	2020-09-25 16:36:28 -07:00
Gao, Xiang	3f5eee666c	Adjust TF32 tests (#44240 ) Summary: - The thresholds of some tests are bumped up. Depending on the random generator, sometimes these tests fail with things like 0.0059 is not smaller than 0.005. I ran `test_nn.py` and `test_torch.py` for 10+ times to check these are no longer flaky. - Add `tf32_on_and_off` to new `matrix_exp` tests. - Disable TF32 on test suites other than `test_nn.py` and `test_torch.py` cc: ptrblck Pull Request resolved: https://github.com/pytorch/pytorch/pull/44240 Reviewed By: mruberry Differential Revision: D23882498 Pulled By: ngimel fbshipit-source-id: 44a9ec08802c93a2efaf4e01d7487222478b6df8	2020-09-24 10:25:58 -07:00
Gao, Xiang	2111ec3bf3	CUDA BFloat16 losses (#45011 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45011 Reviewed By: mruberry Differential Revision: D23805840 Pulled By: ngimel fbshipit-source-id: 3eb60d4367c727100763879e20e9df9d58bf5ad6	2020-09-21 22:51:17 -07:00
Gregory Chanan	a6895d43b6	Turn on gradgrad check for BCELoss Criterion Tests. (#44894 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44894 Looks like we added double backwards support but only turned on the ModuleTests. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23762544 Pulled By: gchanan fbshipit-source-id: b5cef579608dd71f3de245c4ba92e49216ce8a5e	2020-09-21 07:14:22 -07:00
Gregory Chanan	07b7e44ed1	Stop using check_criterion_jacobian. (#44786 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44786 This predates gradcheck and gradcheck does the same and more. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23731902 Pulled By: gchanan fbshipit-source-id: 425fd30e943194f63a663708bada8960265b8f05	2020-09-18 07:04:57 -07:00
Gregory Chanan	6d178f6b8e	Stop ignoring errors in cuda nn module tests. (#44783 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44783 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23731778 Pulled By: gchanan fbshipit-source-id: 32df903a9e36bbf3f66645ee2d77efa5ed6ee429	2020-09-18 07:03:41 -07:00
Gregory Chanan	192c4111a3	Simplify target handling in nn gradcheck. (#44507 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44507 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23635799 Pulled By: gchanan fbshipit-source-id: 75090d6a48771e5c92e737a0829fbfa949f7c8a7	2020-09-11 13:25:59 -07:00
Gregory Chanan	5579b53a7f	Fix SmoothL1Loss when target.requires_grad is True. (#44486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44486 SmoothL1Loss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the SmoothL1Loss CriterionTests to verify that the target derivative is checked. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23630699 Pulled By: gchanan fbshipit-source-id: 0f94d1a928002122d6b6875182867618e713a917	2020-09-11 12:13:36 -07:00
Gregory Chanan	3de2c0b42f	Fix L1Loss when target.requires_grad is True. (#44471 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44471 L1Loss had a completely different (and incorrect, see #43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the L1Loss CriterionTests to verify that the target derivative is checked. Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23626008 Pulled By: gchanan fbshipit-source-id: 2828be16b56b8dabe114962223d71b0e9a85f0f5	2020-09-11 09:51:16 -07:00
Gregory Chanan	d07d25a8c5	Fix MSELoss when target.requires_grad is True. (#44437 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44437 MSELoss had a completely different (and incorrect, see https://github.com/pytorch/pytorch/issues/43228) path when target.requires_grad was True. This PR does the following: 1) adds derivative support for target via the normal derivatives.yaml route 2) kill the different (and incorrect) path for when target.requires_grad was True 3) modify the MSELoss CriterionTests to verify that the target derivative is checked. TODO: 1) do we still need check_criterion_jacobian when we run grad/gradgrad checks? 2) ensure the Module tests check when target.requires_grad 3) do we actually test when reduction='none' and reduction='mean'? Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D23612166 Pulled By: gchanan fbshipit-source-id: 4f74d38d8a81063c74e002e07fbb7837b2172a10	2020-09-11 08:51:28 -07:00
Gregory Chanan	c8914afdfa	Merge criterion_tests and new_criterion_tests. (#44398 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44398 These end up executing the same tests, so no reason to have them separate. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D23600855 Pulled By: gchanan fbshipit-source-id: 0952492771498bf813f1bf8e1d7c8dce574ec965	2020-09-10 08:29:59 -07:00
Gregory Chanan	fa158c4ca6	Combine criterion and new criterion tests in test_jit. (#43958 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43958 There is not any difference between these tests (I'm merging them), so let's merge them in the JIT as well. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D23452337 Pulled By: gchanan fbshipit-source-id: e6d13cdb164205eec3dbb7cdcd0052b02c961778	2020-09-10 08:28:14 -07:00
Gregory Chanan	af9cad761a	Stop ignoring NotImplementedErrors in cuda CriterionTests. (#44381 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44381 Perhaps this was necessary when the test was originally introduced, but it's difficult to figure out what is actually tested. And I don't think we actually use NotImplementedErorrs. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D23598646 Pulled By: gchanan fbshipit-source-id: aa18154bfc4969cca22323e61683a301198823be	2020-09-10 08:18:33 -07:00
Gregory Chanan	49215d7f26	For CriterionTests, have check_gradgrad actually only affect gradgrad checks. (#44060 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44060 Right now it skips grad checks as well. Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23484018 Pulled By: gchanan fbshipit-source-id: 24a8f1af41f9918aaa62bc3cd78b139b2f8de1e1	2020-09-03 12:29:32 -07:00
Gregory Chanan	5973b44d9e	Rename NewCriterionTest to CriterionTest. (#44056 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44056 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23482573 Pulled By: gchanan fbshipit-source-id: dde0f1624330dc85f48e5a0b9d98fb55fdb72f68	2020-09-03 10:29:20 -07:00
Gregory Chanan	cae52b4036	Merge CriterionTest into NewCriterionTest. (#44055 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44055 There is no functional change here. Another patch will rename NewCriterionTest to CriterionTest. Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D23482572 Pulled By: gchanan fbshipit-source-id: de364579067e2cc9de7df6767491f8fa3a685de2	2020-09-03 08:14:34 -07:00

1 2

78 Commits