pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Edward Z. Yang	3638089755	Ported reshape to symints and added a shim for BC (#85998 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85998 Approved by: https://github.com/ezyang	2022-10-02 17:46:00 +00:00
Horace He	a4bd89b267	Revert "Revert "Symintified mmm/addmm derivative formulas (#85794 )"" (#85820 ) This reverts commit `823dc33b00`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85820 Approved by: https://github.com/huydhn	2022-09-28 17:34:11 +00:00
PyTorch MergeBot	a0b1693996	Revert "Update `amax/amin/norm/count_nonzero` signatures with `int[*]? dim` (#83300 )" This reverts commit `1c0f0b33a0`. Reverted https://github.com/pytorch/pytorch/pull/83300 on behalf of https://github.com/jeffdaily due to The commit breaks nvfuser tests	2022-09-28 17:04:53 +00:00
PyTorch MergeBot	823dc33b00	Revert "Symintified mmm/addmm derivative formulas (#85794 )" This reverts commit `230edd2515`. Reverted https://github.com/pytorch/pytorch/pull/85794 on behalf of https://github.com/janeyx99 due to Sorry, reverting as this breaks an aot_autograd mac test on functorch `230edd2515`	2022-09-28 16:02:05 +00:00
Horace He	230edd2515	Symintified mmm/addmm derivative formulas (#85794 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85794 Approved by: https://github.com/ezyang	2022-09-28 14:07:57 +00:00
Edward Z. Yang	793488cda2	Revert "Revert "Symintifying slice ops (#85196 )"" (#85746 ) This reverts commit `3a171dfb0c`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85746 Approved by: https://github.com/albanD	2022-09-28 04:37:35 +00:00
Kurt Mohler	1c0f0b33a0	Update `amax/amin/norm/count_nonzero` signatures with `int[]? dim` (#83300 ) Changes `dim` arg to use `int[]?` type for the following functions in `native_funcitons.yaml`: * `amax` * `amin` * `norm` * `frobenius_norm` * `native_norm` * `count_nonzero` Part of #29137 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83300 Approved by: https://github.com/ngimel, https://github.com/albanD, https://github.com/kulinseth	2022-09-28 01:56:37 +00:00
PyTorch MergeBot	572dd862c4	Revert "Update `amax/amin/norm/count_nonzero` signatures with `int[*]? dim` (#83300 )" This reverts commit `8c7c7ed322`. Reverted https://github.com/pytorch/pytorch/pull/83300 on behalf of https://github.com/huydhn due to The commit pin breaks XLA test somehow	2022-09-28 01:36:43 +00:00
Kurt Mohler	8c7c7ed322	Update `amax/amin/norm/count_nonzero` signatures with `int[]? dim` (#83300 ) Changes `dim` arg to use `int[]?` type for the following functions in `native_funcitons.yaml`: * `amax` * `amin` * `norm` * `frobenius_norm` * `native_norm` * `count_nonzero` Part of #29137 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83300 Approved by: https://github.com/ngimel, https://github.com/albanD, https://github.com/kulinseth	2022-09-27 23:50:04 +00:00
PyTorch MergeBot	3a171dfb0c	Revert "Symintifying slice ops (#85196 )" This reverts commit `4c01c51266`. Reverted https://github.com/pytorch/pytorch/pull/85196 on behalf of https://github.com/atalman due to Break internal build Exutorch	2022-09-27 18:01:27 +00:00
Brian Hirsh	4a2d2e5e40	Change API type `Tensor[]` for structured kernels. (#73350 ) Partially fixes: #66328 This PR: - adds support for `ITensorList` to the dispatcher for: - computing the dispatch key - boxing and unboxing `ITensorList` - modified the codegen for structured kernels: - codegen APIs use `ITensorList` instead of `ArrayRef<Tensor>` Changes summary: - Signature changes due to the different APIs: - dispatcher API (e.g. `BatchingRegistrations.cpp`) - C++ API (e.g. `TensorShape.cpp`) - Miscelaneous functions used by codegen'd functions (e.g. `FunctionalTensorWrapper.`) - Dispatcher changes for handling `ITensorList` correctly (e.g. `DispatchKeyExtractor.h`) - Signature changes of `at::cat` due to the need of `const` inside `TensorBody.h` - Forward declarations of `ITensorList` (e.g. `MethodOperators.h`) - Codegen changes, special casing structured kernels (e.g. `gen.py`) Short description of structured kernels special casing:* I introduced, mainly, 5 types of changes to the codegen for generating code depending on whether the kernel is structured or not: 1. Added a `structured_type_override` flag to the `argument_type` function definition of the affected APIs (mainly the dispatcher and C++ APIs). - `api/cpp.py`, `api/dispatcher.py`, `api/native.py` 2. Added a `structured_type_override` member to the signature classes (e.g. `CppSignature`), since `FunctionSchema` doesn't really know whether the function is structured or not - `api/types.py` 3. Added a `part_of_structured_group` to `NativeFunction` class, which is just a convenient function to forward to `structured_type_override` wherever needed - `model.py` 4. Appropriately changed the rest of the codegen, whenever it used either the signature classes or the `arguments` function directly 5. Added a check for `const ITensorList&` type wherever there was a check for `TensorList` Pull Request resolved: https://github.com/pytorch/pytorch/pull/73350 Approved by: https://github.com/bdhirsh	2022-09-26 21:46:38 +00:00
Edward Z. Yang	4c01c51266	Symintifying slice ops (#85196 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85196 Approved by: https://github.com/ezyang	2022-09-23 22:01:32 +00:00
Mikayla Gawarecki	77f1f98479	Re-introduce `torch.Tensor.to_padded_tensor` (#85293 ) Differential Revision: [D39629004](https://our.internmc.facebook.com/intern/diff/D39629004) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85293 Approved by: https://github.com/cpuhrsch	2022-09-21 18:45:56 +00:00
Edward Z. Yang	3eb27229dd	as_strided symbolic support (#85264 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D39662820](https://our.internmc.facebook.com/intern/diff/D39662820) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85264 Approved by: https://github.com/wconstab	2022-09-21 13:34:55 +00:00
Benoit Steiner	86d8c61c7c	Revert D39583438: Multisect successfully blamed D39583438 for test or build failures (#85277 ) Summary: This diff is reverting D39583438 D39583438 has been identified to be causing the following test or build failures: Tests affected: - https://www.internalfb.com/intern/test/281475048999851/ Here's the Multisect link: https://www.internalfb.com/intern/testinfra/multisect/1260522 Here are the tasks that are relevant to this breakage: T124797105: 18 tests started failing for employee benoitsteiner in the last 2 weeks We're generating a revert to back out the changes in this diff, please note the backout may land if someone accepts it. Test Plan: NA Reviewed By: benoitsteiner Differential Revision: D39599694 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85277 Approved by: https://github.com/dagitses	2022-09-20 15:38:58 +00:00
kshitij12345	a4dca9822d	[composite compliance] prod (#81969 ) Ref: #69991 Also fixes #82644 (fix similar to #81617) For CompositeCompliance, we can't use `item` to choose a special fast-path when Tensor is a Subclass. Instead we always dispatch to the slower but safer implementation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81969 Approved by: https://github.com/zou3519	2022-09-20 08:03:36 +00:00
Thomas Viehmann	e41d758e26	Handle implicit real->complex casting for backward of stack (#84993 ) Fixes: #75852 P.S.: Yay for the PyTorch foundation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84993 Approved by: https://github.com/soulitzer	2022-09-19 21:20:34 +00:00
lezcano	d710c95cc0	Implement forward AD for scatter_reduce (#85000 ) I left the case `reduction="prod"` for future work as it's a bit of a pain. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85000 Approved by: https://github.com/soulitzer	2022-09-16 17:45:07 +00:00
Elias Ellison	54c9c4e73d	Flip fake tensors on in aot autograd (#84968 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84968 Approved by: https://github.com/Chillee	2022-09-16 15:27:48 +00:00
Pearu Peterson	a225f3cfce	torch.zero_ on a sparse compressed tensor resets nnz to 0 (#85030 ) Fixes #84997 and #82683 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85030 Approved by: https://github.com/cpuhrsch	2022-09-15 18:42:38 +00:00
Richard Zou	3a107bc9be	[functorch] fix vmapvjpvjp test for prelu (#84939 ) Turns out this is just a composite compliance issue. Branching on if something requires grad or not can lead to incorrect gradients if we have a BatchedTensor wrapping a tensor that requires grad. Test Plan: - tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/84939 Approved by: https://github.com/soulitzer	2022-09-15 00:36:30 +00:00
Mikayla Gawarecki	e217b30b0f	Add `torch.nested` namespace (#84102 ) First step towards #83775 - only `to_padded_tensor` is moved to the nested namespace for now - following the schema used for `special`, `fft`, `linalg` and other namespaces, nested functions are registered in native_functions.yaml as `nested_{function_name}` and are bound to the desired Python name in `torch/nested/__init__.py`, and the desired C++ name in `torch/csrc/api/include/torch/nested.h`. ~~Question: should we keep the documentation for `Tensor.to_padded_tensor` or can this deleted since it is shared by `torch.nested.to_padded_tensor`?~~ [generated nested docs](https://docs-preview.pytorch.org/84102/nested.html?highlight=nested#module-torch.nested) Differential Revision: [D39361148](https://our.internmc.facebook.com/intern/diff/D39361148) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84102 Approved by: https://github.com/drisspg	2022-09-12 16:31:05 +00:00
Ivan Yashchuk	01c54ad6de	Remove deprecated torch.eig (#70982 ) The time has come to remove deprecated linear algebra related functions. This PR removes `torch.eig`. cc @jianyuh @nikitaved @pearu @mruberry @walterddr @IvanYashchuk @xwang233 @Lezcano Pull Request resolved: https://github.com/pytorch/pytorch/pull/70982 Approved by: https://github.com/Lezcano, https://github.com/malfet	2022-09-09 21:31:57 +00:00
nikitaved	3eb16509c7	optimize householder product backward to be more memory-efficient (#84627 ) A follow-up on discussions in https://github.com/pytorch/pytorch/pull/84180. Makes backward more memory efficient with the lesser number of kernel calls. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84627 Approved by: https://github.com/kshitij12345, https://github.com/zou3519	2022-09-07 15:29:47 +00:00
kshitij12345	07d398fb26	[composite compliance] linalg_householder_product (#84180 ) Ref: #69991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/84180 Approved by: https://github.com/zou3519	2022-09-07 09:33:37 +00:00
kshitij12345	65ea3d0621	[composite compliance] cov, corrcoef (#82954 ) Ref: #69991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82954 Approved by: https://github.com/zou3519	2022-08-26 15:14:37 +00:00
Mario Lezcano	3e6e0a1d10	Support a stable double backward on linalg.det for real inputs (#80217 ) The complex case still fails. I do not know why. Fixes https://github.com/pytorch/pytorch/issues/62327 Fixes https://github.com/pytorch/pytorch/issues/53364 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80217 Approved by: https://github.com/nikitaved, https://github.com/albanD, https://github.com/malfet	2022-08-24 15:18:56 +00:00
Mario Lezcano	aad89bb771	Make the derivative of masked_fill more efficient (#83515 ) There's no need to add all the zeros if we extract all the non-zero elements. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83515 Approved by: https://github.com/albanD, https://github.com/soulitzer	2022-08-18 13:00:12 +00:00
Kurt Mohler	be5b3df6cc	Update `std_mean/var_mean/nanmean/nansum` signatures with `int[1]? dim` (#82912 ) ### Description Change the type of the `dim` arg for `std_mean/var_mean/nanmean/nansum` to `int[1]?` in `native_functions.yaml` ### Issue Part of #29137 ### Testing Pull Request resolved: https://github.com/pytorch/pytorch/pull/82912 Approved by: https://github.com/albanD	2022-08-10 16:58:26 +00:00
kshitij12345	10e7a25488	[composite compliance] eig_backward (#82957 ) Ref #69991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82957 Approved by: https://github.com/zou3519	2022-08-08 15:18:48 +00:00
Kurt Mohler	2bfae07a79	Enable `dim=None` for `torch.mean` (#81286 ) Part of #79525 This will require coordination with XLA before merging, just like #79881 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81286 Approved by: https://github.com/albanD	2022-07-28 22:34:56 +00:00
Nikolay Korovaiko	d2c47d559c	Revert "Revert "Enabling SymInt in autograd; take 3 (#81145 )"" ; make sure is_intlist checks for symintnodes (#82189 ) ### Description <!-- What did you change and why was it needed? --> ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82189 Approved by: https://github.com/ezyang	2022-07-26 20:47:11 +00:00
lezcano	11fe277b62	[PrimTorch] Add reference for torch.norm (#81765 ) This ref does more things than `torch.norm`, and it fixes a few bugs that `torch.norm` has. This implementation and the `torch.norm` implementation come to terms in the next PR of this stack We put this PR before, as otherwise `test_decomp.py` was failing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81765 Approved by: https://github.com/ngimel	2022-07-25 19:57:21 +00:00
Kshiteej K	db0e121b46	[composite compliance] put, take (#81094 ) Reference: #69991 This PR makes `put` CompositeExplicit as it is implemented in terms of `put_` (for which we can't handle Composite Compliance at the implementation level). Ref (put implementation) `478081c698/aten/src/ATen/native/TensorAdvancedIndexing.cpp (L619-L621)` Also, we update the `take` gradient formula to handle Tensor Subclass . Pull Request resolved: https://github.com/pytorch/pytorch/pull/81094 Approved by: https://github.com/zou3519	2022-07-25 15:05:16 +00:00
kshitij12345	5880a66758	[composite compliance] matrix_exp (#81225 ) Ref: #69991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81225 Approved by: https://github.com/zou3519	2022-07-25 11:11:29 +00:00
PyTorch MergeBot	c078476eb0	Revert "Enabling SymInt in autograd; take 3 (#81145 )" This reverts commit `032facd6e6`. Reverted https://github.com/pytorch/pytorch/pull/81145 on behalf of https://github.com/jeanschmidt due to breaking internal builds	2022-07-22 11:15:20 +00:00
Nikolay Korovaiko	032facd6e6	Enabling SymInt in autograd; take 3 (#81145 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/81145 Approved by: https://github.com/ezyang	2022-07-22 00:14:50 +00:00
Edward Z. Yang	84c8a9f88e	Use slow but safe formula for prod_backward (#81617 ) prod performs a sync to test for zeros as the formula is substantially simpler if there are no zeros, but this doesn't work for meta tensors. The double backwards formula works great in all cases though! Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/81617 Approved by: https://github.com/soulitzer	2022-07-18 18:45:32 +00:00
PyTorch MergeBot	4963adcc8d	Revert "[composite compliance] matrix_exp (#81225 )" This reverts commit `367c695237`. Reverted https://github.com/pytorch/pytorch/pull/81225 on behalf of https://github.com/clee2000 due to broke functorch https://github.com/pytorch/pytorch/runs/7345901504?check_suite_focus=true	2022-07-14 19:53:51 +00:00
kshitij12345	367c695237	[composite compliance] matrix_exp (#81225 ) Ref: #69991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81225 Approved by: https://github.com/zou3519	2022-07-14 18:19:11 +00:00
lezcano	b5b9db9f84	Make `kl_div` a composite function. (#80334 ) Benchmarks: https://github.com/pytorch/pytorch/pull/80334#issuecomment-1167229285 Fixes https://github.com/pytorch/pytorch/issues/80158 Fixes https://github.com/pytorch/pytorch/issues/78867 Fixes https://github.com/pytorch/pytorch/issues/69230 Supersedes https://github.com/pytorch/pytorch/pull/79007 Supersedes https://github.com/pytorch/pytorch/pull/69212 Supersedes https://github.com/pytorch/pytorch/pull/19659 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80334 Approved by: https://github.com/ezyang	2022-07-13 20:07:36 +00:00
Kurt Mohler	23bdb570cf	Reland: Enable `dim=None` for `torch.sum` (#79881 ) Part of #29137 Reland of #75845 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79881 Approved by: https://github.com/albanD, https://github.com/kulinseth	2022-07-09 00:54:42 +00:00
PyTorch MergeBot	f2c8557521	Revert "Make `kl_div` a composite function. (#80334 )" This reverts commit `828c787ea9`. Reverted https://github.com/pytorch/pytorch/pull/80334 on behalf of https://github.com/ezyang due to doesn't work with xla	2022-07-06 17:51:06 +00:00
lezcano	828c787ea9	Make `kl_div` a composite function. (#80334 ) Benchmarks: https://github.com/pytorch/pytorch/pull/80334#issuecomment-1167229285 Fixes https://github.com/pytorch/pytorch/issues/80158 Fixes https://github.com/pytorch/pytorch/issues/78867 Fixes https://github.com/pytorch/pytorch/issues/69230 Supersedes https://github.com/pytorch/pytorch/pull/79007 Supersedes https://github.com/pytorch/pytorch/pull/69212 Supersedes https://github.com/pytorch/pytorch/pull/19659 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80334 Approved by: https://github.com/ezyang	2022-07-04 19:33:43 +00:00
lezcano	37a5819665	Make slogdet, linalg.sloget and logdet support metatensors (#79742 ) This PR also adds complex support for logdet, and makes all these functions support out= and be composite depending on one function. We also extend the support of `logdet` to complex numbers and improve the docs of all these functions. We also use `linalg_lu_factor_ex` in these functions, so we remove the synchronisation present before. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79742 Approved by: https://github.com/IvanYashchuk, https://github.com/albanD	2022-07-01 16:09:21 +00:00
Hao Zhuang	0ca9888000	Correct the math of repeat_backward in the function comment (#80286 ) Correct the math of repeat_backward in the function comment. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80286 Approved by: https://github.com/albanD	2022-06-28 16:22:46 +00:00
lezcano	42a2359612	Add forward AD for linalg.det and simplify its backward (#79487 ) This PR is in preparation for implementing `logdet` and `slogdet` as structured kernels + implementing them with more efficient derivatives We implement forward AD for det. We also simplify the implementation of the backward, and leave a note on how to implement it properly for singular matrices. We leave thad for future work. Note (by looking at the OpInfo) that the current implementation passes the same tests as the one before. We skip the forward-over-backward in the singular case, as that one was not working in the gradgrad case either. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79487 Approved by: https://github.com/nikitaved, https://github.com/albanD	2022-06-24 14:15:17 +00:00
lezcano	44ff6be35a	Fix backward of binary_cross_entropy_with_logits The previous PR in this stack uncovered an error in the forward over backward for this function. In this PR, we fix this error and we also fix the gradgrad implementation (and make it more stable and faster using `logsigmoid`). We also move the double backward for this function to `FunctoinsManual` as there's no reason for it to be in `native_functions` Pull Request resolved: https://github.com/pytorch/pytorch/pull/80083 Approved by: https://github.com/zou3519	2022-06-23 01:31:08 +00:00
lezcano	f54e7b4ad6	More forward AD formulas This PR: - Corrects the forward AD formula of `torch.sgn`. - The reason why we can't use `auto_element_wise` for this operations is rather subtle. I left a comment. - This, in turn, fixes a problem we had in forward-over-backward for `linalg.svd` and other spectral decompositions (and `norm`, `linalg.norm`, `linalg.matrix_norm`) that were using `torch.abs` (whose derivative is given by `torch.sgn`. - Implement the formula for a number of missing operations `nansum`, `amax`, `amin`... - Simplified a few formulas, most notably the forward AD for `div` and the derivative of `norm`, `linalg.norm` and `vector_norm` for `ord=+-inf`. - Correct the formula for `mean`, `std_mean`, `var_mean` when `dim` is provided and equal to `()` (or `None`) - A few minor improvements to `sum_backward`, `unsqueeze_multiple` and formulas depending on them - Fix the derivatives of `std_mean` and `std_var` (complex support, ASAN, forward AD...) Fixes: https://github.com/pytorch/pytorch/issues/67539 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80082 Approved by: https://github.com/zou3519	2022-06-23 01:31:08 +00:00
PyTorch MergeBot	e3d0a3ca88	Revert "More forward AD formulas" This reverts commit `6b20ef6b91`. Reverted https://github.com/pytorch/pytorch/pull/77975 on behalf of https://github.com/janeyx99 due to I think this is the real culprit of the broken tests in `28a7ee8cec` for the trunk-only slow test job	2022-06-22 19:30:02 +00:00
PyTorch MergeBot	942c371bbc	Revert "Fix backward of binary_cross_entropy_with_logits" This reverts commit `28a7ee8cec`. Reverted https://github.com/pytorch/pytorch/pull/79381 on behalf of https://github.com/janeyx99 due to Sorry, `28a7ee8cec` this PR breaks trunk-only slow test job	2022-06-22 17:41:09 +00:00
lezcano	28a7ee8cec	Fix backward of binary_cross_entropy_with_logits The previous PR in this stack uncovered an error in the forward over backward for this function. In this PR, we fix this error and we also fix the gradgrad implementation (and make it more stable and faster using `logsigmoid`). We also move the double backward for this function to `FunctoinsManual` as there's no reason for it to be in `native_functions` Pull Request resolved: https://github.com/pytorch/pytorch/pull/79381 Approved by: https://github.com/soulitzer	2022-06-22 14:28:56 +00:00
lezcano	6b20ef6b91	More forward AD formulas This PR: - Corrects the forward AD formula of `torch.sgn`. - The reason why we can't use `auto_element_wise` for this operations is rather subtle. I left a comment. - This, in turn, fixes a problem we had in forward-over-backward for `linalg.svd` and other spectral decompositions (and `norm`, `linalg.norm`, `linalg.matrix_norm`) that were using `torch.abs` (whose derivative is given by `torch.sgn`. - Implement the formula for a number of missing operations `nansum`, `amax`, `amin`... - Simplified a few formulas, most notably the forward AD for `div` and the derivative of `norm`, `linalg.norm` and `vector_norm` for `ord=+-inf`. - Correct the formula for `mean`, `std_mean`, `var_mean` when `dim` is provided and equal to `()` (or `None`) - A few minor improvements to `sum_backward`, `unsqueeze_multiple` and formulas depending on them - Fix the derivatives of `std_mean` and `std_var` (complex support, ASAN, forward AD...) Fixes: https://github.com/pytorch/pytorch/issues/67539 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77975 Approved by: https://github.com/soulitzer	2022-06-22 14:28:56 +00:00
Driss Guessous	a098937c20	Add factory function derivatives (#79872 ) Adding derivatives for factory functions, this issue is used for tracking: #79044 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79872 Approved by: https://github.com/cpuhrsch, https://github.com/soulitzer	2022-06-21 00:53:11 +00:00
lezcano	16f30b494c	Make l1_loss composite Fixing the forward AD for `sgn` in the next PR of this stack uncovered a number of issues with the derivatives of `l1_loss`. Upon inspection, `l1_loss` was just implemented as a composite function, but it was not differentiable. This PR makes it a fully differentiable function. As a side note, `l1_loss_out` was incorrect in a number of ways. Even more, it is not exposed to the public as `F.l1_loss` does not accept an `out=` parameter. As such it is not even tested. I wonder how useful is to have `out=` variants for loss functions if we don't expose them at all. Even more, I wonder how useful is to have `_out` variants for loss functions, given that their most normal use case is to return just a real number cc jbschlosser Pull Request resolved: https://github.com/pytorch/pytorch/pull/79804 Approved by: https://github.com/zou3519, https://github.com/malfet	2022-06-20 19:10:54 +00:00
PyTorch MergeBot	d4a9438786	Revert "Make l1_loss composite" This reverts commit `61a5c779bf`. Reverted https://github.com/pytorch/pytorch/pull/78257 on behalf of https://github.com/malfet due to This breaks executorch	2022-06-17 18:14:21 +00:00
Kshiteej K	04b98df87a	[fix] composite compliance: eig, eigh, symeig (#79698 ) Ref: https://github.com/pytorch/pytorch/issues/69991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79698 Approved by: https://github.com/Lezcano, https://github.com/albanD	2022-06-17 14:13:04 +00:00
PyTorch MergeBot	ee6ebfc06b	Revert "Enable `dim=None` for `torch.sum` (#75845 )" This reverts commit `e79a51f7db`. Reverted https://github.com/pytorch/pytorch/pull/75845 on behalf of https://github.com/malfet due to Breaks MacOS builds, see `e79a51f7db`	2022-06-16 22:01:41 +00:00
Kurt Mohler	e79a51f7db	Enable `dim=None` for `torch.sum` (#75845 ) Part of #29137 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75845 Approved by: https://github.com/ezyang	2022-06-16 20:17:07 +00:00
lezcano	61a5c779bf	Make l1_loss composite Fixing the forward AD for `sgn` in the next PR of this stack uncovered a number of issues with the derivatives of `l1_loss`. Upon inspection, `l1_loss` was just implemented as a composite function, but it was not differentiable. This PR makes it a fully differentiable function. As a side note, `l1_loss_out` was incorrect in a number of ways. Even more, it is not exposed to the public as `F.l1_loss` does not accept an `out=` parameter. As such it is not even tested. I wonder how useful is to have `out=` variants for loss functions if we don't expose them at all. Even more, I wonder how useful is to have `_out` variants for loss functions, given that their most normal use case is to return just a real number cc jbschlosser Pull Request resolved: https://github.com/pytorch/pytorch/pull/78257 Approved by: https://github.com/jbschlosser	2022-06-16 00:03:22 +00:00
Michael Suo	30fb2c4aba	[lint] autoformat test/cpp and torch/csrc Let's have some fun. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78828 Approved by: https://github.com/ezyang	2022-06-11 21:11:16 +00:00
lezcano	54949a5abc	Simplify and optimize linalg.solve This PR heavily simplifies the code of `linalg.solve`. At the same time, this implementation saves quite a few copies of the input data in some cases (e.g. A is contiguous) We also implement it in such a way that the derivative goes from computing two LU decompositions and two LU solves to no LU decompositions and one LU solves. It also avoids a number of unnecessary copies the derivative was unnecessarily performing (at least the copy of two matrices). On top of this, we add a `left` kw-only arg that allows the user to solve `XA = B` rather concisely. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74046 Approved by: https://github.com/nikitaved, https://github.com/IvanYashchuk, https://github.com/mruberry	2022-06-11 04:06:40 +00:00
PyTorch MergeBot	3556457dd2	Revert "`kl_div`: fix for grads wrt `target`, double backward, forward-over-reverse AD support. (#79007 )" This reverts commit `72ad222cff`. Reverted https://github.com/pytorch/pytorch/pull/79007 on behalf of https://github.com/janeyx99 due to Broke test_fn_fwgrad_bwgrad_nn_functional_kl_div_cpu_float64 on trunk https://hud.pytorch.org/minihud?name_filter=pull%20/%20linux-xenial-py3.7-clang7-asan%20/%20test%20(default,%202,%205,%20linux.2xlarge)	2022-06-09 13:07:03 +00:00
Nikita Vedeneev	72ad222cff	`kl_div`: fix for grads wrt `target`, double backward, forward-over-reverse AD support. (#79007 ) Fixes https://github.com/pytorch/pytorch/issues/78867, fixes https://github.com/pytorch/pytorch/issues/65466. Adds forward-over-reverse AD support. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79007 Approved by: https://github.com/soulitzer, https://github.com/jbschlosser	2022-06-09 09:06:52 +00:00
lezcano	c7d6cec078	Add linalg.lu_solve This PR adds `linalg.lu_solve`. While doing so, I found a bug in MAGMA when calling the batched MAGMA backend with trans=True. We work around that by solving the system solving two triangular systems. We also update the heuristics for this function, as they were fairly updated. We found that cuSolver is king, so luckily we do not need to rely on the buggy backend from magma for this function. We added tests testing this function left and right. We also added tests for the different backends. We also activated the tests for AMD, as those should work as well. Fixes https://github.com/pytorch/pytorch/issues/61657 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77634 Approved by: https://github.com/malfet	2022-06-07 22:28:28 +00:00
Nikita Vedeneev	a4509f5b72	More forward-over-reverse implementations. (#78740 ) Umbrella issue: https://github.com/pytorch/pytorch/issues/75432. This one implements forward-over-reverse for: * mse_loss * l1_loss * smooth_l1_loss * softplus * hardswish (also adds double backward support) * prelu Pull Request resolved: https://github.com/pytorch/pytorch/pull/78740 Approved by: https://github.com/soulitzer	2022-06-03 15:44:06 +00:00
Brian Hirsh	5cc258ec9e	make block_diag composite compliant Pull Request resolved: https://github.com/pytorch/pytorch/pull/77716 Approved by: https://github.com/zou3519	2022-05-26 16:15:42 +00:00
Nikita Vedeneev	3924d56fae	`BCE` loss: forward-over-reverse AD support (#77852 ) Umbrella issue: https://github.com/pytorch/pytorch/issues/75432 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77852 Approved by: https://github.com/soulitzer	2022-05-26 14:36:52 +00:00
Brian Hirsh	07e4533403	reland of as_strided support for functionalization; introduce as_strided_scatter This reverts commit `a95f1edd85`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78199 Approved by: https://github.com/ezyang	2022-05-24 22:40:44 +00:00
PyTorch MergeBot	a95f1edd85	Revert "as_strided support for functionalization; introduce as_strided_scatter" This reverts commit `3a921f2d26`. Reverted https://github.com/pytorch/pytorch/pull/77128 on behalf of https://github.com/suo due to This broke rocm tests on master `3a921f2d26`. rocm tests are no longer run on PRs, you should add a `ciflow/trunk` label if you want to run them	2022-05-24 20:19:12 +00:00
Brian Hirsh	3a921f2d26	as_strided support for functionalization; introduce as_strided_scatter Pull Request resolved: https://github.com/pytorch/pytorch/pull/77128 Approved by: https://github.com/ezyang	2022-05-24 18:20:31 +00:00
lezcano	0c8c39fa71	Fix derivatives of norm(p=inf) Following up on https://github.com/pytorch/pytorch/pull/51099#discussion_r583323915, we fix these derivatives, as they were incorrect until now. As described in the note, the better solution would be to use vectorised operations on the preprocessing operation when reducing on CPU. It's not clear how difficult that may be. Fixes https://github.com/pytorch/pytorch/issues/67517 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78105 Approved by: https://github.com/ngimel	2022-05-24 17:16:16 +00:00
lezcano	e0295f55b5	Fix derivatives for linalg.vector_norm(..., dtype=) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/76551 Approved by: https://github.com/albanD	2022-05-19 21:17:18 +00:00
PyTorch MergeBot	7a4e3f329f	Revert "Fix derivatives for linalg.vector_norm(..., dtype=)" This reverts commit `13d8fb93bb`. Reverted https://github.com/pytorch/pytorch/pull/76551 on behalf of https://github.com/seemethere due to Reverting the entire stack, errors originated from * https://github.com/pytorch/pytorch/pull/76547 Failed internal builds due to ([Link for Meta Employees](https://www.internalfb.com/diff/D36494019?selected_signal=c2FuZGNhc3RsZV93b3JrZmxvd19ydW46MTgwMTQzOTg1MTUzNTQ3NzQ%3D&selected_signal_verification_phase=1&dst_version_fbid=1211273672948052)): ``` aten/src/ATen/native/LinearAlgebra.cpp:2496:9: error: unused type alias 'Int' [-Werror,-Wunused-local-typedef] using Int = IntArrayRef::value_type; ^ 1 error generated. Command failed with exit code 1. ```	2022-05-19 21:04:23 +00:00
Nikita Vedeneev	7945fa6ce2	`BCE` loss: forward ad support (#77755 ) As per title + BCE with logits gets a simpler implementation. Relevant for https://github.com/pytorch/pytorch/issues/71117 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77755 Approved by: https://github.com/soulitzer	2022-05-19 13:13:58 +00:00
lezcano	13d8fb93bb	Fix derivatives for linalg.vector_norm(..., dtype=) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/76551 Approved by: https://github.com/mruberry	2022-05-18 11:46:50 +00:00
Nikita Vedeneev	a760dc2687	`binary_cross_entropy`: double backwart wrt target (#77416 ) As per title. An effort to make `binary_cross_entropy` all around differentiable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77416 Approved by: https://github.com/soulitzer	2022-05-18 10:29:27 +00:00
lezcano	369d9f4137	A few forward AD formulas It includes all-time favourites like: - `put` - `nn.functional.embedding` - `prelu` - `nn.functional.bilinear` - `nn.functional.rrelu` - `nn.functional.logsigmoid` Pull Request resolved: https://github.com/pytorch/pytorch/pull/77421 Approved by: https://github.com/soulitzer	2022-05-17 15:55:51 +00:00
Mikayla Gawarecki	7ba4e124e6	Bugfix gradient formula for index_reduce('prod') + separate out sample_inputs for index_reduce Pull Request resolved: https://github.com/pytorch/pytorch/pull/77382 Approved by: https://github.com/cpuhrsch	2022-05-16 18:43:57 +00:00
Mikayla Gawarecki	841c65f499	Unprivate _index_reduce and add documentation Pull Request resolved: https://github.com/pytorch/pytorch/pull/76997 Approved by: https://github.com/cpuhrsch	2022-05-13 19:48:38 +00:00
jiayisun	97deda4f28	add BFloat16 support for logcumsumexp on CPU (#72694 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/72694 Approved by: https://github.com/VitalyFedyunin, https://github.com/frank-wei	2022-05-12 17:10:28 +00:00
Ivan Yashchuk	545d90f032	Sparse CSR: enable autograd for torch.sparse.addmm and torch.sparse.mm This PR updates the derivative rule for `torch.sparse.addmm` to be working with CSR sparse matrix. Notably `torch.sparse.sampled_addmm` is used in the backward function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76591 Approved by: https://github.com/cpuhrsch	2022-05-11 18:57:40 +00:00
PyTorch MergeBot	f94abd59f7	Revert "Sparse CSR: enable autograd for torch.sparse.addmm and torch.sparse.mm" This reverts commit `721a8ca697`. Reverted https://github.com/pytorch/pytorch/pull/76591 on behalf of https://github.com/janeyx99	2022-05-10 13:21:46 +00:00
Ivan Yashchuk	721a8ca697	Sparse CSR: enable autograd for torch.sparse.addmm and torch.sparse.mm This PR updates the derivative rule for `torch.sparse.addmm` to be working with CSR sparse matrix. Notably `torch.sparse.sampled_addmm` is used in the backward function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76591 Approved by: https://github.com/cpuhrsch	2022-05-10 08:44:55 +00:00
PyTorch MergeBot	4ebc4890dd	Revert "Add linalg.lu_solve" This reverts commit `fc5b4a5a33`. Reverted https://github.com/pytorch/pytorch/pull/72935 on behalf of https://github.com/malfet	2022-05-09 19:12:30 +00:00
Mikayla Gawarecki	465e0ae266	Bugfix scatter_reduce backward formulas Pull Request resolved: https://github.com/pytorch/pytorch/pull/76523 Approved by: https://github.com/albanD	2022-05-05 20:22:39 +00:00
lezcano	fc5b4a5a33	Add linalg.lu_solve This PR adds `linalg.lu_solve`. While doing so, I found a bug in MAGMA when calling the batched MAGMA backend with trans=True. We work around that by solving the system solving two triangular systems. We also update the heuristics for this function, as they were fairly updated. We found that cuSolver is king, so luckily we do not need to rely on the buggy backend from magma for this function. We added tests testing this function left and right. We also added tests for the different backends. We also activated the tests for AMD, as those should work as well. Fixes https://github.com/pytorch/pytorch/issues/61657 Pull Request resolved: https://github.com/pytorch/pytorch/pull/72935 Approved by: https://github.com/IvanYashchuk, https://github.com/mruberry	2022-05-05 19:02:13 +00:00
Nikita Vedeneev	33fabe9a2e	`functional.max_unpool`: OpInfo tests + simpler backward + forward ad + fwad over backward ad Resolves https://github.com/pytorch/pytorch/issues/67657, https://github.com/pytorch/pytorch/issues/67658, https://github.com/pytorch/pytorch/issues/67660. These are not necessarily bugs because we cannot produce arbitrary samples coming from `max_pool` to the gradcheck's eternal satisfaction. This PR also replaces low-level complicated backward kernels with much simpler high-level and well-tested counterparts. The replacement is also faster (before: parallel for loop, after: memory layout optimized TensorIterator's parallelization coming from `gather`). cc @albanD @mruberry @jbschlosser @walterddr Pull Request resolved: https://github.com/pytorch/pytorch/pull/68625 Approved by: https://github.com/albanD	2022-05-05 10:13:51 +00:00
lezcano	7cb7cd5802	Add linalg.lu This PR modifies `lu_unpack` by: - Using less memory when unpacking `L` and `U` - Fuse the subtraction by `-1` with `unpack_pivots_stub` - Define tensors of the correct types to avoid copies - Port `lu_unpack` to be a strucutred kernel so that its `_out` version does not incur on extra copies Then we implement `linalg.lu` as a structured kernel, as we want to compute its derivative manually. We do so because composing the derivatives of `torch.lu_factor` and `torch.lu_unpack` would be less efficient. This new function and `lu_unpack` comes with all the things it can come: forward and backward ad, decent docs, correctness tests, OpInfo, complex support, support for metatensors and support for vmap and vmap over the gradients. I really hope we don't continue adding more features. This PR also avoids saving some of the tensors that were previously saved unnecessarily for the backward in `lu_factor_ex_backward` and `lu_backward` and does some other general improvements here and there to the forward and backward AD formulae of other related functions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/67833 Approved by: https://github.com/IvanYashchuk, https://github.com/nikitaved, https://github.com/mruberry	2022-05-05 09:17:05 +00:00
lezcano	1a4eea57be	Improve derivative of QR decomposition We derive and implement a more concise rule for the forward and backward derivatives of the QR decomposition. While doing this we: - Fix the composite compliance of `linalg.qr` and we make it support batches - Improve the performance and simplify the implementation of both foward and backward - Avoid saving the input matrix for the backward computation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76115 Approved by: https://github.com/nikitaved, https://github.com/albanD	2022-05-05 09:14:57 +00:00
Richard Zou	71ae190b87	[composite compliance] Fix a bunch of fft backwards Replaced `at::zeros(..., grad.options()).slice().copy_(grad))` with `grad.new_zeros(..., grad.options()).slice().copy_(grad))` Test Plan: - run tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/76573 Approved by: https://github.com/ngimel, https://github.com/albanD	2022-05-03 00:07:30 +00:00
Mikayla Gawarecki	676a4a3969	Prototype _index_reduce (CPU-only) Pull Request resolved: https://github.com/pytorch/pytorch/pull/75981 Approved by: https://github.com/cpuhrsch	2022-04-27 23:01:00 +00:00
Richard Zou	9cb2871f31	Fix forward-mode AD formula for binary_cross_entropy_with_logits The problem was that `grad_input` and `grad_target` may be ZeroTensors, which are immutable. This PR changes it so that operations on grad_input and grad_target in `binary_cross_entropy_with_logits_jvp` are no longer in-place. Test Plan: - run existing tests Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/76322 Approved by: https://github.com/soulitzer	2022-04-25 22:30:57 +00:00
lezcano	441aea4127	Update Choesky's forward and backward derivative This PR: - Derives formally a new rule for Cholesky (write-up to come) - Implements it without using in-place operations in the forward or backward. - Does not instantiate inverses explicitly, but rather it solves two triangular systems of equations (2 triang vs 1 triang and 2 matmuls should be comparable, but the first one should be more stable). Pull Request resolved: https://github.com/pytorch/pytorch/pull/76032 Approved by: https://github.com/nikitaved, https://github.com/albanD	2022-04-22 00:45:38 +00:00
Nikita Shulga	f6c275f55d	Remove `-Wno-unused-variable` from `utils.cmake` (take 2) (#75538 ) Summary: [Comment](https://github.com/pytorch/pytorch/pull/62445/files#r680132022) claims, it got added for consistency with top level CMakeLists.txt, but `-Wno-unused-variable` is not mentioned there. Modify violations in 50+ files that were added in the interim by either removing unused variables, or decorating the code with `C10_UNUSED` if local variable is likely used to extend object lifetime until the end of the block. Caused preventable revert in https://github.com/pytorch/pytorch/pull/72633#issuecomment-1092300787 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75538 Reviewed By: anjali411 Differential Revision: D35747333 Pulled By: malfet fbshipit-source-id: 3fc5828e44a4c05ba0e89e92613e6ebbdb260626 (cherry picked from commit c179fba21cfa2a0093fad50ccad5a22dd7cff52c)	2022-04-20 17:41:59 +00:00
Ivan Yashchuk	bba4780232	Enable autograd wrt sparse CSR tensors This pull request enables accumulating gradients for the CSR tensor. Functions that work and are tested: - tensor.abs() - tensor.neg() - tensor.conj_physical() - torch.addmm `torch.mm` also works, but tests will be added later. In addition, this PR adds throwing an error when trying to access strides, storage, and contiguity info on a CSR tensor. `tensor.to_sparse_csr().to_sparse_csr()` was failing and now fixed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75435 Approved by: https://github.com/cpuhrsch	2022-04-19 18:42:45 +00:00
PyTorch MergeBot	5c56b2286b	Revert "Remove `-Wno-unused-variable` from utils.cmake" This reverts commit `018cbe1f5c`. Reverted https://github.com/pytorch/pytorch/pull/75538 on behalf of https://github.com/seemethere	2022-04-19 17:19:09 +00:00
Nikita Shulga	018cbe1f5c	Remove `-Wno-unused-variable` from utils.cmake [Comment](https://github.com/pytorch/pytorch/pull/62445/files#r680132022) claims, it got added for consistency with top level CMakeLists.txt, but `-Wno-unused-variable` is not mentioned there. Modify violations in 50+ files that were added in the interim by either removing unused variables, or decorating the code with `C10_UNUSED` if local variable is likely used to extend object lifetime until the end of the block. Caused preventable revert in https://github.com/pytorch/pytorch/pull/72633#issuecomment-1092300787 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75538 Approved by: https://github.com/cpuhrsch	2022-04-19 15:26:55 +00:00
Peter Bell	cc56fac213	Fix complex to real casting warning in _to_copy backward Fixes #75781 A Real->Complex cast should result in a gradient with no imaginary component, so discarding the imaginary component is expected. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75805 Approved by: https://github.com/albanD	2022-04-19 14:04:13 +00:00
soulitzer	8721abc429	Add forward AD support for norm, dist, F.pairwise_dist, F.normalize Pull Request resolved: https://github.com/pytorch/pytorch/pull/74205 Approved by: https://github.com/albanD	2022-04-13 15:03:20 +00:00

1 2 3 4 5 ...

348 Commits