pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
cyy	4d51c8532c	Some simple fixes (#93221 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93221 Approved by: https://github.com/Skylion007	2023-01-30 05:14:03 +00:00
Aaron Gokaslan	0247ed27cc	Apply Clang-Tidy readability-container-size-empty (#93236 ) Not only is this change usually shorter and more readable, it also can yield better performance. size() is not always a constant time operation (such as on LinkedLists), but empty() always is. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93236 Approved by: https://github.com/malfet	2023-01-29 23:28:19 +00:00
mfkasim1	75cfc0be21	Logcumsumexp for CPU (#93153 ) Partial work from #90847, in the direction of solving #89205. Most of the content is from #90847, but this is only for CPU, so hopefully it does not increase the build time by a lot. tag: @albanD, @malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/93153 Approved by: https://github.com/malfet, https://github.com/Skylion007	2023-01-27 22:29:33 +00:00
cyy	e292ddff4e	More clang-tidy fixes (#92944 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/92944 Approved by: https://github.com/Skylion007	2023-01-25 19:11:51 +00:00
PyTorch MergeBot	9b23fd378f	Revert "Logcumsumexp for complex in CPU and CUDA (#90847 )" This reverts commit `64985123e4`. Reverted https://github.com/pytorch/pytorch/pull/90847 on behalf of https://github.com/malfet due to Reverting to decrease build time, let's discuss the alternatives here	2023-01-24 20:49:08 +00:00
Aaron Gokaslan	8c8cd9539d	Add missing moves to torch autograd (#92772 ) Applies some additional std::move functions to torch/csrc/autograd to opportunities that were found via static analysis. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92772 Approved by: https://github.com/ezyang	2023-01-24 02:01:52 +00:00
Nikita Vedeneev	9f381c9b7f	sparse_sparse_matmul: simplify backward (#91712 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91712 Approved by: https://github.com/albanD	2023-01-23 19:24:28 +00:00
mfkasim1	64985123e4	Logcumsumexp for complex in CPU and CUDA (#90847 ) Another PR towards solving #89205. What's in this PR: * The implementation of forward `logcumsumexp` for complex numbers in CPU & CUDA * The tests on forward call of `logcumsumexp` for complex numbers * The implementation of backward `logcumsumexp` for complex numbers What's missing: * The test on backward gradient of `logcumsumexp` (it complaints `RuntimeError: logcumsumexp does not support automatic differentiation for outputs with complex dtype.` and I don't know how to solve the error and I don't know where to put the test for the backward computation). If possible, I'd like this to be done in this PR. It's really tricky to handle the edge cases here (i.e. the ones involving `inf`), but I've tried my best to put some comments explaining the reasonings of my decisions in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90847 Approved by: https://github.com/albanD	2023-01-20 15:10:50 +00:00
Peter Bell	4058dedf21	Replace log(1 + x) with log1p(x) (#92114 ) `log1p` offers better precision near zero since `(1 + x) - 1` truncates any values less than the float epsilon to zero. For `soft_margin_loss` this also requires one fewer kernel invocation which for numel=1e7 gives me a 1.2x speedup on CUDA and a 1.1x speedup on CPU. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92114 Approved by: https://github.com/ngimel, https://github.com/lezcano	2023-01-18 10:43:56 +00:00
Peter Bell	fb1427ea8f	squeeze: allow squeezing multiple dimensions at once (#89017 ) Ref #70924 This addresses part 1 of the issue, allowing `torch.squeeze` to be passed a tuple of dimensions. e.g. ```python x.squeeze(0).squeeze(0) ``` can now be written ```python x.squeeze((0, 1)) ``` (assuming x has at least 2 dimensions) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89017 Approved by: https://github.com/albanD	2023-01-17 14:20:15 +00:00
David Berard	d7dc1c2fd5	Support zero dimensions in softmax decompositions (#91322 ) The eager implementation of softmax supports computation along zero dimensions, but many of the other implementations did not, including: * decompositions & refs (this was causing dynamo failures) * forward AD for logsumexp * MPS log_softmax_backward This PR handles the `input.numel() == 0` cases separately to avoid running `amax()`, which fails for zero dimensions, and updates opinfos. example of "computation along zero dimensions": ```python # example of where import torch t = torch.rand((4, 0, 0)) print("~") print(torch.nn.functional.softmax(t, dim=-1)) # this passes print("~") torch._refs.softmax(t, dim=-1) # this fails print("~") ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91322 Approved by: https://github.com/lezcano	2023-01-11 09:35:43 +00:00
PyTorch MergeBot	df4b3b13bc	Revert "squeeze: allow squeezing multiple dimensions at once (#89017 )" This reverts commit `e26cb06681`. Reverted https://github.com/pytorch/pytorch/pull/89017 on behalf of https://github.com/mehtanirav due to Internal breakages	2023-01-05 19:25:08 +00:00
Peter Bell	e26cb06681	squeeze: allow squeezing multiple dimensions at once (#89017 ) Ref #70924 This addresses part 1 of the issue, allowing `torch.squeeze` to be passed a tuple of dimensions. e.g. ```python x.squeeze(0).squeeze(0) ``` can now be written ```python x.squeeze((0, 1)) ``` (assuming x has at least 2 dimensions) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89017 Approved by: https://github.com/albanD	2023-01-04 14:40:56 +00:00
lezcano	d5163f5206	Fix NumPy broadcasting in lstsq_backward (#91460 ) Fixes https://github.com/pytorch/pytorch/issues/77225 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91460 Approved by: https://github.com/albanD	2022-12-30 10:49:20 +00:00
lezcano	051d16a2f7	Fix NumPy-compat broadcasting in the derivative of linalg.solve (#91456 ) Fixes https://github.com/pytorch/pytorch/issues/89761 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91456 Approved by: https://github.com/albanD	2022-12-30 10:49:20 +00:00
lezcano	484dd40022	Implement PReLU in a compositional way (#91238 ) The PReLU implementation was all over the place. This lead to a number of bugs like https://github.com/pytorch/pytorch/issues/68760. We fix it by: - Keeping the weird broadcasting logic it has as a CompositeImplicit kernel that calls into a second kernel - This second kernel is just a good-ol' pointwise kernel. - We implement the derivative for the pointwise kernel via TI as well for speed. - We implement the second derivative for the pointwise kernel and the forward AD derivatives compositionally This fixes a number of issues: - We don't perform copies any more when the inputs are not contiguous - The derivatives are now correct - We fix vmap and many other functorch-related issues. - CPU and CUDA now share the relevant broadcasting logic - The implementation is about 1/3 the length. Fixes https://github.com/pytorch/pytorch/issues/68760 Fixes https://github.com/pytorch/pytorch/issues/89895 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91238 Approved by: https://github.com/kshitij12345, https://github.com/jbschlosser, https://github.com/albanD	2022-12-30 10:42:30 +00:00
lezcano	5b223c43ec	Avoid calling allclose in the backward if there are tensor subclasses (#91444 ) `allclose` it's data-dependent (returns a bool) so it does not play well with functorch. We are skipping that check in the context of subclasses to avoid hard errors. Partially fixes https://github.com/pytorch/pytorch/issues/90499 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91444 Approved by: https://github.com/albanD	2022-12-28 19:12:50 +00:00
Nikita Karetnikov	cc11edb084	[aot_autograd] symintify `logsumexp` (#91442 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91442 Approved by: https://github.com/albanD	2022-12-28 18:06:26 +00:00
Nikita Vedeneev	3870a9e28d	to_sparse_XXX: backward support (#90281 ) As per title. Fixes https://github.com/pytorch/pytorch/issues/85226 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90281 Approved by: https://github.com/cpuhrsch, https://github.com/soulitzer	2022-12-14 09:05:17 +00:00
soulitzer	98a9235dce	Fix prelu ref when a.ndim < 2 (#89809 ) Fixes https://github.com/pytorch/pytorch/issues/89560 Previously the test case for "input is 1-D or scalar + weight is not scalar" did not exist; adding it introduced some failures: - forward AD (fixed in this PR) - vmap (filed https://github.com/pytorch/pytorch/issues/89895) - ref/meta (fixed this PR, though this also regresses nvFuser support) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89809 Approved by: https://github.com/ngimel	2022-12-12 23:55:31 +00:00
Aaron Gokaslan	7541c9f8be	[Fix]: remove unnecessary copies in aten, c10, and torch bindings (#90629 ) Applies various automated fixes that reduces the number of spurious copies in torch, aten, and c10. I also inlined any default dtors that would have made the type trivially destructible. Follow up to #89000 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90629 Approved by: https://github.com/ezyang	2022-12-12 17:05:52 +00:00
Richard Zou	4b1053497c	[vmap] Prepend "legacy" to files for old vmap implementation (#90324 ) We have an older torch.vmap implementation. It is no longer supported. It still needs to exist somewhere for the sake of BC with torch.autograd.functional. This PR makes it clear what files are meant for implementing the old vmap implementation. I've seen a couple of PRs recently adding support for the old vmap implementation, so this will lessen the confusion. Test Plan: - CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/90324 Approved by: https://github.com/samdow	2022-12-07 18:46:15 +00:00
Nikita Karetnikov	4cb6bbbe27	Symintify `embedding` (#89327 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89327 Approved by: https://github.com/ezyang	2022-11-24 03:25:00 +00:00
Andrew M. James	a41f70603a	Round out rad2deg sparse support (#88442 ) - Add sparse coo dispatch - Modify backward to work with sparse compressed layouts - Enable sparse_compressed autograd testing - Correct layout support attributes on OpInfo Pull Request resolved: https://github.com/pytorch/pytorch/pull/88442 Approved by: https://github.com/cpuhrsch	2022-11-17 06:00:23 +00:00
Kazuaki Ishizaki	e0c194f10b	Fix typos in messages under torch (#88961 ) This PR fixes typos of messages and parms in c++ source and head files under `torch` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88961 Approved by: https://github.com/albanD	2022-11-14 19:06:41 +00:00
Brian Hirsh	a16ced03c9	reland "fix as_strided_scatter_backward (#87646 )" (#88342 ) This reverts commit `71fb763e54`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88342 Approved by: https://github.com/zou3519	2022-11-07 15:00:58 +00:00
Andrew M. James	ff6770a9a1	enable backward for log1p (sparse layouts) (#88155 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88155 Approved by: https://github.com/cpuhrsch	2022-11-04 20:59:26 +00:00
Andrew M. James	6938dd0b2c	Support sparse inputs to deg2rad (#88156 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88156 Approved by: https://github.com/cpuhrsch	2022-11-04 20:59:26 +00:00
PyTorch MergeBot	71fb763e54	Revert "fix as_strided_scatter_backward (#87646 )" This reverts commit `f9d7985851`. Reverted https://github.com/pytorch/pytorch/pull/87646 on behalf of https://github.com/huydhn due to Sorry for reverting your PR but I think this one or one of the PR in the stack break bionic-cuda11.7 on trunk `70782981f0`	2022-11-02 16:54:36 +00:00
Brian Hirsh	f9d7985851	fix as_strided_scatter_backward (#87646 ) as_strided_scatter's derivative formula was broken - instead of making a "mask" of 1's and 0's, it would effectively make a mask of 1's and uninitialized memory. Fixes https://github.com/pytorch/pytorch/issues/88105 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87646 Approved by: https://github.com/albanD	2022-11-02 14:36:49 +00:00
albanD	8a9aca7b8d	Reland 2 Many symintifications (#87604 ) (#87980 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/87980 Approved by: https://github.com/ezyang	2022-10-28 13:40:11 +00:00
PyTorch MergeBot	8b4d95759c	Revert "Many symintifications (#87604 )" This reverts commit `777e6a2c51`. Reverted https://github.com/pytorch/pytorch/pull/87604 on behalf of https://github.com/weiwangmeta due to breaking internal builds	2022-10-28 03:00:11 +00:00
albanD	777e6a2c51	Many symintifications (#87604 ) Adds expand_inplace conv conv_double_backward convolution adaptive_avg_pool2d_symint _embedding_bag_backward_symint cudnn_grid_sampler cuda 32 bit indexing nll_loss / nll_loss_2d tensor split pooling same mode cudnn_is_acceptable storage nbytes Pull Request resolved: https://github.com/pytorch/pytorch/pull/87604 Approved by: https://github.com/ezyang	2022-10-26 17:33:53 +00:00
albanD	12b2f70a89	Symintify pad ops (#87046 ) Following comments below, we need to add support for `std::negate`/`std::min`/`std::max`/`operator-` for SymInt Pull Request resolved: https://github.com/pytorch/pytorch/pull/87046 Approved by: https://github.com/ezyang	2022-10-19 21:43:08 +00:00
Nikita Vedeneev	f2ec9fbd03	`torch.ormqr`: backward support (#86800 ) Seems good to have, especially when neither `a` nor `tau` requires grads and/or they are pretty small in number. Fixes https://github.com/pytorch/pytorch/issues/86267 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86800 Approved by: https://github.com/lezcano	2022-10-18 09:07:35 +00:00
albanD	3a4c0900c7	Reland 3 of Merge more symbolic meta kernels and symint changes from branch (#86795 ) Take 3 Contains: - symintification of split* - floor support on SymFloat - pad_backward, gather, scatter meta Pull Request resolved: https://github.com/pytorch/pytorch/pull/86795 Approved by: https://github.com/z-a-f	2022-10-17 02:09:40 +00:00
Brian Hirsh	34c86adec4	symintify all of derivatives.yaml (#86610 ) Big-bang PR to symintify all .sizes() calls in derivatives.yaml, which will be needed for symbolic tracing. * with the exception of `split()`, which is tougher to land because it requires internal changes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86610 Approved by: https://github.com/albanD	2022-10-14 20:15:48 +00:00
albanD	66cab5245f	Reland 2 min/max support for SymInt/Floats, finish as_strided/scatter/squeeze() backward symint support (#86797 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86797 Approved by: https://github.com/bdhirsh	2022-10-13 00:31:19 +00:00
PyTorch MergeBot	2aa981ab74	Revert "Reland 2 of Merge more symbolic meta kernels and symint changes from branch (#86334 ) (#86488 )" This reverts commit `978b46d7c9`. Reverted https://github.com/pytorch/pytorch/pull/86488 on behalf of https://github.com/osalpekar due to Broke executorch builds internally with the following message: RuntimeError: Missing out variant for functional op: aten::split.Tensor(Tensor(a -> *) self, SymInt split_size, int dim=0) -> Tensor(a)[] . Make sure you have loaded your custom_ops_generated_lib	2022-10-11 23:39:50 +00:00
PyTorch MergeBot	811b8e012b	Revert "min/max support for SymInt/Floats, finish as_strided/scatter/squeeze() backward symint support (#86643 )" This reverts commit `86f914e996`. Reverted https://github.com/pytorch/pytorch/pull/86643 on behalf of https://github.com/osalpekar due to Need to revert this to cleanly revert https://github.com/pytorch/pytorch/pull/86488. This should be safe to re-land later	2022-10-11 23:12:40 +00:00
albanD	86f914e996	min/max support for SymInt/Floats, finish as_strided/scatter/squeeze() backward symint support (#86643 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86643 Approved by: https://github.com/anjali411	2022-10-11 17:37:30 +00:00
albanD	978b46d7c9	Reland 2 of Merge more symbolic meta kernels and symint changes from branch (#86334 ) (#86488 ) symintify split_with_sizes, dropout, fused_fake_obs_quant. meta for padding_2d ops add meta_bernoulli_ meta kernel for at::gather get pytorch_struct to pass: meta for scatter_add, fix backward symintify split ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/86488 Approved by: https://github.com/ezyang	2022-10-10 15:54:28 +00:00
anjali411	c89d286af6	symintify unbind_backward and tensor_split (#86357 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86357 Approved by: https://github.com/albanD	2022-10-09 16:25:55 +00:00
Edward Z. Yang	33f0e98a49	Re-land*4 "SymIntify cat and narrow" (#86468 ) This re-lands https://github.com/pytorch/pytorch/pull/86289 but with more wrappers. Contains implicit inclusion of <ATen/native/NonSymbolicBC.h> in internal usage. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86468 Approved by: https://github.com/albanD	2022-10-08 07:17:37 +00:00
PyTorch MergeBot	65b408074f	Revert "Relandx3 "SymIntify cat and narrow" (#86289 )" This reverts commit `a00f8489df`. Reverted https://github.com/pytorch/pytorch/pull/86289 on behalf of https://github.com/malfet due to @seemether unlanded the rest of the stack and it will fail intern import anyway	2022-10-07 16:29:27 +00:00
PyTorch MergeBot	75df4b5e3d	Revert "Merge more symbolic meta kernels and symint changes from branch (#86334 )" This reverts commit `08e3999fa4`. Reverted https://github.com/pytorch/pytorch/pull/86334 on behalf of https://github.com/seemethere due to Trying to revert https://github.com/pytorch/pytorch/pull/86207, this PR causes merge conflicts with the initial revert so will have to revert this as well	2022-10-07 16:03:30 +00:00
Edward Z. Yang	a00f8489df	Relandx3 "SymIntify cat and narrow" (#86289 ) This reverts commit `fc94a2115b`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/86289 Approved by: https://github.com/wconstab	2022-10-07 14:04:10 +00:00
PyTorch MergeBot	2110c89443	Revert "Revert "Revert "SymIntify cat and narrow (#86191 )"" (#86289 )" This reverts commit `e778fbf519`. Reverted https://github.com/pytorch/pytorch/pull/86289 on behalf of https://github.com/seemethere due to Fails internal tests see: https://www.internalfb.com/intern/sandcastle/job/27021598552487548/	2022-10-07 05:20:36 +00:00
Brian Hirsh	08e3999fa4	Merge more symbolic meta kernels and symint changes from branch (#86334 ) symintify split_with_sizes, dropout, fused_fake_obs_quant. meta for padding_2d ops add meta_bernoulli_ meta kernel for at::gather get pytorch_struct to pass: meta for scatter_add, fix backward symintify split ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/86334 Approved by: https://github.com/ezyang	2022-10-06 23:29:04 +00:00
Pearu Peterson	8f2c2167d4	Support autograd on sparse_mm in full. (#86301 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86301 Approved by: https://github.com/cpuhrsch	2022-10-06 18:39:31 +00:00

1 2 3 4 5 ...

354 Commits