pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Zhu, Haozhe	bd3c6e8e91	avoid large vector copy when query per_channel q_params (#31040 ) Summary: The quantizer use std::vector to save per_channel scales and zero_points, but when query scales(zero_points), it requires to return tensor. These lead to use std::vector to initialize tensors and it dose cost lots of time. So I change quantizer to save per_channel scales and zero_points by using tensor directly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/31040 Differential Revision: D19701070 Pulled By: jerryzh168 fbshipit-source-id: 9043f16c44b74dd8289b8474e540171765a7f92a	2020-02-19 16:24:24 -08:00
Brian Stark	17d4ef9e9e	Support using scalar tensor for split (#32493 ) Summary: split requires an int input, however in tracing operators such as size(axis) return a tensor, which is different behavior than when not tracing. As such need to modify split to handle these cases. Fixes https://github.com/pytorch/pytorch/issues/27551 Pull Request resolved: https://github.com/pytorch/pytorch/pull/32493 Reviewed By: hl475 Differential Revision: D19538254 Pulled By: houseroad fbshipit-source-id: c8623009de5926aa38685e08121f4b48604bd8c0	2020-02-07 17:16:43 -08:00
Alban Desmaison	717274c001	Add useful warnings for t.grad when it won't be populated for known reasons (#30531 ) Summary: Fix https://github.com/pytorch/pytorch/issues/2362 and https://github.com/pytorch/pytorch/issues/19778 To avoid issues with frozen model, we only consider warning for Tensors that require gradients and are neither leafs nor retain gradients. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30531 Differential Revision: D18832767 Pulled By: albanD fbshipit-source-id: 743e863dc14ab57713e66da78b2e4d759dfba0ff	2019-12-11 09:47:18 -08:00
Elias Ellison	f48a8901c5	Add floor_divide function (#30493 ) Summary: Adds `torch.floor_divide` following the numpy's `floor_divide` api. I only implemented the out-of-place version, I can add the inplace version if requested. Also fixes https://github.com/pytorch/pytorch/issues/27512 Pull Request resolved: https://github.com/pytorch/pytorch/pull/30493 Differential Revision: D18896211 Pulled By: eellison fbshipit-source-id: ee401c96ab23a62fc114ed3bb9791b8ec150ecbd	2019-12-10 07:51:39 -08:00
Michael Suo	62b10721fb	Actually make flake8 do something (#30892 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30892 Fixes all outstanding lints and actually installs a properly configured flake8 Test Plan: Imported from OSS Differential Revision: D18862825 Pulled By: suo fbshipit-source-id: 08e9083338a7309272e17bb803feaa42e348aa85	2019-12-06 17:50:50 -08:00
Seiya Tokui	1d7b40f1c4	Fix reading `__cuda_array_interface__` without strides (#24947 ) Summary: When converting a contiguous CuPy ndarray to Tensor via `__cuda_array_interface__`, an error occurs due to incorrect handling of default strides. This PR fixes this problem. It makes `torch.tensor(cupy_ndarray)` works for contiguous inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/24947 Differential Revision: D18838986 Pulled By: ezyang fbshipit-source-id: 2d827578f54ea22836037fe9ea8735b99f2efb42	2019-12-06 07:36:27 -08:00
Igor Fedan	75309b45f3	explicitly provide memory format when calling to clone() at Indexing.cpp Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28660 Test Plan: Imported from OSS Differential Revision: D18333346 Pulled By: ifedan fbshipit-source-id: 06590205d883a5096388a4ae318389244130972d	2019-11-07 05:38:32 -08:00
Jerry Zhang	23193c155f	Quantized Tensor support copy (#28612 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28612 att Test Plan: python test/test_quantized_tensor.py Imported from OSS Differential Revision: D18255247 fbshipit-source-id: 814b12640fdf9d79b27482ee642ce430dbaeea68	2019-11-01 17:40:17 -07:00
Peter Bell	f33813d589	Return NotImplemented from all binary math ops (#27423 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/26333 Fixes the operators missed in https://github.com/pytorch/pytorch/issues/26507 and includes a test for all operators. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27423 Differential Revision: D17835390 Pulled By: ezyang fbshipit-source-id: 7a1351c7ccc8ad11454dbaa00d3701dcee4f06a8	2019-10-28 14:28:33 -07:00
Richard Zou	0fbbc7acb4	Allow `align_to` to take in partially named tensors (#27308 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27308 Currently, `tensor.align_to(*names)` has the restriction that the `tensor` must be fully named. This doesn't need to be the case, when using Ellipsis, we "expand the ellipsis to all unmentioned dimensions, in the order which they appear in the original tensor". For example, consider `tensor: Tensor[None, None, C]`. `tensor.align_to(C, None, None)` is ambiguous because the user might have wanted to switch the order of the None dimensions and there is no way to specify that using this API. However, `tensor.align_to('C', ...)` isn't ambiguous: we can select the two unnamed dimensions in the order in which they appear. To actually implement this, we write a brand-new `align_to(names, ellipsis_idx)` function in c++ that is separate from the regular `align_to(names)` implementation. Ideally we would support "..." as a special name in c++ and combine the two implementations; we'll need to support "..." in c++ in the future but that requires a bit of extra work. In this PR, Python processees the ellipsis and then calls the correct overload. Test Plan: - run tests Differential Revision: D17745179 Pulled By: zou3519 fbshipit-source-id: 9fed06d224215cfb7efecd8c002604baab3c45e6	2019-10-09 16:28:45 -07:00
zou3519	59b14a7620	Documentation for named tensors (#27173 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27173 `docs/source/named_tensor.rst` is the entry point; most users will land either here or the named tensor tutorial when looking to use named tensors. We should strive to make this as readable, concise, and understandable as possible. `docs/source/name_inference.rst` lists all of the name inference rules. It should be clear but it's hard to make it concise. Please let me know if anything doesn't make sense and please propose alternative wordings and/or restructuring to improve the documentation. This should ultimately get cherry-picked into the 1.3 branch as one monolithic commit so it would be good to get all necessary changes made in this PR and not have any follow ups. Test Plan: - built and reviewed locally with `cd docs/ && make html`. Differential Revision: D17763046 Pulled By: zou3519 fbshipit-source-id: c7872184fc4b189d405b18dad77cad6899ae1522	2019-10-08 22:22:30 -07:00
Pearu Peterson	b7fb2b8862	Implement pickle support for sparse tensors and torch.layout instances (#27062 ) Summary: Resolves issue https://github.com/pytorch/pytorch/issues/16667 and https://github.com/OpenMined/PySyft/issues/2326 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27062 Differential Revision: D17762932 Pulled By: ezyang fbshipit-source-id: dd99c1f4ac8eb2286eb55aa20ce973f60ce7b7e1	2019-10-04 08:09:32 -07:00
Zafar Takhirov	ef97841147	Show a warning that not all dir members of quantized work. (#27339 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27339 This PR just shows a warning message. Eventually we will show a correct __dir__ Test Plan: Imported from OSS Differential Revision: D17751333 Pulled By: zafartahirov fbshipit-source-id: e9bc62fd8dd0147979291d0aac3f1afe5b8c7a9f	2019-10-03 20:48:04 -07:00
Ailing Zhang	0b79f77a4d	Serialize XLA Tensor (#27041 ) Summary: https://github.com/pytorch/pytorch/issues/25882 Reopening both PRs to master/v1.3.0 to save myself from rebase hell... Pull Request resolved: https://github.com/pytorch/pytorch/pull/27041 Differential Revision: D17676345 Pulled By: ailzhang fbshipit-source-id: 4784c9f8f0723cd3ff081af3336e672812aaad14	2019-10-01 15:05:30 -07:00
Dmytro Dzhulgakov	b93823cb65	Per-channel quantized tensor to have only a single axis (#26675 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26675 Based on offline poll, we're very unlikely to have multi-axis quantized tensors in the foreseeable future. Let's simplify API and just return int instead of list. It also matches the singular `axis` name. Test Plan: Imported from OSS Differential Revision: D17537052 Pulled By: dzhulgakov fbshipit-source-id: 676abc3b251d288468aaed467b5e5ca4063b98b0	2019-09-23 22:29:01 -07:00
Dmytro Dzhulgakov	ebc2365fd3	Serialization for per channel qtensor (#26339 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26339 Serializes per-channel tensor in both torch.serialization and jit. Since we didn't bind Quantizer properly yet, I chose to save a tuple representing quantizer settings. To avoid recursive tensor serialization calls, I'm using tuple instead of tensor to store scales and zero points. driazati - please check the serialization logic. Is there a good test that compares that JIT serialization and python serialization are equivalent? (I haven't tested it yet) Test Plan: Imported from OSS Differential Revision: D17443222 Pulled By: dzhulgakov fbshipit-source-id: a34758de1ffd2ec1cdc5355f5baf95284a4ccf4b	2019-09-23 13:28:11 -07:00
Richard Zou	567a1981a7	Fix ellipsis behavior for `Tensor.align_to` to glob all missing dims (#26648 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26648 Previously: - `Tensor.align_to(names)` only works on fully named tensors. In addition, the desired ordering `names` must not have any None-names. - `Tensor.align_to(names)` accepted `...`, but expanded it based on position. i.e., in `tensor.align_to('N', ..., 'C', 'H')`, `...` expand to `*tensor.names[1:-2]`. This is wildly incorrect: see the following concrete example. ``` tensor = tensor.refine_names('N', 'C', 'H, 'W') tensor.align_to('W', ...) # ... expands to 'C', 'H, 'W' ``` This PR changes it so that `...` in `tensor.align_to` grabs all unmentioned dimensions from `tensor`, in the order that they appear. `align_to` is the only function that takes ellipsis that requires this change. This is because all other functions (`refine_to`) require their list of names to work in a positional manner, but `align_to` lets the user reorder dimensions. This does not add very much overhead to `align_to`, as shown in the following benchmark. However, in the future, we should resolve to make these operations faster; align_to should be as fast as view but isn't most likely due to Python overhead. ``` [ins] In [2]: import torch ...: named = torch.randn(3, 3, 3, 3, names=('N', 'C', 'H', 'W')) ...: unnamed = torch.randn(3, 3, 3, 3) ...: %timeit unnamed[:] ...: %timeit unnamed.view(-1) ...: %timeit named.align_to(...) ...: %timeit named.align_to('N', 'C', 'H', 'W') 31 µs ± 126 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 43.8 µs ± 146 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 69.6 µs ± 142 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 66.1 µs ± 1.13 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) ``` Test Plan: - new tests [namedtensor ci] allows the user to transpose and permute dimensions. Differential Revision: D17528207 Pulled By: zou3519 fbshipit-source-id: 4efc70329f84058c245202d0b267d0bc5ce42069	2019-09-23 12:16:46 -07:00
vishwakftw	15b506068b	Remove deprecated torch.gels (#26480 ) Summary: Changelog: - Remove `torch.gels` which was deprecated in v1.2.0 Pull Request resolved: https://github.com/pytorch/pytorch/pull/26480 Test Plan: - No tests were changed and all callsites for `torch.gels` where modified to `torch.lstsq` when `torch.lstsq` was introduced Differential Revision: D17527207 Pulled By: zou3519 fbshipit-source-id: 28e2fa3a3bf30eb6b9029bb5aab198c4d570a950	2019-09-23 07:15:39 -07:00
Richard Zou	4fada96218	Renames `tensor.renamed -> rename`, `tensor.names_ -> rename_` (#26548 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26548 This makes the naming more consistent with PyTorch's API. The original concern was that `tensor.rename` might make the operation seem like it is in-place. However, we have many "verb" APIs: `tensor.add(other)`, for example, doesn't add other to tensor in-place, but `tensor.add_(other)` does. `tensor.rename_` does exactly the same place as `tensor.rename`, but in-place. Test Plan: - [namedtensor ci] Differential Revision: D17502021 Pulled By: zou3519 fbshipit-source-id: 6a5b93136a820075013cd1e30fb8fc6b9d77d7d9	2019-09-22 15:38:26 -07:00
Richard Zou	bae7528479	Change '' to '...' and `...` for named tensor API functions. (#26350 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26350 Python 3 lets us use `...` to perform indexing. Semantically, `...` means "the rest of the unspecified dimensions". For example, while indexing, one can do (for 5D `tensor`) `tensor[0, 0, ..., 0]` and the `...` is expanded into `tensor[0, 0, :, :, 0]`. Previously, we were using '' to represent a similar behavior in names. For example, `tensor.refine_names` supports things like the following: ``` x = torch.randn(2, 3, 4, 5, 6) x_out = x.refine_names('', 'H', 'W') # refine only the last two dimensions ``` This PR changes it so that named tensor API functions recognize `'...'` (in Python 2 and Python 3) and `...` (in Python 3 exclusively) instead of `''`. Test Plan: - [namedtensor ci] Differential Revision: D17424666 Pulled By: zou3519 fbshipit-source-id: 003182879fd38ced3fea051217572a457cdaf7cf	2019-09-18 05:47:13 -07:00
Richard Zou	277d442d18	Rename torch.namedtensor -> torch._namedtensor_internals (#26349 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26349 The directory holds a lot of private helper functions that help implement named tensor functionality. Instead of naming each helper function with a leading underscore, I change the name of the import to `_namedtensor_internals` to signal it should not be used directly. Test Plan: - [namedtensor ci] Differential Revision: D17424178 Pulled By: zou3519 fbshipit-source-id: 8f7b74346765759303480e581038a661021acf53	2019-09-18 05:47:09 -07:00
Richard Zou	0038111019	Implement named tensor `unflatten(dim, namedshape)`. (#25658 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25658 This unflattens `dim` according to the shape specified in `namedshape`. `namedshape` may be either an OrderedDict or an iterable of (name, size) tuples. Future: - It is possible to make it take a dict in Python >= 3.6 because those are ordered by default, but I'll leave that task for the future. Test Plan: - new tests [namedtensor ci] Differential Revision: D17192655 Pulled By: zou3519 fbshipit-source-id: fd9bd2f462c23a4df1c23d66f2aa95076ff1b160	2019-09-17 21:24:25 -07:00
Geovanni Zhang	e293c4ea73	Fix 'in' return true incorrectly (#24156 ) Summary: Because of 'return NotImplemented', __contains__ return True when the element is not a number. bool(NotImplemented) == True Pull Request resolved: https://github.com/pytorch/pytorch/pull/24156 Differential Revision: D16829895 Pulled By: zou3519 fbshipit-source-id: 9d3d58025b2b78b33a26fdfcfa6029d0d049f11f	2019-09-13 09:27:58 -07:00
Richard Zou	5e2d25af34	Implement tensor.align_as(other), change tensor.align_to(names) (#25843 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25843 `tensor.align_to(names)` permutes the dimensions of `tensor` and adds additional 1-sized dimensions such that the output tensor has dimensions in the same order as `names`. All dimensions of `tensor` must be present in `names`, in addition, this function requires that all dims of `tensor` be named. `tensor.align_as(other)` is equivalent to `tensor.align_to(other.names)`. I'm planning on changing `torch.align_tensors(*tensors)` to align closer to these semantics because there didn't seem to be a clear use case for the old semantics that preserve unnamed dimensions. That will come in a future change. Test Plan: - new tests [namedtensor ci] Differential Revision: D17255549 Pulled By: zou3519 fbshipit-source-id: 1e437ad81e9359b4d5bd0e7e64c3a1be441fc3e3	2019-09-12 22:53:44 -07:00
Richard Zou	e544f88590	Implement tensor.refine_names (#25842 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25842 `tensor.refine_names(names)` takes `tensor` and attempts to name its dimensions `names` out-of-place. If a dimension `i` already had a name, then it cannot be changed (so tensor.names[i] must equal names[i]); if the original dimension did not have a name, then the new name (names[i]) can be anything. `tensor.refine_names(names)` also accepts a glob '' that greedily selects names from `tensor`. Here are some examples: - `Tensor[None].refine_names('N') -> Tensor[N]` - `Tensor[N].refine_names('N') -> Tensor[N]` - `Tensor[N].refine_names('D') -> Error!` - `Tensor[N].refine_names(None) -> Error!` - `Tensor[None, None].refine_names('', D) -> Tensor[None, D]` Test Plan: - new tests [namedtensor ci] Differential Revision: D17255548 Pulled By: zou3519 fbshipit-source-id: fdbdb3a12f24fbe37ce1e53ed09dc8a42589d928	2019-09-12 22:53:40 -07:00
Richard Zou	7970e5720b	Rename tensor.view_names -> tensor.renamed (#25711 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25711 This function renames the dimensions of a tensor out-of-place. Because of that, I think `tensor.renamed(...)` is a clearer name: `view_names` has the connotation that we can use names to `view` our tensors with a "different shape", but what this function really does is let us rename a tensor no matter the previous names. `tensor.names_`, the in-place version of this, is unchanged for now. However, we might delete this or not advertise it if it has no use case and also because its naming is a little inconsistent with `tensor.renamed`. Test Plan: - [namedtensor ci] Differential Revision: D17206515 Pulled By: zou3519 fbshipit-source-id: 67053951fcc8130c84566b5ebbdce35ef619c90d	2019-09-06 11:28:04 -07:00
Richard Zou	2513ca66ca	Add guards for using named tensor with serialization and multiprocessing (#25345 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25345 Test Plan - New tests [namedtensor ci] Test Plan: Imported from OSS Differential Revision: D17101486 Pulled By: zou3519 fbshipit-source-id: 58e803b042056ee6abab8551517f74078f2b81d5	2019-08-29 14:10:33 -07:00
Mads R. B. Kristensen	f583f2e657	Fixed test_numba_integration (#25017 ) Summary: The semantic of the _auto-convert GPU arrays that support the __cuda_array_interface__ protocol_ has changed a bit. It used to throw an exception when using `touch.as_tensor(...,device=D)` where `D` is a CUDA device not used in `__cuda_array_interface__`. Now, this is supported and will result in an implicit copy. I do not what have changes but `from_blob()` now supports that the input and the output device differ. I have updated the tests to reflect this, which fixes https://github.com/pytorch/pytorch/issues/24968 Pull Request resolved: https://github.com/pytorch/pytorch/pull/25017 Differential Revision: D16986240 Pulled By: soumith fbshipit-source-id: e6f7e2472365f924ca155ce006c8a9213f0743a7	2019-08-23 08:58:08 -07:00
Richard Zou	f996f8d61d	Update tensor.view_names / tensor.names_ API (#23973 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23973 Without loss of generality, I describe the API for `tensor.view_names`. `tensor.names_` has an analogous API. `tensor.view_names(names)` returns a view on tensor with named dims `names`. `names` must be of length `tensor.dim()`; otherwise, if '' is in `names`, then it (known as the "glob") is expanded greedily to be equal to the corresponding names from `tensor.names`. For example, ``` >>> x = torch.empty(2, 3, 5, 7, names=('N', 'C', 'H', 'W')) >>> x.view_names('', 'height', 'width').names ('N', 'C', 'height', 'width') >>> x.view_names('batch', '', 'width').names ('batch', 'C', 'H', 'width') ``` tensor.view_names(**rename_map) returns a view on tensor that has renamed dims as specified in the mapping `rename_map`. For example, ``` >>> x = torch.empty(2, 3, 5, 7, names=('N', 'C', 'H', 'W')) >>> x.view_names(W='width', H='height').names ('N', 'C', 'height', 'width') ``` These are different(!!!) from the C++ API, which only allows the following: - tensor.view_names(optional<DimnameList>) C++ API parity for named tensors is not important right now; I am punting that to the future. Test Plan: - [namedtensor ci] Differential Revision: D16710916 Pulled By: zou3519 fbshipit-source-id: 7cb8056c0fb4c97b04c3a2d1dd0f737e0a67ce34	2019-08-14 09:40:35 -07:00
Tongzhou Wang	af638ad5d7	pin_memory should not copy on already pinned tensors (#23484 ) Summary: fixes https://github.com/pytorch/pytorch/issues/21076 Pull Request resolved: https://github.com/pytorch/pytorch/pull/23484 Differential Revision: D16546264 Pulled By: ezyang fbshipit-source-id: 8058e0bbc6336751f36b884d71234feef498a982	2019-07-30 21:16:23 -07:00
vishwakftw	b3a9a7a9b9	Rename gels to lstsq (#23460 ) Summary: Changelog: - Rename `gels` to `lstsq` - Fix all callsites - Rename all tests - Create a tentative alias for `lstsq` under the name `gels` and add a deprecation warning to not promote usage. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23460 Test Plan: - All tests should pass to confirm that the patch is correct Differential Revision: D16547834 Pulled By: colesbury fbshipit-source-id: b3bdb8f4c5d14c7716c3d9528e40324cc544e496	2019-07-30 09:56:04 -07:00
vishwakftw	6dfecc7e01	Remove deprecated linear algebra functions (and methods) (#22841 ) Summary: Changelog: - Removed the following linear algebra functions in PyTorch in favor of the renamed operations - `btrifact` (use `lu` instead) - `btrifact_with_info` (use `lu` with `get_infos=True` instead) - `btrisolve` (use `lu_solve` instead) - `btriunpack` (use `lu_unpack` instead) - `gesv` (use `solve` instead) - `pstrf` (use `cholesky` instead) - `potrf` (use `cholesky` instead) - `potri` (use `cholesky_inverse` instead) - `potrs` (use `cholesky_solve` instead) - `trtrs` (use `triangular_solve` instead) - Removed dead code after the removal of `pstrf` Pull Request resolved: https://github.com/pytorch/pytorch/pull/22841 Test Plan: - All existing tests should pass to verify that the removal is clean Closes https://github.com/pytorch/pytorch/issues/22832 Differential Revision: D16346184 Pulled By: zou3519 fbshipit-source-id: f748d16ed7609c028de6adcbc28684d5a1af0678	2019-07-19 11:43:06 -07:00
Jerry Zhang	88921feafd	change return type for q_scale and q_zero_point (#21709 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21709 Change the return type from Scalar to double/int64_t so we don't need to do conversion when we call other quantize related aten functions Differential Revision: D15793003 fbshipit-source-id: 510936c69fa17a4d67340a31ebb03415647feb04	2019-06-20 20:30:39 -07:00
Igor Fedan	abd6cffe55	Added some extra tests for std_mean and var_mean for multiple dims. (#20650 ) Summary: Added some extra tests for std_mean and var_mean for multiple dims. Some refactoring of previously created tests based on PR comments: https://github.com/pytorch/pytorch/pull/18731 Pull Request resolved: https://github.com/pytorch/pytorch/pull/20650 Differential Revision: D15396101 Pulled By: ifedan fbshipit-source-id: d15c3c2c7084a24d6cfea4018173552fcc9c03a9	2019-06-18 20:36:32 -07:00
Jerry Zhang	277bf69fa0	Add torch.load/torch.save for QTensor (#20830 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20830 att Reviewed By: dzhulgakov Differential Revision: D15340701 fbshipit-source-id: 677038c8101f66dec4856c2eccf9f9e394012226	2019-05-30 20:52:19 -07:00
Igor Fedan	4c23c34e79	Computing var/stddev and mean at the same time (#18731 ) Summary: The current variance kernels compute mean at the same time. Many times we want both statistics together, so it seems reasonable to have a kwarg/function that allows us to get both values without launching an extra kernel. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18731 Differential Revision: D14726082 Pulled By: ifedan fbshipit-source-id: 473cba0227b69eb2240dca5e61a8f4366df0e029	2019-05-15 16:42:38 -07:00
vishwakftw	c30224ad21	Rename potri to cholesky_inverse (#19498 ) Summary: Changelog: - Rename `potri` to `cholesky_inverse` to remain consistent with names of `cholesky` methods (`cholesky`, `cholesky_solve`) - Fix all callsites - Rename all tests - Create a tentative alias for `cholesky_inverse` under the name `potri` and add a deprecation warning to not promote usage Pull Request resolved: https://github.com/pytorch/pytorch/pull/19498 Differential Revision: D15029901 Pulled By: ezyang fbshipit-source-id: 2074286dc93d8744cdc9a45d54644fe57df3a57a	2019-04-22 08:18:39 -07:00
Xiang Gao	df67969e6b	Step 3: Add support for return_counts to torch.unique for dim not None (#18650 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18650 ghimport-source-id: 75759c95e6c48e27c172b919097dbc40c6bfb5e6 Differential Revision: D14892319 Pulled By: VitalyFedyunin fbshipit-source-id: ec5d1b80fc879d273ac5a534434fd648468dda1e	2019-04-16 14:06:45 -07:00
Xiang Gao	ea2405c7dc	Add torch.unique_consecutive (#19060 ) Summary: Fixes: https://github.com/pytorch/pytorch/issues/19045 Please review: VitalyFedyunin ngimel This is independent on the #18649 series. This will cause merge conflicts in #18649 series, but please merge this first, and I will resolve the merge conflicts there. The new feature is exposed in `_unique2_temporary_will_remove_soon` and `_unique_dim2_temporary_will_remove_soon`. But not at `torch.unique` yet. I will take care of the API after #18649 series get merged completely. Benchmark on a tensor of shape `torch.Size([15320, 2])`: ```python print(torch.__version__) print() a = tensor.sort().values.to('cpu') print('cpu, sorted_input=False:') %timeit torch._unique2_temporary_will_remove_soon(a) %timeit torch._unique2_temporary_will_remove_soon(a, return_inverse=True) %timeit torch._unique2_temporary_will_remove_soon(a, return_counts=True) %timeit torch._unique2_temporary_will_remove_soon(a, return_inverse=True, return_counts=True) print() print('cpu, sorted_input=True:') %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True) %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True, return_inverse=True) %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True, return_counts=True) %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True, return_inverse=True, return_counts=True) print() a = a.to('cuda') print('cuda, sorted_input=False:') %timeit torch._unique2_temporary_will_remove_soon(a); torch.cuda.synchronize() %timeit torch._unique2_temporary_will_remove_soon(a, return_inverse=True); torch.cuda.synchronize() %timeit torch._unique2_temporary_will_remove_soon(a, return_counts=True); torch.cuda.synchronize() %timeit torch._unique2_temporary_will_remove_soon(a, return_inverse=True, return_counts=True); torch.cuda.synchronize() print() print('cuda, sorted_input=True:') %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True); torch.cuda.synchronize() %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True, return_inverse=True); torch.cuda.synchronize() %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True, return_counts=True); torch.cuda.synchronize() %timeit torch._unique2_temporary_will_remove_soon(a, sorted_input=True, return_inverse=True, return_counts=True); torch.cuda.synchronize() ``` ``` 1.1.0a0+2addccc cpu, sorted_input=False: 340 µs ± 5.88 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 717 µs ± 14.9 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 52.3 ms ± 2.75 ms per loop (mean ± std. dev. of 7 runs, 10 loops each) 52.3 ms ± 1.79 ms per loop (mean ± std. dev. of 7 runs, 10 loops each) cpu, sorted_input=True: 32.8 µs ± 285 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 49.9 µs ± 557 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 51.6 µs ± 1.08 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) 78 µs ± 782 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) cuda, sorted_input=False: 213 µs ± 1.52 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 291 µs ± 3.81 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 250 µs ± 1.05 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 321 µs ± 1.59 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) cuda, sorted_input=True: 45.6 µs ± 2.13 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) 110 µs ± 2.47 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) 82 µs ± 857 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 143 µs ± 409 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) ``` ```python print(torch.__version__) print() a1, a2 = tensor.unbind(1) indices = (a1 * tensor.max() + a2).sort().indices a = tensor.index_select(0, indices).to('cpu') print('cpu, sorted_input=False:') %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0) %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, return_inverse=True) %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, return_counts=True) %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, return_inverse=True, return_counts=True) print() print('cpu, sorted_input=True:') %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True) %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True, return_inverse=True) %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True, return_counts=True) %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True, return_inverse=True, return_counts=True) print() a = a.to('cuda') print('cuda, sorted_input=False:') %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0); torch.cuda.synchronize() %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, return_inverse=True); torch.cuda.synchronize() %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, return_counts=True); torch.cuda.synchronize() %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, return_inverse=True, return_counts=True); torch.cuda.synchronize() print() print('cuda, sorted_input=True:') %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True); torch.cuda.synchronize() %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True, return_inverse=True); torch.cuda.synchronize() %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True, return_counts=True); torch.cuda.synchronize() %timeit torch._unique_dim2_temporary_will_remove_soon(a, dim=0, sorted_input=True, return_inverse=True, return_counts=True); torch.cuda.synchronize() ``` ``` cpu, sorted_input=False: 55.4 ms ± 1.12 ms per loop (mean ± std. dev. of 7 runs, 10 loops each) 55.8 ms ± 616 µs per loop (mean ± std. dev. of 7 runs, 10 loops each) 55.2 ms ± 402 µs per loop (mean ± std. dev. of 7 runs, 10 loops each) 55.1 ms ± 725 µs per loop (mean ± std. dev. of 7 runs, 10 loops each) cpu, sorted_input=True: 54.7 ms ± 585 µs per loop (mean ± std. dev. of 7 runs, 10 loops each) 55.2 ms ± 1.23 ms per loop (mean ± std. dev. of 7 runs, 10 loops each) 54.5 ms ± 865 µs per loop (mean ± std. dev. of 7 runs, 10 loops each) 54.9 ms ± 577 µs per loop (mean ± std. dev. of 7 runs, 10 loops each) cuda, sorted_input=False: 171 µs ± 783 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 220 µs ± 1.65 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 203 µs ± 2.95 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) 251 µs ± 2.83 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each) cuda, sorted_input=True: 59.6 µs ± 757 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 113 µs ± 431 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each) 93.2 µs ± 2.13 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) 147 µs ± 2.81 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each) ``` The CPU implementation of `unique_dim` is super slow, see https://github.com/pytorch/pytorch/issues/18987, but this PR will not worry about this issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19060 Differential Revision: D14866909 Pulled By: ezyang fbshipit-source-id: d20012cec68c37b05cf770a6f4d6524f910b950f	2019-04-10 07:36:08 -07:00
Vishwak Srinivasan	487388d8ad	Rename btrisolve to lu_solve (#18726 ) Summary: Changelog: - Rename `btrisolve` to `lu_solve` to remain consistent with names of solve methods (`cholesky_solve`, `triangular_solve`, `solve`) - Fix all callsites - Rename all tests - Create a tentative alias for `lu_solve` under the name `btrisolve` and add a deprecation warning to not promote usage Pull Request resolved: https://github.com/pytorch/pytorch/pull/18726 Differential Revision: D14726237 Pulled By: zou3519 fbshipit-source-id: bf25f6c79062183a4153015e0ec7ebab2c8b986b	2019-04-09 15:21:24 -07:00
Vishwak Srinivasan	d859031ebf	Rename `btrifact*` to `lu` (#18435 ) Summary: Changelog: - Renames `btrifact` and `btrifact_with_info` to `lu`to remain consistent with other factorization methods (`qr` and `svd`). - Now, we will only have one function and methods named `lu`, which performs `lu` decomposition. This function takes a get_infos kwarg, which when set to True includes a infos tensor in the tuple. - Rename all tests, fix callsites - Create a tentative alias for `lu` under the name `btrifact` and `btrifact_with_info`, and add a deprecation warning to not promote usage. - Add the single batch version for `lu` so that users don't have to unsqueeze and squeeze for a single square matrix (see changes in determinant computation in `LinearAlgebra.cpp`) Pull Request resolved: https://github.com/pytorch/pytorch/pull/18435 Differential Revision: D14680352 Pulled By: soumith fbshipit-source-id: af58dfc11fa53d9e8e0318c720beaf5502978cd8	2019-03-29 00:34:30 -07:00
Soumith Chintala	66628f78b7	Revert D14605905: [pytorch][PR] Add return_counts to torch.unique Differential Revision: D14605905 Original commit changeset: 555f5a12a8e2 fbshipit-source-id: c7874f5987893e956c022180a37763d88bba38db	2019-03-26 17:18:01 -07:00
Xiang Gao	e2730ddb21	Add return_counts to torch.unique (#18391 ) Summary: Fixes: https://github.com/pytorch/pytorch/issues/12598 This PR was originally authorized by ptrblck at https://github.com/pytorch/pytorch/pull/15495, but since there was no update for months after the request change, I clone that branch and resolve the code reviews here. Hope everything is good now. Especially, the implementation of count is changed from ptrblck's original algorithm to the one ngimel suggest, i.e. using `unique_by_key` and `adjacent_difference`. The currently implementation of `_unique_dim` is VERY slow for computing inverse index and counts, see https://github.com/pytorch/pytorch/issues/18405. I will refactor `_unique_dim` in a later PR. For this PR, please allow me to keep the implementation as is. cc: ptrblck ezyang ngimel colesbury Pull Request resolved: https://github.com/pytorch/pytorch/pull/18391 Reviewed By: soumith Differential Revision: D14605905 Pulled By: VitalyFedyunin fbshipit-source-id: 555f5a12a8e28c38b10dfccf1b6bb16c030bfdce	2019-03-25 20:38:17 -07:00
vishwakftw	291746f110	Rename trtrs to triangular_solve (#18213 ) Summary: Changelog: - Renames `trtrs` to `triangular_solve` to remain consistent with `cholesky_solve` and `solve`. - Rename all tests, fix callsites - Create a tentative alias for `triangular_solve` under the name `trtrs`, and add a deprecation warning to not promote usage. - Move `isnan` to _torch_docs.py - Remove unnecessary imports Pull Request resolved: https://github.com/pytorch/pytorch/pull/18213 Differential Revision: D14566902 Pulled By: ezyang fbshipit-source-id: 544f57c29477df391bacd5de700bed1add456d3f	2019-03-21 14:27:21 -07:00
Gao, Xiang	7e6220393f	Cleanup arg{min, max} (#17103 ) Summary: Why do we need this workaround? `PythonArgParser` handles these two cases well. The discussion started at https://github.com/pytorch/pytorch/pull/6201#issuecomment-378724406. The conclusion at that time by goldsborough was: > Because we wanted to allow `dim=None` in Python and route to a different function. Essentially the problem was wanting to wrap the C++ function in Python. AFAIK there is no way of translating `dim=None` behavior into C++? So Richard and I came up with this strategy Maybe at that time `PythonArgParser` was not powerful enough to handle the routing of two function with same name but different C++ signature. Will keep an eye on the CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17103 Differential Revision: D14523503 Pulled By: VitalyFedyunin fbshipit-source-id: cae3e2678062da2eccd93b51d4050578c7a9ab80	2019-03-20 16:28:27 -07:00
Vishwak Srinivasan	421b508d55	Rename gesv to solve (#18060 ) Summary: Changelog: - Renames `gesv` to `solve` to remain consistent with `cholesky_solve`. - Rename all tests, fix callsites - Create a tentative alias for `solve` under the name `gesv`, and add a deprecated warning to not promote usage. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18060 Differential Revision: D14503117 Pulled By: zou3519 fbshipit-source-id: 99c16d94e5970a19d7584b5915f051c030d49ff5	2019-03-18 16:04:24 -07:00
Vishwak Srinivasan	3f1d0ee5d5	Deprecate torch.pstrf (#17866 ) Summary: Changelog: - Add deprecation warning to torch.pstrf Pull Request resolved: https://github.com/pytorch/pytorch/pull/17866 Differential Revision: D14405527 Pulled By: soumith fbshipit-source-id: 73f3b7d61c60eb57e4bffd08112e552ae3e6dfdc	2019-03-11 12:27:52 -07:00
bhushan	b57fe3cc66	Introducing array-like sequence methods __contains__ (#17733 ) Summary: for tensor Fixes: #17000 Pull Request resolved: https://github.com/pytorch/pytorch/pull/17733 Differential Revision: D14401952 Pulled By: soumith fbshipit-source-id: c841b128c5a1fceda1094323ed4ef1d0cf494909	2019-03-11 09:00:16 -07:00
Gao, Xiang	722cbe3064	Move argsort to C++ Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17099 Differential Revision: D14165671 Pulled By: ezyang fbshipit-source-id: 3871de6874fe09871ebd9b8943c13c9af325bf33	2019-02-21 07:59:27 -08:00
Xiang Gao	4fcab92d6c	Move outplace ops to ATen (#16788 ) Summary: Based on https://github.com/pytorch/pytorch/pull/12413, with the following additional changes: - Inside `native_functions.yml` move those outplace operators right next to everyone's corresponding inplace operators for convenience of checking if they match when reviewing - `matches_jit_signature: True` for them - Add missing `scatter` with Scalar source - Add missing `masked_fill` and `index_fill` with Tensor source. - Add missing test for `scatter` with Scalar source - Add missing test for `masked_fill` and `index_fill` with Tensor source by checking the gradient w.r.t source - Add missing docs to `tensor.rst` Differential Revision: D14069925 Pulled By: ezyang fbshipit-source-id: bb3f0cb51cf6b756788dc4955667fead6e8796e5	2019-02-15 15:58:10 -08:00

1 2 3

147 Commits