pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Alban Desmaison	056b147e10	clean torch_function handling in serialization (#62744 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62744 The `Tensor._reduce_ex_internal` function can only be called via the `Tensor.__reduce_ex__` function. And that second function already properly handles the `__torch_function__` overwrites. So no need to handle them again in `Tensor._reduce_ex_internal`. This PR also updates `Tensor.__reduce_ex__` to use the specialized unary API for `__torch_function__` that makes it nicer to read. Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D30113113 Pulled By: albanD fbshipit-source-id: c94f5d2597ee3afe799d9de991f75615c3c172d6	2021-08-05 06:48:26 -07:00
Edward Yang	cf1f59452b	Hacky support for meta tensor serialization. (#62192 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62192 This support is hacky because it doesn't preserve meta tensor storage sharing (e.g., if you serialize a model with shared storage, e.g., a tensor and a view on a tensor, when I deserialize the viewing relationship will be broken and these are just different tensors.) The hack is also durable, in the sense that we will be on the hook for supporting `_rebuild_meta_tensor_no_storage` in perpetuity in the future, even if we change our mind about the serialization format. This unblocks an FB production use case. I didn't add C++ support to minimize blast area of this patch. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D29910535 Pulled By: ezyang fbshipit-source-id: d98dcdd0108dfc3ae730a071d3c583b6d0281d21	2021-07-26 14:33:45 -07:00
rusty1s	457a0b63bf	use `torch.bucketize` in`to_sparse_csr` implementation (+ additional tests) (#61340 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/57381 Pull Request resolved: https://github.com/pytorch/pytorch/pull/61340 Reviewed By: bhosmer Differential Revision: D29601393 Pulled By: cpuhrsch fbshipit-source-id: 4ca1f013d96e8716f0e658e0cd685d9aa0d98a5c	2021-07-20 15:44:25 -07:00
Anjali Chourdia	30e48bbeae	Add neg bit (#56058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56058 User facing changes: 1. Adds a negative bit and corresponding new API (`is_neg()`,`resolve_neg()`) 2. `tensor.conj().imag` now returns a floating point tensor with neg bit set to 1 instead of a tensor with no notion of negative bit. Note that imag is still a view and all the view properties still hold for imag. Non user facing changes: 1. Added a new Negative dispatch key and a backend fallback to handle it 2. Updated copy kernel to handle negative bit 3. Merged conjugate and negative bit fallback kernel 4. fixed https://github.com/pytorch/pytorch/issues/60478 (caused due to https://github.com/pytorch/pytorch/pull/54987) Testing: 1. Added a new OpInfo based test `test_neg_view` (verifies that out-of-place and in-place operations work correctly for all operations when the input is a neg view tensor by checking the result against an actually negated tensor, verifies that autograd returns the same output for both neg view and actually negated tensors as well as it works fine when grad_out is a neg view). 2. Added a new test class containing `test_conj_view`, `test_neg_view`. Test Plan: Imported from OSS Reviewed By: soulitzer Differential Revision: D29636403 fbshipit-source-id: 12214c9dc4806c51850f4a72a109db9527c0ca63	2021-07-13 13:50:42 -07:00
Xiong Wei	7e3a694b23	supports non-leaf inputs for autograd.backward() function (#60521 ) Summary: Close https://github.com/pytorch/pytorch/issues/60268 Pull Request resolved: https://github.com/pytorch/pytorch/pull/60521 Reviewed By: ngimel Differential Revision: D29393586 Pulled By: albanD fbshipit-source-id: 2dd2de427ecfecca8d544237bacf690e0b7c918c	2021-06-25 18:57:26 -07:00
Edward Yang	aacc722aec	Dispatch to Python via __torch_dispatch__ (#59760 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59760 See https://github.com/pytorch/pytorch/issues/59049 There are some moving parts to this PR, I'll structure this explanation so the straightforward parts go first, and then the less straightforward parts. The actual dispatch to Python. The core logic of dispatch to Python lives in `concrete_dispatch_fn` in `torch/csrc/autograd/python_variable.cpp`. It takes the input IValue stack, scans all the arguments for Tensor arguments, and defers most of the heavy lifting to `handle_torch_function_no_python_arg_parser` which actually does all of the logic for calling out to torch dispatch (in particular, this function handles multiple dispatch situations for you). Because we have a different function name than regular `__torch_function__` handling, `handle_torch_function_no_python_arg_parser` is generalized to accept a magic method name to look for when testing if Tensors have custom handling or not. Unlike `__torch_function__`, by default there is no `__torch_dispatch__` on Tensor classes. Maintaining the Python dispatch key. In order to get to the dispatch to Python logic, we must tag Tensors with the `__torch_dispatch__` magic method with the newly added Python dispatch key (separated from PythonFuncTorch to allow for a transitional period while they migrate to this mechanism). We expose a new private property `_is_python_dispatch` that assists in debugging if a Tensor is participating in Python dispatch or not. We apply the Python dispatch key the first time a PyObject for a Tensor is constructed (THPVariable_NewWithVar), testing if `__torch_dispatch__` exists with then newly added `check_has_torch_dispatch`. Shallow copy and detach. For the simple examples tested in this PR, most creations of Tensor route through the dispatcher. The exception to this is `shallow_copy_and_detach`, which bypasses the dispatcher and is used when saving tensors for backwards. When a Tensor is Python dispatch, we override the behavior of `shallow_copy_and_detach` to instead directly call into `__torch_dispatch__` to perform a `detach` operation (in the same way it would be invoked if you called `detach` directly). Because this Python call is triggered directly from c10::TensorImpl, it must be indirected through `PyInterpreter::detach`, which is the general mechanism for dynamic dispatching to the Python interpreter associated with a TensorImpl. torchdeploy compatibility. The dispatch to Python logic cannot be directly registered to the dispatcher as it is compiled in the Python library, which will get loaded multiple times per torchdeploy interpreter. Thus, we must employ a two phase process. First, we register a fallback inside a non-Python library (aten/src/ATen/core/PythonFallbackKernel.cpp). Its job is to determine the appropriate PyInterpreter to handle the Python dispatch by going through all of the arguments and finding the first argument that has a PyObject/PyInterpreter. With this PyInterpreter, it makes another dynamic dispatch via "dispatch" which will go to the correct torchdeploy interpreter to handle dispatching to actual Python. Testing. We provide a simple example of a LoggingTensor for testing, which can be used to generate TorchScript-like traces to observe what operations are being called when a Tensor is invoked. Although a LoggingTensor would be better implemented via an is-a relationship rather than a has-a relationship (as is done in the test), we've done it this way to show that arbitrarily complex compositions of tensors inside a tensor work properly. Known limitations. * We haven't adjusted any operator code, so some patterns may not work (as they lose the Python subclass in an unrecoverable way) * `__torch_function__` must be explicitly disabled with `_disabled_torch_function_impl` otherwise things don't work quite correctly (in particular, what is being disabled is default subclass preservation behavior.) * We don't ever populate kwargs, even when an argument is kwarg-only Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: D29017912 D29017912 Test Plan: Imported from OSS Reviewed By: bdhirsh Pulled By: ezyang fbshipit-source-id: a67714d9e541d09203a8cfc85345b8967db86238	2021-06-25 11:50:32 -07:00
Akifumi Imanishi	26cdec6ce4	Support `torch.bitwise_{left/right}_shift` and `__rlshift__`, `__rrshift__` (#59544 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/58121 This PR implements `torch.bitwise_left_shift` and `torch.bitwise_right_shift` and `torch.Tensor.{__rlshift__/__rrshift__}`for compatibility with Python array API standard. (cc: mruberry, rgommers, emcastillo, kmaehashi) Pull Request resolved: https://github.com/pytorch/pytorch/pull/59544 Reviewed By: ngimel Differential Revision: D29348869 Pulled By: mruberry fbshipit-source-id: 329aee296cf890735e8a9f858bccfe87c03d06ca	2021-06-23 23:57:16 -07:00
Edward Yang	82c52fd417	Do not wrap Tensor.{grad,_base} by default (#60464 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60464 Fixes https://github.com/szagoruyko/pytorchviz/issues/65 An alternate implementation of this PR would be to remove the __torch_function__ interposition points for these accessors entirely. In the end, I decided to opt for extra expressivity. See torch.overrides for the criterion on how I decided which accessors should get the nowrap treatment. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D29302835 Pulled By: ezyang fbshipit-source-id: fbe0ac4530a6cc9d6759a3fdf5514d4d7b1f7690	2021-06-22 12:49:23 -07:00
Jeffrey Wan	f52e202840	Add warning when accessing Tensor::grad() in the C++ API (#59362 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/35379 - Adds `retains_grad` attribute backed by cpp as a native function. The python bindings for the function are skipped to be consistent with `is_leaf`. - Tried writing it without native function, but the jit test `test_tensor_properties` seems to require that it be a native function (or alternatively maybe it could also work if we manually add a prim implementation?). - Python API now uses `retain_grad` implementation from cpp Pull Request resolved: https://github.com/pytorch/pytorch/pull/59362 Reviewed By: jbschlosser Differential Revision: D28969298 Pulled By: soulitzer fbshipit-source-id: 335f2be50b9fb870cd35dc72f7dadd6c8666cc02	2021-06-08 19:43:21 -07:00
Akifumi Imanishi	0a5bfa9919	Support `__rmod__` (#58476 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/58035. This PR implements `torch.Tensor.__rmod__` and `torch.remainder(scalar, tensor)` for the compatibility with NumPy’s interface. (cc: mruberry, rgommers, emcastillo, kmaehashi) TODO: - [x] Update `tensor_binary_op` in test/test_binary_ufuncs.py after https://github.com/pytorch/pytorch/issues/58216 is merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58476 Reviewed By: ngimel Differential Revision: D28776810 Pulled By: mruberry fbshipit-source-id: 74f8aea80f439ef2cc370333524e39971eeb7bf4	2021-06-05 16:19:24 -07:00
anjali411	3607478ecd	Conjugate View (#54987 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54987 Based off of ezyang (https://github.com/pytorch/pytorch/pull/44799) and bdhirsh (https://github.com/pytorch/pytorch/pull/43702) 's prototype: Here's a summary of the changes in this PR: This PR adds a new dispatch key called Conjugate. This enables us to make conjugate operation a view and leverage the specialized library functions that fast path with the hermitian operation (conj + transpose). 1. Conjugate operation will now return a view with conj bit (1) for complex tensors and returns self for non-complex tensors as before. This also means `torch.view_as_real` will no longer be a view on conjugated complex tensors and is hence disabled. To fill the gap, we have added `torch.view_as_real_physical` which would return the real tensor agnostic of the conjugate bit on the input complex tensor. The information about conjugation on the old tensor can be obtained by calling `.is_conj()` on the new tensor. 2. NEW API: a) `.conj()` -- now returning a view. b) `.conj_physical()` -- does the physical conjugate operation. If the conj bit for input was set, you'd get `self.clone()`, else you'll get a new tensor with conjugated value in its memory. c) `.conj_physical_()`, and `out=` variant d) `.resolve_conj()` -- materializes the conjugation. returns self if the conj bit is unset, else returns a new tensor with conjugated values and conj bit set to 0. e) `.resolve_conj_()` in-place version of (d) f) `view_as_real_physical` -- as described in (1), it's functionally same as `view_as_real`, just that it doesn't error out on conjugated tensors. g) `view_as_real` -- existing function, but now errors out on conjugated tensors. 3. Conjugate Fallback a) Vast majority of PyTorch functions would currently use this fallback when they are called on a conjugated tensor. b) This fallback is well equipped to handle the following cases: - functional operation e.g., `torch.sin(input)` - Mutable inputs and in-place operations e.g., `tensor.add_(2)` - out-of-place operation e.g., `torch.sin(input, out=out)` - Tensorlist input args - NOTE: Meta tensors don't work with conjugate fallback. 4. Autograd a) `resolve_conj()` is an identity function w.r.t. autograd b) Everything else works as expected. 5. Testing: a) All method_tests run with conjugate view tensors. b) OpInfo tests that run with conjugate views - test_variant_consistency_eager/jit - gradcheck, gradgradcheck - test_conj_views (that only run for `torch.cfloat` dtype) NOTE: functions like `empty_like`, `zero_like`, `randn_like`, `clone` don't propagate the conjugate bit. Follow up work: 1. conjugate view RFC 2. Add neg bit to re-enable view operation on conjugated tensors 3. Update linalg functions to call into specialized functions that fast path with the hermitian operation. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D28227315 Pulled By: anjali411 fbshipit-source-id: acab9402b9d6a970c6d512809b627a290c8def5f	2021-06-04 14:12:41 -07:00
Alexander	b435a27fb7	CUDA support in the CSR layout: constructors (#59010 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59010 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D28719287 Pulled By: bhosmer fbshipit-source-id: fbb5784ccb5ce19dcca1f2f95c4ee16f9b7680c4	2021-05-26 16:39:43 -07:00
Alban Desmaison	032d6b0643	Revert D28112689: CUDA support in the CSR layout: constructors Test Plan: revert-hammer Differential Revision: D28112689 (`1416e57465`) Original commit changeset: f825cd4bce40 fbshipit-source-id: 421fc590797ac5fab6a55ac6f213361fbba7cd5b	2021-05-26 06:15:05 -07:00
Alexander	1416e57465	CUDA support in the CSR layout: constructors (#57274 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57274 Test Plan: Imported from OSS Reviewed By: astaff Differential Revision: D28112689 Pulled By: bhosmer fbshipit-source-id: f825cd4bce402dd4c3f71db88854f77830b687b8	2021-05-26 01:36:20 -07:00
Akifumi Imanishi	3113a1de4a	Fix some tensor operators to return `NotImplemented` for invalid inputs (#58216 ) Summary: Same as https://github.com/pytorch/pytorch/issues/57934. (cc/ albanD) Pull Request resolved: https://github.com/pytorch/pytorch/pull/58216 Reviewed By: ailzhang Differential Revision: D28494886 Pulled By: albanD fbshipit-source-id: 380205867ee1cde90e1c6fcfe2a31749e1243530	2021-05-19 13:09:57 -07:00
BowenBao	6d7fe76317	[ONNX] Warning when using __len__ to calculate tensor shape (#55151 ) (#57595 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57595 Difference in traced graph and outputs, when using len(tensor) as compared to tensor.shape[0] An example model is (with tensor.shape): ``` # Test len fix with variable inputs import torch import onnxruntime class Model(torch.nn.Module): def forward(self, x): return x.size(1) + x.shape[0] # Call export dummy_x = torch.randn(3, 5) model = Model() import io onnx_io = io.BytesIO() torch.onnx.export(model, (dummy_x,), onnx_io, input_names=['input'], dynamic_axes={'input': {0:'h'}}, verbose=True) # Call onnxruntime runtime and compare outputs on dynamic inputs ort_session = onnxruntime.InferenceSession(onnx_io.getvalue()) x = torch.randn(2, 5) print(model(x)) print(ort_session.run(None, {ort_session.get_inputs()[0].name: x.numpy()})) ``` The output graph is as follows: ``` graph(%input : Float(, 5, strides=[5, 1], requires_grad=0, device=cpu)): %1 : Long(2, strides=[1], device=cpu) = onnx::Shape(%input) %2 : Long(device=cpu) = onnx::Constant[value={1}]() %3 : Long(device=cpu) = onnx::Gather[axis=0](%1, %2) # test/onnx/test_m.py:9:0 %4 : Long(2, strides=[1], device=cpu) = onnx::Shape(%input) %5 : Long(device=cpu) = onnx::Constant[value={0}]() %6 : Long(device=cpu) = onnx::Gather[axis=0](%4, %5) # test/onnx/test_m.py:9:0 %7 : Long(requires_grad=0, device=cpu) = onnx::Add(%3, %6) # test/onnx/test_m.py:9:0 return (%7) ``` Torch output: 7 ORT output: 7 Now replacing tensor.shape[0] with len(tensor), the graph looks like: ``` graph(%input : Float(, 5, strides=[5, 1], requires_grad=0, device=cpu)): %1 : Long(2, strides=[1], device=cpu) = onnx::Shape(%input) %2 : Long(device=cpu) = onnx::Constant[value={1}]() %3 : Long(device=cpu) = onnx::Gather[axis=0](%1, %2) # test/onnx/test_m.py:9:0 %4 : Long(requires_grad=0, device=cpu) = onnx::Constant[value={3}]() %5 : Long(requires_grad=0, device=cpu) = onnx::Add(%3, %4) return (%5) ``` Torch output: 7 ORT output: 8 In the case with __len__, %4 is traced as a constant Workaround to avoid the mismatch when using len to get tensor.shape Add the following pattern around _export call ``` import builtins len_backup = builtins.len builtins.len = lambda x : x.__len__() # Call export _export(model, args, ..... builtins.len = len_backup ``` Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D28393526 Pulled By: SplitInfinity fbshipit-source-id: a7d50740442c7e913119f9f92deab48aa8c02843 Co-authored-by: shubhambhokare1 <shubhambhokare1@gmail.com>	2021-05-13 13:42:41 -07:00
Alban Desmaison	5e83c62a9e	Revert D28351931: [pytorch][PR] Fix some tensor operators to return `NotImplemented` for invalid inputs Test Plan: revert-hammer Differential Revision: D28351931 (`35521a2629`) Original commit changeset: 985457a44dba fbshipit-source-id: 10724c219e53648f10a70719e25bcf774c6c7852	2021-05-12 13:58:03 -07:00
Akifumi Imanishi	35521a2629	Fix some tensor operators to return `NotImplemented` for invalid inputs (#57934 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/57719. This PR fixes `torch.Tensor{__rsub__, __rdiv__, __rtruediv__, __pow__, __rmatmul__}` to return `NotImplemented` instead of raising a `TypeError`. cc/ mruberry: The first commit of this PR is the same as `1d209db1cc` excepts the commit message. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57934 Reviewed By: mruberry Differential Revision: D28351931 Pulled By: albanD fbshipit-source-id: 985457a44dba24d2496794dfb8c1661cbcd4ff8f	2021-05-12 11:03:23 -07:00
albanD	4fad8d1a2c	Update the default detach semantic for forward mode AD (#57820 ) Summary: This makes detach both forward and backward non-differentiable by default. You can pass the `only_backward_mode=True` argument to make it forward differentiable but backward non-differentiable. The important side effect of this change is that, by default, detach is not tracking any view information. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57820 Reviewed By: ezyang Differential Revision: D28287633 Pulled By: albanD fbshipit-source-id: bdc4726fcd05889f6ac84e5a3a3ef71b2ec41015	2021-05-07 15:51:18 -07:00
Akifumi Imanishi	9da0f2e95e	Support `__pos__` and `positive` (#55891 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/55604. This PR implements `torch.Tensor.__pos__` and `torch.positive` for the compatibility with NumPy’s interface. (cc: mruberry, rgommers, emcastillo and kmaehashi) Pull Request resolved: https://github.com/pytorch/pytorch/pull/55891 Reviewed By: H-Huang Differential Revision: D28025928 Pulled By: mruberry fbshipit-source-id: e43e329a802f31bf8805f6efab5c2c7ef34c88b9	2021-04-27 13:23:59 -07:00
Sameer Deshmukh	5fb1142702	Add CSR (compressed sparse row) layout for sparse tensors (#50937 ) Summary: Implement compressed sparse row format. Derived from the GCS implementation at https://github.com/pytorch/pytorch/pull/44190 Pull Request resolved: https://github.com/pytorch/pytorch/pull/50937 Reviewed By: mrshenli Differential Revision: D27439865 Pulled By: ezyang fbshipit-source-id: 3ba3dcb9679505b980ff6a5f513e913bbae2fb1d	2021-04-12 10:09:12 -07:00
Tugsbayasgalan Manlaibaatar	10abbb812a	Support tensor subclasses in Torchscript (#54817 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54817 Test Plan: python test case Imported from OSS Reviewed By: gmagogsfm Differential Revision: D27407723 fbshipit-source-id: 459b9067f07908026f94620c1cfa3e00e8b50a4e	2021-04-07 12:10:27 -07:00
Hameer Abbasi	c690ed0ae8	Fix override for __iter__ (#54702 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54702 This fixes subclassing for __iter__ so that it returns an iterator over subclasses properly instead of Tensor. Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D27352563 Pulled By: ezyang fbshipit-source-id: 4c195a86c8f2931a6276dc07b1e74ee72002107c	2021-03-30 08:30:50 -07:00
Edward Yang	1f36ce6e4d	Restore storage on meta tensors; increase meta coverage (#53973 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53973 Two parts to this PR; I had to put them together because adding support for X causes more test code to be exercised, which in turn may require a fix for Y. The first part is restoring the concept of storage to meta tensors. Previously, meta tensors had a nullptr storage (e.g., `meta_tensor.storage()` is an error.) As I was increasing the coverage of meta tensors, I started running into test cases (specifically memory overlap tests) that were failing because not having storage meant I couldn't check for memory overlap. After some discussion, we decided that it would make sense for meta tensors to model this as well (we already model strides, so getting accurate view information also seems useful). This PR does that by: * Rewrite all of the factory functions in MetaTensor.cpp to use the generic versions (which are very carefully written to not actually poke at the data pointer, so everything works out). The key idea here is we give meta tensors a special allocator, MetaAllocator, which always returns a nullptr even if you ask for a nonzero number of bytes. resize_ is also made generic; the normal variant can be used directly rather than having to instruct it to avoid resizing storage * Turn on memory overlap checking in TensorIterator even for meta tensors * Although meta tensors now have storage, the concept of meta storage is NOT exposed to Python land (as it would imply I would have to codegen MetaFloatStorage, MetaDoubleStorage, etc. classes). So `x.storage()` still raises an error and I have a cludge in `__deepcopy__` to break storage sharing upon deep copy (this is wrong, but no tests exercise this at the moment). The second part is adding more support for the most used functions in the test suite. * Inplace operations have very simple meta functions. I added `fill_`, `zero_`, `random_`, `uniform_` and `normal_`. In the case of random, I take advantage of pbelevich's templates for defining random kernels, so that I can reuse the common scaffolding, and then just register a noop stub that actually does the RNG. (Look, another structured kernels tiny variant!) * `copy_` is now implemented. Copying into a meta tensor is always OK, but copying out of a meta tensor raises an error (as we don't know what the "correct" data to copy out is in this case) * `empty_strided` usage from structured kernels now is implemented (TBH, this could have been done as soon as `empty_strided` was added) * Meta was missing in a few places in TensorOptions/DispatchKey utility functions, so I added them * Autograd engine now correctly homes meta tensors with CPU tensors (they have -1 device index so CUDA queues wouldn't work anyway) * `apply_`, `map_` and `map2_` are special cased to no-op on meta tensor self. These count as inplace operations too but they are implemented a little differently. Getting more meta function support triggers a number of bugs in the test suite, which I then fix: - Linear algebra functions sometimes don't report NotImplementedError because they get swallowed by catch all try blocks. This is tracked in https://github.com/pytorch/pytorch/issues/53739 - dlpack obviously doesn't work with meta tensors, I just disabled the test Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: D27036572 Test Plan: Imported from OSS Reviewed By: agolynski, bdhirsh Pulled By: ezyang fbshipit-source-id: 7005ecf4feb92a643c37389fdfbd852dbf00ac78	2021-03-29 08:37:46 -07:00
Nikita Vedeneev	dfc7fa03e5	lu_backward: more numerically stable and with complex support. (#53994 ) Summary: As per title. Numerical stability increased by replacing inverses with solutions to systems of linear triangular equations. Unblocks computing `torch.det` for FULL-rank inputs of complex dtypes via the LU decomposition once https://github.com/pytorch/pytorch/pull/48125/files is merged: ``` LU, pivots = input.lu() P, L, U = torch.lu_unpack(LU, pivots) det_input = P.det() * torch.prod(U.diagonal(0, -1, -2), dim=-1) # P is not differentiable, so we are fine even if it is complex. ``` Unfortunately, since `lu_backward` is implemented as `autograd.Function`, we cannot support both autograd and scripting at the moment. The solution would be to move all the lu-related methods to ATen, see https://github.com/pytorch/pytorch/issues/53364. Resolves https://github.com/pytorch/pytorch/issues/52891 TODOs: * extend lu_backward for tall/wide matrices of full rank. * move lu-related functionality to ATen and make it differentiable. * handle rank-deficient inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53994 Reviewed By: pbelevich Differential Revision: D27188529 Pulled By: anjali411 fbshipit-source-id: 8e053b240413dbf074904dce01cd564583d1f064	2021-03-25 13:33:58 -07:00
Ilya Persky	d4c877b59b	Fix typo "informations" -> "information" (#53746 ) Summary: Hey, fixing the [uncountable](https://www.oxfordlearnersdictionaries.com/definition/american_english/information) noun to the proper form. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53746 Reviewed By: ngimel Differential Revision: D27012035 Pulled By: albanD fbshipit-source-id: dc653e739b5f6abed99b74bd2fd514b795d61b2e	2021-03-12 12:07:38 -08:00
Philip Meier	b0afe945a7	Fix pylint error torch.tensor is not callable (#53424 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53424 Fixes https://github.com/pytorch/pytorch/issues/24807 and supersedes the stale https://github.com/pytorch/pytorch/issues/25093 (Cc Microsheep). If you now run the reproduction ```python import torch if __name__ == "__main__": t = torch.tensor([1, 2, 3], dtype=torch.float64) ``` with `pylint==2.6.0`, you get the following output ``` test_pylint.py:1:0: C0114: Missing module docstring (missing-module-docstring) test_pylint.py:4:8: E1101: Module 'torch' has no 'tensor' member; maybe 'Tensor'? (no- member) test_pylint.py:4:38: E1101: Module 'torch' has no 'float64' member (no-member) ``` Now `pylint` doesn't recognize `torch.tensor` at all, but it is promoted in the stub. Given that it also doesn't recognize `torch.float64`, I think fixing this is out of scope of this PR. --- ## TL;DR This BC-breaking only for users that rely on unintended behavior. Since `torch/__init__.py` loaded `torch/tensor.py` it was populated in `sys.modules`. `torch/__init__.py` then overwrote `torch.tensor` with the actual function. With this `import torch.tensor as tensor` does not fail, but returns the function rather than the module. Users that rely on this import need to change it to `from torch import tensor`. Reviewed By: zou3519 Differential Revision: D26223815 Pulled By: bdhirsh fbshipit-source-id: 125b9ff3d276e84a645cd7521e8d6160b1ca1c21	2021-03-09 11:32:53 -08:00

27 Commits