pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Peter Bell	eb3f975c6e	Fix segfault in has_torch_function (#88559 ) Fixes #83908 `PySequence_Fast` may return `NULL` to indicate an error was raised, in which case `sequence_has_torch_function` will dereference a null pointer. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88559 Approved by: https://github.com/ezyang, https://github.com/Skylion007, https://github.com/hameerabbasi	2022-11-07 23:48:39 +00:00
Edward Z. Yang	f884e817d4	Make Python op registration work with torchdeploy/multipy (#87162 ) See strategy at PythonOpRegistrationTrampoline.cpp for the big picture. Along the way, I made OperatorHandle support == and hashing, and slightly changed the low level python_dispatch impl API to disallow empty strings for dispatch key, which had the knock on effect of requiring us to explicitly make sure we pass in CompositeImplicitAutograd if we would have passed in "" (I didn't apply this to the rest of the file because I'm lazy.) Test strategy is we delete the logic for preventing Python op registrations in torch from being skipped in a torchdeploy context and show CI still works. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/87162 Approved by: https://github.com/anjali411, https://github.com/bdhirsh	2022-11-03 12:56:44 +00:00
Edward Z. Yang	d3c01c722d	Fix pybind11 problems with c10::SymInt unregistered (#88011 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/88011 Approved by: https://github.com/weiwangmeta, https://github.com/albanD	2022-10-29 07:55:45 +00:00
Daniil Kutz	1eba3f220e	Fix bugs found by static analysis (#85705 ) These PR fixes a number of bugs found by Svace static analyzer: 1. DEREF_AFTER_FREE at qnnpack_utils.h: Pointer '&convolution->zero_buffer' is dereferenced at qnnpack_utils.h:258 after the referenced memory was deallocated at operator-delete.c:25 by passing as 1st parameter to function 'pytorch_qnnp_delete_operator' at qnnpack_utils.h:251. 2. DEREF_AFTER_NULL at impl.cpp: After having been compared to NULL value at impl.cpp:1892, pointer 'schema' is passed as 2nd parameter in call to function 'c10::operator<<' at impl.cpp:1921, where it is dereferenced at function_schema_inl.h:13. 3. DEREF_OF_NULL at stmt.h: After having been compared to NULL value at stmt.h:744, pointer 'body->_M_ptr' is passed in call to function 'torch::jit::tensorexpr::malformed_input::malformed_input' at stmt.h:745, where it is dereferenced at exceptions.h:67. 4. DEREF_OF_NULL at loopnest.h: Pointer 'f->ptr' that can have only NULL value (checked at loopnest.cpp:1482), is passed in call to function 'torch::jit::tensorexpr::malformed_input::malformed_input' at loopnest.cpp:1483, where it is dereferenced at exceptions.h:67. This is the same error as 3: forwarding a nullptr to malformed_input(). 4. TAINTED_INT.LOOP in python_arg_parser: Integer value 'this->size' obtained from untrusted source at python_arg_parser.cpp:118 without checking its bounds is used as a loop bound at python_arg_parser.cpp:698 by calling function 'torch::FunctionParameter::set_default_str' at python_arg_parser.cpp:133. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85705 Approved by: https://github.com/kit1980	2022-10-28 23:51:55 +00:00
Edward Z. Yang	1ff52225f1	Unify SymIntNode and SymFloatNode into SymNode (#87817 ) This refactor was prompted by challenges handling mixed int/float operations in C++. A previous version of this patch added overloads for each permutation of int/float and was unwieldy https://github.com/pytorch/pytorch/pull/87722/ This PR takes a different approach. The general outline of the patch is to combine the C++ types SymIntNode and SymFloatNode into a single type, SymNode. This is type erased; we no longer know statically at C++ if we have an int/float and have to test it with the is_int()/is_float() virtual methods. This has a number of knock on effects. - We no longer have C++ classes to bind to Python. Instead, we take an entirely new approach to our Python API, where we have a SymInt/SymFloat class defined entirely in Python, which hold a SymNode (which corresponds to the C++ SymNode). However, SymNode is not pybind11-bound; instead, it lives as-is in Python, and is wrapped into C++ SymNode using PythonSymNode when it goes into C++. This implies a userland rename. In principle, it is also possible for the canonical implementation of SymNode to be written in C++, and then bound to Python with pybind11 (we have this code, although it is commented out.) However, I did not implement this as we currently have no C++ implementations of SymNode. Because we do return SymInt/SymFloat from C++ bindings, the C++ binding code needs to know how to find these classes. Currently, this is done just by manually importing torch and getting the attributes. - Because SymInt/SymFloat are easy Python wrappers, __sym_dispatch__ now takes SymInt/SymFloat, rather than SymNode, bringing it in line with how __torch_dispatch__ works. Some miscellaneous improvements: - SymInt now has a constructor that takes SymNode. Note that this constructor is ambiguous if you pass in a subclass of SymNode, so an explicit downcast is necessary. This means toSymFloat/toSymInt are no more. This is a mild optimization as it means rvalue reference works automatically. - We uniformly use the caster for c10::SymInt/SymFloat, rather than going the long way via the SymIntNode/SymFloatNode. - Removed some unnecessary toSymInt/toSymFloat calls in normalize_* functions, pretty sure this doesn't do anything. - guard_int is now a free function, since to guard on an int you cannot assume the method exists. A function can handle both int and SymInt inputs. - We clean up the magic method definition code for SymInt/SymFloat/SymNode. ONLY the user classes (SymInt/SymFloat) get magic methods; SymNode gets plain methods; this is to help avoid confusion between the two types. Signed-off-by: Edward Z. Yang <ezyang@fb.com> cc @jansel @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87817 Approved by: https://github.com/albanD, https://github.com/anjali411	2022-10-27 20:56:02 +00:00
Nikita Shulga	82c8365c16	[BE] Delete `TH_DISALLOW_COPY_AND_ASSIGN` (#87743 ) Replace it with `AT_DISALLOW_COPY_AND_ASSIGN` and delete the header that contained this define Pull Request resolved: https://github.com/pytorch/pytorch/pull/87743 Approved by: https://github.com/atalman, https://github.com/ngimel	2022-10-26 03:31:56 +00:00
albanD	5ce9993dce	Fix a PyObject leak (#87608 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87608 Approved by: https://github.com/ezyang	2022-10-24 23:55:13 +00:00
albanD	3263bd24be	Improve argument printing (#87601 ) No more "expected tuple but got tuple". We appropriately grovel in the list/tuple for the element that mismatched and report what exactly twinged the failure. invalid_arguments.cpp is a shitshow so I did something slapdash to get it not completely horrible. See https://github.com/pytorch/pytorch/issues/87514 for more context. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/87601 Approved by: https://github.com/Chillee	2022-10-24 23:55:10 +00:00
samdow	169ec120ef	[Modes] refactor modes to only use a stack in cpp (#86458 ) Refactors the mode code to only have the C++ mode stack and not the "C++ mode" like we originally had. This also simplifies the mode logic in a number of places Pull Request resolved: https://github.com/pytorch/pytorch/pull/86458 Approved by: https://github.com/zou3519	2022-10-21 19:18:23 +00:00
Sherlock Huang	ab901b4817	Python binding for dispatcher getAllOpNames (#87422 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/87422 Approved by: https://github.com/bdhirsh	2022-10-21 06:55:10 +00:00
anjali411	182ee87996	symintify nll loss fns (#86915 ) (#87095 ) This reverts commit `bbd7b38d55`. Reland https://github.com/pytorch/pytorch/pull/86915 with a fix for python arg parser handing for SymInt and SymIntList. This was uncovered because we are calling directly into python bindings code through test_autocast.py (`torch._C._nn.nll_loss`) without providing a value for the optional symint arg (`ignore_index`). The arg parser constructs the SymInt and SymIntList using the recorded "default_int" or "default_int_list" (schema string parsing) in case a value is not received for an optional argument. Since we weren't handling the symint case properly, the default_int just had a garbage value which was later being used to construct SymInt. Follow up issue for other unhandled parameter types: https://github.com/pytorch/pytorch/issues/87283 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87095 Approved by: https://github.com/ezyang, https://github.com/albanD	2022-10-19 14:50:51 +00:00
holimion	33343def0b	add XLA backend into tensor type strings (#86881 ) add XLA backend into tensor type strings Pull Request resolved: https://github.com/pytorch/pytorch/pull/86881 Approved by: https://github.com/bdhirsh	2022-10-17 18:27:49 +00:00
Edward Z. Yang	954660a308	Correctly error if you pass in tensors where size arguments expected (#86126 ) This also makes symintlist track intlist exception handling, which eellison fixed. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/86126 Approved by: https://github.com/eellison	2022-10-03 20:18:41 +00:00
Edward Z. Yang	07800c9c81	Miscellaneous fixes from symbolic-shapes branch (#86042 ) - Make toIValue accept SymIntNode and SymFloatNode where number (aka Scalar) is expected - Binding for symintlistOptional in python arg parser - Teach translate to convert from IntArrayRef to ArrayRef<int64_t> - Don't query _symint function for meta info in LTC unless LTC is code generating a symint function Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/86042 Approved by: https://github.com/Chillee	2022-10-01 13:57:58 +00:00
Edward Z. Yang	3b6588ab74	Consistent compute numel/contiguous strategy with SymInts (#85858 ) Previously, our handling for contiguity was inconsistent in the following ways: - is_strides_like 2d/3d and is_non_overlapping_and_dense always were computed based on sizes_and_strides_, even if you had symbolic ints - Furthermore, even if you set custom policy for strides, these quantities were not overridable by subclasses - Furthermore, we didn't even store these fields on ExtraMeta - We duplicate implementations of compute_contiguous (plain, channels last, channels last 3d) - We inconsistently called refresh_numel()/refresh_contiguous(), versus recomputing it ourselves This factor makes a consistent strategy for all of the boolean fields, and for numel computation. After this refactor: - All layout boolean fields are interposable via strides policy and can be overridden from Python; you will never access a garbage field - All layout boolean fields are on ExtraMeta - You can always call refresh_numel/contiguous, no matter if your Tensor is contiguous or not - The numel/layout boolean fields are always populated consistently with the sizes strides fields (either on Tensor or ExtraMeta), even if you have custom policy - There is only one implementation of the actual computation logic Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D39907696](https://our.internmc.facebook.com/intern/diff/D39907696) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85858 Approved by: https://github.com/albanD	2022-09-30 21:26:34 +00:00
Elias Ellison	75db0225ad	Handle fake tensor in intlist (#85759 ) Previously, we were swallowing up the Fake Tensor Exception and throwing `TypeError`, which led to https://github.com/pytorch/torchdynamo/issues/1066. Now, we are propagating back the `DataDependentOutputException`. If this approach is accepted, I can go ahead and do doublelist, symintlist, afterward. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85759 Approved by: https://github.com/ezyang	2022-09-28 21:58:54 +00:00
samdow	18d8c548f4	[Modes] remove enable and rewrite mode stack (squashed) (#84774 ) Based on @ezyang's suggestion, mode stack now has "one true mode" which is the _only_ mode that can ever be active at the C++ level. That mode's torch dispatch is just to take the top mode in the stack, reenable itself (if we aren't at the end of the mode stack), and run the top mode's torch_{dispatch\|function} This maintains that in the middle of a mode's torch dispatch, the mode itself will not be active. It changes the function the user has to call to see what the current mode is (no longer queries the C++, it's python only) but allows the user to also see the entire mode stack easily Removes `enable_torch_dispatch_mode` and `.restore()` since neither makes sense in this new setup ### Background Why do we want this? Well, a pretty common pattern that was coming up was that users had to do something like ```python ## PRE-PR UX def f(mode): with mode.restore(): # user needs to understand this restore thing? ... with Mode() as m: pass f(m) ``` Many users were getting error from forgetting to call `.restore` or from forgetting to add the (tbh weird) "mode instantiation" step where they use the mode as a context manager with an empty body. Really, they wanted to treat modes like context managers and just write ```python ## FROM FEEDBACK, USER DESIRED CODE. POSSIBLE POST-PR def f(mode): with mode: ... f(Mode()) ``` Technical Details With the old mode stack, we basically had a linked list so the mode itself could only be used once and had a fixed parent. In this new design, the mode stack is just a python list that we're pushing to and popping from. There's only one mode that's ever active at the C++ level and it runs the next mode in the Python list. The modes don't have state on them anymore Pull Request resolved: https://github.com/pytorch/pytorch/pull/84774 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-09-27 01:04:35 +00:00
Edward Z. Yang	9c036aa112	Add SymInt to Scalar (#84958 ) This is by no means comprehensive, but adds initial support for SymInt as a Scalar. Things that don't work yet but need to: - for some reason `torch.add(tensor, sym_int)` got matched to the `add.Tensor(Tensor self, Tensor other, *, Scalar alpha=1) -> Tensor` schema - `x + sym_int` failed bc we tried to turn `x` into a sym int: ``` "__radd__", [](c10::SymIntNode a, py::object b) -> c10::SymIntNode { auto snb = toSymIntNode(a, b); return a->add(snb); }) ``` - Many more things I'm sure Pull Request resolved: https://github.com/pytorch/pytorch/pull/84958 Approved by: https://github.com/ezyang	2022-09-25 23:51:06 +00:00
Nikita Shulga	d5cabf7946	Make functorch compilable with Py-3.11 (#85054 ) By using compatibility wrappers from [python_compat.h](https://github.com/pytorch/pytorch/blob/master/torch/csrc/utils/python_compat.h) and skipping part of `getname` switch Fixes https://github.com/pytorch/pytorch/issues/85006 Please note that `import torch` right now fails by default on 3.11 with some jit issue, so I think this shouldn't be a really issue for a bit Pull Request resolved: https://github.com/pytorch/pytorch/pull/85054 Approved by: https://github.com/kit1980, https://github.com/zdevito	2022-09-23 04:48:18 +00:00
Richard Zou	cd32a86bf2	Stop monkeypatching Tensor.backward() on `import functorch` (#85152 ) Monkeypatching is bad, we should never be doing it. This PR removes functorch's monkeypatching on Tensor.backward() by adding it directly to the implementation of Tensor.backward(). As an alternative, we could have done an `import functorch` and used `functorch._C.are_transforms_active` directly in `torch/autograd/__init__.py`. The problem with that is that it runs into a bunch of circular imports. NB: https://github.com/pytorch/pytorch/issues/72179 is still on my mind. I didn't choose to do it right now because: - This PR doesn't make the situation worse than it already is (no monkeypatching is better than having the monkeypatch) - We don't have a design for #72179 yet. Test Plan: - tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/85152 Approved by: https://github.com/soulitzer	2022-09-19 17:06:15 +00:00
Edward Z. Yang	e5fac7f5dc	Optimize torch.ops.ns.opname.overload accessor in torch dispatch (#85132 ) This doesn't actually seem to help all that much. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85132 Approved by: https://github.com/wconstab	2022-09-16 20:21:03 +00:00
Michael Voznesensky	8ca1839d32	Python Dispatcher integration with C++ dispatcher (#85050 ) #84826 but without ghstack Pull Request resolved: https://github.com/pytorch/pytorch/pull/85050 Approved by: https://github.com/malfet	2022-09-15 00:43:36 +00:00
PyTorch MergeBot	706b990306	Revert "Python Dispatcher integration with C++ dispatcher (#84826 )" This reverts commit `35f6a69191`. Reverted https://github.com/pytorch/pytorch/pull/84826 on behalf of https://github.com/malfet due to Broke dynamo, see `35f6a69191`	2022-09-14 14:07:58 +00:00
Michael Voznesensky	35f6a69191	Python Dispatcher integration with C++ dispatcher (#84826 ) Signed-off-by: Edward Z. Yang <ezyangfb.com> From @ezyang's original PR: There are a number of situations where we have non-backend kernels (e.g., CompositeImplicitAutograd, batching rules) which we would like to port to Python, but we have no way to integrate these ports with the overall system while using preexisting C++ registrations otherwise. This PR changes that by introducing a Python dispatcher (which can have its own kernels directly in Python), which can be interpose over ordinary C++ dispatch. The ingredients: We introduce a new PythonDispatcher dispatch key, that has the same tenor as FuncTorchDynamicLayerFrontMode: it works by getting triggered before every other dispatch key in the dispatch key, and shunting to a Python implementation The Python dispatcher is a per-interpreter global object that is enabled/disabled via the guard EnablePythonDispatcher/DisablePythonDispatcher. We don't make it compositional as I have no idea what a compositional version of this feature would look like. Because it is global, we don't need to memory manage it and so I use a simpler SafePyHandle (newly added) to control access to this pointer from non-Python C++. Like __torch_dispatch__, we use PyInterpreter to get to the Python interpreter to handle the dispatch. I need to reimplement dispatch table computation logic in Python. To do this, I expose a lot more helper functions for doing computations on alias dispatch keys and similar. I also improve the pybind11 handling for DispatchKey so that you can either accept the pybind11 bound enum or a string; this simplifies our binding code. See https://github.com/pybind/pybind11/issues/483#issuecomment-1237418106 for how this works; the technique is generally useful. I need to be able to call backend fallbacks. I do this by permitting you to call at a dispatch key which doesn't have a kernel for the operator; if the kernel doesn't exist, we check the backend fallback table instead. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84826 Approved by: https://github.com/ezyang	2022-09-14 06:57:19 +00:00
Nikolay Korovaiko	f725009a48	as_strided supports SymInt; codegen supports optional SymInt (#84393 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84393 Approved by: https://github.com/ezyang	2022-09-06 16:39:24 +00:00
Edward Z. Yang	2a332afbf4	Add SymFloat, support SymInt to SymFloat conversion (#84284 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84284 Approved by: https://github.com/albanD	2022-09-03 01:30:32 +00:00
Elias Ellison	97b2dff600	Add Initial Support For Fake Tensor Constant Tracking (#84387 ) Adds support for constant tensor tracking within FakeTensors. Copy-pasta'ing from `proxy_tensor.py` why this is useful: ``` # In some circumstances, we will be tracing in a situation where a tensor # is statically known to be a constant (currently, this only happens if # you run torch.tensor; deterministic factory functions like torch.arange # don't get this treatment). When the tensor in question is small, it's # helpful to due constant propagation in case we call item() (in which # case we can return the constant value that is known, rather than give # an error.) ``` This PR only attempts to add support for the tracing scenarios where we run each operation linearly - aot autograd, torchdynamo. It does not yet handle how constant tensors should be handled as part of the persistent fx graph. Additionally, it does not yet attempt to de-duplicate or interact with ProxyMode's only constant tensor handling. Edit: plan is to rely on functionalization for fx graph Pull Request resolved: https://github.com/pytorch/pytorch/pull/84387 Approved by: https://github.com/ezyang	2022-09-02 02:43:04 +00:00
Antonio Kim	7371761d9c	Add Lazy backend type string (#84228 ) As the title suggest, the `Lazy` case was missing the in the `backend_to_string` switch case causing ``` RuntimeError: Unimplemented backend Lazy ``` when called with a lazy backend. CC: @wconstab @Krovatkin @desertfire Pull Request resolved: https://github.com/pytorch/pytorch/pull/84228 Approved by: https://github.com/wconstab	2022-08-30 00:31:35 +00:00
Michael Voznesensky	ced2ca8f86	Torch cond operator, python dispatch, pyoperator (#83154 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/83154 Approved by: https://github.com/ezyang	2022-08-25 20:11:53 +00:00
Nikolay Korovaiko	63cbdc92a7	switching the exact check to isinstance check (#84023 ) Simplifying a type check if an object is a SymIntNode in `is_symint_node` Pull Request resolved: https://github.com/pytorch/pytorch/pull/84023 Approved by: https://github.com/ezyang	2022-08-25 08:28:40 +00:00
Edward Z. Yang	2d8f091f6a	Move TorchDispatchModeTLS to c10/core (#83370 ) I need to access it directly from TensorImpl to route directly TensorImpl induced operations to modes (upcoming PR). Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83370 Approved by: https://github.com/zou3519	2022-08-15 17:59:57 +00:00
Nikolay Korovaiko	5b621205f4	Revert "Revert "adding a custom caster for c10::SymInt (#82692 )"" (#83223 ) This should fix the MacOS build errors and reland #82692 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83223 Approved by: https://github.com/albanD	2022-08-12 00:46:50 +00:00
Brian Hirsh	1a51efd8bb	dispatch API for checking computed table, use it in prim decomps (#82358 ) Fixes https://github.com/pytorch/pytorch/issues/82331 Expose a `torch._C._dispatch_has_computed_kernel_for_dispatch_key` to check if an operator has a kernel registered to the given dispatch key in the computed table. Use it in the prim registration logic, making it more accurate and robust (so that it e.g. picks up `CompositeExplicitAutograd` kernels. It looks like before this change we'd register 134 prim ops to the meta key, and after we only register 62. So that's 72 ops that now use an existing C++ decomp to get meta working, instead of going directly through the prim decomp. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82358 Approved by: https://github.com/ezyang	2022-08-10 23:42:02 +00:00
Elias Ellison	8a6b076196	lift numpy tensor, add randperm support (#83191 ) Couple changes needed to trace huggingface w fake tensors. Similar to https://github.com/pytorch/pytorch/pull/81927, need to call liftfresh for tensors created from numpy tensors. Also adds randperm for meta. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83191 Approved by: https://github.com/bdhirsh	2022-08-10 22:27:51 +00:00
PyTorch MergeBot	daeea7d2c3	Revert "adding a custom caster for c10::SymInt (#82692 )" This reverts commit `dee63f4f7b`. Reverted https://github.com/pytorch/pytorch/pull/82692 on behalf of https://github.com/seemethere due to Broke internal builds, see [logs](https://www.internalfb.com/intern/sandcastle/job/4503600373141339/insights)	2022-08-09 22:17:41 +00:00
Nikita Shulga	10be80de40	[BE] Fix unused but set variable (#83046 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83046 Approved by: https://github.com/kit1980, https://github.com/janeyx99	2022-08-09 18:21:59 +00:00
Nikolay Korovaiko	dee63f4f7b	adding a custom caster for c10::SymInt (#82692 ) ### Description Adding a custom caster for `c10::SymInt`. This simplifies handling of c10::SymInt on C++/Pytorch boundary. Namely, removing if statements to handle the union nature (e.g. SymIntNode, int) of c10::SymInt. ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82692 Approved by: https://github.com/ezyang	2022-08-08 21:40:53 +00:00
Peter Bell	2c2278a960	Make python TensorOption signatures consistent with JIT schemas (#82241 ) Fixes #81774 `TensorOptions` arguments in the JIT schema are optional, but in the Python API these were being translated to non-optional but with a default value. This change makes the arguments accept `None` for consistency with the JIT schema. However, it also means that `dtype=c10::nullopt` was previously completely untested so this also fixes several related bugs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82241 Approved by: https://github.com/ngimel	2022-08-07 00:10:27 +00:00
David Berard	149f0eb51e	[JIT] refactor SchemaInfo to lazily initialize (#82596 ) This should reduce the prevalence of #82324 Differential Revision: [D38325919](https://our.internmc.facebook.com/intern/diff/D38325919) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82596 Approved by: https://github.com/goldenxuett	2022-08-02 19:49:44 +00:00
Edward Z. Yang	df69660832	Revert "Revert "Add a lint rule for torch/csrc/util/pybind.h include (#82552 )"" (#82599 ) This reverts commit `532b8a9e00`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82599 Approved by: https://github.com/albanD	2022-08-02 19:37:02 +00:00
goldenxuett	d326a55bff	[JIT] Refactor SchemaInfo training ops to account for rrelu_with_noise case (#82441 ) - Refactor SchemaInfo to be able to handle cases where other variables besides running_mean and running_var mutate due to training = true - Add special case rrelu_with_noise to fix https://github.com/pytorch/pytorch/issues/82434 - Tested by running SchemaInfo tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/82441 Approved by: https://github.com/davidberard98	2022-08-02 18:51:55 +00:00
Edward Z. Yang	bf387e894f	Fix a NotImplemented mode bug and improve Parameter handling for fake tensor (#82574 ) Partially addresses https://github.com/pytorch/pytorch/issues/82547 The repro script still doesn't work with fake tensor, but it is now expected as fake tensor does not work unless all inputs are explicitly wrapped into fake tensor. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82574 Approved by: https://github.com/eellison	2022-08-01 20:40:01 +00:00
PyTorch MergeBot	532b8a9e00	Revert "Add a lint rule for torch/csrc/util/pybind.h include (#82552 )" This reverts commit `9465c0e0b5`. Reverted https://github.com/pytorch/pytorch/pull/82552 on behalf of https://github.com/zengk95 due to This seems to be breaking windows binary wheels	2022-08-01 20:25:35 +00:00
Edward Z. Yang	9465c0e0b5	Add a lint rule for torch/csrc/util/pybind.h include (#82552 ) We define specializations for pybind11 defined templates (in particular, PYBIND11_DECLARE_HOLDER_TYPE) and consequently it is important that these specializations always be #include'd when making use of pybind11 templates whose behavior depends on these specializations, otherwise we can cause an ODR violation. The easiest way to ensure that all the specializations are always loaded is to designate a header (in this case, torch/csrc/util/pybind.h) that ensures the specializations are defined, and then add a lint to ensure this header is included whenever pybind11 headers are included. The existing grep linter didn't have enough knobs to do this conveniently, so I added some features. I'm open to suggestions for how to structure the features better. The main changes: - Added an --allowlist-pattern flag, which turns off the grep lint if some other line exists. This is used to stop the grep lint from complaining about pybind11 includes if the util include already exists. - Added --match-first-only flag, which lets grep only match against the first matching line. This is because, even if there are multiple includes that are problematic, I only need to fix one of them. We don't /really/ need this, but when I was running lintrunner -a to fixup the preexisting codebase it was annoying without this, as the lintrunner overall driver fails if there are multiple edits on the same file. I excluded any files that didn't otherwise have a dependency on torch/ATen, this was mostly caffe2 and the valgrind wrapper compat bindings. Note the grep replacement is kind of crappy, but clang-tidy lint cleaned it up in most cases. See also https://github.com/pybind/pybind11/issues/4099 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82552 Approved by: https://github.com/albanD	2022-08-01 17:16:58 +00:00
Edward Z. Yang	a9320e6d96	Delete SymInt::data() in favor of as_int_unchecked() (#82477 ) I audited all the sites while I was at it, and marked a few suspicious ones. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82477 Approved by: https://github.com/Chillee	2022-08-01 15:07:22 +00:00
Edward Z. Yang	50e8abbcad	Change SymIntNode into an intrusive pointer (#82548 ) This will make the pointer type a single word, which is important for packing it into an int64_t This time, this diff doesn't segfault when you build with DEBUG mode; more details at https://github.com/pybind/pybind11/issues/4099 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82548 Approved by: https://github.com/albanD	2022-08-01 15:07:21 +00:00
Kurt Mohler	14d0296e5c	Rename `_Typed/_UntypedStorage` to `Typed/UntypedStorage` and update docs (#82438 ) ### Description Since the major changes for `_TypedStorage` and `_UntypedStorage` are now complete, they can be renamed to be public. `TypedStorage._untyped()` is renamed to `TypedStorage.untyped()`. Documentation for storages is improved as well. ### Issue Fixes #82436 ### Testing N/A Pull Request resolved: https://github.com/pytorch/pytorch/pull/82438 Approved by: https://github.com/ezyang	2022-07-30 19:37:08 +00:00
Edward Z. Yang	fd5ac1e6b5	Rename SymbolicIntNode to SymIntNodeImpl (#82350 ) Done via ``` git grep -l 'SymbolicIntNode' \| xargs sed -i 's/SymbolicIntNode/SymIntNodeImpl/g' ``` Reasoning for the change: * Sym is shorter than Symbolic, and consistent with SymInt * You usually will deal in shared_ptr<...>, so we're going to reserve the shorter name (SymIntNode) for the shared pointer. But I don't want to update the Python name, so afterwards I ran ``` git grep -l _C.SymIntNodeImpl \| xargs sed -i 's/_C.SymIntNodeImpl/_C.SymIntNode/' ``` and manually fixed up the binding code Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82350 Approved by: https://github.com/Krovatkin	2022-07-28 18:27:45 +00:00
Jing Xu	0e95746580	[RFC] enable oneMKL&oneDNN on-demands verbose functinality (#63212 ) RFC: Problem statement  Intel oneMKL and oneDNN are used to accelerate performance on Intel platforms. Both these 2 libraries provide verbose functionality to dump detailed operator execution information as well as execution time. These verbose messages are very helpful to performance profiling. However, the verbose functionality works for the entire execution. In many scenarios, though, we only would like to profile partial of the execution process. This feature is to expose PyTorch API functions to control oneDNN and oneMKL verbose functionality in runtime. Additional context   The most used performance profiling steps are shown as the following code snippet: ``` def inference(model, inputs): # step0 (optional): jit model = torch.jit.trace(model, inputs) # step1: warmup for _ in range(100): model(inputs) # step2: performance profiling. We only care the profiling result, as well as oneDNN and oneMKL verbose messages, of this step model(inputs) # step3 (optional): benchmarking t0 = time.time() for _ in range(100): model(inputs) t1 = time.time() print(‘dur: {}’.format((t1-t0)/100)) return model(inputs) ``` Since environment variables MKL_VERBOSE and DNNL_VERBOSE will be effect to the entire progress, we will get a great number of verbose messages for all of 101 iterations (if step3 is not involved). However, we only care about the verbose messages dumped in step2. It is very difficult to filter unnecessary verbose messages out if we are running into a complicated usages scenario. Also, jit trace will also bring more undesired verbose messages. Furthermore, there are more complicated topologies or usages like cascaded topologies as below: ``` model1 = Model1() model2 = Model2() model3 = Model3() x1 = inference(model1, x) x2 = inference(model2, x1) y = inference(model3, x2) ``` There are many cases that it is very hard to split these child topologies out. In this scenario, it is not possible to investigate performance of each individual topology with `DNNL_VERBOSE` and `MKL_VERBOSE`. To solve this issue, oneDNN and oneMKL provide API functions to make it possible to control verbose functionality in runtime. ``` int mkl_verbose (int enable) status dnnl::set_verbose(int level) ``` oneDNN and oneMKL print verbose messages to stdout when oneMKL or oneDNN ops are executed. Sample verbose messages: ``` MKL_VERBOSE SGEMM(t,n,768,2048,3072,0x7fff64115800,0x7fa1aca58040,3072,0x1041f5c0,3072,0x7fff64115820,0x981f0c0,768) 8.52ms CNR:OFF Dyn:1 FastMM:1 TID:0 NThr:44 dnnl_verbose,exec,cpu,inner_product,brgemm:avx512_core,forward_training,src_f32::blocked:ab:f0 wei_f32::blocked:AB16b64a:f0 bia_f32::blocked:a:f0 dst_f32::blocked:ab:f0,,,mb16ic768oc768,0.0839844 ``` Design and implementation  The design is to make python-interfaced wrap functions to invoke mkl_verbose and dnnl::set_verbose functions. Design concern   - Need to add wrapper C++ functions for mkl_verbose and dnnl::set_verbose functions in torch/csrc and aten/csrc. - Python API functions will be added to device-specific backends - with torch.backends.mkl.verbose(1): - with torch.backends.mkldnn.verbose(1): Use cases   ``` def inference(model, inputs): # step0 (optional): jit model = torch.jit.trace(model, inputs) # step1: warmup for _ in range(100): model(inputs) # step2: performance profiling with torch.backends.mkl.verbose(1), torch.backends.mkldnn.verbose(1): model(inputs) # step3 (optional): benchmarking t0 = time.time() for _ in range(100): model(inputs) t1 = time.time() print(‘dur: {}’.format((t1-t0)/100)) return model(inputs) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/63212 Approved by: https://github.com/VitalyFedyunin, https://github.com/malfet	2022-07-27 23:29:35 +00:00
Nikita Shulga	d80fe49de0	[Reland] Add py-3.10 config (#82329 ) This is a re-land of #81372 and #81233 with the exception that it does not force the range-checks on older Python runtime versions and as such should not affect the internal workloads, which were the reason for revert, see https://github.com/pytorch/pytorch/pull/81372#issuecomment-1187516464 - [Py3.10] Allow floats to be imported as Long (#81372) - [CI] Move CUDA-11.6 to Python-3.10 configuration (#81233) - Don't do anything about range checks for pre-py3.10 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82329 Approved by: https://github.com/kit1980	2022-07-27 20:22:47 +00:00
goldenxuett	67c22b6c07	[JIT] Modify is_nondeterministic to utilize tags in SchemaInfo for non-mobile contexts and integrate with ir.cpp (#82253 ) - Modified is_nondeterministic method in SchemaInfo class to utilize tags. - Modified isNonDeterministic method in ir.cpp to utilize SchemaInfo when a Node is an aten op. - Added an assert to ensure that if a node is an aten op kind, it has a schema. - Tested through verifying that all IR.cpp tests run, and through adding 2 custom determinism checks to test for the special dropout edge case and a general bernoulli case. Differential Revision: [D38179499](https://our.internmc.facebook.com/intern/diff/D38179499) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82253 Approved by: https://github.com/davidberard98	2022-07-27 20:19:19 +00:00
Nikolay Korovaiko	e0faa02fe7	simplify a check in python_arg_parser for varargs. (#82271 ) ### Description <!-- What did you change and why was it needed? --> ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82271 Approved by: https://github.com/ezyang	2022-07-27 17:05:48 +00:00
albanD	4b7de26556	Fix C API to be compatible with latest 3.11 beta (#81242 ) Based off https://github.com/pytorch/pytorch/pull/80511 with extra changes: - Update pybind to the latest release as it contains some needed fixes - Extend the compat header to do reduce changes in code Pull Request resolved: https://github.com/pytorch/pytorch/pull/81242 Approved by: https://github.com/malfet, https://github.com/mattip	2022-07-27 08:37:10 +00:00
Nikolay Korovaiko	d2c47d559c	Revert "Revert "Enabling SymInt in autograd; take 3 (#81145 )"" ; make sure is_intlist checks for symintnodes (#82189 ) ### Description <!-- What did you change and why was it needed? --> ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82189 Approved by: https://github.com/ezyang	2022-07-26 20:47:11 +00:00
PyTorch MergeBot	e1bd244a14	Revert "[JIT] Modify is_nondeterministic to utilize tags in schemaInfo and integrate with ir.cpp (#81836 )" This reverts commit `fc3555ce4d`. Reverted https://github.com/pytorch/pytorch/pull/81836 on behalf of https://github.com/osalpekar due to Internal Mobile NNPACK custom_ops tests failing with Error: tags are not saved for Mobile	2022-07-26 19:11:49 +00:00
goldenxuett	fc3555ce4d	[JIT] Modify is_nondeterministic to utilize tags in schemaInfo and integrate with ir.cpp (#81836 ) - Modified is_nondeterministic method in SchemaInfo class to utilize tags. - Modified isNonDeterministic method in ir.cpp to utilize SchemaInfo when a Node is an aten op. - Added an assert to ensure that if a node is an aten op kind, it has a schema. - Tested through verifying that all IR.cpp tests run, and through adding 2 custom determinism checks to test for the special dropout edge case and a general bernoulli case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81836 Approved by: https://github.com/davidberard98	2022-07-25 15:44:31 +00:00
goldenxuett	9a5fa15ea8	[JIT] Remove BatchNorm and InstanceNorm special cases from AliasDB and replace with SchemaInfo is_mutable checks (#81785 ) - Generalized AnalyzeImpl cases for batchNorm and InstanceNorm in alias_analysis.cpp using schema_info. - Tested by ensuring all aliasDB special case checks for batchNorm and instanceNorm pass as expected. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81785 Approved by: https://github.com/davidberard98	2022-07-23 05:50:39 +00:00
Lu, Chengjun	008bff1e03	Fix missing symbol issue when USE_NUMPY=False (#81967 ) Fixes missing symbol issue if torch is built with USE_NUMPY=False. - `tensor_to_numpy` signature changed. - `warn_numpy_not_writeable` missed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81967 Approved by: https://github.com/ezyang	2022-07-22 13:57:50 +00:00
goldenxuett	c9497886fd	[JIT] Modify is_mutable in FunctionSchema and SchemaInfo to have SchemaArgument parameter instead of index (#81784 ) - Modify the is_mutable(size_t index) overload to become is_mutable(const SchemaArgument& argument) due to cases where one might want to check the mutability of either input or output arguments. - Refactored all calls to the function to use this new overload - Tested through is_mutable() tests in test_schema_info.cpp Pull Request resolved: https://github.com/pytorch/pytorch/pull/81784 Approved by: https://github.com/davidberard98	2022-07-20 22:09:56 +00:00
goldenxuett	21a4be34cd	[JIT] Enchance training ops check to be more inclusive and account for possible pybind exceptions (#81782 ) - Modified is_mutable python binding to accept a string instead of a string_view for better python compatibility. - Modified argument value adding python bindings to deal with input/self edge case due to inconsistencies in how the first variable is named. - Modified _is_alias_of and created _contains_alias_of python bindings to accurately find out if values are aliasing, or contain an alias. - Fixed is_mutable implementation to cover all ops that have mutable optional arguments. (These are all the ops that have the optional arguments 'running_mean' and 'running_var' along with either 'train', 'training' or 'use_input_stats.' Pull Request resolved: https://github.com/pytorch/pytorch/pull/81782 Approved by: https://github.com/davidberard98	2022-07-20 22:09:54 +00:00
goldenxuett	1ddbc5a7dc	[JIT] Remove has_side_effects functionality from SchemaInfo (#81575 ) - This removes all functionality from https://github.com/pytorch/pytorch/pull/81002 due to a realization that the side effects check doesn't affect any ops outside of JIT. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81575 Approved by: https://github.com/davidberard98	2022-07-19 22:33:19 +00:00
goldenxuett	8e454cc702	[JIT] Add SchemaInfo python bindings to init.cpp (#81518 ) - Added python bindings for SchemaInfo class, SchemaArgument struct, and SchemaArgType enum. - Tested that argument values are added correctly to SchemaInfo binding in test_schema_check.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/81518 Approved by: https://github.com/davidberard98	2022-07-19 22:33:19 +00:00
goldenxuett	a6e716cfed	[JIT] Add may_contains_alias function in SchemaInfo class (#81444 ) - Created may_contain_alias method in SchemaInfo which is a wrapper around FunctionSchema may_contain_alias that also accounts for argument values. This is done using similar logic to AliasDB using an internal understanding of wildcard sets and container object - Added a multitude of tests for various graph edge cases (inputs aliasing, outputs aliasing, multiple input wildcards, multiple container objects, etc...). Pull Request resolved: https://github.com/pytorch/pytorch/pull/81444 Approved by: https://github.com/davidberard98	2022-07-19 04:29:22 +00:00
goldenxuett	47cdab6601	[JIT] Fix double wildcard edge case for may_alias in SchemaInfo and improve formatting (#81439 ) - Create c10::AliasTypeSet type def of vector<TypePtr> to match alias_analysis.cpp formatting and improve readability. - Move canAliasTypeSetsAlias, mapTypeToAliasTypeSet, getAliasTypeSetContainedTypes, and getCorrectList to public in function_schema.h for use in SchemaInfo class. In the future it might be better to find a different home for most of these functions since they don't depend on functionSchema. - Created hash function for SchemaArgument - Add assert to ensure that there is only 1 input and 1 output with each alias set (excluding wildcard) - Fixed double wildcard input edge case for may_alias. (This is the case where if there is a schema with the form (Tensor(a) a, Tensor() b, Tensor() c) -> Tensor, and the argument values for 'a' and 'b' cause them to alias, then 'a' may also alias 'c'. - Added tests for double wildcard case in may_alias, mismatching types in may_alias, and the uniqueness internal assert. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81439 Approved by: https://github.com/davidberard98	2022-07-19 04:29:21 +00:00
PyTorch MergeBot	ec1b3a45ad	Revert "[Py3.10] Allow floats to be imported as Long (#81372 )" This reverts commit `69d73345a2`. Reverted https://github.com/pytorch/pytorch/pull/81372 on behalf of https://github.com/DanilBaibak due to Break internal build	2022-07-18 14:55:13 +00:00
Nikita Shulga	69d73345a2	[Py3.10] Allow floats to be imported as Long (#81372 ) Thus avoiding `TypeError: 'float' object cannot be interpreted as an integer` when trying to create integer tensor from floating point values Use `c10::checked_convert` to detect overflows during tensor construction from scalars. Modify sparse_csr test that violated this rule Fixes #69319 Tested in #81233 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81372 Approved by: https://github.com/ezyang, https://github.com/ngimel	2022-07-15 22:57:58 +00:00
goldenxuett	42ee1608d3	[JIT] Add special cases batch_norm, instance_norm and dropout for SchemaInfo (#81007 ) - Added special cases for detach in is_non_deterministic() check and batch_norm and instance_norm in is_mutable() check in SchemaInfo(). - Added tests for the above special cases for detach, batch_norm and instance_norm. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81007 Approved by: https://github.com/davidberard98	2022-07-15 04:52:02 +00:00
Edward Z. Yang	fca03eeec1	Make proxy tensor support item() calls on torch.tensor constants (#81192 ) This PR is doing a few interrelated things, all of which are necessary to get correctness. Read the comment in torch/fx/experimental/proxy_tensor.py for the high level overview. Let's break down the parts of this PR: * Bug fix where `enable_torch_dispatch_mode` with `None` doesn't work. This make `enable_torch_dispatch_mode(current_mode.inner)` work which is the basis for how we temporarily disable fake tensor mode. * Bug fix for when fake tensor mode is combined with a non-mode tensor subclass. This actually could be ablated from this PR but it affects where the logic for allowing non fake tensor inputs with lift goes, so it's all in here in one go. There are some relevant tests for the fix in fake tensor, but it turns out I didn't need this because I'm always using proxy tensors as a mode (which ensures the ordering is right.) * New `lift_fresh` view operator. Note that like lift, we have to manually write the functionalize kernel for these functions. * The actual change, which is to save constants when we see them in the proxy tensor mode, and then propagate them as we go (because otherwise you'll handle mutations on constants incorrectly--see test.) This is mildly BC-breaking if anyone was previously interposing on at::lift, but this operator was relatively new and I checked functorch which has no explicit reference to lift. So I think it should not be too disruptive. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/81192 Approved by: https://github.com/samdow, https://github.com/bdhirsh	2022-07-15 03:53:40 +00:00
goldenxuett	3b4964230e	[JIT] Add side effects checks for ops in SchemaInfo subclass (#81002 ) - Added has_side_effects method which returns whether a given op has side effects. Currently this is implemented with a hard-coded list of functions copied from ir.cpp in AliasDB, but this will eventually be implemented by returning with a given schema has the has_side_effects tag. - Tested in test_schema_info.cpp with both an op with side effects and an op without side effects. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81002 Approved by: https://github.com/davidberard98	2022-07-13 00:39:30 +00:00
goldenxuett	14c28caed9	[JIT] Add determinism checks for ops in SchemaInfo subclass (#81000 ) - Added is_non_deterministic which returns whether a given op is non-deterministic. Currently this is implemented with a hard-coded list of non-deterministic functions copied from ir.cpp in AliasDB, but this will eventually be implemented by returning with a given schema has the non_deterministic tag. - Tested is_non_deterministic method with a deterministic op and a non deterministic op in test_schema_info.cpp Note that the case for op "aten::dropout(Tensor input, float p, bool train) -> Tensor" which is deterministic whenever "train=false" is not accounted for in this pr and will be fixed in a later pr. Currently "aten::dropout(Tensor input, float p, bool train) -> Tensor" is always considered nondeterministic. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81000 Approved by: https://github.com/davidberard98	2022-07-13 00:35:42 +00:00
goldenxuett	50ba94f5cc	[JIT] Add aliasing checks in SchemaInfo with associated tests (#80984 ) - Created may_alias method in SchemaInfo to update the implementation of FunctionSchema::may_alias for aliasing cases due to inputs aliasing. - Created output_alias_map_ internal variable to check cases where outputs might alias due to inputs aliasing. This variable is updated in generateAliasMap(). - Added tests for various may_alias special cases (input - input, input - output, output - output) due to inputs aliasing causing other arguments to also alias. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80984 Approved by: https://github.com/davidberard98	2022-07-13 00:18:43 +00:00
goldenxuett	aa61fdb667	[JIT] Add argumentValue functions and is_mutable checks to SchemaInfo (#80972 ) - Created addArgumentValue/s methods in SchemaInfo to pass argument values into the subclass. These are used for more accurate mutation, aliasing and determinism checks which include special cases. - Added input_alias_map_ to keep track of which inputs alias each other. This is updated with the method generateAliasMap. - Implemented is_mutable methods in SchemaInfo which also give information based on argument values. For instance, if two inputs alias and one is mutable by the schema, then the other will also be mutable. - Tested Schema Info is_mutable implementation where inputs alias as mentioned above. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80972 Approved by: https://github.com/davidberard98	2022-07-13 00:16:41 +00:00
goldenxuett	e3a870986e	[JIT] Add may_alias in FunctionSchema with associated tests (#80918 ) - Created may_alias method in FunctionSchema to publicize aliasing information about inputs and outputs of a schema. - Tested may_alias methods for basic functionality, exceptions, and wildcard functionality. Cases where elements of a container alias another argument will be handled with a new may_contain_alias method which will be created in a later pr Pull Request resolved: https://github.com/pytorch/pytorch/pull/80918 Approved by: https://github.com/davidberard98	2022-07-12 18:07:23 +00:00
goldenxuett	b4e342928b	[JIT] Add mutability checks in FunctionSchema and create SchemaInfo subclass (#80734 ) - Added overloads to is_mutable method in FunctionSchema to tell whether an argument at index is mutable or an argument with name is mutable. - Created SchemaInfo subclass of FunctionSchema with constructors from FunctionSchema and from const char* signature. - Tested is_mutable method overloads in new test_schema_info.cpp file. Note that this pr is used to set up SchemaInfo. Implementation for SchemaInfo will be addressed in later commits Differential Revision: [D37651384](https://our.internmc.facebook.com/intern/diff/D37651384) Pull Request resolved: https://github.com/pytorch/pytorch/pull/80734 Approved by: https://github.com/davidberard98	2022-07-11 19:13:06 +00:00
Can Balioglu	56dea92d97	Fix set_requires_cuda_init (#81183 ) Fixes the buggy `set_requires_cuda_init` introduced in #80788. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81183 Approved by: https://github.com/ezyang	2022-07-11 15:36:42 +00:00
Richard Zou	26582056fa	Disable functorch modes in testing's freeze_rng_state() (#81006 ) freeze_rng_state() is this thing we use to test random operations in OpInfos: it ensures that everytime the op is called the rng state is the same. Unfortunately this doesn't work with functorch, because - torch.cuda.set_rng_state() clones a Tensor and then grabs its data_ptr - functorch's modes cause functorch wrappers to get emitted on the .clone() call (even if the thing being cloned a regular Tensor). Tensor subclasses also had this problem. This PR applies the same solution as torch_dispatch did before: we're just going to disable functorch dispatch when setting the rng state. In the long run, torch_dispatch should probably have an option to interpose on torch.cuda.set_rng_state or generator.set_state... but I didn't want to think very hard right now. Test Plan: - tested with functorch tests (those tests were previously being skipped, now I can unskip some of them). Pull Request resolved: https://github.com/pytorch/pytorch/pull/81006 Approved by: https://github.com/samdow	2022-07-08 03:28:49 +00:00
George Qi	393f7f6ad7	add layout to slow path (#80429 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/80429 Approved by: https://github.com/ezyang	2022-07-06 18:01:31 +00:00
Can Balioglu	081b56fd41	Improve readability of cuda_lazy_init (#80788 ) This PR cleans up the implementation of `cuda_lazy_init.cpp` and improves its readability. No behavioral changes are introduced. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80788 Approved by: https://github.com/ezyang	2022-07-04 16:47:11 +00:00
Can Balioglu	c54aabf3eb	Exclude Fake dispatch key during tensor construction (#80782 ) This PR excludes Fake dispatch key during tensor construction in order to have consistent behavior with the DeferredInit key in torchdistX. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80782 Approved by: https://github.com/ezyang	2022-07-04 16:46:04 +00:00
Edward Z. Yang	421f04dd02	Only allow numbers as tensors if operator was explicitly allowlisted so (#80587 ) Fixes https://github.com/pytorch/pytorch/issues/80508 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/80587 Approved by: https://github.com/ngimel	2022-06-30 18:59:38 +00:00
jjsjann123	9e86796fe3	simple c10 implementation for std::call_once (#78051 ) A long standing bug on std::call_once: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66146 It could hang during re-entry after an exception handling. Added a c10 implementation yielding a bulky mutex. Not the most efficient thing but at least it shouldn't hang. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78051 Approved by: https://github.com/albanD	2022-06-28 15:47:03 +00:00
Edward Z. Yang	f7ee061638	Wconstab/reland pysymint (#79795 ) rebased https://github.com/pytorch/pytorch/pull/79617/ to see if issues are reproducible. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79795 Approved by: https://github.com/malfet	2022-06-20 22:55:06 +00:00
Elias Ellison	9705fb03b3	Add support for a couple ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/79581 Approved by: https://github.com/Chillee	2022-06-20 22:25:39 +00:00
Taylor Robie	332e43ed1a	[Profiler] Expose extra fields to Python Pull Request resolved: https://github.com/pytorch/pytorch/pull/79623 Pybind11 has a really awesome feature where you can tell it how to move a type from C++ to Python just by specializing one template and it has out of the box support for variant types. (You do have to make one change to variant to avoid a bunch of chatty compiler warnings.) This will make it easy to both: A) Write principled type driven analysis in Python similar to `c10::visit` B) Expose fields that only make sense for certain events without cluttering up the API of the top level events. For now I haven't added any fields; this PR is just to handle the foundation. Differential Revision: [D36988611](https://our.internmc.facebook.com/intern/diff/D36988611/) Approved by: https://github.com/aaronenyeshi	2022-06-19 15:41:27 +00:00
PyTorch MergeBot	44436947bc	Revert "Reland PySymInt (#79617 )" This reverts commit `8ef6356f26`. Reverted https://github.com/pytorch/pytorch/pull/79617 on behalf of https://github.com/zengk95 due to this is breaking periodic jobs (and maybe pull) on trunk	2022-06-16 19:40:27 +00:00
Michael Andreas Dagitses	acd072967a	canonicalize includes of form <aten/src/ATen/...> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78033 This was never intended to be supported. @override-unit-failures (Note: this ignores all push blocking failures!) Differential Revision: [D36567054](https://our.internmc.facebook.com/intern/diff/D36567054/) Approved by: https://github.com/kit1980	2022-06-16 17:46:45 +00:00
Nikolay Korovaiko	8ef6356f26	Reland PySymInt (#79617 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79617 Approved by: https://github.com/Chillee	2022-06-16 04:18:06 +00:00
PyTorch MergeBot	b8db0a0475	Revert "Python Bindings for SymInts (#78135 )" This reverts commit `d332724071`. Reverted https://github.com/pytorch/pytorch/pull/78135 on behalf of https://github.com/ezyang due to broke torchvision tests	2022-06-15 13:52:14 +00:00
Nikolay Korovaiko	d332724071	Python Bindings for SymInts (#78135 ) This PR adds support for `SymInt`s in python. Namely, * `THPVariable_size` now returns `sym_sizes()` * python arg parser is modified to parse PyObjects into ints and `SymbolicIntNode`s * pybind11 bindings for `SymbolicIntNode` are added, so size expressions can be traced * a large number of tests added to demonstrate how to implement python symints. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78135 Approved by: https://github.com/ezyang	2022-06-14 02:17:59 +00:00
Michael Suo	30fb2c4aba	[lint] autoformat test/cpp and torch/csrc Let's have some fun. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78828 Approved by: https://github.com/ezyang	2022-06-11 21:11:16 +00:00
Michael Andreas Dagitses	606b234336	turn on -Werror=unused-function in our Bazel CPU build Summary: We also fix any existing issues. Note that we only do this for the CPU build because nvcc is considered a C++ toolchain but it does not have the same flag support. Adding flags to the GPU build will cause nvcc errors. Test Plan: Built locally, rely on CI to confirm. Reviewers: malfet Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79154 Approved by: https://github.com/seemethere, https://github.com/osalpekar, https://github.com/albanD	2022-06-10 22:11:54 +00:00
PyTorch MergeBot	bcd7a20953	Revert "turn on -Werror=unused-function in our Bazel CPU build" This reverts commit `67d313a032`. Reverted https://github.com/pytorch/pytorch/pull/79154 on behalf of https://github.com/malfet due to Breaks bazel build: `67d313a032`	2022-06-10 20:43:03 +00:00
Michael Andreas Dagitses	67d313a032	turn on -Werror=unused-function in our Bazel CPU build Summary: We also fix any existing issues. Note that we only do this for the CPU build because nvcc is considered a C++ toolchain but it does not have the same flag support. Adding flags to the GPU build will cause nvcc errors. Test Plan: Built locally, rely on CI to confirm. Reviewers: malfet Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79154 Approved by: https://github.com/seemethere, https://github.com/osalpekar, https://github.com/albanD	2022-06-10 18:30:08 +00:00
Edward Z. Yang	eb856daf0f	Do not treat all dense tensors as isTensorSubclassLike Fixes https://github.com/pytorch/pytorch/issues/79079 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/79098 Approved by: https://github.com/soulitzer, https://github.com/albanD	2022-06-09 03:00:57 +00:00
Horace He	bbbfbbeddc	Added "dump ops" API to return ops instead of print (#78995 ) Useful to use for grabbing info instead of the hacky "redirect C++ output" stuff I currently do. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78995 Approved by: https://github.com/ezyang	2022-06-07 05:19:07 +00:00
Michael Suo	f551c22a20	[lint] preparatory changes for mass clang-format These were all the manual changes that were needed to preserve behavior across autoformatting. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78969 Approved by: https://github.com/ezyang	2022-06-06 23:49:45 +00:00
Edward Z. Yang	80f2c175be	Follow up on CR for "Replace TensorMeta with FakeTensor" See https://github.com/pytorch/pytorch/pull/78836 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78895 Approved by: https://github.com/albanD	2022-06-06 22:20:40 +00:00
Rohit Goswami	3f58dd18dc	ENH: Add a force argument to `numpy()` (#78564 ) Reopened to help with merge issues. See #59790 for full context. Fixes #20778. Helps #71688. Finalizes @martinPasen's force argument for `Tensor.numpy()`. It is set to False by default. If it's set to True then we: 1. detatch the Tensor, if requires_grad == True 2. move to cpu, if not on cpu already 3. Uses .resolve_conj() if .is_conj() == True 4. Uses .resolve_neg() if .is_neg() == True cc @albanD Pull Request resolved: https://github.com/pytorch/pytorch/pull/78564 Approved by: https://github.com/albanD	2022-06-06 14:14:17 +00:00
Edward Z. Yang	587efdb5fa	Replace TensorMeta with FakeTensor Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78836 Approved by: https://github.com/albanD, https://github.com/mruberry	2022-06-05 11:51:27 +00:00
Brian Hirsh	7ff091fc4e	move Functionalize dispatch key closer to backends Pull Request resolved: https://github.com/pytorch/pytorch/pull/77132 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-05-26 16:15:43 +00:00
Elias Ellison	2d93e1fada	Add slow path for device Pull Request resolved: https://github.com/pytorch/pytorch/pull/77684 Approved by: https://github.com/ezyang	2022-05-24 21:56:01 +00:00
Alban Desmaison	04ac80c73a	Fix a few issues on assert/double error/legacy constructor (#77966 ) Fixes https://github.com/pytorch/pytorch/issues/77960, https://github.com/pytorch/pytorch/issues/77957, https://github.com/pytorch/pytorch/issues/77781 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77966 Approved by: https://github.com/soulitzer, https://github.com/kulinseth	2022-05-20 20:25:12 +00:00
anjali411	5984bc8233	Allow specifying alias analysis while registering new ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/77690 Approved by: https://github.com/ezyang	2022-05-19 21:11:40 +00:00
Edward Z. Yang	4941e72e40	Revert "Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )"" This reverts commit `c35bd8d423`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77719 Approved by: https://github.com/Chillee, https://github.com/malfet	2022-05-18 18:40:57 +00:00
PyTorch MergeBot	48581d74ad	Revert "Add dispatch mode testing for meta tensors and other stuff" This reverts commit `c1cdb1216b`. Reverted https://github.com/pytorch/pytorch/pull/77477 on behalf of https://github.com/malfet	2022-05-18 02:56:48 +00:00
Edward Z. Yang	c1cdb1216b	Add dispatch mode testing for meta tensors and other stuff We don't have any coverage for meta tensor correctness for backwards because torch function mode can only allow us to interpose on Python torch API calls, but backwards invocations happen from C++. To make this possible, I add torch_dispatch_meta test which runs the tests with __torch_dispatch__ While doing this, I needed to generate fresh expected failure / skip lists for the new test suite, and I discovered that my original scaffolding for this purpose was woefully insufficient. So I rewrote how the test framework worked, and at the same time rewrote the __torch_function__ code to also use the new logic. Here's whats new: - Expected failure / skip is now done on a per function call basis, rather than the entire test. This means that separate OpInfo samples for a function don't affect each other. - There are now only two lists: expect failure list (where the test consistently fails on all runs) and skip list (where the test sometimes passes and fails. - We explicitly notate the dtype that failed. I considered detecting when something failed on all dtypes, but this was complicated and listing everything out seemed to be nice and simple. To keep the dtypes short, I introduce a shorthand notation for dtypes. - Conversion to meta tensors is factored into its own class MetaConverter - To regenerate the expected failure / skip lists, just run with PYTORCH_COLLECT_EXPECT and filter on a specific test type (test_meta or test_dispatch_meta) for whichever you want to update. Other misc fixes: - Fix max_pool1d to work with BFloat16 in all circumstances, by making it dispatch and then fixing a minor compile error (constexpr doesn't work with BFloat16) - Add resolve_name for turning random torch API functions into string names - Add push classmethod to the Mode classes, so that you can more easily push a mode onto the mode stack - Add some more skips for missing LAPACK - Added an API to let you query if there's already a registration for a function, added a test to check that we register_meta for all decompositions (except detach, that decomp is wrong lol), and then update all the necessary sites to make the test pass. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77477 Approved by: https://github.com/zou3519	2022-05-18 00:18:34 +00:00
Michael Suo	7f1e331b34	Make SymInt constructor explicit Since we plan to have a bunch of code that is sensitive to whether or not a SymInt contains a symbolic shape or not, it seems like a bad idea to have an implicit constructor. For example, code like: ``` sizes_and_strides_.stride_at_unchecked(dim) = 0; ``` would sail through, and the `0` would get implicitly promoted to a SymInt. This is a tradeoff though: it makes code that handles `SymInt`s more clunky as `int64_t`s and integer literals need to be explicitly wrapped in `SymInt` before being used. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77666 Approved by: https://github.com/ezyang	2022-05-17 22:28:35 +00:00
Brian Hirsh	cfc87cad02	fix grad(torch.tensor()) using lift() operator Pull Request resolved: https://github.com/pytorch/pytorch/pull/77650 Approved by: https://github.com/zou3519	2022-05-17 16:55:37 +00:00
Brian Hirsh	f9f4896a07	fix torch.jit.tracing for at::lift (#77588 ) After adding the `at::lift` op, it started getting traced during `torch.jit.trace`. We don't want that to happen for BC reasons Pull Request resolved: https://github.com/pytorch/pytorch/pull/77588 Approved by: https://github.com/ezyang	2022-05-17 14:13:46 +00:00
Edward Z. Yang	b5bc954a71	Fix optional dtype/layout/memory_format pycall; fix memory format Double-header bug fix: - As reported by jansel, dtypes are still showing up as integers when the schema is an optional dtype. This is simple enough to fix and I added a test for it. But while I was at it... - I noticed that the THPMemoryFormat_new idiom with "unused" name doesn't actually work, the repr of the returned memory format object is wrong and this shows up when we try to log the args/kwargs. So I fixed memory format to do it properly along with everything else. Fixes https://github.com/pytorch/pytorch/issues/77135 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77543 Approved by: https://github.com/albanD, https://github.com/jansel	2022-05-16 16:46:08 +00:00
anjali411	17653a53d5	Forward fix failing TestDispatch tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/77442 Approved by: https://github.com/janeyx99	2022-05-13 20:08:10 +00:00
Anjali Chourdia	2c1de3aa47	Back out Dispatcher change that makes Messenger Desktop crash on M1 devices (#77414 ) Summary: This change causes Messenger Dekstop to crash on M1 devices when the user enables background during the call. The change apparently causes the compiler to emit AVX instructions that are not supported by Rosetta. This is a surgical backout that only backs out the changes in C++ side, and not Python bindings which I believe are not shipped with Workplace Chat. Test Plan: Run the application and make sure that it doesn't crash when the background is enabled https://pxl.cl/23VSH Reviewed By: ezyang Differential Revision: D36358832 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77414 Approved by: https://github.com/bigfootjon	2022-05-13 17:33:53 +00:00
Hangchen Yu	abb6fab0f4	Add new PrivateUse1 DeviceType for non-public devices (#77208 ) Summary: The new PrivateUse1 DeviceType is associated with the PrivateUse1 DispatchKey, which can be used for non-public devices without introducing a new device type. Note that the stringified name of the PrivateUse1 device is "privateuseone". Test Plan: All CI should pass. Differential Revision: D35859437 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77208 Approved by: https://github.com/bdhirsh	2022-05-13 16:03:27 +00:00
Brian Hirsh	47dd092bae	add a new at::lift operator, fix torch.tensor for functionalization This reverts commit `85bd65a880`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77285 Approved by: https://github.com/albanD, https://github.com/ezyang	2022-05-12 13:31:19 +00:00
PyTorch MergeBot	85bd65a880	Revert "[test] try to fix torch.tensor for functionalization" This reverts commit `9edee09ed6`. Reverted https://github.com/pytorch/pytorch/pull/76319 on behalf of https://github.com/janeyx99	2022-05-11 18:48:42 +00:00
Brian Hirsh	9edee09ed6	[test] try to fix torch.tensor for functionalization Pull Request resolved: https://github.com/pytorch/pytorch/pull/76319 Approved by: https://github.com/ezyang	2022-05-11 17:27:34 +00:00
Edward Z. Yang	0a14a4c280	Register prims as operators. This makes prims look as if they were defined in native_functions.yaml but they're still all written in Python. You now need to give a full schema string for your prims. The returned prim object is now torch.ops.prim overload (prims are not allowed to be overloaded, so we return the overload, not the overload packet, for speed.) Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77117 Approved by: https://github.com/mruberry, https://github.com/albanD	2022-05-11 16:38:14 +00:00
samdow	f23b629196	Change no_torch_function_mode to StashTorchFunctionModeGuard As discussed [here](https://github.com/pytorch/pytorch/pull/75965#discussion_r863097966), changes the torch dispatch and torch function RAII guards to have unified names (StashXModeGuard). This also match the [TLS guard](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/core/PythonFallbackKernel.cpp#L30-L44), called StashTLSOnEntryGuard, that has a similar RAII pattern Pull Request resolved: https://github.com/pytorch/pytorch/pull/76922 Approved by: https://github.com/ezyang	2022-05-06 14:12:18 +00:00
anjali411	3d28ab0709	Minor follow up fixes for python registration Pull Request resolved: https://github.com/pytorch/pytorch/pull/76892 Approved by: https://github.com/albanD	2022-05-05 13:46:48 +00:00
anjali411	07f766df54	Allow creating new libraries and defining new operators from Python Pull Request resolved: https://github.com/pytorch/pytorch/pull/76250 Approved by: https://github.com/ezyang	2022-05-05 03:33:08 +00:00
anjali411	55f55a4cf6	Allow users to override kernels for existing C++ ops through Python Pull Request resolved: https://github.com/pytorch/pytorch/pull/75905 Approved by: https://github.com/ezyang	2022-05-05 03:31:39 +00:00
samdow	6779366f27	add nested mode to python mode Pull Request resolved: https://github.com/pytorch/pytorch/pull/75965 Approved by: https://github.com/albanD, https://github.com/ezyang, https://github.com/zou3519	2022-05-04 13:01:06 +00:00
Pearu Peterson	436a7be059	Factory functions for sparse CSC, BSR, and BSC tensors Pull Request resolved: https://github.com/pytorch/pytorch/pull/76634 Tests for Sparse Compressed factory functions Pull Request resolved: https://github.com/pytorch/pytorch/pull/76746 Approved by: https://github.com/cpuhrsch	2022-05-04 03:30:41 +00:00
Nikita Shulga	8473173c36	Remove breakpad dependency This functionality does not seem to be used and there are some requests to update dependency. Add `third_party` to torch_cpu include directories if compiling with Caffe2 support, as `caffe2/quantization/server/conv_dnnlowp_op.cc` depends on `third_party/fbgemm/src/RefImplementations.h` Pull Request resolved: https://github.com/pytorch/pytorch/pull/75394 Approved by: https://github.com/janeyx99, https://github.com/seemethere	2022-05-03 20:21:55 +00:00
samdow	598e7e5f19	[Reland] Change 'python mode' to 'torch dispatch mode' Changes Python Mode name to Torch Dispatch Mode because there is now a Torch Function Mode, so Torch Dispatch Mode and Torch Function Mode are consistent with each other Pull Request resolved: https://github.com/pytorch/pytorch/pull/76562 Approved by: https://github.com/zou3519, https://github.com/albanD	2022-05-02 20:06:43 +00:00
Pearu Peterson	e6b4d77c3e	Sparse Compressed tensor factory function 2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/76623 Approved by: https://github.com/cpuhrsch	2022-05-02 17:38:30 +00:00
PyTorch MergeBot	395a620a4f	Revert "Change 'python mode' to 'torch dispatch mode'" This reverts commit `7203a73986`. Reverted https://github.com/pytorch/pytorch/pull/76562 on behalf of https://github.com/janeyx99	2022-05-02 14:42:11 +00:00
samdow	7203a73986	Change 'python mode' to 'torch dispatch mode' Changes Python Mode name to Torch Dispatch Mode because there is now a Torch Function Mode, so Torch Dispatch Mode and Torch Function Mode are consistent with each other Pull Request resolved: https://github.com/pytorch/pytorch/pull/76562 Approved by: https://github.com/zou3519	2022-05-02 13:33:58 +00:00
Pearu Peterson	ff10e45993	Unsafe Sparse Compressed tensor factory function Pull Request resolved: https://github.com/pytorch/pytorch/pull/75961 Approved by: https://github.com/cpuhrsch	2022-04-28 23:32:36 +00:00
Kulin Seth	54c75e1e8f	Add "mps" device to PyTorch framework. Remove the "mlc" device for Mac platforms. This commit will be followed up with: * adding MPS runtime components * PyTorch ops for MPS device Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/76291 Approved by: https://github.com/albanD	2022-04-27 19:21:57 +00:00
Can Balioglu	a0bf0f5611	Add new dispatch keys for Fake Tensor and Deferred Module Initialization Thanks to @bdhirsh's work, we now have room for new dispatch keys in `DispatchKey` enum. This PR adds two new keys for out-of-core [Fake Tensor](https://pytorch.org/torchdistx/latest/fake_tensor.html) and [Deferred Module Initialization](https://pytorch.org/torchdistx/latest/deferred_init.html) features. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76139 Approved by: https://github.com/bdhirsh	2022-04-27 18:48:44 +00:00
anjali411	3d438f7189	Make tolist correctly work for 0 element tensors Pull Request resolved: https://github.com/pytorch/pytorch/pull/76335 Approved by: https://github.com/ngimel	2022-04-26 18:57:24 +00:00
anjali411	79891abf40	fix UndefinedBehaviorSanitizer Pull Request resolved: https://github.com/pytorch/pytorch/pull/76235 Approved by: https://github.com/albanD	2022-04-22 20:02:31 +00:00
David Berard	272890998e	[JIT] pass more exception info through the JIT interpreter If TORCH_SHOW_CPP_STACKTRACES=1, then dump e.what() into the RuntimeError, which should make it easier to debug exceptions that happen within interpreted sections. Test: ```patch diff --git a/test/cpp/jit/test_dce.cpp b/test/cpp/jit/test_dce.cpp index 6f9161d0d9..7c574787cf 100644 --- a/test/cpp/jit/test_dce.cpp +++ b/test/cpp/jit/test_dce.cpp @@ -3,6 +3,10 @@ #include <torch/csrc/jit/ir/irparser.h> #include <torch/csrc/jit/passes/dead_code_elimination.h> #include <torch/csrc/jit/testing/file_check.h> +#include <torch/csrc/jit/runtime/interpreter.h> +#include <test/cpp/jit/test_utils.h> + +#include <ATen/ATen.h> namespace torch { namespace jit { @@ -48,5 +52,30 @@ graph(): // Check that dead code elimin testing::FileCheck().run(input, *graph); } + +TEST(EliminateDeadCodeTest, interpreterfailure) { + const std::string input = R"IR( +graph(%x.1 : Tensor): + %2 : int = prim::Constant[value=128]() # /data/users/dberard/scripts/DGB/sz.py:4:38 + %3 : int = prim::Constant[value=256]() # /data/users/dberard/scripts/DGB/sz.py:4:43 + %5 : int = prim::Constant[value=1]() # /data/users/dberard/scripts/DGB/sz.py:4:53 + %4 : int[] = prim::ListConstruct(%2, %3) + %6 : Tensor[] = aten::split_with_sizes(%x.1, %4, %5) # /data/users/dberard/scripts/DGB/sz.py:4:11 + return (%6) +)IR"; + auto graph = std::make_shared<Graph>(); + parseIR(input, graph.get()); + + //auto stack = createStack({at::randn({2, 383}, at::kCPU)}); + auto stack = createStack({at::Tensor{}}); + + Code code(graph, ""); + InterpreterState interpreter{code}; + interpreter.run(stack); + ASSERT_EQ(2, stack.size()); + ASSERT_FALSE(stack[0].toTensor().defined()); + ASSERT_FALSE(stack[1].toTensor().defined()); +} + } // namespace jit } // namespace torch ``` ^ use this to repro the interpreter issue: `TORCH_SHOW_CPP_STACKTRACES=1 ./bin/test_jit --gtest_filter="EliminateDeadCodeTest.interpreterfailure"` and the stack trace is shown. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75682 Approved by: https://github.com/eellison	2022-04-21 18:26:49 +00:00
kshitij12345	aa51704ce5	[complex32] add chalf alias for complex32 and chalf method Reference: https://github.com/pytorch/pytorch/issues/74537 Adds chalf alias for complex32 and also adds method `chalf` similar to `cfloat, cdouble` TODO: * [x] Add docs * [x] Add override Pull Request resolved: https://github.com/pytorch/pytorch/pull/75320 Approved by: https://github.com/anjali411	2022-04-20 23:44:47 +00:00
Nikolay Korovaiko	69e048b090	List of SymInt rebase on master Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/75115 Approved by: https://github.com/ezyang	2022-04-20 02:09:55 +00:00
anjali411	ce7feeadc0	Disallow calling tolist on tensors with nullptr storage Pull Request resolved: https://github.com/pytorch/pytorch/pull/75990 Approved by: https://github.com/ezyang	2022-04-18 20:54:58 +00:00
Pearu Peterson	e9791cd8c9	Validate Sparse Compressed tensor arguments Pull Request resolved: https://github.com/pytorch/pytorch/pull/75946 Approved by: https://github.com/cpuhrsch	2022-04-18 02:21:22 +00:00
PyTorch MergeBot	d79d9fa283	Revert "Remove breakpad dependency" This reverts commit `9aa3c7fd83`. Reverted https://github.com/pytorch/pytorch/pull/75394 on behalf of https://github.com/malfet	2022-04-17 17:58:51 +00:00
Nikita Shulga	9aa3c7fd83	Remove breakpad dependency This functionality does not seem to be used and there are some requests to update dependency Pull Request resolved: https://github.com/pytorch/pytorch/pull/75394 Approved by: https://github.com/janeyx99, https://github.com/seemethere	2022-04-17 17:43:45 +00:00
Alban Desmaison	3467f3fa80	Remove spurious warning when using disabled torch function Pull Request resolved: https://github.com/pytorch/pytorch/pull/75826 Approved by: https://github.com/ezyang	2022-04-15 17:08:45 +00:00
Pearu Peterson	1cd46b309b	Introduce sparse compressed layouts: SparseCsr, SparseBsr, SparseBsc Pull Request resolved: https://github.com/pytorch/pytorch/pull/75831 Approved by: https://github.com/cpuhrsch	2022-04-15 03:55:32 +00:00
Peter Bell	39717d3034	Remove histogramdd functional wrapper Merge once the forward compatibility period is expired for the histogramdd operator. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74201 Approved by: https://github.com/ezyang, https://github.com/albanD	2022-04-14 20:56:24 +00:00
PyTorch MergeBot	715e07b97f	Revert "Remove histogramdd functional wrapper" This reverts commit `8cc338e5c2`. Reverted https://github.com/pytorch/pytorch/pull/74201 on behalf of https://github.com/suo	2022-04-14 03:56:48 +00:00
Peter Bell	8cc338e5c2	Remove histogramdd functional wrapper Merge once the forward compatibility period is expired for the histogramdd operator. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74201 Approved by: https://github.com/ezyang	2022-04-14 02:47:39 +00:00
johnlu	ac8d220188	Add `__torch_function__` override protocol supporting to some factory functions ## Motivation Add `__torch_function__` override protocol supporting to the factory functions in defined in pytorch_torch_funcions_manual.cpp. ## Solution By moving the PythonArg parser from the tensor_new.cpp and add the torch function handle dispatching for these API in `torch` name space. as_tensor sparse_coo_tensor _sparse_coo_tensor_unsafe sparce_csr_tensor _sparce_csr_tensor_unsafe. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75639 Approved by: https://github.com/ezyang	2022-04-13 03:18:55 +00:00
Edward Z. Yang	0a1bc5f501	Miscellaneous __torch_function__ fixes I figured these out by unconditionally turning on a no-op torch function mode on the test suite and then fixing errors as they showed up. Here's what I found: - _parse_to failed internal assert when __torch_function__'ed because it claims its name is "to" to the argument parser; added a name override so we know how to find the correct name - Infix operator magic methods on Tensor did not uniformly handle __torch_function__ and TypeError to NotImplemented. Now, we always do the __torch_function__ handling in _wrap_type_error_to_not_implemented and your implementation of __torch_function__ gets its TypeErrors converted to NotImplemented (for better or for worse; see https://github.com/pytorch/pytorch/issues/75462 ) - A few cases where code was incorrectly testing if a Tensor was Tensor-like in the wrong way, now use is_tensor_like (in grad and in distributions). Also update docs for has_torch_function to push people to use is_tensor_like. - is_grads_batched was dropped from grad in handle_torch_function, now fixed - Report that you have a torch function even if torch function is disabled if a mode is enabled. This makes it possible for a mode to return NotImplemented, pass to a subclass which does some processing and then pass back to the mode even after the subclass disables __torch_function__ (so the tensors are treated "as if" they are regular Tensors). This brings the C++ handling behavior in line with the Python behavior. - Make the Python implementation of overloaded types computation match the C++ version: when torch function is disabled, there are no overloaded types (because they all report they are not overloaded). Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/75484 Approved by: https://github.com/zou3519	2022-04-11 16:52:16 +00:00
Anthony Barbier	ce9e27a0fc	Add new keys for Graphcore IPU (DispatchKey / Backend / DeviceType) We need a key to register our out of tree backend: https://github.com/graphcore/poptorch Pull Request resolved: https://github.com/pytorch/pytorch/pull/74763 Approved by: https://github.com/bdhirsh	2022-04-07 17:18:45 +00:00
Edward Z. Yang	31c86625cc	__torch_function__ mode Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/75154 Approved by: https://github.com/albanD, https://github.com/zou3519	2022-04-07 02:23:29 +00:00
Peter Bell	1ab03a0f6f	Deprecate `__torch_function__` as instance method in C++ Ref #63767 This has already been deprecated in the python code for a long time, but was never deprecated in the C++ api so it's possible users might not have had sufficient warning yet. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74829 Approved by: https://github.com/ezyang	2022-04-06 02:28:00 +00:00
Nikita Shulga	c593c220ff	Fix sign-compare violations in torch_python Prerequisite change for enabling `-Werror=sign-compare` across PyTorch repo Pull Request resolved: https://github.com/pytorch/pytorch/pull/75079 Approved by: https://github.com/albanD	2022-04-05 00:08:05 +00:00
Edward Z. Yang	e3848d75df	Dedupe no parsing __torch_function__ handler Now there is truly only one way to call __torch_function__ and that is via handle_torch_function_no_python_arg_parser Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/75159 Approved by: https://github.com/zou3519	2022-04-04 14:35:02 +00:00
Edward Z. Yang	3f108a5cc1	Save disable_torch_function in ThreadLocalState If __torch_function__ was disabled, this TLS should propagate to other threads. Although I was thinking about https://github.com/pytorch/pytorch/pull/73942 when I did this, this doesn't actually help solve the problem, because when I disable __torch_function__ as part of the disabled __torch_function__ implementation, this is prior to when snapshotting happens (also snapshotting only happens for Python tensors anyway). I intend to add some more TLS to this struct soon, which is why it's a struct and not just a bool. Testing is not so easy to do because on CPU there isn't an easy way to get Python code running in another thread. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/75110 Approved by: https://github.com/albanD	2022-04-02 02:18:01 +00:00
Nikolay Korovaiko	5177f95d21	Introducing SymInt to Pytorch (for tracing size arithmetic) (master rebase) (#74861 ) Summary: This PR introduces `SymInt` type to Pytorch which will be used by LTC and AOTAutograd for tracing size arithmetic and tests. `SymInt` is a C++ union structure [int64_t, SymbolicIntNode*] that wraps around an int64_t field where the value of the field could be an index into a list of `shared_ptr<SymbolicIntNode>` or a real int. This PR doesn't add any support for actually tracing symbolic ints. i.e. data_ for now can only contain real ints. ``` Goal 1: just to show we can add a type to PyTorch core. (wraps int) LANDEABLE Finalize the naming - symint Want the name to be short Does invoke “size” - NO SInt/SymInt/SymbolicInt SInt could mean signed int sym_int or symint or SymInt (originally it was “int”; capitalized implies object semantics, whereas lowercase implies value semantics) JIT schema - symint C++ - symint ``` See more details here: https://docs.google.com/document/d/1iiLNwR5ohAsw_ymfnOpDsyF6L9RTUaHMpD8 (`d843f63f2a`)YLw-jxEw Pull Request resolved: https://github.com/pytorch/pytorch/pull/74861 Reviewed By: qihqi, ngimel Differential Revision: D35226230 Pulled By: Krovatkin fbshipit-source-id: 34acf342bd50fcaa4d8d5dd49c2fd6a98823a5b3 (cherry picked from commit 218643f63ef181cabb92d13a6e837eb64f2dda3c)	2022-03-31 21:59:59 +00:00
Peter Bell	c7f9da5752	Add C++ implementation of histogramdd This creates a `histogramdd` operator with overloads matching the `Union` behaviour used in the functional variant. Moving into C++ is preferred because it can handle torch function automatically instead of needing to differentiate between the overloads manually. This also adds a new return type: `std::tuple<Tensor, std::vector<Tensor>>`. For which I've updated `wrap` to be completely generic for tuples and removed the old manual definitions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74200 Approved by: https://github.com/ezyang	2022-03-29 02:17:21 +00:00
Sherlockk Huang	1c5a812579	Better type checking in disable_torch_function/dispatch A follow up fix for PR #74509 Original issue: #73933 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74720 Approved by: https://github.com/gchanan	2022-03-26 14:49:11 +00:00
Sherlock Huang	f4a0da8695	Supports super().__torch_dispatch__ with arguments list Summary: For THPModule_disable_torch_(dispatch\|function), converts list arguments to tuple before invoking PyObject_Call. Fixes #73933 Test Plan: Reviewers: Subscribers: Tasks: 114830027 Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74509 Fix PyObject leak issue Pull Request resolved: https://github.com/pytorch/pytorch/pull/74567 Approved by: https://github.com/ezyang	2022-03-23 23:33:44 +00:00
kshitij12345	f7ee308dfb	[complex-half] support casting (by updating copy_) Reference https://github.com/pytorch/pytorch/issues/71680 Pull Request resolved: https://github.com/pytorch/pytorch/pull/73847 Approved by: https://github.com/anjali411	2022-03-23 21:42:59 +00:00
Edward Z. Yang	4917f8ce0a	Throw python_error if the call returns nullptr. It's necessary to throw an exception so that PyWarningHandler knows that there is already an exception and it properly propagates it. I need to think about how to lint for this situation in the future. I also need to work out how to test this fix (my local repro is fixed after this change). Fixes https://github.com/pytorch/pytorch/issues/74334 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/74357 Approved by: https://github.com/anjali411	2022-03-22 01:25:25 +00:00
Edward Yang	0239284313	Relax dtype restrictions on torch.Tensor (#73850 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73850 Previously, torch.Tensor was treated as if it were torch.FloatTensor (where Float is whatever the default dtype was). This is not good behavior for tensor subclasses, which inherit from torch.Tensor and will want to super() call into it and will only notice later that only float works as a dtype. So in this PR I relax the behavior for this case to make the torch.Tensor constructor more useful for subclasses. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D34707396 Pulled By: ezyang fbshipit-source-id: a995d601007b6fcd0317d89f66ca7e08c4d6053e (cherry picked from commit e8d0d7b3e8b17681b931cbe4f5729de2e80cf3de)	2022-03-09 15:45:24 +00:00
Edward Yang	d6c29b1d30	Deduplicate legacy _ctor and _new Python bindings (#73822 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73822 I guess hypothetically the logic duplication here is a faux amis because we could say that the constructor and new method should evolve APIs independently... but nah, it's not worth it. There is only very slight differences between the two functions: different error messages, and the new method does extra checks to make sure the requested types are consistent with the base Tensor. But I need to refactor this code and I really don't want to do the refactor twice. So dedupe first. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: anjali411 Differential Revision: D34665171 Pulled By: ezyang fbshipit-source-id: bd40ec7f6e694bfeff4e4aaab2f4e95cea250b65 (cherry picked from commit 10a03926d8d8f36506c9a3d62cf2c380f559b00b)	2022-03-08 00:56:55 +00:00
Edward Z. Yang	35cfa74f97	Add a default implementation of __torch_dispatch__ I was working on an explanation of how to call into the "super" implementation of some given ATen operation inside of __torch_dispatch__ (https://github.com/albanD/subclass_zoo/blob/main/trivial_tensors.py) and I kept thinking to myself "Why doesn't just calling super() on __torch_dispatch__ work"? Well, after this patch, it does! The idea is if you don't actually unwrap the input tensors, you can call super().__torch_dispatch__ to get at the original behavior. Internally, this is implemented by disabling PythonKey and then redispatching. This implementation of disabled_torch_dispatch is not /quite/ right, and some reasons why are commented in the code. There is then some extra work I have to do to make sure we recognize disabled_torch_dispatch as the "default" implementation (so we don't start slapping PythonKey on all tensors, including base Tensors), which is modeled the same way as how disabled_torch_function is done. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/73684 Approved by: albanD	2022-03-03 20:19:33 +00:00
Peter Bell	f437ca6e8e	Remove legacy tensor constructors for complex dtypes PR #72405 added four new types to the public python API: `torch.ComplexFloatTensor`, `torch.ComplexDoubleTensor`, `torch.cuda.ComplexFloatTensor` and `torch.cuda.ComplexDoubleTensor`. I believe this was unintentional and a clarifying comment as to the purpose of `all_declared_types` is needed to avoid this in future. Pull Request resolved: https://github.com/pytorch/pytorch/pull/73370	2022-02-28 15:13:44 +00:00
Shuitao Fan	05c86c2be1	T112685841: Use irange in PyTorch (#73378 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73378 1) ran check_for_c10_loops.py to automatically update all files (.h, .hpp, *.cpp) under fbcode/caffe2/torch (this is the path in the check_for_c10_loops.py, slightly different from the task description where the path mentioned was fbcode/caffe2. since current commit already contains 27 files, will use a separate commit for additional files). 2) manually reviewed each change, and reverted a few files: (a) select_keys.cpp, bucketize_calibration.cpp, index_mmh and TCPStore.cpp: iterator modified in loop (b) qlinear_4bit_ops.cpp and id_list_feature_merge_conversion.cpp: condition containing multiple expressions. Test Plan: Doing the following (still in progress, will address issues as they appear): buck build ... buck test ... Reviewed By: r-barnes Differential Revision: D34435473 fbshipit-source-id: b8d3c94768b02cf71ecb24bb58d29ee952f672c2 (cherry picked from commit fa9b0864f3761a501868fe0373204b12fdfc2b32)	2022-02-26 06:34:22 +00:00
Nikita Karetnikov	75db05c3fd	Check if the iterator is valid before dereferencing it (#72405 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72405 Fixes #71674. This shouldn't segfault now: ``` import torch d = torch.complex64 torch.set_default_dtype(d) ``` Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D34423660 Pulled By: anjali411 fbshipit-source-id: cac92a6f56846f2c0727a120b5f568aa75baa21e (cherry picked from commit eaab813a0fddced24303b3bd50e4fcdba1516e46)	2022-02-23 18:33:46 +00:00
Kurt Mohler	8e7fe87630	Rename `Typed/UntypedStorage` to `_Typed/_UntypedStorage` (#72540 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72540 Reviewed By: jbschlosser Differential Revision: D34216823 Pulled By: bdhirsh fbshipit-source-id: 1bc9930ab582771ebf02308e035576cd1a0dbe47 (cherry picked from commit `329238f612`)	2022-02-15 23:53:01 +00:00
Alban Desmaison	a7cac05ca6	Add new tls snapshot feature (#72832 ) Summary: Reland of https://github.com/pytorch/pytorch/pull/72623 that was reverted for the tls cleanup was removed. From close inspection on the counting of the number of available keys, I think there is one more since the guard is actually one after the last usable key. With this update assert, the last updated key will still be <=63 which will fit just fine. Pull Request resolved: https://github.com/pytorch/pytorch/pull/72832 Reviewed By: H-Huang Differential Revision: D34228571 Pulled By: albanD fbshipit-source-id: ce5e10a841ea87386727346cfc8d9327252574c4 (cherry picked from commit `59d3b86353`)	2022-02-15 19:02:05 +00:00
Can Balioglu	6942fccf60	Skip superfluous storage allocations while constructing meta tensors (#65331 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65331 ghstack-source-id: 148862595 This is a performance optimization for the use case: ``` tensor = torch.tensor(<large_data>, device='meta') ``` where the current implementation requires a superfluous memory allocation on CPU even though the target device is a meta. Test Plan: Run existing tests since no behavioral change is introduced. Reviewed By: ezyang Differential Revision: D31055036 fbshipit-source-id: 04d6c13594a71fc65bf2fbd567ee71833a879851 (cherry picked from commit `489d0a151a`)	2022-02-11 12:55:11 +00:00
Brian Muse	8bf3179f6e	#71946 Remove Python 3.6 references (#72211 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/71946 This commit removes some bits of code that were hard coded for Python 3.6 support from the `.circleci` and `torch` folders. It should only be merged if https://github.com/pytorch/pytorch/issues/66462 is complete. Pull Request resolved: https://github.com/pytorch/pytorch/pull/72211 Reviewed By: dagitses, seemethere Differential Revision: D33982604 Pulled By: musebc fbshipit-source-id: 8f453bf9909df615addd59538adb369c65484044 (cherry picked from commit `944a9970fe`)	2022-02-08 03:46:20 +00:00
Yukio Siraichi	1fdbe9aa76	Make `asarray` behavior consistent with Python Array API. (#71757 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/70591 This PR makes `torch.asarray` consistent with [the Python Array API](https://data-apis.org/array-api/latest/API_specification/generated/signatures.creation_functions.asarray.html#signatures.creation_functions.asarray) (which also happens to be the same as `torch.as_tensor` behavior). Specifically, it makes `asarray` casting conditional to the presence of the `dtype` argument. This solves the issue when Python scalars (and lists) were passed as input without specifying the `dtype`. Before: ```python >>> torch.asarray([True, False]) tensor([1., 0.]) ``` After: ```python >>> torch.asarray([True, False]) tensor([True, False]) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/71757 Reviewed By: mrshenli Differential Revision: D33774995 Pulled By: anjali411 fbshipit-source-id: 9f293401f993dca4046ceb61f714773ed4cf7c46 (cherry picked from commit `0c6f98ebe7`)	2022-02-02 15:57:31 +00:00
Peter Bell	40d1f77384	Codegen: python_torch_functions only include relevant operators (#68693 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68693 Generation of python bindings for native functions is split over 8 different files. One for each namespace, with the torch namespace split into 3 shards, and methods in their own file as well. This change ensures that editing any single (non-method) operator only causes one of these files to be rebuilt. Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D32596270 Pulled By: albanD fbshipit-source-id: 0570ec69e7476b8f1bc21138ba18fe8f95ebbe3f (cherry picked from commit `ba0fc71a3a`)	2022-01-21 15:37:06 +00:00
Can Balioglu	80b19c4c8c	Enable Python bindings for UntypedStorage (#68945 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68945 This PR enables the Python conversion functions for `Storage` (specifically `UntypedStorage`) and also cleans up some remnants of the deprecated typed storages from `DynamicTypes.cpp`. ghstack-source-id: 147245110 Test Plan: Run the existing unit and integration tests. Reviewed By: albanD Differential Revision: D32676505 fbshipit-source-id: 3a3f6db4fb0da5c78dd406c96ab70bdc37015521 (cherry picked from commit `d6427b94cf`)	2022-01-20 02:11:34 +00:00
Andrey Talman	efd274bbcb	Fix for windows builds with python 3.10 , getting rid of ssize_t (ssize_t is not a C++ defined type) (#71390 ) Summary: Fix for windows builds with python 3.10 , getting rid of ssize_t Here is the completed bin build : https://app.circleci.com/pipelines/github/pytorch/pytorch/441527/workflows/144edb79-b398-4d70-92fe-b63158c1b439/jobs/16954881 Pull Request resolved: https://github.com/pytorch/pytorch/pull/71390 Reviewed By: samdow Differential Revision: D33637686 Pulled By: atalman fbshipit-source-id: fcdfca672dc20385a3d2339c20e69bd2d1717e88 (cherry picked from commit `2ac58b0dc1`)	2022-01-18 22:12:41 +00:00
Nikita Shulga	356af8f857	Do not use `ssize_t` in `python_arg_parser.[cpp\|h]` (#71250 ) Summary: Use `Py_ssize_t` when calling Python API Use `c10::irange` to automatically infer loop type Use `size_t` or `unsigned` for unsigned type Partially addresses https://github.com/pytorch/pytorch/issues/69948 Pull Request resolved: https://github.com/pytorch/pytorch/pull/71250 Reviewed By: atalman Differential Revision: D33569724 Pulled By: malfet fbshipit-source-id: c9eb75be9859d586c00db2f824c68840488a2822	2022-01-13 19:10:30 -08:00
Scott Wolchok	ddea6980fe	[PyTorch][JIT] Don't refcount Type singletons (#69579 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69579 This should help us avoid reference counting overhead on singleton Type subclasses without a major rewrite of the Type subsystem. ghstack-source-id: 146643993 Test Plan: Ran //caffe2/caffe2/fb/high_perf_models/pytorch/benchmark_framework_overheads:cpp_benchmark with arguments `--op empty -niter 40 --stressTestRecordFunction --captureRecordFunctionInputs` on devbig with turbo off. Before: ``` I1206 13:47:15.037441 1201670 bench.cpp:144] Mean 0.737675 I1206 13:47:15.037463 1201670 bench.cpp:145] Median 0.736725 I1206 13:47:15.037468 1201670 bench.cpp:146] Min 0.722897 I1206 13:47:15.037473 1201670 bench.cpp:147] stddev 0.00508187 I1206 13:47:15.037482 1201670 bench.cpp:148] stddev / mean 0.00688903 ``` After: ``` I1206 13:48:16.830123 1205612 bench.cpp:144] Mean 0.66988 I1206 13:48:16.830150 1205612 bench.cpp:145] Median 0.663956 I1206 13:48:16.830157 1205612 bench.cpp:146] Min 0.65986 I1206 13:48:16.830164 1205612 bench.cpp:147] stddev 0.0335928 I1206 13:48:16.830171 1205612 bench.cpp:148] stddev / mean 0.0501475 ``` Static runtime startup is also improved; for CMF local_ro, time to initialize a predictor went from 10.01s to 9.59s. (Note: I wish I had a production workload to demonstrate the advantage of this on. I tried ctr_mobile_feed local_ro net but it was neutral. Anything that manipulates types or List/Dict a lot might be promising.) Reviewed By: suo Differential Revision: D32923880 fbshipit-source-id: c82ed6689b3598e61047fbcb2149982173127ff0	2022-01-06 17:39:16 -08:00
Amir Khojaste	748790588c	Upgrading the loop to use irange (#70326 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70326 See D24145988 for context: it allows loops such as for(int i=0;i<10;i++) to be expressed as for(const auto i : c10::irange(10)). This is nice because it auto-types the loops and adds const-safety to the iteration variable. Test Plan: buck run //caffe2/torch/fb/sparsenn:test Reviewed By: r-barnes Differential Revision: D33243400 fbshipit-source-id: b1f1b4163f4bf662031baea9e5268459b40c69a3	2022-01-06 07:06:53 -08:00
Peter Bell	fa09099ba3	Codegen: TraceType only includes operators being registered (#68691 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68691 TraceType is a sharded file, so by only including specific operator headers, we ensure that changing one (non-method) operator only needs one shard to be re-compiled. This also changes all the included autograd and jit headers from including `ATen/ATen.h` to just including `ATen/core/Tensor.h`. Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D33336948 Pulled By: albanD fbshipit-source-id: 4e40371592b9a5a7e7fcd1d8cecae11ffb873113	2022-01-02 13:09:19 -08:00
Nikita Shulga	26e32988bd	Revert D32596264: Codegen: TraceType only includes operators being registered Test Plan: revert-hammer Differential Revision: D32596264 (`e66a8ab4f5`) Original commit changeset: 2f28b62d7b99 Original Phabricator Diff: D32596264 (`e66a8ab4f5`) fbshipit-source-id: 7d18c4e77ce30dd7817a95f9c39b565cb246cd12	2021-12-17 11:20:12 -08:00
Peter Bell	e66a8ab4f5	Codegen: TraceType only includes operators being registered (#68691 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68691 TraceType is a sharded file, so by only including specific operator headers, we ensure that changing one (non-method) operator only needs one shard to be re-compiled. This also changes all the included autograd and jit headers from including `ATen/ATen.h` to just including `ATen/core/Tensor.h`. Test Plan: Imported from OSS Reviewed By: jbschlosser, malfet Differential Revision: D32596264 Pulled By: albanD fbshipit-source-id: 2f28b62d7b9932f30fad7daacd8ac5bb7f63c621	2021-12-17 10:35:05 -08:00
Peter Bell	b08d64202a	Remove THGeneral (#69041 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69041 `TH_CONCAT_{N}` is still being used by THP so I've moved that into it's own header but all the compiled code is gone. Test Plan: Imported from OSS Reviewed By: anjali411 Differential Revision: D32872477 Pulled By: ngimel fbshipit-source-id: 06c82d8f96dbcee0715be407c61dfc7d7e8be47a	2021-12-13 16:14:28 -08:00
Peter Bell	b2e79ed5ec	Remove WindowsTorchApiMacro.h in favor of Export.h (#69585 ) Summary: Follow up to https://github.com/pytorch/pytorch/issues/68095 This also changes the files from the ATen folder to include c10's `Export.h` instead since they can't ever be exporting `TORCH_PYTHON_API`. cc pietern mrshenli pritamdamania87 zhaojuanmao satgera rohan-varma gqchen aazzolini osalpekar jiayisuse SciPioneer H-Huang Pull Request resolved: https://github.com/pytorch/pytorch/pull/69585 Reviewed By: mrshenli Differential Revision: D32958594 Pulled By: albanD fbshipit-source-id: 1ec7ef63764573fa2b486928955e3a1172150061	2021-12-09 17:30:09 -08:00
Nadav Elyahu	a20b9f8d5c	add HPU case for backend_to_string function (#69225 ) Summary: Change-Id: If8ed7f1161343a2e494d8b964576e1ee193007f7 Fixes https://github.com/pytorch/pytorch/issues/65609 Pull Request resolved: https://github.com/pytorch/pytorch/pull/69225 Reviewed By: gchanan Differential Revision: D32804545 Pulled By: wconstab fbshipit-source-id: bdf359bd779113153ebdecc515edba94e47e0ae4	2021-12-03 12:54:15 -08:00
Nick Anderson	f9ea41f257	Fixes spelling error writeable to writable, improves warning, and documentation (#67664 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/46741 pytorchbot contributors: nickleus27, yanivsagy, and khanhthien123 SmrutiSikha this is mostly your work. We just did very minor clean up. cc mruberry Pull Request resolved: https://github.com/pytorch/pytorch/pull/67664 Reviewed By: gchanan Differential Revision: D32311838 Pulled By: mruberry fbshipit-source-id: 0e5d4d888caeccb0fd7c80e6ff11b1b1fa8e00d6	2021-11-11 13:05:00 -08:00
Eddie Yan	af1bd88fc4	Allow scalars for aliased binary ops {`multiply`, `subtract`, `divide`} (#65937 ) Summary: https://github.com/pytorch/pytorch/issues/65868 pointed out that the "long-form" versions of some binary ops like `mul`, `sub`, and `div` don't match their alias's behavior when it comes to handling scalar inputs. This PR adds the missing registration in `python_arg_parser.cpp` to resolve this. CC ptrblck ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/65937 Reviewed By: malfet Differential Revision: D32156580 Pulled By: ngimel fbshipit-source-id: b143cf7119a8bb51609e1b8734204edb750f0210	2021-11-04 09:36:45 -07:00
Yukio Siraichi	8854817f44	Implement Python Array API `asarray` function. (#60627 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60627 In this PR, the core of `frombuffer` and `fromDLPack` onto _tensor_new.cpp_. `asarray` uses such refactored functions for interpreting the object as a tensor. We follow the Python Array API standard found: https://data-apis.org/array-api/latest/API_specification/creation_functions.html?highlight=asarray Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D31640510 Pulled By: mruberry fbshipit-source-id: d0869e0d73cb50023d5866b001dac5d34ca30dfd	2021-10-16 21:11:31 -07:00
anjali411	a82fcd3560	Disable .numpy() and .tolist() for tensor subclasses subclasses and fix .tolist() for conjugated and negated tensors (#66082 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66082 Fixes https://github.com/pytorch/pytorch/issues/66024 #65779 cc ezyang anjali411 dylanbespalko mruberry Lezcano nikitaved albanD Test Plan: Imported from OSS Reviewed By: Gamrix, albanD Differential Revision: D31615588 Pulled By: anjali411 fbshipit-source-id: c3e65ef0fe301630eb76732ccd7819683c09aa19	2021-10-13 13:57:51 -07:00
Nikita Shulga	e7b5712c21	Call `PyArray_Check` only if NumPy is available (#66433 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/66353 Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/66433 Reviewed By: seemethere, janeyx99 Differential Revision: D31548290 Pulled By: malfet fbshipit-source-id: 3b094bc8195d0392338e0bdc6df2f39587b85bb3	2021-10-11 19:25:31 -07:00
Shijun Kong	e2be087207	[oss][pytorch] Add quint2x4 dtype (#65545 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65545 Introduce 2bit qtensor. The new dtype added for this is c10::quint2x4 The underlying storage for this is still uint8_t, so we pack 4 2-bit values in a byte while quantizing it. Kernels that use this dtype should be aware of the packing format. (4 2-bit values in one byte) Test Plan: `buck test mode/dev-asan caffe2/test/:quantization -- test_qtensor` Reviewed By: supriyar Differential Revision: D31148141 fbshipit-source-id: 1dc1de719e097adaf93fee47c6d1b8010a3eae6c	2021-10-06 14:22:00 -07:00
Bert Maher	931352c68d	Make handle_torch_function_no_python_arg_parser public (#66054 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66054 I need this function in functorch to support the ability of custom jitted kernels to invoke torch_function when applicable. Test Plan: functorch unit tests Reviewed By: qihqi, ngimel Differential Revision: D31416599 Pulled By: bertmaher fbshipit-source-id: 90b57badd6a6b9d505ebfc436869b962b55c66d7	2021-10-06 00:27:10 -07:00
Kurt Mohler	5883523c1d	Remove dtype from torch.Storage and use only torch.ByteStorage (#62030 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62030 Remove dtype tracking from Python Storage interface, remove all the different `<type>Storage` classes except for `ByteStorage`, and update serialization accordingly, while maintaining as much FC/BC as possible Fixes https://github.com/pytorch/pytorch/issues/47442 * THE SERIALIZATION FORMAT IS FULLY FC/BC. We worked very hard to make sure this is the case. We will probably want to break FC at some point to make the serialization structure of tensors make more sense, but not today. * There is now only a single torch.ByteStorage class. Methods like `Tensor.set_` no longer check that the dtype of storage is appropriate. * As we no longer know what dtype of a storage is, we've removed the size method from Storage, replacing it with nbytes. This is to help catch otherwise silent errors where you confuse number of elements with number of bytes. * `Storage._new_shared` takes a `nbytes` kwarg and will reject previous positional only calls. `Storage._new_with_file` and `_set_from_file` require explicit element size arguments. * It's no longer possible to convert storages to different types using the float/double/etc methods. Instead, do the conversion using a tensor. * It's no longer possible to allocate a typed storage directly using FloatStorage/DoubleStorage/etc constructors. Instead, construct a tensor and extract its storage. The classes still exist but they are used purely for unpickling. * The preexisting serialization format stores dtype with storage, and in fact this dtype is used to determine the dtype of the tensor overall. To accommodate this case, we introduce a new TypedStorage concept that exists only during unpickling time which is used to temporarily store the dtype so we can construct a tensor. If you overrode the handling of pickling/unpickling, you MUST add handling for TypedStorage or your serialization code will degrade to standard file-based serialization. Original pull request: https://github.com/pytorch/pytorch/pull/59671 Reviewed By: soulitzer, ngimel Differential Revision: D29466819 Pulled By: ezyang fbshipit-source-id: 4a14e5d3c2b08e06e558683d97f7378a3180b00e	2021-10-05 13:50:34 -07:00
Nikita Shulga	4c4525fa5c	Compile without -Wno-unused-variable (take 2) (#66041 ) Summary: Delete `-Wno-unused-variable` from top level `CMakeLists.txt` Still suppress those warnings for tests and `torch_python` Delete number of unused variables from caffe2 code Use `(void)var;` to suppress unused variable in range loops Use `C10_UNUSED` for global constructors and use `constexpr` instead of `static` for global constants Do not delete `caffe2::OperatorBase::Output` calls as they have side effects Pull Request resolved: https://github.com/pytorch/pytorch/pull/66041 Reviewed By: ngimel Differential Revision: D31360142 Pulled By: malfet fbshipit-source-id: 6fdfb9f91efdc49ca984a2f2a17ee377d28210c8	2021-10-04 20:39:39 -07:00
Nikita Shulga	e4ee5ca698	Revert D31326599: [pytorch][PR] Compile without -Wno-unused-variable Test Plan: revert-hammer Differential Revision: D31326599 (`a6280ab653`) Original commit changeset: 924155f1257a fbshipit-source-id: b8ee5bc0298637443232f5ee9ec79e51ed256faf	2021-10-01 20:40:47 -07:00
Nikita Shulga	a6280ab653	Compile without -Wno-unused-variable (#65954 ) Summary: Delete `-Wno-unused-variable` from top level `CMakeLists.txt` Still suppress those warnings for tests and `torch_python` Delete number of unused variables from caffe2 code Use `(void)var;` to suppress unused variable in range loops Use `C10_UNUSED` for global constructors and use `constexpr` instead of `static` for global constants Pull Request resolved: https://github.com/pytorch/pytorch/pull/65954 Reviewed By: ngimel Differential Revision: D31326599 Pulled By: malfet fbshipit-source-id: 924155f1257a2ba1896c50512f615e45ca1f61f3	2021-10-01 17:40:47 -07:00
Richard Barnes	2670cacfc2	LLVM-12 fix for tensor_new.cpp (#65785 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65785 Fixes offset to nullptr at fbcode/caffe2/torch/csrc/utils/tensor_new.cpp:206 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D31250995 fbshipit-source-id: 56c7761787e732180a2537a8aa4346a39e7399a8	2021-09-29 09:35:18 -07:00
Sujoy Saraswati	fea32be964	Add HPU type for check_base_legacy_new (#65410 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/65410 Reviewed By: H-Huang Differential Revision: D31143754 Pulled By: malfet fbshipit-source-id: 32abfbae4f7c09924c7dfa16758d64a2215ec636	2021-09-27 13:13:34 -07:00
Nikita Shulga	c731be8066	[BE] Use `DispatchKeySet` in `check_base_legacy_new` (#65535 ) Summary: Refactor: ``` TORCH_CHECK ( key == a \|\| key == b \|\| key == c, "expected key to be in ", a, " or ", b , " or ", c, " but got ", key); ``` into ``` TORCH_CHECK( key_set.has(key), "expected key to be in ", key_set, " but got ", key ); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/65535 Reviewed By: wconstab Differential Revision: D31144239 Pulled By: malfet fbshipit-source-id: 68a053041a38f043e688e491889dd7ee258f3db3	2021-09-23 11:01:23 -07:00
Brian Hirsh	bcc6e3ab5e	add python API to print all operators that have kernels registered to a particular DispatchKey (#63575 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63575 Test Plan: Imported from OSS Reviewed By: ezyang, Chillee Differential Revision: D30426919 Pulled By: bdhirsh fbshipit-source-id: b0e487e48dfe02f7b9d678403f0a2b5bfe146f4e	2021-09-22 09:15:55 -07:00
Richard Zou	67bd2a31b5	[Reland] Add python mode (#64360 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64360 This PR adds a (private) enable_python_mode context manager. (see torch/utils/_python_dispatch.py). enable_python_mode accepts the type of a __torch_dispatch__ object as its argument. Whenever an operator gets called inside of the context manager, it dispatches to the __torch_dispatch__ of the passed-in type. Example usage: ``` with enable_python_mode(LoggingTensor): z = torch.empty([]) assert isinstance(z, LoggingTensor) ``` There are quite a few changes that were made to support this. First, we added TorchDispatchTypeObject, a C++ struct that represents the type of a `__torch_dispatch__` object (e.g. LoggingTensor). It holds both the PyObject* representing the class and a PyInterpreter* so we know which Python interpreter it came from. Next, we updated the concrete_dispatch_fn in python_variable.cpp to accept a `const std::shared_ptr<TorchDispatchTypeObject>&` argument. When this is null, dispatching happens as usual. When it is non-null, we prepend the TorchDispatchTypeObject's PyObject* to the overloaded args list so that it is considered first for dispatch. To get that to work, we changed how `handle_torch_dispatch_no_python_arg_parser` works. The "overloaded args list" previously only consisted of Tensor PyObjects, but now it can have types in addition to Tensors! - We renamed `append_overloaded_arg` to `append_overloaded_arg` - We added a new `append_overloaded_type` that appends a type to overloaded_args - We added special handling in `handle_torch_dispatch_no_python_arg_parser` and `append_overloaded_arg` to handle types in addition to Tensors. Then, there is PythonMode and PythonModeTLS. - We reuse the DispatchKey::Python dispatch key as a mode key - We use PythonMode::enter and PythonMode::exit to enable/disable DispatchKey::Python and set the PythonModeTLS. - PythonModeTLS stores a TorchDispatchTypeObject as metadata. - PythonMode is in libtorch_python, and PythonModeTLS is in ATen. This split is due to the libtorch_python library boundary (because we need to save TLS in ATen/ThreadLocalState) - We modify the PythonFallbackKernel to look up the relevant TorchDispatchTypeObject (if Python Mode is active) and dispatch using it. There are two more miscellaneous changes: - internal_new_from_data (torch/csrc/utils/tensor_new.cpp) gets an exclude guard. enable_python_mode currently does not handle torch.tensor and the exclude guard is to prevent a bug. Future: - This PR does not allow for the nesting of Python modes. In the future we should be able to enable this with a more sane no_dispatch API and by changing the TLS to a stack. For now I did not need this for CompositeImplicitAutograd testing. Test Plan: - new tests Reviewed By: ezyang Differential Revision: D30698082 Pulled By: zou3519 fbshipit-source-id: 7094a90eee6aa51f8b71bc4d91cfb6f49e9691f8	2021-09-16 09:02:30 -07:00
Richard Zou	0457a85d45	Revert D30543236: Add python mode Test Plan: revert-hammer Differential Revision: D30543236 (`4bd03b0242`) Original commit changeset: ef5444d96a5a fbshipit-source-id: b0042ac2c22765fa11d6d00bf751f6a4489eb6d8	2021-08-31 15:28:33 -07:00
Richard Zou	4bd03b0242	Add python mode (#63496 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63496 This PR adds a (private) enable_python_mode context manager. (see torch/utils/_python_dispatch.py). enable_python_mode accepts the type of a __torch_dispatch__ object as its argument. Whenever an operator gets called inside of the context manager, it dispatches to the __torch_dispatch__ of the passed-in type. Example usage: ``` with enable_python_mode(LoggingTensor): z = torch.empty([]) assert isinstance(z, LoggingTensor) ``` There are quite a few changes that were made to support this. First, we added TorchDispatchTypeObject, a C++ struct that represents the type of a `__torch_dispatch__` object (e.g. LoggingTensor). It holds both the PyObject* representing the class and a PyInterpreter* so we know which Python interpreter it came from. Next, we updated the concrete_dispatch_fn in python_variable.cpp to accept a `const std::shared_ptr<TorchDispatchTypeObject>&` argument. When this is null, dispatching happens as usual. When it is non-null, we prepend the TorchDispatchTypeObject's PyObject* to the overloaded args list so that it is considered first for dispatch. To get that to work, we changed how `handle_torch_dispatch_no_python_arg_parser` works. The "overloaded args list" previously only consisted of Tensor PyObjects, but now it can have types in addition to Tensors! - We renamed `append_overloaded_arg` to `append_overloaded_arg` - We added a new `append_overloaded_type` that appends a type to overloaded_args - We added special handling in `handle_torch_dispatch_no_python_arg_parser` and `append_overloaded_arg` to handle types in addition to Tensors. Then, there is PythonMode and PythonModeTLS. - We reuse the DispatchKey::Python dispatch key as a mode key - We use PythonMode::enter and PythonMode::exit to enable/disable DispatchKey::Python and set the PythonModeTLS. - PythonModeTLS stores a TorchDispatchTypeObject as metadata. - PythonMode is in libtorch_python, and PythonModeTLS is in ATen. This split is due to the libtorch_python library boundary (because we need to save TLS in ATen/ThreadLocalState) - We modify the PythonFallbackKernel to look up the relevant TorchDispatchTypeObject (if Python Mode is active) and dispatch using it. There are two more miscellaneous changes: - internal_new_from_data (torch/csrc/utils/tensor_new.cpp) gets an exclude guard. enable_python_mode currently does not handle torch.tensor and the exclude guard is to prevent a bug. Future: - This PR does not allow for the nesting of Python modes. In the future we should be able to enable this with a more sane no_dispatch API and by changing the TLS to a stack. For now I did not need this for CompositeImplicitAutograd testing. Test Plan: - new tests Reviewed By: malfet, albanD Differential Revision: D30543236 Pulled By: zou3519 fbshipit-source-id: ef5444d96a5a957d1657b7e37dce80f9a497d452	2021-08-30 18:44:35 -07:00
driazati	bd8608cd5c	Use CMake for breakpad (#63186 ) Summary: We currently build breakpad from [this fork](https://github.com/driazati/breakpad) to include extra logic to restore signal handlers that were previously present. With some [new additions](https://github.com/google/breakpad/compare/main...driazati:main) this fork now includes a CMake based build, so we can add breakpad as a proper dependency rather than rely on including it in Docker images as a system library which is error prone (we have a bunch of images) and hard to extend to MacOS / Windows. This also includes some changes to the crash handling code to support MacOS / Windows in a similar way to Linux. ```python import torch # On Windows this writes crashes to C:\Users\<user>\AppData\pytorch_crashes # On MacOS/Linux this writes crashes to /tmp/pytorch_crashes torch.utils._crash_handler.enable_minidumps() # Easy way to cause a segfault and trigger the handler torch.bincount(input=torch.tensor([9223372036854775807])) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/63186 Reviewed By: malfet, seemethere Differential Revision: D30318404 Pulled By: driazati fbshipit-source-id: 0d7daf3701cfaba5451cc529a0730272ab1eb1dc	2021-08-19 10:42:01 -07:00
Edward Yang	c508433617	Implement subclass priority for __torch_dispatch__ (#63411 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63411 In order to get this behavior, you have to use append_overloaded, which I forgot to use in the previous implementation. I exposed an internal helper function which is more appropriate for dispatch to Python where we know that an argument is definitely a Tensor (and this test no longer needs to be done). Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D30374489 Pulled By: ezyang fbshipit-source-id: 43b08c00d1958c9b26d82a025d19f0b67bb85590	2021-08-18 07:49:03 -07:00
Richard Zou	26d2f4acb2	Quick fix to make torch.tensor work with functorch (#62423 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62423 Fixes https://github.com/facebookresearch/functorch/issues/7. functorch uses FuncTorchDynamicLayerBackMode as a mode key to wrap all tensors returned from operators in special TensorWrapper tensor extension. The problem with this is that TensorWrapper does not have storage so accessing the data_ptr (for recursive_store) internal asserts. As a quick hack, the guard added prevents functorch from wrapping the empty tensor in a TensorWrapper and instead when `tensor.to` is called later, the tensor gets wrapped. This is effectively what Ed proposed in https://github.com/facebookresearch/functorch/issues/7#issuecomment-847501020 In the long term we probably want some better way of extending `internal_new_from_data` for cases like this (where there is a mode-based dispatch key for a C++ tensor extension -- the Python case may be different). Test Plan: - Verified that this fixes functorch's problem Reviewed By: malfet Differential Revision: D29992607 Pulled By: zou3519 fbshipit-source-id: 82b713156a37d7470f8fc46e3803ee7353689a33	2021-07-30 10:15:23 -07:00
Alex Suhan	b176feec1e	Add device and key for lazy tensors (#61621 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61621 Test Plan: CI Reviewed By: mruberry Differential Revision: D29912934 Pulled By: asuhan fbshipit-source-id: 493c32063a3e756d93cbf1d876563a35eaafb537	2021-07-26 23:00:22 -07:00
anjali411	143ef016ee	Throw RuntimeError when numpy() is called on a tensor with conjugate or negative bit set (#61925 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61925 Resolves https://github.com/pytorch/pytorch/issues/59945 and https://github.com/pytorch/pytorch/issues/59946 bc breaking note: Unlike before, complex_tensor.conj().numpy(), complex_float_tensor.conj().view(torch.float64), complex_float_tensor.conj().imag.view(torch.int32) now doesn't return a view but instead errors out Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D29819288 Pulled By: anjali411 fbshipit-source-id: 4bebec721eb535f44ef4b728bdc75fa444e05d16	2021-07-23 11:28:36 -07:00
Nikita Shulga	a9b0a921d5	Disable `avoid-non-const-global-variables` lint check (#62008 ) Summary: As GoogleTest `TEST` macro is non-compliant with it as well as `DEFINE_DISPATCH` All changes but the ones to `.clang-tidy` are generated using following script: ``` for i in `find . -type f -iname ".c" -or -iname "*.h"\|xargs grep cppcoreguidelines-avoid-non-const-global-variables\|cut -f1 -d:\|sort\|uniq`; do sed -i "/\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-non-const-global-variables)/d" $i; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/62008 Reviewed By: driazati, r-barnes Differential Revision: D29838584 Pulled By: malfet fbshipit-source-id: 1b2f8602c945bd4ce50a9bfdd204755556e31d13	2021-07-22 18:04:40 -07:00
Kurt Mohler	5a00152a3d	Warn about poor performance creating Tensor from list of numpy.array's (#51680 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/13918 Pull Request resolved: https://github.com/pytorch/pytorch/pull/51680 Reviewed By: saketh-are Differential Revision: D29847229 Pulled By: ezyang fbshipit-source-id: 0519aad27f9ca1d8c06be5b9e6de382374d8b72b	2021-07-22 12:02:50 -07:00
Richard Barnes	b145889192	Modernize use make_unique (#61739 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61739 Test Plan: Sandcastle Reviewed By: malfet Differential Revision: D29717133 fbshipit-source-id: 70e3d81a48f7ae90cca3ef3c9587174ca15d81f4	2021-07-21 15:28:26 -07:00
Richard Barnes	349f2f767c	Modernize to default constructor and nullptr in torch (#61735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61735 Test Plan: Sandcastle Reviewed By: malfet Differential Revision: D29716659 fbshipit-source-id: ec2a0a0b7e55d2e50b1d35f0b651bd40675ae7e8	2021-07-16 10:51:13 -07:00
Mike Guo	6ecc1a4c4f	Make pytorch clang-tidy clean (#60649 ) Summary: This PR suppresses clang-tidy warnings in the codebase (for now) so that we can re-enable clang-tidy checks on master. I ran this script to add the `NOLINTNEXTLINE` comments (on a devserver): ```bash python3 setup.py develop # Uses same script that's run on CI and adds the -j (parallel), -s (add comments), -k (continue if diagnostic errors are found) options python3 tools/clang_tidy.py \ -j \ -s \ -k \ -v \ --paths torch/csrc/ \ -g"-torch/csrc/jit/passes/onnx/helper.cpp" \ -g"-torch/csrc/jit/passes/onnx/shape_type_inference.cpp" \ -g"-torch/csrc/jit/serialization/onnx.cpp" \ -g"-torch/csrc/jit/serialization/export.cpp" \ -g"-torch/csrc/jit/serialization/import.cpp" \ -g"-torch/csrc/jit/serialization/import_legacy.cpp" \ -g"-torch/csrc/onnx/init.cpp" \ -g"-torch/csrc/cuda/nccl." \ -g"-torch/csrc/cuda/python_nccl.cpp" \ -g"-torch/csrc/autograd/FunctionsManual.cpp" \ -g"-torch/csrc/generic/.cpp" \ -g"-torch/csrc/jit/codegen/cuda/runtime/*" \ -g"-torch/csrc/deploy/interpreter/interpreter.cpp" \ -g"-torch/csrc/deploy/interpreter/interpreter.h" \ -g"-torch/csrc/deploy/interpreter/interpreter_impl.h" \ -g"-torch/csrc/deploy/interpreter/test_main.cpp" ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/60649 Test Plan: Verified changes by re-running the script (without the `-s` option) and seeing no warnings/errors. Reviewed By: walterddr, janeyx99 Differential Revision: D29504258 Pulled By: 1ntEgr8 fbshipit-source-id: 78310b30ee8213b73ddb4771ad874665323e7a4e	2021-07-01 12:21:07 -07:00
Jiewen Tan	d5be67a338	Expose findDanglingImpls to Python (#60827 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60827 This diff exposed Dispatcher.findDanglingImpls to Python as _C._dispatch_find_dangling_impls. ghstack-source-id: 132799970 Test Plan: buck test mode/dev //caffe2/test:others -- test_find_dangling_impls Reviewed By: ezyang Differential Revision: D29416330 fbshipit-source-id: d2f26054b6e247be1bb9e818eaa7cb9e68a4a913	2021-06-30 12:31:19 -07:00
Lily Johnson	eb2f535689	c10::Storage python to cpp converter and typecast (#59734 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59734 Adds typecast logic to allow for c10::Storages to cross the Python/C++ barrier with pyBind Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D29075279 Pulled By: Lilyjjo fbshipit-source-id: 3e67b8525d308c5bccc64438ebac82b4d17ba462	2021-06-29 14:16:52 -07:00
Edward Yang	aacc722aec	Dispatch to Python via __torch_dispatch__ (#59760 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59760 See https://github.com/pytorch/pytorch/issues/59049 There are some moving parts to this PR, I'll structure this explanation so the straightforward parts go first, and then the less straightforward parts. The actual dispatch to Python. The core logic of dispatch to Python lives in `concrete_dispatch_fn` in `torch/csrc/autograd/python_variable.cpp`. It takes the input IValue stack, scans all the arguments for Tensor arguments, and defers most of the heavy lifting to `handle_torch_function_no_python_arg_parser` which actually does all of the logic for calling out to torch dispatch (in particular, this function handles multiple dispatch situations for you). Because we have a different function name than regular `__torch_function__` handling, `handle_torch_function_no_python_arg_parser` is generalized to accept a magic method name to look for when testing if Tensors have custom handling or not. Unlike `__torch_function__`, by default there is no `__torch_dispatch__` on Tensor classes. Maintaining the Python dispatch key. In order to get to the dispatch to Python logic, we must tag Tensors with the `__torch_dispatch__` magic method with the newly added Python dispatch key (separated from PythonFuncTorch to allow for a transitional period while they migrate to this mechanism). We expose a new private property `_is_python_dispatch` that assists in debugging if a Tensor is participating in Python dispatch or not. We apply the Python dispatch key the first time a PyObject for a Tensor is constructed (THPVariable_NewWithVar), testing if `__torch_dispatch__` exists with then newly added `check_has_torch_dispatch`. Shallow copy and detach. For the simple examples tested in this PR, most creations of Tensor route through the dispatcher. The exception to this is `shallow_copy_and_detach`, which bypasses the dispatcher and is used when saving tensors for backwards. When a Tensor is Python dispatch, we override the behavior of `shallow_copy_and_detach` to instead directly call into `__torch_dispatch__` to perform a `detach` operation (in the same way it would be invoked if you called `detach` directly). Because this Python call is triggered directly from c10::TensorImpl, it must be indirected through `PyInterpreter::detach`, which is the general mechanism for dynamic dispatching to the Python interpreter associated with a TensorImpl. torchdeploy compatibility. The dispatch to Python logic cannot be directly registered to the dispatcher as it is compiled in the Python library, which will get loaded multiple times per torchdeploy interpreter. Thus, we must employ a two phase process. First, we register a fallback inside a non-Python library (aten/src/ATen/core/PythonFallbackKernel.cpp). Its job is to determine the appropriate PyInterpreter to handle the Python dispatch by going through all of the arguments and finding the first argument that has a PyObject/PyInterpreter. With this PyInterpreter, it makes another dynamic dispatch via "dispatch" which will go to the correct torchdeploy interpreter to handle dispatching to actual Python. Testing. We provide a simple example of a LoggingTensor for testing, which can be used to generate TorchScript-like traces to observe what operations are being called when a Tensor is invoked. Although a LoggingTensor would be better implemented via an is-a relationship rather than a has-a relationship (as is done in the test), we've done it this way to show that arbitrarily complex compositions of tensors inside a tensor work properly. Known limitations. * We haven't adjusted any operator code, so some patterns may not work (as they lose the Python subclass in an unrecoverable way) * `__torch_function__` must be explicitly disabled with `_disabled_torch_function_impl` otherwise things don't work quite correctly (in particular, what is being disabled is default subclass preservation behavior.) * We don't ever populate kwargs, even when an argument is kwarg-only Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: D29017912 D29017912 Test Plan: Imported from OSS Reviewed By: bdhirsh Pulled By: ezyang fbshipit-source-id: a67714d9e541d09203a8cfc85345b8967db86238	2021-06-25 11:50:32 -07:00
Richard Barnes	b162d95e46	Fix a number of lint perf and safety issues in torch (#59897 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59897 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D29037012 fbshipit-source-id: 7c16286d5fc2b67964fb65f8374dfff4d1a7aefb	2021-06-15 13:14:51 -07:00
Edward Yang	d60d81b5a7	Make PyObject_FastGetAttrString accept const char* (#59758 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59758 The underlying call to tp_getattr is const safe but CPython has not fixed it due to BC problems. No reason not to advertise the better type here though! Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D29017911 Pulled By: ezyang fbshipit-source-id: 8d55983fe6416c03eb69c6367bcc431c30000133	2021-06-14 07:24:16 -07:00
Richard Barnes	e3d75b8475	irange for PyTorch sans jit (#59481 ) Summary: Switches most of the simple for loops outside of `jit` directories to use `c10::irange`. Generated with D28874212. Pull Request resolved: https://github.com/pytorch/pytorch/pull/59481 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D28909681 fbshipit-source-id: ec9ab1bd602933238d9d0f73d4d8d027b75d9d85	2021-06-09 14:46:11 -07:00
Richard Barnes	3979cb0656	irange for size_t (#55320 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55320 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D27572577 fbshipit-source-id: 97710fd2bb1303006b05828a0d1343b0b59ccb03	2021-06-03 01:04:13 -07:00
Richard Barnes	2ce23136d0	Use irange in torch/csrc utils (#55556 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55556 Test Plan: Sandcastle Reviewed By: ezyang Differential Revision: D27625936 fbshipit-source-id: 79065438f582a6f5fe6f1f796b6984767605197e	2021-06-02 15:47:00 -07:00
driazati	059a717c9e	Fix breakpad build and add to more images (#59236 ) Summary: This PR * adds the breakpad build to most of the remaining docker images (except the mobile + slim ones) * pins to a [fork of breakpad](https://github.com/google/breakpad/compare/master...driazati:master?expand=1) to enable dasiy chaining on signal handlers * renames the API to be nicer Pull Request resolved: https://github.com/pytorch/pytorch/pull/59236 Reviewed By: malfet Differential Revision: D28792511 Pulled By: driazati fbshipit-source-id: 83723e74b7f0a00e1695210ac2620a0c91ab4bf2	2021-06-01 22:47:14 -07:00
Joel Schlosser	ef32a29c97	Back out "[pytorch][PR] ENH Adds dtype to nn.functional.one_hot" (#59080 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59080 Original commit changeset: 3686579517cc Test Plan: None; reverting diff Reviewed By: albanD Differential Revision: D28746799 fbshipit-source-id: 75a7885ab0bf3abadde9a42b56d479f71f57c89c	2021-05-27 15:40:52 -07:00
Alexander	b435a27fb7	CUDA support in the CSR layout: constructors (#59010 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59010 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D28719287 Pulled By: bhosmer fbshipit-source-id: fbb5784ccb5ce19dcca1f2f95c4ee16f9b7680c4	2021-05-26 16:39:43 -07:00
Edward Yang	17fb651a3b	Make torch.Tensor(torch.tensor(1.0)) work (#58885 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58885 Fixes #58884 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D28687510 Pulled By: ezyang fbshipit-source-id: 81325f501cc3e83cbac02f7c44ded9d396356bb8	2021-05-26 11:33:05 -07:00
Alban Desmaison	032d6b0643	Revert D28112689: CUDA support in the CSR layout: constructors Test Plan: revert-hammer Differential Revision: D28112689 (`1416e57465`) Original commit changeset: f825cd4bce40 fbshipit-source-id: 421fc590797ac5fab6a55ac6f213361fbba7cd5b	2021-05-26 06:15:05 -07:00
Alexander	1416e57465	CUDA support in the CSR layout: constructors (#57274 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57274 Test Plan: Imported from OSS Reviewed By: astaff Differential Revision: D28112689 Pulled By: bhosmer fbshipit-source-id: f825cd4bce402dd4c3f71db88854f77830b687b8	2021-05-26 01:36:20 -07:00
Thomas J. Fan	a7f4f80903	ENH Adds dtype to nn.functional.one_hot (#58090 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33046 Related to https://github.com/pytorch/pytorch/issues/53785 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58090 Reviewed By: zou3519 Differential Revision: D28640893 Pulled By: jbschlosser fbshipit-source-id: 3686579517ccc75beaa74f0f6d167f5e40a83fd2	2021-05-24 13:48:25 -07:00
Kurt Mohler	fe8e5eb260	Change native functions to take `c10::string_view` args instead of `std::string` (#57680 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53546 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57680 Reviewed By: malfet Differential Revision: D28511799 Pulled By: ezyang fbshipit-source-id: 43142f994d048b28b3279ccdb7a28cbaa3190973	2021-05-20 18:15:45 -07:00
Nikita Shulga	abb215e229	Fix dtype inference in sparse_csr_tensor_ctor (#58631 ) Summary: `NULL` return from `PyObject_GetAttrString` should never get ignored without handling the exception, as behavior of subsequent Python C API calls are undefined until `PyErr_Fetch` or `PyErr_Clear` is called. This accidentally leads to `list` type being incorrectly identified as `Tensor` Fixes https://github.com/pytorch/pytorch/issues/58520 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58631 Reviewed By: albanD Differential Revision: D28559454 Pulled By: malfet fbshipit-source-id: 46f044b5f0f94264779a6108474d04a8ba851c53	2021-05-20 08:02:05 -07:00
Horace He	5fa4541c65	Make new_ones an operator (#58405 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/58394 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58405 Reviewed By: HDCharles Differential Revision: D28480075 Pulled By: Chillee fbshipit-source-id: bd29399867e2a002a2f395554621761d3c701f68	2021-05-17 19:24:34 -07:00
Gregory Chanan	e8574b84bf	Fix legacy tensor constructor/new matching incorrect signature with d… (#58108 ) Summary: …evice. Previously, it was possible for torch.Tensor(tensor, device) or Tensor.new(tensor, device) to map to IntArrayRef or PyObject. PyObject was not a problem because that would error out later. But IntArrayRef would create an uninitialized tensor, which is confusing. Fixes https://github.com/pytorch/pytorch/issues/47112 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58108 Reviewed By: agolynski, mruberry Differential Revision: D28372426 Pulled By: gchanan fbshipit-source-id: 795ab4f0561939d002a661c5cc14c6cdb579f31a	2021-05-13 08:11:08 -07:00
Eddie Yan	645a5f706a	move `flatten_dense_tensors` and `unflatten_dense_tensors` to Native (#58006 ) Summary: https://github.com/pytorch/pytorch/issues/55240 CC ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/58006 Reviewed By: agolynski Differential Revision: D28386749 Pulled By: ngimel fbshipit-source-id: 4860c35d5ff95bcc38a243d7001180e7bd536314	2021-05-12 18:18:34 -07:00
Nikita Shulga	3a66a1cb99	[clang-tidy] Exclude cppcoreguidelines-avoid-magic-numbers (#57841 ) Summary: Add cppcoreguidelines-avoid-magic-numbers exclusion to clang-tidy Remove existing nolint warnings using following script: ``` for file in `git ls-files \| grep -v \.py`; do gsed '/^ *\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-magic-numbers)/d' -i $file; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/57841 Reviewed By: samestep Differential Revision: D28295045 Pulled By: malfet fbshipit-source-id: 7c6e8d1213c9593f169ed3df6a916498f1a97163	2021-05-07 20:02:33 -07:00
Ailing Zhang	0ecdbfebff	s/InplaceOrView/ADInplaceOrView/g (#57372 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57372 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57324 Test Plan: Imported from OSS Reviewed By: ZolotukhinM Differential Revision: D28121821 Pulled By: ailzhang fbshipit-source-id: f568dd2505f6279da9ffb93ce1d22e0f98c606bb	2021-05-01 22:56:18 -07:00
Luca Wehrstedt	58bc003487	Add pybind type caster for c10::Device (#57292 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57292 In Future (and soon in other places too) we need to receive a list of devices from Python-land. We don't want to just take their indices because we need full devices in order to infer the type from them. torch.device is not defined through pybind, it's defined through a plain `PyModule_AddObject` call with CPython, thus pybind isn't naturally able to understand and convert it. However we can provide a custom type caster which fixes that. We have this already for at::Tensor, at::Generator, ... ghstack-source-id: 127916268 Test Plan: CI Reviewed By: mrshenli Differential Revision: D28092732 fbshipit-source-id: 1c31d0b85a4d5c9e7bde8161efbb7574d505157c	2021-05-01 16:11:10 -07:00
Nikita Shulga	eac02f85cf	Fix more clang-tidy errors (#57235 ) Summary: In my last PR I've missed CUDA and distributed folders, fixing this now This change is autogenerated by `python tool/clang_tidy.py -s` Pull Request resolved: https://github.com/pytorch/pytorch/pull/57235 Reviewed By: janeyx99 Differential Revision: D28084444 Pulled By: malfet fbshipit-source-id: bf222f69ee90c7872c3cb0931e8cdb84f0cb3cda	2021-04-28 23:29:10 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Edward Yang	ec0fa40f0f	Release GIL before destructing RPCAgent subclasses. (#57029 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57029 Partially addresses https://github.com/pytorch/pytorch/issues/56297 This fixes deadlocks when the threads the RPCAgent are blocking on try to take the GIL. This also adds a general utility for making shared_ptr run destructors without GIL. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D28030294 Pulled By: ezyang fbshipit-source-id: 628c066eebbb70bda5b914645a109dce35d73c8d	2021-04-28 10:25:03 -07:00
Alexander	ecaa208fd6	Fix: sparse_csr_tensor segfaults when crow_indices or col_indices are non-tensors (#56723 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56723 WIP gh-56687 Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D27999919 Pulled By: ezyang fbshipit-source-id: 7eb23c8f45f3c459efe65793caecaa6b67a187c9	2021-04-27 14:47:12 -07:00
Ailing Zhang	be7a943bb8	s/AutoDispatchBelowAutograd/AutoDispatchBelowInplaceOrView. (#56657 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56657 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D27931526 Pulled By: ailzhang fbshipit-source-id: 3af718df3435e2b0b30bc62070dbdc5aeeecdfb4	2021-04-23 15:50:00 -07:00
davidriazati@fb.com	7fff71eb9a	Fix warnings in tensor_flatten.cpp (#55956 ) Summary: Switch to use `TensorOptions` instead of deprecated `.type()` to fix compiler warnings as part of #55952 ](https://our.intern.facebook.com/intern/diff/27830504/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/55956 Pulled By: driazati Reviewed By: pritamdamania87 Differential Revision: D27830504 fbshipit-source-id: f705818ddb7d8b17c0f5383f22dc431203a194d9	2021-04-20 17:22:05 -07:00
Ailing Zhang	3d904b56ec	s/AutoNonVariableTypeMode/AutoDispatchBelowAutograd/ (#56423 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56423 Test Plan: Imported from OSS Reviewed By: bertmaher Differential Revision: D27866606 Pulled By: ailzhang fbshipit-source-id: e3942356dc3133d1c5722de40ec0d45e6a60f2f1	2021-04-20 17:17:46 -07:00
davidriazati@fb.com	638617f9f8	Write mini dump on pybind exceptions (#55652 ) Summary: We register an [error handler](https://pybind11.readthedocs.io/en/stable/advanced/exceptions.html#registering-custom-translators) with pybind so that C++ exceptions are passed to Python and raised as runtime errors that can be `try...except`ed etc. Since these don't terminate the program (until Python does), they never fire the signal handler to write a minidump out with the crash information. This PR adds some logic in the exception translator to write out a minidump if enabled. ](https://our.intern.facebook.com/intern/diff/27830952/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/55652 Pulled By: driazati Reviewed By: bertmaher Differential Revision: D27830952 fbshipit-source-id: 26e8f913e99dff971a4eb09eb87221c66f759763	2021-04-19 14:53:43 -07:00
David Riazati	1ec12fd491	Add minidump collection via breakpad (#55647 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55647 This adds [breakpad](https://github.com/google/breakpad) which comes with out-of-the-box utilities to register a signal handler that writes out a minidump on an unhandled exception. Right now this is gated behind a flag in `torch.utils`, but in the future it could be on by default. Sizewise this adds aboute 500k to `libtorch_cpu.so` (187275968 B to 187810016 B). ```bash $ cat <<EOF > test.py import torch torch.utils.enable_minidump_collection() # temporary util that just segfaults torch._C._crash() EOF $ python test.py Wrote minidump to /tmp/pytorch_crashes/6a829041-50e9-4247-ea992f99-a74cf47a.dmp fish: “python test.py” terminated by signal SIGSEGV (Address boundary error) $ minidump-2-core /tmp/pytorch_crashes/6a829041-50e9-4247-ea992f99-a74cf47a.dmp -o core.dmp $ gdb python core.dmp ... commence debugging ... ``` Right now all exceptions that get passed up to Python don't trigger the signal handler (which by default only handles [these](https://github.com/google/breakpad/blob/main/src/client/linux/handler/exception_handler.cc#L115)). It would be possible for PyTorch exceptions to explicitly write a minidump when passed up to Python (maybe only when the exception is unhandled or something). Test Plan: Imported from OSS Reviewed By: ailzhang Differential Revision: D27679767 Pulled By: driazati fbshipit-source-id: 1ab3b5160b6dc405f5097eb25acc644d533358d7	2021-04-16 13:05:01 -07:00
Heitor Schueroff	33159b68a3	Revert "Deprecate legacy constructor `torch.Tensor()` (#54414 )" (#55831 ) Summary: This PR reverts https://github.com/pytorch/pytorch/pull/54414 because of https://github.com/pytorch/pytorch/issues/55780 cc ysiraichi Pull Request resolved: https://github.com/pytorch/pytorch/pull/55831 Reviewed By: agolynski Differential Revision: D27762264 Pulled By: heitorschueroff fbshipit-source-id: 8079a660cc440cafb9d22aa031d36dde121e13b3	2021-04-15 14:06:10 -07:00
Edward Yang	6ec71ed4f9	Replace all direct cdata access with THPVariable_Unpack (#55799 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55799 I'm going to change the implementation of cdata soon so I need to abstract over cdata access with a function. Additionally, many users are casting manually casting to THPVariable to access the member so I can remove these unsafe casts in the client code (the implementation, of course, is still doing an unsafe cast.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D27712130 Pulled By: ezyang fbshipit-source-id: 95fcc013bf3913d67f2c634068eb5b3aab144cb3	2021-04-15 08:57:04 -07:00
Edward Yang	82a7fff3cd	Modify a few APIs to take/return const Tensor& instead of Tensor& (#55797 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55797 In all of these cases, the inside of the function didn't make use of the fact that the tensor was a mutable reference Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: bdhirsh Differential Revision: D27712132 Pulled By: ezyang fbshipit-source-id: 99e0bb1d783f63d2d42ab53d3d406b2064405ef4	2021-04-15 08:57:00 -07:00
Sameer Deshmukh	5fb1142702	Add CSR (compressed sparse row) layout for sparse tensors (#50937 ) Summary: Implement compressed sparse row format. Derived from the GCS implementation at https://github.com/pytorch/pytorch/pull/44190 Pull Request resolved: https://github.com/pytorch/pytorch/pull/50937 Reviewed By: mrshenli Differential Revision: D27439865 Pulled By: ezyang fbshipit-source-id: 3ba3dcb9679505b980ff6a5f513e913bbae2fb1d	2021-04-12 10:09:12 -07:00
Scott Wolchok	fa19b6dd4d	[PyTorch] New expand_inplace API with MaybeOwned<Tensor> and no unary tuples (#55065 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55065 expand_inplace may give you the same Tensor(s) back, and it unnecessarily wrapped single-Tensor results in a tuple. Further diffs will deprecate and replace the rest of the similar APIs in ExpandUtils. ghstack-source-id: 126170049 Test Plan: beyonce_test Reviewed By: ezyang Differential Revision: D27469297 fbshipit-source-id: 56cf14bc5603355f399fef2e5b02b97afa504428	2021-04-09 22:13:21 -07:00
Nikita Shulga	6a39613f35	[BE] Make torch/csrc/jit/tensorexpr/ clang-tidy clean (#55628 ) Summary: Mostly auto-generated changes using ``` python3 tools/clang_tidy.py -c build -x torch/csrc/jit/tensorexpr/eval.cpp -s ``` With following common patterns manually fixed - Use ` = default` instead of `{}` - deleted methods should be public - Use pass-by-value + std::move instead of pass-by-reference+copy Pull Request resolved: https://github.com/pytorch/pytorch/pull/55628 Reviewed By: walterddr Differential Revision: D27655378 Pulled By: malfet fbshipit-source-id: 92be87a08113435d820711103ea9b0364182c71a	2021-04-08 19:44:14 -07:00
Richard Barnes	d690973295	irange on int64_t (#55148 ) Summary: Converts loops of the form: ``` for(int64_t VAR=0;VAR<LIMIT;VAR++) ``` to the form ``` for(const auto VAR : c10::irange(LIMIT)) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/55148 Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D27447811 fbshipit-source-id: 6311a094ec4a81a0b57383aaee0ba1b1dc2445c4	2021-04-05 16:14:00 -07:00
Mike Ruberry	c0ac0fef4e	Revert D27448156: irange for size_t Test Plan: revert-hammer Differential Revision: D27448156 (`041b4431b2`) Original commit changeset: 585da57d4de9 fbshipit-source-id: 8e047c29f391c0166e0a1a87c3fb2a0854377365	2021-04-03 19:14:00 -07:00

... 3 4 5 6 7 ...

987 Commits