pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Horace He	b8bf604459	Ported linear to symints (#86021 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/86021 Approved by: https://github.com/ezyang	2022-10-02 20:46:20 +00:00
Edward Z. Yang	3638089755	Ported reshape to symints and added a shim for BC (#85998 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85998 Approved by: https://github.com/ezyang	2022-10-02 17:46:00 +00:00
Edward Z. Yang	ada6e5b53a	Implement duck shaping on SymInts (#85808 ) Duck shaping says that when two input tensors have the same size, we assume they are symbolically related. This follows the same optimization done by inductor. This optimization is not done completely because we don't currently install guards corresponding to the duck shape relationships we created, but overall the guard propagation for dynamic shape tracing is incomplete at the moment. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85808 Approved by: https://github.com/albanD	2022-09-29 00:45:53 +00:00
Horace He	39130ccf73	Registered _like metas (#85793 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85793 Approved by: https://github.com/ezyang	2022-09-28 17:23:07 +00:00
PyTorch MergeBot	b44a4a8b51	Revert "Registered _like metas (#85793 )" This reverts commit `a4e75ccf85`. Reverted https://github.com/pytorch/pytorch/pull/85793 on behalf of https://github.com/huydhn due to Sorry, reverting as this breaks an aot_autograd mac test on functorch. https://github.com/pytorch/pytorch/pull/85794 was reverted before but it was at the top of the stack so the revert still fail `823dc33b00`	2022-09-28 17:18:29 +00:00
Horace He	a4e75ccf85	Registered _like metas (#85793 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85793 Approved by: https://github.com/ezyang	2022-09-28 14:07:57 +00:00
Edward Z. Yang	793488cda2	Revert "Revert "Symintifying slice ops (#85196 )"" (#85746 ) This reverts commit `3a171dfb0c`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85746 Approved by: https://github.com/albanD	2022-09-28 04:37:35 +00:00
PyTorch MergeBot	3a171dfb0c	Revert "Symintifying slice ops (#85196 )" This reverts commit `4c01c51266`. Reverted https://github.com/pytorch/pytorch/pull/85196 on behalf of https://github.com/atalman due to Break internal build Exutorch	2022-09-27 18:01:27 +00:00
George Qi	686555b663	[maskedtensor] port torch/_masked into torch/masked (#85515 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85515 Approved by: https://github.com/cpuhrsch	2022-09-26 23:41:13 +00:00
Edward Z. Yang	4c01c51266	Symintifying slice ops (#85196 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85196 Approved by: https://github.com/ezyang	2022-09-23 22:01:32 +00:00
Edward Z. Yang	604487f239	OpInfo for Slice (#85554 ) This is based on wconstab tests from #84680 Technically, slice is covered by the __getitem__ opinfo, but it is easier to debug/test on a more narrow internal function that only uses this functionality and not other advanced indexing stuff. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85554 Approved by: https://github.com/mruberry, https://github.com/wconstab	2022-09-23 22:01:32 +00:00
Edward Z. Yang	9baf6770bc	Apply new symbolic shape strategy to make_fx symbolic mode (#85260 ) This results in some test wobbling, which looks legit. I also added some debug helpers for stuff that I found useful while working on this. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85260 Approved by: https://github.com/albanD	2022-09-22 14:12:28 +00:00
Edward Z. Yang	61b4e8a7bf	More SymFloat support (#85411 ) - Support storing SymFloat in IValue - Add SymFloat to JIT type system (erases to float) - Printing support for SymFloat - add/sub/mul/truediv operator support for SymFloat - Support truediv on integers, it returns a SymFloat - Support parsing SymFloat from Python object Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85411 Approved by: https://github.com/albanD	2022-09-22 08:07:22 +00:00
kshitij12345	56a41b5998	[composite compliance] ctc_loss (#84752 ) #Ref #69991 I have mixed feelings about adding new (private) operators. Backends writers will have to override them as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84752 Approved by: https://github.com/zou3519	2022-09-22 00:21:11 +00:00
Thomas Viehmann	764cba6848	add Python ref for isreal (#85361 ) Dipping my toes into prims waters Pull Request resolved: https://github.com/pytorch/pytorch/pull/85361 Approved by: https://github.com/IvanYashchuk, https://github.com/mruberry	2022-09-21 18:53:34 +00:00
Edward Z. Yang	3eb27229dd	as_strided symbolic support (#85264 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D39662820](https://our.internmc.facebook.com/intern/diff/D39662820) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85264 Approved by: https://github.com/wconstab	2022-09-21 13:34:55 +00:00
Horace He	2f4a517d67	Ported matmul compositeimplicitautograd impl into core (#85239 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85239 Approved by: https://github.com/ezyang, https://github.com/lezcano	2022-09-21 09:25:24 +00:00
Will Constable	52fd7e491b	Update torch.ops.aten.all ref to be symbolic-trace friendly (#85352 ) - previous impl compared the summed bool values of the tensor to its nelem, which in a symbolic world is a symint and can't be coerced back into a bool for the purpose of shoving into a result tensor - new impl adds one extra negation op but avoids the need to compare to the symbolic nelem Pull Request resolved: https://github.com/pytorch/pytorch/pull/85352 Approved by: https://github.com/ezyang, https://github.com/mruberry	2022-09-21 00:06:54 +00:00
lezcano	d17b144e65	Adding multigammaln ref and fix arange (#85153 ) Partially based on https://github.com/pytorch/pytorch/pull/83662. I'll help land this one, as Rob does not work in the PyTorch project anymore I removed the data-dependent check for the args, as data dependencies are bad for many reasons (and it was failing when the input has NaNs). It also registers arange as a decomposition, and fixes the naming of its args. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85153 Approved by: https://github.com/mruberry, https://github.com/ngimel	2022-09-20 17:52:56 +00:00
Horace He	12a19a4846	Made tracing of proxy symints lazy (#85185 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85185 Approved by: https://github.com/ezyang	2022-09-17 20:36:35 +00:00
lezcano	98b8ef99e1	Add refs for sinc and sgn (#85142 ) This PR superseded https://github.com/pytorch/pytorch/pull/80171 This does not add the ref for `special.sinc` as I was getting some errors. This should be added to https://github.com/pytorch/pytorch/pull/84957 (cc @nkaretnikov) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85142 Approved by: https://github.com/ngimel, https://github.com/mruberry	2022-09-17 06:09:13 +00:00
Horace He	377b5d6f8b	Added additional simplifications/caching for replacements and divisibility (#84918 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84918 Approved by: https://github.com/ezyang	2022-09-17 01:33:48 +00:00
Natalia Gimelshein	6162a04364	fix half_to_float arg in *softmax decomp (#85120 ) Fixes https://github.com/pytorch/torchdynamo/issues/1239 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85120 Approved by: https://github.com/Chillee	2022-09-16 15:54:50 +00:00
Horace He	4bdc0af53d	Added support for symbolic is_contiguous (#84829 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84829 Approved by: https://github.com/ezyang	2022-09-16 04:54:01 +00:00
Edward Z. Yang	00ce302c07	Performance optimizations to proxy tensor (#85049 ) - Lazily allocate FX nodes for size/stride accessors on proxy tensor - Properly track derived computations on strides/numel/etc - Remove unnecessary tree_map at end of proxy tensor trace checking invariants; we will just have to be smart (it's too expensive) - Avoid tree_map in sym proxy tracing Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/85049 Approved by: https://github.com/wconstab	2022-09-16 00:28:50 +00:00
Michael Voznesensky	8ca1839d32	Python Dispatcher integration with C++ dispatcher (#85050 ) #84826 but without ghstack Pull Request resolved: https://github.com/pytorch/pytorch/pull/85050 Approved by: https://github.com/malfet	2022-09-15 00:43:36 +00:00
Edward Z. Yang	ccade9410f	Don't detach when making views; force caller to detach (#84893 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84893 Approved by: https://github.com/soulitzer, https://github.com/SherlockNoMad	2022-09-14 22:32:45 +00:00
PyTorch MergeBot	706b990306	Revert "Python Dispatcher integration with C++ dispatcher (#84826 )" This reverts commit `35f6a69191`. Reverted https://github.com/pytorch/pytorch/pull/84826 on behalf of https://github.com/malfet due to Broke dynamo, see `35f6a69191`	2022-09-14 14:07:58 +00:00
Michael Voznesensky	35f6a69191	Python Dispatcher integration with C++ dispatcher (#84826 ) Signed-off-by: Edward Z. Yang <ezyangfb.com> From @ezyang's original PR: There are a number of situations where we have non-backend kernels (e.g., CompositeImplicitAutograd, batching rules) which we would like to port to Python, but we have no way to integrate these ports with the overall system while using preexisting C++ registrations otherwise. This PR changes that by introducing a Python dispatcher (which can have its own kernels directly in Python), which can be interpose over ordinary C++ dispatch. The ingredients: We introduce a new PythonDispatcher dispatch key, that has the same tenor as FuncTorchDynamicLayerFrontMode: it works by getting triggered before every other dispatch key in the dispatch key, and shunting to a Python implementation The Python dispatcher is a per-interpreter global object that is enabled/disabled via the guard EnablePythonDispatcher/DisablePythonDispatcher. We don't make it compositional as I have no idea what a compositional version of this feature would look like. Because it is global, we don't need to memory manage it and so I use a simpler SafePyHandle (newly added) to control access to this pointer from non-Python C++. Like __torch_dispatch__, we use PyInterpreter to get to the Python interpreter to handle the dispatch. I need to reimplement dispatch table computation logic in Python. To do this, I expose a lot more helper functions for doing computations on alias dispatch keys and similar. I also improve the pybind11 handling for DispatchKey so that you can either accept the pybind11 bound enum or a string; this simplifies our binding code. See https://github.com/pybind/pybind11/issues/483#issuecomment-1237418106 for how this works; the technique is generally useful. I need to be able to call backend fallbacks. I do this by permitting you to call at a dispatch key which doesn't have a kernel for the operator; if the kernel doesn't exist, we check the backend fallback table instead. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84826 Approved by: https://github.com/ezyang	2022-09-14 06:57:19 +00:00
PyTorch MergeBot	8ca057eb71	Revert "Don't detach when making views; force caller to detach (#84893 )" This reverts commit `3bb8d6a93c`. Reverted https://github.com/pytorch/pytorch/pull/84893 on behalf of https://github.com/malfet due to Broke MPS, see `3bb8d6a93c`	2022-09-14 01:09:04 +00:00
Edward Z. Yang	3bb8d6a93c	Don't detach when making views; force caller to detach (#84893 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84893 Approved by: https://github.com/soulitzer, https://github.com/SherlockNoMad	2022-09-13 23:31:21 +00:00
kshitij12345	4f6027b78a	[opinfo] narrow: add new sample for Tensor overload (#84785 ) `narrow` accepts `start` argument to be a Tensor. We add a sample to test this overload. NOTE: This leads to a bunch of failed tests and hence the skips and xfails Pull Request resolved: https://github.com/pytorch/pytorch/pull/84785 Approved by: https://github.com/zou3519	2022-09-12 16:59:08 +00:00
Edward Z. Yang	c5a8946e40	Revert "Revert "Redo how custom/python_custom methods on TensorImpl work (#84796 )" (#84806 ) This reverts commit `ca3b2bfbe3`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84806 Approved by: https://github.com/Chillee	2022-09-10 06:17:35 +00:00
Eli Uriegas	ca3b2bfbe3	Revert "Redo how custom/python_custom methods on TensorImpl work (#84796 ) This reverts commit `591b75bf98`. Manual revert of https://github.com/pytorch/pytorch/pull/84641 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/84796 Approved by: https://github.com/izaitsevfb	2022-09-10 00:18:13 +00:00
Ivan Yashchuk	01c54ad6de	Remove deprecated torch.eig (#70982 ) The time has come to remove deprecated linear algebra related functions. This PR removes `torch.eig`. cc @jianyuh @nikitaved @pearu @mruberry @walterddr @IvanYashchuk @xwang233 @Lezcano Pull Request resolved: https://github.com/pytorch/pytorch/pull/70982 Approved by: https://github.com/Lezcano, https://github.com/malfet	2022-09-09 21:31:57 +00:00
Edward Z. Yang	591b75bf98	Redo how custom/python_custom methods on TensorImpl work (#84641 ) A longstanding confusion in the implementation of fake tensor and proxy tensor is what to do about torch.ops.aten.sym_sizes and related calls. In particular, when you have a tensor that (1) has symbolic shapes and (2) has a `__torch_dispatch__` call, previously, you would always get `__torch_dispatch__` calls for sizes/strides query, even if you didn't request it via the dispatch kwargs in `make_wrapper_subclass`. The reason for this is because we were previously mixing several concepts: "I want to dispatch to Python", "I want to call a virtual method" and "I have dynamic shapes". A single boolean variable controlled all of these things, and so it was not possible to understand inside TensorImpl what the user had actually originally requested. In this PR, we track each of these concepts individually so that we can preserve user intent. Then, we combine these into a single "policy" variable that controls whether or not we can use the fastpath or not. For the policy to trigger, we only need one of the exceptional cases to be true. Billing of changes: * Rename `set_sizes_strides_policy` to `set_custom_sizes_strides`; in general, you cannot DIRECTLY set policy; you have to indirectly set it by the public functions. * Some helpers for sizes and strides, since it's more complicated (as it is an enum, rather than just bools as is the case for device and layout). `matches_python_custom` is used to test the Python dispatch user ask. `matches_policy` does the policy test (only used in the user facing functions.) * I reorged the accessor methods so that they are more logical. This makes the diff bad, so I recommend reading the final code directly. * The default custom implementations now more reliably call their default() implementations * As bonus refactor, I devirtualized some functions that don't need to be virtual * `set_sym_sizes_and_strides` is renamed to `set_sizes_and_strides` to make it easier to use in template contexts; it optionally takes a storage offset now so you can set all three values at the same time. If you use the SymInt overload but there are no symbolic integers, we give you a normal resize. * This adds `sym_storage_offset` since we had that in the symbolic shapes branch and there's no reason not to put it in (and it reduces merge conflicts) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84641 Approved by: https://github.com/wconstab	2022-09-09 13:41:13 +00:00
Horace He	1459a909b4	Added mv, mm, and binary_cross_entropy_with_logits decomps (#84451 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84451 Approved by: https://github.com/ngimel	2022-09-08 17:56:18 +00:00
Richard Zou	a47bc96fb7	[composite compliance] fix linalg.eigvals (#84137 ) linalg.eigvals fails in some cases with functorch and the root of the problem is that it is not composite compliant. In particular, checks that branch on whether or not a Tensor requires grad do not work with functorch. In order to support functorch with them, we have to include an additional "if the tensor is a Tensor Subclass, then assume that it MAY require grad, so we must always go through the differentiable path". This PR also changes the batching rule for linalg.eigvals to be a decomposition instead of what it was previously. What it was previously was masking the error in functorch's test suite. Unfortunately we don't comprehensive tests for this on the functorch side which is why this was not caught before. I'll look into why that is in the future; it's a bit complicated. Test Plan: - wait for tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/84137 Approved by: https://github.com/Lezcano, https://github.com/IvanYashchuk, https://github.com/samdow	2022-09-07 14:29:21 +00:00
Richard Zou	c771d73461	[composite compliance] fix max_pool1d (#84127 ) max_pool1d has a fast path for CPU tensors that do not require grad that directly accesses the data_ptr. This PR makes the change that if the input Tensor is a Tensor Subclass, then we want to walk through the "slow path" of calling max_pool1d_with_indices. Test Plan: - wait for tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/84127 Approved by: https://github.com/kshitij12345, https://github.com/samdow, https://github.com/malfet	2022-09-06 17:13:09 +00:00
Edward Z. Yang	2a332afbf4	Add SymFloat, support SymInt to SymFloat conversion (#84284 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84284 Approved by: https://github.com/albanD	2022-09-03 01:30:32 +00:00
Edward Z. Yang	d39490a711	Add meta function for repeat (#84349 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84349 Approved by: https://github.com/Krovatkin	2022-09-01 20:44:52 +00:00
Edward Z. Yang	cd96f3f676	Use register_meta for everything in meta_registrations (#84297 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84297 Approved by: https://github.com/Chillee	2022-08-31 23:58:24 +00:00
PyTorch MergeBot	65f98eb47d	Revert "Add meta function for repeat (#84349 )" This reverts commit `44bc6db8f8`. Reverted https://github.com/pytorch/pytorch/pull/84349 on behalf of https://github.com/janeyx99 due to Land race with the revert causing test_fx failures `44bc6db8f8`	2022-08-31 18:27:59 +00:00
Horace He	85931eaa6b	Rename fake_result to val (#84331 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84331 Approved by: https://github.com/ezyang	2022-08-31 17:44:18 +00:00
Edward Z. Yang	44bc6db8f8	Add meta function for repeat (#84349 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84349 Approved by: https://github.com/Krovatkin	2022-08-31 17:20:21 +00:00
PyTorch MergeBot	14093b5979	Revert "Use register_meta for everything in meta_registrations (#84297 )" This reverts commit `8cd296f680`. Reverted https://github.com/pytorch/pytorch/pull/84297 on behalf of https://github.com/suo due to broke test_proxy_tensor on master	2022-08-31 16:32:24 +00:00
Horace He	a27a4a02fe	Refactored proxytensor to clean up separate branches (#84325 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84325 Approved by: https://github.com/ezyang	2022-08-31 09:37:55 +00:00
Horace He	6a3ecda5a2	Started storing faketensor/symbolic shape metadata on FX nodes in make_fx (#84114 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84114 Approved by: https://github.com/SherlockNoMad	2022-08-31 04:39:48 +00:00
Edward Z. Yang	8cd296f680	Use register_meta for everything in meta_registrations (#84297 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84297 Approved by: https://github.com/Chillee	2022-08-31 02:13:21 +00:00
Nikolay Korovaiko	eda217ab67	Reland symint_numel (#84281 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/84281 Approved by: https://github.com/ezyang	2022-08-30 21:53:34 +00:00
Nikolay Korovaiko	44a975335e	Revert "Re-land sym_numel (#82374 ) (#82726 ) (#82731 ) (#82855 )" (#84207 ) This reverts commit `bfebf254dd`. Differential Revision: [D39104562](https://our.internmc.facebook.com/intern/diff/D39104562) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84207 Approved by: https://github.com/robieta	2022-08-30 13:22:58 +00:00
Edward Z. Yang	ad44670fa1	Back out "Revert D38984222: Don't introduce new overload for SymInt (#83628 )" (#84173 ) Also Back out "Revert D39075159: [acc_tensor] Use SymIntArrayRef for overloaded empty.memory_format's signature" Original commit changeset: dab4a9dba4fa Original commit changeset: dcaf16c037a9 Original Phabricator Diff: D38984222 Original Phabricator Diff: D39075159 Also update Metal registrations for C++ registration changes. Also update NNPI registration to account for tightened schema checking Differential Revision: [D39084762](https://our.internmc.facebook.com/intern/diff/D39084762/) NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D39084762/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/84173 Approved by: https://github.com/Krovatkin	2022-08-29 18:01:07 +00:00
PyTorch MergeBot	c7edcd6968	Revert "Don't introduce new overload for SymInt (#83628 )" This reverts commit `9790d90e4b`. Reverted https://github.com/pytorch/pytorch/pull/83628 on behalf of https://github.com/malfet due to Breaks internal builds, see D39076487	2022-08-27 01:23:17 +00:00
Edward Z. Yang	9790d90e4b	Don't introduce new overload for SymInt (#83628 ) Previously, we introduced new SymInt overloads for every function we wanted. This led to a lot of boilerplate, and also a lot of confusion about how the overloads needed to be implemented. This PR takes a simpler but more risky approach: just take the original function and changes its ints to SymInts. This is BC-breaking in the following ways: * The C++ API for registering implementations for aten operators will change from int64_t to SymInt whenever you make this change. Code generated registrations in PyTorch do not change as codegen handles the translation automatically, but manual registrations will need to follow the change. Typically, if you now accept a SymInt where you previously only took int64_t, you have to convert it back manually. This will definitely break XLA, see companion PR https://github.com/pytorch/xla/pull/3914 Note that not all dispatch keys get the automatic translation; all the composite keys and Meta keys are modified to take SymInt directly (because they should handle them directly), and so there are adjustments for this. This is not BC-breaking in the following ways: * The user facing C++ API remains compatible. Even if a function changes from int to SymInt, the default C++ binding still takes only ints. (e.g., at::empty(IntArrayRef, ...). To call with SymInts, you must call at::empty_symint instead. This involved adding two more signatures to CppSignatureGroup; in many cases I refactored code to iterate over all signatures in the group instead of hard-coding the two that previously existed. * This is TorchScript compatible; internally we treat SymInts as ints so there is no change to what happens at runtime in TorchScript. In particular, it's OK to reference an empty schema by its old type (using int types), as long as you're not doing string equality (which you shouldn't be), these parse to the same underyling type. Structure of the PR: * The general strategy of this PR is that, even when you write `SymInt` inside `native_functions.yaml`, sometimes, we will treat it as if it were an `int`. This idea pervades the codegen changes, where we have a translation from SymInt to c10::SymInt or int64_t, and this is controlled by a symint kwarg which I added and then audited all call sites to decide which I wanted. Here are some of the major places where we pick one or the other: * The C++ FunctionSchema representation represents `SymInt` as `int`. There are a few places we do need to know that we actually have a SymInt and we consult `real_type()` to get the real type in this case. In particular: * When we do schema validation of C++ operator registration, we must compare against true schema (as the C++ API will provide `c10::SymInt`, and this will only be accepted if the schema is `SymInt`. This is handled with cloneWithRealTypes before we check for schema differences. * In `toIValue` argument parsing, we parse against the true schema value. For backwards compatibility reasons, I do still accept ints in many places where Layout/SymInt/etc were expected. (Well, accepting int where SymInt is expected is not BC, it's just the right logic!) * In particular, because SymInt never shows up as type() in FunctionSchema, this means that we no longer need a dedicated Tag::SymInt. This is good, because SymInts never show up in mobile anyway. * Changes to functorch/aten are mostly about tracking changes to the C++ API registration convention. Additionally, since SymInt overloads no longer exist, registrations for SymInt implementations are deleted. In many cases, the old implementations did not properly support SymInts; I did not add any new functionality with this PR, but I did try to annotate with TODOs where this is work to do. Finally, because the signature of `native::` API changed from int to SymInt, I need to find alternative APIs for people who were directly calling these functions to call. Typically, I insert a new dispatch call when perf doesn't matter, or use `at::compositeexplicitautograd` namespace to handle other caes. * The change to `make_boxed_from_unboxed_functor.h` is so that we accept a plain IntList IValue anywhere a SymIntList is expected; these are read-only arguments so covariant typing is OK. * I change how unboxing logic works slightly. Previously, we interpret the C++ type for Layout/etc directly as IntType JIT type, which works well because the incoming IValue is tagged as an integer. Now, we interpret the C++ type for Layout as its true type, e.g., LayoutType (change to `jit_type.h`), but then we accept an int IValue for it anyway. This makes it symmetric with SymInt, where we interpret the C++ type as SymIntType, and then accept SymInt and int IValues for it. * I renamed the `empty.names` overload to `empty_names` to make it less confusing (I kept mixing it up with the real empty overload) * I deleted the `empty.SymInt` overload, which ended up killing a pile of functions. (This was originally a separate PR but the profiler expect test was giving me grief so I folded it in.) * I deleted the LazyDynamicOpsTest tests. These were failing after these changes, and I couldn't figure out why they used to be passing: they make use of `narrow_copy` which didn't actually support SymInts; they were immediately converted to ints. * I bashed LTC into working. The patches made here are not the end of the story. The big problem is that SymInt translates into Value, but what if you have a list of SymInt? This cannot be conveniently represented in the IR today, since variadic Values are not supported. To work around this, I translate SymInt[] into plain int[] (this is fine for tests because LTC dynamic shapes never actually worked); but this will need to be fixed for proper LTC SymInt support. The LTC codegen also looked somewhat questionable; I added comments based on my code reading. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83628 Approved by: https://github.com/albanD, https://github.com/bdhirsh	2022-08-26 01:35:40 +00:00
Mario Lezcano	f5a3515083	Make linalg.inv composite of linalg.solve (#80074 ) The `getri` kernel calls inside `getrs` so we can do so explicitly ourselves and save ourselves from having to maintain an extra kernel. This way we just need to optimise `lu_factor` and `lu_solve` and `inv` will be as efficient as it can be, as it'll be choosing the best backend to perform the factorisation and the best backend (not necessarily the same) to perform the solve. Fixes https://github.com/pytorch/pytorch/issues/77498 The benchmarks: https://github.com/pytorch/pytorch/pull/80074#issuecomment-1164309071 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80074 Approved by: https://github.com/IvanYashchuk, https://github.com/albanD, https://github.com/malfet	2022-08-25 09:28:55 +00:00
Horace He	e3c89d0778	Disable autocast cache during aotdispatch (#84035 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84035 Approved by: https://github.com/jansel	2022-08-25 08:58:50 +00:00
Horace He	8b8942b114	Fix dumb make_fx issue (#84011 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84011 Approved by: https://github.com/ezyang	2022-08-25 06:52:01 +00:00
PyTorch MergeBot	a7edf71360	Revert "Don't introduce new overload for SymInt (#83628 )" This reverts commit `8fae7027b3`. Reverted https://github.com/pytorch/pytorch/pull/83628 on behalf of https://github.com/malfet due to breaking internal builds, see https://www.internalfb.com/diff/D38984222	2022-08-25 00:49:40 +00:00
PyTorch MergeBot	5321bf52f2	Revert "Make linalg.inv composite of linalg.solve (#80074 )" This reverts commit `4737b33614`. Reverted https://github.com/pytorch/pytorch/pull/80074 on behalf of https://github.com/malfet due to Depends on the changes from https://github.com/pytorch/pytorch/pull/83628	2022-08-25 00:43:00 +00:00
Mario Lezcano	3e6e0a1d10	Support a stable double backward on linalg.det for real inputs (#80217 ) The complex case still fails. I do not know why. Fixes https://github.com/pytorch/pytorch/issues/62327 Fixes https://github.com/pytorch/pytorch/issues/53364 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80217 Approved by: https://github.com/nikitaved, https://github.com/albanD, https://github.com/malfet	2022-08-24 15:18:56 +00:00
Mario Lezcano	4737b33614	Make linalg.inv composite of linalg.solve (#80074 ) The `getri` kernel calls inside `getrs` so we can do so explicitly ourselves and save ourselves from having to maintain an extra kernel. This way we just need to optimise `lu_factor` and `lu_solve` and `inv` will be as efficient as it can be, as it'll be choosing the best backend to perform the factorisation and the best backend (not necessarily the same) to perform the solve. Fixes https://github.com/pytorch/pytorch/issues/77498 The benchmarks: https://github.com/pytorch/pytorch/pull/80074#issuecomment-1164309071 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80074 Approved by: https://github.com/IvanYashchuk, https://github.com/albanD, https://github.com/malfet	2022-08-24 15:18:56 +00:00
Edward Z. Yang	0491e1a13a	Support returning symbolic strides from t.stride() in Python (#83842 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83842 Approved by: https://github.com/albanD, https://github.com/Chillee, https://github.com/bdhirsh	2022-08-24 04:32:51 +00:00
Edward Z. Yang	8fae7027b3	Don't introduce new overload for SymInt (#83628 ) Previously, we introduced new SymInt overloads for every function we wanted. This led to a lot of boilerplate, and also a lot of confusion about how the overloads needed to be implemented. This PR takes a simpler but more risky approach: just take the original function and changes its ints to SymInts. This is BC-breaking in the following ways: * The C++ API for registering implementations for aten operators will change from int64_t to SymInt whenever you make this change. Code generated registrations in PyTorch do not change as codegen handles the translation automatically, but manual registrations will need to follow the change. Typically, if you now accept a SymInt where you previously only took int64_t, you have to convert it back manually. This will definitely break XLA, see companion PR https://github.com/pytorch/xla/pull/3914 Note that not all dispatch keys get the automatic translation; all the composite keys and Meta keys are modified to take SymInt directly (because they should handle them directly), and so there are adjustments for this. This is not BC-breaking in the following ways: * The user facing C++ API remains compatible. Even if a function changes from int to SymInt, the default C++ binding still takes only ints. (e.g., at::empty(IntArrayRef, ...). To call with SymInts, you must call at::empty_symint instead. This involved adding two more signatures to CppSignatureGroup; in many cases I refactored code to iterate over all signatures in the group instead of hard-coding the two that previously existed. * This is TorchScript compatible; internally we treat SymInts as ints so there is no change to what happens at runtime in TorchScript. In particular, it's OK to reference an empty schema by its old type (using int types), as long as you're not doing string equality (which you shouldn't be), these parse to the same underyling type. Structure of the PR: * The general strategy of this PR is that, even when you write `SymInt` inside `native_functions.yaml`, sometimes, we will treat it as if it were an `int`. This idea pervades the codegen changes, where we have a translation from SymInt to c10::SymInt or int64_t, and this is controlled by a symint kwarg which I added and then audited all call sites to decide which I wanted. Here are some of the major places where we pick one or the other: * The C++ FunctionSchema representation represents `SymInt` as `int`. There are a few places we do need to know that we actually have a SymInt and we consult `real_type()` to get the real type in this case. In particular: * When we do schema validation of C++ operator registration, we must compare against true schema (as the C++ API will provide `c10::SymInt`, and this will only be accepted if the schema is `SymInt`. This is handled with cloneWithRealTypes before we check for schema differences. * In `toIValue` argument parsing, we parse against the true schema value. For backwards compatibility reasons, I do still accept ints in many places where Layout/SymInt/etc were expected. (Well, accepting int where SymInt is expected is not BC, it's just the right logic!) * In particular, because SymInt never shows up as type() in FunctionSchema, this means that we no longer need a dedicated Tag::SymInt. This is good, because SymInts never show up in mobile anyway. * Changes to functorch/aten are mostly about tracking changes to the C++ API registration convention. Additionally, since SymInt overloads no longer exist, registrations for SymInt implementations are deleted. In many cases, the old implementations did not properly support SymInts; I did not add any new functionality with this PR, but I did try to annotate with TODOs where this is work to do. Finally, because the signature of `native::` API changed from int to SymInt, I need to find alternative APIs for people who were directly calling these functions to call. Typically, I insert a new dispatch call when perf doesn't matter, or use `at::compositeexplicitautograd` namespace to handle other caes. * The change to `make_boxed_from_unboxed_functor.h` is so that we accept a plain IntList IValue anywhere a SymIntList is expected; these are read-only arguments so covariant typing is OK. * I change how unboxing logic works slightly. Previously, we interpret the C++ type for Layout/etc directly as IntType JIT type, which works well because the incoming IValue is tagged as an integer. Now, we interpret the C++ type for Layout as its true type, e.g., LayoutType (change to `jit_type.h`), but then we accept an int IValue for it anyway. This makes it symmetric with SymInt, where we interpret the C++ type as SymIntType, and then accept SymInt and int IValues for it. * I renamed the `empty.names` overload to `empty_names` to make it less confusing (I kept mixing it up with the real empty overload) * I deleted the `empty.SymInt` overload, which ended up killing a pile of functions. (This was originally a separate PR but the profiler expect test was giving me grief so I folded it in.) * I deleted the LazyDynamicOpsTest tests. These were failing after these changes, and I couldn't figure out why they used to be passing: they make use of `narrow_copy` which didn't actually support SymInts; they were immediately converted to ints. * I bashed LTC into working. The patches made here are not the end of the story. The big problem is that SymInt translates into Value, but what if you have a list of SymInt? This cannot be conveniently represented in the IR today, since variadic Values are not supported. To work around this, I translate SymInt[] into plain int[] (this is fine for tests because LTC dynamic shapes never actually worked); but this will need to be fixed for proper LTC SymInt support. The LTC codegen also looked somewhat questionable; I added comments based on my code reading. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83628 Approved by: https://github.com/albanD, https://github.com/bdhirsh	2022-08-23 22:04:07 +00:00
Ivan Yashchuk	cb488e6d2f	Allow None arguments for elementwise type promotion wrapper and fix clamp with None arguments (#83586 ) Fixes https://github.com/pytorch/torchdynamo/issues/759 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83586 Approved by: https://github.com/ezyang, https://github.com/ngimel	2022-08-23 17:47:10 +00:00
Horace He	7ebdb4c72f	Refactored ops on size to be dispatcher ops (#83719 ) An example of how the graph looks now. ``` def forward(self, x_1): size = torch.ops.math.size(x_1, 0) size_1 = torch.ops.math.size(x_1, 1); x_1 = None ones = torch.ops.aten.ones.default([1], device = device(type='cpu'), pin_memory = False) expand_sym_int = torch.ops.aten.expand.SymInt(ones, [size, size_1]); ones = size = size_1 = None cos_default = torch.ops.aten.cos.default(expand_sym_int); expand_sym_int = None return (cos_default,) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/83719 Approved by: https://github.com/ezyang	2022-08-23 15:48:00 +00:00
Horace He	0e0af73ba2	Add support for partial decompositions in make_fx (#83770 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83770 Approved by: https://github.com/ngimel	2022-08-20 01:03:39 +00:00
Edward Z. Yang	9152144944	Coverage for nondeterministic_seeded, respect it in constant prop (#83650 ) - nondeterministic_seeded was not applied to enough functions. I added some heuristics to codegen for identifying functions that are likely to be random and added a bunch of these tags to functions. Not sure I got all of them. - Don't constant propagate through nondeterministic functions in FX tracing. It would be better to do some testing for the tag but this would be quite an effort. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83650 Approved by: https://github.com/bdhirsh, https://github.com/eellison	2022-08-18 22:18:10 +00:00
Edward Z. Yang	24acc3155f	Be more conservative about propagating constants. (#83648 ) If a constant would turn into something large, don't keep it as a constant, just drop it. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83648 Approved by: https://github.com/eellison	2022-08-18 22:18:10 +00:00
Edward Z. Yang	817a82704f	Delete ProxyTensor wrapper subclass (#83330 ) I was working on https://github.com/pytorch/torchdynamo/issues/80 and my working hypothesis for what was causing the error was that proxy tensor was not advertising correct dispatch keys, causing AMP to operate differently when you traced. I could have fixed this directly by replicating fake tensor's fix for setting dispatch keys to also apply to proxy tensor, but I was like, "Why must I repeat myself." This PR is the result. It completely deletes the ProxyTensor wrapper subclass, so that when we are tracing, the tensors flowing through the program are the original real or fake tensors, depending on what the user requested in the top-level API. There is no more wrapping. To store the Proxy objects necessary for actually doing tracing, I store the property directly on the tensors. (Note: I never clean up old entries from the map at the moment, this is easily fixed by using a weak map) Benefits of doing this: * No more tip-toeing around no_dispatch() creation of new ProxyTensors; we never create new tensors (except when we call the underlying func), so you don't have to worry about accidentally tracing them. * No more syncing up metadata from in place operators. In particular https://github.com/pytorch/pytorch/issues/81526 is mooted * This fixes https://github.com/pytorch/torchdynamo/issues/519 as we no longer need to teach proxy tensor to support sparse tensor. * No more schlepping symbolic integers from the inner fake tensor to the outer proxy tensor. If you can make a fake tensor with symbolic ints, you're done, nothing else to do. To avoid having to rewrite all of the guts, when I get to the actual proxy tensor handler, I first "fetch" the stored ProxyTensor data from the weakmap via a tree_map, and then operate on the consequent data as before. A more optimized implementation is possible. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83330 Approved by: https://github.com/Chillee	2022-08-18 01:56:07 +00:00
Edward Z. Yang	e09821f784	Avoid using true division in split_dim (#83527 ) This makes it more amenable to tracing with dynamic shapes, where we don't support SymFloats yet. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83527 Approved by: https://github.com/ngimel	2022-08-17 04:19:29 +00:00
Edward Z. Yang	4c8cfb57aa	Convert SymInt tracing to mode based tracing (#83380 ) We're on our way to deleting ProxyTensor entirely (see https://github.com/pytorch/pytorch/pull/83330 ), but before we can do that, we have to delete ProxySymInt first. Here's the plan. Changes in torch.fx.experimental.symbolic_shapes * The general idea is to do mode based tracing. This means we need a mode that can interpose on all SymInt operations. There are a few ways to do this, but I've done it the easy way: (1) I have a separate mode for SymInt operations specifically called SymDispatchMode, and (2) this mode operates on PySymInt (and not the basic SymInt which is user visible). I elided Int from the name because if we add SymFloats I want to use the same mode to handle those as well, and I used Dispatch rather than Function because this is the "inner" dispatch operating PySymInt and not SymInt (this is not a perfect analogy, but SymFunctionMode definitely seemed wrong as you still must go through the C++ binding.) The mode is entirely implemented in Python for ease of implementation. We could have implemented this more symmetrically to TorchFunctionMode in C++, but I leave that as later work; this API is unlikely to get used by others (unlike TorchFunctionMode). One downside to not doing the mode in C++ is that we still have to do the hop via a preexisting PySymInt to wrap; this is currently not a big deal as conversion to SymInts only really happens when there is already another SymInt floating around. SymDispatchMode is pared down from TorchDispatchMode; there is no ancestor tracking since I don't expect people to be mixing up SymDispatchModes. * I made some improvements for tracing. When I invoke the SymDispatchMode handler, I would like constants to show up as constants, so they can be directly inlined into the FX graph (rather than going through a wrapping process first, and then the wrapped SymInt being used in the operation). To do this, I directly track if a PySymInt is a constant at construction time. Only wrapped PySymInts are constants. * For convenience, PySymInts now support all magic methods that regular SymInts do. This is so that redispatch inside the SymDispatchMode can be written the idiomatic way `func(args, kwargs)` where func is an operator. The original names are retained for direct C++ calls. Changes in torch.fx.experimental.proxy_tensor OK, so we got a new SymDispatchMode, so we define a ProxySymDispatchMode and activate it when we start tracing. This mode is currently unconditionally activated although technically we only need to activate it when doing symbolic tracing (it doesn't matter either way as there are no SymInts if you are not doing symbolic tracing). * We delete ProxySymInt. To do this, we must now record the proxy for the SymInt some other way. Based on discussion with Chillee, it is more intuitive to him if the proxies are still recorded on the SymInt in some way. So we store them in the `__dict__` of the PySymInt, indexed by Tracer. An improvement is to make this a weak map, so that we remove all of these entries when the tracer dies. In an original version of this PR, I keyed on the mode itself, but tracer is better as it is accessible from both modes (and as you will see, we will need to fetch the map from both the ProxySymDispatchMode as well as the ProxyTorchDispatchMode.) The implementation of SymDispatchMode now simply retrieves the proxies, performs the underlying operation as well as the FX graph recording, and then records the output proxy to the PySymInt. Note that FX tracing does not work with proxies and SymInts, so we manually call `call_function` to ensure that the correct operations get recorded to the graph. This means conventional FX retracing with proxies only will not work with these graphs, but there wasn't really any reason to do this (as opposed to `make_fx` retracing) anyway. Constants are detected and converted directly into Python integers. * SymInts can show up as arguments to tensor operations, so they must be accounted for in ProxyTorchDispatchMode as well. This is done by searching for SymInt arguments and converting them into proxies before the proxy call. This can be done more efficiently in a single `tree_map` but I'm lazy. The helper `unwrap_symint_proxy` conveniently implements the unwrapping in one place given a tracer; unfortunately it cannot be shared with SymDispatchMode as SymDispatchMode gets PySymInts, but ProxyTensorMode gets SymInts. Similarly, tensors that are returned from tensor operations can have SymInts in their shapes, which need fresh proxies allocated. To avoid leaking internal details of SymInt shape computation to the tensor operation graph, these SymInts are always given proxies derived from `x.size(dim)` call on their return tensor. We also need to do this for strides and numel but have not done so yet. Furthermore, we must avoid tracing internal SymInt calls while we run meta operations on the true operation; this is achieved by also disabling SymInt tracing on the inside of tensor tracing. This is analogous to how tensor tracing is disabled inside the implementation of tracing mode, but unfortunately we are unable to use the same mechanism (this would have been easier if the two modes could be combined somehow, and I am amenable to suggestions to try harder to achieve this.) * Because there are no more ProxySymInts, we no longer need to do anything to unwrap SymInt. Furthermore, we do not need to reallocate ProxySymInts on class creation. * If a bare SymInt without a Proxy is encountered, it is assumed that this must be a constant. `create_arg` handles this case. Non-constant free SymInts result in an assert error. * The initial input handling in `dispatch_trace` involves traversing all of the input tensors, traversing over their shapes, and assigning proxies for the SymInts in shapes in the same way we handle proxies for the output tensors. The preexisting testing is inadequate but will be better after I rebase past https://github.com/pytorch/pytorch/pull/82209 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83380 Approved by: https://github.com/samdow	2022-08-16 14:32:27 +00:00
Horace He	86de9e7291	Added some additional symbolic tracing tests (#82209 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82209 Approved by: https://github.com/ezyang	2022-08-14 00:47:57 +00:00
Horace He	c2808571bf	Removed trace_factory_functions=False option (#83215 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83215 Approved by: https://github.com/ezyang	2022-08-13 03:06:45 +00:00
Edward Z. Yang	d423722607	Add data_dependent_output tag; generalize proxy tensor to test it (#83312 ) Fixes https://github.com/pytorch/pytorch/issues/83251 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83312 Approved by: https://github.com/albanD	2022-08-12 17:31:55 +00:00
Brian Hirsh	ba90c9f229	fix functionalization <> resnet18, make ProxyTensor work with tensor-less decomps (#83207 ) This should fix a few of the errors I was seeing when I turned on functionalization in torchbench. It also fixes this AOTAutograd repro with resnet18: ``` import torch from torchvision.models import resnet18 from functorch._src.compilers import nop from functorch._src.aot_autograd import aot_module from functorch.compile import config config.use_functionalize = True model = resnet18().cuda().half().to(memory_format=torch.channels_last) input = torch.randn(256, 3, 224, 224, device='cuda', dtype=torch.float16) \ .to(memory_format=torch.channels_last).detach().requires_grad_(True) input_expected = input.clone().detach().requires_grad_(True) fn = aot_module(model, nop) out = fn(input) out_expected = model(input_expected) print(torch.allclose(out, out_expected)) out.sum().backward() out_expected.sum().backward() print(torch.allclose(input.grad, input_expected.grad)) ``` The problem was that functorch adds a decomp to the decomp table for `new_zeros`: ``` @register_decomposition(aten.new_zeros, aot_autograd_decompositions) def new_zeros(inp, size, dtype=None, layout=None, device=None, pin_memory=None): return torch.zeros(size, dtype=inp.dtype, device=inp.device) ``` When calling that decomp from inside of `ProxyTensorDispatchMode`, the ProxyTensorMode is already disabled, and `torch.zeros` doesn't take in any tensor-like arguments, so we never end up dispatching back into python again. The way that manifests is that the output of `new_zeros()` gets baked as a constant into the AOTAutograd FX graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83207 Approved by: https://github.com/ezyang	2022-08-12 01:07:31 +00:00
Edward Z. Yang	63f35f1a0b	Hack up make_fx to natively support varargs (#83210 ) This is kind of nasty but it works. I attempted to fix FX first but the inspect logic is impenetrable. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83210 Approved by: https://github.com/Chillee, https://github.com/albanD	2022-08-11 14:52:26 +00:00
Horace He	663967777b	Handle redispatch correctly with tensor subclasses in ProxyTensor mode (#83122 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83122 Approved by: https://github.com/ezyang	2022-08-11 01:31:16 +00:00
Edward Z. Yang	988bd0173c	Add OpOverload.decompose API (#83075 ) This allows you to directly call into the CompositeImplicitAutograd implementation of an operator, without changing any aspects of the dispatcher state. In particular, you can use this to recursively call into a decomposition, dispatching back to your tensor subclass/mode as desired. Hypothetically, we should also make these available in the decompositions dictionary, but I'm leaving this as future work as enumerating these decompositions is annoying (as operators are lazily registered.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83075 Approved by: https://github.com/albanD	2022-08-09 18:53:19 +00:00
kshitij12345	a3d37f1114	[composite compliance] quantile and nanquantile (#81767 ) Reference: #69991 This adds a new CompositeExplicit Operator `at::assert_all_true` (also exposed in Python) to check the truthiness of a tensor and throw an error based on that. This helps us mitigate `TORCH_CHECK(t.all().item<bool>(), "err_msg")` pattern which is not composite compliant. Using the mentioned operator, we fix `quantile` and `nanquantile` to be Composite Compliant. Question: Should it be documented? Pull Request resolved: https://github.com/pytorch/pytorch/pull/81767 Approved by: https://github.com/zou3519	2022-08-08 14:42:51 +00:00
Edward Z. Yang	d24724499b	Parameterize TestGenericProxyTensor on tracing_mode (#82746 ) This gives us tests for fake (all passing) and symbolic (not all passing). I needed to add a gadget for xfail'ing tests in symbolic. It might be generally useful, let me know what you think. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82746 Approved by: https://github.com/eellison	2022-08-06 12:41:54 +00:00
Edward Z. Yang	b361f70347	Reorganize test_proxy_tensor.py per tracing mode (#82739 ) I intend to run all of the proxy tensor tests on each of our tracing modes, but to do this I have to make the tests parametrized on tracing mode first. This does that refactor, without adding any new tests. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82739 Approved by: https://github.com/eellison	2022-08-06 12:41:54 +00:00
Nikolay Korovaiko	bfebf254dd	Re-land sym_numel (#82374 ) (#82726 ) (#82731 ) (#82855 ) ### Description This is a reland of (#82374) (#82726) (#82731) This PR has no extra fixes, it simply updates the correct pin to point to the XLA side that has the corresponding changes. ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82855 Approved by: https://github.com/ezyang, https://github.com/qihqi	2022-08-05 03:36:09 +00:00
PyTorch MergeBot	78bd95b13a	Revert "Re-land sym_numel (#82374 ) (#82726 ) (#82731 )" This reverts commit `c90e00cf85`. Reverted https://github.com/pytorch/pytorch/pull/82731 on behalf of https://github.com/zengk95 due to This is breaking XLA tests on trunk. It seems to have passed on PR and was able to checkout that commit `c90e00cf85`.	2022-08-04 22:45:26 +00:00
Nikolay Korovaiko	c90e00cf85	Re-land sym_numel (#82374 ) (#82726 ) (#82731 ) This PR relands sym_numel #82374 and fixes the ios build break in this commit : `8cbd0031c5` which was a type mismatch in an equality. ### Description <!-- What did you change and why was it needed? --> ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82731 Approved by: https://github.com/malfet	2022-08-04 21:05:24 +00:00
Fabio Rocha	ff753cbc12	[primTorch] Added unbind OpInfo and ref (#81776 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/81776 Approved by: https://github.com/Lezcano, https://github.com/ngimel	2022-08-04 17:03:24 +00:00
Horace He	1164c83c3c	Revert "Revert "Added zero.symint and modified aten::trapz to use symbolic ints (#82054 )"" (#82779 ) This reverts commit `0f52794ce7`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82779 Approved by: https://github.com/eellison	2022-08-04 02:26:50 +00:00
PyTorch MergeBot	0f52794ce7	Revert "Added zero.symint and modified aten::trapz to use symbolic ints (#82054 )" This reverts commit `cd73fc9456`. Reverted https://github.com/pytorch/pytorch/pull/82054 on behalf of https://github.com/Chillee due to Land didn't capture the additional github UI created commit	2022-08-03 23:45:21 +00:00
Horace He	cd73fc9456	Added zero.symint and modified aten::trapz to use symbolic ints (#82054 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82054 Approved by: https://github.com/ezyang	2022-08-03 20:32:18 +00:00
zengk95	d0e6e5a5bb	Revert "sym_numel (#82374 )" (#82726 ) TSIA It looks like this PR #82374 is breaking mac builds on trunk but I can't revert it normally since there's a merge conflict in the XLA hash. <img width="1753" alt="image" src="https://user-images.githubusercontent.com/34172846/182644661-b7fdda4b-e5ce-45c3-96a2-ad6737d169ae.png"> I reverted it and resolved the conflict using the old XLA hash that this commit was based upon Pull Request resolved: https://github.com/pytorch/pytorch/pull/82726 Approved by: https://github.com/albanD, https://github.com/janeyx99	2022-08-03 15:23:47 +00:00
Fabio Rocha	d6303cd860	Added OpInfo for unflatten and ref for flatten OpInfo (#81230 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/81230 Approved by: https://github.com/Lezcano, https://github.com/ngimel	2022-08-03 15:19:56 +00:00
Edward Z. Yang	42fefd4403	Sparse fake tensor support (#82172 ) Add support for sparse fake tensors. - The testing strategy is to run a fake tensor cross ref test on `test_sparse.py`. This is necessary because OpInfo sparse coverage is completely nonexistent. We could have tried to turn on cross ref testing globally for all files, but that would be very time consuming and the tests I'm interested in are mostly in this file. There are some exclusions in testing for things that don't work. - I make fake tensor converter raise a UnsupportedFakeTensorException if the meta converter fails to do a conversion (which can happen in a relatively large number of situations). - I relax fake tensor invariants so that you can make a fake tensor from a meta tensor. This is useful because in the cross ref test sometimes we operate on meta tensors. - Fake tensor wrapping is improved to handle the case when a function doesn't return any tensors - Meta converter is taught how to convert sparse tensors to meta There's still a little more cleanup that needs to be done, but this is good for review. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82172 Approved by: https://github.com/eellison	2022-08-03 14:29:36 +00:00
Nikolay Korovaiko	fd68b0931f	sym_numel (#82374 ) ### Description This PR makes `numel` symint-aware similar to `sym_sizes()` and `sym_strides()`. Similar to https://github.com/pytorch/pytorch/pull/81300 . This PR is the part of a bigger project to support dynamic_shapes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82374 Approved by: https://github.com/ezyang	2022-08-03 06:33:45 +00:00
Ivan Yashchuk	900e93d351	Add context manager for conditional rewrites of torch.* to torch._refs.* calls (#81764 ) Adds a new context manager `TorchRefsNvfuserCapabilityMode` for conditional rewrite of `torch.` calls to `torch._refs.` based on whether the decomposition consisting of prims supports nvFuser execution or not. A new optional argument for `TorchRefsMode` is added - `should_fallback_fn`, a callable that returns whether the original `torch.foo` or the replacement `torch._refs.foo` should be used. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81764 Approved by: https://github.com/ezyang	2022-08-02 11:02:10 +00:00
Edward Z. Yang	bf387e894f	Fix a NotImplemented mode bug and improve Parameter handling for fake tensor (#82574 ) Partially addresses https://github.com/pytorch/pytorch/issues/82547 The repro script still doesn't work with fake tensor, but it is now expected as fake tensor does not work unless all inputs are explicitly wrapped into fake tensor. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82574 Approved by: https://github.com/eellison	2022-08-01 20:40:01 +00:00
Edward Z. Yang	98215923ad	Correctly unpack constants when used in multi-return output (#82568 ) Partial fix for https://github.com/pytorch/pytorch/issues/82547 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82568 Approved by: https://github.com/IvanYashchuk, https://github.com/davidberard98	2022-08-01 20:40:01 +00:00
Edward Z. Yang	98b9dfa129	Add decompositions for zero_, fill_, new_full, new_zeros, new_ones (#82332 ) This makes symbolic tracing tests for logsigmoid and xlogy start working again. While I'm at it, add pin_memory and layout kwargs to empty; but they don't actually do anything and raise an error if they are non standard. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82332 Approved by: https://github.com/eellison	2022-07-28 04:02:02 +00:00
Elias Ellison	1c0f7bd6d2	Enable complex for meta tensors (#79975 ) There weren't really any fundamental blockers - add support for `aten::complex` - update `angle` for complex - remove the error in the fallback kernel Pull Request resolved: https://github.com/pytorch/pytorch/pull/79975 Approved by: https://github.com/ezyang	2022-07-27 22:19:14 +00:00
Edward Z. Yang	617e90db22	Add meta support for eye (#82309 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82309 Approved by: https://github.com/bdhirsh	2022-07-27 18:42:47 +00:00
Edward Z. Yang	d38ffa6a4c	Make all of new_/_like factory functions composite explicit autograd (#82238 ) Once CompositeImplicitAutograd gets registered to Python key, this will ensure that tensor subclasses can interpose on these functions directly rather than getting decomposed. We prefer not decomposing as these functions are functional, but their implementations use inplace operations (and are thus more difficult to deal with, unless you use functionalization.) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82238 Approved by: https://github.com/zou3519, https://github.com/bdhirsh	2022-07-27 18:33:46 +00:00
Horace He	a42616e0bf	Revert "Revert "Ported aten::cross to work with symints (#82052 )"" (#82287 ) This reverts commit `e519dd37e1`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82287 Approved by: https://github.com/ezyang	2022-07-27 04:51:06 +00:00
PyTorch MergeBot	e519dd37e1	Revert "Ported aten::cross to work with symints (#82052 )" This reverts commit `30ed427d2e`. Reverted https://github.com/pytorch/pytorch/pull/82052 on behalf of https://github.com/Chillee due to broke build on master	2022-07-27 01:04:42 +00:00
Horace He	30ed427d2e	Ported aten::cross to work with symints (#82052 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82052 Approved by: https://github.com/ezyang	2022-07-27 00:45:26 +00:00
Horace He	91b4648633	Did some cleanup of symbolic shapes (#82051 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82051 Approved by: https://github.com/eellison, https://github.com/ezyang	2022-07-27 00:45:26 +00:00
Horace He	fc389cc0a0	Added new_empty.symint overload and a new_empty ref (#82049 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82049 Approved by: https://github.com/ezyang	2022-07-27 00:31:57 +00:00
lezcano	11fe277b62	[PrimTorch] Add reference for torch.norm (#81765 ) This ref does more things than `torch.norm`, and it fixes a few bugs that `torch.norm` has. This implementation and the `torch.norm` implementation come to terms in the next PR of this stack We put this PR before, as otherwise `test_decomp.py` was failing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81765 Approved by: https://github.com/ngimel	2022-07-25 19:57:21 +00:00
Mostafa Elhoushi	0894c4967d	Add test_make_fx_model_train example (#980 ) (#82011 ) Summary: Pull Request resolved: https://github.com/pytorch/functorch/pull/980 Test Plan: CI should pass Differential Revision: D38078694 Pulled By: mostafaelhoushi Pull Request resolved: https://github.com/pytorch/pytorch/pull/82011 Approved by: https://github.com/Chillee	2022-07-25 12:43:17 +00:00
Horace He	1a18ff3247	Revert "Revert "Added dynamic shape POC (#81093 )"" (#82063 ) This reverts commit `0888a4844c`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82063 Approved by: https://github.com/ezyang	2022-07-23 22:35:50 +00:00
PyTorch MergeBot	0888a4844c	Revert "Added dynamic shape POC (#81093 )" This reverts commit `8169a85dc6`. Reverted https://github.com/pytorch/pytorch/pull/81093 on behalf of https://github.com/janeyx99 due to Broke slow tests on trunk `8169a85dc6`.	2022-07-23 11:30:37 +00:00
Horace He	8169a85dc6	Added dynamic shape POC (#81093 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/81093 Approved by: https://github.com/ezyang, https://github.com/eellison	2022-07-23 04:46:32 +00:00
PyTorch MergeBot	521d5ae1ce	Revert "Enable reentrant dispatch for decompositions (#81598 )" This reverts commit `08b9544e1c`. Reverted https://github.com/pytorch/pytorch/pull/81598 on behalf of https://github.com/ezyang due to out of tree failures	2022-07-22 00:21:18 +00:00
Edward Z. Yang	5b88a2078b	Follow GitHub relabeling of oncall: fx for test owners (#81821 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/81821 Approved by: https://github.com/janeyx99	2022-07-21 01:50:06 +00:00
David Berard	08b9544e1c	Enable reentrant dispatch for decompositions (#81598 ) This allows us to avoid tracing through CompositeImplicitAutograd ops when decomposing via make_fx or other decomposition methods that use tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81598 Approved by: https://github.com/ezyang	2022-07-20 21:26:16 +00:00
Edward Z. Yang	fca03eeec1	Make proxy tensor support item() calls on torch.tensor constants (#81192 ) This PR is doing a few interrelated things, all of which are necessary to get correctness. Read the comment in torch/fx/experimental/proxy_tensor.py for the high level overview. Let's break down the parts of this PR: * Bug fix where `enable_torch_dispatch_mode` with `None` doesn't work. This make `enable_torch_dispatch_mode(current_mode.inner)` work which is the basis for how we temporarily disable fake tensor mode. * Bug fix for when fake tensor mode is combined with a non-mode tensor subclass. This actually could be ablated from this PR but it affects where the logic for allowing non fake tensor inputs with lift goes, so it's all in here in one go. There are some relevant tests for the fix in fake tensor, but it turns out I didn't need this because I'm always using proxy tensors as a mode (which ensures the ordering is right.) * New `lift_fresh` view operator. Note that like lift, we have to manually write the functionalize kernel for these functions. * The actual change, which is to save constants when we see them in the proxy tensor mode, and then propagate them as we go (because otherwise you'll handle mutations on constants incorrectly--see test.) This is mildly BC-breaking if anyone was previously interposing on at::lift, but this operator was relatively new and I checked functorch which has no explicit reference to lift. So I think it should not be too disruptive. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/81192 Approved by: https://github.com/samdow, https://github.com/bdhirsh	2022-07-15 03:53:40 +00:00
Horace He	b7046e9b7f	Stopped ProxyTensor from turning aten::lift tensors into proxy objects (#81024 ) ``` def f(): val = torch.tensor(float('inf')) return torch.full((100, 100), val) ``` today we turn `val` into a ProxyTensor, and then complain when we try to turn `val` into a scalar. We call `aten::lift` when we call `torch.tensor(5)`, so this just prevents those from being turned into ProxyTensors unnecessarily. cc: @ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/81024 Approved by: https://github.com/ezyang	2022-07-07 04:54:31 +00:00
David Berard	00f651811a	Interpreter for decomposing aten -> prims (#79989 ) If an aten -> prim decomposition is needed after the initial trace with make_fx, this interpreter can be used to perform the decomposition. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79989 Approved by: https://github.com/SherlockNoMad	2022-06-29 21:16:28 +00:00
Horace He	615dd25088	Made Proxy Tensor Mode also trace overloads (#80403 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/80403 Approved by: https://github.com/zou3519	2022-06-28 04:31:43 +00:00
PyTorch MergeBot	4e33c8c6bb	switched over to using faketensor in proxytensor (#79634 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/79634 Approved by: https://github.com/albanD	2022-06-27 19:55:47 +00:00
Horace He	159d459c50	Switched to tracing overloads by default (#80013 ) There are many cases where it's more convenient to use overloads, but we've hesitated in doing so since we can't torchscript it directly. Luckily, it's pretty easy to strip overloads. See https://github.com/pytorch/functorch/pull/899 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80013 Approved by: https://github.com/zou3519	2022-06-22 18:55:06 +00:00
Peter Bell	9bf52f4be8	Add OpInfo for torch.equal and fix support for non-standard bools Pull Request resolved: https://github.com/pytorch/pytorch/pull/79389 Approved by: https://github.com/mruberry	2022-06-20 23:48:39 +00:00
Horace He	f5d7e5a192	started using mode-based tracing Pull Request resolved: https://github.com/pytorch/pytorch/pull/79638 Approved by: https://github.com/samdow	2022-06-17 20:24:49 +00:00
Horace He	4d88affb5d	Ported proxy tensor tests over to core (#78890 ) Will fill out later Pull Request resolved: https://github.com/pytorch/pytorch/pull/78890 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-06-07 00:28:53 +00:00

... 5 6 7 8 9

421 Commits