pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Jiewen Tan	93aa6e3e36	[LTC] Make some LazyGraphExecutor private data structures protected (#90457 ) Summary: This pull request makes some LazyGraphExecutor private data structures protected such that XLAGraphExecutor can reuse them. Here is the list: 1. DeviceLocker. 2. DeviceLockerArena. 3. DataCacheArena. In addition, it also introduces LazyGraphExecutor::ResetTrimCounter() such that XLAGraphExecutor can reuse the trim counter. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90457 Approved by: https://github.com/JackCaoG	2022-12-09 18:28:13 +00:00
Jiewen Tan	b738da8c8e	[LTC] Tweak LazyTensor Class for XLATensor (#90363 ) Summary: This pull request makes some tweaks on LazyTensor class such that it's easier for XLATensor to inherit. 1. It replaces data_ptr() with data() which now returns a const shared_ptr& type. 2. It adds a temporary ctor to LazyTensor::Data such that XLATensor::Data can easily inherits it. 3. It moves LazyTensor(std::shared_ptr<Data>) and SetTensorData(at::Tensor) to protected for XLATensor to access. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90363 Approved by: https://github.com/JackCaoG	2022-12-08 18:23:17 +00:00
Jiewen Tan	1978773399	[LTC] Overlap data creation and ir_value setting (#90438 ) Summary: Upstreaming changes from torch_xla to lazy tensor core: https://github.com/pytorch/xla/pull/4011. It overlaps data creation and ir_value setting with previous executions. To be noted, this is a clone of https://github.com/pytorch/pytorch/pull/87119, and the author is @aws-rhsoln. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90438 Approved by: https://github.com/JackCaoG	2022-12-08 08:11:01 +00:00
Jiewen Tan	c20d41253f	[LTC] Tweak LazyGraphExecutor for XLA (#90420 ) Summary: This patch moves some of the data structures from private to protected such that XLAGraphExecutor can reuse them. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90420 Approved by: https://github.com/JackCaoG	2022-12-08 06:56:23 +00:00
Jiewen Tan	7e034193bb	[LTC] Restore default ctor for LazyTensor (#90086 ) Summary: This pull request introduced a temporarily change that make XLA's LTC migration easier. One step among is to make XLATensor naively inherits LazyTensor and that requires LazyTensor to have a default constructor. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90086 Approved by: https://github.com/JackCaoG, https://github.com/kit1980	2022-12-05 18:26:37 +00:00
Jiewen Tan	69d7afc799	[LTC] Remove noop_execution_mode_ (#89989 ) Summary: noop_execution_mode_ doesn't seem to be useful anymore. Let's remove it. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89989 Approved by: https://github.com/desertfire, https://github.com/JackCaoG	2022-12-02 01:51:30 +00:00
Jiewen Tan	13d2af2a9b	[LTC] Metrics can be reset too (#89606 ) Summary: This change allow MetricsArena to ResetMetrics too. And then rename Reset to ResetCounters given that's what it does for real. This matches pytorch/xla#4109, and is paired with pytorch/xla#4245. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89606 Approved by: https://github.com/JackCaoG	2022-11-28 21:44:12 +00:00
Sahan Paliskara	2a2c07ae37	[multipy] Address GetPythonFramesFunction() and multipy incompatibility. (#267 ) (#89315 ) Summary: https://github.com/pytorch/pytorch/pull/89122 introduces internal compatibility issues with torchdeploy. However, GetPythonFramesFunction() never worked with torchdeploy, so this PR simply reverts to the original behavior of skipping the function if torchdeploy is used as a forward fix. Test Plan: Running failed tests in T128123281 ``` buck2 test @//mode/opt //multipy/runtime:test_deploy -- --exact 'multipy/runtime:test_deploy - TorchpyTest.TaggingRace' --run-disabled buck2 test mode/dev //multipy/runtime/testdev:test_deploy_from_python -- --exact 'multipy/runtime/testdev:test_deploy_from_python - multipy.runtime.testdev.test_deploy_from_python.TestDeployFromPython: test_deploy_from_python' ``` Differential Revision: D41414263 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89315 Approved by: https://github.com/kurman	2022-11-28 19:36:45 +00:00
Jiewen Tan	4f5c4c022a	[LTC] Refine MetricsArena::Reset (#89608 ) Summary: After counters are reset, getters' behaviors are inconsistent. To improve that, here I 1) move the validation of CounterData into CounterData::IsValid such that it's better encapsulated, 2) divide getters into two groups: a) MetricsArena::GetCounter() and b) MetricsArena::ForEachCounter(), and route MetricsArena::GetCounterNames() and CreateMetricReport() to use b. This is paired with pytorch/xla#4217. Test Plan: PJRT_DEVICE=CPU python xla/test/test_metrics.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/89608 Approved by: https://github.com/JackCaoG	2022-11-24 10:57:03 +00:00
Nikita Karetnikov	07dd2fe6c3	Symintify `select` (#89326 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89326 Approved by: https://github.com/ezyang	2022-11-23 05:00:33 +00:00
Jiewen Tan	2dcacc6b99	[LTC] Upstream short_metrics (#89186 ) Summary: This pull request upstreams pytorch/xla#4148. Test Plan: xla CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89186 Approved by: https://github.com/JackCaoG	2022-11-18 09:28:48 +00:00
Jiewen Tan	3c2676de3d	[LTC] Restore GetPythonFrames (#89122 ) Summary: pytorch/pytorch@936e930 delete the registration of GetPythonFramesFunction. Restore that and add a test case to prevent regression. Test Plan: python test/lazy/test_debug_util.py Fixes pytorch/xla#4206. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89122 Approved by: https://github.com/JackCaoG	2022-11-18 03:37:14 +00:00
Kazuaki Ishizaki	a5f04e9a91	Fix typos in .md and .rst files (#88962 ) This PR fixes typos `Github` in `.md` and `.rst` files. `Github` -> `GitHub` Pull Request resolved: https://github.com/pytorch/pytorch/pull/88962 Approved by: https://github.com/kit1980	2022-11-17 03:37:02 +00:00
Kazuaki Ishizaki	e0c194f10b	Fix typos in messages under torch (#88961 ) This PR fixes typos of messages and parms in c++ source and head files under `torch` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88961 Approved by: https://github.com/albanD	2022-11-14 19:06:41 +00:00
Jiewen Tan	3b8245ab12	[LTC] Make ComputePostOrder accept const T pointers (#88773 ) Summary: Since `c10::ArrayRef` now support `c10::ArrayRef<const T>`, let's restore `ComputePostOrder` to accept `const Node*` again, which is more suitable for the context of the given helpers. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88773 Approved by: https://github.com/JackCaoG	2022-11-10 18:34:19 +00:00
Jiewen Tan	c29502dd2f	[LTC] Remove view (#88445 ) Summary: This pull request removes the last view ops, the original view. Test Plan: ./build/bin/test_lazy --gtest_filter=LazyOpsTest.TestView* Pull Request resolved: https://github.com/pytorch/pytorch/pull/88445 Approved by: https://github.com/JackCaoG, https://github.com/antoniojkim, https://github.com/Krovatkin	2022-11-08 02:22:02 +00:00
Jiewen Tan	7354368fd5	[LTC] Remove non-native view ops (#88031 ) Summary: LTC somehow implements a bunch of non-native view ops during the transition to functionalization. Let's remove them now that functionalization is final. Test Plan: CI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88031 Approved by: https://github.com/JackCaoG, https://github.com/antoniojkim	2022-11-02 23:31:26 +00:00
Edward Z. Yang	1ff52225f1	Unify SymIntNode and SymFloatNode into SymNode (#87817 ) This refactor was prompted by challenges handling mixed int/float operations in C++. A previous version of this patch added overloads for each permutation of int/float and was unwieldy https://github.com/pytorch/pytorch/pull/87722/ This PR takes a different approach. The general outline of the patch is to combine the C++ types SymIntNode and SymFloatNode into a single type, SymNode. This is type erased; we no longer know statically at C++ if we have an int/float and have to test it with the is_int()/is_float() virtual methods. This has a number of knock on effects. - We no longer have C++ classes to bind to Python. Instead, we take an entirely new approach to our Python API, where we have a SymInt/SymFloat class defined entirely in Python, which hold a SymNode (which corresponds to the C++ SymNode). However, SymNode is not pybind11-bound; instead, it lives as-is in Python, and is wrapped into C++ SymNode using PythonSymNode when it goes into C++. This implies a userland rename. In principle, it is also possible for the canonical implementation of SymNode to be written in C++, and then bound to Python with pybind11 (we have this code, although it is commented out.) However, I did not implement this as we currently have no C++ implementations of SymNode. Because we do return SymInt/SymFloat from C++ bindings, the C++ binding code needs to know how to find these classes. Currently, this is done just by manually importing torch and getting the attributes. - Because SymInt/SymFloat are easy Python wrappers, __sym_dispatch__ now takes SymInt/SymFloat, rather than SymNode, bringing it in line with how __torch_dispatch__ works. Some miscellaneous improvements: - SymInt now has a constructor that takes SymNode. Note that this constructor is ambiguous if you pass in a subclass of SymNode, so an explicit downcast is necessary. This means toSymFloat/toSymInt are no more. This is a mild optimization as it means rvalue reference works automatically. - We uniformly use the caster for c10::SymInt/SymFloat, rather than going the long way via the SymIntNode/SymFloatNode. - Removed some unnecessary toSymInt/toSymFloat calls in normalize_* functions, pretty sure this doesn't do anything. - guard_int is now a free function, since to guard on an int you cannot assume the method exists. A function can handle both int and SymInt inputs. - We clean up the magic method definition code for SymInt/SymFloat/SymNode. ONLY the user classes (SymInt/SymFloat) get magic methods; SymNode gets plain methods; this is to help avoid confusion between the two types. Signed-off-by: Edward Z. Yang <ezyang@fb.com> cc @jansel @mlazos @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87817 Approved by: https://github.com/albanD, https://github.com/anjali411	2022-10-27 20:56:02 +00:00
Jiewen Tan	2205f56f46	[LTC] Remove lazy::View (#87822 ) Summary: This is the first part to remove the whole view and aliasing infrastructure in LTC, which is deprecated in favor of functionalization. It mainly removes things that use lazy::View. Test Plan: CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/87822 Approved by: https://github.com/JackCaoG, https://github.com/antoniojkim, https://github.com/wconstab	2022-10-27 20:39:30 +00:00
Jiewen Tan	536474e823	[LTC] Remove tensor.storage_ (#87645 ) Summary: Since LTC now supports functionalization, we don't need to fake a storage to support is_alias_of anymore. Let's remove it. Test Plan: ./build/bin/test_lazy --gtest_filter=LazyOpsTest.IsAliasOf Pull Request resolved: https://github.com/pytorch/pytorch/pull/87645 Approved by: https://github.com/JackCaoG, https://github.com/bdhirsh	2022-10-26 22:41:19 +00:00
lezcano	faf9c47abb	Simplify a few diagonal-related functions (#87180 ) `diag` was unnecessarily implemented as a kernel rather than as a composite function, which made it unnecessarily difficult (explicit backward + all it entails). We also change a few uses of `diag` on 2D tensors for `diagonal()`. The latter returns a view rather than creating a new tensor. We also upgrade its meta implementation to a fully-fledged decomposition I tried implementing the backwards of `diagonal()` via `diag_scatter` (or better `diag_scatter_` to keep the perf) but functionalisation was failing and I was not sure how to fix this, so I moved on. It may be possible to simplify that one as well if @soulitzer or someone knows how to do this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87180 Approved by: https://github.com/ngimel, https://github.com/albanD, https://github.com/mruberry	2022-10-24 06:11:53 +00:00
Antonio Kim	d37dc6f698	Make LazyGraphExecutor extensible (#87218 ) Add `LazyGraphExecutor` to backend interface so that its is extensible by a vendor backend. I've made some preliminary methods virtual. Not sure if we want to make all methods in `LazyGraphExecutor` virtual. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87218 Approved by: https://github.com/wconstab, https://github.com/alanwaketan	2022-10-21 14:28:14 +00:00
JackCaoG	8f85831fdf	Give more clear error message when gscope is non-empty (#87005 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/87005 Approved by: https://github.com/alanwaketan, https://github.com/Krovatkin	2022-10-17 18:17:01 +00:00
Sahan Paliskara	936e93058b	Delete torch::deploy from pytorch core (#85953 ) As we have migrated torch::deploy over to https://github.com/pytorch/multipy, we can now delete it from pytorch core as ongoing development will happen there. This PR was created due to syncing issues with https://github.com/pytorch/pytorch/pull/85443 which is where the review history can be found. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85953 Approved by: https://github.com/seemethere, https://github.com/malfet	2022-10-06 07:20:16 +00:00
Edward Z. Yang	e8b0bea677	Rename fromIntArrayRef to fromIntArrayRefSlow, audit call sites (#86235 ) Some of them are known non-negative, I've revised them accordingly. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/86235 Approved by: https://github.com/albanD	2022-10-05 23:11:01 +00:00
Edward Z. Yang	79dd621f76	Symbolic shapes mega merge PR (Oct 3) (#86160 ) - TensorGeometry supports symint - check_size supports symint - functorch batch rule improved symint - Some operator support for symint in LTC - More supported operations on SymInt and SymFloat - More symint support in backwards formulas This merge includes code contributions from bdhirsh and anjali411. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/86160 Approved by: https://github.com/Chillee	2022-10-04 04:12:09 +00:00
Edward Z. Yang	aabf3e234b	Allow functionalize_aten_op to work with non-SymInt signature. (#86080 ) This is done similarly to how we did CPU fallback template. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/86080 Approved by: https://github.com/wconstab	2022-10-03 02:57:44 +00:00
Edward Z. Yang	3b6588ab74	Consistent compute numel/contiguous strategy with SymInts (#85858 ) Previously, our handling for contiguity was inconsistent in the following ways: - is_strides_like 2d/3d and is_non_overlapping_and_dense always were computed based on sizes_and_strides_, even if you had symbolic ints - Furthermore, even if you set custom policy for strides, these quantities were not overridable by subclasses - Furthermore, we didn't even store these fields on ExtraMeta - We duplicate implementations of compute_contiguous (plain, channels last, channels last 3d) - We inconsistently called refresh_numel()/refresh_contiguous(), versus recomputing it ourselves This factor makes a consistent strategy for all of the boolean fields, and for numel computation. After this refactor: - All layout boolean fields are interposable via strides policy and can be overridden from Python; you will never access a garbage field - All layout boolean fields are on ExtraMeta - You can always call refresh_numel/contiguous, no matter if your Tensor is contiguous or not - The numel/layout boolean fields are always populated consistently with the sizes strides fields (either on Tensor or ExtraMeta), even if you have custom policy - There is only one implementation of the actual computation logic Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D39907696](https://our.internmc.facebook.com/intern/diff/D39907696) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85858 Approved by: https://github.com/albanD	2022-09-30 21:26:34 +00:00
Antonio Kim	8d99d6127e	Add torch_lazy_all_numbers_special_scalars flag (#85902 ) This is to allow even non zero and one scalars to appear as constants in the graph. The assumption being that none of them will change. The flag is set to `false` by default to preserve the original behaviour. CC: @wconstab @JackCaoG @ke1337 @vaibhavc-cerebras @glebk-cerebras Pull Request resolved: https://github.com/pytorch/pytorch/pull/85902 Approved by: https://github.com/wconstab	2022-09-30 19:25:41 +00:00
Edward Z. Yang	793488cda2	Revert "Revert "Symintifying slice ops (#85196 )"" (#85746 ) This reverts commit `3a171dfb0c`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85746 Approved by: https://github.com/albanD	2022-09-28 04:37:35 +00:00
PyTorch MergeBot	3a171dfb0c	Revert "Symintifying slice ops (#85196 )" This reverts commit `4c01c51266`. Reverted https://github.com/pytorch/pytorch/pull/85196 on behalf of https://github.com/atalman due to Break internal build Exutorch	2022-09-27 18:01:27 +00:00
Brian Hirsh	4a2d2e5e40	Change API type `Tensor[]` for structured kernels. (#73350 ) Partially fixes: #66328 This PR: - adds support for `ITensorList` to the dispatcher for: - computing the dispatch key - boxing and unboxing `ITensorList` - modified the codegen for structured kernels: - codegen APIs use `ITensorList` instead of `ArrayRef<Tensor>` Changes summary: - Signature changes due to the different APIs: - dispatcher API (e.g. `BatchingRegistrations.cpp`) - C++ API (e.g. `TensorShape.cpp`) - Miscelaneous functions used by codegen'd functions (e.g. `FunctionalTensorWrapper.`) - Dispatcher changes for handling `ITensorList` correctly (e.g. `DispatchKeyExtractor.h`) - Signature changes of `at::cat` due to the need of `const` inside `TensorBody.h` - Forward declarations of `ITensorList` (e.g. `MethodOperators.h`) - Codegen changes, special casing structured kernels (e.g. `gen.py`) Short description of structured kernels special casing:* I introduced, mainly, 5 types of changes to the codegen for generating code depending on whether the kernel is structured or not: 1. Added a `structured_type_override` flag to the `argument_type` function definition of the affected APIs (mainly the dispatcher and C++ APIs). - `api/cpp.py`, `api/dispatcher.py`, `api/native.py` 2. Added a `structured_type_override` member to the signature classes (e.g. `CppSignature`), since `FunctionSchema` doesn't really know whether the function is structured or not - `api/types.py` 3. Added a `part_of_structured_group` to `NativeFunction` class, which is just a convenient function to forward to `structured_type_override` wherever needed - `model.py` 4. Appropriately changed the rest of the codegen, whenever it used either the signature classes or the `arguments` function directly 5. Added a check for `const ITensorList&` type wherever there was a check for `TensorList` Pull Request resolved: https://github.com/pytorch/pytorch/pull/73350 Approved by: https://github.com/bdhirsh	2022-09-26 21:46:38 +00:00
Edward Z. Yang	4c01c51266	Symintifying slice ops (#85196 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85196 Approved by: https://github.com/ezyang	2022-09-23 22:01:32 +00:00
Edward Z. Yang	3eb27229dd	as_strided symbolic support (#85264 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D39662820](https://our.internmc.facebook.com/intern/diff/D39662820) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85264 Approved by: https://github.com/wconstab	2022-09-21 13:34:55 +00:00
Edward Z. Yang	e24e17916f	Remove errant semicolon (#85356 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D39662630](https://our.internmc.facebook.com/intern/diff/D39662630) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85356 Approved by: https://github.com/wconstab, https://github.com/Krovatkin, https://github.com/voznesenskym	2022-09-20 18:01:15 +00:00
Nikolay Korovaiko	2dbd2673b6	remove symintnode bits in LTC (#85171 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85171 Approved by: https://github.com/ezyang	2022-09-17 03:14:59 +00:00
Edward Z. Yang	65158b8876	empty strided symint (#84830 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84830 Approved by: https://github.com/ezyang	2022-09-15 04:09:43 +00:00
Edward Z. Yang	9e5563dbb1	Delete SymIntArrayRef wrapper struct (#84837 ) Since we separated at::foo and at::foo_symint there is no benefit to trying to make initializer lists work in both cases. So we can get rid of the special different struct. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84837 Approved by: https://github.com/kit1980	2022-09-12 20:04:01 +00:00
PyTorch MergeBot	034f2db1fd	Revert "Delete SymIntArrayRef wrapper struct (#84837 )" This reverts commit `9c78f599e4`. Reverted https://github.com/pytorch/pytorch/pull/84837 on behalf of https://github.com/ZainRizvi due to The test test_post_localSGD_optimizer_step_reload in the X linux-bionic-cuda11.6-py3.10-gcc7 workflow has started consistently failing since this PR was submitted	2022-09-12 19:04:07 +00:00
Edward Z. Yang	9c78f599e4	Delete SymIntArrayRef wrapper struct (#84837 ) Since we separated at::foo and at::foo_symint there is no benefit to trying to make initializer lists work in both cases. So we can get rid of the special different struct. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84837 Approved by: https://github.com/kit1980	2022-09-12 16:28:20 +00:00
Edward Z. Yang	c5a8946e40	Revert "Revert "Redo how custom/python_custom methods on TensorImpl work (#84796 )" (#84806 ) This reverts commit `ca3b2bfbe3`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84806 Approved by: https://github.com/Chillee	2022-09-10 06:17:35 +00:00
Eli Uriegas	ca3b2bfbe3	Revert "Redo how custom/python_custom methods on TensorImpl work (#84796 ) This reverts commit `591b75bf98`. Manual revert of https://github.com/pytorch/pytorch/pull/84641 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/84796 Approved by: https://github.com/izaitsevfb	2022-09-10 00:18:13 +00:00
Eli Uriegas	93aef3a010	Use presence of _symint in kernel name to generate symint sig or not (#84579 ) Something people found confusing was that whether or not a native:: signature would get SymInt or not in its type was based on the dispatch key. This changes it so that SymInt or not in type is based on whether or not you have _symint in the name of the kernel or not. This means that even when we make operators support SymInt, you no longer have to go and update all the preexisting definitions; instead, you now selectively write _symint to opt individual kernels into SymInt support. I then go and update a bunch of kernels that don't have proper SymInt support to make use of this convention. There is some hacking around for view generation code. I also add support for external backends to specify 'symint' operators, for which we generate SymInt signatures instead of regular signatures. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: [D39310060](https://our.internmc.facebook.com/intern/diff/D39310060) Pull Request resolved: https://github.com/pytorch/pytorch/pull/84579 Approved by: https://github.com/wconstab	2022-09-09 18:31:56 +00:00
Edward Z. Yang	591b75bf98	Redo how custom/python_custom methods on TensorImpl work (#84641 ) A longstanding confusion in the implementation of fake tensor and proxy tensor is what to do about torch.ops.aten.sym_sizes and related calls. In particular, when you have a tensor that (1) has symbolic shapes and (2) has a `__torch_dispatch__` call, previously, you would always get `__torch_dispatch__` calls for sizes/strides query, even if you didn't request it via the dispatch kwargs in `make_wrapper_subclass`. The reason for this is because we were previously mixing several concepts: "I want to dispatch to Python", "I want to call a virtual method" and "I have dynamic shapes". A single boolean variable controlled all of these things, and so it was not possible to understand inside TensorImpl what the user had actually originally requested. In this PR, we track each of these concepts individually so that we can preserve user intent. Then, we combine these into a single "policy" variable that controls whether or not we can use the fastpath or not. For the policy to trigger, we only need one of the exceptional cases to be true. Billing of changes: * Rename `set_sizes_strides_policy` to `set_custom_sizes_strides`; in general, you cannot DIRECTLY set policy; you have to indirectly set it by the public functions. * Some helpers for sizes and strides, since it's more complicated (as it is an enum, rather than just bools as is the case for device and layout). `matches_python_custom` is used to test the Python dispatch user ask. `matches_policy` does the policy test (only used in the user facing functions.) * I reorged the accessor methods so that they are more logical. This makes the diff bad, so I recommend reading the final code directly. * The default custom implementations now more reliably call their default() implementations * As bonus refactor, I devirtualized some functions that don't need to be virtual * `set_sym_sizes_and_strides` is renamed to `set_sizes_and_strides` to make it easier to use in template contexts; it optionally takes a storage offset now so you can set all three values at the same time. If you use the SymInt overload but there are no symbolic integers, we give you a normal resize. * This adds `sym_storage_offset` since we had that in the symbolic shapes branch and there's no reason not to put it in (and it reduces merge conflicts) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/84641 Approved by: https://github.com/wconstab	2022-09-09 13:41:13 +00:00
Antonio Kim	7c3102f3f0	Add ShouldSyncTensor interface (#84418 ) Adding an `ShouldSyncTensor` interface to allow for the case of output pruning should a vendor not support retrieving the value of a certain output. CC: @wconstab @JackCaoG @Krovatkin Pull Request resolved: https://github.com/pytorch/pytorch/pull/84418 Approved by: https://github.com/wconstab	2022-09-07 05:03:02 +00:00
Antonio Kim	bab1304f59	Add step closures (#84300 ) Ports over the step closure functionality from PyTorch/XLA to Lazy Tensor Core: References: `205ae574c0/torch_xla/core/xla_model.py (L852-L900)` `205ae574c0/torch_xla/utils/closures.py (L7-L83)` CC: @wconstab @JackCaoG @Krovatkin Pull Request resolved: https://github.com/pytorch/pytorch/pull/84300 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-09-06 20:55:34 +00:00
Edward Z. Yang	ad44670fa1	Back out "Revert D38984222: Don't introduce new overload for SymInt (#83628 )" (#84173 ) Also Back out "Revert D39075159: [acc_tensor] Use SymIntArrayRef for overloaded empty.memory_format's signature" Original commit changeset: dab4a9dba4fa Original commit changeset: dcaf16c037a9 Original Phabricator Diff: D38984222 Original Phabricator Diff: D39075159 Also update Metal registrations for C++ registration changes. Also update NNPI registration to account for tightened schema checking Differential Revision: [D39084762](https://our.internmc.facebook.com/intern/diff/D39084762/) NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D39084762/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/84173 Approved by: https://github.com/Krovatkin	2022-08-29 18:01:07 +00:00
PyTorch MergeBot	c7edcd6968	Revert "Don't introduce new overload for SymInt (#83628 )" This reverts commit `9790d90e4b`. Reverted https://github.com/pytorch/pytorch/pull/83628 on behalf of https://github.com/malfet due to Breaks internal builds, see D39076487	2022-08-27 01:23:17 +00:00
Edward Z. Yang	9790d90e4b	Don't introduce new overload for SymInt (#83628 ) Previously, we introduced new SymInt overloads for every function we wanted. This led to a lot of boilerplate, and also a lot of confusion about how the overloads needed to be implemented. This PR takes a simpler but more risky approach: just take the original function and changes its ints to SymInts. This is BC-breaking in the following ways: * The C++ API for registering implementations for aten operators will change from int64_t to SymInt whenever you make this change. Code generated registrations in PyTorch do not change as codegen handles the translation automatically, but manual registrations will need to follow the change. Typically, if you now accept a SymInt where you previously only took int64_t, you have to convert it back manually. This will definitely break XLA, see companion PR https://github.com/pytorch/xla/pull/3914 Note that not all dispatch keys get the automatic translation; all the composite keys and Meta keys are modified to take SymInt directly (because they should handle them directly), and so there are adjustments for this. This is not BC-breaking in the following ways: * The user facing C++ API remains compatible. Even if a function changes from int to SymInt, the default C++ binding still takes only ints. (e.g., at::empty(IntArrayRef, ...). To call with SymInts, you must call at::empty_symint instead. This involved adding two more signatures to CppSignatureGroup; in many cases I refactored code to iterate over all signatures in the group instead of hard-coding the two that previously existed. * This is TorchScript compatible; internally we treat SymInts as ints so there is no change to what happens at runtime in TorchScript. In particular, it's OK to reference an empty schema by its old type (using int types), as long as you're not doing string equality (which you shouldn't be), these parse to the same underyling type. Structure of the PR: * The general strategy of this PR is that, even when you write `SymInt` inside `native_functions.yaml`, sometimes, we will treat it as if it were an `int`. This idea pervades the codegen changes, where we have a translation from SymInt to c10::SymInt or int64_t, and this is controlled by a symint kwarg which I added and then audited all call sites to decide which I wanted. Here are some of the major places where we pick one or the other: * The C++ FunctionSchema representation represents `SymInt` as `int`. There are a few places we do need to know that we actually have a SymInt and we consult `real_type()` to get the real type in this case. In particular: * When we do schema validation of C++ operator registration, we must compare against true schema (as the C++ API will provide `c10::SymInt`, and this will only be accepted if the schema is `SymInt`. This is handled with cloneWithRealTypes before we check for schema differences. * In `toIValue` argument parsing, we parse against the true schema value. For backwards compatibility reasons, I do still accept ints in many places where Layout/SymInt/etc were expected. (Well, accepting int where SymInt is expected is not BC, it's just the right logic!) * In particular, because SymInt never shows up as type() in FunctionSchema, this means that we no longer need a dedicated Tag::SymInt. This is good, because SymInts never show up in mobile anyway. * Changes to functorch/aten are mostly about tracking changes to the C++ API registration convention. Additionally, since SymInt overloads no longer exist, registrations for SymInt implementations are deleted. In many cases, the old implementations did not properly support SymInts; I did not add any new functionality with this PR, but I did try to annotate with TODOs where this is work to do. Finally, because the signature of `native::` API changed from int to SymInt, I need to find alternative APIs for people who were directly calling these functions to call. Typically, I insert a new dispatch call when perf doesn't matter, or use `at::compositeexplicitautograd` namespace to handle other caes. * The change to `make_boxed_from_unboxed_functor.h` is so that we accept a plain IntList IValue anywhere a SymIntList is expected; these are read-only arguments so covariant typing is OK. * I change how unboxing logic works slightly. Previously, we interpret the C++ type for Layout/etc directly as IntType JIT type, which works well because the incoming IValue is tagged as an integer. Now, we interpret the C++ type for Layout as its true type, e.g., LayoutType (change to `jit_type.h`), but then we accept an int IValue for it anyway. This makes it symmetric with SymInt, where we interpret the C++ type as SymIntType, and then accept SymInt and int IValues for it. * I renamed the `empty.names` overload to `empty_names` to make it less confusing (I kept mixing it up with the real empty overload) * I deleted the `empty.SymInt` overload, which ended up killing a pile of functions. (This was originally a separate PR but the profiler expect test was giving me grief so I folded it in.) * I deleted the LazyDynamicOpsTest tests. These were failing after these changes, and I couldn't figure out why they used to be passing: they make use of `narrow_copy` which didn't actually support SymInts; they were immediately converted to ints. * I bashed LTC into working. The patches made here are not the end of the story. The big problem is that SymInt translates into Value, but what if you have a list of SymInt? This cannot be conveniently represented in the IR today, since variadic Values are not supported. To work around this, I translate SymInt[] into plain int[] (this is fine for tests because LTC dynamic shapes never actually worked); but this will need to be fixed for proper LTC SymInt support. The LTC codegen also looked somewhat questionable; I added comments based on my code reading. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83628 Approved by: https://github.com/albanD, https://github.com/bdhirsh	2022-08-26 01:35:40 +00:00
Mario Lezcano	f5a3515083	Make linalg.inv composite of linalg.solve (#80074 ) The `getri` kernel calls inside `getrs` so we can do so explicitly ourselves and save ourselves from having to maintain an extra kernel. This way we just need to optimise `lu_factor` and `lu_solve` and `inv` will be as efficient as it can be, as it'll be choosing the best backend to perform the factorisation and the best backend (not necessarily the same) to perform the solve. Fixes https://github.com/pytorch/pytorch/issues/77498 The benchmarks: https://github.com/pytorch/pytorch/pull/80074#issuecomment-1164309071 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80074 Approved by: https://github.com/IvanYashchuk, https://github.com/albanD, https://github.com/malfet	2022-08-25 09:28:55 +00:00
PyTorch MergeBot	a7edf71360	Revert "Don't introduce new overload for SymInt (#83628 )" This reverts commit `8fae7027b3`. Reverted https://github.com/pytorch/pytorch/pull/83628 on behalf of https://github.com/malfet due to breaking internal builds, see https://www.internalfb.com/diff/D38984222	2022-08-25 00:49:40 +00:00
PyTorch MergeBot	5321bf52f2	Revert "Make linalg.inv composite of linalg.solve (#80074 )" This reverts commit `4737b33614`. Reverted https://github.com/pytorch/pytorch/pull/80074 on behalf of https://github.com/malfet due to Depends on the changes from https://github.com/pytorch/pytorch/pull/83628	2022-08-25 00:43:00 +00:00
Mario Lezcano	4737b33614	Make linalg.inv composite of linalg.solve (#80074 ) The `getri` kernel calls inside `getrs` so we can do so explicitly ourselves and save ourselves from having to maintain an extra kernel. This way we just need to optimise `lu_factor` and `lu_solve` and `inv` will be as efficient as it can be, as it'll be choosing the best backend to perform the factorisation and the best backend (not necessarily the same) to perform the solve. Fixes https://github.com/pytorch/pytorch/issues/77498 The benchmarks: https://github.com/pytorch/pytorch/pull/80074#issuecomment-1164309071 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80074 Approved by: https://github.com/IvanYashchuk, https://github.com/albanD, https://github.com/malfet	2022-08-24 15:18:56 +00:00
Edward Z. Yang	8fae7027b3	Don't introduce new overload for SymInt (#83628 ) Previously, we introduced new SymInt overloads for every function we wanted. This led to a lot of boilerplate, and also a lot of confusion about how the overloads needed to be implemented. This PR takes a simpler but more risky approach: just take the original function and changes its ints to SymInts. This is BC-breaking in the following ways: * The C++ API for registering implementations for aten operators will change from int64_t to SymInt whenever you make this change. Code generated registrations in PyTorch do not change as codegen handles the translation automatically, but manual registrations will need to follow the change. Typically, if you now accept a SymInt where you previously only took int64_t, you have to convert it back manually. This will definitely break XLA, see companion PR https://github.com/pytorch/xla/pull/3914 Note that not all dispatch keys get the automatic translation; all the composite keys and Meta keys are modified to take SymInt directly (because they should handle them directly), and so there are adjustments for this. This is not BC-breaking in the following ways: * The user facing C++ API remains compatible. Even if a function changes from int to SymInt, the default C++ binding still takes only ints. (e.g., at::empty(IntArrayRef, ...). To call with SymInts, you must call at::empty_symint instead. This involved adding two more signatures to CppSignatureGroup; in many cases I refactored code to iterate over all signatures in the group instead of hard-coding the two that previously existed. * This is TorchScript compatible; internally we treat SymInts as ints so there is no change to what happens at runtime in TorchScript. In particular, it's OK to reference an empty schema by its old type (using int types), as long as you're not doing string equality (which you shouldn't be), these parse to the same underyling type. Structure of the PR: * The general strategy of this PR is that, even when you write `SymInt` inside `native_functions.yaml`, sometimes, we will treat it as if it were an `int`. This idea pervades the codegen changes, where we have a translation from SymInt to c10::SymInt or int64_t, and this is controlled by a symint kwarg which I added and then audited all call sites to decide which I wanted. Here are some of the major places where we pick one or the other: * The C++ FunctionSchema representation represents `SymInt` as `int`. There are a few places we do need to know that we actually have a SymInt and we consult `real_type()` to get the real type in this case. In particular: * When we do schema validation of C++ operator registration, we must compare against true schema (as the C++ API will provide `c10::SymInt`, and this will only be accepted if the schema is `SymInt`. This is handled with cloneWithRealTypes before we check for schema differences. * In `toIValue` argument parsing, we parse against the true schema value. For backwards compatibility reasons, I do still accept ints in many places where Layout/SymInt/etc were expected. (Well, accepting int where SymInt is expected is not BC, it's just the right logic!) * In particular, because SymInt never shows up as type() in FunctionSchema, this means that we no longer need a dedicated Tag::SymInt. This is good, because SymInts never show up in mobile anyway. * Changes to functorch/aten are mostly about tracking changes to the C++ API registration convention. Additionally, since SymInt overloads no longer exist, registrations for SymInt implementations are deleted. In many cases, the old implementations did not properly support SymInts; I did not add any new functionality with this PR, but I did try to annotate with TODOs where this is work to do. Finally, because the signature of `native::` API changed from int to SymInt, I need to find alternative APIs for people who were directly calling these functions to call. Typically, I insert a new dispatch call when perf doesn't matter, or use `at::compositeexplicitautograd` namespace to handle other caes. * The change to `make_boxed_from_unboxed_functor.h` is so that we accept a plain IntList IValue anywhere a SymIntList is expected; these are read-only arguments so covariant typing is OK. * I change how unboxing logic works slightly. Previously, we interpret the C++ type for Layout/etc directly as IntType JIT type, which works well because the incoming IValue is tagged as an integer. Now, we interpret the C++ type for Layout as its true type, e.g., LayoutType (change to `jit_type.h`), but then we accept an int IValue for it anyway. This makes it symmetric with SymInt, where we interpret the C++ type as SymIntType, and then accept SymInt and int IValues for it. * I renamed the `empty.names` overload to `empty_names` to make it less confusing (I kept mixing it up with the real empty overload) * I deleted the `empty.SymInt` overload, which ended up killing a pile of functions. (This was originally a separate PR but the profiler expect test was giving me grief so I folded it in.) * I deleted the LazyDynamicOpsTest tests. These were failing after these changes, and I couldn't figure out why they used to be passing: they make use of `narrow_copy` which didn't actually support SymInts; they were immediately converted to ints. * I bashed LTC into working. The patches made here are not the end of the story. The big problem is that SymInt translates into Value, but what if you have a list of SymInt? This cannot be conveniently represented in the IR today, since variadic Values are not supported. To work around this, I translate SymInt[] into plain int[] (this is fine for tests because LTC dynamic shapes never actually worked); but this will need to be fixed for proper LTC SymInt support. The LTC codegen also looked somewhat questionable; I added comments based on my code reading. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/83628 Approved by: https://github.com/albanD, https://github.com/bdhirsh	2022-08-23 22:04:07 +00:00
Wonjoo Lee	1e4383f756	Add lazy shape inference for cholesky op (#83720 ) PyTorch/XLA companion PR: https://github.com/pytorch/xla/pull/3907 --- Add lazy shape inference for cholesky op Pull Request resolved: https://github.com/pytorch/pytorch/pull/83720 Approved by: https://github.com/JackCaoG	2022-08-22 22:52:10 +00:00
Nikolay Korovaiko	07d0c9ec75	make sym sizes be computed lazily (#82233 ) ### Description Creating size nodes proactively for each tensor is leading to increased memory pressure as hold strong pointers to tensor data. ### Issue [<!-- Link to Issue ticket or RFP -->](https://github.com/pytorch/pytorch/issues/80942) Creating ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82233 Approved by: https://github.com/wconstab	2022-08-22 16:41:49 +00:00
Wonjoo Lee	0ff929f487	Add lazy shape inference for take op (#82679 ) Add lazy shape inference for take op --- Companion PR on XLA's side: https://github.com/pytorch/xla/pull/3818 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82679 Approved by: https://github.com/JackCaoG	2022-08-19 03:51:15 +00:00
Milad Mohammadi	72963bbae9	Update isDynamic api to align with is_symbolic API (#83415 ) Downstream #https://github.com/pytorch/xla/pull/3888 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83415 Approved by: https://github.com/Krovatkin	2022-08-18 22:53:19 +00:00
JackCaoG	cd0ab154b5	Handle python frame is empty in GetPythonFrames (#83643 ) Fixes https://github.com/pytorch/xla/issues/3900 and https://github.com/pytorch/xla/issues/3795 for pytorch/xla when `XLA_IR_DEBUG=1` Pull Request resolved: https://github.com/pytorch/pytorch/pull/83643 Approved by: https://github.com/Krovatkin	2022-08-18 16:36:54 +00:00
Brian Hirsh	1665715cb0	add sym_strides() function, use in fake/proxy tensors (#81300 ) Add `TensorImpl::sym_strides`, bind it to python with `torch.ops.aten.sym_strides`, and use it in `ProxyTensor` and `FakeTensor`. Before, `ProxyTensor` was generating `ProxySymInt`'s for the sizes, but not for the strides. Internally we still represent strides with a `SymIntArrayRef` though, so I ran into some weird issues where sizes were showing up as `ProxySymInt`, but strides were `PySymInt`'s. Differential Revision: [D38594558](https://our.internmc.facebook.com/intern/diff/D38594558) Pull Request resolved: https://github.com/pytorch/pytorch/pull/81300 Approved by: https://github.com/ezyang	2022-08-16 14:31:27 +00:00
Milad Mohammadi	13e2a0a048	Add `getDynamicValue` to `dynamic_ir` (#82188 ) Add `getDynamicValue` to `dynamic_ir`. This is a precondition to support https://github.com/pytorch/xla/issues/3759 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82188 Approved by: https://github.com/Krovatkin	2022-08-15 19:48:26 +00:00
Bin Bao	7b39406526	[LTC] Pass a BackendDevice parameter into GetIrValueForScalarFromCodegen (#82970 ) Summary: Currently GetIrValueForScalarFromCodegen uses CPU as the default backend device for scalars, but we should make it a backend-dependent decision. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82970 Approved by: https://github.com/Krovatkin, https://github.com/JackCaoG	2022-08-10 03:59:25 +00:00
Kurt Mohler	eb0e30e0bc	Enable `dim=None` for `torch.std` (#81845 ) Part of #29137 BC Breaking Note This PR breaks C++ API backward compatibility for `at::std`. A call that has argument types `at::std(Tensor, OptionalIntArrayRef, int64_t, bool)` used to resolve to the `std.correction` overload, but now it resolves to the `std.dim` overload. In order to call the `std.correction` overload, the `int64_t` argument can be wrapped in a `c10::optional`, so that the call has the form `at::std(Tensor, OptionalIntArrayRef, optional<int64_t>, bool)`. The same is true for the corresponding arguments of the `std.out` and `std.correction_out` overloads of `at::std_out`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81845 Approved by: https://github.com/albanD	2022-08-04 01:49:13 +00:00
Edward Z. Yang	df69660832	Revert "Revert "Add a lint rule for torch/csrc/util/pybind.h include (#82552 )"" (#82599 ) This reverts commit `532b8a9e00`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82599 Approved by: https://github.com/albanD	2022-08-02 19:37:02 +00:00
PyTorch MergeBot	532b8a9e00	Revert "Add a lint rule for torch/csrc/util/pybind.h include (#82552 )" This reverts commit `9465c0e0b5`. Reverted https://github.com/pytorch/pytorch/pull/82552 on behalf of https://github.com/zengk95 due to This seems to be breaking windows binary wheels	2022-08-01 20:25:35 +00:00
Edward Z. Yang	9465c0e0b5	Add a lint rule for torch/csrc/util/pybind.h include (#82552 ) We define specializations for pybind11 defined templates (in particular, PYBIND11_DECLARE_HOLDER_TYPE) and consequently it is important that these specializations always be #include'd when making use of pybind11 templates whose behavior depends on these specializations, otherwise we can cause an ODR violation. The easiest way to ensure that all the specializations are always loaded is to designate a header (in this case, torch/csrc/util/pybind.h) that ensures the specializations are defined, and then add a lint to ensure this header is included whenever pybind11 headers are included. The existing grep linter didn't have enough knobs to do this conveniently, so I added some features. I'm open to suggestions for how to structure the features better. The main changes: - Added an --allowlist-pattern flag, which turns off the grep lint if some other line exists. This is used to stop the grep lint from complaining about pybind11 includes if the util include already exists. - Added --match-first-only flag, which lets grep only match against the first matching line. This is because, even if there are multiple includes that are problematic, I only need to fix one of them. We don't /really/ need this, but when I was running lintrunner -a to fixup the preexisting codebase it was annoying without this, as the lintrunner overall driver fails if there are multiple edits on the same file. I excluded any files that didn't otherwise have a dependency on torch/ATen, this was mostly caffe2 and the valgrind wrapper compat bindings. Note the grep replacement is kind of crappy, but clang-tidy lint cleaned it up in most cases. See also https://github.com/pybind/pybind11/issues/4099 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82552 Approved by: https://github.com/albanD	2022-08-01 17:16:58 +00:00
Edward Z. Yang	a9320e6d96	Delete SymInt::data() in favor of as_int_unchecked() (#82477 ) I audited all the sites while I was at it, and marked a few suspicious ones. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82477 Approved by: https://github.com/Chillee	2022-08-01 15:07:22 +00:00
Edward Z. Yang	50e8abbcad	Change SymIntNode into an intrusive pointer (#82548 ) This will make the pointer type a single word, which is important for packing it into an int64_t This time, this diff doesn't segfault when you build with DEBUG mode; more details at https://github.com/pybind/pybind11/issues/4099 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82548 Approved by: https://github.com/albanD	2022-08-01 15:07:21 +00:00
PyTorch MergeBot	3b9cbb1738	Revert "Change SymIntNode into an intrusive pointer (#82432 )" This reverts commit `7be44f8158`. Reverted https://github.com/pytorch/pytorch/pull/82432 on behalf of https://github.com/ezyang due to segfaults on test but not caught in CI	2022-07-29 20:08:59 +00:00
Edward Z. Yang	7be44f8158	Change SymIntNode into an intrusive pointer (#82432 ) This will make the pointer type a single word, which is important for packing it into an int64_t Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82432 Approved by: https://github.com/albanD, https://github.com/Krovatkin	2022-07-29 17:32:54 +00:00
JackCaoG	0bb467de69	Add shapefn for selu and adaptive_avgpool3d (#82297 ) Add shapefn for selu Pull Request resolved: https://github.com/pytorch/pytorch/pull/82297 Approved by: https://github.com/Krovatkin	2022-07-28 23:50:27 +00:00
Edward Z. Yang	34bdd46e6e	Rename shared_ptr<SymIntNodeImpl> to SymIntNode (#82355 ) Makes code a lot more compact! It also makes it possible to swap out the shared ptr implementation, which I am about to do next. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82355 Approved by: https://github.com/Krovatkin	2022-07-28 18:27:45 +00:00
Edward Z. Yang	fd5ac1e6b5	Rename SymbolicIntNode to SymIntNodeImpl (#82350 ) Done via ``` git grep -l 'SymbolicIntNode' \| xargs sed -i 's/SymbolicIntNode/SymIntNodeImpl/g' ``` Reasoning for the change: * Sym is shorter than Symbolic, and consistent with SymInt * You usually will deal in shared_ptr<...>, so we're going to reserve the shorter name (SymIntNode) for the shared pointer. But I don't want to update the Python name, so afterwards I ran ``` git grep -l _C.SymIntNodeImpl \| xargs sed -i 's/_C.SymIntNodeImpl/_C.SymIntNode/' ``` and manually fixed up the binding code Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82350 Approved by: https://github.com/Krovatkin	2022-07-28 18:27:45 +00:00
albanD	4b7de26556	Fix C API to be compatible with latest 3.11 beta (#81242 ) Based off https://github.com/pytorch/pytorch/pull/80511 with extra changes: - Update pybind to the latest release as it contains some needed fixes - Extend the compat header to do reduce changes in code Pull Request resolved: https://github.com/pytorch/pytorch/pull/81242 Approved by: https://github.com/malfet, https://github.com/mattip	2022-07-27 08:37:10 +00:00
Nikolay Korovaiko	d2c47d559c	Revert "Revert "Enabling SymInt in autograd; take 3 (#81145 )"" ; make sure is_intlist checks for symintnodes (#82189 ) ### Description <!-- What did you change and why was it needed? --> ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82189 Approved by: https://github.com/ezyang	2022-07-26 20:47:11 +00:00
Steven Krawczyk	c15924969e	Add shape fn for isnan (#82162 ) ### Description Add shape fn for isnan ### Issue This is for PyTorch/XLA Lazy code gen ### Testing Ran `BUILD_CPP_TESTS=0 python setup.py install` without error In python shell, ran: ``` >>> import torch >>> torch.isnan(torch.tensor([1, float('nan'), 2])) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/82162 Approved by: https://github.com/Krovatkin	2022-07-26 18:30:23 +00:00
Will Constable	4f34cd6d1e	Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032 ) Avoid exposing defines that conflict with google logging, since this blocks external usage of libtorch in certain cases. All the 'interesting' changes should be in these two files, and the rest should just be mechanical changes via sed. c10/util/logging_is_not_google_glog.h c10/util/logging_is_google_glog.h Fixes https://github.com/pytorch/pytorch/issues/81415 cc @miladm @malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/82032 Approved by: https://github.com/soumith, https://github.com/miladm	2022-07-26 01:20:44 +00:00
Milad Mohammadi	1a6329209c	Lazy shape symbolic const (#81718 ) Fixes #81710 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81718 Approved by: https://github.com/JackCaoG, https://github.com/Krovatkin	2022-07-22 17:35:36 +00:00
Brian Hirsh	38023ff371	fix LTC usage of torch.tensor ctr, add test (#81928 ) switching the tensor constructor to use `lift_fresh` instead of `lift` silently broken LTC, because we didn't have a test for it. I added one here. The fix is that LTC now also needs a kernel for `lift_fresh`. The way this error manifests is that if you call `torch.tensor(..., device='lazy')`, we expect the result to be wrapped up a functional wrapper, and it wasn't - so calling any view ops on the tensor will now break. That wrapping is supposed to happen in `lift_fresh` now. cc @antoniojkim Pull Request resolved: https://github.com/pytorch/pytorch/pull/81928 Approved by: https://github.com/antoniojkim, https://github.com/wconstab	2022-07-22 13:57:53 +00:00
PyTorch MergeBot	c078476eb0	Revert "Enabling SymInt in autograd; take 3 (#81145 )" This reverts commit `032facd6e6`. Reverted https://github.com/pytorch/pytorch/pull/81145 on behalf of https://github.com/jeanschmidt due to breaking internal builds	2022-07-22 11:15:20 +00:00
Nikolay Korovaiko	032facd6e6	Enabling SymInt in autograd; take 3 (#81145 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/81145 Approved by: https://github.com/ezyang	2022-07-22 00:14:50 +00:00
JackCaoG	5fa64ed9dc	Add shape fn for hardswish and backward (#81827 ) This is for PyTorch/XLA Lazy code gen Pull Request resolved: https://github.com/pytorch/pytorch/pull/81827 Approved by: https://github.com/Krovatkin	2022-07-21 22:35:43 +00:00
Nikolay Korovaiko	4aac42cc98	[LT] Add a new backend interface [DUP of the original] (#81662 ) This is a dup of https://github.com/pytorch/pytorch/pull/76517 which is failing because Jiewen needs to resign the CLA. Summary: This commit introduces a new set of BackendImplInterface: GetDefaultDeviceOrdinal and SetDefaultDeviceOrdinal. It allows backend to specify their own default device, e.g, 1 for XLA and 0 for CUDA/CPU. Test Plan: ./build/bin/test_lazy --gtest_filter=BackendDeviceTest.* ghstack-source-id: b4adfef49253e51bffbbf40d356188a92c98994d Pull Request resolved: https://github.com/pytorch/pytorch/pull/76517 Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/81662 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-07-19 01:15:22 +00:00
lezcano	b5b9db9f84	Make `kl_div` a composite function. (#80334 ) Benchmarks: https://github.com/pytorch/pytorch/pull/80334#issuecomment-1167229285 Fixes https://github.com/pytorch/pytorch/issues/80158 Fixes https://github.com/pytorch/pytorch/issues/78867 Fixes https://github.com/pytorch/pytorch/issues/69230 Supersedes https://github.com/pytorch/pytorch/pull/79007 Supersedes https://github.com/pytorch/pytorch/pull/69212 Supersedes https://github.com/pytorch/pytorch/pull/19659 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80334 Approved by: https://github.com/ezyang	2022-07-13 20:07:36 +00:00
Nikolay Korovaiko	8389ccbcd8	reinstate size and shape returning symints (#79560 ) This PR redirects `size` and `.shape` to call `sym_sizes` Pull Request resolved: https://github.com/pytorch/pytorch/pull/79560 Approved by: https://github.com/Chillee	2022-07-08 01:17:33 +00:00
Nikita Shulga	2beb57a823	Add `-Werror=non-virtual-dtor` (reland) (#81012 ) This PR relands #80584, but instead of adding suppression in CMakeLists.txt suppresses it directly in `llvm_codegen.cpp` and just for a single header. In general, it's better to avoid `set_target_properties` pattern for suppressing warnings, as it makes build brittle and hard to debug/understand Test plan: wait for `ciflow/binaries_wheel` to finish Pull Request resolved: https://github.com/pytorch/pytorch/pull/81012 Approved by: https://github.com/huydhn, https://github.com/kit1980	2022-07-07 05:33:55 +00:00
Wonjoo Lee	f98bfce6a1	Add const modifier for lazy shape inference for boolean logic ops (#79999 ) Follow up to https://github.com/pytorch/pytorch/pull/78004. --- Trying to fully codegen these boolean logic ops on pt/xla's side, saw errors: ``` Exception: Missing shape inference function. Please add declare this function in /workspace/pytorch/torch/csrc/lazy/core/shape_inference.h: and implement it in the the corresponding shape_inference.cpp file. TORCH_API std::vector<torch::lazy::Shape> compute_shape_logical_and(const at::Tensor & self, const at::Tensor & other); TORCH_API std::vector<torch::lazy::Shape> compute_shape_logical_not(const at::Tensor & self); TORCH_API std::vector<torch::lazy::Shape> compute_shape_logical_or(const at::Tensor & self, const at::Tensor & other); TORCH_API std::vector<torch::lazy::Shape> compute_shape_logical_xor(const at::Tensor & self, const at::Tensor & other); Failed to generate lazy files: ['python', '/workspace/pytorch/xla/scripts/gen_lazy_tensor.py'] ``` Add const modifier for lazy shape inference for boolean logic ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/79999 Approved by: https://github.com/wconstab	2022-07-07 01:47:08 +00:00
PyTorch MergeBot	0491c10a63	Revert "Add -Werror=non-virtual-dtor (#80584 )" This reverts commit `7670035862`. Reverted https://github.com/pytorch/pytorch/pull/80584 on behalf of https://github.com/malfet due to Broke nighly builds, see https://github.com/pytorch/pytorch/runs/7209779559?check_suite_focus=true	2022-07-06 22:26:59 +00:00
Nikolay Korovaiko	d9ff56ccc0	python bindings for create_metric_report (#79679 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/79679 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-07-06 20:06:17 +00:00
Nikolay Korovaiko	4b54a946a8	add MetricReport API (#79678 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/79678 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-07-06 20:06:17 +00:00
PyTorch MergeBot	f2c8557521	Revert "Make `kl_div` a composite function. (#80334 )" This reverts commit `828c787ea9`. Reverted https://github.com/pytorch/pytorch/pull/80334 on behalf of https://github.com/ezyang due to doesn't work with xla	2022-07-06 17:51:06 +00:00
Brian Hirsh	960758b0b7	fix overload ambiguity with functional ops; fix _foreach op grouping (#80556 ) This should fix the last issue that @anijain2305 hit when running ResNet with TorchDynamo <> functionalization. Today if you try to call an `OpOverloadPacket` from python with some arguments, we will use the types of those arguments to perform overload resolution. With some functional variants of ops, this can be ambiguous. Today this affects just one op: `_fused_moving_avg_obs_fq_helper`, although it would potentially affect e.g. `native_batch_norm` in the future. Example: ``` # There are technically two overloads: # torch.ops.aten._fused_moving_avg_obs_fq_helper.default (returns 2 argument, mutates 4 of its inputs inplace) # torch.ops.aten._fused_moving_avg_obs_fq_helper.functional (returns 6 argument, mutates none of its inputs) # We pick the wrong one - no way to know that we should pick the functional one, just from the call site. outs = torch.ops.aten._fused_moving_avg_obs_fq_helper(a, a, a, a, a, a, a, 1.0, 0, 1, 0) # raises an error - tries to call the overload with only 2 returns return _fused_moving_avg_obs_fq_helper_functional[5] ``` Specifically, functionalization will bake `_fused_moving_avg_obs_fq_helper.functional` into the graph, but when AOTAutograd tries to compile with TorchScript, it needs to remove the overload name (TS doesn't know how to parse overload names directly, so we need to remove the overload name and let it infer the right overload at runtime later- so it picks the wrong one). The situation is pretty similar to inplace; `ops.aten.add` and `ops.aten.add_` represent two different `OverloadPacket` objects; they can't be overloads of the same op, because their schemas would be ambiguous - the alias annotations are different, but that isn't enough to disambiguate). In this PR, I try to fix the situation in a pretty similar way to how we handle `inplace` in the data model: `inplace` ops get their own base operator name, but they are represented as a flag inside of `BaseOperatorName` in the data model. Two other important changes that I made as part of this PR: (1) Originally, there were ~100 different `_functional` operators: e.g. we had operators named `resize.functional` and `zero.functional`. The `_functional` bit isn't actually necessary in most cases: it's only necessary for operators that also* have a `SchemaKind.mutable` variant, where `_fused_moving_avg_obs_fq_helper` is the only op that fits that description today. So I removed the unnecessary notion of "functional" from those other ops. I also added a bunch of assertions to force this restriction. I think that makes more sense in the long run, because it eliminates an unnecessary difference in the model. E.g. we don't have `add_.Tensor` and `add.Tensor_functional`. We just have `add_.Tensor` and `add.Tensor`. (2) I noticed that we actually still weren't pairing up a bunch of `_foreach` operators correctly, because their input arguments were different (`self` vs. `tensors`). Since they're private API's, I went ahead and changed the argument names directly so they get matched up. Before this PR, we were generating a separate `_foreach_add` and `_foreach_add.functional` variant in a bunch of cases, that really did the same thing (but happened to have a different name for the first argument). Pull Request resolved: https://github.com/pytorch/pytorch/pull/80556 Approved by: https://github.com/ezyang, https://github.com/albanD	2022-07-06 12:45:11 +00:00
Nikolay Korovaiko	0a5123a752	Revert "Revert "Add support for directly passing symint to empty"" (#79954 ) Relanding https://github.com/Krovatkin/pytorch/pull/new/krovatkin/symint_empty Pull Request resolved: https://github.com/pytorch/pytorch/pull/79954 Approved by: https://github.com/Chillee, https://github.com/kulinseth	2022-07-04 20:08:55 +00:00
lezcano	828c787ea9	Make `kl_div` a composite function. (#80334 ) Benchmarks: https://github.com/pytorch/pytorch/pull/80334#issuecomment-1167229285 Fixes https://github.com/pytorch/pytorch/issues/80158 Fixes https://github.com/pytorch/pytorch/issues/78867 Fixes https://github.com/pytorch/pytorch/issues/69230 Supersedes https://github.com/pytorch/pytorch/pull/79007 Supersedes https://github.com/pytorch/pytorch/pull/69212 Supersedes https://github.com/pytorch/pytorch/pull/19659 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80334 Approved by: https://github.com/ezyang	2022-07-04 19:33:43 +00:00
Huy Do	7670035862	Add -Werror=non-virtual-dtor (#80584 ) This also resolves https://github.com/pytorch/pytorch/pull/77323 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80584 Approved by: https://github.com/seemethere	2022-07-04 16:54:47 +00:00
Wenzhe Xue	e40d95416f	[LT] fix lazy tensor tutorial and test for cpu only run (#79744 ) `cuda_kwargs` is undefined when LTC_TS_CUDA is not used. This fixed the issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79744 Approved by: https://github.com/wconstab	2022-06-30 21:19:40 +00:00
Milad Mohammadi	0922cc024e	Added support for `expand` in LazyTensor shape inference (#77830 ) Added support for `expand` in LazyTensor shape inference Fixes #77831 --- Blockers: - [x] https://github.com/pytorch/pytorch/issues/77880 - [x] https://github.com/pytorch/pytorch/issues/77882 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77830 Approved by: https://github.com/Krovatkin	2022-06-29 05:27:06 +00:00
jjsjann123	9e86796fe3	simple c10 implementation for std::call_once (#78051 ) A long standing bug on std::call_once: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66146 It could hang during re-entry after an exception handling. Added a c10 implementation yielding a bulky mutex. Not the most efficient thing but at least it shouldn't hang. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78051 Approved by: https://github.com/albanD	2022-06-28 15:47:03 +00:00
Brian Hirsh	c2d395cf8e	functionalization <> LTC integration (take 3) (#80251 ) new PR for https://github.com/pytorch/pytorch/pull/75527. It looks like there's a bug in the windows CI scripts that was causing flaky failures, that disappear when I create a new PR. example failure: https://github.com/pytorch/pytorch/runs/6999272635?check_suite_focus=true Pull Request resolved: https://github.com/pytorch/pytorch/pull/80251 Approved by: https://github.com/wconstab	2022-06-26 23:10:21 +00:00
Henry Tu	02093da36c	Autogen native_batch_norm and native_batch_norm_backward (#79637 ) This PR makes the `native_batch_norm` and `native_batch_norm_backward` ops autogen, and implements their respective shape inference functions. Previously, these two ops were manually implemented. cc: @ke1337 @antoniojkim @wconstab @desertfire Pull Request resolved: https://github.com/pytorch/pytorch/pull/79637 Approved by: https://github.com/Gamrix, https://github.com/desertfire	2022-06-24 23:29:33 +00:00
Henry Tu	fc6b645fe2	Prevent out of bounds access to null LTC operands (#80060 ) When constructing a lazy::Node, [null operands (optional values that aren't included) are dropped](`30fb2c4aba/torch/csrc/lazy/core/ir.cpp (L82-L84)`), so it’s possible for the stored operand list to be a different length than the one that was passed into the constructor. This can become a problem during the call to `CanBeReused` in the autogen `LazyIr.h` code. For example: ``` bool CanBeReused(const torch::lazy::Value& input, const c10::optional<torch::lazy::Value>& weight, const c10::optional<torch::lazy::Value>& bias, const c10::optional<torch::lazy::Value>& running_mean, const c10::optional<torch::lazy::Value>& running_var, const bool& training, const double& momentum, const double& eps) const { size_t i = 0; std::cout << "Num operands: " << operands().size() << std::endl; return (operand(i++) == input && operand(i++) == weight.value_or(kNullValue) && operand(i++) == bias.value_or(kNullValue) && operand(i++) == running_mean.value_or(kNullValue) && operand(i++) == running_var.value_or(kNullValue) && this->training == training && this->momentum == momentum && this->eps == eps); } ``` Here we operate under the assumption that the number of operands stored in the `lazy::Node` is equal to the number of operands originally passed into the constructor. Recall that we drop any null operands though, so it’s possible to inadvertently access an invalid index at this point. This PR addresses this issue by adding a new nullable_operand method which falls back to a null value instead of producing an index error when going out of bounds. This should solve the issue found at https://github.com/pytorch/pytorch/pull/79637#issuecomment-1162044545 cc: @antoniojkim @ke1337 @wconstab @desertfire Pull Request resolved: https://github.com/pytorch/pytorch/pull/80060 Approved by: https://github.com/desertfire	2022-06-24 20:39:37 +00:00
Nikolay Korovaiko	7bf1b1dc31	switch computation to computationPtr (#79491 ) Make `ExecuteComputation` take a pointer to `Computation` so it can be overridden and extended. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79491 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-06-23 23:48:16 +00:00
Nikolay Korovaiko	efc7343743	Revert "Revert "Put symint overloads on a different name"" (#79680 ) This relands https://github.com/pytorch/pytorch/pull/79281 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79680 Approved by: https://github.com/malfet	2022-06-21 07:06:33 +00:00
Edward Z. Yang	f7ee061638	Wconstab/reland pysymint (#79795 ) rebased https://github.com/pytorch/pytorch/pull/79617/ to see if issues are reproducible. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79795 Approved by: https://github.com/malfet	2022-06-20 22:55:06 +00:00
lezcano	16f30b494c	Make l1_loss composite Fixing the forward AD for `sgn` in the next PR of this stack uncovered a number of issues with the derivatives of `l1_loss`. Upon inspection, `l1_loss` was just implemented as a composite function, but it was not differentiable. This PR makes it a fully differentiable function. As a side note, `l1_loss_out` was incorrect in a number of ways. Even more, it is not exposed to the public as `F.l1_loss` does not accept an `out=` parameter. As such it is not even tested. I wonder how useful is to have `out=` variants for loss functions if we don't expose them at all. Even more, I wonder how useful is to have `_out` variants for loss functions, given that their most normal use case is to return just a real number cc jbschlosser Pull Request resolved: https://github.com/pytorch/pytorch/pull/79804 Approved by: https://github.com/zou3519, https://github.com/malfet	2022-06-20 19:10:54 +00:00
PyTorch MergeBot	d4a9438786	Revert "Make l1_loss composite" This reverts commit `61a5c779bf`. Reverted https://github.com/pytorch/pytorch/pull/78257 on behalf of https://github.com/malfet due to This breaks executorch	2022-06-17 18:14:21 +00:00
PyTorch MergeBot	44436947bc	Revert "Reland PySymInt (#79617 )" This reverts commit `8ef6356f26`. Reverted https://github.com/pytorch/pytorch/pull/79617 on behalf of https://github.com/zengk95 due to this is breaking periodic jobs (and maybe pull) on trunk	2022-06-16 19:40:27 +00:00
Michael Andreas Dagitses	acd072967a	canonicalize includes of form <aten/src/ATen/...> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78033 This was never intended to be supported. @override-unit-failures (Note: this ignores all push blocking failures!) Differential Revision: [D36567054](https://our.internmc.facebook.com/intern/diff/D36567054/) Approved by: https://github.com/kit1980	2022-06-16 17:46:45 +00:00
Nikolay Korovaiko	8ef6356f26	Reland PySymInt (#79617 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79617 Approved by: https://github.com/Chillee	2022-06-16 04:18:06 +00:00
lezcano	61a5c779bf	Make l1_loss composite Fixing the forward AD for `sgn` in the next PR of this stack uncovered a number of issues with the derivatives of `l1_loss`. Upon inspection, `l1_loss` was just implemented as a composite function, but it was not differentiable. This PR makes it a fully differentiable function. As a side note, `l1_loss_out` was incorrect in a number of ways. Even more, it is not exposed to the public as `F.l1_loss` does not accept an `out=` parameter. As such it is not even tested. I wonder how useful is to have `out=` variants for loss functions if we don't expose them at all. Even more, I wonder how useful is to have `_out` variants for loss functions, given that their most normal use case is to return just a real number cc jbschlosser Pull Request resolved: https://github.com/pytorch/pytorch/pull/78257 Approved by: https://github.com/jbschlosser	2022-06-16 00:03:22 +00:00
PyTorch MergeBot	b9bb52d97b	Revert "Put symint overloads on a different name" This reverts commit `213a8fc992`. Reverted https://github.com/pytorch/pytorch/pull/79281 on behalf of https://github.com/bigfootjon due to Diff reverted internally	2022-06-15 17:15:21 +00:00
PyTorch MergeBot	b8db0a0475	Revert "Python Bindings for SymInts (#78135 )" This reverts commit `d332724071`. Reverted https://github.com/pytorch/pytorch/pull/78135 on behalf of https://github.com/ezyang due to broke torchvision tests	2022-06-15 13:52:14 +00:00
Nikolay Korovaiko	83e575c510	have a common interface to extract metadata from SizeNodes (#78088 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/78088 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-06-15 04:59:08 +00:00
John Clow	07a528cac7	Adding isDynamic Support to SizeNodes Pull Request resolved: https://github.com/pytorch/pytorch/pull/77917 Approved by: https://github.com/Krovatkin	2022-06-14 03:27:57 +00:00
John Clow	498a34224b	Adding helper function for getting DimensionNodes Pull Request resolved: https://github.com/pytorch/pytorch/pull/77916 Approved by: https://github.com/Krovatkin	2022-06-14 03:27:57 +00:00
Nikolay Korovaiko	d332724071	Python Bindings for SymInts (#78135 ) This PR adds support for `SymInt`s in python. Namely, * `THPVariable_size` now returns `sym_sizes()` * python arg parser is modified to parse PyObjects into ints and `SymbolicIntNode`s * pybind11 bindings for `SymbolicIntNode` are added, so size expressions can be traced * a large number of tests added to demonstrate how to implement python symints. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78135 Approved by: https://github.com/ezyang	2022-06-14 02:17:59 +00:00
Edward Z. Yang	213a8fc992	Put symint overloads on a different name Due to implicit conversion shenanigans, having both IntArrayRef and SymIntArrayRef overloads makes {} ambiguous. While we could fix this by making a single unified type that accepts all the overloads we want, an easier fix was to just push the SymIntArrayRef overload to its own name. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/79281 Approved by: https://github.com/suo	2022-06-12 14:36:39 +00:00
Michael Suo	30fb2c4aba	[lint] autoformat test/cpp and torch/csrc Let's have some fun. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78828 Approved by: https://github.com/ezyang	2022-06-11 21:11:16 +00:00
Michael Suo	f551c22a20	[lint] preparatory changes for mass clang-format These were all the manual changes that were needed to preserve behavior across autoformatting. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78969 Approved by: https://github.com/ezyang	2022-06-06 23:49:45 +00:00
Antonio Kim	fe67dff82a	Deprecate `TSNodeLoweringInterface` (#78273 ) Fixes #78206 Deprecate `TSNodeLoweringInterface` and refactor lower functions into IR nodes. CC: @wconstab @desertfire Pull Request resolved: https://github.com/pytorch/pytorch/pull/78273 Approved by: https://github.com/wconstab	2022-05-31 18:09:12 +00:00
Brian Hirsh	5cc258ec9e	make block_diag composite compliant Pull Request resolved: https://github.com/pytorch/pytorch/pull/77716 Approved by: https://github.com/zou3519	2022-05-26 16:15:42 +00:00
Bin Bao	29189d2ba8	[LT] Add IR resuing support for manually-implemented ops Summary: Add CanBeReused methods for manually-implemented ops and replace MakeNode with ReuseOrMakeNode. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77616 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-05-26 16:04:47 +00:00
Wonjoo Lee	593d66e1b3	Add lazy shape inference for logical boolean ops (#78004 ) Add lazy shape inference for logical boolean ops - logical_and - logical_not - logical_or - logical_xor Uses helper functions defined at https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/ExpandUtils.h#L21 to infer the shape. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78004 Approved by: https://github.com/wconstab	2022-05-25 17:48:40 +00:00
Antonio Kim	02c4d877b4	Codegen Non-Native IR Nodes (#76535 ) Add codegen infrastructure to generate IR nodes for non-native ops. The proposed change is to add a `non_native` key to the `{backend}_native_functions.yaml` file that contains schema definitions similar to what is found in `native_functions.yaml`. e.g. ``` non_native: ... - func: expand(Tensor input, int[] size, bool is_scalar_expand) -> Tensor ... ``` these definitions are parsed into a `LazyIrSchema` that can be used for generating IR nodes using `GenLazyIR`. Fixes #74628 CC: @wconstab @desertfire @henrytwo Pull Request resolved: https://github.com/pytorch/pytorch/pull/76535 Approved by: https://github.com/wconstab	2022-05-24 19:29:23 +00:00
Nikolay Korovaiko	df1f9b9840	Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#77756 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/77756 Approved by: https://github.com/desertfire	2022-05-20 05:39:03 +00:00
Milad Mohammadi	e67284d9ee	Added support for `slogdet` in LazyTensor shape inference (#77904 ) Fixes https://github.com/pytorch/xla/pull/3576 Added support for `slogdet` in LazyTensor shape inference Pull Request resolved: https://github.com/pytorch/pytorch/pull/77904 Approved by: https://github.com/wconstab, https://github.com/JackCaoG	2022-05-20 01:34:56 +00:00
Milad Mohammadi	d6ae650738	Added support for `inverse` in LazyTensor shape inference (#77888 ) Fixes https://github.com/pytorch/xla/pull/3575 Added support for `inverse` in LazyTensor shape inference Pull Request resolved: https://github.com/pytorch/pytorch/pull/77888 Approved by: https://github.com/wconstab	2022-05-20 01:31:13 +00:00
Brian Hirsh	0161e9eb00	[test] attempt to functionalize ops with mutable positional-only args Pull Request resolved: https://github.com/pytorch/pytorch/pull/76320 Approved by: https://github.com/ezyang	2022-05-19 18:50:34 +00:00
Antonio Kim	55be35ae39	Fix 'Code below assumes there is at least one tensor arg' assumption (#76917 ) Previously when codegening ops like `zeros_` or `ones_` we'd hit a `Code below assumes there is at least one tensor arg error`. This check is not entirely correct which is what is causing the error to be thrown. There are ops like the ones mentioned that pass in a `device` parameter that can be used in place of the "first tensor". CC: @wconstab @desertfire @henrytwo @ke1337 Pull Request resolved: https://github.com/pytorch/pytorch/pull/76917 Approved by: https://github.com/desertfire	2022-05-18 17:58:47 +00:00
John Clow	73480bcbe0	Adding support for nonzero in LazyTensor shape functions Pull Request resolved: https://github.com/pytorch/pytorch/pull/77572 Approved by: https://github.com/Krovatkin	2022-05-18 17:32:48 +00:00
PyTorch MergeBot	e9d660c331	Revert "Revert "Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )""" This reverts commit `acf7136a52`. Reverted https://github.com/pytorch/pytorch/pull/77719 on behalf of https://github.com/suo	2022-05-18 05:06:50 +00:00
Edward Z. Yang	acf7136a52	Revert "Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )"" This reverts commit `c35bd8d423`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77719 Approved by: https://github.com/Chillee, https://github.com/malfet	2022-05-18 03:25:43 +00:00
PyTorch MergeBot	c35bd8d423	Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )" This reverts commit `fc4c3c9bc7`. Reverted https://github.com/pytorch/pytorch/pull/76836 on behalf of https://github.com/suo	2022-05-18 02:45:25 +00:00
Nikolay Korovaiko	fc4c3c9bc7	Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 ) LTC Tensors now create real IR (SizeNode) for sym_sizes() in LTCTensorImpl.cpp. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76836 Approved by: https://github.com/ezyang	2022-05-18 00:40:42 +00:00
Bin Bao	25c6ebd12c	Revert "Revert "[LT] Codegen ReuseNode for supported ops"" Summary: Fixed a XLC build failure by generating an always-return-false default CanBeReused method. This reverts commit `3cade9d454`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77513 Approved by: https://github.com/alanwaketan	2022-05-16 20:14:42 +00:00
Antonio Kim	c218263207	[LTC] Mark Step Indicator (#76840 ) Proposed solution for #76826 Basically adds a context which is only "active" when called in mark step. Any backend can then use this to check if within a mark step context. I've also added an example warning in the TS backend so that we now see the following: ```python >>> import torch >>> import torch._lazy >>> import torch._lazy.ts_backend >>> torch._lazy.ts_backend.init() >>> a = torch.tensor([1, 2, 3, 4], device="lazy") >>> b = torch.tensor([5, 6, 7, 8], device="lazy") >>> c = a + b >>> c [W ts_backend_impl.cpp:187] Compile outside of mark step tensor([ 6, 8, 10, 12], device='lazy:0') >>> d = a * b >>> torch._lazy.mark_step() >>> d tensor([ 5, 12, 21, 32], device='lazy:0') ``` Though it was mainly for example and will be happy to remove if this warning is not desired. Fixes #76826 CC: @wconstab @desertfire @henrytwo @ke1337 Pull Request resolved: https://github.com/pytorch/pytorch/pull/76840 Approved by: https://github.com/desertfire	2022-05-16 15:47:29 +00:00
PyTorch MergeBot	3cade9d454	Revert "[LT] Codegen ReuseNode for supported ops" This reverts commit `6066e5929f`. Reverted https://github.com/pytorch/pytorch/pull/76738 on behalf of https://github.com/malfet	2022-05-14 00:33:10 +00:00
Bin Bao	6066e5929f	[LT] Codegen ReuseNode for supported ops Summary: 1. Update the codegen script to add a TrieCache lookup (ReuseNode) before creating a new IR node. The following is an example generated code, ``` at::Tensor LazyNativeFunctions::add(const at::Tensor & self, const at::Tensor & other, const at::Scalar & alpha) { ... torch::lazy::NodePtr node = torch::lazy::ReuseNode<AddTensor>(lazy_self->GetIrValue(), lazy_other->GetIrValue(), node_alpha); if (!node) { auto out_meta = at::meta::add(self, other, alpha); std::vector<Shape> shapes{Shape(out_meta.scalar_type(), out_meta.sizes().vec())}; TORCH_INTERNAL_ASSERT(shapes.size() == 1); if(symbolicShapeEnabled()){ std::vector<jit::IValue> inputs = { self, other, alpha }; char* schema_str = "aten::add.Tensor(Tensor self, Tensor other, *, Scalar alpha=1) -> Tensor"; applySymbolicShapesOnLT(schema_str, inputs, shapes); } node = torch::lazy::MakeNode<AddTensor>(lazy_self->GetIrValue(), lazy_other->GetIrValue(), node_alpha, std::move(shapes)); CacheNode(node); } ... } ``` 2. TrieCache lookup depends on each IR node subclass to provide its own comparison function. The following is an example generated code, ``` bool CanBeReused(const torch::lazy::Value& self, const torch::lazy::Value& other, const torch::lazy::Value& alpha) const { size_t i = 0; return (operand(i++) == self && operand(i++) == other && operand(i++) == alpha); } ``` 3. DeviceData is specially handled. 4. Non-codegen op changes are coming a separate PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76738 Approved by: https://github.com/JackCaoG, https://github.com/wconstab	2022-05-13 19:13:58 +00:00
Edward Z. Yang	4ef6407906	Remove unnecessary ifdef, fixes fbcode build Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77376 Approved by: https://github.com/atalman, https://github.com/malfet	2022-05-12 15:30:01 -07:00
JackCaoG	e36a8c1f13	Lazy codegen change for xla (#76717 ) Codegen change to enable PyTorch/XLA to generate the first op in https://github.com/pytorch/xla/pull/3544. @bdhirsh @wconstab Pull Request resolved: https://github.com/pytorch/pytorch/pull/76717 Approved by: https://github.com/Krovatkin	2022-05-12 17:04:04 +00:00
Nikolay Korovaiko	99339fddd9	move SymInt and SymIntArrayRef to c10/core (#77009 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/77009 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-05-11 16:21:31 +00:00
Bin Bao	8f5cdc6d5d	Revert "Revert "[LT] Store OpKind for each IR subclass in a static field"" Summary: Re-land https://github.com/pytorch/pytorch/pull/76711 by fixing internal build errors. Generate class-level opkind as a static method instead of a static member. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77102 Approved by: https://github.com/wconstab, https://github.com/JackCaoG, https://github.com/antoniojkim	2022-05-11 12:27:05 +00:00
Edward Z. Yang	2896f81dd4	Consolidate customization contiguous/sizes policy into unified policy Prior to this PR, we had a mish-mash of ways of getting unconventional sizes/strides behavior: - In OSS (but not in fbcode), some methods are virtual and you can override them directly - There is a is_contiguous policy which is a bitfield tag that lets you toggle is_contiguous to error or hit a virtual method is_contiguous_custom if it is set. Ordinarily is_contiguous() is virtual and you can just override it, but this works EVEN IF is_contiguous() is non-virtual (e.g., in fbcode) - There is also a sizes policy which is the same idea but for sizes This PR unifies these mechanisms, and in doing so, eliminates the maybe virtual/not-virtualness of the methods in question. The primary downside of this change is that it is BC-breaking (but the BC break is very easy to fix!) The new scheme works like this: we have three levels of policy for sizes/strides (order matters). - The Default policy is a conventional dense tensor, where we use all of the built-in fields to directly represent the sizes/strides/numel/contiguity of the tensor, and it is possible to bypass virtual call entirely. - The CustomStrides policy represent tensors which have a custom notion of strides (most typically, that they don't support them), shunting strides() and is_contiguous() to virtual methods strides_custom() and is_contiguous_custom(). This INCLUDES handling for contiguity, since they typically go hand-in-hand (although the situation is murky with batched tensors). The default implementations of these functions raise errors saying the tensor doesn't support them. - The CustomSizes policy represent tensors which have a custom notion of sizes (the two notable examples are nested tensor, which doesn't have a representation of sizes in the conventional form, and XLA/LTC tensor, which synchronizes its sizes with an underlying compiler backend). This shunts sizes(), numel() and dim() (along with everything from strides) to _custom() variants. There is no special policy for erroring; instead, we just do a vcall and expect the virtual method to raise an exception (the performance hit from the vcall doesn't matter because you're about to raise a C++ exception anyway). The default implementations of all overridable functions are available at _default() which is helpful in some situations when you just want to do a "sync" and then run the conventional semantics. This PR could be extended further in two ways but I did not do them due to time constraints: - Ideally, all TENSORIMPL_MAYBE_VIRTUAL would be eliminated from TensorImpl, by using the same policy trick. - set_size and set_stride are still virtual; it's not entirely clear the same trick should be used here though as these methods are deprecated. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77036 Approved by: https://github.com/bdhirsh	2022-05-11 00:23:07 +00:00
Nikita Vedeneev	afd8bd772c	`nn.functional.glu`: forward AD support (#77186 ) To knock out functions in https://github.com/pytorch/pytorch/issues/71117. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77186 Approved by: https://github.com/soulitzer	2022-05-10 23:58:35 +00:00
PyTorch MergeBot	7eaf4780ba	Revert "[LT] Store OpKind for each IR subclass in a static field" This reverts commit `ac37ddc795`. Reverted https://github.com/pytorch/pytorch/pull/76711 on behalf of https://github.com/malfet	2022-05-09 20:50:09 +00:00
Nikolay Korovaiko	a6341d2ce5	LT tutorial [WIP] (#76392 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/76392 Approved by: https://github.com/wconstab	2022-05-09 17:36:16 +00:00
Bin Bao	36150c63a7	[LT] Move device lock in LazyGraphExecutor to a later place Summary: cherry-picking https://github.com/pytorch/xla/pull/3457 Pull Request resolved: https://github.com/pytorch/pytorch/pull/76974 Approved by: https://github.com/wconstab, https://github.com/JackCaoG	2022-05-09 17:31:33 +00:00
Bin Bao	ac37ddc795	[LT] Store OpKind for each IR subclass in a static field Summary: Currently OpKind is stored as an object field called op_ for each IR node, and one usage of op_ is to avoid dynamic_cast in NodeCast when we need to downcast a base-node pointer into a concrete sub-node pointer. As a result, we need to construct and pass in an op when downcasting nodes, and this becomes quite anonnying when we start to implement the trie-based IR node reusing. More importantly, the op for each subclass should be unique for that subclass and thus making it a const static field is a more logical design. In this PR, we still keep the object-level op_ for easier XLA adoption. As furture work, we can come back to remove op_, make the op() method virtual, and get rid of OpKind in all the node constructors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76711 Approved by: https://github.com/wconstab, https://github.com/JackCaoG	2022-05-06 19:14:46 +00:00
Bin Bao	f05710dd40	[LT] Add a trie data structure for caching IR nodes Summary: TrieCache provides a way to look up an IR node before we actually create it. If the lookup hits in TrieCache, we reuse the existing node and move the current pointer in TrieCache to point to that node; if the lookup misses, we create a new node and insert it into TrieCache. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76542 Approved by: https://github.com/wconstab, https://github.com/JackCaoG	2022-05-04 23:48:03 +00:00
Bin Bao	f8a4780eb2	[LT] Move MakeNode into ir_builder.h Summary: Move MakeNode into ir_builder.h to avoid circular header reference later when introducing a trie cache for IR node lookup. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76482 Approved by: https://github.com/wconstab	2022-05-03 14:53:19 +00:00
Bin Bao	65b9778d30	[LT] Add a flag to control IR reusing Pull Request resolved: https://github.com/pytorch/pytorch/pull/76488 Approved by: https://github.com/wconstab, https://github.com/JackCaoG	2022-05-03 14:48:15 +00:00
Wonjoo Lee	28dfed962a	Remove deprecated string torch::lazy::BackendDevice constructor (#76506 ) Summary: Remove deprecated string torch::lazy::BackendDevice constructor, re-landing part of https://github.com/pytorch/pytorch/pull/76264. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76506 Reviewed By: dagitses Differential Revision: D35993059 Pulled By: alanwaketan fbshipit-source-id: c859331919447ecfa56a9a57a3324305b7904fc2 (cherry picked from commit 2e99b1533425693a6575405f8806551efb2002e4)	2022-05-03 06:10:01 +00:00
Antonio Kim	f3f327e103	Decouple LTC from TS Backend using Lazy IR Builder Next stage of breaking up https://github.com/pytorch/pytorch/pull/74710 IR builder class introduced to decouple the explicit usage of `TsNode` in core lazy tensors. Requires https://github.com/pytorch/pytorch/pull/75324 to be merged in first. Background - there are ~ 5 special ops used in lazy core but defined as :public {Backend}Node. (DeviceData, Expand, Scalar...) - we currently require all nodes derive from {Backend}Node, so that backends can make this assumption safely - it is hard to have shared 'IR classes' in core/ because they depend on 'Node' Motivation 1. avoid copy-paste of "special" node classes for each backend 2. in general decouple and remove all dependencies that LTC has on the TS backend Summary of changes - new 'IRBuilder' interface that knows how to make 5 special ops - move 'special' node classes to `ts_backend/` - implement TSIRBuilder that makes the special TS Nodes - new backend interface API to get the IRBuilder - update core code to call the builder CC: @wconstab @JackCaoG @henrytwo Partially Fixes #74628 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75433 Approved by: https://github.com/wconstab	2022-04-28 02:07:02 +00:00
Jiewen Tan	a28b132bc2	Revert D35860266: [pytorch][PR] Update torch::lazy::BackendDevice to have a new default ordinal Test Plan: revert-hammer Differential Revision: D35860266 (`f9d07ae644`) Original commit changeset: 554ebe16a068 Original Phabricator Diff: D35860266 (`f9d07ae644`) fbshipit-source-id: 325c54aa2e87e51134115213352b3d33a81b7edf (cherry picked from commit bbd74bf34a534d1b87aadff9790038e3dbbfa9c8)	2022-04-27 18:11:24 +00:00
Henry Tu	a39a2c3969	Enable LTC Input/Output Mapping (#75828 ) Summary: This PR enables Input/Output aliasing for Lazy Tensor Core. `SetUpAlias` is a virtual function that can be overridden in a vendor's custom `LoweringContext` implementation. The return type of `LoweringContext::GetResultShape` has also been updated to return a `c10::optional` value, since `GetResultShape` isn't currently implemented for the TorchScript backend. The changes here mirror the interface used by `torch_xla`: https://github.com/pytorch/xla/blob/master/torch_xla/csrc/tensor.cpp#L1548-L1549 cc: antoniojkim ke1337 wconstab silvasean Pull Request resolved: https://github.com/pytorch/pytorch/pull/75828 Reviewed By: Krovatkin Differential Revision: D35952593 Pulled By: wconstab fbshipit-source-id: e20b11e44e0e1beda1b1c47aa3a8b611afd97b7f (cherry picked from commit bcbc9ef01ef8eb84667e5c42edc10d38d5d78395)	2022-04-27 17:06:01 +00:00
Nikolay Korovaiko	bb60cac25a	E2E SymInt example narrow_copy This roughly corresponds to Goal 3.2 in https://docs.google.com/document/d/1iiLNwR5ohAsw_ymfnOpDsyF6L9RTUaHMpD8YLw-jxEw/edit# Namely: It adds the following: * SymbolicIntNode interface * LazySymbolicIntNode implementation * Lazy `narrow_copy` implementation * Need add support for SymInt in codegen * Test (below) ```cpp TEST(LazyDynamicOpsTest, NarrowCopy) { auto x = torch::rand({5, 10, 10}).to(kLazy); const size_t Y_DIM = 3; const size_t X_DIM_INDEX = 2; auto y = torch::rand({Y_DIM}).to(kLazy); auto ly = torch::lazy::TryGetLtcTensor(y); auto dim_node = MakeNode<SizeNode>(ly->GetIrValue(), 0); auto lmn = new torch::lazy::SymbolicIntNode(dim_node); auto z = x.narrow_copy(X_DIM_INDEX, 0, lmn->toSymInt()); AllClose(z.cpu(), x.cpu().narrow_copy(X_DIM_INDEX, 0, Y_DIM)); } ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/75759 Approved by: https://github.com/wconstab	2022-04-26 02:40:27 +00:00
Wonjoo Lee	f9d07ae644	Update torch::lazy::BackendDevice to have a new default ordinal (#76264 ) Summary: Fixes https://github.com/pytorch/xla/issues/3490. Updates `torch::lazy::BackendDevice` with changes below: 1. Remove the no-op string constructor. 2. Update default ordinal to `-1`. 3. Add a `is_valid` function to check if `ordinal` is valid/non-default (`ordinal >= 0`). Pull Request resolved: https://github.com/pytorch/pytorch/pull/76264 Reviewed By: mrshenli Differential Revision: D35860266 Pulled By: alanwaketan fbshipit-source-id: 554ebe16a0683d37b00270c4f35163bf690bfe28 (cherry picked from commit b941d10e8545dfecfb34e4d5c24a29a1cc49bc4b)	2022-04-25 23:57:18 +00:00
Will Constable	a4ce113649	Avoid compiling ts backend binidngs in fbcode build Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/76337 Approved by: https://github.com/shunting314	2022-04-25 21:45:54 +00:00
Edward Yang	36420b5e8c	Rename tools/codegen to torchgen (#76275 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/76275 In preparation for addressing https://github.com/pytorch/pytorch/issues/73212 Diff was generated with: ``` git mv tools/codegen torchgen git grep -l 'tools.codegen' \| xargs sed -i 's/tools.codegen/torchgen/g' sed -i "s/\${TOOLS_PATH}\/codegen/\${TORCH_ROOT}\/torchgen/g" caffe2/CMakeLists.txt ``` and a manual edits to: * tools/test/test_gen_backend_stubs.py * torchgen/build.bzl * torchgen/gen_backend_stubs.py aka this diff: ``` diff --git a/tools/test/test_gen_backend_stubs.py b/tools/test/test_gen_backend_stubs.py index 3dc26c6d2d..104054575e 100644 --- a/tools/test/test_gen_backend_stubs.py +++ b/tools/test/test_gen_backend_stubs.py @@ -9,7 +9,7 @@ from torchgen.gen_backend_stubs import run from torchgen.gen import _GLOBAL_PARSE_NATIVE_YAML_CACHE # noqa: F401 path = os.path.dirname(os.path.realpath(__file__)) -gen_backend_stubs_path = os.path.join(path, '../torchgen/gen_backend_stubs.py') +gen_backend_stubs_path = os.path.join(path, '../../torchgen/gen_backend_stubs.py') # gen_backend_stubs.py is an integration point that is called directly by external backends. # The tests here are to confirm that badly formed inputs result in reasonable error messages. diff --git a/torchgen/build.bzl b/torchgen/build.bzl index ed04e35a43..d00078a3cf 100644 --- a/torchgen/build.bzl +++ b/torchgen/build.bzl @@ -1,6 +1,6 @@ def define_targets(rules): rules.py_library( - name = "codegen", + name = "torchgen", srcs = rules.glob(["*/.py"]), deps = [ rules.requirement("PyYAML"), @@ -11,6 +11,6 @@ def define_targets(rules): rules.py_binary( name = "gen", - srcs = [":codegen"], + srcs = [":torchgen"], visibility = ["//visibility:public"], ) diff --git a/torchgen/gen_backend_stubs.py b/torchgen/gen_backend_stubs.py index c1a672a655..beee7a15e0 100644 --- a/torchgen/gen_backend_stubs.py +++ b/torchgen/gen_backend_stubs.py @@ -474,7 +474,7 @@ def run( ) -> None: # Assumes that this file lives at PYTORCH_ROOT/torchgen/gen_backend_stubs.py - pytorch_root = pathlib.Path(__file__).parent.parent.parent.absolute() + pytorch_root = pathlib.Path(__file__).parent.parent.absolute() template_dir = os.path.join(pytorch_root, "aten/src/ATen/templates") def make_file_manager(install_dir: str) -> FileManager: ``` run_all_fbandroid_tests Test Plan: sandcastle Reviewed By: albanD, ngimel Differential Revision: D35770317 fbshipit-source-id: 153ac4a7fef15b1e750812a90bfafdbc8f1ebcdf (cherry picked from commit c6d485d1d4648fa1c8a4c14c5bf3d8e899b9b4dd)	2022-04-25 01:38:06 +00:00
Antonio Kim	2c2c13d21b	Decouple Lazy Node Shape Cache (#75324 ) Summary: Next stage of breaking up https://github.com/pytorch/pytorch/pull/74710 Move shape cache implementation to the backend interface. Also, clean up some of the hashing logic in the base node class. CC: wconstab JackCaoG henrytwo Partially Fixes https://github.com/pytorch/pytorch/issues/74628 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75324 Reviewed By: anjali411 Differential Revision: D35730823 Pulled By: wconstab fbshipit-source-id: cf6fa326319b9324e5f422a78817b6fb5bf7e9b8 (cherry picked from commit faec5043df56639e2fd23de2d91ae796e4f3df70)	2022-04-21 17:27:05 -07:00
Nikita Shulga	f6c275f55d	Remove `-Wno-unused-variable` from `utils.cmake` (take 2) (#75538 ) Summary: [Comment](https://github.com/pytorch/pytorch/pull/62445/files#r680132022) claims, it got added for consistency with top level CMakeLists.txt, but `-Wno-unused-variable` is not mentioned there. Modify violations in 50+ files that were added in the interim by either removing unused variables, or decorating the code with `C10_UNUSED` if local variable is likely used to extend object lifetime until the end of the block. Caused preventable revert in https://github.com/pytorch/pytorch/pull/72633#issuecomment-1092300787 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75538 Reviewed By: anjali411 Differential Revision: D35747333 Pulled By: malfet fbshipit-source-id: 3fc5828e44a4c05ba0e89e92613e6ebbdb260626 (cherry picked from commit c179fba21cfa2a0093fad50ccad5a22dd7cff52c)	2022-04-20 17:41:59 +00:00
PyTorch MergeBot	5c56b2286b	Revert "Remove `-Wno-unused-variable` from utils.cmake" This reverts commit `018cbe1f5c`. Reverted https://github.com/pytorch/pytorch/pull/75538 on behalf of https://github.com/seemethere	2022-04-19 17:19:09 +00:00
Nikita Shulga	018cbe1f5c	Remove `-Wno-unused-variable` from utils.cmake [Comment](https://github.com/pytorch/pytorch/pull/62445/files#r680132022) claims, it got added for consistency with top level CMakeLists.txt, but `-Wno-unused-variable` is not mentioned there. Modify violations in 50+ files that were added in the interim by either removing unused variables, or decorating the code with `C10_UNUSED` if local variable is likely used to extend object lifetime until the end of the block. Caused preventable revert in https://github.com/pytorch/pytorch/pull/72633#issuecomment-1092300787 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75538 Approved by: https://github.com/cpuhrsch	2022-04-19 15:26:55 +00:00
Brian Hirsh	204df13d42	teach ivalue about List[Optional[Tensor]], fix fallbacks Pull Request resolved: https://github.com/pytorch/pytorch/pull/75716 Approved by: https://github.com/ezyang	2022-04-18 20:05:26 +00:00
Nikolay Korovaiko	977a66fe88	switch DimensionNode's base from TsNode to Node Switching the base of DimensionNode to TsNode so it can be reused by XLA and other backends. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75916 Approved by: https://github.com/alanwaketan	2022-04-16 02:57:29 +00:00
Jiewen Tan	b0081e7642	[LT] Support narrow Summary: Support op narrow. Narrow is a view op which we cannot code-gen so far. So did it in the hand-written old fashion way. Test Plan: ./build/bin/test_lazy --gtest_filter=LazyOpsTest.TestNarrow* Pull Request resolved: https://github.com/pytorch/pytorch/pull/75344 Approved by: https://github.com/wconstab, https://github.com/desertfire	2022-04-15 23:57:10 +00:00
Jiewen Tan	ab0d9b18e9	[LT] Support Tensor.is_alias_of Summary: Tensor.is_alias_of relies on Storage to perform. However, LTCTensorImpl was not implemented with that in mind. This commit adds a fake storage to LazyTensor as a marker to mark LazyTensors that point to the same storage. The reason why it's not done at LTCTensorImpl is that LazyTensor maintains the view ops/alias logic in LazyTensor class instead of relying on TensorImpl to do the check. Test Plan: ./build/bin/test_lazy --gtest_filter=LazyOpsTest.IsAliasOf Pull Request resolved: https://github.com/pytorch/pytorch/pull/75246 Approved by: https://github.com/bdhirsh	2022-04-14 07:28:03 +00:00
Nikita Shulga	6ba29d715e	[BE] Fix deprecated usages of `isIntegral` By passing `includeBool=` parameter Pull Request resolved: https://github.com/pytorch/pytorch/pull/75524 Approved by: https://github.com/seemethere, https://github.com/janeyx99	2022-04-08 20:43:41 +00:00
Jiewen Tan	dc37090ec5	[LT] Support diagonal op (#75230 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75230 Op diagonal is a view op which we can't code-gen yet. Therefore, support it by making hand-written IR construction and lowering. Test Plan: ./build/bin/test_lazy --gtest_filter=LazyOpsTest.TestDiagonal* Reviewed By: wconstab Differential Revision: D35378316 Pulled By: alanwaketan fbshipit-source-id: 7958d00107aef20ac37aabcf2868346240977530 (cherry picked from commit 84155528fce484627c9688cfd92fd4aeb68219e5)	2022-04-08 19:49:42 +00:00
Nikolay Korovaiko	4a85145bbd	Ansley's rebase of DimensionNode onto master (#75352 ) Summary: Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/75352 Reviewed By: wconstab Differential Revision: D35455859 Pulled By: Krovatkin fbshipit-source-id: e24c81d63dc66d03b752cc8de5cb551d84b003ac (cherry picked from commit 4ad371cb4cc88860ce8ec398d82083f6759e3fcf)	2022-04-08 17:22:56 +00:00
John Clow	f1db3e465a	Adding integration of SSA into LazyTensor Pull Request resolved: https://github.com/pytorch/pytorch/pull/75050 Approved by: https://github.com/Krovatkin	2022-04-07 19:49:41 +00:00
Shunting Zhang	a9d43d6f6e	Dynamo+LTC: add pybind to set force fallback config and use that in test_extract_compiled_graph.py (#75292 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75292 - Follow the convention in [this doc](https://docs.google.com/document/d/1Vi96ITGoK7BW01ZEccexs4pvCQKF4_LdV8w7TfIWPvM/edit) to setup config for ltc force fallback ops. - Pybinds are added to read/set the config. - Use the added pybinds in the unit test which needs to force fallbacks. Test Plan: ``` pytest test/lazy/test_extract_compiled_graph.py ``` Reviewed By: malfet Differential Revision: D35417678 Pulled By: shunting314 fbshipit-source-id: 1e05b8c831174872d70257a0ddd958863d6ca80d (cherry picked from commit 9366bde7ef20837dcf03b7d8e18f9017a58c80fa)	2022-04-07 02:39:20 +00:00
Will Constable	722e9e3403	Wconstab/doc codegen (#74850 ) Summary: Add basic docs to the lazy codegen utility cc JackCaoG Pull Request resolved: https://github.com/pytorch/pytorch/pull/74850 Reviewed By: shunting314 Differential Revision: D35418039 Pulled By: wconstab fbshipit-source-id: bfc16e42b2afcff8301f2bcc053dc911baad2241 (cherry picked from commit 479b3f5cf18d12d8e8dca661e31a1fab763c98dc)	2022-04-06 06:16:07 +00:00
Antonio Kim	e1b4117e30	Move shape and operand definitions to base node (#75223 ) Summary: First stage of breaking up https://github.com/pytorch/pytorch/pull/74710 Moves the shape and operand definitions from `TsNode` to the base `Node` CC: wconstab JackCaoG henrytwo Partially Fixes https://github.com/pytorch/pytorch/issues/74628 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75223 Reviewed By: zou3519 Differential Revision: D35410285 Pulled By: wconstab fbshipit-source-id: bb84d3fb636882cbe7e18af4b35ff2c0e22aaa58 (cherry picked from commit a4144c9a48379d8a9007cff845796608b597cce1)	2022-04-06 01:43:46 +00:00
Will Constable	22d227fd29	Fix lazy ts backend build flags (#75237 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75237 applies 'OVRSOURCE' logic to one more place missed in D35331263 (`8b7e2bf7a6`) so that lazy TS backend is not compiled in internal builds Test Plan: CI Reviewed By: malfet, shunting314 Differential Revision: D35377758 fbshipit-source-id: 5dcd3d36e50a8917470a917f2120353972dc31ba (cherry picked from commit 8b8ed7bdaa553eec2ef8b5461d1bd867979049dd)	2022-04-05 00:46:00 +00:00
Nikita Shulga	8b7e2bf7a6	Skip TorchScript backend for OVRSource as well (#75138 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75138 (Note: this ignores all push blocking failures!) Test Plan: CI Reviewed By: wconstab Differential Revision: D35331263 fbshipit-source-id: e426c4017359c9f98188c0df5226775be7b1f700 (cherry picked from commit bf1768f3be06e9de643978d70591d9cd15ef11b3)	2022-04-02 04:01:57 +00:00
Shunting Zhang	19747cbbe6	Dynamo+LTC: merging related code from staging branch to master (#75046 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/75046 Merge the code needed for dynamic+ltc integration from the staging branch to the master branch. Test Plan: Unit test ``` pytest test_extract_compiled_graph ``` test thru dynamo ``` LTC_TS_CUDA=1 time python torchbench.py --speedup-ltc -dcuda --nvfuser --randomize-input --only <model name> ``` Reviewed By: alanwaketan Differential Revision: D35300646 Pulled By: shunting314 fbshipit-source-id: 09ed20d3bb8ef80e4b93ba87ea3356a07d2dccdb (cherry picked from commit 2b56771cdfd2cfa825c65ee9fd42338fb372fb32)	2022-04-02 00:23:15 +00:00
Wei-Sheng Chin	ea2e0e773b	Fix LTC tests on Windows (#74960 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/74519 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74960 Reviewed By: gmagogsfm, cpuhrsch Differential Revision: D35297620 Pulled By: wconstab fbshipit-source-id: ef0ee27d29cd5a08afbaabeac9b254801555f70a (cherry picked from commit 7910b625558bf75e62a0190ef835bf3ebca21d05)	2022-04-01 19:18:34 +00:00
Will Constable	b9e535a64a	Add non-eager registration to dispatch autogen (#74557 ) Summary: Previously, the torchscript backend would be (partially) initialized at startup. - the dispatcher registrations would be registered, - but other backend components would not be initialized until explicitly calling the backend init function With this change, the torchscript backend is not initialized until its explicit initialization function is called. This enables external backends to register their own backend instead of the torchscript backend to the same (Lazy) key. Lands a change contributed by antoniojkim via lazy_tensor_staging branch (https://github.com/pytorch/pytorch/issues/73973) Pull Request resolved: https://github.com/pytorch/pytorch/pull/74557 Reviewed By: bdhirsh Differential Revision: D35051464 Pulled By: wconstab fbshipit-source-id: 5a8b0851293e394f49427d1416ee571a8881fe9f (cherry picked from commit ef745a4a2c8d1d7f9510541a20f1f40625ce29de)	2022-04-01 03:42:53 +00:00
Will Constable	14affba799	Fix ir_metadata Python frames func and remove dead code (#74979 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/74979 Reviewed By: alanwaketan Differential Revision: D35261641 Pulled By: wconstab fbshipit-source-id: e82b5f17d0043c4a3de72c16fb42fd02a85414fe (cherry picked from commit fc6c0a1654256871361a5ad08926bc39d74cd0c5)	2022-03-31 23:23:36 +00:00
Nikolay Korovaiko	5177f95d21	Introducing SymInt to Pytorch (for tracing size arithmetic) (master rebase) (#74861 ) Summary: This PR introduces `SymInt` type to Pytorch which will be used by LTC and AOTAutograd for tracing size arithmetic and tests. `SymInt` is a C++ union structure [int64_t, SymbolicIntNode*] that wraps around an int64_t field where the value of the field could be an index into a list of `shared_ptr<SymbolicIntNode>` or a real int. This PR doesn't add any support for actually tracing symbolic ints. i.e. data_ for now can only contain real ints. ``` Goal 1: just to show we can add a type to PyTorch core. (wraps int) LANDEABLE Finalize the naming - symint Want the name to be short Does invoke “size” - NO SInt/SymInt/SymbolicInt SInt could mean signed int sym_int or symint or SymInt (originally it was “int”; capitalized implies object semantics, whereas lowercase implies value semantics) JIT schema - symint C++ - symint ``` See more details here: https://docs.google.com/document/d/1iiLNwR5ohAsw_ymfnOpDsyF6L9RTUaHMpD8 (`d843f63f2a`)YLw-jxEw Pull Request resolved: https://github.com/pytorch/pytorch/pull/74861 Reviewed By: qihqi, ngimel Differential Revision: D35226230 Pulled By: Krovatkin fbshipit-source-id: 34acf342bd50fcaa4d8d5dd49c2fd6a98823a5b3 (cherry picked from commit 218643f63ef181cabb92d13a6e837eb64f2dda3c)	2022-03-31 21:59:59 +00:00
Will Constable	ff206ed09e	Add lazy tensor python bindings (#74508 ) Summary: This adds a minimal set of python bindings for lazy tensor and the torchscript backend. It targets the APIs that are used by the `test_ts_opinfo.py` test (which it also lands, from lazy_tensor_staging, where it is [lazy_tensor_core/test/test_lazy.py](https://github.com/pytorch/pytorch/blob/lazy_tensor_staging/lazy_tensor_core/test/test_lazy.py)). We should land more python bindings obviously. I just wanted to focus on a minimal set that can also be tested, and use it to agree on how we organize the bindings, then others could easily contribute bindings on top of this infrastructure. cc JackCaoG Pull Request resolved: https://github.com/pytorch/pytorch/pull/74508 Reviewed By: pbelevich Differential Revision: D35032152 Pulled By: wconstab fbshipit-source-id: 526505ab355b7ad27037ece0ff814b2a4b69f1e2 (cherry picked from commit b4f73dd147472cb38003204aff228087c0230fda)	2022-03-29 13:40:11 +00:00
Will Constable	6707d67131	Expose GetMetaDataIfDebugging API (#74784 ) Summary: This should allow torch/XLA to access this API cc wonjoolee95 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74784 Reviewed By: shunting314 Differential Revision: D35159977 Pulled By: wconstab fbshipit-source-id: 1ad3ebb691acf6968314c1f37558d684d9bc1cdf (cherry picked from commit 7971ad0cd9ce3d40c56eb871b610c387faa30cac)	2022-03-28 16:36:55 +00:00
Kurt Mohler	5375b2e994	Resolve `int[]?` arguments to new OptionalIntArrayRef class This PR uses the `OptionalArrayRef` template class that was drafted in #64084. Fixes #44409 Pull Request resolved: https://github.com/pytorch/pytorch/pull/70864 Approved by: https://github.com/ezyang	2022-03-26 01:45:50 +00:00
Will Constable	3547f20872	Land remaining parts of Torchscript Lazy Tensor backend (#74111 ) Summary: Also enables bazel build to run lazy codegen. Bazel (oss) build feeds off the same filelists as cmake/buck (build_variables.bzl), so enabling it is easier than keeping it disabled. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74111 Test Plan: Run CI and verify test_lazy_ops is running via OSS cmake builds Reviewed By: bdhirsh Differential Revision: D34772403 fbshipit-source-id: 8a63f58b9536e6ac1be530667932176ef2549496 (cherry picked from commit e807ffb1918853d10b924fdc24f85ee5b1a39021)	2022-03-22 23:14:03 +00:00
Will Constable	d67a265881	Sync lazy_tensor_staging to master (#74311 ) Summary: This merges changes that have already been reviewed/landed onto lazy_tensor_staging branch. It combines changes from multiple PRs into one diff. updated from lazy_tensor_staging on 3/16 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74311 Test Plan: Run CI to ensure compilation on various platforms Run unit tests on lazy_tensor_staging branch with source version of all these diffs Reviewed By: desertfire Differential Revision: D34929235 fbshipit-source-id: babbc3bbeabc5b8107ee9284ed7765887a148622 (cherry picked from commit d91577a6557343ec536f6859e4808ec1a8a9b685)	2022-03-17 16:08:57 +00:00
Will Constable	72b1194464	Run lazy tensor codegen in generate_code.py (#73996 ) Summary: Hooks into existing autograd codegen script (generate_code.py) to take advantage of its integrations into buck/cmake/bazel. Adds a new option (--gen_lazy_ts_backend) to. generate_code.py, calling this from CMake OSS build and fbcode build, but not from other internal xplat/ovrsource builds (these could be opted in later) Bazel support is added in a later diff. Includes one generated file (torch/csrc/lazy/generated/LazyIr.h) in a unit test (test/cpp/lazy/test_ir.cpp) to partially verify the generator is working, but does not compile the remaining output sources from the generator yet as they depend on other files not yet landed from lazy_tensor_staging branch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/73996 Test Plan: OSS/internal CI - verify all builds are working and test_ir.cpp compiles LazyIr.h Reviewed By: ezyang Differential Revision: D34408536 fbshipit-source-id: 8af0aea3b95d81eccafc17d64390d70ddd176515 (cherry picked from commit f930612f2bad61c76eb02d85cfbec9f33a1459dc)	2022-03-17 15:31:26 +00:00
Peter Bell	5816a95ca9	CPU Kernel: Use per-operator headers (#71137 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71137 ATen has a header dependency problem. Whenever an operator is added or modified, it changes `ATen/Functions.h` and `ATen/NativeFunctions.h` which in turn requires essentially every single file to be rebuilt. Per-operator headers allow files to only include the specific operators they use and so minimizes unnecessary rebuilds during incremental builds and improves cache hits in CI builds. See this note for more details: `3a03af2f50/aten/src/ATen/templates/Functions.h (L20)` Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D33949900 Pulled By: malfet fbshipit-source-id: d76a53bfab9ea75391de060a2d85923339f93a7c (cherry picked from commit d3be6e4283bbc8d5c967c4634ea1a6b3386861ed)	2022-03-05 01:32:17 +00:00
Will Constable	b4173b80b7	Use intrusive_ptr for LazyTensor (#73445 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73445 Refactors the whole codebase to use LazyTensorPtr (defined as c10::intrusive_ptr) to enable XLA to use a derived class XlaLazyTensor and override functionality. this PR is just the first step, and we will need to add a factory class that XLA can override in their backend to actually hook up their derived tensor class. Parallel PR on lazy_tensor_staging: #73429 Test Plan: tested via lazy_tensor_staging test_ptltc and torchbench and CI Reviewed By: ezyang Differential Revision: D34481918 fbshipit-source-id: 01176b127df6b79039aa1bc57bc6da5505161f87 (cherry picked from commit 52b9ae4e22d2703d44c6436311d79d40bd62c6aa)	2022-03-03 06:27:35 +00:00
Shuitao Fan	05c86c2be1	T112685841: Use irange in PyTorch (#73378 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73378 1) ran check_for_c10_loops.py to automatically update all files (.h, .hpp, *.cpp) under fbcode/caffe2/torch (this is the path in the check_for_c10_loops.py, slightly different from the task description where the path mentioned was fbcode/caffe2. since current commit already contains 27 files, will use a separate commit for additional files). 2) manually reviewed each change, and reverted a few files: (a) select_keys.cpp, bucketize_calibration.cpp, index_mmh and TCPStore.cpp: iterator modified in loop (b) qlinear_4bit_ops.cpp and id_list_feature_merge_conversion.cpp: condition containing multiple expressions. Test Plan: Doing the following (still in progress, will address issues as they appear): buck build ... buck test ... Reviewed By: r-barnes Differential Revision: D34435473 fbshipit-source-id: b8d3c94768b02cf71ecb24bb58d29ee952f672c2 (cherry picked from commit fa9b0864f3761a501868fe0373204b12fdfc2b32)	2022-02-26 06:34:22 +00:00
Will Constable	d56d530e5c	Move Lazy Shape Inference functions to pytorch core (#72756 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72756 Test Plan: tested via lazy_tensor_staging branch (test_ptltc operator tests as well as torchbench inference) Reviewed By: bdhirsh Differential Revision: D34187999 fbshipit-source-id: f40c5869d573de4a58233ee1e8710f8a0ea7f284 (cherry picked from commit c4b2977c5a9afa7bf89cb7c8914738ee6100dd7b)	2022-02-24 07:35:05 +00:00
Nikolay Korovaiko	5a7778c9a6	Squeeze p2: hook up Squeeze to LazyView (#73067 ) Summary: This PR hooks up Squeeze op to LazyView. The end goal to reduce the instances where we need to rely on explicit shapes. In the final PR we will make `aten_ops` to use the right ViewInfo and update the lowering in ts_lowering. Pull Request resolved: https://github.com/pytorch/pytorch/pull/73067 Reviewed By: wconstab, mikaylagawarecki Differential Revision: D34345163 Pulled By: Krovatkin fbshipit-source-id: 6bfadedbded7521312019ead0dfc7c6a334ff0f5 (cherry picked from commit 4b3b10fa97911c2302840b44b63619d081e0d9d4)	2022-02-24 04:30:48 +00:00
Nikolay Korovaiko	237574db19	add assert to make sure expected number of LTC roots matches what TS … (#73112 ) Summary: …computes Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/73112 Reviewed By: mikaylagawarecki Differential Revision: D34351338 Pulled By: Krovatkin fbshipit-source-id: 1b3d0f3c801bd095b68d2eff3184ecbefadf7f34 (cherry picked from commit `53b7fc4ad6`)	2022-02-19 06:33:08 +00:00
Stephen Oakley	1646a0033d	Use irange in PyTorch (#72836 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72836 Replacing increment iterator loops with ranged loops. It allows loops such as for(int i=0;i<10;i++) to be expressed as for(const auto i : c10::irange(10)). This auto-types the loops and adds const-safety to the iteration variable. Reviewed By: albanD Differential Revision: D34136539 fbshipit-source-id: 760a70ad43ce6f05630ba8fea261d4dbb699e62e (cherry picked from commit `0428408d88`)	2022-02-18 19:29:07 +00:00
Alban Desmaison	0951cb513a	Revert D34342689: Revert D34250357: Sync lazy_tensor_staging back to master Test Plan: revert-hammer Differential Revision: D34342689 Original commit changeset: 43f6da6986f7 Original Phabricator Diff: D34250357 (`69389fb542`) fbshipit-source-id: 8a3fb74877e719e9b9577b58027b4e7061a04ef0 (cherry picked from commit `c749f08e7a`)	2022-02-18 17:31:21 +00:00
Alban Desmaison	86a961af87	Revert D34250357: Sync lazy_tensor_staging back to master Test Plan: revert-hammer Differential Revision: D34250357 (`69389fb542`) Original commit changeset: aa7d589f6050 Original Phabricator Diff: D34250357 (`69389fb542`) fbshipit-source-id: 43f6da6986f7fc5189d641b7803adc5ada27194c (cherry picked from commit `3c930a5e4e`)	2022-02-18 15:47:37 +00:00
Will Constable	69389fb542	Sync lazy_tensor_staging back to master (#72875 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72875 This diff contains changes from several PRs landed to lazy_tensor_staging branch. * generating 'fallback' overrides for each codegenned op, useful for debugging * supports operators which are missing aten:: symbols for op names, instead using their string counterpart * makes the IR class a base class instead of hardcoding the assumption of TS It also resolves lint issues and in particular cleans up the following: * {Type}s shouldn't be passed into isValueType, and using the catch-all base class of CType is nicer than specifying a list of types. Fixes #72852 Test Plan: test manually on lazy_tensor_staging branch Reviewed By: shunting314 Differential Revision: D34250357 fbshipit-source-id: aa7d589f605055d5d02bc77c77fa6f1182ff7497 (cherry picked from commit `2f8f5e4971`)	2022-02-18 03:49:46 +00:00
Will Constable	889f3f48b2	Revert D34178476: Update lazy_ir.py from lazy_tensor_staging Test Plan: revert-hammer Differential Revision: D34178476 (`3842140fd5`) Original commit changeset: 7190b2e0d82b Original Phabricator Diff: D34178476 (`3842140fd5`) fbshipit-source-id: 4c969a355f01244c6f5acc52bc31679f2182aa55 (cherry picked from commit `17082075dd`)	2022-02-16 19:34:41 +00:00
Will Constable	3842140fd5	Update lazy_ir.py from lazy_tensor_staging (#72730 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72730 This diff contains changes from several PRs landed to lazy_tensor_staging branch. - generating 'fallback' overrides for each codegenned op, useful for debugging - supports operators which are missing aten:: symbols for op names, instead using their string counterpart - makes the IR class a base class instead of hardcoding the assumption of TS Test Plan: tested on lazy_tensor_staging branch Reviewed By: desertfire Differential Revision: D34178476 fbshipit-source-id: 7190b2e0d82b4eb1f4510c858c24446c6df3f9d0 (cherry picked from commit `6713d3f0ef`)	2022-02-16 18:33:31 +00:00
Will Constable	328cfd50e7	Move debug_util and python_util to torch/csrc/lazy (#72607 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72607 since python isn't available from libtorch, most of lazy tensor code can't depend on python. separate python_util into libtorch_python library make debug_util and IR dump work with or without python by providing a default function for 'maybe getting python stacktrace' that returns an empty stacktrace use a registration mechanism on libtorch_python library load to update the 'maybe' function to use the real python stacktrace getter Test Plan: OSS build tests: - test_ptltc by itself works - LTC_SAVE_TENSORS_FILE=log test_ptltc works, and log contains empty stacktrces - python examply.py by itself works - LTC_SAVE_TENSORS_FILE=log test_ptltc works, and log contains real stacktraces fbcode build: rely on CI to run test/lazy Reviewed By: desertfire Differential Revision: D34115046 fbshipit-source-id: 8d6222963c146da36b3c1b5ff8a638bbc3f1442e (cherry picked from commit `3717688ade`)	2022-02-11 18:00:40 +00:00

... 2 3 4 5 6 ...

395 Commits