pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Xiang Gao	dad071d8fe	[nvFuser] Add real and imag to nvfuser and its python frontend (#79824 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/79824 Approved by: https://github.com/mruberry, https://github.com/jjsjann123, https://github.com/kevinstephano	2022-07-07 17:25:42 +00:00
lezcano	beb98676ba	Correct cbrt implementation (#80443 ) Following https://github.com/pytorch/pytorch/pull/80219#discussion_r907680368 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80443 Approved by: https://github.com/IvanYashchuk, https://github.com/mruberry	2022-07-07 16:15:42 +00:00
Xiang Gao	9ca561cbd4	Add prims::{real, imag}, refs::{real, imag}, remove prims::is_infinite (#80148 ) This is a subset of https://github.com/pytorch/pytorch/pull/78655. I want to land this separately because the world is changing so fast, and https://github.com/pytorch/pytorch/pull/78655 is having lots of conflicts with other parts of the world. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80148 Approved by: https://github.com/mruberry	2022-07-05 23:06:39 +00:00
Nikita Shulga	b62209f047	[Prims] Unbreak CUDA lazy init (#80899 ) CUDA calls should not be made in the default codepath Fixes https://github.com/pytorch/pytorch/issues/80876 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80899 Approved by: https://github.com/ngimel	2022-07-05 21:16:58 +00:00
Ivan Yashchuk	9a12aa6cad	Add cached nvFuser's fusion creation for torch._prims.executor (#80525 ) In the current setup for each call of the `execute` function, a `Fusion` object was constructed using `GraphModule` and args, that's expensive. This PR makes use of `functools.lru_cache` to pay the `Fusion` creation cost once per `GraphModule` and set of args. Currently, the shape, strides, and dtype of tensors are static it can be changed later to make better use of the nvFuser's internal caching mechanism (by specifying only ndim, contiguity, dtype). On master: ```py In [2]: a = torch.randn(3, 3, device='cuda') In [3]: with TorchRefsMode.push(): ...: gm = make_fx(lambda x: torch.sigmoid(x))(a) ...: In [4]: %%timeit ...: execute(gm, a, executor="nvfuser") ...: torch.cuda.synchronize() 175 ms ± 1.18 ms per loop (mean ± std. dev. of 7 runs, 10 loops each) ``` This PR: ```py In [2]: a = torch.randn(3, 3, device='cuda') In [3]: with TorchRefsMode.push(): ...: gm = make_fx(lambda x: torch.sigmoid(x))(a) ...: In [4]: %%timeit ...: execute(gm, a, executor="nvfuser") ...: torch.cuda.synchronize() 62.6 µs ± 9.99 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each) ``` In addition, this PR adds support for pytree inputs and extends the test for this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80525 Approved by: https://github.com/kevinstephano, https://github.com/jjsjann123, https://github.com/SherlockNoMad	2022-07-05 17:00:45 +00:00
lezcano	eb0889cf7d	Add support for multiple inputs to out_wrapper and strict dtype checking (#80601 ) Reland of https://github.com/pytorch/pytorch/pull/79941 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80601 Approved by: https://github.com/albanD	2022-07-05 12:31:21 +00:00
PyTorch MergeBot	184a065ba7	Revert "Add support for multiple inputs to out_wrapper and strict dtype checking (#79941 )" This reverts commit `dc7066a8f0`. Reverted https://github.com/pytorch/pytorch/pull/79941 on behalf of https://github.com/suo due to broke master `dc7066a8f0`	2022-06-30 03:29:30 +00:00
lezcano	dc7066a8f0	Add support for multiple inputs to out_wrapper and strict dtype checking (#79941 ) When a function returns multiple parameters in PyTorch, the `out` parameter takes a tuple of tensors (see `linalg.svd` for example). The current implementation in `out_wrapper_multi` modelled this wrong, as it assumed that it would take a number of different named parameters. This PR implements the correct behaviour in `out_wrapper`. As a small side-effect, we now need to call `@out_wrapper()` when the output is just one tensor. This PR also implements an additional optional parameter that checks whether the dtype of the given `out` is exactly the dtype that the meta function requires. This is the behaviour that we currently have in PyTorch, and this check is necessary in eager when we call with these tensors into external libraries. We also make the functions with several outputs return a namedtuple, similar to what we do in PyTorch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79941 Approved by: https://github.com/mruberry, https://github.com/ezyang	2022-06-30 02:47:16 +00:00
lezcano	2d100eaa40	Correct meta behaviour of prims.resize (#80516 ) The previous behaviour did not modify the tensor in-place when it should Pull Request resolved: https://github.com/pytorch/pytorch/pull/80516 Approved by: https://github.com/ezyang	2022-06-30 02:47:16 +00:00
Aidyn-A	74fb6ee4c5	[primTorch] support one tensor and two scalars in _prims.where (#80146 ) Fixes an issue of supporting two scalar arguments for `where` and other functions with similar set of arguments: ``` refs.where(a, 1, 0) ``` I had to skip `test_python_ref_executor` because the test causes a `Segmentation fault` when running with two scalars. The issue https://github.com/csarofeen/pytorch/issues/1770 has been fixed https://github.com/csarofeen/pytorch/pull/1774, so we can lift the skip when its merged to the upstream. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80146 Approved by: https://github.com/ngimel	2022-06-29 19:58:31 +00:00
Ryan Spring	1d0d506e97	Add Div reference (#77936 ) Add Prims: - trunc - Replace _wrap_scalar with scalar_tensor Add Reference: - copysign - div - floor_divide - trunc_divide Other: * Add support for `variant_test_name` in _find_referenced_opinfo Pull Request resolved: https://github.com/pytorch/pytorch/pull/77936 Approved by: https://github.com/mruberry	2022-06-27 14:46:17 +00:00
Ivan Yashchuk	0d5bc54114	Fix interpretation torch -> torch._refs in case of nested torch calls under TorchRefsMode (#80135 ) torch calls inside `TorchRefsMode.__torch_function__` dispatch should be interpreted as refs calls under `TorchRefsMode`. Fixes https://github.com/pytorch/pytorch/issues/80079. In addition, this PR enables two more tests for the nvFuser executor. For example here's the FX trace of `torch._refs.nn.functional.layer_norm` before the proposed change (note the mix of `aten` and `prims`): ```py opcode name target args kwargs ------------- ---------------------- -------------------------- -------------------------------- ----------------- placeholder a_1 a_1 () {} call_function convert_element_type prims.convert_element_type (a_1, torch.float32) {} call_function var prims.var (convert_element_type, [0, 1]) {'correction': 0} call_function broadcast_in_dim prims.broadcast_in_dim (var, [1, 1], []) {} call_function convert_element_type_1 prims.convert_element_type (a_1, torch.float32) {} call_function sum_1 prims.sum (convert_element_type_1, [0, 1]) {} call_function broadcast_in_dim_1 prims.broadcast_in_dim (sum_1, [1, 1], []) {} call_function div prims.div (broadcast_in_dim_1, 9.0) {} call_function add aten.add (broadcast_in_dim, 1e-05) {} call_function rsqrt aten.rsqrt (add,) {} call_function sub aten.sub (a_1, div) {} call_function mul aten.mul (sub, rsqrt) {} call_function convert_element_type_2 prims.convert_element_type (mul, torch.float32) {} output output output (convert_element_type_2,) {} ``` And with this PR: ```py opcode name target args kwargs ------------- ---------------------- -------------------------- -------------------------------- ----------------- placeholder a_1 a_1 () {} call_function convert_element_type prims.convert_element_type (a_1, torch.float32) {} call_function var prims.var (convert_element_type, [0, 1]) {'correction': 0} call_function broadcast_in_dim prims.broadcast_in_dim (var, [1, 1], []) {} call_function convert_element_type_1 prims.convert_element_type (a_1, torch.float32) {} call_function sum_1 prims.sum (convert_element_type_1, [0, 1]) {} call_function broadcast_in_dim_1 prims.broadcast_in_dim (sum_1, [1, 1], []) {} call_function div prims.div (broadcast_in_dim_1, 9.0) {} call_function add prims.add (broadcast_in_dim, 1e-05) {} call_function rsqrt prims.rsqrt (add,) {} call_function broadcast_in_dim_2 prims.broadcast_in_dim (div, [3, 3], [0, 1]) {} call_function sub prims.sub (a_1, broadcast_in_dim_2) {} call_function broadcast_in_dim_3 prims.broadcast_in_dim (rsqrt, [3, 3], [0, 1]) {} call_function mul prims.mul (sub, broadcast_in_dim_3) {} call_function convert_element_type_2 prims.convert_element_type (mul, torch.float32) {} output output output (convert_element_type_2,) {} ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/80135 Approved by: https://github.com/ngimel	2022-06-25 03:55:04 +00:00
Khushi Agrawal	b00448df6b	[primTorch] asinh, atanh (#80210 ) Adds asinh and atanh refs and prims. These are prims because both are C++ standard library calls. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80210 Approved by: https://github.com/mruberry	2022-06-24 16:43:35 +00:00
Ivan Yashchuk	072311bb28	Enable torch._prims.amax/amin for nvFuser executor (#80070 ) This PR adds nvFuser implementations for `torch._prims.amax` and `torch._prims.amin` reduction functions. Currently, nvFuser refuses to reduce the 0d tensor, so these inputs are skipped in tests for now. An accompanying fix replaces `collections.Sequence` -> `collections.abc.Sequence` in refs because `collections.Sequence` is deprecated and removed in Python 3.10 Many ops that were skipped for the nvFuser executor test are now enabled. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80070 Approved by: https://github.com/ngimel	2022-06-23 10:19:57 +00:00
Natalia Gimelshein	9244547a1b	small cleanup of executor (#79973 ) per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/79973 Approved by: https://github.com/mruberry	2022-06-22 00:35:51 +00:00
Mike Ruberry	ca845462a0	Fixes maybe_broadcast to actually broadcast only when needed (#79298 ) Adds a `same_shape` util and updates maybe_broadcast to use it; previously maybe_broadcast was always broadcasting because its equality check was always failing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79298 Approved by: https://github.com/ezyang	2022-06-21 19:26:40 +00:00
Natalia Gimelshein	c0ce4b0de9	make refs executor handle kwargs (#79858 ) Mostly fixes #78923 I had to disable function patching in fx for functions with kwonly args, see https://github.com/pytorch/pytorch/compare/ngimel/make_fx_fix?expand=1#diff-090b22122be0779cd14afd2ebaf20d1e7c0bfe837e9eefa1d84e7521bb1defc6R446, cc @jamesr66a But it looks like it was doing weird things anyway - it was patching signature of wrapped function with arbitrary local vars from wrapper, that can't be right, but I don't know what the intent there is. A lot of functions now fail with nvfuser executor, and some still fail with aten, although with the different errors than before. Edit: undid the change to _symbolic_script.py, turns out inspect.unwrapping function is not needed, and fx never sees kwargs. cc @IvanYashchuk, @Chillee Pull Request resolved: https://github.com/pytorch/pytorch/pull/79858 Approved by: https://github.com/IvanYashchuk, https://github.com/mruberry	2022-06-21 18:53:15 +00:00
Horace He	e89676f76c	fix logical_not reland issues Pull Request resolved: https://github.com/pytorch/pytorch/pull/79900 Approved by: https://github.com/ngimel	2022-06-21 03:41:18 +00:00
Nikita Shulga	f5eb05f107	Revert "Reland #2 of "Added {logical_not, trace} refs, moved logical ops to use method overloads"" This reverts commit `f3665dd237`. Reverted https://github.com/pytorch/pytorch/pull/79819 on behalf of https://github.com/malfet due to land raced with softshrink refs	2022-06-20 14:22:15 -07:00
Horace He	f3665dd237	Reland #2 of "Added {logical_not, trace} refs, moved logical ops to use method overloads" Pull Request resolved: https://github.com/pytorch/pytorch/pull/79819 Approved by: https://github.com/mruberry	2022-06-20 19:50:43 +00:00
Syed Tousif Ahmed	802efc90c6	[primTorch] Implements refs for gcd, lcm and remainder (#78747 ) This PR implements the references for gcd, lcm and remainder. Additionally, `gcd` is added as a prim, since we currently don't have a while loop construct. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78747 Approved by: https://github.com/mruberry	2022-06-20 01:51:32 +00:00
Ivan Yashchuk	e10b762537	Enable torch._refs.var for nvFuser executor (#79517 ) This PR adds variance function with correction argument to nvFuser. Now it's possible to run ```py import torch import torch._refs from torch._prims.executor import make_traced def foo1(a): return torch._refs.var(a, keepdim=False, unbiased=False) def foo2(a): return torch._refs.var(a, keepdim=False, correction=2) a = torch.randn(3, 3, device='cuda') make_traced(foo1)(a, executor="nvfuser") make_traced(foo2)(a, executor="nvfuser") ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/79517 Approved by: https://github.com/mruberry, https://github.com/jjsjann123	2022-06-14 23:08:53 +00:00
Ivan Yashchuk	8895862744	Enable torch._refs.mean for nvFuser executor (#79444 ) This PR fixes a bug with `broadcast_in_dim` leading to the situation when reduction ops were not allowed to be used before `broadcast_in_dim`. With this PR it's possible to run ```py import torch import torch._refs from torch._prims.executor import make_traced def foo(a): return torch._refs.mean(a, keepdim=False) a = torch.randn(3, 3, device='cuda') make_traced(foo)(a, executor="nvfuser") ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/79444 Approved by: https://github.com/mruberry, https://github.com/jjsjann123	2022-06-14 19:42:07 +00:00
PyTorch MergeBot	66460c4a6a	Revert "Fixes maybe_broadcast to actually broadcast only when needed (#79298 )" This reverts commit `1cb1c2c08c`. Reverted https://github.com/pytorch/pytorch/pull/79298 on behalf of https://github.com/suo due to Broke FakeTensor tests on master, see: `1cb1c2c08c`	2022-06-11 23:36:18 +00:00
Mike Ruberry	1cb1c2c08c	Fixes maybe_broadcast to actually broadcast only when needed (#79298 ) Adds a `same_shape` util and updates maybe_broadcast to use it; previously maybe_broadcast was always broadcasting because its equality check was always failing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79298 Approved by: https://github.com/ezyang	2022-06-11 22:04:47 +00:00
PyTorch MergeBot	fefff54cad	Revert "Revert "Revert "Added {logical_not, trace} refs, moved logical ops to use method overloads""" This reverts commit `a2d2981e8e`. Reverted https://github.com/pytorch/pytorch/pull/79224 on behalf of https://github.com/suo due to broke lots of things `a2d2981e8e`	2022-06-10 04:40:43 +00:00
Horace He	a2d2981e8e	Revert "Revert "Added {logical_not, trace} refs, moved logical ops to use method overloads"" This reverts commit `d67309aefb`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79224 Approved by: https://github.com/mruberry	2022-06-10 03:07:14 +00:00
PyTorch MergeBot	d67309aefb	Revert "Added {logical_not, trace} refs, moved logical ops to use method overloads" This reverts commit `64b6bd8c1e`. Reverted https://github.com/pytorch/pytorch/pull/79000 on behalf of https://github.com/malfet due to Introduces test failure, see https://hud.pytorch.org/pr/79000	2022-06-09 13:11:23 +00:00
Horace He	64b6bd8c1e	Added {logical_not, trace} refs, moved logical ops to use method overloads Pull Request resolved: https://github.com/pytorch/pytorch/pull/79000 Approved by: https://github.com/ezyang	2022-06-09 07:16:36 +00:00
Elias Ellison	3c5a3ca9e8	Make FakeTensors return meta within kerenl invocation, add FakeTensor op tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/78972 Approved by: https://github.com/ezyang	2022-06-09 01:39:27 +00:00
Elias Ellison	290d0979f1	Migrate FakeTensors to always call into FakeTensorMode and have them hold a reference Pull Request resolved: https://github.com/pytorch/pytorch/pull/78677 Approved by: https://github.com/ezyang	2022-06-08 22:30:34 +00:00
Horace He	e675dbadc4	Ported gelu decomp to ref (#78697 ) Ugh... these are actually so painful to write without operator overloading lol. Decided to just utilize operator overloading, and xfail the ref tests for now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78697 Approved by: https://github.com/mruberry	2022-06-06 22:30:20 +00:00
Edward Z. Yang	80f2c175be	Follow up on CR for "Replace TensorMeta with FakeTensor" See https://github.com/pytorch/pytorch/pull/78836 Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78895 Approved by: https://github.com/albanD	2022-06-06 22:20:40 +00:00
Kshiteej K	c461d8a977	[primTorch] refs: hsplit, vsplit (#78418 ) As per title TODO: * [x] Add error inputs (already exist) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78418 Approved by: https://github.com/mruberry	2022-06-06 19:54:05 +00:00
Edward Z. Yang	99882fc492	Make check() strongly typed, fix erroneous call sites Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78896 Approved by: https://github.com/Lezcano, https://github.com/anjali411	2022-06-05 23:10:55 +00:00
Edward Z. Yang	587efdb5fa	Replace TensorMeta with FakeTensor Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78836 Approved by: https://github.com/albanD, https://github.com/mruberry	2022-06-05 11:51:27 +00:00
Ivan Yashchuk	df748b60f7	Allow pytrees as output for make_traced and nvfuser executor (#78802 ) This PR lifts the restriction that the output of a function traced with `make_traced` and executed with nvFuser must be a single tensor. Now it's possible to return a "pytree", a tensor's nested data structure (see https://github.com/pytorch/pytorch/blob/master/torch/utils/_pytree.py). I added a test with a function that returns a tuple of two objects where one of the objects is a dictionary with a tensor value. ```py def fn(a, b): d = {} d["c"] = torch.add(a, b) return (d, torch.add(a, d["c"])) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78802 Approved by: https://github.com/mruberry	2022-06-04 08:41:18 +00:00
Edward Z. Yang	83d40a4dba	linalg_cholesky_ex meta function Taken from https://github.com/albanD/subclass_zoo/blob/main/python_meta_tensor.py Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78604 Approved by: https://github.com/bdhirsh, https://github.com/ngimel, https://github.com/Lezcano	2022-06-03 23:11:02 +00:00
Edward Z. Yang	6120a8e05d	Implement meta function for aten::index.Tensor Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78527 Approved by: https://github.com/bdhirsh, https://github.com/ngimel, https://github.com/Lezcano	2022-06-03 23:11:02 +00:00
Gao, Xiang	eb88ea01b5	Cleanup impl_nvfuser for unary ops (#78670 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78670 Approved by: https://github.com/mruberry, https://github.com/IvanYashchuk	2022-06-02 16:17:47 +00:00
Ivan Yashchuk	0be4672a9d	[primTorch] Use the same error message as in ATen for canonicalize_dim (#78541 ) Fixes https://github.com/pytorch/pytorch/issues/78252. Locally nothing seems to break when changing the error type and the error message meaning there were no tests. At least one xfailed test from https://github.com/pytorch/pytorch/pull/78080 wouldn't pass with this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78541 Approved by: https://github.com/ngimel, https://github.com/mruberry	2022-06-02 12:10:41 +00:00
jjsjann123	fea909b43e	[primTorch] Adds broadcast_shapes reference (#78612 ) 1. Added references `_refs.broadcast_shapes` 2. Added OpInfo test for `torch.broadcast_shapes` A few minor changes: - `test_python_ref_meta` and `_ref_test_helper` update to avoid non-tensor outputs - type annotation update for `_resize_meta` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78612 Approved by: https://github.com/mruberry	2022-06-02 08:56:37 +00:00
Xiang Gao	b651148fc3	remove prims::square (#78627 ) because it is just `x * x` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78627 Approved by: https://github.com/mruberry	2022-06-02 02:18:17 +00:00
PyTorch MergeBot	6dafefe3d4	Revert "[primTorch] Use the same error message as in ATen for canonicalize_dim (#78541 )" This reverts commit `c054993b53`. Reverted https://github.com/pytorch/pytorch/pull/78541 on behalf of https://github.com/malfet due to as it depends on https://github.com/pytorch/pytorch/pull/78080 that caused XLA failures and is getting reverted	2022-06-01 16:48:00 +00:00
Ivan Yashchuk	c054993b53	[primTorch] Use the same error message as in ATen for canonicalize_dim (#78541 ) Fixes https://github.com/pytorch/pytorch/issues/78252. Locally nothing seems to break when changing the error type and the error message meaning there were no tests. At least one xfailed test from https://github.com/pytorch/pytorch/pull/78080 wouldn't pass with this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78541 Approved by: https://github.com/ngimel, https://github.com/mruberry	2022-06-01 16:09:35 +00:00
Ivan Yashchuk	20c32503eb	Add more impl_nvfuser for prims (#78493 ) This PR adds `test_nvfuser_impl_is_used` that checks that the corresponding nvfuser op (if available) is used in the prim definition. Adds `impl_nvfuser=` for atan2, bitwise_and, bitwise_or, bitwise_xor, eq, ne, pow, sub, sum, where, rsqrt, lgamma. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78493 Approved by: https://github.com/mruberry	2022-06-01 14:10:17 +00:00
Edward Z. Yang	b7215de32f	prod ref It turns out the prim is implemented incorrectly as torch.prod does not accept a dim list, so I added a little stub for this. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78461 Approved by: https://github.com/ngimel	2022-05-31 14:18:49 +00:00
Ryan Spring	2df1da09e1	Add Elementwise unary ops 4 references (#78216 ) Add reference implementations for `nan_to_num, positive, sigmoid, signbit, tanhshink` Add prims for `minimum_value(dtype)` and `maximum_value(dtype)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78216 Approved by: https://github.com/mruberry	2022-05-27 21:55:34 +00:00
jjsjann123	1a9a1b8b5e	fixing typo (#78417 ) primtorch prod is mistakenly using `_sum_doc` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78417 Approved by: https://github.com/malfet	2022-05-27 17:10:15 +00:00
Aidyn-A	31016eb81e	[primTorch] Elementwise Binary Ops I (#78023 ) This PR is a result of collaboration with @rdspring1 and @mruberry on primTorch. It adds the following prims: - `fmax` - `fmin` - `fmod` And adds the following refs: - `fmax` - `fmin` - `fmod` - `logical_xor` The work is in progress as there are some tests that fail. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78023 Approved by: https://github.com/mruberry	2022-05-26 20:22:27 +00:00
Gao, Xiang	5ecd30e857	[primTorch] Rename is_finite->isfinite (#78211 ) `isfinite` sounds like a better name, because PyTorch, C++, numpy all have this name instead of `is_finite` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78211 Approved by: https://github.com/ngimel, https://github.com/mruberry	2022-05-26 16:17:51 +00:00
Xiang Gao	3b70a7c294	[primTorch] impl_nvfuser for unary ops - 1 (#78220 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78220 Approved by: https://github.com/mruberry	2022-05-25 22:59:48 +00:00
Edward Z. Yang	a1765f0176	addr ref Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/78014 Approved by: https://github.com/ngimel	2022-05-25 01:40:11 +00:00
Ryan Spring	bb4653e736	Add i0, i1, zeta refs (#78111 ) Add reference implementations for i0, i1, zeta Add prim operations for i0, i1, zeta Pull Request resolved: https://github.com/pytorch/pytorch/pull/78111 Approved by: https://github.com/mruberry	2022-05-23 21:33:56 +00:00
Mike Ruberry	2738405a76	[primTorch] Adds any, all, equal, item references (#78072 ) This PR adds the item, equal, any, and all references. While doing this I found the following issues: - https://github.com/pytorch/pytorch/issues/78070 - https://github.com/pytorch/pytorch/issues/78071 And I fixed a bug where the `convert_element_type` prim could not convert tensors requiring grad to datatypes that don't require grad. Creating the item reference required adding item as a prim, but per @ngimel's suggestion I removed the prims for any and all and implemented them as references, so this is net negative one prim. Reference OpInfos are added for any and all, but item and equal don't even have regular OpInfos. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78072 Approved by: https://github.com/ngimel	2022-05-23 12:49:04 +00:00
Mike Ruberry	d4345ed0a6	[primTorch] Adds random operations (#78026 ) This PR... Issues Found - https://github.com/pytorch/pytorch/issues/78058 - https://github.com/pytorch/pytorch/issues/78054 - https://github.com/pytorch/pytorch/issues/78053 - https://github.com/pytorch/pytorch/issues/78050 - https://github.com/pytorch/pytorch/issues/77932 Testing - disables stride consistency checks in test_ops and test_meta pending resolution of https://github.com/pytorch/pytorch/issues/78050 - skips chalf in reference tests (addressing https://github.com/pytorch/pytorch/issues/78054) - splits test test_python_reference_consistency in one test for the ctx where torch.foo is torch.foo, and another for when torch.foo is refs.foo - updates test names to be more natural and consistent: - test_python_reference_errors -> test_python_ref_errors - test_python_reference_consistency -> test_python_ref and test_python_ref_torch_fallback - test_python_reference_meta_functions -> test_python_ref_meta - test_reference_testing -> test_numpy_ref - updates test_python_ref and test_python_ref_torch_fallback to check that the reference is more accurate than the torch op if the reference and torch op results are not close, a warning is raised when this occurs (addressing https://github.com/pytorch/pytorch/issues/77687) - adds reference inputs for broadcast_tensors - Updates the "fill_" OpInfo to "fill", adding a NumPy reference and making it an elementwise unary operator - Adds 1D no element sample inputs to the cat OpInfo and updates the NumPy reference to handle them and type promotion correctly - Adds reference inputs for elementwise ternary operations, like clamp - Adds a NumPy reference for clamp - Adds reference inputs to where's OpInfo - Makes softplus an elementwise unary OpInfo - Removes the great majority of Python reference OpInfo skips and xfails due to the above test changes - Adds Python reference OpInfos for fill, dropout, clamp, broadcast_tensors, and where Prims - adds the fill, empty_strided, and uniform prims - removes the empty, empty_like, full, and full_like prims -- these are now references that use empty_strided and fill - renames the "concatenate" and "select" prims to "cat" and "where", respectively, to be consistent with PyTorch - extends the `_elementwise_meta` operation to accepts tensors that don't participate in type promotion, like the `cond` tensor in `where` - fixes a bug in the stride propagation of broadcast_in_dim - moves some error checks from prims.cat to prims.where to refs.cat and refs.where, respectively, consistent with our new policy of doing as much error checking in the ref as possible Utils - adds the canoicalize_device, extract_shape, and extract_shape_from_varargs helpers - adds the elementwise_unary_scalar_wrapper -- this allows elementwise unary operators to take and return scalar values (ex. refs.sin(1) will return .84...) Refs - adds the fill, broadcast_tensors, clamp, empty_strided, ones, zeros, and uniform references - adds the nn.functional.dropout reference - fixes refs.cat to handle 1D tensors with no inputs consistent with eager mode Pull Request resolved: https://github.com/pytorch/pytorch/pull/78026 Approved by: https://github.com/ngimel	2022-05-23 01:56:28 +00:00
PyTorch MergeBot	acfbc16b1c	Revert "[primTorch] Adds random operations (#78026 )" This reverts commit `043cf1f9c7`. Reverted https://github.com/pytorch/pytorch/pull/78026 on behalf of https://github.com/suo due to This broke trunk: `043cf1f9c7`	2022-05-22 18:11:14 +00:00
Mike Ruberry	043cf1f9c7	[primTorch] Adds random operations (#78026 ) This PR... Issues Found - https://github.com/pytorch/pytorch/issues/78058 - https://github.com/pytorch/pytorch/issues/78054 - https://github.com/pytorch/pytorch/issues/78053 - https://github.com/pytorch/pytorch/issues/78050 - https://github.com/pytorch/pytorch/issues/77932 Testing - disables stride consistency checks in test_ops and test_meta pending resolution of https://github.com/pytorch/pytorch/issues/78050 - skips chalf in reference tests (addressing https://github.com/pytorch/pytorch/issues/78054) - splits test test_python_reference_consistency in one test for the ctx where torch.foo is torch.foo, and another for when torch.foo is refs.foo - updates test names to be more natural and consistent: - test_python_reference_errors -> test_python_ref_errors - test_python_reference_consistency -> test_python_ref and test_python_ref_torch_fallback - test_python_reference_meta_functions -> test_python_ref_meta - test_reference_testing -> test_numpy_ref - updates test_python_ref and test_python_ref_torch_fallback to check that the reference is more accurate than the torch op if the reference and torch op results are not close, a warning is raised when this occurs (addressing https://github.com/pytorch/pytorch/issues/77687) - adds reference inputs for broadcast_tensors - Updates the "fill_" OpInfo to "fill", adding a NumPy reference and making it an elementwise unary operator - Adds 1D no element sample inputs to the cat OpInfo and updates the NumPy reference to handle them and type promotion correctly - Adds reference inputs for elementwise ternary operations, like clamp - Adds a NumPy reference for clamp - Adds reference inputs to where's OpInfo - Makes softplus an elementwise unary OpInfo - Removes the great majority of Python reference OpInfo skips and xfails due to the above test changes - Adds Python reference OpInfos for fill, dropout, clamp, broadcast_tensors, and where Prims - adds the fill, empty_strided, and uniform prims - removes the empty, empty_like, full, and full_like prims -- these are now references that use empty_strided and fill - renames the "concatenate" and "select" prims to "cat" and "where", respectively, to be consistent with PyTorch - extends the `_elementwise_meta` operation to accepts tensors that don't participate in type promotion, like the `cond` tensor in `where` - fixes a bug in the stride propagation of broadcast_in_dim - moves some error checks from prims.cat to prims.where to refs.cat and refs.where, respectively, consistent with our new policy of doing as much error checking in the ref as possible Utils - adds the canoicalize_device, extract_shape, and extract_shape_from_varargs helpers - adds the elementwise_unary_scalar_wrapper -- this allows elementwise unary operators to take and return scalar values (ex. refs.sin(1) will return .84...) Refs - adds the fill, broadcast_tensors, clamp, empty_strided, ones, zeros, and uniform references - adds the nn.functional.dropout reference - fixes refs.cat to handle 1D tensors with no inputs consistent with eager mode Pull Request resolved: https://github.com/pytorch/pytorch/pull/78026 Approved by: https://github.com/ngimel	2022-05-22 10:06:24 +00:00
Natalia Gimelshein	192aa3ad5f	adds std and var refs and var prim (#77948 ) Per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/77948 Approved by: https://github.com/mruberry	2022-05-22 04:01:21 +00:00
kshitij12345	5f1b0a4f48	[primTorch] add exp2 (prim and ref), log10 (prim and ref), frac (ref) (#78046 ) Adds `exp2`, `log10` to the prims (both also exist in C++ lib and Intel SIMD intrinsic has `exp2`) Adds `exp2`, `log10`, `frac` to refs with corresponding entries to OpInfo. Tried to decompose `exp2` (before adding it as prim) as * `exp(log(2) * x)` but it wasn't stable at large numbers. * `pow(2, x)` in which case there was stride mismatch. At cursory look, `pow` tries to preserve stride of first arg if possible. Tried to decompose `log10` (before adding it as prim) as * `log(x) / log(10)` passed for real dtypes. Failed for complex at extremals. Probably related to https://github.com/pytorch/pytorch/issues/52332 (not a 100% sure) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78046 Approved by: https://github.com/mruberry	2022-05-22 03:43:54 +00:00
Edward Z. Yang	6b273444c4	Add logit ref; allow non-refs to be called in refs. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77816 Approved by: https://github.com/mruberry	2022-05-21 02:35:14 +00:00
Horace He	64b4bb4b01	Fix meta tests on norm (and relanding norm fixes) (#77930 ) Had a land race with meta tests. Will also be relanding https://github.com/pytorch/pytorch/pull/77407 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77930 Approved by: https://github.com/malfet, https://github.com/ezyang	2022-05-20 23:15:53 +00:00
kshitij12345	efb2c093fc	[fix] complex type promotion (#77524 ) Fixes https://github.com/pytorch/pytorch/issues/76803 Before Fix: ```python >> a = torch.randn((2, 2), dtype=torch.float) >> b = torch.tensor(1, dtype=torch.cdouble) >> (a + b).dtype torch.complex128 ``` After Fix: ```python >> a = torch.randn((2, 2), dtype=torch.float) >> b = torch.tensor(1, dtype=torch.cdouble) >> (a + b).dtype torch.complex64 ``` Note: This is a BC Breaking change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77524 Approved by: https://github.com/anjali411, https://github.com/mruberry	2022-05-20 10:23:56 +00:00
PyTorch MergeBot	03546e9c07	Revert "Fixed type promotion semantics for native_batch_norm and native_layer_norm (#77407 )" This reverts commit `70d80fb424`. Reverted https://github.com/pytorch/pytorch/pull/77407 on behalf of https://github.com/malfet due to as it broke meta tests ( I guess due to landrace), see `70d80fb424`	2022-05-20 02:31:57 +00:00
Kevin Stephano	11daf200e8	Adding activation references for celu, mish, selu, softplus, and tanh (#77473 ) Adding activation references for celu, softplus, mish, selu. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77473 Approved by: https://github.com/mruberry	2022-05-20 00:47:31 +00:00
Edward Z. Yang	9e0e949484	Fix bugs, increase meta coverage Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77499 Approved by: https://github.com/mruberry	2022-05-19 21:04:57 +00:00
Horace He	70d80fb424	Fixed type promotion semantics for native_batch_norm and native_layer_norm (#77407 ) Originally, when these were written, they simply used the naive strategy of "upcast all inputs to floats, and downcast all inputs back". In addition to being... not quite what the kernels did, they also didn't capture some additional semantics. Namely, that the norms (except for layer norm on CPU! cc: @ngimel) return fp32 for the mean and rstd values. Also, folks didn't like that I wrote `native_layer_norm` in terms of `native_batch_norm`. Which is fair - so I refactored the common logic into a `normalize` function. cc: @jansel / @bertmaher , who've been looking at lowering layer norm/batch norm. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77407 Approved by: https://github.com/bertmaher	2022-05-19 17:11:47 +00:00
Ryan Spring	3c4af1c496	[WIP] Add support for elementwise unary ops (#77807 ) * Add support for `log2, isinf, zeros_like` * Add primitives for `log2` and `is_infinite` I left a TODO to remove the `is_infinite` prim and to implement `isinf` reference using `isfinite` and `isnan`. We're missing `real` and `imag` ops to handle complex tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77807 Approved by: https://github.com/mruberry	2022-05-19 16:26:01 +00:00
Edward Z. Yang	88c89c9eb9	log_sigmoid_forward out support; out_wrapper_multi Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77739 Approved by: https://github.com/mruberry	2022-05-19 14:43:35 +00:00
Mike Ruberry	580a053832	[primTorch] Enforces stride metadata (#77542 ) This PR... Filed the Following Issues - https://github.com/pytorch/pytorch/issues/77553 - https://github.com/pytorch/pytorch/issues/77526 - https://github.com/pytorch/pytorch/issues/77600 Testing - Updates test_dtypes to longer attempt to test the backward of sample inputs where no inputs require grad - Adds a new test_python_reference_errors; it ensures the meta operations for references throw errors as expected - Updates compare_tensor_meta to better handle CUDA devices, and (temporarily) restricts stride checking to the CUDA device type - Elementwise unary and elementwise binary operators now have arbitrarily strided reference inputs - Reference inputs for _like functions are added - An OpInfo for torch.empty is added - Reference inputs for torch.clone are added - A NumPy reference for clone is added - Adds OpInfos for refs.empty and refs.empty_like Prims - Renames the "max" and "min" prims have been renamed to "maximum" and "minimum," respectively, to better conform to their ATen names - Adds the empty, empty_like, full, and full_like prims - Fixes the elementwise meta function's stride propagation - Fixes clone's meta function's stride propagation - Fixes convert_element_type's meta's stride propagation - Adds a (temporary) _to_dtype pprivate prim that casts a tensor while preserving its stride permutation - Removes the _set prim comment - Adds utils.compute_elementwise_output_strides, which computes the correct output strides for elementwise operations - Corrects an issue where utils.make_contiguous_strides_for was creating the incorrect strides for tensors with no elements References - Adds the empty, empty_like, full, full_like, and ones_like refs - Extends make_elementwise_unary_reference to accept an additional callable to perform extra input validation - Adds an extra validation function to handle refs.neg(BoolTensor) - Updates the isfinite ref to call ones_like when appropriate - Models Python scalar handling for elementwise binary operations - Added a 64 dim check for the amin and amax references - opmath is now a flag that can be set separately for cpu and CUDA Pull Request resolved: https://github.com/pytorch/pytorch/pull/77542 Approved by: https://github.com/ezyang	2022-05-18 13:57:26 +00:00
Edward Z. Yang	4d3930fe8a	Fix mypy lint failure Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77652 Approved by: https://github.com/SherlockNoMad	2022-05-17 15:48:48 +00:00
Natalia Gimelshein	375e21b2c6	check that flip doesn't accept repeating dimensions (#77500 ) Per title. Before this PR `flip` throws errors on invalid inputs from ATen implementation itself, and not from error checks happening in prims/refs. We should make sure that prims/refs do all the necessary error checking (@mruberry is going to test that by moving reference error inputs testing to call meta implementations instead of real ones). In general, most error checking should live in refs, prims meta functions should propagate the necessary properties, but they should assume that they are getting valid inputs. The checks on the inputs should happen in refs, where they can be traced to the necessary guards, or lead to RuntimeErrors during tracing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77500 Approved by: https://github.com/mruberry	2022-05-16 19:33:14 +00:00
Mike Ruberry	64c6a89bd6	[primTorch] reshape and view (#77220 ) This PR makes the following changes... Prims - adds as_strided - fixes errors in flatten meta Testing - enables view consistency checking (which can be opted out of, see issues below) - adds reference inputs for view, reshape, and flatten - adds error inputs for reshape Refs - adds as_strided, reshape, and view - fixes an error in the flatten ref where it was not returning self on no-op - fixes a bug in transpose where it was not retuning a view when the transposed tensor has 1 or fewer dims Issues - https://github.com/pytorch/pytorch/issues/77218 - https://github.com/pytorch/pytorch/issues/77216 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77220 Approved by: https://github.com/ngimel	2022-05-13 13:12:04 +00:00
Edward Z. Yang	d5ed73badd	Make it possible to register decompositions to Meta key Decompositions can be used to fill in meta support where necessary, assuming the operations they decompose to support meta key. This PR adds register_meta kwarg to register_decomposition that optionally lets you register the meta to the C++ dispatch table for meta tensors. I use this to then get the meta function for where and huber_loss for free. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77353 Approved by: https://github.com/mruberry	2022-05-12 23:20:16 +00:00
Edward Z. Yang	feaa64f7e0	Apply black lint CI to PrimTorch Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77226 Approved by: https://github.com/mruberry	2022-05-11 18:23:12 +00:00
PyTorch MergeBot	bbb1a55d9f	Revert "Apply black lint CI to PrimTorch" This reverts commit `188854eeaf`. Reverted https://github.com/pytorch/pytorch/pull/77226 on behalf of https://github.com/ezyang	2022-05-11 17:55:29 +00:00
Jane Xu	b0bd5926c9	Fix prims lint broken on trunk due to land race (#77271 ) Fixes upstream lint errors https://hud.pytorch.org/minihud?name_filter=Lint%20/%20lintrunner Pull Request resolved: https://github.com/pytorch/pytorch/pull/77271 Approved by: https://github.com/seemethere, https://github.com/malfet	2022-05-11 17:45:56 +00:00
Edward Z. Yang	0a14a4c280	Register prims as operators. This makes prims look as if they were defined in native_functions.yaml but they're still all written in Python. You now need to give a full schema string for your prims. The returned prim object is now torch.ops.prim overload (prims are not allowed to be overloaded, so we return the overload, not the overload packet, for speed.) Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77117 Approved by: https://github.com/mruberry, https://github.com/albanD	2022-05-11 16:38:14 +00:00
Edward Z. Yang	188854eeaf	Apply black lint CI to PrimTorch Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/77226 Approved by: https://github.com/mruberry	2022-05-11 16:37:19 +00:00
Kevin Stephano	752d496c91	Fix `broadcast_in_dim` support in NVFuser Frontend (#76790 ) This PR primarily addresses augmenting the frontend to properly support `broadcast_in_dim`. This required make a new version of the `define_tensor()` that takes in the `size` and `strides` of input tensors in order to properly determine broadcasts. This PR also has a fix for the `python_example.py` that broke when a new argument was added to reductions to allow the user to specify an output Data Type. `define_tensor()` Interface Example: ``` fusion2 = Fusion() input1 = torch.ones(1, 1, 4, device='cuda') input2 = torch.ones(2, 3, 4, device='cuda') with FusionDefinition(fusion2) as fd : t0 = fd.define_tensor(sizes=input1.size(), strides=input1.stride()) t1 = fd.define_tensor(sizes=input2.size(), strides=input2.stride()) fd.add_input(t0) fd.add_input(t1) t0_b = fd.Ops.broadcast_in_dim(t0, [2, 3, 4], [0, 1, 2]) print("Broadcast TensorView", t0_b) t2 = fd.Ops.add(t0_b, t1) fd.add_output(t2) ``` Print statement of defined broadcast tensor: ``` Broadcast TensorView T2_l[ sbS6{1}, sbS7{1}, iS8{i2} ] DataType: float Contiguity: ttt ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/76790 Approved by: https://github.com/mruberry, https://github.com/jjsjann123	2022-05-10 18:13:22 +00:00
Mike Ruberry	bb8baea932	[primTorch] flatten, squeeze, unsqueeze... (#77043 ) This PR ... Makes the following testing changes: - Updates stride testing in test_python_reference_consistency to only check strides of dimensions with length > 1 - Creates reference inputs for reshape - Creates reference inputs for chunk - Extends the sample inputs for unsqueeze - Extends the sample inputs for stack -- test_conj_view and test_neg_view are now xfailed - https://github.com/pytorch/pytorch/issues/77046 Makes the following architecture changes: - Adds the refs.special (sub)module - Adds the refs.nn.functional (sub)module Adds the following prims: - expand_dims - view_of - rev - clone Adds the following references: - flatten - squeeze - unsqueeze - special.i0e - special.i1e - logical_or - logical_and - isclose - flip - stack - nn.functional.elu - chunk - clone - narrow Identifies the following bugs in PyTorch today: - https://github.com/pytorch/pytorch/issues/77054 - https://github.com/pytorch/pytorch/issues/77055 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77043 Approved by: https://github.com/ngimel	2022-05-09 11:24:55 +00:00
Mike Ruberry	c031643e39	Adds decorators for Python References and extends Python Reference testing (#76945 ) This PR does the following... Tests: - fixes test_type_promotion in test_binary_ufuncs to correctly generate scalar cpu tensors - fixes test_python_reference_consistency to use the Python Reference's reference inputs - extends Python reference testing to test_conj_view, test_neg_view, and test_neg_conj_view - adds a NaN propagation sample input for elementwise unary and binary operations - fixes the UnaryUfuncInfo class to properly register its reference inputs - Updates the Python Reference OpInfos to skip error inputs when their behavior on scalar inputs is inconsistent with their reference operators Code organization: - moves elementwise type promotion functionality to prims.utils Prims & Refs: - fixes scalar cpu tensor handling by having them pass through broadcasting and device and shape checks - adds two decorators, `elementwise_type_promotion_wrapper` and `out_wrapper`, the former allows for elementwise type promotion to be automated and the latter automatically adds the out kwarg and handles it properly cc @ezyang who also had some thoughts on cpu scalar tensor handling cc @chillee -- might want to use this new decorator as we converge decompositions and references Pull Request resolved: https://github.com/pytorch/pytorch/pull/76945 Approved by: https://github.com/ngimel	2022-05-07 03:42:24 +00:00
Horace He	e9f34931ef	Add some shape decomps (t, transpose, rot90, stack) Also fixes xlogy (turns out the only thing it was missing was a type cast annotation! nice!) I also renamed `canonicalize_idx` => `canonicalize_dim` (to align with `canonicalize_dims`) and fixed a bug in it (cc: @mruberry) Pull Request resolved: https://github.com/pytorch/pytorch/pull/76873 Approved by: https://github.com/mruberry	2022-05-06 02:40:57 +00:00
Natalia Gimelshein	1c776d209c	Adds amax and amin references Also extends reference testing to error inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76855 Approved by: https://github.com/mruberry	2022-05-05 15:53:09 +00:00
Edward Z. Yang	4a11678368	Change TensorMeta to use tensor subclass. This means it can be fed through traditional PyTorch C++ code (although currently it does not work, as the __torch_dispatch__ implementation is stubbed to always throw an error.) Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/76759 Approved by: https://github.com/mruberry	2022-05-04 23:49:47 +00:00
Edward Z. Yang	48eb8d6aad	Use TorchFunctionMode to implement PrimTorch tracing context Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/76735 Approved by: https://github.com/mruberry	2022-05-04 23:49:46 +00:00
Mike Ruberry	b557e102d8	Fixes prim type promotion and updates type promotion testing This PR fixes prim elementwise type promotion, tests elementwise binary references using `test_type_promotion` in the elementwise binary test suite, and updates that test with additional cases for float x complex and scalar type promotion. The following issues were discovered while working on this PR: - https://github.com/pytorch/pytorch/issues/76806 - https://github.com/pytorch/pytorch/issues/76805 - https://github.com/pytorch/pytorch/issues/76804 - https://github.com/pytorch/pytorch/issues/76803 - https://github.com/pytorch/pytorch/issues/76801 Pull Request resolved: https://github.com/pytorch/pytorch/pull/76809 Approved by: https://github.com/ngimel	2022-05-04 17:58:10 +00:00
Mike Ruberry	ef9f56eb0b	[primTorch] slice and transpose & etc. This PR... Adds the following prims: - slice - slice_in_dim - transpose Adds the following refs: - cat - permute - transpose - swap_axes (alias for transpose) - tensor_split Makes the following test improvements: - adds reference inputs for torch.permute - adds a NumPy reference for torch.permute - adds reference inputs for torch.cat Fixes the following bugs: - adds support for scalars to the min and max prims Pull Request resolved: https://github.com/pytorch/pytorch/pull/76727 Approved by: https://github.com/ngimel	2022-05-04 05:38:33 +00:00
Natalia Gimelshein	c51b53d4ef	[WIP] sum reference Per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/76714 Approved by: https://github.com/mruberry	2022-05-04 02:50:00 +00:00
Mike Ruberry	c9bd73878a	adds elementwise opinfos and unary references, extends to out testing This PR makes the following changes: Prims: - igamma and igammac are now correctly listed as elementwise binary operations, not elementwise unary operations - elementwise prims now must specify their type promotion kind (this is currently unused) Refs: - complexhalf is now handled by opmath-style type promotion - adds references for: abs, acos, acosh, asin, atan, ceil, cos, cosh, digamma, erf, erfinv, erfc, exp, expm1, isfinite, isnan, lgamma, log, log1p, neg, reciprocal, sign, sin, sinh, sqrt, square, tan, igamma, igammac - adds "complex to float" and "bool to long" type promotion kinds - updates out behavior to warn when resizing a non-empty tensor, consistent with current ops - updates the elementwise unary reference template with type promotion Tests: - fixes torch.pow's OpInfo to correctly specify it only supports one scalar input, not two - fixes elementwise binary reference inputs to not attempt generating certain tensors in complex half (for now, cc @kshitij12345) - adds OpInfos for the following Python references: abs, acos, acosh, asin, atan, ceil, cos, cosh, digamma, erf, erfinv, erfc, exp, expm1, isfinite, isnan, lgamma, log, log1p, neg, reciprocal, round, sign, sin, sinh, sqrt, square, tan, atan2, bitwise_and, bitwise_left_shift, bitwise_or, bitwise_xor, eq, float_power, ge, gt, igamma, igammac, le, lt, maximum, minimum, mul, ne, nextafter, pow, sub, true_divide Pull Request resolved: https://github.com/pytorch/pytorch/pull/76647 Approved by: https://github.com/ngimel	2022-05-02 14:23:05 +00:00
Mike Ruberry	f6bbecf8b5	Adds python ref consistency test, elementwise unary reference inputs, and formats test files Per title. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76626 Approved by: https://github.com/ngimel	2022-05-01 22:42:46 +00:00
Mike Ruberry	fe1968dea0	[primTorch] Prototype nvFuser integration and test_prims.py This adds prototype nvFuser integration for the following prims: - broadcast_in_dim - convert_element_type - add - div - ge - gt - le - lt - mul Adding it for additional prims supported by nvFuser's prototype Python frontend should be easy. This also adds a new sugar to run operations using the ATen or nvFuser trace executors. For example: ``` def foo(a, b): return torch.add(a, b) traced_foo = make_traced(foo) a = torch.randn((1, 2, 3, 4, 5), device='cuda') b = torch.randn((1, 2, 3, 4, 5), device='cuda') result = traced_foo(a, b, executor='nvfuser') ``` Currently only operations with tensor inputs and one tensor output are supported, and the operation must be composed exclusively of reference or prim operations. Finally, this adds a new test, test_prims.py, that just tests the broadcast_in_dim prim for now. In the future we'll likely have OpInfos for each prim, but we'll need a reference implementation of broadcast_in_dim to make that interesting. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76560 Approved by: https://github.com/ngimel	2022-04-29 02:02:25 +00:00
Mike Ruberry	4048d4cdd2	[primTorch] Prototype tracer and elementwise unary reference opinfo class Adds a prototype tracer with no caching support and the `ElementwiseUnaryPythonRefInfo` class. A reference for `floor` is added to test the latter, and the elementwise binary reference inputs are extended to also return noncontiguous inputs. The SampleInput transform operation has been updated to return an actual SampleInput instead of a tuple to facilitate uniform handling of (transformed) SampleInputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76388 Approved by: https://github.com/ngimel	2022-04-27 14:40:21 +00:00
Mike Ruberry	28c3e0f77c	Initial prims, references, and test architecture for them (#75095 ) Summary: This PR adds an initial set of experimental primitive operations and Python references that reimplement existing PyTorch operations using them. See https://dev-discuss.pytorch.org/t/tracing-with-primitives-update-0/577 for additional context. The following experimental primitives are added: - Elementwise unary prims -- abs, acos, acosh, asin, atan, cos, cosh, bessel_i0e, bessel_i1e, cbrt, ceil, digamma, erf, erf_inv, erfc, exp, expm1, floor, igamma, igammac, is_finite, lgamma, log, log1p, neg, reciprocal, round, sign, sinh, sqrt, square, tan. - Elementwise binary prims -- add, atan2, bitwise_and, bitwise_not, bitwise_or, bitwise_xor, div, eq, ge, gt, le, lt, max, min, mul, ne, nextafter, pow, rsqrt, shift_left, shift_right_arithmetic - View prims -- brodcast_in_dim, collapse_view, split_dim, squeeze - Shape prims -- collapse, concatenate, reshape - Conditional prims -- select - Data conversion & movement prims -- convert_element_type, device_put - Inplace prims -- copy_to, resize These primitives do not add any new functionality to PyTorch, but are intended to be the semantic building blocks for reference operators. We have tried to make them consistent with the operations in [jax.lax](https://jax.readthedocs.io/en/latest/jax.lax.html) where possible (because PyTorch prefers being consistent with other frameworks), although there are key differences between these prims and operations in jax.lax. Most notably is that these prims model view semantics and inplace operations. In addition to these primitives the following elementwise binary Python references are added: - Elementwise binary Python references -- add, atan2, bitwise_and, bitwise_left_shift, bitwise_or, bitwise_right_shift, bitwise_xor, eq, float_power, ge, gt, le, lt, maximum, minimum, mul, ne, nextafter, pow, sub, true_divide - Conditional Python references - where - Data conversion & movement references - copy_to A Python reference implements the same behavior as its corresponding PyTorch operator (excepting slight numerical differences, bug fixes, and in some cases additional features). The start of an OpInfo-based test architecture for these references is also included in this PR. A new list, `python_ref_db`, is added to `common_methods_invocations.py`. This list introduces the new `ElementwiseBinaryPythonRefInfo`, which inherits input arguments from the original operators' OpInfo, allows them to be overridden, and then constructs the OpInfo for the Python reference using the (potentially modified) arguments. OpInfo-based tests can opt-into testing references by including this new list in the Sequence passed to the `ops` decorator. cc ngimel csarofeen kevinstephano Lezcano Pull Request resolved: https://github.com/pytorch/pytorch/pull/75095 Reviewed By: ngimel Differential Revision: D35888004 Pulled By: mruberry fbshipit-source-id: 21e77c4456c2a02113367d4bdae168a3a2f33f25 (cherry picked from commit 1d5bcfa99d4e8cf36f60642803a0bfca50e2ea4e)	2022-04-25 09:57:20 +00:00

... 2 3 4 5 6

294 Commits