pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Nikita Shulga	fd8367a7b1	[MPS][BE] Introduce xfail (#95045 ) Add `mps_ops_modifier` function that adds `unittest.expectedFailure` decorators to the operators that supposed to fail on MPS. This allows one to know whether or not operation will fail, rather than skip it. For example: ``` % python test_mps.py -v -k test_output_match_dot test_output_match_dot_cpu_float32 (__main__.TestConsistencyCPU) ... ok test_output_match_dot_cpu_int16 (__main__.TestConsistencyCPU) ... ok test_output_match_dot_cpu_int32 (__main__.TestConsistencyCPU) ... ok test_output_match_dot_cpu_int64 (__main__.TestConsistencyCPU) ... expected failure test_output_match_dot_cpu_uint8 (__main__.TestConsistencyCPU) ... ok ---------------------------------------------------------------------- Ran 5 tests in 0.175s OK (expected failures=1) ``` Moved a few functions from blocklist to xfail, and find out that some of the functions in the list actually work, for example `torch.long`. Also, allow `None` to be used in `ALLOWLIST` instead of specifying all types explicitly (which aligns with `DecorateInfo` semantic) Eventually, we should get rid of `ALLOWLIST` (i.e. all ops are allowed), keep small `BLOCKLIST` and move the rest to `XFAILLIST` Add step to print HW/SW info before running MPS tests. Fix type promotion in `trace_mps_out` Introduce `MACOS_12_X_XFAILLIST` and skip almost every function for `torch.uint8`, although some of those doesn't make much sense and feels like a regression from PyTorch-1.13 Re-enabled MPS testing on MacOS 12, as runners seems to be available again Pull Request resolved: https://github.com/pytorch/pytorch/pull/95045 Approved by: https://github.com/albanD	2023-02-27 15:01:01 +00:00
Li-Huai (Allan) Lin	4dca9bde05	[MPS] Add fmax fmin op (#95191 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/95191 Approved by: https://github.com/kulinseth	2023-02-25 07:21:48 +00:00
Li-Huai (Allan) Lin	5cad542e43	[MPS] Add log_sigmoid op (#95280 ) 1. Add log_sigmoid. 2. Make log1p a common function. Operators that use log1p: mish, softplus, log_sigmoid (maybe more). Pull Request resolved: https://github.com/pytorch/pytorch/pull/95280 Approved by: https://github.com/kulinseth	2023-02-24 01:38:30 +00:00
alexdremov	b9e95158d5	[MPS] Fix LSTM backward and forward pass (#95137 ) Fixes #91694 Fixes #92615 Several transpositions were missing for backward graph in case of `batch_first=True`. The #91694 is not reproduced with `batch_first=False`. After fixing transpose issue, I finally thought that now I can use LSTM freely in my project. And then I got horrific results on train. Seems related to #92615. After that I decided to fix LSTM's backward step completely. I collected all my findings in this thread — seems like I succeeded Funny enough, backward tests were completely disabled before and were not passing: ```python @unittest.skipIf(True, "Backward of lstm returns wrong result") def test_lstm_2(self, device="mps", dtype=torch.float32): ``` UPD: forward pass of multi-layer version also was wrong due to the incorrect `initState, initCell` slices. Tests were passing because states were inited with zeros. Accidentally fixed this too Pull Request resolved: https://github.com/pytorch/pytorch/pull/95137 Approved by: https://github.com/jhavukainen, https://github.com/kulinseth, https://github.com/soulitzer	2023-02-23 17:32:42 +00:00
Denis Vieriu	86efa104f5	[MPS] Fix view op slicing for 2nd dim in case of 0 offset (#95381 ) * Fix view op slicing for 2nd dim in case of 0 offset Pull Request resolved: https://github.com/pytorch/pytorch/pull/95381 Approved by: https://github.com/razarmehr	2023-02-23 17:26:10 +00:00
XiaobingSuper	5730cabdd0	using float type to do the computation of norm reduce for cpu half and bfloat16 dtype (#95166 ) As the title, we should use a higher dtype to compute norm reduce for half and bfloat1 dtype. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95166 Approved by: https://github.com/peterbell10, https://github.com/jgong5, https://github.com/ngimel, https://github.com/lezcano	2023-02-23 05:00:25 +00:00
Li-Huai (Allan) Lin	69c76ff05e	[MPS] Add xlogy op (#95213 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/95213 Approved by: https://github.com/kulinseth, https://github.com/soulitzer	2023-02-22 19:43:12 +00:00
Denis Vieriu	5e47571a13	[MPS] Convolution cleanup; remove unnecessary contiguous calls (#95078 ) - Fixes convolution crashes in backward with weights - Removes unnecessary contiguous calls Pull Request resolved: https://github.com/pytorch/pytorch/pull/95078 Approved by: https://github.com/kulinseth	2023-02-22 18:04:12 +00:00
Kulin Seth	02a6d4334b	[MPS] Handle broadcasting by expanding src tensor in Copy.mm (#95272 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/95272 Approved by: https://github.com/DenisVieriu97	2023-02-22 18:02:42 +00:00
Denis Vieriu	8475af7761	[MPS] Cast int64 to int32 for reduction ops (#95231 ) - give warnings of converting int64 for reduction ops - use cast tensor for reduction sum on trace - unblock trace from running Pull Request resolved: https://github.com/pytorch/pytorch/pull/95231 Approved by: https://github.com/razarmehr	2023-02-22 17:23:25 +00:00
Li-Huai (Allan) Lin	f70a3430aa	[MPS] Add hypot op (#95196 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/95196 Approved by: https://github.com/kulinseth	2023-02-21 22:40:20 +00:00
Li-Huai (Allan) Lin	e0a0329a67	[MPS] Add hardsigmoid op (#95164 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/95164 Approved by: https://github.com/kulinseth	2023-02-21 07:06:37 +00:00
Li-Huai (Allan) Lin	d96aac8d2a	[MPS] Add logit op (#95162 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/95162 Approved by: https://github.com/kulinseth	2023-02-21 07:02:45 +00:00
alexdremov	a17a7ccc92	[MPS] LogSoftmax numerical stability (#95091 ) Fixes #94043 Calculations are now consistent with numericaly stable formula and CPU: $LogSoftmax(X, \dim) = X - \max(X, \dim) - \log(sum(X - \max(X, \dim), \dim))$ @malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/95091 Approved by: https://github.com/malfet, https://github.com/kulinseth	2023-02-18 18:26:29 +00:00
Ramin Azarmehr	9511b9fad2	[MPS] Fix copy_cast_mps() on tensors with storage offset (#95093 ) - The copy_cast path requires storage_offset to be applied before casting - This should fix some correctness issues in transformer models Fixes #94980 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95093 Approved by: https://github.com/kulinseth	2023-02-18 16:29:01 +00:00
Li-Huai (Allan) Lin	25ee6dd335	[MPS] Fix fill_ where input tensor has a storage offset (#95113 ) Fixes #94390 Apart from fixing the issue above, this PR also fixes a bug that when an input tensor can be sliced, a sliced array view is created. This array view seems to be not writable or have a different storage from the original tensor, causing incorrect results with the in-place `fill`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95113 Approved by: https://github.com/kulinseth	2023-02-18 16:19:15 +00:00
Li-Huai (Allan) Lin	0a9c608461	[MPS] Fix tensor with non-zero storage offset graph gathering (#91071 ) Previously, the "can slice" flag in Placeholder constructor in `OperationUtils.mm` is conditioned on whether the numbers of dimensions of base shape and view shape are the same. This doesn't consider the situation that a view tensor could be the base tensor's sliced and then unsqueezed version, resulting in different num of dims. For example, if we want to stack `y_mps` and `x_mps` on the last dim: ``` t_mps = torch.tensor([1, 2, 3, 4], device="mps") x_mps = t_mps[2:] # [3, 4] y_mps = t_mps[:2] # [1, 2] res_mps = torch.stack((y_mps, x_mps), dim=-1) ``` the kernel will unsqueeze both of them on the last dim and then concatenate them, which is equivalent to: ``` res_mps = torch.cat((y_mps.unsqueeze(-1), x_mps.unsqueeze(-1)), dim=-1) ``` `x_mps.unsqueeze(-1)` is an unsqueezed and contiguous tensor with a storage offset, this kind of tensors should be sliceable without cloning its storage. Fixes #87856 Fixes #91065 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91071 Approved by: https://github.com/kulinseth	2023-02-17 18:44:20 +00:00
Denis Vieriu	a2afc657da	[MPS] Fix upsample for NHWC output (#94963 ) Fixes https://github.com/huggingface/diffusers/issues/941 Before: <img width="1144" alt="Screenshot 2023-02-15 at 8 11 53 PM" src="https://user-images.githubusercontent.com/104024078/219266709-6a77636a-2fc0-4802-b130-85069b95953f.png"> After: <img width="1144" alt="Screenshot 2023-02-15 at 8 12 02 PM" src="https://user-images.githubusercontent.com/104024078/219266694-ea743c02-fb55-44f1-b7d6-5946106527c3.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/94963 Approved by: https://github.com/razarmehr	2023-02-17 05:07:22 +00:00
Denis Vieriu	5d1e9fd214	[MPS] Fix prelu backward pass (#94933 ) Allocate the correct shape for the weights gradient Pull Request resolved: https://github.com/pytorch/pytorch/pull/94933 Approved by: https://github.com/razarmehr	2023-02-17 03:45:12 +00:00
Denis Vieriu	bc361fdfdf	[MPS] Fix bilinear backward pass (#94892 ) Fixes backward pass for bilinear. Summary of changes: - bilinear op is able to produce contiguous, non-view tensors with a storage offset, such as: shape=`[1, 1, 1, 1]`, `storage_offset=12`. This seems a weird case, but it is valid, and for these type of tensors we wouldn't be able to gather/scatter since we look at the view flag (which is not set here). This change looks into `storage_offset` only rather than the is_view flag which is not being set - reduction sum must return a zeroed out output if passing an input with 0 elements (e.g a shape of (0, 5)). Pull Request resolved: https://github.com/pytorch/pytorch/pull/94892 Approved by: https://github.com/kulinseth	2023-02-16 00:30:29 +00:00
Kulin Seth	54ebf255ab	[MPS] Fixes for LSTM. (#94889 ) - Backward pass has to give explicit bias tensor of zeros if none is passed to the op or the bias gradient will not be calculated. - Fixed bias tensor mistakenly getting overwritten to zeros - Fixes crash when lstm op called with has_biases set to false. Change takes into account the changed shape of the input params TensorList depending on the bias flag. Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94889 Approved by: https://github.com/DenisVieriu97	2023-02-15 16:10:40 +00:00
Denis Vieriu	71ec2617d2	[MPS] Block uint8 data type for unary and binary ops on macOS 12 (#94876 ) Blocks uint8 data type for unary and binary ops on macOS 12 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94876 Approved by: https://github.com/kulinseth	2023-02-15 06:09:56 +00:00
Kulin Seth	94f0808629	[MPS] Add fmod op. (#94722 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94722 Approved by: https://github.com/DenisVieriu97	2023-02-14 14:55:26 +00:00
Xuehai Pan	b005ec62b9	[BE] Remove dependency on `six` and `future` (#94709 ) Remove the Python 2 and 3 compatibility library [six](https://pypi.org/project/six) and [future](https://pypi.org/project/future) and `torch._six`. We only support Python 3.8+ now. It's time to retire them. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94709 Approved by: https://github.com/malfet, https://github.com/Skylion007	2023-02-14 09:14:14 +00:00
Denis Vieriu	1f06a71797	[MPS] Error out for square int64 input (#94766 ) - add checks for whether macOS is greater than 13.2 - remove square from block list - throw error messages if power int64 is called before macOS 13.2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94766 Approved by: https://github.com/kulinseth	2023-02-14 04:45:41 +00:00
Denis Vieriu	cedb7e3d77	[MPS] Fix remainder op for integral dtypes (#94757 ) Map remainder op to the same template as div (integral dtypes will be cast to float) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94757 Approved by: https://github.com/kulinseth	2023-02-14 01:06:49 +00:00
Denis Vieriu	4acdc446b2	[MPS] Fix batch norm for NHWC (#94760 ) Fixes `test_modules.py` batch norm NHWC testcases: - `test_memory_format_nn_BatchNorm2d_eval_mode_mps_float32` - `test_memory_format_nn_BatchNorm2d_eval_mode_mps_float32` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94760 Approved by: https://github.com/kulinseth	2023-02-13 23:31:10 +00:00
OwenPendrighElliott	840fb74ec8	86990 range mps support (#91075 ) Fixes #86990 - Added range_mps_out to RangeFactories.mm - Updated native_functions.yaml - Added tests in test_mps.py I did observe that despite [the documentation for torch.range](https://pytorch.org/docs/stable/generated/torch.range.html), the existing implementations do not adjust their return type based off the arguments passed to them. The MPS implementation provided here behaves the same way as the existing CPU and CUDA implementations in this regard, hence the conversion to float32 in the test cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91075 Approved by: https://github.com/kulinseth, https://github.com/DenisVieriu97	2023-02-13 23:19:10 +00:00
Ramin Azarmehr	b57e6fdb50	[MPS] Enable Memory Leak Detection for test_mps.py (#94646 ) - To check for Memory Leaks in `test_mps.py`, set the env-variable `PYTORCH_TEST_MPS_MEM_LEAK_CHECK=1` when running test_mps.py (used CUDA code as reference). - Added support for the following new python interfaces in MPS module: `torch.mps.[empty_cache(), set_per_process_memory_fraction(), current_allocated_memory(), driver_allocated_memory()]` - Renamed `_is_mps_on_macos_13_or_newer()` to `_mps_is_on_macos_13_or_newer()`, and `_is_mps_available()` to `_mps_is_available()` to be consistent in naming with prefix `_mps`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94646 Approved by: https://github.com/malfet	2023-02-13 17:56:24 +00:00
Kulin Seth	18587cb31f	[MPS] Add sort and argSort Op. (#94697 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94697 Approved by: https://github.com/DenisVieriu97	2023-02-13 01:03:22 +00:00
Xuehai Pan	046e88a291	[BE] [3/3] Rewrite `super()` calls in test (#94592 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94592 Approved by: https://github.com/ezyang, https://github.com/seemethere	2023-02-12 22:20:53 +00:00
Ramin Azarmehr	bdd8f518d7	[MPS] Add Python Module Bindings for the MPS backend (#94417 ) - This PR is a prerequisite for the upcoming Memory Leak Detection PR. - Enable global manual seeding via `torch.manual_seed()` + test case - Add `torch.mps.synchronize()` to wait for MPS stream to finish + test case - Enable the following python interfaces for MPS: `torch.mps.[get_rng_state(), set_rng_state(), synchronize(), manual_seed(), seed()]` - Added some test cases in test_mps.py - Added `mps.rst` to document the `torch.mps` module. - Fixed the failure with `test_public_bindings.py` Description of new files added: - `torch/csrc/mps/Module.cpp`: implements `torch._C` module functions for `torch.mps` and `torch.backends.mps`. - `torch/mps/__init__.py`: implements Python bindings for `torch.mps` module. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94417 Approved by: https://github.com/albanD	2023-02-12 21:22:30 +00:00
Henry Cheng	fe0c7fbcf8	[MPS] Add repeat_interleave to MPS (#88649 ) Fixes #87219 Implements new ``repeat_interleave`` function into ``aten/src/ATen/native/mps/operations/Repeat.mm`` Adds it to ``aten/src/ATen/native/native_functions.yaml`` Adds new test ``test_repeat_interleave`` to ``test/test_mps/py`` Pull Request resolved: https://github.com/pytorch/pytorch/pull/88649 Approved by: https://github.com/kulinseth	2023-02-12 08:43:55 +00:00
Denis Vieriu	b794fd19c5	[MPS] Add scatter gather kernels (support up to 5 dimensions) (#94663 ) Add scatter gather kernels (support up to 5 dimensions) - Fixes int64 issues for `mH`, `mT`, `T`, `H` on Monterey Pull Request resolved: https://github.com/pytorch/pytorch/pull/94663 Approved by: https://github.com/kulinseth	2023-02-12 08:17:26 +00:00
Kulin Seth	54c0f37646	[MPS] Add support for TopK k>16 (#94639 ) Fixes: https://github.com/pytorch/pytorch/issues/78915 * Add the topk>16 support Pull Request resolved: https://github.com/pytorch/pytorch/pull/94639 Approved by: https://github.com/DenisVieriu97	2023-02-12 00:57:53 +00:00
Denis Vieriu	4a762cb622	[MPS] Fix channels last copies in ELU,ReLU and Hardswish (#94664 ) Fixes test_modules.py tests: ``` test_memory_format_nn_Hardswish_mps_float32 test_non_contiguous_tensors_nn_Hardswish_mps_float32 test_memory_format_nn_ReLU_mps_float32 ``` Fixes elu when ran with `ChannelsLast` memory format. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94664 Approved by: https://github.com/kulinseth	2023-02-11 22:05:21 +00:00
Kulin Seth	c74f438c01	[MPS] Fix the cat op for NHWC case (#94662 ) * add unit test cat with non-contiguous Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94662 Approved by: https://github.com/DenisVieriu97	2023-02-11 19:43:33 +00:00
PyTorch MergeBot	4fe365774a	Revert "[MPS] Add Python Module Bindings for the MPS backend (#94417 )" This reverts commit `beb4f5bf39`. Reverted https://github.com/pytorch/pytorch/pull/94417 on behalf of https://github.com/huydhn due to Sorry for reverting your PR, but it seems to break MacOS test in trunk `bae397ec63`	2023-02-11 05:24:45 +00:00
Ramin Azarmehr	030209088f	[MPS] Fix the regression with test_index_select_scalar() (#94645 ) The PR #94347 caused a regression in test_mps which this patch fixes it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94645 Approved by: https://github.com/DenisVieriu97	2023-02-11 01:36:51 +00:00
Denis Vieriu	7ce785b50b	[MPS] Fix gelu forward and backward ops (#94529 ) Forward pass: ``` fix gelu_out_mps key add calculation for gelu with tanh remove gelu from blocklist ``` Backward pass: ``` fix gelu_backward_out_mps key uniform format add caculation for tanh approximate backward pass unblock grad test from blocklist ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94529 Approved by: https://github.com/razarmehr, https://github.com/kulinseth	2023-02-11 00:24:30 +00:00
Denis Vieriu	507b8c3423	[MPS] Native implementation for addr (#94538 ) ``` addr_out_mps to perform res = betainput + alpha(vec1Xvec2) move addr f16 to low precision list move addr none float to unsupported list add test_addr tests ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94538 Approved by: https://github.com/razarmehr	2023-02-11 00:16:50 +00:00
Denis Vieriu	0b31ebf9e4	[MPS] Added zero check to inverse & fix for any op to avoid segfault issue (#94551 ) Fixes empty placeholder error in inverse op. Change to any op should also resolve previously seen segfaults Pull Request resolved: https://github.com/pytorch/pytorch/pull/94551 Approved by: https://github.com/kulinseth	2023-02-10 23:39:12 +00:00
Ramin Azarmehr	beb4f5bf39	[MPS] Add Python Module Bindings for the MPS backend (#94417 ) - This PR is a prerequisite for the upcoming Memory Leak Detection PR. - Enable global manual seeding via `torch.manual_seed()` + test case - Add `torch.mps.synchronize()` to wait for MPS stream to finish + test case - Enable the following python interfaces for MPS: `torch.mps.[get_rng_state(), set_rng_state(), synchronize(), manual_seed(), seed()]` - Added some test cases in test_mps.py - Added `mps.rst` to document the `torch.mps` module. - Fixed the failure with `test_public_bindings.py` Description of new files added: - `torch/csrc/mps/Module.cpp`: implements `torch._C` module functions for `torch.mps` and `torch.backends.mps`. - `torch/mps/__init__.py`: implements Python bindings for `torch.mps` module. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94417 Approved by: https://github.com/albanD	2023-02-10 23:18:41 +00:00
Denis Vieriu	728dfeee48	[MPS] Fix ops with bool issues in macOS Monterey (#94464 ) Summary: - Remove redundant bool casts from scatter/gather - Make the workarounds for scatter/gather (for bool/uint8 data types) OS specific - use them only in macOS Monterey, ignore them starting with macOS Ventura - Make all tensors ranked in scatter Fixes following tests: ``` test_output_match_slice_scatter_cpu_bool test_output_match_select_scatter_cpu_bool test_output_match_diagonal_scatter_cpu_bool test_output_match_repeat_cpu_bool test_output_match_rot90_cpu_bool etc.. ``` Still failing on macOS Monterey (needs additional investigation): ``` test_output_match_scatter_cpu_bool ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94464 Approved by: https://github.com/kulinseth	2023-02-10 21:36:25 +00:00
Ramin Azarmehr	7c4acdad4a	[MPS] Fix the crash in huberloss with Float16 (#94567 ) - Also fix FP16 correctness issues in several other ops by lowering their FP16 precision in the new list `FP16_LOW_PRECISION_LIST`. - Add atol/rtol to the `AssertEqual()` of Gradient tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94567 Approved by: https://github.com/kulinseth	2023-02-10 19:20:29 +00:00
Denis Vieriu	92d8c4b37c	[MPS] Fix cumsum for integral data types (#94530 ) - Make intermediate type for cumsum ScalarType::Int: fixes https://github.com/pytorch/pytorch/issues/90635 - Add support for negative dimensions in cumsum: fixes https://github.com/pytorch/pytorch/issues/92329 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94530 Approved by: https://github.com/kulinseth	2023-02-10 17:40:29 +00:00
Kulin Seth	1d3980656c	[MPS] Fix min/max_reduction_with_dim ops (#94386 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94386 Approved by: https://github.com/DenisVieriu97, https://github.com/razarmehr	2023-02-10 15:23:47 +00:00
Kulin Seth	0fe11589df	[MPS] Add im2col and col2im to Fallback (#94491 ) These are not in the hot path as they are mostly used in Preprocessing layers. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94491 Approved by: https://github.com/razarmehr	2023-02-10 15:22:59 +00:00
PyTorch MergeBot	f152a79be9	Revert "update aten op overload to not use `from` to avoid compile errors (#89797 )" This reverts commit `021d267694`. Reverted https://github.com/pytorch/pytorch/pull/89797 on behalf of https://github.com/jeanschmidt due to breaking internal builds - more details on https://fburl.com/sandcastle/bz8mgkil	2023-02-10 11:32:25 +00:00
Denis Vieriu	a1f15fb987	[MPS] Fix batchnorm forward and backward pass (#94351 ) Fixes batchnorm forward/backward pass and layer_norm: Batchnorm Forward pass: ``` - fix batch_norm_mps_out key - return 1/sqrt(var+epsilon) instead of var - return empty tensor for mean and var if train is not enabled - remove native_batch_norm from block list ``` Batchnorm Backward pass: ``` - add revert caculation for save_var used in backward path - add backward test for native_batch_norm and _native_batch_norm_legit ``` Layer norm: ``` - remove the duplicate calculation from layer_norm_mps - enable native_layer_norm backward test - raise atol rtol for native_layer_norm ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94351 Approved by: https://github.com/razarmehr	2023-02-10 05:53:36 +00:00
Denis Vieriu	016f0b2f62	[MPS] Calculate nonzero count inside nonzero op (#94442 ) Calculate nonzero count directly in the nonzero op. Additionally, synchronize before entering nonzero op to make sure all previous operations finished (output shape is allocated based on the count_nonzero count) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94442 Approved by: https://github.com/kulinseth	2023-02-10 00:53:52 +00:00
Denis Vieriu	336d9354d6	[MPS] Enable index add for TestConsistency (#94356 ) Enable index_add TestConsistency TestCase Pull Request resolved: https://github.com/pytorch/pytorch/pull/94356 Approved by: https://github.com/kulinseth	2023-02-10 00:21:11 +00:00
Kulin Seth	299ada9cff	[MPS] Add the floor_divide fixes. (#94488 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94488 Approved by: https://github.com/razarmehr	2023-02-10 00:10:08 +00:00
Kulin Seth	f35f12320a	[MPS] Fixes for arange_mps for empty tensor. (#94485 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94485 Approved by: https://github.com/razarmehr	2023-02-09 19:30:17 +00:00
Kulin Seth	105f7205bd	[MPS] Fix and unblock TestConsistency for median (#94489 ) - fix num_output_dims calculation - fix median_out_mps key - cast tensor sent to sortWithTensor and argSortWithTensor - note down same issue for unique - unblock median from blocklist - adding test_median_int16 test Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94489 Approved by: https://github.com/razarmehr	2023-02-09 19:29:07 +00:00
Ramin Azarmehr	4f691d2e2f	[MPS] Fix correctness issue with fill_scalar_mps() (#94479 ) - The self was not contiguous and inline filling produced wrong results - Added a test case for the issue Fixes the zero_like() issue reported in #94190 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94479 Approved by: https://github.com/DenisVieriu97, https://github.com/kulinseth	2023-02-09 19:07:13 +00:00
jinsu kim	a5b052259b	Add MPS support for aten::remainder.Tensor_out (#92139 ) Fixes #86806 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92139 Approved by: https://github.com/kulinseth, https://github.com/DenisVieriu97	2023-02-09 15:32:30 +00:00
Soof Golan	e4fe11eecb	[MPS] Fix torch.topk for empty tensors and k=0 on mps (#91884 ) Fixes #91878 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91884 Approved by: https://github.com/kulinseth	2023-02-09 10:42:52 +00:00
Soof Golan	19264b50bb	[MPS] Add support for nansum on mps (#93845 ) * Add `nansum_out_mps` and `nansum_mps` functions * Moved `get_dtype_from_self` into ReduceOpsUtils.h Fixes #86809 Pull Request resolved: https://github.com/pytorch/pytorch/pull/93845 Approved by: https://github.com/malfet	2023-02-09 10:30:55 +00:00
Kulin Seth	02ca2253cc	[MPS] Fixes for Binary ops with casting issues from FP to uint8 (#94382 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/94382 Approved by: https://github.com/razarmehr	2023-02-09 09:44:02 +00:00
Denis Vieriu	5b8e485a34	[MPS] Add 2d grid sampler (#94273 ) Add support for MPS grid sampler Pull Request resolved: https://github.com/pytorch/pytorch/pull/94273 Approved by: https://github.com/razarmehr	2023-02-09 02:25:46 +00:00
Ramin Azarmehr	6c80d0a5a5	[MPS] Fix correctness issues with Pool2D ops (#94348 ) - Fix wrong results in AvgPool2D when `count_include_pad=True` - Fix issues with adaptive average and max pool2d - Remove the redundant blocking copies from `AdaptiveMaxPool2d` - Add `divisor` to cached string key to avoid conflicts - Add test case when both `ceil_mode` and `count_include_pad` are True (previously failed). - Clean up redundant code Pull Request resolved: https://github.com/pytorch/pytorch/pull/94348 Approved by: https://github.com/kulinseth	2023-02-09 02:06:40 +00:00
Elias Ellison	021d267694	update aten op overload to not use `from` to avoid compile errors (#89797 ) Fix for https://github.com/pytorch/pytorch/issues/93591 by changing `random_.from` to `random_.from_int`. The previous signature would fail when printed in an fx graph, because `from` is a reserved python keyword. This change affects serialization but I have added an adapter. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89797 Approved by: https://github.com/tugsbayasgalan	2023-02-08 22:04:59 +00:00
Denis Vieriu	22e1698cf7	[MPS] Add triangular solve op through MPSMatrixSolveTriangular (#94345 ) Add triangular solve op support through MPS `MPSMatrixSolveTriangular` kernel Pull Request resolved: https://github.com/pytorch/pytorch/pull/94345 Approved by: https://github.com/razarmehr	2023-02-08 21:48:12 +00:00
Denis Vieriu	5d48392abb	[MPS] Skip gather/blit calls in case of strided output (#94260 ) Skip gather/blit calls in case of strided output - this prevents: - allocating additional memory for the output - additional transpose for both the input and output Fixes: ``` x = torch.rand((256,10), device='mps') x = x.permute(1,0) x.exp() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94260 Approved by: https://github.com/razarmehr	2023-02-07 16:25:03 +00:00
Denis Vieriu	86ae14deaa	[MPS] Fix MPSGraph casting issue to MPSDataTypeBool in masked_fill op (#94263 ) Fixes TestConsistency masked_fill for bool data type. Casting a tensor > 1 to MPSDataTypeBool will result in 0 instead of 1. This change manually casts the scalar to a value of 0 or 1 when casting a non-boolean tensor to a boolean tensor: ``` (inputDataType == MPSDataTypeBool) ? !!value.to<double>() : value.to<double>() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94263 Approved by: https://github.com/razarmehr	2023-02-07 16:20:55 +00:00
Denis Vieriu	e3ac109618	[MPS] Fallback on gather code to solve view tensors when a slice is followed by a reshape (#94278 ) There are cases when the arrayViewTensor API cannot be used to solve the view operations, such as when a view dimension is bigger than the base dimension of the tensor, e.g: ``` base shape: [1, 768, 512, 2] // we cannot slice the base shape in any way to result in first dimension `2` view shape: [2, 384, 512, 1] ``` On such cases, we need to fallback on the gather code (that detects this is a slice followed by a reshape) to solve this issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94278 Approved by: https://github.com/razarmehr	2023-02-07 16:20:08 +00:00
Kulin Seth	4cd086b14c	[MPS] Raise error for int64 inputs of dot operator. (#94270 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94270 Approved by: https://github.com/razarmehr	2023-02-07 16:12:17 +00:00
Ramin Azarmehr	b654d1494b	[MPS] Fix the argument error for tensor_split() test (#94234 ) The second tensor argument `tensor_indices_or_sections` of tensor_split() must be on CPU when testing it in TestConsistency. Otherwise it will error out. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94234 Approved by: https://github.com/kulinseth	2023-02-07 15:56:49 +00:00
Ramin Azarmehr	36062dd2b4	[MPS] Fix the crash in View ops when slicing wrong lengths (#94259 ) The offset + length of destination tensor should not be larger than source's length when slicing Fixes #94190 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94259 Approved by: https://github.com/malfet	2023-02-07 15:51:26 +00:00
Ramin Azarmehr	bc8a378333	[MPS] Unregister put_() op due to lack of implementation (#94231 ) Currently, the `put_()` is not implemented on MPS backend, so this patch will unregister it and insert it into blocklist of TestConsistency. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94231 Approved by: https://github.com/kulinseth	2023-02-07 06:54:15 +00:00
Kulin Seth	ca74105377	[MPS] Add scalar params to the softplus key. (#94256 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94256 Approved by: https://github.com/razarmehr, https://github.com/malfet	2023-02-07 03:04:53 +00:00
Denis Vieriu	9358726a06	[MPS] Handle empty input in layer norm (#94212 ) Handle empty input in layer norm Pull Request resolved: https://github.com/pytorch/pytorch/pull/94212 Approved by: https://github.com/kulinseth, https://github.com/malfet	2023-02-07 02:55:48 +00:00
Ramin Azarmehr	368e364c19	[MPS] Fix gradient issues with NLL and Smooth_L1 loss ops (#94226 ) - Fix correctness issues with nll_loss_backward(), smooth_l1_loss_backward() and cross_entropy_backward() by taking grad_output into account when computing those loss ops - Add numel()==0 check to prevent crashes - Clean up and formatting Pull Request resolved: https://github.com/pytorch/pytorch/pull/94226 Approved by: https://github.com/kulinseth	2023-02-07 01:54:18 +00:00
Nikita Shulga	10a1efb49f	[MPS] Fix `cumsum` for negative indexes (#94119 ) Use `wrap_dim` to get dim in range or range IndexError Add test to test for that Addresses feedback raised in https://github.com/pytorch/pytorch/pull/88319#issuecomment-1403541180 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94119 Approved by: https://github.com/Skylion007, https://github.com/seemethere	2023-02-05 18:21:29 +00:00
Nikita Shulga	8a88852d5f	[MPS] Fix `index_select` for empty input (#94117 ) Also add test for this case to `test_index_select` Fixes https://github.com/pytorch/pytorch/issues/93877 Pull Request resolved: https://github.com/pytorch/pytorch/pull/94117 Approved by: https://github.com/orionr	2023-02-05 05:45:57 +00:00
Jane Xu	b90496eef5	[nn] zero_grad() set_to_none default True (#92731 ) Attempts to fix #92656 BC-breaking! This changes the default of zero_grad in optim and in nn to default set grads to None instead of zero tensors. We are changing the default because there are proven perf wins and existing code has typically not regressed due to this change. (will probably have to flesh out this note more). Pull Request resolved: https://github.com/pytorch/pytorch/pull/92731 Approved by: https://github.com/ngimel	2023-01-26 01:04:28 +00:00
Li-Huai (Allan) Lin	ccbdf49582	[MPS] Fix index_select scalar input with multiple indices (#91064 ) Support operations like this: ``` device="mps" arr = torch.tensor(10, device=device) indices = torch.tensor([0, 0], device=device) # multiple indices torch.index_select(arr, 0, indices) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91064 Approved by: https://github.com/kulinseth	2023-01-19 14:08:02 +00:00
lezcano	46a81c8db7	Deprecate .mT,.T,.mH,.H on 0D tensors (#92143 ) As discussed with @ngimel, this is not only not documented, but also an unnecessary edge case. See https://github.com/pytorch/pytorch/pull/90463#discussion_r1064807197 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92143 Approved by: https://github.com/ngimel	2023-01-17 16:54:35 +00:00
Denis Vieriu	0a677f2335	[MPS] Add testcase for copying cpu tensors into strided mps tensors (#91784 ) Fixes https://github.com/pytorch/pytorch/issues/86975 If the destination is a strided MPS tensor and the source is a CPU tensor, we cannot perform a blit directly to copy the memory from the CPU tensor into the MPS tensor. We need to scatter the data into the right indices. ``` a1 = torch.Tensor([[1,2],[3,4], [5,6]]).to(torch.device("mps")) b1 = torch.Tensor([-1, -1]) a1[1:,1] = b1 # strided MPS destination / contiguous CPU source ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91784 Approved by: https://github.com/kulinseth	2023-01-10 22:45:48 +00:00
Denis Vieriu	e0b82d7d1f	[MPS] Fix convolution `Source and weight input channels mismatch' crash (#91822 ) Fixes crashes in conv input/weight backward passes due to NCHW / NHWC formats. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91822 Approved by: https://github.com/razarmehr	2023-01-10 18:30:18 +00:00
Denis Vieriu	0ec3c5bc72	[MPS] Reduce ops multi axes support (#91734 ) Currently, most of the reduction ops are flattening the input tensor to 1D to perform the operation. This change removes the flattening of the tensors / the unranked placeholders and adds support for multi axes in all the reduction ops. - Fixes reduction ops with correctness and shape issues. - Fixes masked.argmax / masked.argmin. In case of passing inf to argmax / argmin, MPS will return nan as index for these numbers. Casting this nan to Long will make it -1. This change avoids negative values by clamping them to 0 (matching CPU results). TestConsistency issues fixed: ``` std var amax amin sum prod mean count_nonzero masked.amax masked.amin masked.mean masked.prod masked.std masked.sum ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91734 Approved by: https://github.com/kulinseth	2023-01-09 10:55:11 +00:00
Denis Vieriu	53ef96faae	[MPS] Add support for randperm (#91708 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91708 Approved by: https://github.com/kulinseth	2023-01-06 22:49:06 +00:00
Ramin Azarmehr	87164ace51	[MPS] Fix the ChannelsLast memory format in cat_out_mps() (#91786 ) - Fixed the memory leak with the `malloc()` - Introduced shortened data type strings (optional) to avoid getting extra long cached graph string keys with ops such as cat_out() - Fixed data type issues in Monterey - Removed the unused `use_scalar_value` argument from `getTensorsStringKey()` - Clean up and refactoring Fixes #89353 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91786 Approved by: https://github.com/kulinseth	2023-01-06 17:28:49 +00:00
Ramin Azarmehr	2f0e4839ee	[MPS] Fix correctness issues with Pooling ops (#91519 ) - Workaround for MaxPool when ceilMode=true - Workaround for ChannelsLast memory format - Workaround for divisor_override in AvgPool ops - Enabled count_include_pad parameter for AvgPool - Refactoring and clean up of duplicate code - Enable MaxPool tests in TestConsistency Pull Request resolved: https://github.com/pytorch/pytorch/pull/91519 Approved by: https://github.com/kulinseth, https://github.com/malfet	2023-01-06 01:35:46 +00:00
Denis Vieriu	1a0738f599	[MPS] Add support for torch.linalg.cross (#91642 ) * Add support for torch.linalg.cross * Make use of `metal::cross` for float and half. For the other dtypes implement cross manually Pull Request resolved: https://github.com/pytorch/pytorch/pull/91642 Approved by: https://github.com/razarmehr, https://github.com/malfet	2023-01-05 14:48:34 +00:00
Ramin Azarmehr	229f12bf6a	[MPS] Implement nan_to_num() for MPS backend (#91110 ) Added a test case, and also enabled it in TestConsistency Pull Request resolved: https://github.com/pytorch/pytorch/pull/91110 Approved by: https://github.com/malfet, https://github.com/kulinseth	2023-01-05 02:17:48 +00:00
Ramin Azarmehr	b44d46702a	[MPS] Fix correctness issues with Upsample 1D and 2D (#91669 ) - Implemented following new ops: upsample_nearest1d_backward upsample_nearest_exact1d upsample_nearest_exact1d_backward - Moved Upsample code from Shape.mm to Upsample.mm - Fallback to CPU for nearest mode on Monterey Pull Request resolved: https://github.com/pytorch/pytorch/pull/91669 Approved by: https://github.com/malfet	2023-01-05 00:48:54 +00:00
Ramin Azarmehr	7dd28e9e83	[MPS] Fix data type and shape issues in Scatter and Gather ops (#91514 ) - Clean up redundant code and headers - Move scatter/gather ops from block list to allow list in TestConsistency Pull Request resolved: https://github.com/pytorch/pytorch/pull/91514 Approved by: https://github.com/kulinseth	2023-01-04 23:20:01 +00:00
Kulin Seth	fc59664ef4	[MPS] Add Unique and unique_consecutive ops. (#88532 ) Add check for macos 13.0 Fixes #88487 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88532 Approved by: https://github.com/malfet	2023-01-04 22:15:13 +00:00
Ramin Azarmehr	13de5a0150	[MPS] Fix the right padding bug in Monterey (#91522 ) - Workaround for the bool type bug in padding (needed for both Monterey and Ventura) - Move the recently fixed padding tests of TestConsistency to AllowList Pull Request resolved: https://github.com/pytorch/pytorch/pull/91522 Approved by: https://github.com/DenisVieriu97, https://github.com/kulinseth, https://github.com/malfet	2023-01-04 22:00:37 +00:00
Denis Vieriu	80394bb734	[MPS] Register norm_dtype_out_mps and cdist (#91643 ) Add support for `norm_dtype_out` and `cdist` ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/91643 Approved by: https://github.com/razarmehr	2023-01-04 02:20:53 +00:00
Denis Vieriu	38de981e16	[MPS] Add nonzero mps support (#91616 ) Adds nonzero support for mps: Pseudocode: ``` // // inputTensor = [1, 0, 0, 3] // inputNonZero = [1, 0, 0, 1] (input != 0) // scan = [1, 1, 1, 2] (prefix sum) // maskedIndices = [0, -1, -1, 1] (select) // coordinates = [0, 1, 2, 3] (coordinateAlongAxis) // scatterResult = [0, 3] (scatter) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91616 Approved by: https://github.com/razarmehr	2023-01-04 00:02:24 +00:00
Ramin Azarmehr	688e351970	[MPS] Implement MPSGenerator to enable manual random seeding (#91348 ) This patch adds support for creating torch.Generator for MPS device, and enables its functions such as manual_seed, get_state, and set_state. Fixes #84288 and #84516 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91348 Approved by: https://github.com/malfet, https://github.com/albanD	2023-01-03 16:01:19 +00:00
Denis Vieriu	f7939b21e1	[MPS] Add bincount support for mps (#91267 ) Add support for bincount on MPS Pull Request resolved: https://github.com/pytorch/pytorch/pull/91267 Approved by: https://github.com/razarmehr	2023-01-03 06:01:07 +00:00
Denis Vieriu	dbf96164be	[MPS] Add suport for casting updatesTensor directly in scatter (#91197 ) Fixes copies into slices where the input data type is different than the output dtype. This change removes the cast done before scatter, so we don't have to allocate additional memory to perform the casting. Scatter handles the casting directly now. device = "mps" shape = (4, 4) tensor = torch.randint(10, shape, device=device) tensor_before = tensor.clone() res = torch.empty(shape[0], shape[1] * 2, device=device)[:, ::2].copy_(tensor) torch.testing.assert_close(tensor, tensor_before) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91197 Approved by: https://github.com/razarmehr	2023-01-02 16:31:27 +00:00
Denis Vieriu	bdbf188c80	[MPS] Exclude int64 dtype from reduction ops (#91272 ) Reduction ops don't support int64 data type. This PR takes care to assert when int64 is used for min / max reductions ops. All other integer dtypes are casted to int32. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91272 Approved by: https://github.com/razarmehr, https://github.com/malfet	2022-12-23 17:30:42 +00:00
Ramin Azarmehr	6485d2609a	[MPS] Fix data type issues in Binary Ops (#91151 ) - Cast to unsigned type when comparing signed vs. unsigned integers - Refactor and cleanup logaddexp() ops Pull Request resolved: https://github.com/pytorch/pytorch/pull/91151 Approved by: https://github.com/malfet	2022-12-23 17:11:55 +00:00
Denis Vieriu	4477a5b691	[MPS] Register unfold key for MPS (#91266 ) Register unfold key for MPS (uses generic implementation that's already existent). Pull Request resolved: https://github.com/pytorch/pytorch/pull/91266 Approved by: https://github.com/razarmehr	2022-12-22 21:21:04 +00:00
Nikita Shulga	fd3a7264ae	[MPS] Add `group_norm[fwd+backward]` and `mean_var` (take 2) (#91190 ) Use Prims to implement group_norm, group_norm_backward and mean_var Use `torch._ops.ops` instead of `torch.ops` in numerous subpackages in order to be able to make them importable from `torch/backend/mps/__init__.py` as this alias is defined in `15af4b1cee/torch/__init__.py (L1095)` is executed last during init process. Add `__all__` to `torch/backends/mps/__init__.py` as well as alias all imports as private Add `TestNNMPS.test_group_norm_backward` that validates no NaNs are generated during the backward pass Fixes https://github.com/pytorch/pytorch/issues/88331 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91190 Approved by: https://github.com/albanD	2022-12-22 08:54:37 +00:00
Denis Vieriu	81a9a0ac07	[MPS] Fix gather for uint8 dtype in index_select (#91047 ) Use int8 instead of uint8 for MPS Gather/Scatter (uint8 is broken in macOS Monterey) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91047 Approved by: https://github.com/razarmehr	2022-12-21 19:48:46 +00:00
Li-Huai (Allan) Lin	b7f35e4104	[MPS] Fix index_add with non-f32 inputs (#88542 ) The `multiplicationWithPrimaryTensor` and/or `scatterWithDataTensor` api has issues with handling two f16 tensor inputs, resulting in zeros outputs. With int16 or int64 inputs, there are issues as well. This PR conditionally casts inputs to f32 if they're not and then casts the output back to the source's datatype. Fixes #82645. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88542 Approved by: https://github.com/kulinseth	2022-12-21 05:31:03 +00:00
Ramin Azarmehr	a274b5b99e	[MPS] Fix data type issues in Unary ops (#91120 ) Refactored sigmoid() and log1p() Pull Request resolved: https://github.com/pytorch/pytorch/pull/91120 Approved by: https://github.com/DenisVieriu97, https://github.com/kulinseth	2022-12-21 02:42:59 +00:00
Nikita Shulga	dd735b96df	[MPS] Fix `torch.std`/`torch.var` default/correction handling (#91203 ) If `torch.std`, `torch.var` are invoked without any arguments, it should be assumed that `unbiased` is `True`. Also, if `correction` parameter is specified it should be use in correction computation. Test by adding `std` and `var` to consistency tests Fixes https://github.com/pytorch/pytorch/issues/91198 Pull Request resolved: https://github.com/pytorch/pytorch/pull/91203 Approved by: https://github.com/kit1980	2022-12-21 02:23:50 +00:00
Ramin Azarmehr	b63f0311a5	[MPS] Add floor_divide() op and its test case (#91126 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/91126 Approved by: https://github.com/malfet	2022-12-20 17:02:29 +00:00
Kulin Seth	8ecb49b8fb	[MPS] Add Inverse op. (#90428 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90428 Approved by: https://github.com/DenisVieriu97, https://github.com/malfet	2022-12-19 22:00:12 +00:00
Nikita Shulga	3859aace20	[MPS] Skip tests broken on Ventura (#90843 ) Also add `torch.backends.mps.is_macos13_or_newer` See https://github.com/pytorch/pytorch/issues/85758 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90843 Approved by: https://github.com/kulinseth, https://github.com/albanD	2022-12-14 19:51:00 +00:00
Li-Huai (Allan) Lin	544756ae5e	Fix mps constant pad (#89864 ) Support arbitrary dimensions for constant padding on MPS Fixes #89624 Fixes #87277 Pull Request resolved: https://github.com/pytorch/pytorch/pull/89864 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-12-13 17:28:54 +00:00
Denis Vieriu	b71c710db1	Add additional tests for view slice tensors (#86282 ) Fixes https://github.com/pytorch/pytorch/issues/83995 and https://github.com/pytorch/pytorch/issues/84489 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86282 Approved by: https://github.com/kulinseth	2022-12-08 17:59:55 +00:00
PyTorch MergeBot	cba96366a2	Revert "remove torch.equal usages (#89527 )" This reverts commit `4095ef8b80`. Reverted https://github.com/pytorch/pytorch/pull/89527 on behalf of https://github.com/clee2000 due to broke periodic multigpu tests `4095ef8b80` https://github.com/pytorch/pytorch/actions/runs/3592806602/jobs/6049368502	2022-12-02 21:36:13 +00:00
Philip Meier	4095ef8b80	remove torch.equal usages (#89527 ) Preparation for the next PR in this stack: #89559. I replaced - `self.assertTrue(torch.equal(...))` with `self.assertEqual(..., rtol=0, atol=0, exact_device=True)`, - the same for `self.assertFalse(...)` with `self.assertNotEqual(...)`, and - `assert torch.equal(...)` with `torch.testing.assert_close(..., rtol=0, atol=0)` (note that we don't need to set `check_device=True` here since that is the default). There were a few instances where the result of `torch.equal` is used directly. In that cases I've replaced with `(... == ...).all().item()` while sometimes also dropping the `.item()` depending on the context. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89527 Approved by: https://github.com/mruberry	2022-12-01 11:22:52 +00:00
Sergii Dymchenko	09f2373ec0	Fix TODOs related to #38095 in test_mps.py (#89815 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/89815 Approved by: https://github.com/weiwangmeta, https://github.com/kulinseth	2022-11-30 17:00:36 +00:00
Thomas	4935b597ac	Added implementation and tests for MPS Hardswish (#87952 ) ## What? Fixes issue #86807 by adding MPS backend support for aten::hardswish. ## How? Registered mps hardswish functions in native_functions.yaml, and added the code implementation to Activations.mm. Added functions: - hardswish_mps - hardswish_mps_ - hardswish_backward_mps - hardswish_out_mps ## Testing Added test in test/test_mps.py and tested code using the command `python3 test/test_mps.py -k test_hardswish` Pull Request resolved: https://github.com/pytorch/pytorch/pull/87952 Approved by: https://github.com/kulinseth, https://github.com/kit1980	2022-11-23 02:18:03 +00:00
Edward Z. Yang	dbeacf1182	Fix cat striding in PrimTorch (#89332 ) Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/89332 Approved by: https://github.com/ngimel	2022-11-20 04:05:33 +00:00
PumeTu	fc1c0cd3ef	Add support trace on MPS backend (#87910 ) Fixes [#87221](https://github.com/pytorch/pytorch/issues/87221) `trace` now supported on MPS Pull Request resolved: https://github.com/pytorch/pytorch/pull/87910 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-11-18 07:24:33 +00:00
Raman kumar	fd0efb01a7	[MPS] Support for median with dim (#88807 ) ## Summary ⚡ Aim: Add support for aten::median for MPS backend (Fixes #87220) This is fresh clean PR from the previous [PR](https://github.com/pytorch/pytorch/pull/88554) - Implementing the new median function in aten/src/ATen/native/mps/operations/ReduceOps.mm - Adding it to aten/src/ATen/native/native_functions.yaml - Adding it to existing test_median ### this will works like this 🪶 median of entire input tensor on MPS `torch.median(mps_inputTensor)` median of along a dim `torch.median(mps_inputTensor, dim=[int], keepdim=[Bool])` Pull Request resolved: https://github.com/pytorch/pytorch/pull/88807 Approved by: https://github.com/kulinseth	2022-11-18 02:53:42 +00:00
Lukas Hoenig	81a8fdc40d	[MPS] Add binary operations dtype precedence test case (#87545 ) See https://github.com/pytorch/pytorch/pull/84742 and https://github.com/pytorch/pytorch/pull/78319. The test case tests that - for the binary operations (add, sub, mul, div), - for all data types (dtypes), - for a range of representative values and their combinations, - for various shapes and ways of creating the test tensors, the contents and dtype of the result tensor is identical for the MPS and CPU backends. It adds about 15-18s runtime to `test_mps.py`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87545 Approved by: https://github.com/kit1980	2022-11-17 04:54:27 +00:00
Nikita Shulga	62ef15e320	[MPS] Fix `test_embedding_dense_backward` (#88847 ) By copying randomly initialized weights distribution from MPS `nn.Embedding` to `cpu` Test plan: `python test_mps.py -k test_embedding_dense_backward --repeat 150` Fixes https://github.com/pytorch/pytorch/issues/88679 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88847 Approved by: https://github.com/seemethere	2022-11-10 23:52:27 +00:00
Li-Huai (Allan) Lin	7c353eb395	[MPS] Fix softplus (#88555 ) 1. Fixes #87780 2. Fixes mps graph cache issue 3. Adds proper tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/88555 Approved by: https://github.com/kulinseth	2022-11-10 09:40:08 +00:00
Nikita Shulga	078c25df13	[MPS][BE] Code cleanup (#88529 ) Various code cleanup in MPS operations: - Per @kulinseth suggestion move `mpsSupportsCumsum` to `MPSDevice.h` and rename it to `is_macos_13_or_newer()` - Move Ventura MPSGraph new operators to `MPSGraphVenturaOps.h` header - Use `LookupAs` and `CreateCachedGraphAs` to make code more compact - Formatting Pull Request resolved: https://github.com/pytorch/pytorch/pull/88529 Approved by: https://github.com/kulinseth	2022-11-08 21:10:07 +00:00
Li-Huai (Allan) Lin	15e54293ef	[MPS] Fix embedding backward with scalar index (#82809 ) ### Description Previously the embedding backward always expands `-1` dim to indices, resulting in the following error when the indices is a scalar: ``` error: Rank of data array must equal number of outer dimensions in indices array + rank of slice to update, 2 != 1 + 0 -:8:10: note: see current operation: %5 = "mps.scatter_nd"(%0, %arg1, %4) {batch_dims = 0 : ui32, mode = 0 : i32} : (tensor<10x5xf16>, ``` Now makes it conditional. Reproducer: ```python def repro(): w = torch.tensor([[-2.6465, 2.5859, 0.4688, 1.7949, 3.2676], [-3.1641, 8.9375, 5.7578, -2.9453, -6.5469], [ 2.0469, 1.3516, -8.7344, 6.0000, 1.3906], [ 6.5781, 7.8438, 6.9766, 3.2891, -5.1172], [-7.9414, 7.7344, 4.1875, 2.8574, 2.9531], [-0.4844, -5.6328, -6.8359, -4.5156, 3.7891], [ 4.9375, 6.6094, 6.7031, 0.6719, -6.4219], [ 7.0469, 8.2031, 4.4453, 1.7129, -2.4688], [ 1.2207, -3.3750, -2.4531, 7.4062, -6.0469], [-8.9688, 2.2656, 2.4160, -1.0176, 8.4531]], dtype=torch.float32, requires_grad=True) x = torch.tensor(5) out = torch.nn.functional.embedding(x, w) out.sum().backward() w_mps = w.detach().clone().to("mps").requires_grad_() x_mps = x.to("mps") out = torch.nn.functional.embedding(x_mps, w_mps) out.sum().backward() # error ``` ### Issue <!-- Link to Issue ticket or RFP --> ### Testing <!-- How did you test your change? --> Pull Request resolved: https://github.com/pytorch/pytorch/pull/82809 Approved by: https://github.com/malfet	2022-11-04 19:43:56 +00:00
Nikita Shulga	657f2e12f0	[MPS] Add native `cumsum` implementation (#88319 ) Using https://developer.apple.com/documentation/metalperformanceshadersgraph/mpsgraph/4057333-cumulativesumwithtensor?language=objc Fall back to CPU if running on older MacOS versions In `unary_op` add output tensor dims/dtype to the graph key (as even in default op we check output graph type) Also, upcast int16 to int32 as MPS cumsum op on Ventura returns incorrect results for Int16 type (and it makes total sense for int8, as chances for overflow are very high) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88319 Approved by: https://github.com/kulinseth	2022-11-04 01:22:41 +00:00
Philip Meier	bc73affdad	prepare removal of deprecated functionality in torch.testing (#87969 ) _Redo of #86586 with all BC breaking changes granularly placed into separate commits._ --- Per title. Deprecation happened on Feb 25, 2022 in `c6f1bbc0ac`, which made it into the 1.12 release. Since it is now 245 days later and the next release will be 1.14, the removals later in the stack comply with the [BC policy](https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#minimizing-the-disruption-of-bc-breaking-changes). Pull Request resolved: https://github.com/pytorch/pytorch/pull/87969 Approved by: https://github.com/mruberry	2022-11-02 14:04:48 +00:00
arnaudstiegler	16e35bd179	Adding expm1 to MPS (#87147 ) Fixes #86744 - Implementing the new `expm1_out_mps` function in `aten/src/ATen/native/mps/operations/UnaryOps.mm` - Adding it to `aten/src/ATen/native/native_functions.yaml` - Adding it to existing `test.test_mps.TestNLLLoss.test_unary_ops` Pull Request resolved: https://github.com/pytorch/pytorch/pull/87147 Approved by: https://github.com/kulinseth	2022-10-26 17:45:46 +00:00
Daniel Falbel	e818574e78	Support `signbit` in MPS. (#87214 ) Implements the signbit operator for MPS. Links to #77764 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87214 Approved by: https://github.com/kulinseth, https://github.com/kit1980	2022-10-25 07:12:31 +00:00
Alex	620dbc43d8	Slowly introduce ops to be tested by test_numpy_ref on MPS backend (#87342 ) Enable a test that would have caught https://github.com/pytorch/pytorch/issues/86239 Prior to the fix for that bug, this test fails with ``` _____________________________ TestCommonMPS.test_numpy_ref_mps_where_mps_float32 _____________________________ Traceback (most recent call last): File "/Users/alex/git/pytorch/test/test_ops.py", line 197, in test_numpy_ref_mps self.compare_with_reference( File "/Users/alex/git/pytorch/torch/testing/_internal/common_utils.py", line 2366, in compare_with_reference actual = torch_fn(t_inp, t_args, t_kwargs) File "/Users/alex/git/pytorch/torch/testing/_internal/opinfo/core.py", line 1068, in __call__ return self.op(args, **kwargs) File "/Users/alex/git/pytorch/torch/testing/_internal/common_methods_invocations.py", line 15167, in <lambda> op=lambda self, condition, other: torch.where(condition, self, other), RuntimeError: 0'th index 3 of x tensor does not match the other tensors ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/87342 Approved by: https://github.com/albanD	2022-10-21 19:03:00 +00:00
Nikita Shulga	ae62cf7c02	[MPS] Revamp copy_to_mps_ implementation (#86956 ) Tensor's view in linear storage is represented by the following parameters: `.shape`, `.stride()` and `.storage_offset()`. Only tensors that are representable as 1d-views can be copied from host to device (and vice versa) using single [`copy(from:sourceOffset:to:destinationOffset:size:)`](https://developer.apple.com/documentation/metal/mtlblitcommandencoder/1400767-copyfrombuffer?language=objc) call. Modify `copy_to_mps_` function to do the following steps: - Cast `src` tensor to dst data type if needed - Expand `src` tensor to `dst` tensor shape - Clone `src` tensor if it is not stride contiguous (i.e. can not be represented by `src.view(src.numel())`) - Create an empty tensor if `dst` is not stride-contiguous or if its strides are different then potentially cloned `src` strides - Do 1d copy for `src` to (potentiall temp) `dst` - Finally do re-striding/copy on MPS if needed Add test to cover cases where stide-contiguous permuted tensor is copied to MPS, non-stride-contiguous tensor is copied to MPS and if permuted CPU tensor is copied to differently permuted MPS tensor Fixes https://github.com/pytorch/pytorch/issues/86954 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86956 Approved by: https://github.com/kulinseth	2022-10-21 14:10:05 +00:00
Nikita Karetnikov	1b8af28fe8	[primTorch] Add refs for `softmax`, `softmin`, `log_softmax` (#84956 ) cc @ezyang @mruberry @ngimel @Lezcano @fdrocha Pull Request resolved: https://github.com/pytorch/pytorch/pull/84956 Approved by: https://github.com/lezcano, https://github.com/mruberry	2022-10-20 12:29:04 +00:00
Nikita Shulga	13cff2ee8e	[MPS] Copy from CPU always add storageOffset (#86958 ) Because why wouldn't it? Fixes https://github.com/pytorch/pytorch/issues/86052 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86958 Approved by: https://github.com/kulinseth	2022-10-14 17:35:18 +00:00
Nikita Shulga	692b525b71	[MPS] Extend unary ops to int64 (#86615 ) Most of them are already supported for `int64` except for: - rounding operations (`floor`, `ceil` and `round`), which are no-ops for integral types anyway - sign operation, when it can be emulated by clamping it tensor to [-1, 1] range Test new types by test MPS Fixes https://github.com/pytorch/pytorch/issues/86319 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86615 Approved by: https://github.com/DenisVieriu97, https://github.com/huydhn	2022-10-12 00:32:53 +00:00
Nikita Shulga	b7b5bd47ae	[MPS] Implement `frac` operator (#86625 ) As combination if self-trunc Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/86625 Approved by: https://github.com/kulinseth, https://github.com/albanD	2022-10-10 20:36:22 +00:00
Alex	ca69ddb4f7	Fix broadcasting to implicit leading dimensions in `torch.where` on MPS (#86240 ) Fixes #86239 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86240 Approved by: https://github.com/kulinseth	2022-10-07 01:38:57 +00:00
Nikita Shulga	fa799132d8	[MPS] Better error message for `slow_conv2d_forward` (#86303 ) Error `Could not run 'aten::_slow_conv2d_forward' with arguments from the 'MPS' backend.` is very misleading as usually this method is only invoked if input is on CPU but weights are on MPS device. Raise a more user friendly error in this case Add test to `test_invalid_conv2d` to check for those conditions. Fixes https://github.com/pytorch/pytorch/issues/77931 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86303 Approved by: https://github.com/kulinseth	2022-10-06 15:38:57 +00:00
Nikita Shulga	97d2e1df55	[MPS] Fix GELU for `torch.half` (#86218 ) Also, make sure it raises catcheable errors if invoked with integral types Otherwise, it used to fail with following fatal error invoked for `torch.half` and with similar signatures if invoked for integral types ``` loc("mps_multiply"("(mpsFileLoc): /AppleInternal/Library/BuildRoots/4883e71d-37bd-11ed-b0ef-b25c5e9b9057/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShadersGraph/mpsgraph/MetalPerformanceShadersGraph/Core/Files/MPSGraphUtilities.mm":228:0)): error: input types 'tensor<2xf16>' and 'tensor<1xf32>' are not broadcast compatible LLVM ERROR: Failed to infer result type(s). ``` Modified `test_gelu_simple` to check both fwd and backward gradients for gelu	2022-10-05 09:09:17 -07:00
Kulin Seth	6a842e33c6	MPS: Add multinomial op (#80760 ) Add multinomial with replacement Pull Request resolved: https://github.com/pytorch/pytorch/pull/80760 Approved by: https://github.com/razarmehr, https://github.com/malfet	2022-10-03 21:05:30 +00:00
Abhishek Pathak	8860e48994	[MPS] Handle compatible inputs to where (#85946 ) Inputs with different number of dimensions but compatible shapes were being rejected e.g. x.shape = [10,1,10] y.shape = [10,10] cond.shape = [10,10,1] Pull Request resolved: https://github.com/pytorch/pytorch/pull/85946 Approved by: https://github.com/malfet	2022-10-03 18:12:48 +00:00
Nikita Shulga	b9b24c31fd	[MPS] Fix non-contig to contig tensor copy (#86056 ) This handles a rare case when MPS tensor is constructed from non-contiguous CPU tensor. Fixes https://github.com/pytorch/pytorch/issues/85967 Pull Request resolved: https://github.com/pytorch/pytorch/pull/86056 Approved by: https://github.com/janeyx99	2022-10-02 20:13:05 +00:00
Ramin Azarmehr	bf667c63e7	Fix the error with constant_pad_nd for 4D+ padding (#85991 ) - We warn the user and fall back to default implementation for 4D+ constant padding Fixes #84535 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85991 Approved by: https://github.com/kulinseth	2022-10-01 00:33:23 +00:00
Ramin Azarmehr	334686bde7	Fix the dimension of padding to match the input's dimension (#85990 ) Fixes #85143 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85990 Approved by: https://github.com/malfet, https://github.com/kulinseth	2022-09-30 22:57:57 +00:00
Ramin Azarmehr	a4cc63991a	[MPS] Enable caching for random ops with Philox engine (#85833 ) Also Fix type cast issue in Bernoulli (Fixes #85611) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85833 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-09-30 22:40:50 +00:00
Denis Vieriu	be327ec08f	[MPS] Fix base shape size for view ops in case of multiple slices (#85934 ) Fixes https://github.com/pytorch/pytorch/issues/84364, https://github.com/pytorch/pytorch/issues/85592 Fixes bug for view ops where the base shape would be incorectly determined. E.g for the following tensor `torch.tensor([0.5, 0.5], device="mps")[1][None]`, we could consider the base shape of the parent tensor as 1, while the actual base shape is 2. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85934 Approved by: https://github.com/kulinseth	2022-09-30 18:51:43 +00:00
Abhishek Pathak	81b366a9dd	[MPS] Handle scalar input for scatter and gather (#85842 ) Issue noticed in test consistency - "Indexing dim 0 is out of bounds of tensor" Pull Request resolved: https://github.com/pytorch/pytorch/pull/85842 Approved by: https://github.com/kulinseth	2022-09-30 00:24:16 +00:00
Abhishek Pathak	62a4fd7907	[MPS] Handle output shape for empty input in binary ops (#85836 ) Output of input shape [0,1,2] should be [0,1,2], not [0] i.e. delay returning from empty input condition to resize/reshape the output accordingly Pull Request resolved: https://github.com/pytorch/pytorch/pull/85836 Approved by: https://github.com/DenisVieriu97, https://github.com/kulinseth	2022-09-30 00:19:14 +00:00
Denis Vieriu	6a14fcb922	[MPS] Add support for aten::masked_select on mps (#119 ) (#85818 ) Reuse the `index.Tensor_out` implementation since it's already expanding the bool/byte indices to long tensors. Pull Request resolved: https://github.com/pytorch/pytorch/pull/85818 Approved by: https://github.com/kulinseth	2022-09-29 23:23:00 +00:00
Denis Vieriu	ce4f187f15	[MPS] Add tensor::index_put op (#85672 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85672 Approved by: https://github.com/malfet	2022-09-28 00:47:52 +00:00
Kulin Seth	7ff6a00a9a	[MPS] Handle 1D weight in linear layer (#85752 ) Fixes https://github.com/pytorch/pytorch/issues/84591 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85752 Approved by: https://github.com/malfet	2022-09-28 00:43:11 +00:00
Abhishek Pathak	e746fff8ba	[MPS] Enable adaptive avg pool 2d with larger output size (#85726 ) * Handle adpative pool 2d forward and backward when ouptut size is larger than input size * Disallow larger output size if not a multiple of input size Fixes: https://github.com/pytorch/pytorch/issues/80732 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85726 Approved by: https://github.com/malfet	2022-09-27 19:08:22 +00:00
Nikita Shulga	1367f2409f	[MPS] Fix placeholder behavior for transposed view (#85689 ) Looks like the expectation in that code were that `.clone` will return contiguous tensor, so explicitly specify memory format Fixes https://github.com/pytorch/pytorch/issues/85675 and https://github.com/pytorch/pytorch/issues/85224 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85689 Approved by: https://github.com/kulinseth	2022-09-27 15:44:53 +00:00
Abhishek Pathak	7a5449f148	[MPS] Clamp op - fix shape issues (#114 ) (#85673 ) * Handle shape mismatch * Handle case where 1 occurs in input shape; fix fill_new_shapes * Move clamp ops to allowlist Pull Request resolved: https://github.com/pytorch/pytorch/pull/85673 Approved by: https://github.com/malfet	2022-09-27 01:54:42 +00:00
George Qi	686555b663	[maskedtensor] port torch/_masked into torch/masked (#85515 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85515 Approved by: https://github.com/cpuhrsch	2022-09-26 23:41:13 +00:00
Abhishek Pathak	f0570354dd	[MPS] Fix memory error in var (#85571 ) * Fix memory corruption + wrong handling of negative dims * Use vector for shape Pull Request resolved: https://github.com/pytorch/pytorch/pull/85571 Approved by: https://github.com/malfet	2022-09-25 19:03:58 +00:00
Kulin Seth	e1ed485c65	[MPS] Handle reduction of scalars in edge-cases (#83743 ) The issue was found as part of fixing Test consistency issues. Test case coming up. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83743 Approved by: https://github.com/razarmehr, https://github.com/malfet	2022-09-20 17:53:45 +00:00
Kulin Seth	077db3de92	[MPS] Fix conv1d backwards crash for channels last case (#85283 ) Fixes pytorch#84511 Use the same logic as in the forward pass for the backward pass (in case of channels last, manually set the shape to NHWC) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85283 Approved by: https://github.com/malfet, https://github.com/razarmehr	2022-09-20 06:19:40 +00:00
Nikita Shulga	1a6cf6ea88	[MPS] Fix int rounding div crash on M1 (#85016 ) Fixes https://github.com/pytorch/pytorch/issues/84995 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85016 Approved by: https://github.com/kulinseth	2022-09-14 23:40:20 +00:00
Denis Vieriu	4247cc98a2	[MPS] Fix mps to cpu casting from a smaller dtype to a bigger dtype (#84928 ) Fixes #82566 , #80800 - mps->cpu casts from a smaller dtype to a bigger dtype mps->mps cast from smaller/bigger dtype to another dtype in case of scatter - For mps->cpu copies where we don't have a source/destination offset, we can save the cast result directly in the destTensor, so we can skip the additional overhead of the blit. - In case we can return the data without doing the blit, we need to check if it's blocking call, case in which we'd need a synchronize(SyncType::COMMIT_AND_WAIT); call (previously this was done by the blit). Pull Request resolved: https://github.com/pytorch/pytorch/pull/84928 Approved by: https://github.com/razarmehr	2022-09-14 17:24:25 +00:00
Nikita Shulga	9b16bf04af	Fix MPS test sanity (#84889 ) Follow up after https://github.com/pytorch/pytorch/pull/84834 Pull Request resolved: https://github.com/pytorch/pytorch/pull/84889 Approved by: https://github.com/tugsbayasgalan, https://github.com/janeyx99, https://github.com/ZainRizvi	2022-09-12 22:25:26 +00:00
Abhishek Pathak	bccc26f365	[MPS] Handle casting for div operation (#84742 ) * Handle casting for div operation * Update divmode test to test for rounding mode in div cc. @lhoenig Pull Request resolved: https://github.com/pytorch/pytorch/pull/84742 Approved by: https://github.com/razarmehr	2022-09-10 03:10:04 +00:00
Peter Bell	b9793a66b5	Fix linalg.norm sample inputs function and related failures (#84452 ) Due to an indentation error, the return statement happens after just 1 loop of `for test_size in test_sizes` so only one shape was ever tested. This also revealed several cases where the provided shapes don't work so I've disabled the generation of those sample inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/84452 Approved by: https://github.com/Lezcano, https://github.com/zou3519	2022-09-08 17:07:58 +00:00
soulitzer	bfdfeecd15	Add per-op MPS gradient tests and update skips (#84242 ) Follow up: - ~Remove non-float dtypes from allow-list for gradients~ - ~Map dtypes to short-hand so there aren't so many lines, i.e. float16 should be f16.~ - ~There were a lot of linting issues that flake8 wouldn't format for me, so I reformatted with black. This makes the diff a little trickier to parse.~ Observations: - there are entries in the allow-list that weren't there before - some forward that we previously passing now fail with requires_grad=True - Because the allow list does not know about variants, a special skip was added for that in the block list Pull Request resolved: https://github.com/pytorch/pytorch/pull/84242 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-09-01 16:41:52 +00:00
Ramin Azarmehr	d1be36ceab	[MPS] Fix the index error in constant_pad_nd() with single-dimension input (#83745 ) * Fix the index error in constant_pad_nd() with single-dimension input (#83343) - Also added a test case in test_mps for it * Move padding code into new file Pad.mm Fixes https://github.com/pytorch/pytorch/issues/83343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83745 Approved by: https://github.com/razarmehr	2022-08-22 17:07:09 +00:00
Denis Vieriu	a6b75bb099	[MPS] Fix placeholder case for missing gather graph (#83744 ) Fixes https://github.com/pytorch/pytorch/issues/82543, https://github.com/pytorch/pytorch/issues/83230 The current Placeholder code relies to find a gather graph in order to make the data contiguous, otherwise we'll try calling into tensor.contiguous() directly, which for slice elements, won't do anything. E.g consider the following basic case where we index a 2 element tensor: ``` tensor_list = torch.tensor([1.2, 1.0], device="mps") for scalar in tensor_list: r_mps = torch.ceil(scalar) r_cpu = torch.ceil(scalar.to("cpu")) self.assertEqual(r_mps.cpu(), r_cpu) ``` The second element 1.0 is a contiguous view tensor (similar to slicing), but it has no gather graph created behind. In the placeholder, we won't be able to find the graph, thus relying on the fallback case where we call _tensor = src.contiguous();. For an already contiguous tensor, this won't do anything, thus we end up creating the NDArray with all the values of the tensor (1.2 and 1.0 instead of just 1.0). Doing clone instead of contiguous will actually perform a blit behind and take into consideration the storage_offset of the view when performing the copy. Similarly, the following basic case is also failing because of this issue: ``` x = torch.tensor([1.0, 0.49], device="mps") print(x) # prints 1.0 and 0.0 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/83744 Approved by: https://github.com/razarmehr	2022-08-22 17:05:53 +00:00
Nikita Shulga	ff533b1efa	[MPS] Fix torch.full for uint8 (#83697 ) By creating uint32 tensor and then downcasting it to uint8 Workaround https://github.com/pytorch/pytorch/issues/83692 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83697 Approved by: https://github.com/albanD	2022-08-18 21:59:15 +00:00
Kulin Seth	ce7177f88a	[MPS] Register index.Tensor_out (#82507 ) * Add more tests from test_indexing into test_mps * Cache the indexing library on the MPSDevice Pull Request resolved: https://github.com/pytorch/pytorch/pull/82507 Approved by: https://github.com/malfet	2022-08-18 06:03:16 +00:00
Kulin Seth	31d4b6f52a	[MPS] Fix conv1D and conv2D with non-matching strides/paddings (#83522 ) * Add reference to the github issue in test_mps.py Fixes https://github.com/pytorch/pytorch/issues/83180, https://github.com/pytorch/pytorch/issues/82921, https://github.com/pytorch/pytorch/issues/82711, https://github.com/pytorch/pytorch/issues/82563 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83522 Approved by: https://github.com/albanD, https://github.com/malfet	2022-08-17 00:26:41 +00:00
Kulin Seth	02cfefb48c	[MPS] Fix for matmul errors in test consistency (#83124 ) * Handle empty input with non-empty output * Remove transpose options from mm op Pull Request resolved: https://github.com/pytorch/pytorch/pull/83124 Approved by: https://github.com/malfet	2022-08-12 23:28:38 +00:00
Kulin Seth	b3dea3e413	Add the Conv1D support for NHWC format. (#83121 ) Fixes https://github.com/pytorch/pytorch/issues/81557 cc @DenisVieriu97 Pull Request resolved: https://github.com/pytorch/pytorch/pull/83121 Approved by: https://github.com/malfet	2022-08-10 14:30:20 +00:00
Nikita Shulga	dcf5188561	[MPS] Fix scatter for bool type (#82685 ) By typecasting bool tensors to int8 before scattering See https://github.com/pytorch/pytorch/issues/82663 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82685 Approved by: https://github.com/albanD, https://github.com/kulinseth	2022-08-03 14:54:47 +00:00
Nikita Shulga	420c576809	[MPS] Unary ops over empty tensors should be no-op (#82650 ) Fixes https://github.com/pytorch/pytorch/issues/82531 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82650 Approved by: https://github.com/kulinseth	2022-08-02 21:15:37 +00:00
Nikita Shulga	bdd0a4a84c	[MPS] Fix `torch.full` for boolean types (#82575 ) By creating int8 tensor and casting it to bool later Workaround for MPSGraph deficiency reported in https://github.com/pytorch/pytorch/issues/82427 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82575 Approved by: https://github.com/kulinseth	2022-08-01 19:42:24 +00:00
qqaatw	7496937e3f	[MPS] Add prelu (#82401 ) ### Description Adds `prelu` MPS support. I'm wondering whether the generic test framework implemented in `test_mps.py` checks the correctness of gradients. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82401 Approved by: https://github.com/kulinseth	2022-07-30 05:16:25 +00:00
Ramin Azarmehr	38b4114278	[MPS] Add MPS implementation for constant_pad_nd() (#75 ) (#82366 ) MPS has a native implementation of the constant pad nd. Adding that instead of going through the view ops helps improve performance in several benchmarks in torchbench. Pull Request resolved: https://github.com/pytorch/pytorch/pull/82366 Approved by: https://github.com/malfet, https://github.com/razarmehr	2022-07-29 16:34:07 +00:00
Abhishek Pathak	d7210e6129	[MPS] Fixes for MPS testConsistency (#81735 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/81735 Approved by: https://github.com/razarmehr, https://github.com/albanD	2022-07-20 16:31:44 +00:00
Kulin Seth	596bb41163	[MPS] Get the correct size of the view tensor when copying from cpu to mps (#81730 ) Fixes: https://github.com/pytorch/pytorch/issues/81567, https://github.com/pytorch/pytorch/issues/80844 * Get the correct size of the view tensor when copying from cpu to mps * Use 'computeStorageNbytesContiguous' to get the size just when src is a view * Add asserts and tests to check for storage_offset * Add testcase for https://github.com/pytorch/pytorch/issues/80844 * Replace assert_allclose with assertEqual * Replace TORCH_CHECK with TORCH_INTERNAL_ASSERT Pull Request resolved: https://github.com/pytorch/pytorch/pull/81730 Approved by: https://github.com/razarmehr, https://github.com/albanD	2022-07-20 14:27:54 +00:00
qqaatw	1caa25ebcb	[MPS] Add `aten::index_add.out` (#79935 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79935 Approved by: https://github.com/kulinseth	2022-07-14 23:40:00 +00:00
Denis Vieriu	e3b98ba7a4	[MPS] Handle Boolean inputs in the cat op (#81480 ) Fixes https://github.com/pytorch/pytorch/issues/80850 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81480 Approved by: https://github.com/razarmehr	2022-07-14 22:00:57 +00:00
Denis Vieriu	0adc2e35f2	Add testcase for MPS issue #80856 (#81455 ) Testcase for #80856. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81455 Approved by: https://github.com/kulinseth	2022-07-14 19:54:18 +00:00
Kulin Seth	067c8067a3	[MPS]: Added op upsample_nearest1d (#81303 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/81303 Approved by: https://github.com/malfet	2022-07-13 21:39:50 +00:00
Abhishek Pathak	ae83e44c5f	[MPS] Handle 1D inputs for NLL (#81290 ) * Add test for NLL 1d * Fix forward NLL for 1D case * Handle NLL backward for 1d Pull Request resolved: https://github.com/pytorch/pytorch/pull/81290 Approved by: https://github.com/razarmehr	2022-07-12 19:46:59 +00:00
qqaatw	b0b24b4285	[MPS] Fix LSTM batch_first output transposed (#80597 ) The output of LSTM with `batch_first` should be transposed back to batch first format. Fixes #80306 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80597 Approved by: https://github.com/kulinseth	2022-07-07 07:18:00 +00:00
qqaatw	2458b3cd5f	[MPS] Add argmin (#80828 ) This PR 1. adds argmin 2. refactors `reduction_type` in `ReduceOps.mm` with enum. Co-authored by Kulin Seth <kulinseth@gmail.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/80828 Approved by: https://github.com/malfet	2022-07-07 00:04:49 +00:00
PyTorch MergeBot	255649984c	Revert "[MPS] Add argmin (#80828 )" This reverts commit `37ebcc8a80`. Reverted https://github.com/pytorch/pytorch/pull/80828 on behalf of https://github.com/suo due to Broke Mac tests on master: `37ebcc8a80`	2022-07-06 16:20:18 +00:00
qqaatw	c48e964a04	[MPS] Add huber loss (#80163 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/80163 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-07-06 15:36:47 +00:00
qqaatw	37ebcc8a80	[MPS] Add argmin (#80828 ) This PR 1. adds argmin 2. refactors `reduction_type` in `ReduceOps.mm` with enum. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80828 Approved by: https://github.com/malfet	2022-07-06 15:14:04 +00:00
qqaatw	8745118ccb	[MPS] Add softplus backward (#79873 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79873 Approved by: https://github.com/malfet	2022-07-06 06:13:21 +00:00
Kulin Seth	5436134f32	[MPS] Move the View ops to a separate file and reduce the number of graphs created (#80491 ) This is dependent on the PR to go in first: https://github.com/pytorch/pytorch/pull/79939 Remove the data_ptr from the View Graph key which reduces the number of graphs created significantly. Don't wait when copying from MPS to MPS tensors Pull Request resolved: https://github.com/pytorch/pytorch/pull/80491 Approved by: https://github.com/malfet	2022-07-06 03:39:20 +00:00
Kulin Seth	76cff18242	[MPS] Add test consistency from OpInfo based tests from PR 78504 (#79532 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/79532 Approved by: https://github.com/albanD, https://github.com/malfet	2022-07-04 06:41:39 +00:00
Ramin Azarmehr	0e3953fc52	MPS: Fix handling of 1D tensors in linear backward (#80759 ) Fixes #https://github.com/pytorch/pytorch/issues/79784 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80759 Approved by: https://github.com/ezyang	2022-07-04 02:06:14 +00:00
Kulin Seth	b744e1c8ef	Add scatter support for view operations (#79939 ) * Add scatter support for view operations; #78074, #78886, #79672 * Update test_slicing_replace_column to properly test different sizes * Handle in-place changes for binary ops; add new testcase * Add new view ops testing scatter; add MPSDebugConfig.h config file for debugging purposes * Merge gatherViewTensor and scatterViewTensor into a generic function * Add scatter on demand in scatterViewOperation instead of caching it into a generic graph * Create separate graphs for scatter and gather; * Create scatter graph at scatter time Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79939 Approved by: https://github.com/razarmehr	2022-07-01 15:10:56 +00:00
PyTorch MergeBot	b1943e01e2	Revert "[MPS] Add test consistency from OpInfo based tests from PR 78504 (#79532 )" This reverts commit `c71886e048`. Reverted https://github.com/pytorch/pytorch/pull/79532 on behalf of https://github.com/malfet due to Unintended submodules updates	2022-06-30 16:37:11 +00:00
qqaatw	ae6f07e7d5	[MPS] Fix std/var cache issue (#80502 ) Use `getTensorsStringKey` which has tensor shape info added as part of the key to prevent cache lookup issue when the shape of input tensor is changed. Fixes #80499 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80502 Approved by: https://github.com/malfet, https://github.com/kulinseth	2022-06-30 12:56:56 +00:00
qqaatw	c980fc3d3c	[MPS] Add glu (#79866 ) Adds mps op for `aten::glu.out`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79866 Approved by: https://github.com/kulinseth, https://github.com/albanD	2022-06-30 08:58:42 +00:00
Kulin Seth	c71886e048	[MPS] Add test consistency from OpInfo based tests from PR 78504 (#79532 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/79532 Approved by: https://github.com/albanD	2022-06-30 01:50:17 +00:00
qqaatw	5943aaa0c4	[MPS] Add logical ops (#80216 ) This PR adds `logical_not`, `logical_and`, `logical_or`, `logical_xor`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80216 Approved by: https://github.com/albanD, https://github.com/kulinseth	2022-06-29 02:44:35 +00:00
qqaatw	c4da23ed1b	[MPS] Add flip (#80214 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/80214 Approved by: https://github.com/DenisVieriu97, https://github.com/albanD	2022-06-28 19:51:45 +00:00
qqaatw	e1b15b7a04	[MPS] add `aten::normal.Tensor_float` `aten::normal.float_Tensor` `aten::normal.Tensor_Tensor` (#80297 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/80297 Approved by: https://github.com/albanD, https://github.com/kulinseth	2022-06-28 15:19:39 +00:00
Nikita Shulga	f11cce309b	[MPS] Add equal operator (#80195 ) Which is, in essence is composite of `eq`->`all`->`item` `native/mps/operators/Equal.cpp` is an almost verbatim copy of `native/cuda/Equal.cpp` Fix codegen by generating MPSFunctions headers Pull Request resolved: https://github.com/pytorch/pytorch/pull/80195 Approved by: https://github.com/albanD	2022-06-25 12:40:52 +00:00
Nikita Shulga	06f874e276	[MPS] Fix binary ops between int32 tensor with int64 scalar (#80220 ) For some reason, tensor op scalar does not follow the normal binary promotion rules So cast output tensor to expected type if needed It seems that one should have casted input tensors to expected output tensor type, but it does not really work for boolean binary ops, so... Add output tensor type/shape to cached graph key Extend `TestMPS. test_add_scalars` to test for this regression Fixes #79835 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80220 Approved by: https://github.com/albanD	2022-06-25 02:21:34 +00:00
qqaatw	ff44bfa1ea	[MPS] Add L1 loss test (#80010 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/80010 Approved by: https://github.com/albanD	2022-06-24 17:18:31 +00:00
Nikita Shulga	4390546f86	[MPS] Fix torch.uint8 support (#80049 ) `ScalarType.Byte` should be cast to `MPSDataTypeUInt8` And support for `torch.int8` as well as test those conversions in `TestMPS.test_to` Fixes #80006 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80049 Approved by: https://github.com/albanD	2022-06-22 18:41:21 +00:00
Abhishek Pathak	074dc7465e	MPS: Add amax and amin Ops with tests (#79682 ) * Add amax and amin with tests Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79682 Approved by: https://github.com/albanD	2022-06-18 00:14:05 +00:00
Kulin Seth	4615f6aa97	[MPS]: Add fix for squeezed input axes handling in BCE loss (#79676 ) Fixes #79527 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79676 Approved by: https://github.com/razarmehr, https://github.com/albanD	2022-06-16 20:21:31 +00:00
Kulin Seth	355a1c8c3f	MPS: TopK raise an error if K>16 (#79677 ) * Error out in TopK when k>16. * Add a test case too. Fixes #78915 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79677 Approved by: https://github.com/albanD	2022-06-16 16:06:45 +00:00
Nikita Shulga	81cd276d61	[MPS] Support stride of stride Fixes https://github.com/pytorch/pytorch/issues/79181 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79521 Approved by: https://github.com/kulinseth	2022-06-14 18:49:44 +00:00
Alban Desmaison	0a651a231d	Add full support for serialization of MPS Tensors (#79465 ) Fix https://github.com/pytorch/pytorch/issues/79384 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79465 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-06-14 17:54:30 +00:00
PyTorch MergeBot	ce6ce74703	Revert "Add full support for serialization of MPS Tensors (#79465 )" This reverts commit `64c2a275c4`. Reverted https://github.com/pytorch/pytorch/pull/79465 on behalf of https://github.com/zengk95 due to this broke X linux-xenial-py3.7-clang7-onnx / test (default, 1, 2, linux.2xlarge). Not sure why since it passed on pull.	2022-06-14 16:42:36 +00:00
Alban Desmaison	64c2a275c4	Add full support for serialization of MPS Tensors (#79465 ) Fix https://github.com/pytorch/pytorch/issues/79384 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79465 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-06-14 14:20:09 +00:00
Kulin Seth	77b6885a22	MPS: add layer_norm_backward (#79189 ) Layernorm backward Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79189 Approved by: https://github.com/razarmehr, https://github.com/albanD	2022-06-10 13:25:41 +00:00
Kulin Seth	83239351c5	MPS: add exponential op (#79188 ) Add exponential distribution Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79188 Approved by: https://github.com/razarmehr, https://github.com/albanD	2022-06-10 13:16:21 +00:00
Kulin Seth	50f7b40ad9	MPS: Binary cast fix by proper type promotion and remove spurious copy warning (#79185 ) Fixes #78019, #78020 Fixes https://github.com/pytorch/pytorch/pull/79185 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79185 Approved by: https://github.com/albanD, https://github.com/razarmehr	2022-06-09 17:33:06 +00:00
Nikita Shulga	97594a24b4	Print output during MPS test import tests (#79163 ) Simplify `test_no_warnings_on_input` to simply capture any output. Copy its implementation to `test_testing.py` as this is not specific to MPS Pull Request resolved: https://github.com/pytorch/pytorch/pull/79163 Approved by: https://github.com/janeyx99, https://github.com/kulinseth	2022-06-09 13:07:05 +00:00
Philip Meier	32593ef2dd	move MPS compat into common comparison machinery (#77836 ) Addresses https://github.com/pytorch/pytorch/issues/77144#issuecomment-1128168082. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77836 Approved by: https://github.com/albanD	2022-06-08 08:09:18 +00:00
Kulin Seth	a6347f5467	MPS: Fixes (#78930 ) Cast integer to float in UnaryOps Add tensor dtype in key generation Enable FP16 scalars and use placeholder for alpha tensor in add/sum ops Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/78930 Approved by: https://github.com/albanD	2022-06-07 18:22:10 +00:00
Nikita Shulga	55cac22cdf	[MPS] Add `arange_mps_out` implementation (#78789 ) Mostly by factoring out shader logic from `linspace_out_mps` implementation Pull Request resolved: https://github.com/pytorch/pytorch/pull/78789 Approved by: https://github.com/albanD, https://github.com/kulinseth	2022-06-03 21:54:41 +00:00
Kulin Seth	4858c56334	MPS: Fix issues with view tensors and linspace. (#78690 ) Fixes: #https://github.com/pytorch/pytorch/issues/78642, https://github.com/pytorch/pytorch/issues/78511 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78690 Approved by: https://github.com/razarmehr, https://github.com/DenisVieriu97	2022-06-02 06:17:19 +00:00
Kulin Seth	a3bdafece3	MPS: add linespace op (#78570 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/78570 Approved by: https://github.com/malfet	2022-06-01 13:47:14 +00:00
Ramin Azarmehr	aa62b3e003	Add test case for issue: https://github.com/pytorch/pytorch/issues/77851 (#78547 ) The test works fine now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78547 Approved by: https://github.com/kulinseth	2022-05-31 19:15:45 +00:00
Rohan Mitchell	f42b42d3eb	MPS: Implement aten::count_nonzero.dim_IntList (#78169 ) - See: #77764 Implements the `aten::count_nonzero.dim_IntList` operator (as used by [torch.count_nonzero](https://pytorch.org/docs/stable/generated/torch.count_nonzero.html)) for [MPS](https://pytorch.org/blog/introducing-accelerated-pytorch-training-on-mac/). Pull Request resolved: https://github.com/pytorch/pytorch/pull/78169 Approved by: https://github.com/malfet, https://github.com/kulinseth, https://github.com/albanD	2022-05-31 18:23:25 +00:00
Kulin Seth	017b0ae943	MPS: Fix crashes in view tensors due to buffer size mismatch (#78496 ) Fixes #78247, #77886 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78496 Approved by: https://github.com/albanD, https://github.com/malfet	2022-05-31 02:09:03 +00:00
Alban Desmaison	bde246fcc6	Speed up test_mps from 9min to 25s Pull Request resolved: https://github.com/pytorch/pytorch/pull/78488 Approved by: https://github.com/kulinseth	2022-05-30 18:16:53 +00:00
Alban Desmaison	02551a0025	Remove prints and add proper asserts Pull Request resolved: https://github.com/pytorch/pytorch/pull/78454 Approved by: https://github.com/kulinseth	2022-05-30 18:16:53 +00:00
Kulin Seth	d63db52349	MPS: Fixes the as_strided_mps implementation for contiguous view operations (#78440 ) Fixes https://github.com/pytorch/pytorch/issues/78107; https://github.com/pytorch/pytorch/issues/77750 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78440 Approved by: https://github.com/malfet	2022-05-28 14:41:56 +00:00
Nikita Shulga	437ecfc461	[MPS] Fix `copy_kernel_mps` (#78428 ) By passing `storage_offset` of source and destination Tensors This fixes following simple usecase: ``` python3` -c "import torch;x=torch.zeros(3, 3, device='mps'); x[1, 1]=1;print(x)" ``` Add test to validate it would not regress in the future Pull Request resolved: https://github.com/pytorch/pytorch/pull/78428 Approved by: https://github.com/kulinseth	2022-05-27 20:46:53 +00:00
Kulin Seth	8552acbd74	MPS: Eye op (#78408 ) This can be used as a reference PR was to add Op in MPS backend. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78408 Approved by: https://github.com/albanD	2022-05-27 17:07:02 +00:00
Kulin Seth	2e32d5fcd8	MPS: Add adaptive max pool2d op (#78410 ) Adaptive max pool 2d forward and backward with test Pull Request resolved: https://github.com/pytorch/pytorch/pull/78410 Approved by: https://github.com/albanD	2022-05-27 11:59:07 +00:00
Nikita Shulga	705082656a	Fix typo in testname (#78258 ) `test_linear2D_no_bias_backwarwd` -> `test_linear2D_no_bias_backward` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78258 Approved by: https://github.com/kulinseth, https://github.com/janeyx99	2022-05-25 16:23:10 +00:00
Lukas Hoenig	a52bfe2c5d	Convert MPS Tensor data using MPSGraph API (#78092 ) Fixes #78091 If you are already working on this, simply disregard this or take what may be helpful. This is my attempt at MPS-native Tensor datatype conversion. It works for everything tested ~~but is currently only implemented for MPS-to-MPS copy, not MPS-to-X or X-to-MPS, but the same approach could easily be used~~. Before: ```python In [5]: pt.full((40,), -10.3, device="mps") Out[5]: tensor([-10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000], device='mps:0') In [6]: pt.full((40,), -10.3, device="mps").int() Out[6]: tensor([-1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883, -1054552883], device='mps:0', dtype=torch.int32) In [7]: pt.full((40,), -10.3, device="mps").int().float() Out[7]: tensor([-10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000], device='mps:0') In [8]: pt.full((40,), -10.3, device="mps").int().float().bool() Out[8]: tensor([ True, False, False, True, True, False, False, True, True, False, False, True, True, False, False, True, True, False, False, True, True, False, False, True, True, False, False, True, True, False, False, True, True, False, False, True, True, False, False, True], device='mps:0') ``` After: ```python In [3]: pt.full((40,), -10.3, device="mps") Out[3]: tensor([-10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000, -10.3000], device='mps:0') In [4]: pt.full((40,), -10.3, device="mps").int() Out[4]: tensor([-10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10, -10], device='mps:0', dtype=torch.int32) In [5]: pt.full((40,), -10.3, device="mps").int().float() Out[5]: tensor([-10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10., -10.], device='mps:0') In [6]: pt.full((40,), -10.3, device="mps").int().float().bool() Out[6]: tensor([True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True, True], device='mps:0') ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/78092 Approved by: https://github.com/kulinseth, https://github.com/malfet	2022-05-24 20:09:45 +00:00
Alban Desmaison	04ac80c73a	Fix a few issues on assert/double error/legacy constructor (#77966 ) Fixes https://github.com/pytorch/pytorch/issues/77960, https://github.com/pytorch/pytorch/issues/77957, https://github.com/pytorch/pytorch/issues/77781 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77966 Approved by: https://github.com/soulitzer, https://github.com/kulinseth	2022-05-20 20:25:12 +00:00
Kulin Seth	3d83321b44	MPS Fixes: copy operations, addmm and baddmm (#77791 ) Fixes for the copy operations and GEMM operations on MPS backend. Fixes https://github.com/pytorch/pytorch/issues/77819 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77791 Approved by: https://github.com/albanD	2022-05-20 03:18:11 +00:00
Kulin Seth	978304fc9c	MPS: fixes (#77462 ) - Fix the is_available flag for x86 machines - Fix the tensor creation for older MacOS platforms - Addmm fixes for transposition Pull Request resolved: https://github.com/pytorch/pytorch/pull/77462 Approved by: https://github.com/albanD	2022-05-14 13:33:16 +00:00
Kulin Seth	e011a8e18b	Enable PyTorch operations on MPS Backend. (#77343 ) Add PyTorch operations to MPS backend. - https://github.com/pytorch/pytorch/issues/77394 Pull Request resolved: https://github.com/pytorch/pytorch/pull/77343 Approved by: https://github.com/albanD	2022-05-13 18:28:53 +00:00

... 8 9 10 11 12 ...

680 Commits