pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Xuehai Pan	1cb4e2df65	[BE][PYFMT] migrate PYFMT for `torch._inductor` to `ruff format` (#144550 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144550 Approved by: https://github.com/jansel	2025-02-28 13:33:19 +00:00
Davide Italiano	760921a7d8	[MPS] Add inductor support for the `entr()` operator. (#148128 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/148128 Approved by: https://github.com/jansel, https://github.com/malfet	2025-02-28 03:33:22 +00:00
Davide Italiano	8b65dbad13	[MPS/Inductor] Add support for xlog1py. (#147709 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/147709 Approved by: https://github.com/jansel	2025-02-24 05:28:52 +00:00
Davide Italiano	6a5e3917a7	[MPS] Add inductor support for spherical_bessel_j0. (#147650 ) Counterpart to my previous patch that added support for the op in eager. Pull Request resolved: https://github.com/pytorch/pytorch/pull/147650 Approved by: https://github.com/jansel	2025-02-23 00:32:36 +00:00
Jason Ansel	06604c4ec1	[inductor] Refactor op handlers part 5 (#146257 ) This makes OpHandler just a normal class using inheritance, and removes typing workarounds needed because it wasn't Pull Request resolved: https://github.com/pytorch/pytorch/pull/146257 Approved by: https://github.com/shunting314 ghstack dependencies: #146252, #146254, #146255	2025-02-08 18:00:30 +00:00
Nikita Shulga	2328dcccb9	[MPSInductor] Implement Welford reduction (#146703 ) Still work in progress, though fallback works as expected, but custom shader is not Pull Request resolved: https://github.com/pytorch/pytorch/pull/146703 Approved by: https://github.com/jansel, https://github.com/dcci	2025-02-08 05:00:00 +00:00
Davide Italiano	46390e9a37	[mps] Implement support for sinc() operator (inductor and eager). (#146539 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146539 Approved by: https://github.com/malfet, https://github.com/jansel Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>	2025-02-06 16:37:27 +00:00
Nikita Shulga	36c6e09528	[MPSInductor] Fix min/max for bfloat16 (#146552 ) By introducing a full specialization that upcasts everything to float, as bfloat does not have a native min/max Test by runing `test_min_max_reduction` Pull Request resolved: https://github.com/pytorch/pytorch/pull/146552 Approved by: https://github.com/dcci	2025-02-06 05:15:00 +00:00
PyTorch MergeBot	49effa0deb	Revert "[inductor] Refactor op handlers part 5 (#146257 )" This reverts commit `d3dd3eeb7f`. Reverted https://github.com/pytorch/pytorch/pull/146257 on behalf of https://github.com/atalman due to Sorry need to revert https://github.com/pytorch/pytorch/pull/146252 ([comment](https://github.com/pytorch/pytorch/pull/146257#issuecomment-2638251994))	2025-02-05 23:20:38 +00:00
Davide Italiano	8a2000fd42	[MPS] Implement support for zeta (both eager and inductor). (#146465 ) A test was failing in inductor (`test_pointwise_zeta`) -- and I realized the operation was missing also from eager. Implemented for both, leveraging the kernel. Happy to split in two (one PR for eager, one for inductor) if folks prefer. Pull Request resolved: https://github.com/pytorch/pytorch/pull/146465 Approved by: https://github.com/malfet	2025-02-05 13:55:50 +00:00
Jason Ansel	d3dd3eeb7f	[inductor] Refactor op handlers part 5 (#146257 ) This makes OpHandler just a normal class using inheritance, and removes typing workarounds needed because it wasn't Pull Request resolved: https://github.com/pytorch/pytorch/pull/146257 Approved by: https://github.com/shunting314 ghstack dependencies: #146225, #146226, #146235, #146252, #146254, #146255	2025-02-04 23:36:25 +00:00
Jason Ansel	67be5953fe	[inductor] Refactor op handlers part 1 (#146235 ) This enforces the invariant that every backend implements the same set of ops and removes a layer of indirection for BasicMathOps. Interestingly this is a small compile time win: ``` ... WIN: benchmark ('add_loop_inductor', 'compile_time_instruction_count') failed, actual result 30151159301 is -6.13% lower than expected 32120000000 ±1.50% please update the expected results. please update all results that changed significantly, and not only the failed ones PASS: benchmark ('add_loop_inductor_dynamic_gpu', 'compile_time_instruction_count') pass, actual result 44447549162 -1.69% is within expected 45210000000 ±2.50% WIN: benchmark ('add_loop_inductor_gpu', 'compile_time_instruction_count') failed, actual result 26743557195 is -2.25% lower than expected 27360000000 ±1.50% please update the expected results. please update all results that changed significantly, and not only the failed ones PASS: benchmark ('basic_modules_ListOfLinears_eager', 'compile_time_instruction_count') pass, actual result 945129734 +0.93% is within expected 936400000 ±1.50% WIN: benchmark ('basic_modules_ListOfLinears_inductor', 'compile_time_instruction_count') failed, actual result 18984384503 is -3.19% lower than expected 19610000000 ±1.50% please update the expected results. please update all results that changed significantly, and not only the failed ones WIN: benchmark ('basic_modules_ListOfLinears_inductor_gpu_force_shape_pad', 'compile_time_instruction_count') failed, actual result 17258025389 is -1.94% lower than expected 17600000000 ±1.50% please update the expected results. ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/146235 Approved by: https://github.com/shunting314 ghstack dependencies: #146225, #146226	2025-02-04 23:35:53 +00:00
Nikita Shulga	3525b834f0	[MPSInductor] Implement `argmax`/`argmin` (#146429 ) TODOs: - Find test with NaN - Report internal compiler error when running `test_argmax_argmin1` (which is actually not enough shared memory) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146429 Approved by: https://github.com/dcci ghstack dependencies: #146423, #146428	2025-02-04 19:16:06 +00:00
Nikita Shulga	5d81bc3696	[MPSInductor] Implement `prod` reduction (#146396 ) Mostly reusing `sum` reduction logic Pull Request resolved: https://github.com/pytorch/pytorch/pull/146396 Approved by: https://github.com/dcci ghstack dependencies: #146369, #146370, #146380, #146389	2025-02-04 14:08:04 +00:00
Nikita Shulga	bbe95341d9	[MPSInductor] Implement `min` and `max` reductions (#146389 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146389 Approved by: https://github.com/jansel, https://github.com/dcci ghstack dependencies: #146369, #146370, #146380	2025-02-04 14:04:10 +00:00
Davide Italiano	bb4bd5f00b	[Metal][BE] Fix the arguments of `polygamma` (#146382 ) In the public API, order comes before input, while here they're reversed. Match for consistency (and make this less error prone). Pull Request resolved: https://github.com/pytorch/pytorch/pull/146382 Approved by: https://github.com/jansel, https://github.com/malfet	2025-02-04 06:40:34 +00:00
Nikita Shulga	54ceb7c565	[MPSInductor] Add support for `sum` reduction (#146380 ) - Add `threadgroup_sum` template to `c10/metal/reduction_utils.h` that so far uses barrier to compute the reductions TODOs: - Implement efficient reduction using cooperative functions such as `simd_shuffle_down` - Figure out how to merge several sum reduction together - Implement `reduction_store` that will only write results from the first thread Pull Request resolved: https://github.com/pytorch/pytorch/pull/146380 Approved by: https://github.com/jansel, https://github.com/dcci ghstack dependencies: #146369, #146370	2025-02-04 06:23:44 +00:00
Nikita Shulga	5451c9b7c9	[MPSInductor] Add support for any reduction (#146370 ) - Add `_new_accvar` function that creates a threadgroup variable - As threadgroup variables can not be initialized in place, add explicit initialization for reduction var Pull Request resolved: https://github.com/pytorch/pytorch/pull/146370 Approved by: https://github.com/dcci, https://github.com/jansel ghstack dependencies: #146369	2025-02-04 02:45:03 +00:00
Nikita Shulga	71179772cd	[MPSInductor] Prep change for reduction support (#146369 ) Add `group_pos` parameter as well as set `group_size` when invoking reduction kernels Separates loads and stores and insert threadgroup barrier if reduction is in place Should be a no-op right now Pull Request resolved: https://github.com/pytorch/pytorch/pull/146369 Approved by: https://github.com/dcci, https://github.com/jansel	2025-02-04 02:38:07 +00:00
PyTorch MergeBot	2f40f789da	Revert "[inductor] Refactor op handlers part 1 (#146235 )" This reverts commit `204be4e0a2`. Reverted https://github.com/pytorch/pytorch/pull/146235 on behalf of https://github.com/atalman due to Breaks lint, sorry: Definition of polygamma in base class MetalOverrides is incompatible with definition in base class OpsHandler. Please rebase fix lint and reland ([comment](https://github.com/pytorch/pytorch/pull/146235#issuecomment-2632444514))	2025-02-04 00:00:08 +00:00
Jason Ansel	204be4e0a2	[inductor] Refactor op handlers part 1 (#146235 ) This enforces the invariant that every backend implements the same set of ops and removes a layer of indirection for BasicMathOps. Interestingly this is a small compile time win: ``` ... WIN: benchmark ('add_loop_inductor', 'compile_time_instruction_count') failed, actual result 30151159301 is -6.13% lower than expected 32120000000 ±1.50% please update the expected results. please update all results that changed significantly, and not only the failed ones PASS: benchmark ('add_loop_inductor_dynamic_gpu', 'compile_time_instruction_count') pass, actual result 44447549162 -1.69% is within expected 45210000000 ±2.50% WIN: benchmark ('add_loop_inductor_gpu', 'compile_time_instruction_count') failed, actual result 26743557195 is -2.25% lower than expected 27360000000 ±1.50% please update the expected results. please update all results that changed significantly, and not only the failed ones PASS: benchmark ('basic_modules_ListOfLinears_eager', 'compile_time_instruction_count') pass, actual result 945129734 +0.93% is within expected 936400000 ±1.50% WIN: benchmark ('basic_modules_ListOfLinears_inductor', 'compile_time_instruction_count') failed, actual result 18984384503 is -3.19% lower than expected 19610000000 ±1.50% please update the expected results. please update all results that changed significantly, and not only the failed ones WIN: benchmark ('basic_modules_ListOfLinears_inductor_gpu_force_shape_pad', 'compile_time_instruction_count') failed, actual result 17258025389 is -1.94% lower than expected 17600000000 ±1.50% please update the expected results. ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/146235 Approved by: https://github.com/shunting314 ghstack dependencies: #146225, #146226	2025-02-03 23:15:13 +00:00
Davide Italiano	0463cb6ca5	[mps/inductor] Add support for digamma(). (#146292 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146292 Approved by: https://github.com/malfet, https://github.com/jansel	2025-02-03 22:48:13 +00:00
Davide Italiano	7854299b27	[mps/inductor] Implement support for polygamma(). (#146259 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/146259 Approved by: https://github.com/jansel	2025-02-02 01:54:23 +00:00
Jason Ansel	8e56d713c9	[inductor] Add typing to common.OpDecompositions (#145915 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/145915 Approved by: https://github.com/yanboliang ghstack dependencies: #145913, #145914	2025-02-01 16:34:11 +00:00
Jason Ansel	e90cf4abcf	[inductor] Add some typing to common.py (#145691 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/145691 Approved by: https://github.com/malfet ghstack dependencies: #145690	2025-01-27 06:27:13 +00:00
Nikita Shulga	71caac2b30	[MPSInductor] Add rand support (#145705 ) Using Philox4 as PRNG Test plan (other that CI) Run ```python mport torch from torch._inductor.utils import run_and_get_code from contextlib import nullcontext def foo(x): return x * torch.randn_like(x) foo_c = torch.compile(foo) x = torch.ones(100, 100, device="mps") y = foo_c(x) print(y.mean().item(), y.std().item()) for i in range(25): print(y[i].mean(), y[i].std()) ``` And observe that printed values are close to 0 and 1 TODO: Better `randint` algorithm for large ranges Pull Request resolved: https://github.com/pytorch/pytorch/pull/145705 Approved by: https://github.com/dcci, https://github.com/jansel	2025-01-27 06:07:36 +00:00
Davide Italiano	57591edca1	[mps/inductor] Add support for `erfinv`. (#145643 ) After several rounds of refactoring, this seems to be done now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/145643 Approved by: https://github.com/malfet, https://github.com/jansel	2025-01-24 22:55:44 +00:00
Nikita Shulga	70ccbade83	[MPSInductor] Add `gamma` op (#145341 ) By moving `gamma` and `log_gamma` implementation from `Gamma.metal` to `c10/metal/special_math.h` Pull Request resolved: https://github.com/pytorch/pytorch/pull/145341 Approved by: https://github.com/Skylion007, https://github.com/dcci ghstack dependencies: #145309	2025-01-22 19:37:45 +00:00
Nikita Shulga	980c75fe6e	[MPSInductor] Add `TrueDiv` and `Round[Int\|Decimal]` (#145160 ) That fixes `test_builtins_round_float_ndigits_neg` and `test_builtins_round` Pull Request resolved: https://github.com/pytorch/pytorch/pull/145160 Approved by: https://github.com/jansel, https://github.com/dcci	2025-01-20 04:29:42 +00:00
Davide Italiano	8cc415774f	[mps/inductor] Introduce a metal approx for erf() and use it. (#145161 ) Probably we can do better, but this is a start. Pull Request resolved: https://github.com/pytorch/pytorch/pull/145161 Approved by: https://github.com/malfet	2025-01-19 02:29:05 +00:00
Nikita Shulga	cede43e06b	[MPSInductor][BE] NaN-propagating min/max to header (#145157 ) May be to be later reused from eager op as well Also, didn't know that Metal already have type_traits And use `metal::isunorderder(a, b)` instead of `metal::isnan(a + b)` is it is defined as function that is equivalent `a != a \|\| b != b`, but I suspect it might have a best native implementation for the specific architecture Pull Request resolved: https://github.com/pytorch/pytorch/pull/145157 Approved by: https://github.com/dcci	2025-01-18 22:52:44 +00:00
Nikita Shulga	8a57234033	[MPSInductor] Implement `i0` and `i1` ops (#145092 ) Using shared definitions with eager op Pull Request resolved: https://github.com/pytorch/pytorch/pull/145092 Approved by: https://github.com/dcci, https://github.com/jansel ghstack dependencies: #145023, #145087	2025-01-18 15:41:02 +00:00
Nikita Shulga	41ec2e8d3e	[MPSInductor] Fix codegen regression (#144924 ) Caused by https://github.com/pytorch/pytorch/pull/144649 Do not try to insert anything into the header if wrapper is not ready yet Fixes `test_sort_mps` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144924 Approved by: https://github.com/dcci ghstack dependencies: #144827, #144917	2025-01-16 02:12:42 +00:00
Nikita Shulga	05505771a0	[MPSInductor] Properly convert index (#144917 ) By calling `self.index_to_str` from `load`,`store` and `check_bounds` in order to properly handle sizevars variables renames Pull Request resolved: https://github.com/pytorch/pytorch/pull/144917 Approved by: https://github.com/dcci ghstack dependencies: #144827	2025-01-16 02:12:41 +00:00
Nikita Shulga	904641769e	[MPSInductor] Implement `pow()` (#144827 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144827 Approved by: https://github.com/dcci, https://github.com/jansel	2025-01-15 20:11:34 +00:00
Nikita Shulga	d2ca8163c0	[MPSInductor] Support `abs` in MetalPrintExpr (#144826 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144826 Approved by: https://github.com/dcci ghstack dependencies: #144509, #144798, #144795, #144796	2025-01-15 05:01:25 +00:00
Nikita Shulga	e2251fffbb	[MPSInductor] Add `min`/`max` to MetalExprPrinter (#144798 ) After that `GPUTests::test_avg_pool2d8_mps` and `GPUTests::test_avg_pool2d5_mps` passes Pull Request resolved: https://github.com/pytorch/pytorch/pull/144798 Approved by: https://github.com/dcci ghstack dependencies: #144509	2025-01-15 01:43:42 +00:00
Davide Italiano	35b46a75f1	[mps/inductor] Add support for `round()` (#144731 ) With this change, inductor/test_view_on_aliased passes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144731 Approved by: https://github.com/malfet	2025-01-14 05:56:13 +00:00
Davide Italiano	de9d6a25d7	[mps/inductor] Add support for `ceil` (#144715 ) inductor/test_index_dynamic_shapes passes after this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144715 Approved by: https://github.com/malfet	2025-01-14 01:16:47 +00:00
Nikita Shulga	c40d917182	[MPSInductor] Fix maximum/minimum for int types (#144665 ) `metal::isnan` is only defined for floats, so provide a generic wrapper that is false for integral types TODO: Figure out why type propagantion is not working (or should it?) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/144665 Approved by: https://github.com/dcci	2025-01-13 15:14:01 +00:00
Davide Italiano	417354d953	[mps/inductor] Add support for truncdiv(). (#144666 ) Two other inductor tests pass after this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144666 Approved by: https://github.com/malfet	2025-01-13 13:39:38 +00:00
Nikita Shulga	7e2239f1f0	[MPSInductor] Better error when kernel fails to compile (#144649 ) Now error message looks as follows: ``` % python ../test/inductor/test_torchinductor.py -v -k test_cat_unbacked_2d_mps test_cat_unbacked_2d_mps (__main__.GPUTests) ... inline_call [] stats [('calls_captured', 6)] inductor [('extern_calls', 2), ('fxgraph_cache_miss', 1)] aot_autograd [('total', 1), ('autograd_cache_bypass', 1), ('not_ok', 1)] ERROR ====================================================================== ERROR: test_cat_unbacked_2d_mps (__main__.GPUTests) ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/malfet/git/pytorch/pytorch/torch/testing/_internal/common_utils.py", line 3126, in wrapper method(args, kwargs) File "/Users/malfet/git/pytorch/pytorch/build/../test/inductor/test_torchinductor.py", line 12254, in new_test return value(self) File "/Users/malfet/miniconda3/lib/python3.10/contextlib.py", line 79, in inner return func(args, *kwds) File "/Users/malfet/git/pytorch/pytorch/build/../test/inductor/test_torchinductor.py", line 5885, in test_cat_unbacked_2d self.common( File "/Users/malfet/miniconda3/lib/python3.10/contextlib.py", line 79, in inner return func(args, *kwds) File "/Users/malfet/git/pytorch/pytorch/build/../test/inductor/test_torchinductor.py", line 620, in check_model_gpu check_model( File "/Users/malfet/git/pytorch/pytorch/build/../test/inductor/test_torchinductor.py", line 461, in check_model actual = run(example_inputs, *kwargs) File "/Users/malfet/git/pytorch/pytorch/torch/_dynamo/eval_frame.py", line 580, in _fn raise e.remove_dynamo_frames() from None # see TORCHDYNAMO_VERBOSE=1 File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/compile_fx.py", line 704, in _compile_fx_inner raise InductorError(e, currentframe()).with_traceback( File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/compile_fx.py", line 689, in _compile_fx_inner mb_compiled_graph = fx_codegen_and_compile( File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/compile_fx.py", line 1149, in fx_codegen_and_compile return scheme.codegen_and_compile(gm, example_inputs, inputs_to_check, graph_kwargs) File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/compile_fx.py", line 1064, in codegen_and_compile compiled_fn = graph.compile_to_module().call File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/graph.py", line 1977, in compile_to_module return self._compile_to_module() File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/graph.py", line 2018, in _compile_to_module mod = PyCodeCache.load_by_key_path( File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/codecache.py", line 2768, in load_by_key_path mod = _reload_python_module(key, path) File "/Users/malfet/git/pytorch/pytorch/torch/_inductor/runtime/compile_tasks.py", line 51, in _reload_python_module exec(code, mod.__dict__, mod.__dict__) File "/var/folders/sc/2thx6_x95h7_h9qs8s48yh140000gn/T/tmpmyfz2ju8/lt/cltm34ognlgcc6oxoe6bexvtbwcdtdfgnkjj5miz7vhkemitacp7.py", line 40, in <module> File "/var/folders/sc/2thx6_x95h7_h9qs8s48yh140000gn/T/tmpmyfz2ju8/lt/cltm34ognlgcc6oxoe6bexvtbwcdtdfgnkjj5miz7vhkemitacp7.py", line 32, in _compile_mps_shader torch._inductor.exc.InductorError: SyntaxError: failed to compile kernel void generated_kernel( device float out_ptr0, constant float* in_ptr0, uint xindex [[thread_position_in_grid]] ) { long x1 = (xindex) / (3); auto tmp0 = x1; auto tmp1 = static_cast<long>(tmp0); auto tmp2 = 0; auto tmp3 = tmp1 >= tmp2; auto tmp4 = 2; auto tmp5 = tmp1 < tmp4; long x0 = (xindex) % (3); auto tmp6 = in_ptr0[x0 + 3*(x1)]; auto tmp7 = tmp5 ? tmp6 : 0.0; auto tmp8 = tmp1 >= tmp4; auto tmp9 = 2 + ks0; auto tmp10 = static_cast<long>(tmp9); auto tmp11 = tmp1 < tmp10; auto tmp12 = 1.0; auto tmp13 = tmp8 ? tmp12 : 0.0; auto tmp14 = tmp5 ? tmp7 : tmp13; long x2 = xindex; out_ptr0[x2] = static_cast<float>(tmp14); } with program_source:18:25: error: use of undeclared identifier 'ks0' auto tmp9 = 2 + ks0; ^ Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information You can suppress this exception and fall back to eager by setting: import torch._dynamo torch._dynamo.config.suppress_errors = True To execute this test, run the following from the base repo dir: python test/inductor/test_torchinductor.py GPUTests.test_cat_unbacked_2d_mps This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ---------------------------------------------------------------------- Ran 1 test in 0.472s FAILED (errors=1) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144649 Approved by: https://github.com/Skylion007, https://github.com/jansel, https://github.com/dcci ghstack dependencies: #144647, #144648	2025-01-13 13:38:03 +00:00
Nikita Shulga	a08bd8154e	[MPSInductor] Add support for sizevars (#144662 ) Just pass them as kernel arguments After this change `pytest test/inductor/test_torchinduct.py -v -k _mps` reports 330 failed, 429 passed after and 335 failed, 424 passed before Pull Request resolved: https://github.com/pytorch/pytorch/pull/144662 Approved by: https://github.com/jansel	2025-01-13 06:22:38 +00:00
Nikita Shulga	91a65cbd31	[MPSInductor] Implement `check_bounds` (#144635 ) Although at the moment it returns rather than rasises assert due to https://github.com/pytorch/pytorch/pull/144632 `pytest test/inductor/test_torchinductor.py -v -k _mps` score is `368 failed, 391 passed, 32 skipped` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144635 Approved by: https://github.com/jansel	2025-01-12 21:01:20 +00:00
Nikita Shulga	cec245806e	[MPSInductor] Implement bitcasts (#144638 ) That will be used to compile something like `torch.rand(32, device='mps').view(dtype=torch.int32)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/144638 Approved by: https://github.com/dcci	2025-01-12 06:11:28 +00:00
Nikita Shulga	32a91dedc5	[MPSInductor] Properly generate index expressions (#144632 ) Now test_slice_scatter4_mps passes Before this change test_torchinductor.py reported 422 failed and 337 passed, after this change 412 failed 347 passed. Fixes https://github.com/pytorch/pytorch/issues/144630 Pull Request resolved: https://github.com/pytorch/pytorch/pull/144632 Approved by: https://github.com/dcci	2025-01-12 06:10:05 +00:00
Davide Italiano	e0f67405a1	[mps/inductor] Add support for exp(). (#144606 ) inductor/test_silu now passes after this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144606 Approved by: https://github.com/malfet Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>	2025-01-12 00:38:11 +00:00
Davide Italiano	5e858254d2	[mps/inductor] Add support for trunc(). (#144629 ) inductor/test_div1 passes after this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144629 Approved by: https://github.com/malfet, https://github.com/jansel	2025-01-12 00:11:03 +00:00
PyTorch MergeBot	4f406d22a2	Revert "[mps/inductor] Add support for exp(). (#144606 )" This reverts commit `2ccbacfa24`. Reverted https://github.com/pytorch/pytorch/pull/144606 on behalf of https://github.com/malfet due to It now passes MPS-not-supported test ([comment](https://github.com/pytorch/pytorch/pull/144606#issuecomment-2585482477))	2025-01-11 23:51:35 +00:00
Davide Italiano	2ccbacfa24	[mps/inductor] Add support for exp(). (#144606 ) inductor/test_silu now passes after this change. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144606 Approved by: https://github.com/malfet	2025-01-11 18:09:33 +00:00

1 2

75 Commits