pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
PyTorch MergeBot	2699f5410b	Revert "[xpu][feature] Integrate OneDNN SDPA training forward/backward into XPU OVERRIDEABLE Backend (#162454 )" This reverts commit `fd68d409ad`. Reverted https://github.com/pytorch/pytorch/pull/162454 on behalf of https://github.com/atalman due to internal build failure ([comment](https://github.com/pytorch/pytorch/pull/162454#issuecomment-3475009089))	2025-10-31 21:58:52 +00:00
fengqing.lu	fd68d409ad	[xpu][feature] Integrate OneDNN SDPA training forward/backward into XPU OVERRIDEABLE Backend (#162454 ) This is the second PR split from https://github.com/pytorch/pytorch/pull/156272 Pull Request resolved: https://github.com/pytorch/pytorch/pull/162454 Approved by: https://github.com/guangyey, https://github.com/EikanWang, https://github.com/drisspg	2025-10-31 11:20:38 +00:00
Michael Lazos	e8d887ae3f	[user-streams] Support streams as contexts (#164507 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/164507 Approved by: https://github.com/williamwen42 ghstack dependencies: #162903, #164343, #164344	2025-10-29 04:46:08 +00:00
linhaifeng	695cb0d342	[2/N][Fix] Fix typo in test folder (#166374 ) Fix typo in test folder. _typos.toml ```bash [default.extend-words] nd = "nd" arange = "arange" Nd = "Nd" GLOBALs = "GLOBALs" hte = "hte" iy = "iy" PN = "PN" Dout = "Dout" optin = "optin" gam = "gam" PTD = "PTD" ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/166374 Approved by: https://github.com/cyyever, https://github.com/ezyang	2025-10-29 03:02:07 +00:00
can-gaa-hou	c201a1cab1	[OpenReg] Update Installation in README.md (#166235 ) It is recommended to use `python -m pip install --no-build-isolation .` instead of `pip3 install --no-build-isolation .` because most of us use a virtual environment, and the latter probably relies on the system `pip3` rather than the conda or uv. We need to make it consistent with the Python we use, and it is also consistent with how `torch` is installed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/166235 Approved by: https://github.com/fffrog, https://github.com/ezyang	2025-10-29 02:57:26 +00:00
KarhouTam	13413b3b07	[AMP][Refactor] Autocast dtype handling to simplify device-specific c… (#165221 ) This PR refactors the autocast context manager in autocast_mode.py to simplify and centralize the logic for checking supported dtypes for each device. The previous implementation repeated similar checks for multiple device types. Now, a single mapping device_supported_dtypes is used to associate device types with their supported dtypes, and the validation logic is unified. The former PR #163446 was merged but reverted due to failed CI test on `openreg` related tests. This RR additionally slightly modified some test assertions for passing the CI tests. CI failed due to assertion for the exactly same error message. For example: ``` File "/var/lib/jenkins/workspace/test/cpp_extensions/open_registration_extension/torch_openreg/tests/test_autocast.py", line 9, in test_autocast_with_unsupported_type with self.assertWarnsRegex( AssertionError: "In openreg autocast, but the target dtype torch.float32 is not supported." does not match "In openreg autocast, but the target dtype is not supported. Disabling autocast." ``` Sorry for the inconvenience again. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165221 Approved by: https://github.com/albanD	2025-10-28 06:21:29 +00:00
FFFrog	1d13c314b3	[OpenReg] Remove the Unnecessary Fallback Implementation for AutogradPrivate1 (#165316 ) As the title stated. The fallback for AutogradPrivateUse1 is builtin in PyTorch, so it is no need to register general implementation for out of tree backend. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165316 Approved by: https://github.com/ezyang, https://github.com/albanD ghstack dependencies: #165315	2025-10-25 01:27:27 +00:00
PyTorch MergeBot	7773a22cdb	Revert "[AMP][Refactor] Autocast dtype handling to simplify device-specific c… (#165221 )" This reverts commit `4be1e3bf92`. Reverted https://github.com/pytorch/pytorch/pull/165221 on behalf of https://github.com/clee2000 due to I think this broke test_openreg [GH job link](https://github.com/pytorch/pytorch/actions/runs/18698271058/job/53322459496) [HUD commit link](`4be1e3bf92`) note to self: bad TD ([comment](https://github.com/pytorch/pytorch/pull/165221#issuecomment-3430012693))	2025-10-22 00:26:57 +00:00
KarhouTam	4be1e3bf92	[AMP][Refactor] Autocast dtype handling to simplify device-specific c… (#165221 ) This PR refactors the autocast context manager in autocast_mode.py to simplify and centralize the logic for checking supported dtypes for each device. The previous implementation repeated similar checks for multiple device types. Now, a single mapping device_supported_dtypes is used to associate device types with their supported dtypes, and the validation logic is unified. The former PR #163446 was merged but reverted due to failed CI test on `openreg` related tests. This RR additionally slightly modified some test assertions for passing the CI tests. CI failed due to assertion for the exactly same error message. For example: ``` File "/var/lib/jenkins/workspace/test/cpp_extensions/open_registration_extension/torch_openreg/tests/test_autocast.py", line 9, in test_autocast_with_unsupported_type with self.assertWarnsRegex( AssertionError: "In openreg autocast, but the target dtype torch.float32 is not supported." does not match "In openreg autocast, but the target dtype is not supported. Disabling autocast." ``` Sorry for the inconvenience again. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165221 Approved by: https://github.com/FFFrog, https://github.com/albanD	2025-10-21 21:32:12 +00:00
PyTorch MergeBot	62a263b8d4	Revert "Widen ops support to take in IntHOArrayRef vs only std::vec (#165152 )" This reverts commit `e4454947e2`. Reverted https://github.com/pytorch/pytorch/pull/165152 on behalf of https://github.com/clee2000 due to breaking internal tests D84961075 ([comment](https://github.com/pytorch/pytorch/pull/164991#issuecomment-3423058017))	2025-10-20 17:26:42 +00:00
Jane Xu	e4454947e2	Widen ops support to take in IntHOArrayRef vs only std::vec (#165152 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/165152 Approved by: https://github.com/mikaylagawarecki ghstack dependencies: #164991	2025-10-17 18:32:39 +00:00
PyTorch MergeBot	816fb7f48d	Revert "Enable ruff rule E721 (#165162 )" This reverts commit `9e7c19f72b`. Reverted https://github.com/pytorch/pytorch/pull/165162 on behalf of https://github.com/pytorch-auto-revert due to Reverted automatically by pytorch's autorevert, to avoid this behaviour add the tag autorevert: disable ([comment](https://github.com/pytorch/pytorch/pull/165162#issuecomment-3393328271))	2025-10-11 13:25:40 +00:00
Yuanyuan Chen	9e7c19f72b	Enable ruff rule E721 (#165162 ) `E721` checks for object type comparisons using == and other comparison operators. This is useful because it is recommended to use `is` for type comparisons. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165162 Approved by: https://github.com/Skylion007	2025-10-11 06:43:53 +00:00
zeshengzong	77354e22e1	[OpenReg] Add AMP Integration guide for accelerators (#162050 ) Fix part of #158917 Add AMP integration document and OpenReg code as example to explain steps of integration. Pull Request resolved: https://github.com/pytorch/pytorch/pull/162050 Approved by: https://github.com/albanD Co-authored-by: FFFrog <ljw1101.vip@gmail.com>	2025-09-30 12:27:11 +00:00
Klaus Zimmermann	50d418f69f	Replace setup.py bdist_wheel with python -m build --wheel (#156712 ) Previously we already replaced most use of `python setup.py develop/install`. This PR also replaces the use of `setup.py bdist_wheel` with the modern `python -m build --wheel` alternative. Pull Request resolved: https://github.com/pytorch/pytorch/pull/156712 Approved by: https://github.com/atalman ghstack dependencies: #156711	2025-09-29 21:51:32 +00:00
Nikita Shulga	8f32adc90a	[MPSHooks] Release pending command encoder (#164093 ) Before returning a comand buffer, as subsequent calle are very likely to allocate their own encoder, which results in the following runtime error ``` tryCoalescingPreviousComputeCommandEncoderWithConfig:nextEncoderClass:]:1090: failed assertion `A command encoder is already encoding to this command buffer' ``` Added regression test to `test_mps_extension` Please note, that `torch::mps::get_command_buffer()` should be called with dispatch_queue held, both before and after this change, but many implementations skip that Fixes https://github.com/pytorch/pytorch/issues/163721 Pull Request resolved: https://github.com/pytorch/pytorch/pull/164093 Approved by: https://github.com/atalman, https://github.com/Skylion007	2025-09-29 17:50:12 +00:00
can-gaa-hou	22d5f5ff94	[OpenReg][BE] Replacing explicit prefix/suffix with CMake variables (#163850 ) As the title states, suffixes like`.dylib` and `lib` can be replaced by `CMAKE_SHARED_LIBRARY_SUFFIX`, and prefixes like `lib` can be replaced by `CMAKE_SHARED_LIBRARY_PREFIX` on Unix or `CMAKE_IMPORT_LIBRARY_PREFIX` on Windows. Pull Request resolved: https://github.com/pytorch/pytorch/pull/163850 Approved by: https://github.com/albanD	2025-09-25 16:33:16 +00:00
FFFrog	0bca77951d	[Code Clean] Remove deadcodes about Python3.9 [2/N] (#163627 ) As the title stated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/163627 Approved by: https://github.com/jansel ghstack dependencies: #163626	2025-09-24 07:30:50 +00:00
KarhouTam	375f3e3a61	[OpenReg][Docs] Correct docs about `openreg` usage example. (#163235 ) ## Why this PR? I've tried to follow the guidance of the `OpenReg` [usage example](https://github.com/pytorch/pytorch/tree/main/test/cpp_extensions/open_registration_extension/torch_openreg/third_party/openreg) and found that the command for compiling `example.cpp` (`g++ -o out example/example.cpp -L ./build -lopenreg`) is not compatible with my `gcc` (v11.4). Since I installed my `gcc` through `apt install build-essential`, and I think that's a common way to install `gcc` for a few developers? I believe it's necessary to slightly modify the command to add `-I ./` to explicitly indicate the header file search path. ## What I've changed? - I added `-I ./` to correctly search for `./include/openreg.h`. - I also added a `pwd` comment for better readability and removed unused imports in `example/example.cpp`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/163235 Approved by: https://github.com/FFFrog, https://github.com/albanD Co-authored-by: Jiawei Li <ljw1101.vip@gmail.com>	2025-09-23 06:16:45 +00:00
Pearu Peterson	8abc2af9b9	[STABLE ABI] Add clone method to torch::stable::Tensor (#161896 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/161896 Approved by: https://github.com/janeyx99	2025-09-22 20:39:24 +00:00
Pearu Peterson	f9074c7332	[STABLE ABI] Add copy_ operation. (#161895 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/161895 Approved by: https://github.com/janeyx99	2025-09-20 10:30:33 +00:00
FFFrog	a94ddd9b00	[OpenReg] Fix the docs of Accelerator Intergration (#162826 ) ---- - Fixed the redirect link about step 1 - Formatted the autoload and added necessary links Pull Request resolved: https://github.com/pytorch/pytorch/pull/162826 Approved by: https://github.com/albanD ghstack dependencies: #161917, #161918, #160101	2025-09-12 23:53:17 +00:00
FFFrog	29f84b0f61	[OpenReg] Improve the Event and Stream capabilities of DeviceGuardImplInterface (#160101 ) Changes: - Based on `OpenRegStream` and `OpenRegEvent`, we improve the implementation of Device Guard for `OpenReg` - Add some related testcases Pull Request resolved: https://github.com/pytorch/pytorch/pull/160101 Approved by: https://github.com/albanD ghstack dependencies: #161917, #161918	2025-09-12 23:53:17 +00:00
FFFrog	27daa6af6a	[OpenReg] Strengthen Openreg's execution limits to minimize the waste of computing resources (#161918 ) Currently, OpenReg supports Linux, Windows, and OS X, ensuring stability and ease of integration with third-party devices across all three platforms. It also doesn't rely on any other accelerators (such as CUDA or MPS). Therefore, to minimize computational resource usage, `test_openreg` can be added to certain BLOCKLISTS to prevent its execution, limiting OpenReg's execution to only necessary scenarios. Pull Request resolved: https://github.com/pytorch/pytorch/pull/161918 Approved by: https://github.com/albanD ghstack dependencies: #161917	2025-09-12 23:53:17 +00:00
FFFrog	9b429846e8	[OpenReg] Migrate OpenReg Tests from tests/test_openreg.py into torch_openreg/tests (#161917 ) Background: Almost all the tests in `test/test_openreg.py` are designed for `torch_openreg`, so placing these testcases in the test directory is not a good idea. Instead, they should be moved to the `tests` directory under `torch_openreg`, coordinating these tests with their corresponding functional logic. How to do: So how do we verify the quality of the third-party device integration mechanism? We will maintain a `test_openreg` entrypoint in `test/run_test.py`. This entrypoint will install `torch_openreg` and run all the testcases located in `torch_openreg`. As long as all testcases pass, we can guarantee that the out-of-tree backend integration mechanism is available. Next: We will also improve `torch_openreg's` test coverage in the future. Pull Request resolved: https://github.com/pytorch/pytorch/pull/161917 Approved by: https://github.com/albanD	2025-09-12 23:53:17 +00:00
Dmitry Rogozhkin	ee53ad2dd0	xpu: test py_limited_api with SyclExtension (#162546 ) Commit extends existing CUDA test to cover XPU SyclExtension case for the same feature - `py_limited_api`. Commit required a fix for xpu to install some Aten header files (#145902) which got resolved after the merge of #159621. See: https://github.com/pytorch/pytorch/issues/145902 Requires: https://github.com/pytorch/pytorch/pull/159621 Requires: https://github.com/intel/torch-xpu-ops/pull/1743 CC: @guangyey, @EikanWang Pull Request resolved: https://github.com/pytorch/pytorch/pull/162546 Approved by: https://github.com/guangyey, https://github.com/EikanWang, https://github.com/janeyx99	2025-09-12 21:57:01 +00:00
can-gaa-hou	95191522e0	[OpenReg] Implement device autoload mechanism (#158555 ) # Implement OpenReg device autoload mechanism ## Overview The Autoload mechanism in PyTorch simplifies the integration of third-party device backends by enabling automatic discovery and initialization at runtime. Traditionally, integrating a new backend required explicit imports or manual initialization, which could be cumbersome and error-prone. With Autoload, PyTorch dynamically detects and initializes device backends, providing a seamless user experience. This mechanism leverages Python entry points (e.g., `torch.backends`) and dynamic module loading. When PyTorch starts, it scans for registered entry points and invokes their initialization hooks, ensuring that all available backends are ready for use without requiring explicit imports. ## Motivation This PR aims to apply [device autoload mechanism](https://github.com/pytorch/pytorch/issues/122468) to the OpenReg module with some simple changes. ## Change ### Before ```python import torch import torch_openreg x = torch.tensor([1, 2, 3], device="openreg") print(x) ``` ### After ```python import torch # No need to import torch_openreg manually! x = torch.tensor([1, 2, 3], device="openreg") print(x) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/158555 Approved by: https://github.com/FFFrog, https://github.com/albanD Co-authored-by: Jiawei Li <ljw1101.vip@gmail.com>	2025-09-12 04:24:11 +00:00
FFFrog	b93f87d67b	[OpenReg] Integrate Event&Stream from OpenReg Backend into PyTorch (#160100 ) We integrated the openreg backend’s `Stream` and `Event` into PyTorch, all of which are similar to other accelerators like `CUDA`, `XPUs`, etc. Pull Request resolved: https://github.com/pytorch/pytorch/pull/160100 Approved by: https://github.com/albanD ghstack dependencies: #161603, #160099, #161773	2025-08-30 13:21:28 +00:00
FFFrog	6284881b2a	[OpenReg] Add tests of device and memory for OpenReg (#161773 ) As the title stated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/161773 Approved by: https://github.com/albanD ghstack dependencies: #161603, #160099	2025-08-30 13:21:28 +00:00
FFFrog	aae9cbb6c0	[OpenReg] Add Event&Stream Support for OpenReg Backend (#160099 ) Referring to the signatures and functions of `Stream` and `Event` in CUDA, we use CPU multithreading and conditional variables to implement equivalent capabilities as the underlying foundation of torch_openreg. Changes: - Add stream capabilities for OpenReg - Add event capabilities for OpenReg - Add kernel launch entrypoint for OpenReg - Add testcases about stream and event for OpenReg - Add example for OpenReg Pull Request resolved: https://github.com/pytorch/pytorch/pull/160099 Approved by: https://github.com/albanD ghstack dependencies: #161603	2025-08-30 13:21:21 +00:00
FFFrog	dad2e50ac5	[OpenReg] Rename cpu_fallback_blacklist to cpu_fallback_blocklist (#161603 ) As the title stated. Related Infos: https://github.com/pytorch/pytorch/pull/158644#discussion_r2301460839 Pull Request resolved: https://github.com/pytorch/pytorch/pull/161603 Approved by: https://github.com/albanD	2025-08-30 13:21:15 +00:00
Jane Xu	63632fc7ee	Add new_zeros dtype variant to the shim and as a stable op (#161597 ) In case we want this before 2.9 Pull Request resolved: https://github.com/pytorch/pytorch/pull/161597 Approved by: https://github.com/mikaylagawarecki	2025-08-28 13:57:24 +00:00
Mikayla Gawarecki	d3d9eb4777	Error when TORCH_STABLE_ONLY is defined in TensorBase.h (#161658 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/161658 Approved by: https://github.com/albanD	2025-08-28 04:36:31 +00:00
FFFrog	ec21cafd85	[OpenReg] Refactor and Optimize the OpenReg for Preparation of Docs (#159640 ) As the title stated. Changes: - Fixed a bug where abs_stub could not be triggered - Refactor registration to prepare for documentation - Add meta, fallback for openreg Pull Request resolved: https://github.com/pytorch/pytorch/pull/159640 Approved by: https://github.com/albanD	2025-08-26 01:44:21 +00:00
FFFrog	56ebed627a	[OpenReg] Add OSX/Windows Support for OpenReg (#159441 ) As the title stated. Changes: - Abstract platform-specific APIs - Add OSX/Windows support - Set default symbol visibility to "hidden" Co-authored-by: @can-gaa-hou Original PR:https://github.com/pytorch/pytorch/pull/159029 Pull Request resolved: https://github.com/pytorch/pytorch/pull/159441 Approved by: https://github.com/albanD Co-authored-by: jiahaochen666 <jiahaochen535@gmail.com>	2025-08-25 08:03:27 +00:00
Mikayla Gawarecki	78a8e6a671	Add new_empty (with dtype argument only) to torch::stable (#159508 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/159508 Approved by: https://github.com/janeyx99 ghstack dependencies: #160557	2025-08-20 00:50:42 +00:00
Jane Xu	8f766d6839	Add ScalarType -> shim conversion, add stable::Tensor.scalar_type (#160557 ) TL;DR: Moving to ScalarType in user extensions and removing deprecated dtypes. This change _modifies_ the from/to behavior between ScalarType and StableValue! Whereas before, user extensions could only in abstract pass around obfuscated dtypes appearing as int32_ts, now, users can confidently use torch::headeronly::ScalarType in their extensions for major scalar types. This PR enables ABI stability by adding a translation layer through the shim, so that even if the ScalarType enum values change in the future, user extensions need not fear. Then we add a Tensor scalar_type API which reuses the from/to logic to return to the user a nice ScalarType (vs an abstracted int32_t). I then changed the test to test the scalar_type API. This code change required some refactoring because of circular dependencies. ## BC Breaking note This commit is (narrowly) BC-breaking for unpopular dtypes: `quint`s, `qint`s, `Bits`, `dummy_uint`s, `dummy_int*`s, `Float8_e8m0fnu`, and `Float4_e2m1fn_x2` in the narrow use case where an extension retrieves a Tensor dtype of the above and passes it into `aoti_torch_call_dispatcher`. As of now, I believe there are 0 users of this use case, so the benefits of this change significantly justify BC-breaking this API. Pull Request resolved: https://github.com/pytorch/pytorch/pull/160557 Approved by: https://github.com/mikaylagawarecki, https://github.com/malfet	2025-08-19 22:13:47 +00:00
Sam Anklesaria	0a5ab612dd	Port amax to stable ABI (#160214 ) To enable porting torchaudio to the stable ABI, we need the `amax` operation to be accessible. This PR ports the op and provides tests that it behaves correctly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/160214 Approved by: https://github.com/mikaylagawarecki	2025-08-19 17:24:53 +00:00
Sam Anklesaria	c0a1ae4404	Add `is_cpu` method to stable tensor type (#160212 ) Porting torchaudio to use the stable api requires the `is_cuda` and `dtype` functions. It would be more convenient if these were methods of the stable tensor class rather than utilities one needed to call from the C api. This PR adds them as methods, mirroring how `is_cuda` and `get_device` are already defined. Pull Request resolved: https://github.com/pytorch/pytorch/pull/160212 Approved by: https://github.com/janeyx99	2025-08-18 17:42:43 +00:00
Mikayla Gawarecki	50a8c11875	Add getCurrentDeviceIndex to torch::stable::accelerator (#160453 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/160453 Approved by: https://github.com/janeyx99 ghstack dependencies: #159679	2025-08-13 23:42:24 +00:00
Mikayla Gawarecki	e4e4dbd2f8	Add beginnings of torch::stable::accelerator (#159679 ) Adds - `torch::stable::accelerator::DeviceGuard`: `std::unique_ptr` to `DeviceGuardOpauqe` mostly copied from the below (but made generic) `50eac811a6/torch/csrc/inductor/aoti_runtime/utils_cuda.h (L30-L46)` - constructor `DeviceGuard(DeviceIndex)` (this matches aoti but defers from the actual c10 DeviceGuard constructor that takes in device) - `set_index(DeviceIndex)` - `torch::stable::accelerator::Stream`: `std::shared_ptr` to `StreamOpaque` - constructor `Stream(StreamHandle stream)` (similar to torch::stable::Tensor) - `id() -> StreamId` - `getCurrentStream(DeviceIndex device_index) -> stable::accelerator::Stream` Pull Request resolved: https://github.com/pytorch/pytorch/pull/159679 Approved by: https://github.com/guangyey, https://github.com/janeyx99	2025-08-13 23:42:24 +00:00
Jane Xu	355462e127	Add stable Tensor get_device_index, use more stable DeviceIndex (#160143 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/160143 Approved by: https://github.com/mikaylagawarecki	2025-08-13 03:27:10 +00:00
Mikayla Gawarecki	4d419a7461	Add pad and narrow to torch/csrc/stable/ops.h (#159328 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/159328 Approved by: https://github.com/janeyx99 ghstack dependencies: #159507	2025-08-12 21:29:49 +00:00
Mikayla Gawarecki	655137b678	Update torch::stable::Tensor() default constructor (#159507 ) Allows things like ```cpp Tensor cu_seqlens_q; if (...) { cu_seqlens_q = ... } ... ``` Also adds `torch::stable::Tensor.defined()` Pull Request resolved: https://github.com/pytorch/pytorch/pull/159507 Approved by: https://github.com/janeyx99	2025-08-12 21:29:49 +00:00
can-gaa-hou	c03a734ba1	[OpenReg] Disable automatic inclusion of data files (#159845 ) # Background After I built torch_openreg, I noticed that the wheel package contained the stub.c file under the csrc directory, which was not used in the runtime. # Motivation This PR aims to remove the stub.c file and any unused file when running torch_openreg. Changes: - Setting include_package_data keyword to false in the setup function Pull Request resolved: https://github.com/pytorch/pytorch/pull/159845 Approved by: https://github.com/albanD	2025-08-06 10:35:13 +00:00
Mikayla Gawarecki	e65ab9a868	Enable generating generic c_shim that doesn't bypass dispatcher (#158974 ) Adds `c_shim_aten.{h/cpp}` and use this for `fill_` This is the generated `c_shim_aten.cpp` for reference ```cpp // WARNING: THIS FILE IS AUTOGENERATED BY torchgen. DO NOT MODIFY BY HAND. // See `7e86a7c015/torchgen/gen.py (L2424-L2436)` for details // This file corresponds to the aten_shimified_ops list in torchgen/aoti/fallback_ops.py #include <torch/csrc/inductor/aoti_torch/generated/c_shim_aten.h> #include <torch/csrc/inductor/aoti_torch/utils.h> #ifndef AT_PER_OPERATOR_HEADERS #include <ATen/Functions.h> #include <ATen/CompositeExplicitAutogradFunctions.h> #include <ATen/CompositeExplicitAutogradNonFunctionalFunctions.h> #include <ATen/CompositeImplicitAutogradFunctions.h> #else #include <ATen/ops/fill.h> #endif // AT_PER_OPERATOR_HEADERS using namespace torch::aot_inductor; AOTITorchError aoti_torch_aten_fill__Scalar(AtenTensorHandle self, double value) { AOTI_TORCH_CONVERT_EXCEPTION_TO_ERROR_CODE({ at::fill_( *tensor_handle_to_tensor_pointer(self), value ); ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/158974 Approved by: https://github.com/albanD, https://github.com/janeyx99	2025-07-25 21:59:14 +00:00
FFFrog	b635359e4c	[OpenReg] add pyproject.toml for openreg (#158440 ) As the title stated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/158440 Approved by: https://github.com/albanD ghstack dependencies: #158415	2025-07-25 02:39:41 +00:00
FFFrog	f1a1aa9490	[OpenReg] Improve README.md and optimize some codes for OpenReg (#158415 ) ---- - add description for DSO dependencies - remove unnecessary code Pull Request resolved: https://github.com/pytorch/pytorch/pull/158415 Approved by: https://github.com/albanD	2025-07-25 02:39:41 +00:00
Mikayla Gawarecki	fef236da69	Add zero_() and empty_like(t) to torch/csrc/stable/ops.h (#158866 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/158866 Approved by: https://github.com/janeyx99	2025-07-23 18:31:05 +00:00
Jane Xu	e882c761dd	Add STD_TORCH_CHECK to headeronly (#158377 ) Differential Revision: [D78366519](https://our.internmc.facebook.com/intern/diff/D78366519/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/158377 Approved by: https://github.com/albanD	2025-07-18 14:35:20 +00:00

1 2 3 4 5 ...

372 Commits