pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
FFFrog	1b389025ba	Refactor and Improve the OpenReg Module (#158090 ) ---- # Refactor and Improve the OpenReg Module ## Background Since PrivateUse1 has become the main path for integrating new devices with PyTorch, there have been some feature requests related to PrivateUse1 regarding interfaces, documentation, reference examples, etc., such as the following: - https://github.com/pytorch/pytorch/issues/155864 - https://github.com/pytorch/pytorch/issues/144955 - https://github.com/pytorch/pytorch/issues/144845 Taking these requests into consideration and combining them with the position of OpenReg, which is currently used as the test backend for PrivateUse1, I'm planning to make the following optimizations: - Optimize the implementation of OpenReg to make it align with the standard specifications for real backend (C++) access, serving as a reference for new device integration code. - Add comprehensive documentation to the [developer notes](https://docs.pytorch.org/docs/main/notes.html) to guide new accelerator integration, functioning as a reference manual. ## Design Principles: - Minimization Principle: Keep the code small and clear; only implement the minimum set of code required for verification and as an integration reference. - Authenticity Principle: Integrate OpenReg in the same way that real accelerators access PyTorch. ## More Infos: Pleaes refer to [this](`6b8020f1ab/test/cpp_extensions/open_registration_extension/torch_openreg/README.md`) for more information about `OpenReg`. ## Current Progress: - Refer to the implementation of [torch_xla](https://github.com/pytorch/xla) to refactor all of OpenReg's code, making it easier to understand. - Ensure all tests in [test/test_openreg.py](https://github.com/FFFrog/pytorch/blob/openreg/test/test_openreg.py) pass after refactoring. ## Next Steps: - Add more features to cover all integration points. - Gradually add user guides and documentation to the [developer notes](https://docs.pytorch.org/docs/main/notes.html). Pull Request resolved: https://github.com/pytorch/pytorch/pull/158090 Approved by: https://github.com/seemethere, https://github.com/albanD	2025-07-15 08:10:05 +00:00
Xuehai Pan	4dce5b71a0	[build] modernize build-frontend: `python setup.py develop/install` -> `[uv ]pip install --no-build-isolation [-e ].` (#156027 ) Modernize the development installation: ```bash # python setup.py develop python -m pip install --no-build-isolation -e . # python setup.py install python -m pip install --no-build-isolation . ``` Now, the `python setup.py develop` is a wrapper around `python -m pip install -e .` since `setuptools>=80.0`: - pypa/setuptools#4955 `python setup.py install` is deprecated and will emit a warning during run. The warning will become an error on October 31, 2025. - `9c4d383631/setuptools/command/install.py (L58-L67)` > ```python > SetuptoolsDeprecationWarning.emit( > "setup.py install is deprecated.", > """ > Please avoid running ``setup.py`` directly. > Instead, use pypa/build, pypa/installer or other > standards-based tools. > """, > see_url="https://blog.ganssle.io/articles/2021/10/setup-py-deprecated.html", > due_date=(2025, 10, 31), > ) > ``` - pypa/setuptools#3849 Additional Resource: - [Why you shouldn't invoke setup.py directly](https://blog.ganssle.io/articles/2021/10/setup-py-deprecated.html) Pull Request resolved: https://github.com/pytorch/pytorch/pull/156027 Approved by: https://github.com/ezyang	2025-07-09 11:24:27 +00:00
FFFrog	a730c65fe3	[OpenReg][1/N] Migrate cpp_extensions_open_device_registration to OpenReg (#156588 ) ---- - fake tensor - named tensor - custom autograd function Pull Request resolved: https://github.com/pytorch/pytorch/pull/156588 Approved by: https://github.com/albanD	2025-06-26 03:59:50 +00:00
Xuehai Pan	6d5c789ad5	[BE][PYFMT] migrate PYFMT for `test/[a-h]*/` to `ruff format` (#144555 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144555 Approved by: https://github.com/ezyang ghstack dependencies: #144551, #144554	2025-06-24 04:53:54 +00:00
FFFrog	1d522325b4	[OpenReg][1/N] Migrate cpp_extensions_open_device_registration to OpenReg (#156400 ) As the title stated. Changes: - add resize_ for OpenReg - migrate related tests into test_openreg.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/156400 Approved by: https://github.com/albanD	2025-06-22 18:40:38 +00:00
Jane Xu	55dae0bf7a	Add a basic shim and stable::Tensor is_contiguous API (#156228 ) Add a limited is_contiguous in shim, stable::Tensor API with a test case Pull Request resolved: https://github.com/pytorch/pytorch/pull/156228 Approved by: https://github.com/desertfire	2025-06-20 17:59:52 +00:00
Jane Xu	9a5c59368d	Replace all RAIIATH with Tensor in libtorch_agnostic test, test some APIs (#155977 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/155977 Approved by: https://github.com/albanD ghstack dependencies: #155367	2025-06-17 17:36:31 +00:00
Jane Xu	b115a4c03a	torch::stable::Tensor beginnings, mainly mem mgmt (#155367 ) ``` // The torch::stable::Tensor class is a highlevel C++ header-only wrapper around // the C shim Tensor APIs. We've modeled this class after TensorBase, as custom // op kernels only really need to interact with Tensor metadata (think sizes, // strides, device, dtype). Other functions on Tensor (like empty_like) should // live like the ATen op that they are and exist outside of this struct. // // There are several goals of this class over AtenTensorHandle and // RAIIAtenTensorHandle: // 1. torch::stable::Tensor is a nicer UX much closer to torch::Tensor than the // C APIs with AtenTensorHandle. Under the hood we still call to these C shim // APIs to preserve stability. // 2. RAIIAtenTensorHandle boils down to a uniq_ptr that forces the user to pass // around ownership. This makes it difficult to pass one input into 2 // different functions, e.g., doing something like c = a(t) + b(t) for // stable::Tensor t. Thus, we use a shared_ptr here. ``` This PR: - exemplifies the above - adds test cases in libtorch_agnostic to make sure the file actually works - includes the results of a battle with template specialization Pull Request resolved: https://github.com/pytorch/pytorch/pull/155367 Approved by: https://github.com/albanD	2025-06-17 17:36:31 +00:00
FFFrog	187828dcb4	[OpenReg][5/N] add set_.source_Storage for openreg (#155191 ) Changes: - add set_.source_Storage for openreg to support torch.load & torch.serialization - uncomment some related tests in the test_openreg.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/155191 Approved by: https://github.com/albanD ghstack dependencies: #153947, #154018, #154019, #154106, #154181, #155101	2025-06-14 03:44:32 +00:00
FFFrog	e4fd0bf771	[OpenReg][4/N] Migrate cpp_extensions_open_device_registration to OpenReg (#155101 ) As the title stated. Involved testcases: - test_open_device_storage_pin_memory - test_open_device_serialization Pull Request resolved: https://github.com/pytorch/pytorch/pull/155101 Approved by: https://github.com/albanD ghstack dependencies: #153947, #154018, #154019, #154106, #154181	2025-06-14 03:44:32 +00:00
FFFrog	1e7989cad5	[OpenReg][3/N] Migrate cpp_extensions_open_device_registration to OpenReg (#154181 ) As the title stated. Involved testcases: - test_open_device_quantized - test_open_device_random - test_open_device_tensor - test_open_device_packed_sequence - test_open_device_storage Pull Request resolved: https://github.com/pytorch/pytorch/pull/154181 Approved by: https://github.com/albanD ghstack dependencies: #153947, #154018, #154019, #154106	2025-06-14 03:44:32 +00:00
FFFrog	7e5f29b2de	[OpenReg][2/N] Migrate cpp_extensions_open_device_registration to OpenReg (#154106 ) As the title stated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154106 Approved by: https://github.com/nareshrajkumar866, https://github.com/albanD ghstack dependencies: #153947, #154018, #154019	2025-06-14 03:44:32 +00:00
FFFrog	676abded4b	[OpenReg][1/N] Migrate cpp_extensions_open_device_registration to OpenReg (#154019 ) As the title stated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/154019 Approved by: https://github.com/albanD ghstack dependencies: #153947, #154018	2025-06-14 03:44:32 +00:00
FFFrog	cafd2344d6	[OpenReg] add manual_seed related capabilities (#153947 ) Changes: - Add manual_seed manual_seed_all initial_seed and so on - Delay execution of self._lazy_init more deeply Pull Request resolved: https://github.com/pytorch/pytorch/pull/153947 Approved by: https://github.com/albanD	2025-06-14 03:44:31 +00:00
Nikita Shulga	ce9ba071fd	[BE] Fix warning in open_registration_extension.cpp (#155755 ) Namely ``` /Users/nshulga/git/pytorch/pytorch/test/cpp_extensions/open_registration_extension.cpp:306:33: warning: left operand of comma operator has no effect [-Wunused-value] 306 \| at::Tensor first = at::empty((2,3)).to(at::DeviceType::PrivateUse1); ``` Or switching between Python and C++ is hard In Python `(2, 3)` creates a tuple, in C/C++ it's just a integral literal 3 P.S. I could have vibe-coded the fix with Claude: https://claude.ai/share/82479e88-84cb-4299-aa2f-dafd28ee2d55 Pull Request resolved: https://github.com/pytorch/pytorch/pull/155755 Approved by: https://github.com/huydhn, https://github.com/atalman	2025-06-12 03:01:30 +00:00
PyTorch MergeBot	8347268edc	Revert "Make open device registration tests standalone (#153855 )" This reverts commit `8823138e47`. Reverted https://github.com/pytorch/pytorch/pull/153855 on behalf of https://github.com/clee2000 due to causing some linux aarch64 tests to fail [GH job link](https://github.com/pytorch/pytorch/actions/runs/15566289293/job/43832373302) [HUD commit link](`8823138e47`), should be easy fix, rename in places where its mentioned, there might be more than just aarch64 though ([comment](https://github.com/pytorch/pytorch/pull/153855#issuecomment-2960191503))	2025-06-10 18:11:24 +00:00
Joel Schlosser	8823138e47	Make open device registration tests standalone (#153855 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/153855 Approved by: https://github.com/janeyx99	2025-06-10 17:33:26 +00:00
Jane Xu	41a9aa6564	Remove janky (though at times useful) dlclose test (#153975 ) This test was never the shining star in class but it helped check that we can properly delete a stable library. But now that we are running it in CI this is not a good test to annoy people with as dlclose + parallelism is likely not the move. I will miss it locally though. Pull Request resolved: https://github.com/pytorch/pytorch/pull/153975 Approved by: https://github.com/jbschlosser	2025-05-20 23:26:42 +00:00
Joel Schlosser	7587350458	Make python_agnostic cpp extension tests standalone (#153274 ) Related: #148920 This PR: * Introduces a new file `test/cpp_extensions/python_agnostic_extension/test/test_python_agnostic.py` with testing that follows the usual python testing patterns * This replaces the testing for python_agnostic in `test/test_cpp_extensions_aot.py` After this PR, it is now possible to run: ``` python test/cpp_extensions/python_agnostic_extension/test/test_python_agnostic.py ``` and the test will build the prerequisite wheel before running the tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/153274 Approved by: https://github.com/janeyx99, https://github.com/cyyever ghstack dependencies: #153264	2025-05-20 19:18:09 +00:00
Joel Schlosser	3ecd444004	Support independent builds for cpp extension tests + apply to libtorch_agnostic tests (#153264 ) Related: #148920 This PR: * Provides a helper `install_cpp_extension(extension_root)` for building C++ extensions. This is intended to be used in `TestMyCppExtension.setUpClass()` * Updates libtorch_agnostic tests to use this * Deletes preexisting libtorch_agnostic tests from `test/test_cpp_extensions_aot.py` * Fixes `run_test.py` to actually run tests in `test/cpp_extensions/libtorch_agnostic_extension/test/test_libtorch_agnostic.py` to avoid losing coverage. This wasn't being run due to logic excluding tests that start with "cpp"; this is fixed now After this PR, it is now possible to run: ``` python test/cpp_extensions/libtorch_agnostic_extension/test/test_libtorch_agnostic.py ``` and the test will build the `libtorch_agnostic` extension before running the tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/153264 Approved by: https://github.com/janeyx99	2025-05-20 19:18:09 +00:00
FFFrog	29c8ae825f	[OpenReg] Move SDPA to OpenReg from open_registration_extension.cpp (#153309 ) As the title stated. Next Chages: - Migrate remaining functionality to OpenReg Pull Request resolved: https://github.com/pytorch/pytorch/pull/153309 Approved by: https://github.com/albanD	2025-05-13 03:49:19 +00:00
FFFrog	fd8fd01d25	[OpenReg] Add _lazy_init and rng_state support for OpenReg (#151914 ) As the title stated. Changes: - Add get_rng_state & set_rng_state support for OpenReg - Add _lazy_init support for OpenReg - Remove redundant code for cuda/Module.cpp Pull Request resolved: https://github.com/pytorch/pytorch/pull/151914 Approved by: https://github.com/albanD	2025-05-04 09:42:08 +00:00
PyTorch MergeBot	3962b8f1e0	Revert "[OpenReg] Add _lazy_init and rng_state support for OpenReg (#151914 )" This reverts commit `64a55b531f`. Reverted https://github.com/pytorch/pytorch/pull/151914 on behalf of https://github.com/malfet due to Looks like breaks number of ROCM jobs, see `797768cd90/1` ([comment](https://github.com/pytorch/pytorch/pull/151914#issuecomment-2839691038))	2025-04-29 17:36:12 +00:00
FFFrog	64a55b531f	[OpenReg] Add _lazy_init and rng_state support for OpenReg (#151914 ) As the title stated. Changes: - Add get_rng_state & set_rng_state support for OpenReg - Add _lazy_init support for OpenReg - Remove redundant code for cuda/Module.cpp Pull Request resolved: https://github.com/pytorch/pytorch/pull/151914 Approved by: https://github.com/albanD	2025-04-29 11:18:12 +00:00
FFFrog	1cc5a8452b	[Openreg][PrivateUse1] Fix releasing tensor issue when using pin_memory (#151091 ) As the title stated. Related PR: https://github.com/pytorch/pytorch/pull/147066 Co-authored-by: Zhenbin Lin <lin-zhenbin@qq.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/151091 Approved by: https://github.com/albanD ghstack dependencies: #151007	2025-04-18 02:40:07 +00:00
FFFrog	3528488061	[Openreg][PrivateUse1] Enable CI for openreg (#151007 ) Changes: - move test_openreg.py from test/cpp_extensions/open_registration_extension/ to test/ - update README.md for openreg - enable CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/151007 Approved by: https://github.com/albanD	2025-04-18 02:40:07 +00:00
PyTorch MergeBot	f252f9df5e	Revert "[Openreg][PrivateUse1] Enable CI for openreg (#151007 )" This reverts commit `abbca37fe8`. Reverted https://github.com/pytorch/pytorch/pull/151007 on behalf of https://github.com/clee2000 due to At least test_record_event needs to also be skipped on dynamo too, its failing and then somehow causing a hang? https://github.com/pytorch/pytorch/actions/runs/14487625709/job/40637535027#step:25:73 ([comment](https://github.com/pytorch/pytorch/pull/151007#issuecomment-2810789483))	2025-04-16 21:05:17 +00:00
PyTorch MergeBot	e0535e823f	Revert "[Openreg][PrivateUse1] Fix releasing tensor issue when using pin_memory (#151091 )" This reverts commit `e229ce34c4`. Reverted https://github.com/pytorch/pytorch/pull/151091 on behalf of https://github.com/clee2000 due to At least test_record_event needs to also be skipped on dynamo too, its failing and then somehow causing a hang? https://github.com/pytorch/pytorch/actions/runs/14487625709/job/40637535027#step:25:73 ([comment](https://github.com/pytorch/pytorch/pull/151007#issuecomment-2810789483))	2025-04-16 21:05:17 +00:00
FFFrog	e229ce34c4	[Openreg][PrivateUse1] Fix releasing tensor issue when using pin_memory (#151091 ) As the title stated. Related PR: https://github.com/pytorch/pytorch/pull/147066 Co-authored-by: Zhenbin Lin <lin-zhenbin@qq.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/151091 Approved by: https://github.com/albanD ghstack dependencies: #151005, #151007	2025-04-16 13:12:17 +00:00
FFFrog	abbca37fe8	[Openreg][PrivateUse1] Enable CI for openreg (#151007 ) Changes: - move test_openreg.py from test/cpp_extensions/open_registration_extension/ to test/ - update README.md for openreg - enable CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/151007 Approved by: https://github.com/albanD ghstack dependencies: #151005	2025-04-16 07:55:51 +00:00
FFFrog	a9dbbe1aee	[OpenReg][PrivateUse1] Refactoring the csrc files of pytorch_openreg (#151005 ) As the title stated. Changes: - Remove unnecessary header file - Remove unnecessary registry logic about PrivateUse1HooksRegistry，such as TORCH_DECLARE_REGISTRY, C10_DEFINE_REGISTRY, etc,. - using static + global variable to do initialization instead of call_one Next Step: Enable test_openreg.py in CI/CD to guard the quality of PrivateUse1 Pull Request resolved: https://github.com/pytorch/pytorch/pull/151005 Approved by: https://github.com/albanD	2025-04-16 07:55:50 +00:00
Aleksei Nikiforov	067a7b1d4a	Disable -Werror for s390x test module compilation (#150413 ) This change should make nightly testsuite green again for s390x. Pull Request resolved: https://github.com/pytorch/pytorch/pull/150413 Approved by: https://github.com/seemethere	2025-04-16 02:15:17 +00:00
FFFrog	2653498ff3	[Openreg][PrivateUse1] Refactor csrc files of Pytorch_openreg (#151004 ) I want to format and refactor the csrc file of pytorch_openreg. To make the code review clearer and easier to understand, I divide the code refactoring into two parts: - Part 1: Code formatting - Part 2: Code refactoring and optimization (Next PR) Pull Request resolved: https://github.com/pytorch/pytorch/pull/151004 Approved by: https://github.com/albanD ghstack dependencies: #151000	2025-04-12 17:22:28 +00:00
FFFrog	c181403063	[Openreg][PrivateUse1] Improve openreg module capabilities (#151000 ) ---- - Add more functionalities for openreg in openreg module - Remove related functionalities from test_cpp_extensions_open_device_registration.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/151000 Approved by: https://github.com/albanD	2025-04-12 17:21:35 +00:00
FFFrog	0c59a031c8	[OpenReg][PrivateUse1] add device context for OpenReg Module (#150997 ) Add device context support for OpenReg Module, which is depended by some tests such as ``torch.serialization.default_restore_location`` Pull Request resolved: https://github.com/pytorch/pytorch/pull/150997 Approved by: https://github.com/albanD	2025-04-12 06:32:30 +00:00
Youseok Yang	b99e0c5412	Fix mtia_extension.cpp setDevice() to correctly set current_device (#149398 ) We referred to this code and found that there was a minor bug. Fix for future reference for others. Pull Request resolved: https://github.com/pytorch/pytorch/pull/149398 Approved by: https://github.com/janeyx99	2025-03-31 06:07:22 +00:00
Wei-Sheng Chin	bca75fe97a	[MAIA] [Autocast] Enable autocast on MAIA device (#148511 ) Fixes #148510. Pull Request resolved: https://github.com/pytorch/pytorch/pull/148511 Approved by: https://github.com/albanD	2025-03-18 03:46:22 +00:00
Jane Xu	cccdf860e2	[BE] Add STABLE_LIBRARY test for multiple returns (#149230 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/149230 Approved by: https://github.com/albanD, https://github.com/zou3519 ghstack dependencies: #149052	2025-03-18 02:40:54 +00:00
Jane Xu	988827cdfb	Use schema as source of truth + support ones_like/empty_like (#149052 ) This change does 2 important things: (a) Instead of relying on IValue type as source of truth, we use the schema as the source of truth, which is important as IValue types are overloaded and can ambiguously convert incorrectly. For example, a MemoryFormat will look like an int + get converted to an int64_t vs a MemoryFormat! (b) This PR expands support for many more types to encompass way more schemas, e.g., Optional, Device, dtype, etc. The main win from this PR is the ability for aoti_torch_call_dispatcher to call TensorFactory ops like ones_like/empty_like! Pull Request resolved: https://github.com/pytorch/pytorch/pull/149052 Approved by: https://github.com/albanD	2025-03-18 02:40:54 +00:00
Jane Xu	e6ef0620cc	Add shim.h C API to call dispatcher on our own aten ops (#148832 ) This PR still needs testing through some cpp extension Pull Request resolved: https://github.com/pytorch/pytorch/pull/148832 Approved by: https://github.com/albanD, https://github.com/atalman ghstack dependencies: #148124	2025-03-11 21:02:04 +00:00
Jane Xu	971606befa	Add a stable TORCH_LIBRARY to C shim (#148124 ) This PR adds two main parts: - shim.h stable C APIs into torch::Library APIs - a higher level API in torch/csrc/stable/library.h that calls into this shim.h + otherwise is self contained Goal: custom kernel writers should be able to call the apis in the directories above in order to register their library in a way that allows their custom extension to run with a different libtorch version than it was built with. Subplots resolved: - Do we want a whole separate StableLibrary or do we want to freeze torch::Library and add `m.stable_impl(cstring, void (fn)(void , int64_t, int64_t)` into it - Yes, we want a separate StableLibrary. We cannot freeze Library and it is NOT header only. - Should I use unint64_t as the common denominator instead of void to support 32bit architectures better? - Yes, and done - Should I add a stable `def` and `fragment` when those can be done in python? - I think we do want these --- and now they're done - Where should library_stable_impl.cpp live? -- no longer relevant - I need some solid test cases to make sure everything's going ok. I've intentionally thrown in a bunch of random dtypes into the signature, but I still haven't tested returning multiple things, returning nothing, complex dtypes, etc. - Have since tested all the torch library endpoints. the others can be tested in a followup to separate components that need to be in shim.h vs can be added later Pull Request resolved: https://github.com/pytorch/pytorch/pull/148124 Approved by: https://github.com/albanD, https://github.com/zou3519, https://github.com/atalman	2025-03-11 19:12:46 +00:00
PyTorch MergeBot	275a7c5dbb	Revert "Add a stable TORCH_LIBRARY to C shim (#148124 )" This reverts commit `327e07ac1d`. Reverted https://github.com/pytorch/pytorch/pull/148124 on behalf of https://github.com/malfet due to Sorry for reverting your PR, but somehow it caused test failures in newly introduced tests, see https://hud.pytorch.org/hud/pytorch/pytorch/main/1?per_page=50&name_filter=pull%20%2F%20linux-focal-cuda12.6-py3.10-gcc11-sm89%20%2F%20test%20(default%2C%201&mergeLF=true ([comment](https://github.com/pytorch/pytorch/pull/148124#issuecomment-2709057833))	2025-03-09 20:44:56 +00:00
Jane Xu	327e07ac1d	Add a stable TORCH_LIBRARY to C shim (#148124 ) This PR adds two main parts: - shim.h stable C APIs into torch::Library APIs - a higher level API in torch/csrc/stable/library.h that calls into this shim.h + otherwise is self contained Goal: custom kernel writers should be able to call the apis in the directories above in order to register their library in a way that allows their custom extension to run with a different libtorch version than it was built with. Subplots resolved: - Do we want a whole separate StableLibrary or do we want to freeze torch::Library and add `m.stable_impl(cstring, void (fn)(void , int64_t, int64_t)` into it - Yes, we want a separate StableLibrary. We cannot freeze Library and it is NOT header only. - Should I use unint64_t as the common denominator instead of void to support 32bit architectures better? - Yes, and done - Should I add a stable `def` and `fragment` when those can be done in python? - I think we do want these --- and now they're done - Where should library_stable_impl.cpp live? -- no longer relevant - I need some solid test cases to make sure everything's going ok. I've intentionally thrown in a bunch of random dtypes into the signature, but I still haven't tested returning multiple things, returning nothing, complex dtypes, etc. - Have since tested all the torch library endpoints. the others can be tested in a followup to separate components that need to be in shim.h vs can be added later Pull Request resolved: https://github.com/pytorch/pytorch/pull/148124 Approved by: https://github.com/albanD, https://github.com/zou3519	2025-03-09 10:07:25 +00:00
Dmitry Rogozhkin	d27ecf85db	xpu: support sycl with torch.utils.cpp_extension APIs (#132945 ) This patch adds support for sycl kernels build via `torch.utils.cpp_extension.load`, `torch.utils.cpp_extension.load_inline` and (new) `class SyclExtension` APIs. Files having `.sycl` extension are considered to have sycl kernels and are compiled with `icpx` (dpc++ sycl compiler from Intel). Files with other extensions, `.cpp`, `.cu`, are handled as before. API supports building sycl along with other file types into single extension. Note that `.sycl` file extension is a PyTorch convention for files containing sycl code which I propose to adopt. We did follow up with compiler team to introduce such file extension in the compiler, but they are opposed to this. At the same time discussion around sycl file extension and adding sycl language support into such tools as cmake is ongoing. Eventually cmake also considers to introduce some file extension convention for sycl. I hope we can further influence cmake and compiler communities to broader adopt `.sycl` file extension. By default SYCL kernels are compiled for all Intel GPU devices for which pytorch native aten SYCL kernels are compiled. At the moment `pvc,xe-lpg`. This behavior can be overridden by setting `TORCH_XPU_ARCH_LIST` environment variables to the comma separated list of desired devices to compile for. Fixes: #132944 CC: @gujinghui @EikanWang @fengyuan14 @guangyey @jgong5 Pull Request resolved: https://github.com/pytorch/pytorch/pull/132945 Approved by: https://github.com/albanD, https://github.com/guangyey, https://github.com/malfet Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>	2025-02-16 16:50:59 +00:00
PyTorch MergeBot	dd5d0ea6bb	Revert "xpu: support sycl with torch.utils.cpp_extension APIs (#132945 )" This reverts commit `607379960b`. Reverted https://github.com/pytorch/pytorch/pull/132945 on behalf of https://github.com/malfet due to It just broke all the tests, see `b16ae97ad0/1` ([comment](https://github.com/pytorch/pytorch/pull/132945#issuecomment-2661498747))	2025-02-16 16:03:42 +00:00
Dmitry Rogozhkin	607379960b	xpu: support sycl with torch.utils.cpp_extension APIs (#132945 ) This patch adds support for sycl kernels build via `torch.utils.cpp_extension.load`, `torch.utils.cpp_extension.load_inline` and (new) `class SyclExtension` APIs. Files having `.sycl` extension are considered to have sycl kernels and are compiled with `icpx` (dpc++ sycl compiler from Intel). Files with other extensions, `.cpp`, `.cu`, are handled as before. API supports building sycl along with other file types into single extension. Note that `.sycl` file extension is a PyTorch convention for files containing sycl code which I propose to adopt. We did follow up with compiler team to introduce such file extension in the compiler, but they are opposed to this. At the same time discussion around sycl file extension and adding sycl language support into such tools as cmake is ongoing. Eventually cmake also considers to introduce some file extension convention for sycl. I hope we can further influence cmake and compiler communities to broader adopt `.sycl` file extension. By default SYCL kernels are compiled for all Intel GPU devices for which pytorch native aten SYCL kernels are compiled. At the moment `pvc,xe-lpg`. This behavior can be overridden by setting `TORCH_XPU_ARCH_LIST` environment variables to the comma separated list of desired devices to compile for. Fixes: #132944 CC: @gujinghui @EikanWang @fengyuan14 @guangyey @jgong5 Pull Request resolved: https://github.com/pytorch/pytorch/pull/132945 Approved by: https://github.com/albanD, https://github.com/guangyey	2025-02-16 10:16:09 +00:00
Jane Xu	515e55e692	Set -DPy_LIMITED_API flag for py_limited_api=True extensions (#145764 ) This could be BC breaking, because there was a period of time when we use py_limited_api=True but don't enforce the flag, and now that we will start enforcing the flag, people's custom extensions may fail to build. This is strictly still better behavior, as it is sketchy to claim CPython agnosticism without the flag, but calling this out as potential people yelling at us. Ways to mitigate this risk + reasons this may not be too big a deal: - People haven't known about py_limited_api for extensions much due to lack of docs from python so usage is low right now - My current tutorial is in store to make new users of py_limited_api pass this flag, so it'd be a noop for them. Test plan: * Locally i'm confident as I tried rebuilding ao with this change and it reliably failed (cuz importing torch/extension.h is a nono) * Unit test wise, the normal python_agnostic one I added should work Pull Request resolved: https://github.com/pytorch/pytorch/pull/145764 Approved by: https://github.com/ezyang, https://github.com/zou3519, https://github.com/albanD	2025-01-28 20:11:05 +00:00
Zhenbin Lin	a08f7f3266	OpenReg: fix issue of pin_memory (#145046 ) Fix issue of `pin_memory` when rewrapping a storage. Pull Request resolved: https://github.com/pytorch/pytorch/pull/145046 Approved by: https://github.com/albanD	2025-01-28 09:41:04 +00:00
Zhenbin Lin	392dc177a9	OpenReg: Refactor impl_registry (#145465 ) Refactor impl_registry to use `driver.exec` as fallback. Pull Request resolved: https://github.com/pytorch/pytorch/pull/145465 Approved by: https://github.com/albanD	2025-01-25 03:31:49 +00:00
Zhenbin Lin	47e65077b1	OpenReg: Remove REGISTER_GENERATOR_PRIVATEUSE1 (#144841 ) Replace REGISTER_GENERATOR_PRIVATEUSE1 with new API in AcceleratorHooksInterface. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144841 Approved by: https://github.com/albanD	2025-01-24 01:52:10 +00:00

1 2 3 4 5 ...

321 Commits