pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Tom Ritchford	d8c8ba2440	Fix unused Python variables in test/[e-z]* (#136964 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/136964 Approved by: https://github.com/justinchuby, https://github.com/albanD	2024-12-18 23:02:30 +00:00
Yukio Siraichi	565a53d326	Use DLPack for creating tensors out of custom classes, when available. (#138697 ) Fixes #120614 Takes over #120615 In summary, this PR: - Adds a `__dlpack__` attribute check in the tensor creation path (i.e. [`internal_new_from_data` @ tensor_new.cpp](`cdfe1bffd1/torch/csrc/utils/tensor_new.cpp (L266)`)) - Creates the tensor by using the DLPack machinery, instead of an element-by-element copy - No changes since #120615 - Adds a test, making sure the DLPack machinery is used - Wraps a tensor in a fresh `TensorDLPackWrapper` class that implements only the DLPack methods - Creates a new tensor from an instance of `TensorDLPackWrapper` Pull Request resolved: https://github.com/pytorch/pytorch/pull/138697 Approved by: https://github.com/ezyang Co-authored-by: Wenzel Jakob <wenzel.jakob@epfl.ch>	2024-10-26 01:27:05 +00:00
Yuanhao Ji	c165a8e71d	Enable UFMT on `test_decomp.py`, `test_expanded_weights.py` and some files (#125117 ) Part of: #123062 Ran lintrunner on: - test/test_decomp.py - test/test_deploy.py - test/test_determination.py - test/test_dlpack.py - test/test_dynamic_shapes.py - test/test_expanded_weights.py Detail: ```bash $ lintrunner -a --take UFMT --all-files ok No lint issues. Successfully applied all patches. ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/125117 Approved by: https://github.com/jansel	2024-05-07 02:36:40 +00:00
Edward Z. Yang	5b24877663	Improve uint{16,32,64} dlpack/numpy compatibility (#116808 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/116808 Approved by: https://github.com/malfet, https://github.com/albanD	2024-01-11 17:01:54 +00:00
Iman Tabrizian	a81290ccb9	Add DLPack bool support (#108486 ) Fixes #94463 Fixes https://github.com/pytorch/pytorch/issues/67081 - [X] Update DLPack header file - [X] Add testing for DLPack boolean - [X] Add boolean support to PyTorch's DLPack support Pull Request resolved: https://github.com/pytorch/pytorch/pull/108486 Approved by: https://github.com/ezyang	2023-09-08 17:55:33 +00:00
Justin Chu	73e1455327	[BE] Enable ruff's UP rules and autoformat test/ (#105434 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105434 Approved by: https://github.com/albanD	2023-07-19 20:36:06 +00:00
Natalia Gimelshein	ecd79b1fef	add additional stream priority for cuda streams (#101956 ) Changes the StreamID encoding to use the last bit to distinguish between external and internal streams, 4 bits for IdType (DEFAULT, EXT or user-created streams possibly with high priority), and 5 bits for index. This allows us to have more stream priorities exposed to user (I'm currently setting 4, but that's easy to change now). Note, we are pre-creating all 32 streams in the pool per each allowed priority, I don't know if it's a problem in practice. Currently cuda 11.8/A100 GPUs allow 6 different stream priorities, the number may be different for the different cards/different cuda versions. Previous callsites explicitly requesting high prioity stream (`isHighPriority=true`) are now getting the highest priority stream. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101956 Approved by: https://github.com/ezyang	2023-05-27 02:36:16 +00:00
eqy	54f38381a0	[CUDA][DLPack] Try ~~bumping sleep interval~~ running on explicit side-stream for Windows `dlpack` test (#102283 ) (attempted fix for Windows failure in #101318) CC @huydhn If this doesn't work, will try adding an explicit side stream in case that is causing the issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102283 Approved by: https://github.com/huydhn	2023-05-26 00:57:55 +00:00
Natalia Gimelshein	5da497cabb	add additional stream priority for cuda streams (#101956 ) Changes the StreamID encoding to use the last bit to distinguish between external and internal streams, 4 bits for IdType (DEFAULT, EXT or user-created streams possibly with high priority), and 5 bits for index. This allows us to have more stream priorities exposed to user (I'm currently setting 4, but that's easy to change now). Note, we are pre-creating all 32 streams in the pool per each allowed priority, I don't know if it's a problem in practice. Currently cuda 11.8/A100 GPUs allow 6 different stream priorities, the number may be different for the different cards/different cuda versions. Previous callsites explicitly requesting high prioity stream (`isHighPriority=true`) are now getting the highest priority stream. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101956 Approved by: https://github.com/ezyang	2023-05-24 23:26:47 +00:00
eqy	66f6e0e605	[CUDA][DLPack] Handle legacy default streams for DLPack conversion (#101318 ) It seems that some legacy default stream logic (e.g., present in `a8ff647e42/torch/utils/dlpack.py (L114)` ) is not handled on the potential receiving end in `torch/_tensor.py`. Open to suggestions on how to make the test case less clunky, as this was the combination we arrived at after discovering flakiness in alternate versions. Thanks to Olga Andreeva for surfacing this issue and providing a repro. CC @Aidyn-A @ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/101318 Approved by: https://github.com/ngimel	2023-05-24 16:14:50 +00:00
puririshi98	8aa34602f7	Jetson Update for CI Redo (#94549 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94549 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-02-21 17:13:38 +00:00
mattip	4dfa6d28a1	Normalize DLPack stride to 1 where shape < 2 (#83158 ) Fixes #83069. Also move all the dlpack tests to a new file., `test_dlpack.py`. The fix involves always allocating a "strides" int array when converting to dlPack and deleting the strides when the capsule descructor is called. Then the strides are copied from the tensor, and `strides[i]` is set to `1` where `shape[i] < 2`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83158 Approved by: https://github.com/ezyang	2022-08-23 15:03:29 +00:00

12 Commits