pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 00:20:18 +01:00

Author	SHA1	Message	Date
Wang, Chuanqi	b09fb481e0	[CD] Upgrade GCC version to 13 for XPU build (#162474 ) Follow #152426 Pull Request resolved: https://github.com/pytorch/pytorch/pull/162474 Approved by: https://github.com/zxiiro, https://github.com/atalman	2025-10-31 21:15:37 +00:00
Fadi Arafeh	d2be06f673	[cpu][fix] Update ACL version to fix crashes with tensor sizes > 2^31-1 (#165904 ) ---- - Updates Arm Compute Library (ACL) to v52.6.0 - v52.6.0 contains https://github.com/ARM-software/ComputeLibrary/pull/1201 which fixes crashes with tensors of sizes > 2^31-1 fixes: #165654 Pull Request resolved: https://github.com/pytorch/pytorch/pull/165904 Approved by: https://github.com/malfet	2025-10-31 19:37:26 +00:00
Jeff Daily	239e7b541a	[ROCm][CI] upgrade nightly wheels to ROCm 7.1 (#166730 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/166730 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-10-31 17:30:47 +00:00
Jeff Daily	24e94e021a	[ROCm][CI] create ROCm 7.1 magma tarball (#166693 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/166693 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-10-31 15:20:00 +00:00
Wang, Chuanqi	0d3a4f7155	[CD] Enable Inductor performance test for xpu (#166289 ) Add Dynamo benchmark performance tests for XPU backend Pull Request resolved: https://github.com/pytorch/pytorch/pull/166289 Approved by: https://github.com/EikanWang, https://github.com/atalman	2025-10-31 10:52:07 +00:00
dependabot[bot]	cf7756da38	Bump uv from 0.9.5 to 0.9.6 in /.ci/lumen_cli (#166578 ) Bumps [uv](https://github.com/astral-sh/uv) from 0.9.5 to 0.9.6. - [Release notes](https://github.com/astral-sh/uv/releases) - [Changelog](https://github.com/astral-sh/uv/blob/main/CHANGELOG.md) - [Commits](https://github.com/astral-sh/uv/compare/0.9.5...0.9.6) --- updated-dependencies: - dependency-name: uv dependency-version: 0.9.6 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-29 18:28:14 -07:00
Jeff Daily	12c12466b0	[ROCm][CI] remove amdgpu from install_rocm.sh (#166575 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/166575 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-10-30 01:08:33 +00:00
Andrey Talman	82ff07c788	Add py 3.14 CI docker build pytorch-linux-jammy-py3.14-clang12 (#164791 ) Related to https://github.com/pytorch/pytorch/issues/156856 Pull Request resolved: https://github.com/pytorch/pytorch/pull/164791 Approved by: https://github.com/huydhn, https://github.com/malfet, https://github.com/albanD	2025-10-29 22:21:22 +00:00
Aaron Gokaslan	96b61844a7	[BE]: Update nvshmem to 3.4.5 (#164046 ) Release notes can be found here: https://docs.nvidia.com/nvshmem/release-notes-install-guide/release-notes/release-3405.html main difference is the addition of a CPU assisted IBGDA fallback which should allow NVSHMEM IBGDA to work on way more systems without admin intervention and without using GDRCopy. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164046 Approved by: https://github.com/ezyang, https://github.com/kwen2501	2025-10-29 07:32:05 +00:00
atalman	b903018c26	[CD] Windows builds migrate python 3.14rc1->3.14.0 (#166408 ) Python 3.14 was released, hence we can use official release version now Pull Request resolved: https://github.com/pytorch/pytorch/pull/166408 Approved by: https://github.com/Skylion007, https://github.com/malfet	2025-10-28 19:52:38 +00:00
Bin Bao	687c15c0b3	[AOTI][BE] Change test_aoti_inference to one-pass build (#164277 ) Summary: To fix https://github.com/pytorch/pytorch/issues/159400. Currently, test_aoti_abi_check and test_aoti_inference need to be built in two passes, first build pytorch using the regular `pythonsetup.py develop` and then build with `CMAKE_FRESH=1 BUILD_AOT_INDUCTOR_TEST=1 python setup.py devleop`. This is cumbersome. Fix by rewriting CMakeLists.txt for test_aoti_inference to one-pass build which runs AOTI to compile models at the test time. Also update CI test script to get rid of two-pass build. For test_aoti_abi_check, it is not AOTI specific, so we make it not guarded by BUILD_AOT_INDUCTOR_TEST. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164277 Approved by: https://github.com/janeyx99	2025-10-28 17:43:22 +00:00
PyTorch MergeBot	a4a0378e6b	Revert "[cuDNN] Smoke-test runtime cuDNN version matches compile time version in CI (#165922 )" This reverts commit `2a5f87decf`. Reverted https://github.com/pytorch/pytorch/pull/165922 on behalf of https://github.com/atalman due to cudnn update started to fail, see https://github.com/pytorch/pytorch/pull/165913#issuecomment-3457293475 ([comment](https://github.com/pytorch/pytorch/pull/165922#issuecomment-3457389406))	2025-10-28 16:29:29 +00:00
PyTorch MergeBot	0eacd934bc	Revert "Update cuDNN 9.10.2 in Manylinux 2.28 Docker files (#165913 )" This reverts commit `840d63c12d`. Reverted https://github.com/pytorch/pytorch/pull/165913 on behalf of https://github.com/clee2000 due to I think something here is causing CI tests to segfault at exit on cuda, ex [GH job link](https://github.com/pytorch/pytorch/actions/runs/18857880394/job/53811917713) [HUD commit link](`9a91486e45`) says no tests failed but it segfaulted afterwards. I can't tell if it's because of this change, or an unpinned dependency in docker that got triggered by this. Note to self, would have been bad TD except trunk didn't run either ([comment](https://github.com/pytorch/pytorch/pull/165913#issuecomment-3457293475))	2025-10-28 16:11:07 +00:00
Ting Lu	544b443ea1	[CD] Upgrade to CUDA 13.0.2 for nightly binaries (#165470 ) 13.0.U2 is posted, adding to nightlies Why we want to upgrade: CUDA 13.0.U2 included a new release from cuBLAS that 1. Enabled opt-in fixed-point emulation for FP64 matmuls (D/ZGEMM) which improves performance and power-efficiency. 2. Improved performance on NVIDIA [DGX Spark](https://www.nvidia.com/en-us/products/workstations/dgx-spark/) for FP16/BF16 and FP8 GEMMs. 3. adds BF16x9 FP32 emulation support for SYRK and HERK routines. Reference: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#cublas-release-13-0-update-2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/165470 Approved by: https://github.com/atalman	2025-10-28 15:14:43 +00:00
PyTorch MergeBot	74336f8c77	Revert "[CD] Upgrade to CUDA 13.0.2 for nightly binaries (#165470 )" This reverts commit `5e769ff867`. Reverted https://github.com/pytorch/pytorch/pull/165470 on behalf of https://github.com/atalman due to Sorry reverting for now, to restore trunk health ([comment](https://github.com/pytorch/pytorch/pull/165470#issuecomment-3454166879))	2025-10-28 02:21:48 +00:00
Ting Lu	5e769ff867	[CD] Upgrade to CUDA 13.0.2 for nightly binaries (#165470 ) 13.0.U2 is posted, adding to nightlies Why we want to upgrade: CUDA 13.0.U2 included a new release from cuBLAS that 1. Enabled opt-in fixed-point emulation for FP64 matmuls (D/ZGEMM) which improves performance and power-efficiency. 2. Improved performance on NVIDIA [DGX Spark](https://www.nvidia.com/en-us/products/workstations/dgx-spark/) for FP16/BF16 and FP8 GEMMs. 3. adds BF16x9 FP32 emulation support for SYRK and HERK routines. Reference: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#cublas-release-13-0-update-2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/165470 Approved by: https://github.com/atalman	2025-10-28 00:21:47 +00:00
Eddie Yan	2a5f87decf	[cuDNN] Smoke-test runtime cuDNN version matches compile time version in CI (#165922 ) Fix and regression test for https://github.com/pytorch/pytorch/issues/165801 Pull Request resolved: https://github.com/pytorch/pytorch/pull/165922 Approved by: https://github.com/malfet, https://github.com/atalman, https://github.com/Skylion007, https://github.com/drisspg Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>	2025-10-27 22:10:45 +00:00
Andrey Talman	840d63c12d	Update cuDNN 9.10.2 in Manylinux 2.28 Docker files (#165913 ) Fixes https://github.com/pytorch/pytorch/issues/165801 Smoke test: https://github.com/pytorch/pytorch/pull/165922/files Pull Request resolved: https://github.com/pytorch/pytorch/pull/165913 Approved by: https://github.com/Camyll, https://github.com/Skylion007	2025-10-27 22:08:06 +00:00
Yuanyuan Chen	7cb467a169	[CI] Update ONNX CI packages to latest (#165883 ) This PR updates ONNX related packages to their latest versions used in CI environments. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165883 Approved by: https://github.com/justinchuby, https://github.com/albanD	2025-10-22 00:25:35 +00:00
dependabot[bot]	dac9ed9790	Bump uv from 0.8.6 to 0.9.5 in /.ci/lumen_cli (#166017 ) Bumps [uv](https://github.com/astral-sh/uv) from 0.8.6 to 0.9.5. - [Release notes](https://github.com/astral-sh/uv/releases) - [Changelog](https://github.com/astral-sh/uv/blob/main/CHANGELOG.md) - [Commits](https://github.com/astral-sh/uv/compare/0.8.6...0.9.5) --- updated-dependencies: - dependency-name: uv dependency-version: 0.9.5 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-21 13:16:30 -07:00
Nikita Shulga	5d62b63a76	[BE] Use Python-3.14 GE build (#165804 ) 3.14 reached general availability on Oct 7th 2025, so we can remove all pre-release workarounds Pull Request resolved: https://github.com/pytorch/pytorch/pull/165804 Approved by: https://github.com/yangw-dev, https://github.com/Skylion007, https://github.com/cyyever	2025-10-19 11:45:10 +00:00
Yuanyuan Chen	3255e7872b	Enable all flake8-logging-format rules (#164655 ) These rules are enabled by removing existing suppressions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164655 Approved by: https://github.com/janeyx99, https://github.com/mlazos	2025-10-19 00:59:28 +00:00
Jeff Daily	da8517fa63	[ROCm][CI] upgrade wheels to 7.0.2 and 6.4.4 patch release (#165756 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/165756 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-10-17 17:41:19 +00:00
Justin Chu	fcbde24c1c	[ONNX] Remove common imports from torchlib (#165156 ) The Rank and IsScalar functions are no longer used in the torchlib. Requires onnxscript v0.5.4 Pull Request resolved: https://github.com/pytorch/pytorch/pull/165156 Approved by: https://github.com/Skylion007, https://github.com/cyyever	2025-10-17 03:25:34 +00:00
Shangdi Yu	d82527b32a	[Windows] Add AOTI cross-compilation CI (#165573 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/165573 Approved by: https://github.com/malfet ghstack dependencies: #165560	2025-10-17 01:05:35 +00:00
Shangdi Yu	5d9b024276	Add mingw to docker (#165560 ) Add mingw to `pytorch-linux-jammy-cuda12.8-cudnn9-py3-gcc11` docker image to support AOTI cross-compilation This PR will make docker container rebuild, and upgrade python version from 3.13.7 to 3.13.8. and it relies on https://github.com/pytorch/pytorch/pull/165667 Pull Request resolved: https://github.com/pytorch/pytorch/pull/165560 Approved by: https://github.com/malfet	2025-10-17 00:47:01 +00:00
Nikita Shulga	4d833f859b	[BE] [CI] Fix aarch64 arch checks (#165676 ) Instead of relying on `TEST_CONFIG` environment variable to contain `aarch64`, which is prone to errors, use output of `$(uname -m)` that is equal to `aarch64` on Linux ARM systems Pull Request resolved: https://github.com/pytorch/pytorch/pull/165676 Approved by: https://github.com/huydhn, https://github.com/atalman ghstack dependencies: #165583, #165584	2025-10-16 22:19:53 +00:00
PyTorch MergeBot	69b05913fb	Revert "Add mingw to docker (#165560 )" This reverts commit `5e480b8ecf`. Reverted https://github.com/pytorch/pytorch/pull/165560 on behalf of https://github.com/pytorch-auto-revert due to Reverted automatically by pytorch's autorevert, to avoid this behaviour add the tag autorevert: disable ([comment](https://github.com/pytorch/pytorch/pull/165560#issuecomment-3409814274))	2025-10-16 08:42:11 +00:00
Nikita Shulga	23fb7e9f4b	[CI] Add arch prefix in front of op benchmark results (#165584 ) To be able to run x86 and aarch64 benchmarks later on Pull Request resolved: https://github.com/pytorch/pytorch/pull/165584 Approved by: https://github.com/huydhn ghstack dependencies: #165583	2025-10-16 01:50:52 +00:00
Shangdi Yu	5e480b8ecf	Add mingw to docker (#165560 ) Add mingw to `pytorch-linux-jammy-cuda12.8-cudnn9-py3-gcc11` docker image to support AOTI cross-compilation Pull Request resolved: https://github.com/pytorch/pytorch/pull/165560 Approved by: https://github.com/malfet ghstack dependencies: #165574	2025-10-16 01:31:50 +00:00
Huy Do	132ae8e6dd	Don't link with libnvToolsExt when building for 12.9 (#165465 ) This is to bring back this logic from https://github.com/pytorch/pytorch/pull/161916/files#diff-bf46b4a09ca67e50622bf84fefc0d11b584ffcc24ee6cc5019cf0fc7565d81a8L170. Building libtorch on 12.9 is failing otherwise https://github.com/pytorch/pytorch/actions/runs/18458531395/job/52610761895: ``` cp: cannot stat '/usr/local/cuda/lib64/libnvToolsExt.so.1': No such file or directory ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/165465 Approved by: https://github.com/atalman, https://github.com/malfet	2025-10-15 01:45:37 +00:00
PyTorch UpdateBot	a2601630cd	[vllm hash update] update the pinned vllm hash (#164628 ) This PR is auto-generated nightly by [this action](https://github.com/pytorch/pytorch/blob/main/.github/workflows/nightly.yml). Update the pinned vllm hash. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164628 Approved by: https://github.com/pytorchbot Co-authored-by: Huy Do <huydhn@gmail.com>	2025-10-12 18:26:07 +00:00
Huy Do	4400c5d31e	Continue to build nightly CUDA 12.9 for internal (#163029 ) Revert part of https://github.com/pytorch/pytorch/pull/161916 to continue building CUDA 12.9 nightly Pull Request resolved: https://github.com/pytorch/pytorch/pull/163029 Approved by: https://github.com/malfet	2025-10-11 08:26:47 +00:00
Aidyn-A	d41aa187ec	Add more B200 smoke test (#165133 ) A follow up to #159494. This PR adds additional `test_scaled_matmul_cuda` to smoke tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165133 Approved by: https://github.com/drisspg	2025-10-10 16:46:26 +00:00
FFFrog	e0abcee3b5	[Code Clean] Remove support of python3.9 (#163846 ) As the title stated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/163846 Approved by: https://github.com/ezyang	2025-10-10 11:11:56 +00:00
Jithun Nair	c7b57d9349	Add gfx1100 to build target for ROCm docker builds (#165103 ) Fixes issue of gfx1100 test jobs timing out Pull Request resolved: https://github.com/pytorch/pytorch/pull/165103 Approved by: https://github.com/jeffdaily	2025-10-10 01:18:56 +00:00
Nikita Shulga	e7fd296930	[CI] Add full debug build to trunk (#164974 ) But not test, just import torch, as regression test for https://github.com/pytorch/pytorch/issues/164297 Test plan: Re-apply #164974 on top of this change and observer the failure in the workflows: https://github.com/pytorch/pytorch/actions/runs/18383302153/job/52375282838 Pull Request resolved: https://github.com/pytorch/pytorch/pull/164974 Approved by: https://github.com/seemethere, https://github.com/clee2000, https://github.com/atalman ghstack dependencies: #164968, #164969	2025-10-09 20:12:16 +00:00
PyTorch MergeBot	91040f4934	Revert "[Code Clean] Remove support of python3.9 (#163846 )" This reverts commit `bc1690c7e8`. Reverted https://github.com/pytorch/pytorch/pull/163846 on behalf of https://github.com/izaitsevfb due to breaks distributed tests ([comment](https://github.com/pytorch/pytorch/pull/163846#issuecomment-3386855437))	2025-10-09 17:27:08 +00:00
Catherine Lee	eaa02655ea	[CI] Run cpp tests on windows in one run_tests call (#164861 ) The windows cpp tests take ~1 hour according to logs. Each has run_test called on them individually, so I tried batching them together so it's just one run_test call for all of them. I believe it now takes 30min. I turned off TD since I don't think cpp tests are included in TD stuff. As always with batch, I'm not sure if the errorlevel/error surfacing stuff is correct This code is written with a lot of help from chatgpu and copilot Pull Request resolved: https://github.com/pytorch/pytorch/pull/164861 Approved by: https://github.com/huydhn	2025-10-09 16:07:28 +00:00
FFFrog	bc1690c7e8	[Code Clean] Remove support of python3.9 (#163846 ) As the title stated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/163846 Approved by: https://github.com/ezyang	2025-10-09 11:54:10 +00:00
Boyuan Feng	90b4e130d6	[Benchmark] cleanup torchbench models (#164816 ) Prune models from TorchInductor dashboard to reduce ci cost. This PR prunes torchbench models according to the [doc](https://docs.google.com/document/d/1nLPNNAU-_M9Clx9FMrJ1ycdPxe-xRA54olPnsFzdpoU/edit?tab=t.0), which removes timm and huggingface models from torchbench. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164816 Approved by: https://github.com/anijain2305, https://github.com/seemethere, https://github.com/huydhn, https://github.com/malfet	2025-10-09 00:31:25 +00:00
Laith Sakka	0b85236477	Fix refine_ranges corner case (#164075 ) (#164846 ) Summary: address https://github.com/pytorch/pytorch/issues/161360 u0>0 should update the range of u0 to start from [1, ..] this fix it. it was not doing that. Test Plan: contbuild & OSS CI, see `27234792ad` D84038721 Pull Request resolved: https://github.com/pytorch/pytorch/pull/164846 Approved by: https://github.com/izaitsevfb, https://github.com/ezyang	2025-10-08 18:42:37 +00:00
Adnan Akhundov	73adac05d1	Triton 3.5.x pin update to 7416ffc (#164587 ) Updates triton pin to latest: https://github.com/triton-lang/triton/commits/release/3.5.x/ This updates contains 1 cherry-pick to fix flex_attention_fwd regression on B200: - https://github.com/triton-lang/triton/pull/8366 Pull Request resolved: https://github.com/pytorch/pytorch/pull/164587 Approved by: https://github.com/atalman	2025-10-08 16:07:18 +00:00
Jeff Daily	f46ddb1e65	[ROCm][CI] add gfx1150 gfx1151 to docker images for binary builds (#164854 ) Fixes #164346. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164854 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-10-08 14:34:22 +00:00
Boyuan Feng	f76fdcaaf8	[Benchmark] cleanup huggingface models (#164815 ) Prune models from TorchInductor dashboard to reduce ci cost. This PR prunes for hugging face models according to the [doc](https://docs.google.com/document/d/1nLPNNAU-_M9Clx9FMrJ1ycdPxe-xRA54olPnsFzdpoU/edit?tab=t.0), which reduces from 46 to 27 models. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164815 Approved by: https://github.com/anijain2305, https://github.com/seemethere, https://github.com/huydhn, https://github.com/malfet	2025-10-08 03:21:04 +00:00
amdfaa	955f21dc2c	[ROCm][CI] Add support for gfx1100 in rocm workflow + test skips (#148355 ) This PR adds infrastructure support for gfx1100 in the rocm workflow. Nodes have been allocated for this effort. @dnikolaev-amd contributed all the test skips. Pull Request resolved: https://github.com/pytorch/pytorch/pull/148355 Approved by: https://github.com/jeffdaily Co-authored-by: Dmitry Nikolaev <dmitry.nikolaev@amd.com> Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-10-07 22:36:25 +00:00
Aleksei Nikiforov	87c9fbda22	Follow up to PR 163980 for s390x (#164464 ) Now with same updates propagated to s390x it works on s390x runners too. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164464 Approved by: https://github.com/atalman	2025-10-07 12:02:29 +00:00
Yuanyuan Chen	50e077beaa	Fix outdated info in requirements-ci.txt (#164441 ) Fixes installation instructions and descriptions for `numba` and `scikit-image` Pull Request resolved: https://github.com/pytorch/pytorch/pull/164441 Approved by: https://github.com/albanD	2025-10-07 02:10:41 +00:00
Jeff Daily	8f54e27e5d	[ROCm][CI] rebuild magma binary for gfx1150 gfx1151 (#164782 ) After #164763 added gfx1150 gfx1151 to list of targets, this PR will trigger rebuild of magma binary for ROCm 7 with the new targets. Pull Request resolved: https://github.com/pytorch/pytorch/pull/164782 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-10-06 23:29:21 +00:00
PyTorch MergeBot	3912ba3e94	Revert "Fix refine_ranges corner case (#164075 )" This reverts commit `27234792ad`. Reverted https://github.com/pytorch/pytorch/pull/164075 on behalf of https://github.com/izaitsevfb due to fails executorch builds, see [D83938444](https://www.internalfb.com/diff/D83938444) ([comment](https://github.com/pytorch/pytorch/pull/164075#issuecomment-3374430964))	2025-10-06 22:09:39 +00:00

1 2 3 4 5 ...

1733 Commits