Wang, Chuanqi
b09fb481e0
[CD] Upgrade GCC version to 13 for XPU build ( #162474 )
...
Follow #152426
Pull Request resolved: https://github.com/pytorch/pytorch/pull/162474
Approved by: https://github.com/zxiiro , https://github.com/atalman
2025-10-31 21:15:37 +00:00
Fadi Arafeh
d2be06f673
[cpu][fix] Update ACL version to fix crashes with tensor sizes > 2^31-1 ( #165904 )
...
----
- Updates Arm Compute Library (ACL) to v52.6.0
- v52.6.0 contains https://github.com/ARM-software/ComputeLibrary/pull/1201 which fixes crashes with tensors of sizes > 2^31-1
fixes : #165654
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165904
Approved by: https://github.com/malfet
2025-10-31 19:37:26 +00:00
Jeff Daily
239e7b541a
[ROCm][CI] upgrade nightly wheels to ROCm 7.1 ( #166730 )
...
Pull Request resolved: https://github.com/pytorch/pytorch/pull/166730
Approved by: https://github.com/jeffdaily
Co-authored-by: Jeff Daily <jeff.daily@amd.com>
2025-10-31 17:30:47 +00:00
Jeff Daily
24e94e021a
[ROCm][CI] create ROCm 7.1 magma tarball ( #166693 )
...
Pull Request resolved: https://github.com/pytorch/pytorch/pull/166693
Approved by: https://github.com/jeffdaily
Co-authored-by: Jeff Daily <jeff.daily@amd.com>
2025-10-31 15:20:00 +00:00
Wang, Chuanqi
0d3a4f7155
[CD] Enable Inductor performance test for xpu ( #166289 )
...
Add Dynamo benchmark performance tests for XPU backend
Pull Request resolved: https://github.com/pytorch/pytorch/pull/166289
Approved by: https://github.com/EikanWang , https://github.com/atalman
2025-10-31 10:52:07 +00:00
dependabot[bot]
cf7756da38
Bump uv from 0.9.5 to 0.9.6 in /.ci/lumen_cli ( #166578 )
...
Bumps [uv](https://github.com/astral-sh/uv ) from 0.9.5 to 0.9.6.
- [Release notes](https://github.com/astral-sh/uv/releases )
- [Changelog](https://github.com/astral-sh/uv/blob/main/CHANGELOG.md )
- [Commits](https://github.com/astral-sh/uv/compare/0.9.5...0.9.6 )
---
updated-dependencies:
- dependency-name: uv
dependency-version: 0.9.6
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-29 18:28:14 -07:00
Jeff Daily
12c12466b0
[ROCm][CI] remove amdgpu from install_rocm.sh ( #166575 )
...
Pull Request resolved: https://github.com/pytorch/pytorch/pull/166575
Approved by: https://github.com/jeffdaily
Co-authored-by: Jeff Daily <jeff.daily@amd.com>
2025-10-30 01:08:33 +00:00
Andrey Talman
82ff07c788
Add py 3.14 CI docker build pytorch-linux-jammy-py3.14-clang12 ( #164791 )
...
Related to https://github.com/pytorch/pytorch/issues/156856
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164791
Approved by: https://github.com/huydhn , https://github.com/malfet , https://github.com/albanD
2025-10-29 22:21:22 +00:00
Aaron Gokaslan
96b61844a7
[BE]: Update nvshmem to 3.4.5 ( #164046 )
...
Release notes can be found here: https://docs.nvidia.com/nvshmem/release-notes-install-guide/release-notes/release-3405.html main difference is the addition of a CPU assisted IBGDA fallback which should allow NVSHMEM IBGDA to work on way more systems without admin intervention and without using GDRCopy.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164046
Approved by: https://github.com/ezyang , https://github.com/kwen2501
2025-10-29 07:32:05 +00:00
atalman
b903018c26
[CD] Windows builds migrate python 3.14rc1->3.14.0 ( #166408 )
...
Python 3.14 was released, hence we can use official release version now
Pull Request resolved: https://github.com/pytorch/pytorch/pull/166408
Approved by: https://github.com/Skylion007 , https://github.com/malfet
2025-10-28 19:52:38 +00:00
Bin Bao
687c15c0b3
[AOTI][BE] Change test_aoti_inference to one-pass build ( #164277 )
...
Summary: To fix https://github.com/pytorch/pytorch/issues/159400 . Currently, test_aoti_abi_check and test_aoti_inference need to be built in two passes, first build pytorch using the regular `pythonsetup.py develop` and then build with `CMAKE_FRESH=1 BUILD_AOT_INDUCTOR_TEST=1 python setup.py devleop`. This is cumbersome. Fix by rewriting CMakeLists.txt for test_aoti_inference to one-pass build which runs AOTI to compile models at the test time. Also update CI test script to get rid of two-pass build. For test_aoti_abi_check, it is not AOTI specific, so we make it not guarded by BUILD_AOT_INDUCTOR_TEST.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164277
Approved by: https://github.com/janeyx99
2025-10-28 17:43:22 +00:00
PyTorch MergeBot
a4a0378e6b
Revert "[cuDNN] Smoke-test runtime cuDNN version matches compile time version in CI ( #165922 )"
...
This reverts commit 2a5f87decf .
Reverted https://github.com/pytorch/pytorch/pull/165922 on behalf of https://github.com/atalman due to cudnn update started to fail, see https://github.com/pytorch/pytorch/pull/165913#issuecomment-3457293475 ([comment](https://github.com/pytorch/pytorch/pull/165922#issuecomment-3457389406 ))
2025-10-28 16:29:29 +00:00
PyTorch MergeBot
0eacd934bc
Revert "Update cuDNN 9.10.2 in Manylinux 2.28 Docker files ( #165913 )"
...
This reverts commit 840d63c12d .
Reverted https://github.com/pytorch/pytorch/pull/165913 on behalf of https://github.com/clee2000 due to I think something here is causing CI tests to segfault at exit on cuda, ex [GH job link](https://github.com/pytorch/pytorch/actions/runs/18857880394/job/53811917713 ) [HUD commit link](9a91486e45 ) says no tests failed but it segfaulted afterwards. I can't tell if it's because of this change, or an unpinned dependency in docker that got triggered by this. Note to self, would have been bad TD except trunk didn't run either ([comment](https://github.com/pytorch/pytorch/pull/165913#issuecomment-3457293475 ))
2025-10-28 16:11:07 +00:00
Ting Lu
544b443ea1
[CD] Upgrade to CUDA 13.0.2 for nightly binaries ( #165470 )
...
13.0.U2 is posted, adding to nightlies
Why we want to upgrade: CUDA 13.0.U2 included a new release from cuBLAS that
1. Enabled opt-in fixed-point emulation for FP64 matmuls (D/ZGEMM) which improves performance and power-efficiency.
2. Improved performance on NVIDIA [DGX Spark](https://www.nvidia.com/en-us/products/workstations/dgx-spark/ ) for FP16/BF16 and FP8 GEMMs.
3. adds BF16x9 FP32 emulation support for SYRK and HERK routines.
Reference: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#cublas-release-13-0-update-2
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165470
Approved by: https://github.com/atalman
2025-10-28 15:14:43 +00:00
PyTorch MergeBot
74336f8c77
Revert "[CD] Upgrade to CUDA 13.0.2 for nightly binaries ( #165470 )"
...
This reverts commit 5e769ff867 .
Reverted https://github.com/pytorch/pytorch/pull/165470 on behalf of https://github.com/atalman due to Sorry reverting for now, to restore trunk health ([comment](https://github.com/pytorch/pytorch/pull/165470#issuecomment-3454166879 ))
2025-10-28 02:21:48 +00:00
Ting Lu
5e769ff867
[CD] Upgrade to CUDA 13.0.2 for nightly binaries ( #165470 )
...
13.0.U2 is posted, adding to nightlies
Why we want to upgrade: CUDA 13.0.U2 included a new release from cuBLAS that
1. Enabled opt-in fixed-point emulation for FP64 matmuls (D/ZGEMM) which improves performance and power-efficiency.
2. Improved performance on NVIDIA [DGX Spark](https://www.nvidia.com/en-us/products/workstations/dgx-spark/ ) for FP16/BF16 and FP8 GEMMs.
3. adds BF16x9 FP32 emulation support for SYRK and HERK routines.
Reference: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#cublas-release-13-0-update-2
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165470
Approved by: https://github.com/atalman
2025-10-28 00:21:47 +00:00
Eddie Yan
2a5f87decf
[cuDNN] Smoke-test runtime cuDNN version matches compile time version in CI ( #165922 )
...
Fix and regression test for https://github.com/pytorch/pytorch/issues/165801
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165922
Approved by: https://github.com/malfet , https://github.com/atalman , https://github.com/Skylion007 , https://github.com/drisspg
Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>
2025-10-27 22:10:45 +00:00
Andrey Talman
840d63c12d
Update cuDNN 9.10.2 in Manylinux 2.28 Docker files ( #165913 )
...
Fixes https://github.com/pytorch/pytorch/issues/165801
Smoke test: https://github.com/pytorch/pytorch/pull/165922/files
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165913
Approved by: https://github.com/Camyll , https://github.com/Skylion007
2025-10-27 22:08:06 +00:00
Yuanyuan Chen
7cb467a169
[CI] Update ONNX CI packages to latest ( #165883 )
...
This PR updates ONNX related packages to their latest versions used in CI environments.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165883
Approved by: https://github.com/justinchuby , https://github.com/albanD
2025-10-22 00:25:35 +00:00
dependabot[bot]
dac9ed9790
Bump uv from 0.8.6 to 0.9.5 in /.ci/lumen_cli ( #166017 )
...
Bumps [uv](https://github.com/astral-sh/uv ) from 0.8.6 to 0.9.5.
- [Release notes](https://github.com/astral-sh/uv/releases )
- [Changelog](https://github.com/astral-sh/uv/blob/main/CHANGELOG.md )
- [Commits](https://github.com/astral-sh/uv/compare/0.8.6...0.9.5 )
---
updated-dependencies:
- dependency-name: uv
dependency-version: 0.9.5
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-10-21 13:16:30 -07:00
Nikita Shulga
5d62b63a76
[BE] Use Python-3.14 GE build ( #165804 )
...
3.14 reached general availability on Oct 7th 2025, so we can remove all pre-release workarounds
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165804
Approved by: https://github.com/yangw-dev , https://github.com/Skylion007 , https://github.com/cyyever
2025-10-19 11:45:10 +00:00
Yuanyuan Chen
3255e7872b
Enable all flake8-logging-format rules ( #164655 )
...
These rules are enabled by removing existing suppressions.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164655
Approved by: https://github.com/janeyx99 , https://github.com/mlazos
2025-10-19 00:59:28 +00:00
Jeff Daily
da8517fa63
[ROCm][CI] upgrade wheels to 7.0.2 and 6.4.4 patch release ( #165756 )
...
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165756
Approved by: https://github.com/jeffdaily
Co-authored-by: Jeff Daily <jeff.daily@amd.com>
2025-10-17 17:41:19 +00:00
Justin Chu
fcbde24c1c
[ONNX] Remove common imports from torchlib ( #165156 )
...
The Rank and IsScalar functions are no longer used in the torchlib. Requires onnxscript v0.5.4
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165156
Approved by: https://github.com/Skylion007 , https://github.com/cyyever
2025-10-17 03:25:34 +00:00
Shangdi Yu
d82527b32a
[Windows] Add AOTI cross-compilation CI ( #165573 )
...
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165573
Approved by: https://github.com/malfet
ghstack dependencies: #165560
2025-10-17 01:05:35 +00:00
Shangdi Yu
5d9b024276
Add mingw to docker ( #165560 )
...
Add mingw to `pytorch-linux-jammy-cuda12.8-cudnn9-py3-gcc11` docker image to support AOTI cross-compilation
This PR will make docker container rebuild, and upgrade python version from 3.13.7 to 3.13.8. and it relies on https://github.com/pytorch/pytorch/pull/165667
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165560
Approved by: https://github.com/malfet
2025-10-17 00:47:01 +00:00
Nikita Shulga
4d833f859b
[BE] [CI] Fix aarch64 arch checks ( #165676 )
...
Instead of relying on `TEST_CONFIG` environment variable to contain `aarch64`, which is prone to errors, use output of `$(uname -m)` that is equal to `aarch64` on Linux ARM systems
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165676
Approved by: https://github.com/huydhn , https://github.com/atalman
ghstack dependencies: #165583 , #165584
2025-10-16 22:19:53 +00:00
PyTorch MergeBot
69b05913fb
Revert "Add mingw to docker ( #165560 )"
...
This reverts commit 5e480b8ecf .
Reverted https://github.com/pytorch/pytorch/pull/165560 on behalf of https://github.com/pytorch-auto-revert due to Reverted automatically by pytorch's autorevert, to avoid this behaviour add the tag autorevert: disable ([comment](https://github.com/pytorch/pytorch/pull/165560#issuecomment-3409814274 ))
2025-10-16 08:42:11 +00:00
Nikita Shulga
23fb7e9f4b
[CI] Add arch prefix in front of op benchmark results ( #165584 )
...
To be able to run x86 and aarch64 benchmarks later on
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165584
Approved by: https://github.com/huydhn
ghstack dependencies: #165583
2025-10-16 01:50:52 +00:00
Shangdi Yu
5e480b8ecf
Add mingw to docker ( #165560 )
...
Add mingw to `pytorch-linux-jammy-cuda12.8-cudnn9-py3-gcc11` docker image to support AOTI cross-compilation
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165560
Approved by: https://github.com/malfet
ghstack dependencies: #165574
2025-10-16 01:31:50 +00:00
Huy Do
132ae8e6dd
Don't link with libnvToolsExt when building for 12.9 ( #165465 )
...
This is to bring back this logic from https://github.com/pytorch/pytorch/pull/161916/files#diff-bf46b4a09ca67e50622bf84fefc0d11b584ffcc24ee6cc5019cf0fc7565d81a8L170 . Building libtorch on 12.9 is failing otherwise https://github.com/pytorch/pytorch/actions/runs/18458531395/job/52610761895 :
```
cp: cannot stat '/usr/local/cuda/lib64/libnvToolsExt.so.1': No such file or directory
```
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165465
Approved by: https://github.com/atalman , https://github.com/malfet
2025-10-15 01:45:37 +00:00
PyTorch UpdateBot
a2601630cd
[vllm hash update] update the pinned vllm hash ( #164628 )
...
This PR is auto-generated nightly by [this action](https://github.com/pytorch/pytorch/blob/main/.github/workflows/nightly.yml ).
Update the pinned vllm hash.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164628
Approved by: https://github.com/pytorchbot
Co-authored-by: Huy Do <huydhn@gmail.com>
2025-10-12 18:26:07 +00:00
Huy Do
4400c5d31e
Continue to build nightly CUDA 12.9 for internal ( #163029 )
...
Revert part of https://github.com/pytorch/pytorch/pull/161916 to continue building CUDA 12.9 nightly
Pull Request resolved: https://github.com/pytorch/pytorch/pull/163029
Approved by: https://github.com/malfet
2025-10-11 08:26:47 +00:00
Aidyn-A
d41aa187ec
Add more B200 smoke test ( #165133 )
...
A follow up to #159494 . This PR adds additional `test_scaled_matmul_cuda` to smoke tests.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165133
Approved by: https://github.com/drisspg
2025-10-10 16:46:26 +00:00
FFFrog
e0abcee3b5
[Code Clean] Remove support of python3.9 ( #163846 )
...
As the title stated.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/163846
Approved by: https://github.com/ezyang
2025-10-10 11:11:56 +00:00
Jithun Nair
c7b57d9349
Add gfx1100 to build target for ROCm docker builds ( #165103 )
...
Fixes issue of gfx1100 test jobs timing out
Pull Request resolved: https://github.com/pytorch/pytorch/pull/165103
Approved by: https://github.com/jeffdaily
2025-10-10 01:18:56 +00:00
Nikita Shulga
e7fd296930
[CI] Add full debug build to trunk ( #164974 )
...
But not test, just import torch, as regression test for https://github.com/pytorch/pytorch/issues/164297
Test plan: Re-apply #164974 on top of this change and observer the failure in the workflows: https://github.com/pytorch/pytorch/actions/runs/18383302153/job/52375282838
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164974
Approved by: https://github.com/seemethere , https://github.com/clee2000 , https://github.com/atalman
ghstack dependencies: #164968 , #164969
2025-10-09 20:12:16 +00:00
PyTorch MergeBot
91040f4934
Revert "[Code Clean] Remove support of python3.9 ( #163846 )"
...
This reverts commit bc1690c7e8 .
Reverted https://github.com/pytorch/pytorch/pull/163846 on behalf of https://github.com/izaitsevfb due to breaks distributed tests ([comment](https://github.com/pytorch/pytorch/pull/163846#issuecomment-3386855437 ))
2025-10-09 17:27:08 +00:00
Catherine Lee
eaa02655ea
[CI] Run cpp tests on windows in one run_tests call ( #164861 )
...
The windows cpp tests take ~1 hour according to logs. Each has run_test called on them individually, so I tried batching them together so it's just one run_test call for all of them. I believe it now takes 30min. I turned off TD since I don't think cpp tests are included in TD stuff.
As always with batch, I'm not sure if the errorlevel/error surfacing stuff is correct
This code is written with a lot of help from chatgpu and copilot
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164861
Approved by: https://github.com/huydhn
2025-10-09 16:07:28 +00:00
FFFrog
bc1690c7e8
[Code Clean] Remove support of python3.9 ( #163846 )
...
As the title stated.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/163846
Approved by: https://github.com/ezyang
2025-10-09 11:54:10 +00:00
Boyuan Feng
90b4e130d6
[Benchmark] cleanup torchbench models ( #164816 )
...
Prune models from TorchInductor dashboard to reduce ci cost. This PR prunes torchbench models according to the [doc](https://docs.google.com/document/d/1nLPNNAU-_M9Clx9FMrJ1ycdPxe-xRA54olPnsFzdpoU/edit?tab=t.0 ), which removes timm and huggingface models from torchbench.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164816
Approved by: https://github.com/anijain2305 , https://github.com/seemethere , https://github.com/huydhn , https://github.com/malfet
2025-10-09 00:31:25 +00:00
Laith Sakka
0b85236477
Fix refine_ranges corner case ( #164075 ) ( #164846 )
...
Summary:
address https://github.com/pytorch/pytorch/issues/161360
u0>0 should update the range of u0 to start from [1, ..] this fix it. it was not doing that.
Test Plan: contbuild & OSS CI, see 27234792ad
D84038721
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164846
Approved by: https://github.com/izaitsevfb , https://github.com/ezyang
2025-10-08 18:42:37 +00:00
Adnan Akhundov
73adac05d1
Triton 3.5.x pin update to 7416ffc ( #164587 )
...
Updates triton pin to latest: https://github.com/triton-lang/triton/commits/release/3.5.x/
This updates contains 1 cherry-pick to fix flex_attention_fwd regression on B200:
- https://github.com/triton-lang/triton/pull/8366
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164587
Approved by: https://github.com/atalman
2025-10-08 16:07:18 +00:00
Jeff Daily
f46ddb1e65
[ROCm][CI] add gfx1150 gfx1151 to docker images for binary builds ( #164854 )
...
Fixes #164346 .
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164854
Approved by: https://github.com/jeffdaily
Co-authored-by: Jeff Daily <jeff.daily@amd.com>
2025-10-08 14:34:22 +00:00
Boyuan Feng
f76fdcaaf8
[Benchmark] cleanup huggingface models ( #164815 )
...
Prune models from TorchInductor dashboard to reduce ci cost. This PR prunes for hugging face models according to the [doc](https://docs.google.com/document/d/1nLPNNAU-_M9Clx9FMrJ1ycdPxe-xRA54olPnsFzdpoU/edit?tab=t.0 ), which reduces from 46 to 27 models.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164815
Approved by: https://github.com/anijain2305 , https://github.com/seemethere , https://github.com/huydhn , https://github.com/malfet
2025-10-08 03:21:04 +00:00
amdfaa
955f21dc2c
[ROCm][CI] Add support for gfx1100 in rocm workflow + test skips ( #148355 )
...
This PR adds infrastructure support for gfx1100 in the rocm workflow. Nodes have been allocated for this effort.
@dnikolaev-amd contributed all the test skips.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/148355
Approved by: https://github.com/jeffdaily
Co-authored-by: Dmitry Nikolaev <dmitry.nikolaev@amd.com>
Co-authored-by: Jeff Daily <jeff.daily@amd.com>
2025-10-07 22:36:25 +00:00
Aleksei Nikiforov
87c9fbda22
Follow up to PR 163980 for s390x ( #164464 )
...
Now with same updates propagated to s390x it works on s390x runners too.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164464
Approved by: https://github.com/atalman
2025-10-07 12:02:29 +00:00
Yuanyuan Chen
50e077beaa
Fix outdated info in requirements-ci.txt ( #164441 )
...
Fixes installation instructions and descriptions for `numba` and `scikit-image`
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164441
Approved by: https://github.com/albanD
2025-10-07 02:10:41 +00:00
Jeff Daily
8f54e27e5d
[ROCm][CI] rebuild magma binary for gfx1150 gfx1151 ( #164782 )
...
After #164763 added gfx1150 gfx1151 to list of targets, this PR will trigger rebuild of magma binary for ROCm 7 with the new targets.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/164782
Approved by: https://github.com/jeffdaily
Co-authored-by: Jeff Daily <jeff.daily@amd.com>
2025-10-06 23:29:21 +00:00
PyTorch MergeBot
3912ba3e94
Revert "Fix refine_ranges corner case ( #164075 )"
...
This reverts commit 27234792ad .
Reverted https://github.com/pytorch/pytorch/pull/164075 on behalf of https://github.com/izaitsevfb due to fails executorch builds, see [D83938444](https://www.internalfb.com/diff/D83938444 ) ([comment](https://github.com/pytorch/pytorch/pull/164075#issuecomment-3374430964 ))
2025-10-06 22:09:39 +00:00