pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 00:20:18 +01:00

Author	SHA1	Message	Date
Anthony Shoumikhin	7d39e73c57	Fix more URLs (#153277 ) Or ignore them. Found by running the lint_urls.sh script locally with https://github.com/pytorch/pytorch/pull/153246 Pull Request resolved: https://github.com/pytorch/pytorch/pull/153277 Approved by: https://github.com/malfet	2025-05-14 16:23:50 +00:00
PyTorch UpdateBot	c5b4dc9898	[executorch hash update] update the pinned executorch hash (#152238 ) This PR is auto-generated nightly by [this action](https://github.com/pytorch/pytorch/blob/main/.github/workflows/nightly.yml). Update the pinned executorch hash. Pull Request resolved: https://github.com/pytorch/pytorch/pull/152238 Approved by: https://github.com/pytorchbot, https://github.com/huydhn Co-authored-by: Huy Do <huydhn@gmail.com>	2025-05-12 01:50:12 +00:00
Catherine Lee	4b8b7c7fb9	[CI] Use cmake from pip instead of conda in CI docker images (#152537 ) As in title idk how the install_cmake script is used because I see it being called with 3.18 but when I look at the build jobs some say 3.18 and others 3.31 Just make everything install cmake via the requirements-ci.txt. I don't know if the comment at `5d36485b4a/.ci/docker/common/install_conda.sh (L78)` still holds, but pretty much every build has CONDA_CMAKE set to true, so I'm just defaulting to installing through pip Also defaulting to 4.0.0 everywhere except the executorch docker build because executorch reinstalls 3.31.something Pull Request resolved: https://github.com/pytorch/pytorch/pull/152537 Approved by: https://github.com/cyyever, https://github.com/atalman, https://github.com/malfet	2025-05-08 18:58:10 +00:00
Jithun Nair	fe8ebacee4	[ROCm] Upgrade ROCm CI to ROCm6.4 (#151368 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/151368 Approved by: https://github.com/jeffdaily, https://github.com/malfet Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-05-08 16:12:16 +00:00
PyTorch MergeBot	a7ea115494	Revert "[CI] Use cmake from pip instead of conda in CI docker images (#152537 )" This reverts commit `941062894a`. Reverted https://github.com/pytorch/pytorch/pull/152537 on behalf of https://github.com/malfet due to Sorry to revert this PR, but it broke doc builds, see `4976b1a3a8/1` ([comment](https://github.com/pytorch/pytorch/pull/152537#issuecomment-2863337268))	2025-05-08 14:53:34 +00:00
Catherine Lee	941062894a	[CI] Use cmake from pip instead of conda in CI docker images (#152537 ) As in title idk how the install_cmake script is used because I see it being called with 3.18 but when I look at the build jobs some say 3.18 and others 3.31 Just make everything install cmake via the requirements-ci.txt. I don't know if the comment at `5d36485b4a/.ci/docker/common/install_conda.sh (L78)` still holds, but pretty much every build has CONDA_CMAKE set to true, so I'm just defaulting to installing through pip Also defaulting to 4.0.0 everywhere except the executorch docker build because executorch reinstalls 3.31.something Pull Request resolved: https://github.com/pytorch/pytorch/pull/152537 Approved by: https://github.com/cyyever, https://github.com/atalman, https://github.com/malfet	2025-05-08 10:10:27 +00:00
Catherine Lee	d2935a9f85	[CI] Upgrade sccache to 0.10.0 (#152957 ) Newest release handles cuda better, and I think this fixes the cases I saw where some cuda related builds weren't being cached correctly Pull Request resolved: https://github.com/pytorch/pytorch/pull/152957 Approved by: https://github.com/malfet	2025-05-07 00:33:43 +00:00
PyTorch MergeBot	d197228d43	Revert "[CI] Use cmake from pip instead of conda in CI docker images (#152537 )" This reverts commit `3196a3aca0`. Reverted https://github.com/pytorch/pytorch/pull/152537 on behalf of https://github.com/huydhn due to We need signals from inductor, cmake version from pip is too old? ([comment](https://github.com/pytorch/pytorch/pull/152537#issuecomment-2852820175))	2025-05-06 00:22:23 +00:00
Catherine Lee	3196a3aca0	[CI] Use cmake from pip instead of conda in CI docker images (#152537 ) As in title Pull Request resolved: https://github.com/pytorch/pytorch/pull/152537 Approved by: https://github.com/cyyever, https://github.com/atalman	2025-05-05 16:32:40 +00:00
PyTorch MergeBot	cc28b43950	Revert "[ROCm] Upgrade ROCm CI to ROCm6.4 (#151368 )" This reverts commit `844842dfbf`. Reverted https://github.com/pytorch/pytorch/pull/151368 on behalf of https://github.com/malfet due to This broke inductor cpp wrapper ([comment](https://github.com/pytorch/pytorch/pull/151368#issuecomment-2848519706))	2025-05-03 08:31:31 +00:00
Jithun Nair	844842dfbf	[ROCm] Upgrade ROCm CI to ROCm6.4 (#151368 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/151368 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-05-02 17:21:18 +00:00
Catherine Lee	4408701fed	[CI][CD] Unify install_cuda and install_cuda_aarch64 scripts (#152140 ) Generalize install_cuda so it can also handle aarch64 Remove install_cuda_aarch64 since install_cuda can now handle it Make install_cuda and install_cudnn functions in the install_cuda script because most of the code is the same Pull Request resolved: https://github.com/pytorch/pytorch/pull/152140 Approved by: https://github.com/huydhn, https://github.com/atalman	2025-04-30 15:09:06 +00:00
Huy Do	5c01302cc8	Remove 3.13 hack when installing TIMM (#152399 ) A Docker build failure showing up at this step triggered by the landing of https://github.com/pytorch/pytorch/pull/152362. Here is the example logs https://github.com/pytorch/pytorch/actions/runs/14718029881/job/41305891896: ``` #37 29.72 + as_jenkins conda run -n py_3.13 pip install --progress-bar off --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu124 #37 29.72 + sudo -E -H -u jenkins env -u SUDO_UID -u SUDO_GID -u SUDO_COMMAND -u SUDO_USER env PATH=/opt/conda/envs/py_3.13/bin:/opt/conda/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64 conda run -n py_3.13 pip install --progress-bar off --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu124 #37 49.50 ERROR: Cannot install torch and torchvision==0.22.0.dev20250226+cu124 because these package versions have conflicting dependencies. ``` This happens because we have stopped building 12.4 nightly for sometime. This hack doesn't apply anymore, so let's just remove it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/152399 Approved by: https://github.com/cyyever, https://github.com/wdvr, https://github.com/malfet	2025-04-29 08:22:37 +00:00
PyTorch MergeBot	c02edba863	Revert "Update OpenBLAS commit (#151547 )" This reverts commit `c4b0854750`. Reverted https://github.com/pytorch/pytorch/pull/151547 on behalf of https://github.com/malfet due to It breaks all aarch64 tests ([comment](https://github.com/pytorch/pytorch/pull/151547#issuecomment-2833593427))	2025-04-27 18:58:35 +00:00
Aditya Tewari	c4b0854750	Update OpenBLAS commit (#151547 ) Motivation: Update OpenBLAS and change build script to enable SBGEMM kernels . Update pytorch `jammy` builds for aarch64 to use `install_openblas.sh` instead of `conda_install` Link to full [TorchInductor Performance Dashboard AArch64](https://hud.pytorch.org/benchmark/compilers?dashboard=torchinductor&startTime=Wed%2C%2016%20Apr%202025%2009%3A35%3A26%20GMT&stopTime=Thu%2C%2017%20Apr%202025%2009%3A35%3A26%20GMT&granularity=hour&mode=inference&dtype=bfloat16&deviceName=cpu%20(aarch64)&lBranch=adi/update_openblas&lCommit=90701ab81bf61fd864d31e0aa7e88d97a1a8676c&rBranch=main&rCommit=40ce4fb24a536d175348df876f61956d4945778e) 1. This shows a promising speedup across most of the HF models in benchmark, specifically giving a significant boost to SDPA layers. 2. Overall torch-bench pass-rate increased `[87%, 65/75 → 96%, 72/75]` <img width="676" alt="Screenshot 2025-04-17 at 10 32 10" src="https://github.com/user-attachments/assets/a92dce0c-ecee-4466-8175-065df664dd71" /> Pull Request resolved: https://github.com/pytorch/pytorch/pull/151547 Approved by: https://github.com/malfet	2025-04-27 15:55:42 +00:00
Catherine Lee	b11c9e1808	[CI][docker] Use install_cusparselt when possible in docker image (#150600 ) spot checked builds for line like `Found CUSPARSELT: /usr/local/cuda/lib64/libcusparseLt.so`. I don't know if there's another way to do it I am slowly trying to reduce the duplicated code in docker image installs Pros: * less dup code Cons: * more docker copies Pull Request resolved: https://github.com/pytorch/pytorch/pull/150600 Approved by: https://github.com/atalman	2025-04-24 18:52:10 +00:00
Catherine Lee	6d28d61323	[CI] Remove protobuf from docker image (#151933 ) Pretty sure the source should be the one in third-party Pull Request resolved: https://github.com/pytorch/pytorch/pull/151933 Approved by: https://github.com/huydhn	2025-04-23 10:29:09 +00:00
atalman	43de9b75c3	Remove mention of magma-cuda in readme.md, refactor magma_conda install (#147476 ) Related to: https://github.com/pytorch/pytorch/issues/138506 we migrated magma-cuda build from anaconda to aws Last version of magma-cuda published was 12.6 https://anaconda.org/pytorch/magma-cuda126 Here is the PR that moved from anaconda to tarball: https://github.com/pytorch/pytorch/pull/140417 Pull Request resolved: https://github.com/pytorch/pytorch/pull/147476 Approved by: https://github.com/albanD	2025-04-22 22:08:49 +00:00
Jithun Nair	b4550541ea	[ROCm] upgrade nightly wheels to rocm6.4 (#151355 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/151355 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-04-17 17:29:07 +00:00
Jeff Daily	19a33b20c2	[ROCm][CI/CD] create ROCm 6.4 images, part 1, skip magma tarball (#151236 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/151236 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>	2025-04-15 19:45:15 +00:00
Eli Uriegas	af5c1b96e2	ci: Set minimum cmake version for halide build (#150560 ) This was failing due to pybind being strict about their cmake version requirements. This resolves errors like: ``` 652.1 Compatibility with CMake < 3.5 has been removed from CMake. 652.1 652.1 Update the VERSION argument <min> value. Or, use the <min>...<max> syntax 652.1 to tell CMake that the project requires at least <min> but has been updated 652.1 to work with policies introduced by <max> or earlier. 652.1 652.1 Or, add -DCMAKE_POLICY_VERSION_MINIMUM=3.5 to try configuring anyway. 652.1 652.1 652.1 -- Configuring incomplete, errors occurred! ``` Tested this locally with the following command: ``` ./build.sh pytorch-linux-jammy-py3.12-halide -t 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-jammy-py3.12-halide:8a8989876ff1aa1d5b0e465177afebbc7a9da921 ``` Closes https://github.com/pytorch/pytorch/issues/150420 Signed-off-by: Eli Uriegas <eliuriegas@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/150560 Approved by: https://github.com/clee2000, https://github.com/ZainRizvi, https://github.com/atalman, https://github.com/malfet	2025-04-02 20:27:24 +00:00
Catherine Lee	d4298f2136	[CI] Use system nccl in build (#150226 ) Install nccl in the docker image (which is already being done in some docker images), and use USE_SYSTEM_NCCL=1 in CI builds It takes some time to build nccl and doesn't happen in parallel, so theres less benefit in switching to a bigger runner and using more processes The other changes in this PR are because there is an install_cuda script and an install_cuda_aarch64 script and they both build nccl from source and define their own pins for the nccl version. There is also a .ci/docker/nccl-cu11.txt and cu12.txt that define the pins, and this is an attempt to unify them. Unfortunately this leads to a lot of files needing to be copied to the docker build Generally seems to increase docker pull times by <1 min, P1768456379 but its hard to tell what the real increase is 15761 mib -> 16221 [linux-focal-cuda11.8-py3.10-gcc9 / test (distributed](https://github.com/pytorch/pytorch/actions/runs/14114171729/job/39545500161#logs) `jq '[.layers[].size, .config.size] \| add / 1024 / 1024'` Example `6eb3c2e282 (39520169577-box)` ![image](https://github.com/user-attachments/assets/d44ef415-6e48-41ef-ac83-f19bab47560c) TODO: * Figure out a way to verify that nccl was built + works properly when it is expected (this time i just checked torch.distributed.is_nccl_available) * Merge the cusparse installation scripts * Merge the cuda installation scripts * Either split the nccl, cuda, and cusparse installations always, or make the always together in one bash script distributed/test_distributed_spawn Pull Request resolved: https://github.com/pytorch/pytorch/pull/150226 Approved by: https://github.com/seemethere, https://github.com/atalman	2025-04-02 19:42:43 +00:00
Mergen Nachin	7382654ebc	Update ExecuTorch pin to latest viable/strict 3/28/2025 (#150308 ) From latest viable/strict: https://hud.pytorch.org/hud/pytorch/executorch/viable%2Fstrict/1?per_page=50 Fixes https://github.com/pytorch/pytorch/issues/144480 This commit has important CI stability fixes, such as https://github.com/pytorch/executorch/pull/9561 and https://github.com/pytorch/executorch/pull/9634 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150308 Approved by: https://github.com/jathu, https://github.com/malfet	2025-04-01 16:30:09 +00:00
Nikita Shulga	493c7fa66f	[Cmake] Make PyTorch buildable by CMake-4.x (#150203 ) By turning on compatibility mode for protobuf, nnpack, PSimd and FP16, ittapi, TensorPipe and Gloo Update CMake requirements Revert `0ece461cca` and `b0901d62ae` to test that it actually works TODO: - Update/get rid of those libraries Fixes https://github.com/pytorch/pytorch/issues/150149 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150203 Approved by: https://github.com/clee2000	2025-03-29 01:39:13 +00:00
PyTorch MergeBot	7ac0658757	Revert "[CI] Fix docker builds failing due to cmake update by setting CMAKE_POLICY_VERSION_MINIMUM (#150220 )" This reverts commit `87549a65c9`. Reverted https://github.com/pytorch/pytorch/pull/150220 on behalf of https://github.com/clee2000 due to doesn't solve the problem since the installed cmake 4 stays on the system, resulting in failed pytorch builds later ([comment](https://github.com/pytorch/pytorch/pull/150220#issuecomment-2762623078))	2025-03-28 21:44:03 +00:00
Catherine Lee	87549a65c9	[CI] Fix docker builds failing due to cmake update by setting CMAKE_POLICY_VERSION_MINIMUM (#150220 ) Set the CMAKE_POLICY_VERSION_MINIMUM env var to make executorch and halide docker builds pass (they install from those repos which don't have cmake pinned) This can be removed if executorch and halide update their builds and we update the hash? Pull Request resolved: https://github.com/pytorch/pytorch/pull/150220 Approved by: https://github.com/atalman, https://github.com/malfet	2025-03-28 20:55:04 +00:00
Jeff Daily	2bd5bfa3ce	[ROCm] use magma-rocm tarball for CI/CD (#149986 ) Follow-up to #149902. Pull Request resolved: https://github.com/pytorch/pytorch/pull/149986 Approved by: https://github.com/malfet	2025-03-28 19:28:50 +00:00
Catherine Lee	d5a8bd0688	[CI][docker] Use multistage build for triton (#149413 ) Sees to reduce docker pull times by ~3 min if triton is requested, some compressed docker sizes seems to have decreased by 1/3 ish Also add check that triton is installed/not installed Pull Request resolved: https://github.com/pytorch/pytorch/pull/149413 Approved by: https://github.com/malfet	2025-03-28 16:07:19 +00:00
Catherine Lee	0ece461cca	Pin cmake==3.31.6 (#150158 ) I'm not sure if this is the right think to do, but cmake 4.0.0 got released on pypi and our builds are failing with it Example: `aa70d62041 (39555975425-box)` I guess we have to go change all the cmake_minimum_required to >=3.5? backwards compat still failing because its building with the base commit which this pr can't really change until it gets merged, but at least manywheel binary builds got past where they were originally failing Also pin the conda installation, but the most recent version on conda is 3.31.2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/150158 Approved by: https://github.com/cyyever, https://github.com/malfet	2025-03-28 15:49:17 +00:00
Jithun Nair	4a9466c96a	Newer conda versions require --update-deps to update dependencies such as libgcc-ng (#149599 ) * When we try to install [libstdcxx-ng 12.3.0 from conda-forge](`595293316d/.ci/docker/common/install_conda.sh (L65)`), conda 24.7.1 updates the dependencies of that package, including libgcc-ng package to the following: `libgcc-ng-14.2.0 \| h69a702a_2 52 KB conda-forge` * However, conda updated their installer script on Feb 6 2025 to version 25.1.1, which behaves differently from previous versions when installing conda packages. * conda 25.1.1 does not update any dependencies in the above step, and hence the same installation of libgcc-ng from "defaults" channel is present: `libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1` * Adding the "--update-deps" flags to the conda install command installs a newer libgcc-ng package from the "conda-forge" conda channel: `libgcc-ng-12.3.0 \| h77fa898_13 762 KB conda-forge`, which is compatible with the libstdcxx-ng 12.3.0 package * Compare this [Feb 4 docker build](https://github.com/pytorch/pytorch/actions/runs/13148456164/job/36691412387#step:6:5179) to this [Feb 10 docker build](https://github.com/pytorch/pytorch/actions/runs/13247023578/job/36975931849#step:6:5451), which shows that the latter does not update libgcc-ng. * This creates linking issues when trying to use a library, that was built with a newer libgcc_s.so.1 (from libcc-ng package), in the PyTorch conda environment. Eg. ONNX-RT: ``` [0;93m2025-02-13 10:18:38.492434704 [W:onnxruntime:Default, migraphx_execution_provider.cc:167 get_flags_from_env] [MIGraphX EP] MIGraphX ENV Override Variables Set:[m [1;31m2025-02-13 10:18:38.628064251 [E:onnxruntime:Default, provider_bridge_ort.cc:2028 TryGetProviderInfo_ROCM] /onnxruntime/onnxruntime/core/session/provider_bridge_ort.cc:1636 onnxruntime::Provider& onnxruntime::ProviderLibrary::Get() [ONNXRuntimeError] : 1 : FAIL : Failed to load library libonnxruntime_providers_rocm.so with error: /opt/conda/envs/py_3.10/bin/../lib/libgcc_s.so.1: version `GCC_12.0.0' not found (required by /opt/conda/envs/py_3.10/lib/python3.10/site-packages/onnxruntime/capi/libonnxruntime_providers_rocm.so) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/149599 Approved by: https://github.com/malfet	2025-03-26 17:04:21 +00:00
PyTorch MergeBot	7b218ca874	Revert "[BE] Replace XPU support packages installation to offline mode in Linux CI/CD (#149843 )" This reverts commit `86dcdf9c8b`. Reverted https://github.com/pytorch/pytorch/pull/149843 on behalf of https://github.com/malfet due to This breaks XPU builds, see `23183fef7e/1` ([comment](https://github.com/pytorch/pytorch/pull/149843#issuecomment-2751482412))	2025-03-25 14:39:10 +00:00
Wang, Chuanqi	86dcdf9c8b	[BE] Replace XPU support packages installation to offline mode in Linux CI/CD (#149843 ) To ensure the build environment is stable Pull Request resolved: https://github.com/pytorch/pytorch/pull/149843 Approved by: https://github.com/EikanWang	2025-03-25 09:11:35 +00:00
PyTorch MergeBot	453da423d4	Revert "ci: Add sccache to manylinux images (#148419 )" This reverts commit `1099c37150`. Reverted https://github.com/pytorch/pytorch/pull/148419 on behalf of https://github.com/atalman due to Breaks triton build ([comment](https://github.com/pytorch/pytorch/pull/148419#issuecomment-2748759515))	2025-03-24 16:43:26 +00:00
Tristan Rice	ddc0fe903f	ci/docker: use NCCL 2.26.2-1 (#149778 ) Related to #149153 This updates some build scripts to hopefully fix the nightly builds which are somehow building against nccl 2.25.1 and using 2.26.2 from pip. Test plan: After merging rerun nightly linux jobs and validate that nccl version matches Pull Request resolved: https://github.com/pytorch/pytorch/pull/149778 Approved by: https://github.com/Skylion007, https://github.com/atalman Co-authored-by: Andrey Talman <atalman@fb.com>	2025-03-24 16:14:54 +00:00
cyy	9367f8f6f1	Remove outdated instructions from CI scripts (#149795 ) Some instructions about Python 3.8 and CUDA 11.3 are removed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/149795 Approved by: https://github.com/malfet	2025-03-22 18:37:07 +00:00
PyTorch MergeBot	b238e36fd9	Revert "[BE][Ez]: Update CU126 to CUDNN 12.8 too (#149254 )" This reverts commit `b0a5d55c58`. Reverted https://github.com/pytorch/pytorch/pull/149254 on behalf of https://github.com/izaitsevfb due to seems to be causing multiple test failures ([comment](https://github.com/pytorch/pytorch/pull/149254#issuecomment-2744686862))	2025-03-21 23:44:09 +00:00
Aaron Gokaslan	b0a5d55c58	[BE][Ez]: Update CU126 to CUDNN 12.8 too (#149254 ) Have CUDNN have the same version for 12.6 and 12.8 for better performance and consistency. We can't do CU12.1 because it's not supported and CU12.4 isn't updated due to manywheel Linux compatibility reasons and dropping support for it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/149254 Approved by: https://github.com/jansel, https://github.com/atalman, https://github.com/tinglvv	2025-03-21 18:20:44 +00:00
Eli Uriegas	1099c37150	ci: Add sccache to manylinux images (#148419 ) Adds sccache to our manylinux images, these are purposefully built without the scccache-dist binary since we're not expecting to use that. Another caveat of these builds is that they are built with the vendored version of openssl. This is to set the stage for us to be able to build binaries sequentially. Signed-off-by: Eli Uriegas <github@terriblecode.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/148419 Approved by: https://github.com/atalman	2025-03-21 01:15:34 +00:00
cyy	29c4f2c07a	Remove Ubuntu 18.04 scripts (#149479 ) Ubuntu 18.04 end of life reached on May 31, 2023. These code isn't used now. Pull Request resolved: https://github.com/pytorch/pytorch/pull/149479 Approved by: https://github.com/malfet	2025-03-20 00:13:40 +00:00
Andrey Talman	c9de76a1e4	Modify cuda aarch64 install for cudnn and nccl. Cleanup aarch64 cuda 12.6 docker (#149540 ) 1. Use NCCL_VERSION=v2.26.2-1 . Fixes nccl cuda aarch64 related failure we see here: https://github.com/pytorch/pytorch/actions/runs/13955856471/job/39066681549?pr=149443 . After landing: https://github.com/pytorch/pytorch/pull/149351 TODO: Followup required to unify NCCL definitions across the x86 and aarch64 builds 3. Cleanup Remove older CUDA versions for aarch64 builds . CUDA 12.6 where removed by: https://github.com/pytorch/pytorch/pull/148895 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149540 Approved by: https://github.com/seemethere, https://github.com/malfet, https://github.com/nWEIdia	2025-03-19 23:20:05 +00:00
Catherine Lee	cc469aaf3b	[CI][docker] Remove vulkan and swiftshader from docker builds (#149530 ) Probably should have been removed with https://github.com/pytorch/pytorch/pull/139354/files? Should I also remove mentions of them from build.sh and test.sh? Pull Request resolved: https://github.com/pytorch/pytorch/pull/149530 Approved by: https://github.com/malfet	2025-03-19 23:13:27 +00:00
Mergen Nachin	bc86b6c55a	Update ExecuTorch pin update (#149539 ) Latest commit in https://hud.pytorch.org/hud/pytorch/executorch/viable%2Fstrict/1?per_page=50 Follow-up to https://github.com/pytorch/pytorch/issues/144480#issuecomment-2731150636 Also, need to incorporate change from https://github.com/pytorch/executorch/pull/8817 Test Plan: Monitor linux-jammy-py3-clang12-executorch test Pull Request resolved: https://github.com/pytorch/pytorch/pull/149539 Approved by: https://github.com/larryliu0820	2025-03-19 22:29:59 +00:00
Catherine Lee	6974ba84f6	[ci][anaconda] Remove conda from linter docker images (#147789 ) Remove conda usage from the linter docker images Handles part of https://github.com/pytorch/pytorch/issues/148110 Pull Request resolved: https://github.com/pytorch/pytorch/pull/147789 Approved by: https://github.com/atalman	2025-03-19 21:56:44 +00:00
Nikita Shulga	c41196a4d0	[EZ][Docker] Remove `install_db.sh` (#149360 ) Which is a vestige of caffe2 days and was no-op since https://github.com/pytorch/pytorch/pull/125092 Pull Request resolved: https://github.com/pytorch/pytorch/pull/149360 Approved by: https://github.com/atalman, https://github.com/cyyever, https://github.com/seemethere, https://github.com/Skylion007	2025-03-18 16:07:47 +00:00
Jithun Nair	b06b5c3e27	[ROCm] Use alternate mirror for drm repo (#149380 ) Fixes issue with building ROCm manywheel and libtorch images eg. https://github.com/pytorch/pytorch/actions/runs/13887711267/job/38854659005#step:4:8328 ``` #53 2.832 Cloning into 'drm'... #53 2.849 fatal: unable to access 'https://gitlab.freedesktop.org/mesa/drm.git/': The requested URL returned error: 503 #53 2.851 ./install_rocm_drm.sh: line 29: pushd: drm: No such file or directory ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/149380 Approved by: https://github.com/jeffdaily	2025-03-18 13:33:25 +00:00
Aaron Gokaslan	6856d81c60	[BE]: Update CU128 cudnn to 9.8.0.87 (#148963 ) Also cu12.6 is an on old CUDNN version, we may want to upgrade it for all the performance reasons as I don't see a manywheel linux reason to stay back on the old 9.5 release. I might split that into it's own PR. This one just updates CU126 to the latest and greatest. Pull Request resolved: https://github.com/pytorch/pytorch/pull/148963 Approved by: https://github.com/jansel, https://github.com/eqy, https://github.com/nWEIdia, https://github.com/tinglvv, https://github.com/atalman	2025-03-13 16:59:12 +00:00
Ting Lu	c652772af7	[aarch64] install ninja for docker to build triton on arm (#148768 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/148768 Approved by: https://github.com/atalman, https://github.com/Skylion007 Co-authored-by: Andrey Talman <atalman@fb.com>	2025-03-10 21:28:53 +00:00
Justin Chu	70d0e1b96a	Bump onnxscript to 0.2.2 in CI (#148388 ) Unblock https://github.com/pytorch/pytorch/pull/148140 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148388 Approved by: https://github.com/malfet	2025-03-04 22:09:50 +00:00
PyTorch MergeBot	9d196edb7d	Revert "Bump onnxscript to 0.2.2 in CI (#148388 )" This reverts commit `7ab6749ec7`. Reverted https://github.com/pytorch/pytorch/pull/148388 on behalf of https://github.com/clee2000 due to broke libtorch debug build? [GH job link](https://github.com/pytorch/pytorch/actions/runs/13646179239/job/38152039312) [HUD commit link](`7ab6749ec7`) ([comment](https://github.com/pytorch/pytorch/pull/148388#issuecomment-2698665495))	2025-03-04 19:16:34 +00:00
Justin Chu	7ab6749ec7	Bump onnxscript to 0.2.2 in CI (#148388 ) Unblock https://github.com/pytorch/pytorch/pull/148140 Pull Request resolved: https://github.com/pytorch/pytorch/pull/148388 Approved by: https://github.com/malfet	2025-03-04 04:21:58 +00:00

1 2 3 4 5 ...

289 Commits