pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Han, Xu	3e382456c1	Fix compiler check (#120492 ) Fixes #119304 1. Add try catch to handle the compiler version check. 2. Retry to query compiler version info. 3. Return False if can't get compiler info twice. Pull Request resolved: https://github.com/pytorch/pytorch/pull/120492 Approved by: https://github.com/ezyang	2024-02-25 02:41:20 +00:00
Wang, Xiao	c83af673bc	Allow CUDA extension builds to skip generating cuda dependencies during compile time (#119936 ) nvcc flag `--generate-dependencies-with-compile` doesn't seem to be supported by `sccache` for now. Builds with this flag enabled will not benefit from sccache. This PR adds an environment variable that allows users to set this flag and skip those nvcc dependencies to speed up their build with compiler caches. If everything is "fresh build" in CI, we don't care if there are unnecessary recompile during incremental builds. related: https://github.com/pytorch/pytorch/pull/49344 - [ ] todo: raise an issue to sccache Pull Request resolved: https://github.com/pytorch/pytorch/pull/119936 Approved by: https://github.com/ezyang	2024-02-15 07:03:59 +00:00
Mark Saroufim	7fd6b1c558	s/print/warn in arch choice in cpp extension (#119463 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/119463 Approved by: https://github.com/malfet	2024-02-08 20:38:51 +00:00
Nikolay Bogoychev	46ef73505d	Clarify how to get extra link flags when building CUDA/C++ extension (#118743 ) Make it a bit more explicit how one parse linker arguments to the build and point to the superclass documentation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/118743 Approved by: https://github.com/ezyang	2024-02-01 22:35:25 +00:00
Catherine Lee	4f5785b6b3	Enable possibly-undefined error code (#118533 ) Fixes https://github.com/pytorch/pytorch/issues/118129 Suppressions automatically added with ``` import re with open("error_file.txt", "r") as f: errors = f.readlines() error_lines = {} for error in errors: match = re.match(r"(.):(\d+):\d+: error:.\[(.*)\]", error) if match: file_path, line_number, error_type = match.groups() if file_path not in error_lines: error_lines[file_path] = {} error_lines[file_path][int(line_number)] = error_type for file_path, lines in error_lines.items(): with open(file_path, "r") as f: code = f.readlines() for line_number, error_type in sorted(lines.items(), key=lambda x: x[0], reverse=True): code[line_number - 1] = code[line_number - 1].rstrip() + f" # type: ignore[{error_type}]\n" with open(file_path, "w") as f: f.writelines(code) ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Co-authored-by: Catherine Lee <csl@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118533 Approved by: https://github.com/Skylion007, https://github.com/zou3519	2024-01-30 21:07:01 +00:00
PyTorch MergeBot	40ece2e579	Revert "Enable possibly-undefined error code (#118533 )" This reverts commit `4f13f69a45`. Reverted https://github.com/pytorch/pytorch/pull/118533 on behalf of https://github.com/clee2000 due to sorry i'm trying to figure out a codev merge conflict, if this works i'll be back to rebase and merge ([comment](https://github.com/pytorch/pytorch/pull/118533#issuecomment-1917695185))	2024-01-30 19:00:34 +00:00
Edward Z. Yang	4f13f69a45	Enable possibly-undefined error code (#118533 ) Fixes https://github.com/pytorch/pytorch/issues/118129 Suppressions automatically added with ``` import re with open("error_file.txt", "r") as f: errors = f.readlines() error_lines = {} for error in errors: match = re.match(r"(.):(\d+):\d+: error:.\[(.*)\]", error) if match: file_path, line_number, error_type = match.groups() if file_path not in error_lines: error_lines[file_path] = {} error_lines[file_path][int(line_number)] = error_type for file_path, lines in error_lines.items(): with open(file_path, "r") as f: code = f.readlines() for line_number, error_type in sorted(lines.items(), key=lambda x: x[0], reverse=True): code[line_number - 1] = code[line_number - 1].rstrip() + f" # type: ignore[{error_type}]\n" with open(file_path, "w") as f: f.writelines(code) ``` Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118533 Approved by: https://github.com/Skylion007, https://github.com/zou3519	2024-01-30 05:08:10 +00:00
lancerts	e6f3a4746c	include a print for _get_cuda_arch_flags (#118503 ) Related to #118494, it is not clear to users that the default behavior is to include all feasible archs (if the 'TORCH_CUDA_ARCH_LIST' is not set). In these scenarios, a user may experience a long build time. Adding a print statement to reflect this behavior. [`verbose` arg is not available and not feeling necessary to add `verbose` arg to this function and all its parent functions...] Co-authored-by: Edward Z. Yang <ezyang@mit.edu> Pull Request resolved: https://github.com/pytorch/pytorch/pull/118503 Approved by: https://github.com/ezyang	2024-01-29 07:03:56 +00:00
Kunal Tyagi	6c02520466	Remove unneeded comment and link for `BuildExtension` (#115496 ) `BuildExtension` is no longer derived from object, but from `build_ext`. Py2 is also deprecated, so this comment wouldn't be required anyways Pull Request resolved: https://github.com/pytorch/pytorch/pull/115496 Approved by: https://github.com/Skylion007	2024-01-01 08:29:48 +00:00
Jeff Daily	8bff59e41d	[ROCm] add hipblaslt support (#114329 ) Disabled by default. Enable with env var DISABLE_ADDMM_HIP_LT=0. Tested on both ROCm 5.7 and 6.0. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114329 Approved by: https://github.com/malfet	2023-12-20 19:09:25 +00:00
PyTorch MergeBot	47908a608f	Revert "[ROCm] add hipblaslt support (#114329 )" This reverts commit `b062ea3803`. Reverted https://github.com/pytorch/pytorch/pull/114329 on behalf of https://github.com/jeanschmidt due to Reverting due to inconsistencies on internal diff ([comment](https://github.com/pytorch/pytorch/pull/114329#issuecomment-1861933267))	2023-12-19 01:04:58 +00:00
Jeff Daily	b062ea3803	[ROCm] add hipblaslt support (#114329 ) Disabled by default. Enable with env var DISABLE_ADDMM_HIP_LT=0. Tested on both ROCm 5.7 and 6.0. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114329 Approved by: https://github.com/malfet	2023-12-15 15:36:46 +00:00
PyTorch MergeBot	59f7355f86	Revert "[ROCm] add hipblaslt support (#114329 )" This reverts commit `bb2bb8cca1`. Reverted https://github.com/pytorch/pytorch/pull/114329 on behalf of https://github.com/atalman due to OSSCI oncall, trunk tests are failing ([comment](https://github.com/pytorch/pytorch/pull/114329#issuecomment-1857003155))	2023-12-14 23:53:30 +00:00
Jeff Daily	bb2bb8cca1	[ROCm] add hipblaslt support (#114329 ) Disabled by default. Enable with env var DISABLE_ADDMM_HIP_LT=0. Tested on both ROCm 5.7 and 6.0. Pull Request resolved: https://github.com/pytorch/pytorch/pull/114329 Approved by: https://github.com/malfet	2023-12-14 21:41:22 +00:00
vfdev-5	a43c757275	Fixed error with cuda_ver in cpp_extension.py (#113555 ) Reported in `71ca42787f (r132390833)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113555 Approved by: https://github.com/ezyang	2023-11-14 00:12:22 +00:00
ChanBong	5e10dd2c78	fix docstring issues in torch.utils (#113335 ) Fixes #112634 Fixes all the issues listed except in `torch/utils/_pytree.py` as the file no longer exists. ### Error counts \|File \| Count Before \| Count now\| \|---- \| ---- \| ---- \| \|`torch/utils/collect_env.py` \| 39 \| 25\| \|`torch/utils/cpp_extension.py` \| 51 \| 13\| \|`torch/utils/flop_counter.py` \| 25 \| 8\| \|`torch/utils/_foreach_utils.py.py` \| 2 \| 0\| \|`torch/utils/_python_dispatch.py.py` \| 26 \| 25\| \|`torch/utils/backend_registration.py` \| 15 \| 4\| \|`torch/utils/checkpoint.py` \| 29 \| 21\| Pull Request resolved: https://github.com/pytorch/pytorch/pull/113335 Approved by: https://github.com/ezyang	2023-11-13 19:37:25 +00:00
Nikita Shulga	0a7eef9bcf	[BE] Remove stale CUDA version check from cpp_extension.py (#113447 ) As at least CUDA-11.x is needed to build PyTorch on latest trunk. But still skip `--generate-dependencies-with-compile` if running on ROCm Pull Request resolved: https://github.com/pytorch/pytorch/pull/113447 Approved by: https://github.com/Skylion007, https://github.com/atalman, https://github.com/PaliC, https://github.com/huydhn	2023-11-11 00:20:08 +00:00
PyTorch MergeBot	ae2c219de2	Revert "[BE] Remove stale CUDA version check from cpp_extension.py (#113447 )" This reverts commit `7ccca60927`. Reverted https://github.com/pytorch/pytorch/pull/113447 on behalf of https://github.com/malfet due to Broke ROCM ([comment](https://github.com/pytorch/pytorch/pull/113447#issuecomment-1806407892))	2023-11-10 20:46:13 +00:00
Nikita Shulga	7ccca60927	[BE] Remove stale CUDA version check from cpp_extension.py (#113447 ) As at least CUDA-11.x is needed to build PyTorch on latest trunk Pull Request resolved: https://github.com/pytorch/pytorch/pull/113447 Approved by: https://github.com/Skylion007, https://github.com/atalman, https://github.com/PaliC, https://github.com/huydhn	2023-11-10 18:54:19 +00:00
vfdev	71ca42787f	Replaced deprecated pkg_resources.packaging with packaging module (#113023 ) Usage of `from pkg_resources import packaging` leads to a deprecation warning: ``` DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html ``` and in strict tests where warnings are errors, this leads to CI breaks, e.g.: https://github.com/pytorch/vision/pull/8092 Replacing `pkg_resources.package` with `package` as it is now a pytorch dependency: `fa9045a872/requirements.txt (L19)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113023 Approved by: https://github.com/Skylion007, https://github.com/malfet	2023-11-10 15:06:03 +00:00
PyTorch MergeBot	aef9e43fe6	Revert "Replaced deprecated pkg_resources.packaging with packaging module (#113023 )" This reverts commit `81ea7a489a`. Reverted https://github.com/pytorch/pytorch/pull/113023 on behalf of https://github.com/atalman due to breaks nightlies ([comment](https://github.com/pytorch/pytorch/pull/113023#issuecomment-1802720774))	2023-11-08 21:39:59 +00:00
Alexander Grund	21b6030ac3	Don't set CUDA_HOME when not compiled with CUDA support (#106310 ) It doesn't make sense to set this (on import!) as CUDA cannot be used with PyTorch in this case but leads to messages like > No CUDA runtime is found, using CUDA_HOME='/usr/local/cuda' when CUDA happens to be installed which is at least confusing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106310 Approved by: https://github.com/ezyang	2023-11-06 21:48:49 +00:00
vfdev	81ea7a489a	Replaced deprecated pkg_resources.packaging with packaging module (#113023 ) Usage of `from pkg_resources import packaging` leads to a deprecation warning: ``` DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html ``` and in strict tests where warnings are errors, this leads to CI breaks, e.g.: https://github.com/pytorch/vision/pull/8092 Replacing `pkg_resources.package` with `package` as it is now a pytorch dependency: `fa9045a872/requirements.txt (L19)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/113023 Approved by: https://github.com/Skylion007	2023-11-06 20:26:32 +00:00
Shaun Walbridge	0adb28b77d	Show CUDAExtension example commands as code (#112764 ) The default rendering of these code snippets renders the `TORCH_CUDA_ARCH_LIST` values with typographic quotes which prevent the examples from being directly copyable. Use code style for the two extension examples. Fixes #112763 Pull Request resolved: https://github.com/pytorch/pytorch/pull/112764 Approved by: https://github.com/malfet	2023-11-02 21:47:50 +00:00
Jeff Daily	28c0b07d19	[ROCm] remove HCC references (#111975 ) - rename `__HIP_PLATFORM_HCC__` to `__HIP_PLATFORM_AMD__` - rename `HIP_HCC_FLAGS` to `HIP_CLANG_FLAGS` - rename `PYTORCH_HIP_HCC_LIBRARIES` to `PYTORCH_HIP_LIBRARIES` - workaround in tools/amd_build/build_amd.py until submodules are updated These symbols have had a long deprecation cycle and will finally be removed in ROCm 6.0. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111975 Approved by: https://github.com/ezyang, https://github.com/hongxiayang	2023-10-26 02:39:10 +00:00
Aleksei Nikiforov	ba04d84089	S390x inductor support (#111367 ) Use arch compile flags. They are needed for vectorization support on s390x. Implement new helper functions for inductor. This change fixes multiple tests in test_cpu_repro.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/111367 Approved by: https://github.com/ezyang	2023-10-20 19:38:46 +00:00
Aaron Gokaslan	cb856b08b2	[BE]: Attach cause to some exceptions and enable RUFF TRY200 (#111496 ) Did some easy fixes from enabling TRY200. Most of these seem like oversights instead of intentional. The proper way to silence intentional errors is with `from None` to note that you thought about whether it should contain the cause and decided against it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111496 Approved by: https://github.com/malfet	2023-10-19 21:56:36 +00:00
Kent Gauen	bb89a9e48c	Skipped CUDA Flags if C++ Extension Name includes "arch" Substring (#111211 ) The CUDA architecture flags from TORCH_CUDA_ARCH_LIST will be skipped if the TORCH_EXTENSION_NAME includes the substring "arch". A C++ Extension should be allowed to have any name. I just manually skip the TORCH_EXTENSION_NAME flag when checking if one of the flags is "arch". There is probably a better fix, but I'll leave this to experts. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111211 Approved by: https://github.com/ezyang	2023-10-14 00:10:01 +00:00
Dmytro Dzhulgakov	a0cea517e7	Add 9.0a to cpp_extension supported compute archs (#110587 ) There's an extended compute capability 9.0a for Hopper that was introduced in Cuda 12.0: https://docs.nvidia.com/cuda/archive/12.0.0/cuda-compiler-driver-nvcc/index.html#gpu-feature-list E.g. Cutlass leverages it: `5f13dcad78/python/cutlass/emit/pytorch.py (L684)` This adds it to the list of permitted architectures to use in `cpp_extension` directly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110587 Approved by: https://github.com/ezyang	2023-10-05 17:41:06 +00:00
QuarticCat	20812d69e5	Fix extension rebuilding on Linux (#108613 ) On Linux, CUDA header dependencies are not correctly tracked. After you modify a CUDA header, affected CUDA files won't be rebuilt. This PR will fix this problem. ```console $ ninja -t deps rep_penalty.o: #deps 2, deps mtime 1693956351892493247 (VALID) /home/qc/Workspace/NotMe/exllama/exllama_ext/cpu_func/rep_penalty.cpp /home/qc/Workspace/NotMe/exllama/exllama_ext/cpu_func/rep_penalty.h rms_norm.cuda.o: #deps 0, deps mtime 1693961188871054130 (VALID) rope.cuda.o: #deps 0, deps mtime 1693961188954388632 (VALID) cuda_buffers.cuda.o: #deps 0, deps mtime 1693961188797719768 (VALID) ... ``` Historically, this line of code has been changed twice. It was first implemented in #49344 and there's no `if IS_WINDOWS`, just like now. Then in #56015 someone added `if IS_WINDOWS` for unknown reason. That PR has no description so I don't know what bug he encountered. I don't think there's any bug with these flags on Linux, at least for today. CMake generates exactly the same flags for CUDA. ```ninja ############################################# # Rule for compiling CUDA files. rule CUDA_COMPILER__cpp_cuda_unscanned_Debug depfile = $DEP_FILE deps = gcc command = ${LAUNCHER}${CODE_CHECK}/opt/cuda/bin/nvcc -forward-unknown-to-host-compiler $DEFINES $INCLUDES $FLAGS -MD -MT $out -MF $DEP_FILE -x cu -c $in -o $out description = Building CUDA object $out ``` where `-MD` is short for `--generate-dependencies-with-compile` and `-MF` is short for `--dependency-output`. My words can be verified by `nvcc --help`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108613 Approved by: https://github.com/ezyang	2023-09-06 17:58:21 +00:00
Aaron Gokaslan	660e8060ad	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-22 23:16:38 +00:00
PyTorch MergeBot	d59a6864fb	Revert "[BE]: Update ruff to 0.285 (#107519 )" This reverts commit `88ab3e4322`. Reverted https://github.com/pytorch/pytorch/pull/107519 on behalf of https://github.com/ZainRizvi due to Sorry, but this PR breaks internal tests. @ezyang, can you please hep them get unblocked? It seems like one of the strings was prob accidentally modified ([comment](https://github.com/pytorch/pytorch/pull/107519#issuecomment-1688833480))	2023-08-22 19:53:32 +00:00
Xu Han	3f3479e85e	reduce header file to boost cpp_wrapper build. (#107585 ) 1. Reduce cpp_wrapper un-used header files. 2. Clean pch cache, when use_pch is False. The first change will reduce the build time from 7.35s to 4.94s. Before change: ![image](https://github.com/pytorch/pytorch/assets/8433590/fc5c1d37-ec40-44f3-8d4d-bf26bdc674bb) After change: ![image](https://github.com/pytorch/pytorch/assets/8433590/c7ccadd2-bf3a-4d30-bf56-6e3b0230a194) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107585 Approved by: https://github.com/ezyang, https://github.com/jansel, https://github.com/jgong5	2023-08-22 11:58:47 +00:00
Han, Xu	5ed60477a7	Optimize load inline via pch (#106696 ) Add PreCompiled Header(PCH) to reduce load_inline build time. PCH is gcc built-in mechanism: https://gcc.gnu.org/onlinedocs/gcc-4.0.4/gcc/Precompiled-Headers.html Add PCH for '#include <torch/extension.h>'. This file will used in all load_inline modules. All load_inline modules can take benifit from this PR. Changes: 1. Add PCH signature to guarantee PCH(gch) file take effect. 2. Unification get cxx compiler funtions. 3. Unification get build flags funtions. Before this PR: ![image](https://github.com/pytorch/pytorch/assets/8433590/f190cdcb-236c-4312-b165-d419a7efafe3) Added this PR: ![image](https://github.com/pytorch/pytorch/assets/8433590/b45c5ad3-e902-4fc8-b450-743cf73505a4) Compiling time is reduced from 14.06s to 7.36s. Pull Request resolved: https://github.com/pytorch/pytorch/pull/106696 Approved by: https://github.com/jgong5, https://github.com/jansel	2023-08-21 10:08:30 +00:00
Aaron Gokaslan	88ab3e4322	[BE]: Update ruff to 0.285 (#107519 ) This updates ruff to 0.285 which is faster, better, and have fixes a bunch of false negatives with regards to fstrings. I also enabled RUF017 which looks for accidental quadratic list summation. Luckily, seems like there are no instances of it in our codebase, so enabling it so that it stays like that. :) Pull Request resolved: https://github.com/pytorch/pytorch/pull/107519 Approved by: https://github.com/ezyang	2023-08-20 01:36:18 +00:00
Nikita Shulga	bcc0f4bcab	Move ASAN to clang12 and Ubuntu-22.04 (Jammy) (#106355 ) - Modify `install_conda` to remove libstdc++ from libstdcxx-ng to use one from OS - Modify `install_torchvision` to workaround weird glibc bug, where malloc interposers (such as ASAN) are causing a hang in internationalization library, see https://sourceware.org/bugzilla/show_bug.cgi?id=27653 and https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90589 - Modify `torch.utils.cpp_extension` to recognize Ubuntu's clang as supported compiler Extracted from https://github.com/pytorch/pytorch/pull/105260 Pull Request resolved: https://github.com/pytorch/pytorch/pull/106355 Approved by: https://github.com/huydhn ghstack dependencies: #106354	2023-08-03 05:36:04 +00:00
Justin Chu	4cc1745b13	[BE] f-stringify torch/ and scripts (#105538 ) This PR is a follow up on the pyupgrade series to convert more strings to use f-strings using `flynt`. - https://docs.python.org/3/reference/lexical_analysis.html#f-strings - https://pypi.org/project/flynt/ Command used: ``` flynt torch/ -ll 120 flynt scripts/ -ll 120 flynt tools/ -ll 120 ``` and excluded `collect_env.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105538 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-07-21 19:35:24 +00:00
Justin Chu	abc1cadddb	[BE] Enable ruff's UP rules and autoformat utils/ (#105424 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105424 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-07-18 20:17:25 +00:00
lcskrishna	004ff536e8	[ROCm] Fix circular recursion issue in hipification (#104085 ) This PR fixes the circular issue during hipification process by introducing current_state to track whether a file is processed for hipification. (Iterative DFS) The issue arises when two header files try to include themselves, which leads to a circular recursion or an infinite loop. Fixes the related issues such as : https://github.com/pytorch/pytorch/issues/93827 https://github.com/ROCmSoftwarePlatform/hipify_torch/issues/39 Error log: ``` File "/opt/conda/lib/python3.8/posixpath.py", line 471, in relpath start_list = [x for x in abspath(start).split(sep) if x] File "/opt/conda/lib/python3.8/posixpath.py", line 375, in abspath if not isabs(path): File "/opt/conda/lib/python3.8/posixpath.py", line 63, in isabs sep = _get_sep(s) File "/opt/conda/lib/python3.8/posixpath.py", line 42, in _get_sep if isinstance(path, bytes): RecursionError: maximum recursion depth exceeded while calling a Python object ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104085 Approved by: https://github.com/jithunnair-amd, https://github.com/malfet	2023-07-01 03:25:51 +00:00
Felix Erkinger	e140c9cc92	Fixes ROCM_HOME detection in case no hipcc is found in path (#95634 ) if ROCM_HOME is not set as environment variable, it tries to find hipcc in the path, but fails with an empty string instead of an exception, returning an empty string instead of harcoded '/opt/rocm' as third case Fixes #95633 Pull Request resolved: https://github.com/pytorch/pytorch/pull/95634 Approved by: https://github.com/jithunnair-amd, https://github.com/ezyang	2023-06-28 19:39:26 +00:00
albanD	b81f1d1bee	Speed up cpp extensions re-compilation (#104280 ) Fixes https://github.com/pytorch/pytorch/issues/68066 to a large extend. This is achieved by not touching files that don't need changing to make sure the ninja caching works as expected. Pull Request resolved: https://github.com/pytorch/pytorch/pull/104280 Approved by: https://github.com/fmassa	2023-06-28 17:06:07 +00:00
Nikita Shulga	347463fddf	[cpp-extensions] Add clang to the list of supported Linux compilers (#103349 ) Not sure, why was it excluded previous (oversight I guess). Also, please note, that `clang++` is already considered acceptable compiler (as it ends with `g++` ;)) <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at 55aa7db</samp> > _`clang` or `gcc`, we don't care what you use_ > _We'll build our extensions with the tools we choose_ > _Don't try to stop us with your version string_ > _We'll update our logic and make our code sing_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/103349 Approved by: https://github.com/seemethere	2023-06-10 02:53:38 +00:00
Li-Huai (Allan) Lin	3c0072e7c0	[MPS] Prerequisite for MPS C++ extension (#102483 ) in order to add mps kernels to torchvision codebase, we need to expose mps headers and allow objc++ files used in extensions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102483 Approved by: https://github.com/malfet	2023-06-07 17:28:31 +00:00
Matthew Hoffman	29da75cc55	Enable mypy allow redefinition (#102046 ) Related #101528 I tried to enable this in another PR but it uncovered a bunch of type errors: https://github.com/pytorch/pytorch/actions/runs/4999748262/jobs/8956555243?pr=101528#step:10:1305 The goal of this PR is to fix these errors. --- This PR enables [allow_redefinition = True](https://mypy.readthedocs.io/en/stable/config_file.html#confval-allow_redefinition) in `mypy.ini`, which allows for a common pattern: > Allows variables to be redefined with an arbitrary type, as long as the redefinition is in the same block and nesting level as the original definition. `allow_redefinition` allows mypy to be more flexible by allowing reassignment to an existing variable with a different type... for instance (from the linked PR): `4a1e9230ba/torch/nn/parallel/data_parallel.py (L213)` A `Sequence[Union[int, torch.device]]` is narrowed to `Sequence[int]` thru reassignment to the same variable. Pull Request resolved: https://github.com/pytorch/pytorch/pull/102046 Approved by: https://github.com/ezyang	2023-05-24 07:05:30 +00:00
pminimd	59a3759d97	Update cpp_extension.py (#101285 ) When we need to link extra libs, we should notice that 64-bit CUDA may be installed in "lib", not in "lib64". <!-- copilot:summary --> ### <samp>🤖 Generated by Copilot at 05c1ca6</samp> Improve CUDA compatibility in `torch.utils.cpp_extension` by checking for `lib64` or `lib` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101285 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-05-15 22:47:41 +00:00
Richard Barnes	5f92909faf	Use correct standard when compiling NVCC on Windows (#100031 ) Test Plan: Sandcastle Differential Revision: D45129001 Pull Request resolved: https://github.com/pytorch/pytorch/pull/100031 Approved by: https://github.com/ngimel	2023-05-01 16:28:23 +00:00
Aaron Gokaslan	e2a3817dfd	[BE] Enable C419 rule for any all shortcircuiting (#99890 ) Apparently https://github.com/pytorch/pytorch/pull/78142 made torch.JIT allow for simple generator expressions which allows us to enable rules that replace unnecessary list comprehensions with generators in any/all. This was originally part of #99280 but I split it off into this PR so that it can be easily reverted should anything break. Pull Request resolved: https://github.com/pytorch/pytorch/pull/99890 Approved by: https://github.com/justinchuby, https://github.com/kit1980, https://github.com/malfet	2023-04-25 15:02:13 +00:00
PyTorch MergeBot	cfacb5eaaa	Revert "Use correct standard when compiling NVCC on Windows (#99492 )" This reverts commit `db6944562e`. Reverted https://github.com/pytorch/pytorch/pull/99492 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally	2023-04-19 20:51:26 +00:00
Richard Barnes	db6944562e	Use correct standard when compiling NVCC on Windows (#99492 ) Test Plan: Sandcastle Reviewed By: malfet Differential Revision: D45108690 Pull Request resolved: https://github.com/pytorch/pytorch/pull/99492 Approved by: https://github.com/ezyang	2023-04-19 20:36:05 +00:00
Pruthvi Madugundu	08f125bcac	[ROCm] Remove usage of deprecated ROCm component header includes (#97620 ) - clang parameter 'amdgpu-target' changed to 'offload-arch' - HIP and MIOpen includes path updated for extensions Pull Request resolved: https://github.com/pytorch/pytorch/pull/97620 Approved by: https://github.com/ezyang, https://github.com/jithunnair-amd	2023-03-28 19:28:38 +00:00
Stas Bekman	8275e5d2a8	[cpp_extension.py] fix bogus `_check_cuda_version` (#97602 ) Currently if `setuptools<49.4.0` and there is a minor version mismatch `_check_cuda_version` fails with a misleading non-actionable error: ``` 2023-03-24T20:21:35.0625644Z RuntimeError: 2023-03-24T20:21:35.0628441Z The detected CUDA version (11.2) mismatches the version that was used to compile 2023-03-24T20:21:35.0630681Z PyTorch (11.3). Please make sure to use the same CUDA versions. ``` This condition shouldn't be failing since minor version match isn't required. It fails because the other condition to have a certain version of `setuptools` isn't met. But that condition is written in a comment (!!!). So this PR changes it to actually tell the user how to fix the problem. While at it, I adjusted the version number as a lower `setuptools>=49.4.0` is sufficient for this to work. Thanks. p.s. this problem manifests on `nvidia/cuda:11.2.2-cudnn8-devel-ubuntu20.04` docker image. Pull Request resolved: https://github.com/pytorch/pytorch/pull/97602 Approved by: https://github.com/ezyang	2023-03-27 15:15:57 +00:00
mikey dagitses	461f088c96	add -std=c++17 to windows cuda compilations (#97515 ) add -std=c++17 to windows cuda compilations Summary: We're using C++17 in headers that are compiled by C++ extensions. Support for this was not added when we upgraded to C++17. Test Plan: Rely on CI. --- Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/pytorch/pytorch/pull/97515). * #97175 * __->__ #97515 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97515 Approved by: https://github.com/ezyang	2023-03-26 15:23:52 +00:00
Kazuaki Ishizaki	622a11d512	Fix typos under torch/utils directory (#97516 ) This PR fixes typos in comments and messages of `.py` files under `torch/utils` directory Pull Request resolved: https://github.com/pytorch/pytorch/pull/97516 Approved by: https://github.com/ezyang	2023-03-24 16:53:39 +00:00
mikey dagitses	bcff4773da	add /std:c++17 to windows compilations when not using Ninja (#97445 ) add /std:c++17 to windows compilations when not using Ninja Summary: This was overlooked when we upgraded to C++17. Test Plan: Rely on CI. Reviewers: ezyang Subscribers: Tasks: Tags: --- Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/pytorch/pytorch/pull/97445). * #96603 * #97473 * #97175 * #97515 * __->__ #97445 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97445 Approved by: https://github.com/ezyang	2023-03-24 14:52:29 +00:00
mikey dagitses	bdaf402565	build C++ extensions on windows with /std:c++17 (#97413 ) build C++ extensions on windows with /std:c++17 Summary: We added -std=c++17 to Posix builds, but neglected to add this for Windows. This just brings back parity. Test Plan: Rely on CI. Reviewers: ezyang Subscribers: Tasks: Tags: --- Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/pytorch/pytorch/pull/97413). * #97175 * __->__ #97413 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97413 Approved by: https://github.com/ezyang	2023-03-23 13:31:29 +00:00
Xiao Wang	44d7bbfe22	[cpp extension] Allow setting PYTORCH_NVCC to a customized nvcc in torch cpp extension build (#96987 ) per title I can write a script named `nvcc` like this ```bash #!/bin/bash /opt/cache/bin/sccache /usr/local/cuda/bin/nvcc $@ ``` and set its path to `PYTORCH_NVCC` (added in this PR), along with another `sccache-g++` script to env var `CXX`. `cfa6b52e02/torch/utils/cpp_extension.py (L2106-L2109)` With ninja, I can fully enable c-cached build on my cuda extensions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96987 Approved by: https://github.com/ezyang	2023-03-17 17:05:17 +00:00
Aaron Gokaslan	dd5e6e8553	[BE]: Merge startswith calls - rule PIE810 (#96754 ) Merges startswith, endswith calls to into a single call that feeds in a tuple. Not only are these calls more readable, but it will be more efficient as it iterates through each string only once. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96754 Approved by: https://github.com/ezyang	2023-03-14 22:05:20 +00:00
cyy	a32be76a53	Disable more warnings on Windows CI test (#95933 ) These warnings are disabled to avoid long log on Windows tests. They are also disabled on CMake buildings currently. '/wd4624': MSVC complains "destructor was implicitly defined as delete" on c10::optional and other templates '/wd4076': "unexpected tokens following preprocessor directive - expected a newline" on some header '/wd4068': "The compiler ignored an unrecognized [pragma]" Pull Request resolved: https://github.com/pytorch/pytorch/pull/95933 Approved by: https://github.com/ezyang	2023-03-03 07:11:13 +00:00
Eddie Yan	db8e91ef73	[CUDA] Split out compute capability 8.7 and 7.2 from others (#95803 ) Follow up of #95008 to avoid building Jetson compute capabilities unnecessarily, also adds missing 7.2. CC @ptrblck @malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/95803 Approved by: https://github.com/ezyang	2023-03-02 14:13:15 +00:00
Eddie Yan	13ebffe088	[CUDA] `sm_87` / Jetson Orin support (#95008 ) Surfaced from #94438 CC @ptrblck @ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/95008 Approved by: https://github.com/ezyang	2023-02-17 02:22:23 +00:00
PyTorch MergeBot	36dfbb08f3	Revert "Update Cutlass to v2.11 (#94188 )" This reverts commit `a0f9abdcb6`. Reverted https://github.com/pytorch/pytorch/pull/94188 on behalf of https://github.com/ezyang due to bouncing this to derisk branch cut	2023-02-13 19:03:36 +00:00
Aaron Gokaslan	a0f9abdcb6	Update Cutlass to v2.11 (#94188 ) Now that we are on CUDA 11+ exclusively, we can update Nvidia's Cutlass to the next version. We also had to remove the cuda build flag : "-D__CUDA_NO_HALF_CONVERSIONS__" since Cutlass no longer builds without it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94188 Approved by: https://github.com/ezyang, https://github.com/jansel	2023-02-12 20:45:03 +00:00
Aaron Gokaslan	67d9790985	[BE] Apply almost all remaining flake8-comprehension checks (#94676 ) Applies the remaining flake8-comprehension fixes and checks. This changes replace all remaining unnecessary generator expressions with list/dict/set comprehensions which are more succinct, performant, and better supported by our torch.jit compiler. It also removes useless generators such as 'set(a for a in b)`, resolving it into just the set call. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94676 Approved by: https://github.com/ezyang	2023-02-12 01:01:25 +00:00
Xuehai Pan	5b1cedacde	[BE] [2/3] Rewrite `super()` calls in functorch and torch (#94588 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94588 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-10 21:16:33 +00:00
Aaron Gokaslan	3ce1ebb6fb	Apply some safe comprehension optimizations (#94323 ) Optimize unnecessary collection cast calls, unnecessary calls to list, tuple, and dict, and simplify calls to the sorted builtin. This should strictly improve speed and improve readability. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94323 Approved by: https://github.com/albanD	2023-02-07 23:53:46 +00:00
Aaron Gokaslan	8fce9a09cd	[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 ) Apply parts of pyupgrade to torch (starting with the safest changes). This PR only does two things: removes the need to inherit from object and removes unused future imports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94308 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-07 21:10:56 +00:00
bxia	70b3ea59ae	[ROCM] Modify transcoding: absolute path ->relative path (#91845 ) Fixes https://github.com/pytorch/pytorch/issues/91797 This PR compiles the transcoded file with a relative path to ensure that the written transcoded file is written to SOURCE.txt as a relative path. Ensure successful packaging. Pull Request resolved: https://github.com/pytorch/pytorch/pull/91845 Approved by: https://github.com/jithunnair-amd, https://github.com/ezyang	2023-01-13 23:00:57 +00:00
cyy	9710ac6531	Some CMake and CUDA cleanup given recent update to C++17 (#90599 ) The main changes are: 1. Remove outdated checks for old compiler versions because they can't support C++17. 2. Remove outdated CMake checks because it now requires 3.18. 3. Remove outdated CUDA checks because we are moving to CUDA 11. Almost all changes are in CMake files for easy audition. Pull Request resolved: https://github.com/pytorch/pytorch/pull/90599 Approved by: https://github.com/soumith	2022-12-30 11:19:26 +00:00
joncrall	ad782ff7df	Enable xdoctest runner in CI for real this time (#83816 ) Builds on #83317 and enables running the doctests. Just need to figure out what is causing the failures. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83816 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-12-29 05:32:42 +00:00
Nikita Shulga	36ac095ff8	Migrate PyTorch to C++17 (#85969 ) With CUDA-10.2 gone we can finally do it! This PR mostly contains build system related changes, invasive functional ones are to be followed. Among many expected tweaks to the build system, here are few unexpected ones: - Force onnx_proto project to be updated to C++17 to avoid `duplicate symbols` error when compiled by gcc-7.5.0, as storage rule for `constexpr` changed in C++17, but gcc does not seem to follow it - Do not use `std::apply` on CUDA but rely on the built-in variant, as it results in test failures when CUDA runtime picks host rather than device function when `std::apply` is invoked from CUDA code. - `std::decay_t` -> `::std::decay_t` and `std::move`->`::std::move` as VC++ for some reason claims that `std` symbol is ambigious - Disable use of `std::aligned_alloc` on Android, as its `libc++` does not implement it. Some prerequisites: - https://github.com/pytorch/pytorch/pull/89297 - https://github.com/pytorch/pytorch/pull/89605 - https://github.com/pytorch/pytorch/pull/90228 - https://github.com/pytorch/pytorch/pull/90389 - https://github.com/pytorch/pytorch/pull/90379 - https://github.com/pytorch/pytorch/pull/89570 - https://github.com/facebookincubator/gloo/pull/336 - https://github.com/facebookincubator/gloo/pull/343 - `919676fb32` Fixes https://github.com/pytorch/pytorch/issues/56055 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85969 Approved by: https://github.com/ezyang, https://github.com/kulinseth	2022-12-08 02:27:48 +00:00
Alexander Grund	5b51ca6808	Update CUDA compiler matrix (#86360 ) Switch GCC/Clang max versions to be exclusive as the `include/crt/host_config.h` checks the major version only for the upper bound. This allows to be less restrictive and match the checks in the aforementioned header. Also update the versions using that header in the CUDA SDKs. Follow up to #82860 I noticed this as PyTorch 1.12.1 with CUDA 11.3.1 and GCC 10.3 was failing in the `test_cpp_extensions*` tests. Example for CUDA 11.3.1 from the SDK header: ``` #if __GNUC__ > 11 // Error out ... #if (__clang_major__ >= 12) \|\| (__clang_major__ < 3) \|\| ((__clang_major__ == 3) && (__clang_minor__ < 3)) // Error out ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/86360 Approved by: https://github.com/ezyang	2022-11-23 03:07:22 +00:00
Nikita Shulga	575e02df53	Fix CUDNN_PATH handling on Windows (#88898 ) Fixes https://github.com/pytorch/pytorch/issues/88873 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88898 Approved by: https://github.com/kit1980	2022-11-11 21:19:26 +00:00
Eddie Yan	a7420d2ccb	Hopper (`sm90`) support (#87736 ) Essentially a followup of #87436 CC @xwang233 @ptrblck Pull Request resolved: https://github.com/pytorch/pytorch/pull/87736 Approved by: https://github.com/xwang233, https://github.com/malfet	2022-11-09 01:49:50 +00:00
Greg Hogan	71fe069d98	ada lovelace (arch 8.9) support (#87436 ) changes required to be able to compile https://github.com/pytorch/vision and https://github.com/nvidia/apex for `sm_89` architecture Pull Request resolved: https://github.com/pytorch/pytorch/pull/87436 Approved by: https://github.com/ngimel	2022-10-24 21:25:36 +00:00
Nikita Shulga	c28cdb53ea	[BE] Delete BUILD_SPLIT_CUDA option (#87502 ) As we are linking with cuDNN and cuBLAS dynamically for all configs anyway, as statically linked cuDNN is different library than dynamically linked one, increases default memory footprint, etc, and libtorch_cuda even if compiled for all GPU architectures is no longer approaching 2Gb binary size limit, so BUILD_SPLIT_CUDA can go away. Pull Request resolved: https://github.com/pytorch/pytorch/pull/87502 Approved by: https://github.com/atalman	2022-10-22 06:00:59 +00:00
Alexander Grund	fe87ae692f	Fix `check_compiler_ok_for_platform` on non-English locales (#85891 ) The function checks the output of e.g. `c++ -v` for "gcc version". But on another locale than English it might be "gcc-Version" which makes the check fail. This causes the function to wrongly return false on systems where `c++` is a hardlink to `g++` and the current locale returns another output format. Fix this by setting `LC_ALL=C`. I found this as `test_utils.py` was failing in `test_cpp_compiler_is_ok` Pull Request resolved: https://github.com/pytorch/pytorch/pull/85891 Approved by: https://github.com/ezyang	2022-09-29 18:36:36 +00:00
anjali411	0183c1e336	Add __all__ to torch.utils submodules (#85331 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/85331 Approved by: https://github.com/albanD	2022-09-27 14:45:26 +00:00
chengscott	1bf2371365	Rename path on Windows from lib/x64 to lib\x64 (#83417 ) Use `os.path.join` to join path Pull Request resolved: https://github.com/pytorch/pytorch/pull/83417 Approved by: https://github.com/ezyang	2022-08-15 14:47:19 +00:00
joncrall	4618371da5	Integrate xdoctest - Rebased (#82797 ) This is a new version of #15648 based on the latest master branch. Unlike the previous PR where I fixed a lot of the doctests in addition to integrating xdoctest, I'm going to reduce the scope here. I'm simply going to integrate xdoctest, and then I'm going to mark all of the failing tests as "SKIP". This will let xdoctest run on the dashboards, provide some value, and still let the dashboards pass. I'll leave fixing the doctests themselves to another PR. In my initial commit, I do the bare minimum to get something running with failing dashboards. The few tests that I marked as skip are causing segfaults. Running xdoctest results in 293 failed, 201 passed tests. The next commits will be to disable those tests. (unfortunately I don't have a tool that will insert the `#xdoctest: +SKIP` directive over every failing test, so I'm going to do this mostly manually.) Fixes https://github.com/pytorch/pytorch/issues/71105 @ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/82797 Approved by: https://github.com/ezyang	2022-08-12 02:08:01 +00:00
Nikita Shulga	737fa85dd2	Update CUDA compiler matrix (#82860 ) Update CUDA compiler versions to match ones defined in https://docs.nvidia.com/cuda/archive/11.4.1/cuda-installation-guide-linux/index.html#system-requirements https://docs.nvidia.com/cuda/archive/11.5.0/cuda-installation-guide-linux/index.html#system-requirements https://docs.nvidia.com/cuda/archive/11.6.0/cuda-installation-guide-linux/index.html#system-requirements https://docs.nvidia.com/cuda/archive/11.7.0/cuda-installation-guide-linux/index.html#system-requirements Special case 11.4.0, where maximum GCC supported version are similar to 11.3 rather that to 11.4.1+ Fixes https://github.com/pytorch/pytorch/issues/81039 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82860 Approved by: https://github.com/huydhn	2022-08-06 00:46:30 +00:00
Xuehai Pan	e849ed3d19	Redirect print messages to `stderr` in `torch.utils.cpp_extension` (#82097 ) ### Description <!-- What did you change and why was it needed? --> Listed in the commit message: > The user may want to use `python3 -c "..."` to get the torch library > path and the include path. Printing messages to stdout will mess up > the output. I'm using the command: ```bash LIBTORCH_PATH="$( python3 -c 'print(":".join(__import__("torch.utils.cpp_extension", fromlist=[None]).library_paths()))' )" export LD_LIBRARY_PATH="${LIBTORCH_PATH}${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}" ``` To let the command line tools find the torch shared libraries. I think this would be a common use case for users who writing C/C++ extensions. I got: ```console $ LIBTORCH_PATH="$(python3 -c 'print(":".join(__import__("torch.utils.cpp_extension", fromlist=[None]).library_paths()))')" $ export LD_LIBRARY_PATH="${LIBTORCH_PATH}${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}" $ echo "LD_LIBRARY_PATH=${LD_LIBRARY_PATH}" LD_LIBRARY_PATH=No CUDA runtime is found, using CUDA_HOME='/usr/local/cuda-11.6' /opt/hostedtoolcache/Python/3.7.13/x64/lib/python3.7/site-packages/torch/lib:/usr/local/cuda-11.6/lib64: $ ls -alh "${LIBTORCH_PATH}" ls: cannot access 'No CUDA runtime is found, using CUDA_HOME='\''/usr/local/cuda-11.6'\'''$'\n''/opt/hostedtoolcache/Python/3.7.13/x64/lib/python3.7/site-packages/torch/lib': No such file or directory ``` This PR prints messages in `torch.utils.cpp_extension` to `stderr`, which allows users to get correct result using `VAR="$(python3 -c '...')"` ### Issue <!-- Link to Issue ticket or RFP --> N/A ### Testing <!-- How did you test your change? --> N/A Pull Request resolved: https://github.com/pytorch/pytorch/pull/82097 Approved by: https://github.com/ezyang	2022-07-25 21:55:15 +00:00
Nikita Shulga	95c148e502	[BE] Turn `_check_cuda_version` into a function (#81603 ) It was class method, but does not use any of the class properties/called other class methods Pull Request resolved: https://github.com/pytorch/pytorch/pull/81603 Approved by: https://github.com/ezyang	2022-07-17 05:49:39 +00:00
Nikita Shulga	7e274964d3	[BE] Disamntle pyramid of doom in _check_cuda_version (#81602 ) Replace `if stmt: doSmth; else: raise_or_return` with `if not stmt: raise_or_return; doSmth` Pull Request resolved: https://github.com/pytorch/pytorch/pull/81602 Approved by: https://github.com/ezyang	2022-07-17 05:49:39 +00:00
Jithun Nair	71ee384924	[ROCm] Use torch._C._cuda_getArchFlags to get list of gfx archs pytorch was built for (#80498 ) even if no GPUs are available When building PyTorch extensions for ROCm Pytorch, if the user doesn't specify a list of archs using PYTORCH_ROCM_ARCH env var, we would like to use the list of gfx archs that PyTorch was built for as the default value. To do this successfully even in an environment where no GPUs are available eg. a build-only CPU node, we need to be able to get the list of archs. `torch.cuda.get_arch_list()` doesn't work here because it calls `torch.cuda.available()` first: `0922cc024e/torch/cuda/__init__.py (L463)`, which will return `False` if no GPUs are available, resulting in an empty list being returned by `torch.cuda.get_arch_list()`. To get around this issue, we call the underlying API `torch._C._cuda_getArchFlags()` Pull Request resolved: https://github.com/pytorch/pytorch/pull/80498 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-07-07 16:06:12 +00:00
PyTorch MergeBot	ec4be38ba9	Revert "To add hipify_torch as a submodule in pytorch/third_party (#74704 )" This reverts commit `93b0fec39d`. Reverted https://github.com/pytorch/pytorch/pull/74704 on behalf of https://github.com/malfet due to broke torchvision	2022-06-21 23:54:00 +00:00
Bhavya Medishetty	93b0fec39d	To add hipify_torch as a submodule in pytorch/third_party (#74704 ) `hipify_torch` as a submodule in `pytorch/third_party` Pull Request resolved: https://github.com/pytorch/pytorch/pull/74704 Approved by: https://github.com/jeffdaily, https://github.com/malfet	2022-06-21 18:56:49 +00:00
Xiao Wang	ef0332e36d	Allow relocatable device code linking in pytorch CUDA extensions (#78225 ) Close https://github.com/pytorch/pytorch/issues/57543 Doc: check `Relocatable device code linking:` in https://docs-preview.pytorch.org/78225/cpp_extension.html#torch.utils.cpp_extension.CUDAExtension Pull Request resolved: https://github.com/pytorch/pytorch/pull/78225 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-06-02 21:35:56 +00:00
Michael Suo	fb0f285638	[lint] upgrade mypy to latest version Fixes https://github.com/pytorch/pytorch/issues/75927. Had to fix some bugs and add some ignores. To check if clean: ``` lintrunner --paths-cmd='git grep -Il .' --take MYPY,MYPYSTRICT ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/76753 Approved by: https://github.com/malfet	2022-05-03 20:51:34 +00:00
PyTorch MergeBot	3d7428d9ac	Revert "[lint] upgrade mypy to latest version" This reverts commit `9bf18aab94`. Reverted https://github.com/pytorch/pytorch/pull/76753 on behalf of https://github.com/suo	2022-05-03 20:01:18 +00:00
Michael Suo	9bf18aab94	[lint] upgrade mypy to latest version Fixes https://github.com/pytorch/pytorch/issues/75927. Had to fix some bugs and add some ignores. To check if clean: ``` lintrunner --paths-cmd='git grep -Il .' --take MYPY,MYPYSTRICT ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/76753 Approved by: https://github.com/malfet	2022-05-03 19:43:28 +00:00
rraminen	7422ccea8b	Hipify fixes for a successful DeepSpeed build These commits are required to build DeepSpeed on ROCm without the hipify errors. `a41829d9ed` `663c718462` cc: @jeffdaily Pull Request resolved: https://github.com/pytorch/pytorch/pull/76141 Approved by: https://github.com/jeffdaily, https://github.com/pruthvistony, https://github.com/albanD	2022-04-28 13:19:59 +00:00
Min Si	9562aedb58	ROCm: add HIP_HOME/include,lib in cpp_extensions (#75548 ) Summary: hip/hip_runtime.h and libamdhip64.so may be required to compile extension such as torch_ucc. They are in $ROCM_HOME/hip by default, and may not be symlinked to $ROCM_HOME/include and $ROCM_HOME/lib. This commit defines $ROCM_HOME/hip as $HIP_HOME, and adds its include and lib paths when building hipified extension. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75548 Test Plan: ## Verify OSS pytorch + TorchUCC on an AMD GPU machine (MI100) - step 1. Install OSS pytorch ``` export ROCM_PATH=/opt/rocm-4.5.2 git clone https://github.com/pytorch/pytorch.git cd pytorch python3 tools/amd_build/build_amd.py USE_NCCL=0 USE_RCCL=0 USE_KINETO=0 with-proxy python3 setup.py develop USE_NCCL=0 USE_RCCL=0 USE_KINETO=0 with-proxy python3 setup.py install ``` - step2. Install torchUCC extension ``` # /opt/rocm-4.5.2/include/hip does not exist, need include /opt/rocm-4.5.2/hip/include at compile time export ROCM_PATH=/opt/rocm-4.5.2 export RCCL_INSTALL_DIR=/opt/rccl-rocm-rel-4.4-rdc git clone https://github.com/facebookresearch/torch_ucc.git cd torch_ucc UCX_HOME=$RCCL_INSTALL_DIR UCC_HOME=$RCCL_INSTALL_DIR WITH_CUDA=$ROCM_PATH python setup.py ``` Build log before fix (error "hip/hip_runtime.h: No such file or directory"): P493038915 Build log after fix: P493037572 Reviewed By: ezyang Differential Revision: D35506098 Pulled By: minsii fbshipit-source-id: 76cbb6d4eaa6549a00898c9d9ebaca47a55330e9 (cherry picked from commit d684c080edf1fbd293e3321151976812c1da8533)	2022-04-19 20:51:37 +00:00
provefar	7a243ddd19	Add import to `importlib.abc` Fixes #70525 ``` --------------------------------------------------------------------------- AttributeError Traceback (most recent call last) <ipython-input-3-334d309cf512> in <module> ----> 1 lltm_cpp = load(name="lltm_cpp", sources=["lltm.cpp"]) /usr/lib/python3.10/site-packages/torch/utils/cpp_extension.py in load(name, sources, extra_cflags, extra_cuda_cflags, extra_ldflags, extra_include_paths, build_directory, verbose, with_cuda, is_python_module, is_standalone, keep_intermediates) 1122 verbose=True) 1123 ''' -> 1124 return _jit_compile( 1125 name, 1126 [sources] if isinstance(sources, str) else sources, /usr/lib/python3.10/site-packages/torch/utils/cpp_extension.py in _jit_compile(name, sources, extra_cflags, extra_cuda_cflags, extra_ldflags, extra_include_paths, build_directory, verbose, with_cuda, is_python_module, is_standalone, keep_intermediates) 1360 return _get_exec_path(name, build_directory) 1361 -> 1362 return _import_module_from_library(name, build_directory, is_python_module) 1363 1364 /usr/lib/python3.10/site-packages/torch/utils/cpp_extension.py in _import_module_from_library(module_name, path, is_python_module) 1751 spec = importlib.util.spec_from_file_location(module_name, filepath) 1752 module = importlib.util.module_from_spec(spec) -> 1753 assert isinstance(spec.loader, importlib.abc.Loader) 1754 spec.loader.exec_module(module) 1755 return module AttributeError: module 'importlib' has no attribute 'abc' ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/75736 Approved by: https://github.com/ezyang	2022-04-14 03:32:30 +00:00
Edgar Andrés Margffoy Tuay	86deecd7be	Check clang++/g++ version when compiling CUDA extensions (#63230 ) Summary: See https://github.com/pytorch/pytorch/issues/55267 Pull Request resolved: https://github.com/pytorch/pytorch/pull/63230 Reviewed By: soulitzer Differential Revision: D34159119 Pulled By: malfet fbshipit-source-id: 6eef7582388bf6a42dcc1d82b6e4b1f40f418dd7 (cherry picked from commit 2056d0a0be7951602de22f8d3b4efc28dd71b6c2)	2022-02-24 08:32:32 +00:00
Andrey Talman	46f9e16afe	Documenting cuda 11.5 windows issue (#73013 ) Summary: Adding documentation about compiling extension with CUDA 11.5 and Windows Example of failure: https://github.com/pytorch/pytorch/runs/4408796098?check_suite_focus=true Note: Don't use torch/extension.h In CUDA 11.5 under windows in your C++ code: Use aten instead of torch interface in all cuda 11.5 code under windows. It has been failing with errors, due to a bug in nvcc. Example use: >>> #include <ATen/ATen.h> >>> at::Tensor SigmoidAlphaBlendForwardCuda(....) Instead of: >>> #include <torch/extension.h> >>> torch::Tensor SigmoidAlphaBlendForwardCuda(...) Currently open issue for nvcc bug: https://github.com/pytorch/pytorch/issues/69460 Complete Workaround code example: `cb170ac024` Pull Request resolved: https://github.com/pytorch/pytorch/pull/73013 Reviewed By: malfet, seemethere Differential Revision: D34306134 Pulled By: atalman fbshipit-source-id: 3c5b9d7a89c91bd1920dc63dbd356e45dc48a8bd (cherry picked from commit `87098e7f17`)	2022-02-19 02:34:59 +00:00
Jithun Nair	8dfdc3df82	[ROCm] Refactor how to specify AMD gpu targets using PYTORCH_ROCM_ARCH (#61706 ) Summary: Remove all hardcoded AMD gfx targets PyTorch build and Magma build will use rocm_agent_enumerator as backup if PYTORCH_ROCM_ARCH env var is not defined PyTorch extensions will use same gfx targets as the PyTorch build, unless PYTORCH_ROCM_ARCH env var is defined torch.cuda.get_arch_list() now works for ROCm builds PyTorch CI dockers will continue to be built for gfx900 and gfx906 for now. PYTORCH_ROCM_ARCH env var can be a space or semicolon separated list of gfx archs eg. "gfx900 gfx906" or "gfx900;gfx906" cc jeffdaily sunway513 jithunnair-amd ROCmSupport KyleCZH Pull Request resolved: https://github.com/pytorch/pytorch/pull/61706 Reviewed By: seemethere Differential Revision: D32735862 Pulled By: malfet fbshipit-source-id: 3170e445e738e3ce373203e1e4ae99c84e645d7d	2021-12-13 15:41:40 -08:00
Nikita Shulga	bede18b061	Add support for C++ frontend wrapper on Linux (#69094 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69094 Partially addresses https://github.com/pytorch/pytorch/issues/68768 Test Plan: Imported from OSS Reviewed By: seemethere Differential Revision: D32730079 Pulled By: malfet fbshipit-source-id: 854e4215ff66e087bdf354fed7a17e87f2649c87	2021-12-02 16:47:00 -08:00
Nikita Shulga	c08e95dd9c	Introduce `IS_LINUX` and `IS_MACOS` global vars (#69093 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69093 Test Plan: Imported from OSS Reviewed By: samdow Differential Revision: D32730080 Pulled By: malfet fbshipit-source-id: aa3f218d09814b4edd96b01c7b57b85fd58c47fc	2021-12-01 09:47:38 -08:00
Nikita Shulga	f6f1b580f8	Fix mypy in cpp_extension.py (#69101 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69101 Test Plan: Imported from OSS Reviewed By: atalman, janeyx99 Differential Revision: D32730081 Pulled By: malfet fbshipit-source-id: 76ace65b51850b74b175a3c4688c05e107873e8d	2021-11-30 16:01:55 -08:00
Jane Xu	78f970568c	Add dummy op to use instead of searchsorted (#66964 ) Summary: Would help unblock https://github.com/pytorch/pytorch/issues/66818 if this actually works Pull Request resolved: https://github.com/pytorch/pytorch/pull/66964 Reviewed By: mruberry Differential Revision: D31817942 Pulled By: janeyx99 fbshipit-source-id: 9e9a2bcb0c0479ec7000ab8760a2e64bf0e85e95	2021-10-21 12:56:22 -07:00

1 2 3 4 5 ...

320 Commits