pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
FFFrog	8562457cba	Make torch/csrc/utils.h to be device-agnostic (#152521 ) `torch/csrc/utils.h` should be device-independent. Currently, it contains CUDA-related implementations, which indirectly causes the [failure of ROCm testing](https://github.com/pytorch/pytorch/pull/151914#issuecomment-2839691038) (The reason is that the ROCm test environment shouldn`t expose HIP-related header files, which causes the JIT compilation to fail during testing) Therefore, move CUDA-related implementations to `torch/csrc/cuda/utils.h`. Question: This change may introduce BC-breack. I searched for this function globally on github and I think the impact is very small. Pull Request resolved: https://github.com/pytorch/pytorch/pull/152521 Approved by: https://github.com/Skylion007, https://github.com/albanD ghstack dependencies: #152512, #152513	2025-05-04 07:15:11 +00:00
Kurt Mohler	32cf6c6fb0	Remove `THPTensor` defs, override macros, and `GenerateByteType.h` (#82503 ) ### Description These are old definitions and files that aren't used anymore. ### Issue Fixes #82502 ### Testing N/A Pull Request resolved: https://github.com/pytorch/pytorch/pull/82503 Approved by: https://github.com/ezyang	2022-07-30 19:40:16 +00:00
Michael Suo	30fb2c4aba	[lint] autoformat test/cpp and torch/csrc Let's have some fun. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78828 Approved by: https://github.com/ezyang	2022-06-11 21:11:16 +00:00
Kurt Mohler	aea6e2c396	Merge torch.cuda._UntypedStorage into torch._UntypedStorage (#75459 ) Fixes #74933 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75459 Approved by: https://github.com/ezyang	2022-05-19 13:54:39 +00:00
Peter Bell	e279963eef	Remove remaining THC code (#69039 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/69039 Test Plan: Imported from OSS Reviewed By: anjali411 Differential Revision: D32872476 Pulled By: ngimel fbshipit-source-id: 7972aacc24aef9450fb59b707ed6396c501bcb31	2021-12-08 12:18:08 -08:00
Thomas Viehmann	ac1fe91dc9	Clean up some THC includes (#69024 ) Summary: These seem to not be needed and cause ninja to rebuild the files at every build. (There also is THCStorage.cu, but hopefully this will go away with https://github.com/pytorch/pytorch/issues/68556 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/69024 Reviewed By: soulitzer Differential Revision: D32705309 Pulled By: ngimel fbshipit-source-id: 5255297f213fdcf36e7203de7460a71291f8c9a0	2021-11-29 20:55:27 -08:00
Edward Yang	a5d356cb39	Delete THP_CORE macro; partially replace with THP_BUILD_MAIN_LIB (#29143 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29143 THP_CORE macro is a very old macro that appeared to have served two purposes: 1. The torch-python equivalent of CAFFE2_BUILD_MAIN_LIB, to toggle symbol visibility headers 2. Some sort of ad hoc way of hiding certain definitions from headers so external clients can't get at them. It did (2) in a very confusing manner, because we set THP_CORE in both torch and torch-python (it shouldn't do anything in torch). In this PR I just get rid of use case (2) entirely (so everything shows up in headers all the time), and then redo (1) using a new THP_BUILD_MAIN_LIB macro. This cleans up some of the macro definitions and makes my life easier for working on #27215. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D18309594 Pulled By: ezyang fbshipit-source-id: adcb6d7cb387cd818480137e2b94e5e761dbfefc	2019-11-06 15:02:02 -08:00
Shen Li	24f4d3987e	Move all Stream and Event Python implementation to C++ (#15937 ) Summary: 1. Added `torch/csrc/cuda/Event.h` and `torch/csrc/cuda/Event.cpp` to bind Python Event class to C++ implementation. 2. Move all CUDA runtime invocations from `torch/cuda/streams.py` to C++ 3. Added tests to cover Stream and Event APIs. ~(event IPC handle tests is introduced in #15974)~ Pull Request resolved: https://github.com/pytorch/pytorch/pull/15937 Differential Revision: D13649001 Pulled By: mrshenli fbshipit-source-id: 84ca58f35f6ba679a4ba33150ceba678d760d240	2019-01-17 07:29:22 -08:00
Edward Yang	517c7c9861	Canonicalize all includes in PyTorch. (#14849 ) Summary: Anywhere we used #include "foo.h", we now say #include <foo.h> Paths are adjusted to be rooted out of aten/src, torch/lib, or the root level directory. I modified CMakeLists.txt by hand to remove TH and THC from the include paths. I used the following script to do the canonicalization: ``` import subprocess import re import os.path files = subprocess.check_output(['git', 'ls-files']).decode('utf-8').rstrip().split('\n') for fn in files: if not any(fn.endswith(suff) for suff in ['.cu', '.cpp', '.in', '.h', '.hpp', '.cu', '.cuh', '.cc']): continue if not any(fn.startswith(pref) for pref in ["aten/", "torch/"]): continue with open(fn, 'r') as f: c = f.read() def fmt(p): return "#include <{}>".format(p) def repl(m): p = m.group(1) if p in ["dlfcn.h", "unistd.h", "nvrtc.h", "cuda.h", "cuda_runtime.h", "cstdint", "cudnn.h", "Python.h", "cusparse.h", "cuda_runtime_api.h", "cuda_fp16.h", "cublas_v2.h", "stdint.h", "curand_kernel.h"]: return fmt(p) if any(p.startswith(pref) for pref in ["torch/csrc", "c10/", "ATen/", "caffe2/", "TH/", "THC/", "Eigen/", "gtest/", "zdl/", "gloo/", "onnx/", "miopen/"]): return fmt(p) for root in ["aten/src", "torch/lib", ""]: for bad_root in [os.path.dirname(fn), "aten/src/TH", "aten/src/THC", "torch/csrc"]: new_p = os.path.relpath(os.path.join(bad_root, p), root) if not new_p.startswith("../") and (os.path.exists(os.path.join(root, new_p)) or os.path.exists(os.path.join(root, new_p + ".in"))): return fmt(new_p) print("ERROR: ", fn, p) return m.group(0) new_c = re.sub(r'#include "([^"]+)"', repl, c) if new_c != c: print(fn) with open(fn, 'w') as f: f.write(new_c) ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/14849 Reviewed By: dzhulgakov Differential Revision: D13363445 Pulled By: ezyang fbshipit-source-id: 52361f878a672785f9306c9e9ab2513128092b68	2018-12-08 19:38:30 -08:00
Roy Li	f00f99ebcc	use at::Half in THC (#11322 ) Summary: - use Half instead of half in THC - clean up TH_float2half, TH_half2float, etc. conversions Pull Request resolved: https://github.com/pytorch/pytorch/pull/11322 Differential Revision: D9799553 Pulled By: li-roy fbshipit-source-id: 9aa3e003bff73d9df6224a393f3ec0624b1f44ed	2018-09-12 17:39:37 -07:00
Edward Z. Yang	3598356420	Port THCS to ATen. (#8689 ) * Port THCS to ATen. General structure of the sparse implementation: - SparseCUDATensor.{cpp, cu} and SparseCUDATensorMath.cu contain the same functions as their CPU analogues - SparseCUDAApplyUtils.cuh contains what used to be in THCSTensor.cu - SparseCUDABlas.cu contains what used to be THCSparse.cu Unrelated improvements: - Forward declared CUDA types in Context.h are now moved exclusively to CUDAHooks - New getCurrentCUDASparseHandle in Context - Support for printing CUSPARSE_STATUS_ZERO_PIVOT error message directly Some unusual pieces: - get_device got the LegacyBridge makeover, as it needs special logic on sparse tensors (defer to the inner tensors). - I noticed that I need to turn off device_guard codegen for many functions in sparse, noticed because get_device became a native function, and resulted in an infinite recursion. This was done by adding device_guard: False to the native definitions. An alternative strategy might be to make the heuristic for deciding when to put in a device guard more clever. Scaffolding removal: - LegacyBridge now special-cases only on sparse versus dense; no more CUDA test (hooray!) - Native bindings get CUDA/SparseCUDA dispatch entries. CPU sparse refactoring: - New SparseUtils.h header, with all of the utility functions that used to live in SparseTensor.cpp - new_with_tensor_sparse now correctly handles both CPU and CUDA - transpose functions in sparse/ turned out to be dead, so I killed them Bugs I noticed while working on this: - I used accessor<...>() on a CUDA tensor, because I thought it does the CUDA-CPU sync. It does not. Last mile changes: - I killed all of the THS/THCS directories, build scripts, bindings everything. It is now no more! - A bunch of trampolines in LegacyBridge are no more; anything that was "sparse only" is now done natively. - `sparse_coo_tensor` is implemented a little funny, but we think it's a good idea. - HIP is handled by explicitly ifdef'ing out all kernels; we'll add support for this at some later point in time. - TH_INDEX_BASE is now unconditionally set to 0. - Some uses of x.type() now replaced with x.options(), the new way of doing it. - More notes about checked_cast_tensor, and eliminate Storage/Tensor fields in the code gen env when they are dead. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2018-06-24 15:14:09 -04:00
gchanan	045e7435c3	Have a single THTensor / THCTensor type. (#8288 ) * Remove remaining TensorTypeUtils functions. Mostly what's remaining is copy utilities -- these are now provided in THCTensorCopy.hpp and templatized on the ScalarType rather than the TensorType. * Have a single THTensor / THCTensor type. As was previously done with Storages, have only a single (dtype-independent) THTensor / THCTensor. For documentation and backwards compatibility purposes, the old names, e.g. TH(Cuda)LongTensor alias the new TH(C)Tensor type. * undef GENERATE_SPARSE.	2018-06-08 17:57:44 -04:00
Zachary DeVito	d985cf46f1	Add workaround to fix include warnings in Python 2 builds. (#6716 )	2018-04-24 12:30:19 -07:00
Sam Gross	b38ed69441	Delete unused files (#5500 )	2018-03-01 14:28:06 -05:00
Sam Gross	48a3349c29	Delete dead Tensor code paths (#5417 ) This deletes most of the dead Tensor code paths, including the TensorMethods cwrap and generic/Tensor.cpp. This also moves the THNN.cwrap/.cpp generation to generate_code which can use ninja if installed.	2018-02-27 17:58:09 -05:00
Edward Z. Yang	3ada9da808	Make csrc -Werror clean. (#1795 ) Primary things I had to fix: - Suppress _XOPEN_SOURCE warnings by ensuring that Python.h is included first, because it always unconditionally defines this macro. - Turn off strict aliasing, because Python 2 doesn't work with strict aliasing. - Workaround setuptools bug, where it's incorrectly passing -Wstrict-prototypes to C++ compilers (where this doesn't make any sense) To compile csrc with -Werror, run `CFLAGS="-Werror" python setup.py build_ext` Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-06-13 20:18:09 -04:00
Zeming Lin	59d66e6963	Sparse Library (#333 )	2017-01-05 00:43:41 +01:00
Adam Paszke	2fd78112ab	Add half copy/conversions	2016-11-17 14:34:33 -08:00
Adam Paszke	ef557761dd	Allow to not use all function outputs in autograd	2016-10-31 22:47:09 +01:00
Sam Gross	79ead42ade	Add CUDA Stream and Event API (#133 )	2016-10-18 12:15:57 -04:00
Adam Paszke	06ab3f962f	Refactor _C extension to export some utilities	2016-09-21 08:36:54 -07:00
Adam Paszke	686e8d32e2	Add torch.save and torch.load	2016-08-23 07:51:55 -07:00
Adam Paszke	d7504b1f52	Fix type checks in cwrap	2016-08-02 09:45:22 -07:00
Adam Paszke	3a44259b32	Add support for CUDA	2016-07-19 10:45:59 -04:00

24 Commits