pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
cyy	8a3c241094	Remove unused header inclusion (#119667 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/119667 Approved by: https://github.com/Skylion007	2024-02-12 05:36:25 +00:00
Yuxin Wu	e6996ea172	Don't redefine __STDC_FORMAT_MACROS (#89310 ) Similar to https://github.com/pytorch/pytorch/pull/39608 and https://github.com/pytorch/pytorch/pull/6676 This causes a compile error in our internal build. Pull Request resolved: https://github.com/pytorch/pytorch/pull/89310 Approved by: https://github.com/kit1980	2022-11-19 02:24:21 +00:00
Kurt Mohler	32cf6c6fb0	Remove `THPTensor` defs, override macros, and `GenerateByteType.h` (#82503 ) ### Description These are old definitions and files that aren't used anymore. ### Issue Fixes #82502 ### Testing N/A Pull Request resolved: https://github.com/pytorch/pytorch/pull/82503 Approved by: https://github.com/ezyang	2022-07-30 19:40:16 +00:00
Nikita Shulga	e895672b35	Followup fix after #78828 (#79554 ) Will be skipped when imported internally, for more details see https://www.internalfb.com/diff/D37114156?src_version_fbid=3331368873807344 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79554 Approved by: https://github.com/albanD	2022-06-14 20:20:18 +00:00
Michael Suo	30fb2c4aba	[lint] autoformat test/cpp and torch/csrc Let's have some fun. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78828 Approved by: https://github.com/ezyang	2022-06-11 21:11:16 +00:00
xiaobing.zhang	c2c835dd95	Port sigmoid backward to Aten(CPU+CUDA) (#29185 ) Summary: VitalyFedyunin, This PR is about port sigmoid backward to Aten: Test script: ``` import torch import torch.nn as nn import time torch.manual_seed(0) def _time(): if torch.cuda.is_available(): torch.cuda.synchronize() return time.time() device = "cpu" if torch.cuda.is_available(): device = "cuda" #warm up for n in [100, 10000]: input = torch.randn(128, n, requires_grad=True, device=device) for i in range(1000): output = input.sigmoid().sum() output.backward() #get running time for n in [100, 10000]: bwd_t = 0 input = torch.randn(128, n, requires_grad=True, device=device) for i in range(10000): output = input.sigmoid().sum() t1 = _time() output.backward() t2 = _time() bwd_t = bwd_t + (t2 - t1) bwd_avg = bwd_t / 10000 * 1000 print("input size(128, %d), backwad avg time is %.2f (ms)." % (n, bwd_avg)) ``` Test Device: CPU: skx-8280, GPU: Tesla P40 Perfromance: Before: ``` GPU: input size(128, 100), backwad avg time is 0.14 (ms). input size(128, 10000), backwad avg time is 0.17 (ms). CPU: OMP_NUM_THREADS=56 input size(128, 100), backwad avg time is 0.06 (ms). input size(128, 10000), backwad avg time is 4.21 (ms). OMP_NUM_THREADS=1 input size(128, 100), backwad avg time is 0.06 (ms). input size(128, 10000), backwad avg time is 2.30 (ms). ``` After: ``` GPU: input size(128, 100), backwad avg time is 0.14 (ms). input size(128, 10000), backwad avg time is 0.17 (ms). CPU: OMP_NUM_THREADS=56 input size(128, 100), backwad avg time is 0.05 (ms). input size(128, 10000), backwad avg time is 0.48 (ms). OMP_NUM_THREADS=1 input size(128, 100), backwad avg time is 0.04 (ms). input size(128, 10000), backwad avg time is 0.86 (ms). ``` How to set number thread? using following script: ``` num_threads=$1 script=$2 last_core=`expr $num_threads - 1` echo "using $num_threads OMP threads" echo "bind cores to 0~$last_core" export OMP_NUM_THREADS=$num_threads export KMP_AFFINITY=granularity=fine,compact,1,0 numactl --physcpubind=0-$last_core --membind=0 python $script ``` and run ./run.sh num_threads test.py. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29185 Differential Revision: D18587352 Pulled By: VitalyFedyunin fbshipit-source-id: 8167ca261960399f795d35a83fa8c4be365bc4da	2019-11-20 07:31:42 -08:00
Edward Yang	517c7c9861	Canonicalize all includes in PyTorch. (#14849 ) Summary: Anywhere we used #include "foo.h", we now say #include <foo.h> Paths are adjusted to be rooted out of aten/src, torch/lib, or the root level directory. I modified CMakeLists.txt by hand to remove TH and THC from the include paths. I used the following script to do the canonicalization: ``` import subprocess import re import os.path files = subprocess.check_output(['git', 'ls-files']).decode('utf-8').rstrip().split('\n') for fn in files: if not any(fn.endswith(suff) for suff in ['.cu', '.cpp', '.in', '.h', '.hpp', '.cu', '.cuh', '.cc']): continue if not any(fn.startswith(pref) for pref in ["aten/", "torch/"]): continue with open(fn, 'r') as f: c = f.read() def fmt(p): return "#include <{}>".format(p) def repl(m): p = m.group(1) if p in ["dlfcn.h", "unistd.h", "nvrtc.h", "cuda.h", "cuda_runtime.h", "cstdint", "cudnn.h", "Python.h", "cusparse.h", "cuda_runtime_api.h", "cuda_fp16.h", "cublas_v2.h", "stdint.h", "curand_kernel.h"]: return fmt(p) if any(p.startswith(pref) for pref in ["torch/csrc", "c10/", "ATen/", "caffe2/", "TH/", "THC/", "Eigen/", "gtest/", "zdl/", "gloo/", "onnx/", "miopen/"]): return fmt(p) for root in ["aten/src", "torch/lib", ""]: for bad_root in [os.path.dirname(fn), "aten/src/TH", "aten/src/THC", "torch/csrc"]: new_p = os.path.relpath(os.path.join(bad_root, p), root) if not new_p.startswith("../") and (os.path.exists(os.path.join(root, new_p)) or os.path.exists(os.path.join(root, new_p + ".in"))): return fmt(new_p) print("ERROR: ", fn, p) return m.group(0) new_c = re.sub(r'#include "([^"]+)"', repl, c) if new_c != c: print(fn) with open(fn, 'w') as f: f.write(new_c) ``` Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/14849 Reviewed By: dzhulgakov Differential Revision: D13363445 Pulled By: ezyang fbshipit-source-id: 52361f878a672785f9306c9e9ab2513128092b68	2018-12-08 19:38:30 -08:00
Peter Goldsborough	d6c53328f9	Large scale fix of python-related files in torch/csrc/ Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14515 Differential Revision: D13247966 Pulled By: goldsborough fbshipit-source-id: 7a127c508fc576a7a92626dd6b729f660162d628	2018-12-07 13:04:46 -08:00
Zachary DeVito	d985cf46f1	Add workaround to fix include warnings in Python 2 builds. (#6716 )	2018-04-24 12:30:19 -07:00
Sam Gross	d0cabbde74	Implement Variable.from_numpy (#4043 ) Implements from_numpy using ATen tensors. Variable.from_numpy is a convenient placeholder for the variant that returns Variables until we merge Tensor and Variable. The behavior is slightly changed: - from_numpy() on an empty array now returns an empty tensor instead of throwing an exception. The shape may not be preserved. - CharTensor(ndarray) used to throw an exception. It now copies the ndarray. Copying is implemented via ATen toType.	2017-12-06 14:08:56 -05:00
andreh7	cc8fd5bde1	added #define __STDC_FORMAT_MACROS to tensor and storage code templates to avoid problems with gcc 4.8.5 (#3629 )	2017-11-10 15:21:33 -05:00
Adam Paszke	9169f60a84	Parallelize TensorMethods.cpp builds (#1400 )	2017-04-29 09:07:21 -04:00
Soumith Chintala	24e5a9057e	Revert "Parallelize TensorMethods.cpp builds (#1364 )" (#1390 ) This reverts commit `060048bcd8`.	2017-04-28 07:59:40 -04:00
Adam Paszke	060048bcd8	Parallelize TensorMethods.cpp builds (#1364 )	2017-04-28 07:45:21 -04:00
Sam Gross	bd5303010d	Refactor autograd package to separate Python dependencies. (#662 ) The core autograd Variable, Function, and Engine no longer depend on the Python API. This let's us implement functions in C++. In the future, we can also multithread engine and release the GIL for most of the non-Python backwards.	2017-02-13 16:00:16 -08:00
Sam Gross	1af9a9637f	Refactor copy and release GIL during copy (#286 )	2016-12-11 21:54:58 +01:00
Adam Paszke	ef557761dd	Allow to not use all function outputs in autograd	2016-10-31 22:47:09 +01:00
Adam Paszke	93b8b5631f	Improve CUDA tensor constructor speed	2016-10-13 17:16:39 -07:00
Sam Gross	e8a5f00866	Auto GPU for CUNN (#71 )	2016-09-30 14:04:53 -04:00
Adam Paszke	1828e7c42f	Add async CUDA copy	2016-09-27 15:12:48 -07:00
Adam Paszke	12bed8dc0d	Add CUDA device selection	2016-08-12 07:46:46 -07:00
Adam Paszke	92e983a489	Fixes for Linux and new cutorch	2016-08-02 09:20:18 -07:00
Adam Paszke	3a44259b32	Add support for CUDA	2016-07-19 10:45:59 -04:00

23 Commits