pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Edward Yang	6edf340338	Delete torch/__init__.pyi, deferring to direct extension stubs (#38157 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38157 This removes the error prone process of assembling `torch/__init__.pyi` (and frequently forgetting to expose things), since now we can simply rely on the true source file to get things done. Most of the old codegen in gen_pyi.py is now rerouted to various files: - `torch/_C/__init__.pyi` (the dumping pile of all misc bindings) - `torch/_C/_nn.pyi` (NN function bindings) - `torch/_C/_VariableFunctions.pyi` (torch function bindings) `torch.types` grew a bunch more definitions that previously where defined in `torch/__init__.pyi` Some miscellaneous changes - Fixed a bug where we treat single TensorList argument as implying varargs are accepted. This is actually only supported on IntList. This means we can correctly generate a stub for dequantize. - Add missing manual stub for nonzero - Switched torch/onnx/operators.py to directly refer to _C module, since apparently mypy doesn't think that methods prefixed with underscores get reexported. This may be a recurring theme; maybe we need to find a better way to solve it. Because I was really lazy, I dumped namedtuple definitions in both `torch._C` and `torch._C._VariableFunctions`. This is definitely wrong. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D21497400 Pulled By: ezyang fbshipit-source-id: 07b126141c82efaca37be27c07255cb2b9b3f064	2020-05-11 07:20:13 -07:00
Jerry Zhang	0ed7fc581c	[quant][graphmode][refactor] Split quantization.cpp (#37975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37975 Test Plan: . Imported from OSS Differential Revision: D21468497 fbshipit-source-id: 35cbf98a344ca6e4094d616a4040eacf017fd2de	2020-05-08 12:24:50 -07:00
peter	c5d6f59ab1	Replacing EHa with EHsc (#37235 ) Summary: We should not rely on the async exceptions. Catching C++ only exception is more sensible and may get a boost in both space (1163 MB -> 1073 MB, 0.92x) and performance(51m -> 49m, 0.96x). Pull Request resolved: https://github.com/pytorch/pytorch/pull/37235 Differential Revision: D21256918 Pulled By: ezyang fbshipit-source-id: 572ee96f2e4c48ad13f83409e4e113483b3a457a	2020-04-28 08:20:37 -07:00
Mo Zhou	5b9f7f7b0e	[cmake] Add USE_SYSTEM_{GLOO,FP16,PTHREADPOOL,PSIMD,FXDIV,BENCHMARK} options (#14699 ) (#37277 ) Summary: These options are disabled by default, and are supposed to be used by linux distro developers. With the existing shortcut option USE_SYSTEM_LIBS toggled, these new options will be enabled as well. Additionally, when USE_SYSTEM_LIBS is toggled, setup.py should no longer check the existence of git submodules. ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/37277 Differential Revision: D21256999 Pulled By: ezyang fbshipit-source-id: 84f97d008db5a5e41a289cb7bce94906de3c52cf	2020-04-27 09:37:27 -07:00
Mo Zhou	ff21b15624	cmake: add USE_SYSTEM_{LIBS,CPUINFO,SLEEF} options (#14699 ) (#37137 ) Summary: ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/37137 Differential Revision: D21222632 Pulled By: ezyang fbshipit-source-id: 47624b30f8d07b31a40a26edf665bbec39e45202	2020-04-23 20:43:36 -07:00
Christian Kastner	6df90bcecc	setup.py: Remove conflicting double documentation of USE_FBGEMM (#36993 ) Summary: Line 33+ contains instructions on how to disable use, 108+ on how to enable it. The default in CMakeLists.txt is enabled, so drop the latter. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36993 Differential Revision: D21161793 Pulled By: ngimel fbshipit-source-id: 08c5eecaf8768491f90d4a52c338ecea32a0c35e	2020-04-21 22:33:49 -07:00
David Reiss	3c85f44ce8	Fail setup.py if trying to set up with Python 2 (#35613 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35613 Python 2 has reached end-of-life and is no longer supported by PyTorch. To spare users from a long, doomed setup when trying to use PyTorch with Python 2, detect this case early and fail with a clear message. This commit covers setup.py. Test Plan: Attempted to build PyTorch with Python 2 and saw a clear error quickly. Differential Revision: D20842881 Pulled By: dreiss fbshipit-source-id: caaaa0dbff83145ff668bd25df6d7d4b3ce12e47	2020-04-16 10:24:03 -07:00
peter	b9260bdb7b	Don't build deps for `python setup.py egg_info` (#36208 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/36207. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36208 Differential Revision: D20919649 Pulled By: ezyang fbshipit-source-id: b5242a540181b29dba8987fb5f00332e1e81ca98	2020-04-08 09:02:01 -07:00
Sebastian Messmer	7ee88d61f7	Rename boxing/unboxing files and utilities (#35411 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35411 The file and class names in ATen/core/boxing were quite confusing. Let's rename them for readability. Also move function schema inference out of the boxing logic into op_registration.h where it belongs. ghstack-source-id: 101539206 Test Plan: waitforsandcastle Differential Revision: D20653621 fbshipit-source-id: 6a79c73d5758bee1e072d543c030913b18a69c7c	2020-04-04 14:13:28 -07:00
Feng Tian	762270c51f	add c10d dynamic loading mechanism and unit test (#28068 ) Summary: The original behavior of pytorch c10d only supports built-in c10d backends, such as nccl/gloo/mpi. This patch is used to extend the c10d capability to support dynamically loading 3rd party communication libraries which are derived from ProcessGroup base class. related RFC is in: https://github.com/pytorch/pytorch/issues/27955 Through this way, user just need specify a 3rd party c10d backend name when invoking torch.distributed.init_process_group(). The proposed logic will try to load corresponding c10d backend cpp extension automatically. as for how to develop a new 3rd party c10d backend through cpp extension, pls refer to test/cpp_extensions/cpp_c10d_extension.cpp Pull Request resolved: https://github.com/pytorch/pytorch/pull/28068 Differential Revision: D19174838 Pulled By: agolynski fbshipit-source-id: 3409a504a43ce7260e6f9d1207c00e87471fac62	2020-04-02 15:46:51 -07:00
Orion Reblitz-Richardson	f101949390	Remove python2 support from setup.py (#35539 ) Summary: As a followup to https://github.com/pytorch/pytorch/pull/35042 this removes python2 from setup.py and adds Python 3.8 to the list of supported versions. We're already testing this in CircleCI. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35539 Differential Revision: D20709060 Pulled By: orionr fbshipit-source-id: 5d40bc14cb885374fec370fc7c5d3cde8769039a	2020-03-27 14:33:11 -07:00
pinzhenx	bd604cb5b7	Upgrade MKL-DNN to DNNL v1.2 (#32422 ) Summary: ## Motivation This PR upgrades MKL-DNN from v0.20 to DNNL v1.2 and resolves https://github.com/pytorch/pytorch/issues/30300. DNNL (Deep Neural Network Library) is the new brand of MKL-DNN, which improves performance, quality, and usability over the old version. This PR focuses on the migration of all existing functionalities, including minor fixes, performance improvement and code clean up. It serves as the cornerstone of our future efforts to accommodate new features like OpenCL support, BF16 training, INT8 inference, etc. and to let the Pytorch community derive more benefits from the Intel Architecture. <br> ## What's included? Even DNNL has many breaking changes to the API, we managed to absorb most of them in ideep. This PR contains minimalist changes to the integration code in pytorch. Below is a summary of the changes: <br> General: 1. Replace op-level allocator with global-registered allocator ``` // before ideep::sum::compute<AllocForMKLDNN>(scales, {x, y}, z); // after ideep::sum::compute(scales, {x, y}, z); ``` The allocator is now being registeted at `aten/src/ATen/native/mkldnn/IDeepRegistration.cpp`. Thereafter all tensors derived from the `cpu_engine` (by default) will use the c10 allocator. ``` RegisterEngineAllocator cpu_alloc( ideep::engine::cpu_engine(), [](size_t size) { return c10::GetAllocator(c10::DeviceType::CPU)->raw_allocate(size); }, [](void* p) { c10::GetAllocator(c10::DeviceType::CPU)->raw_deallocate(p); } ); ``` ------ 2. Simplify group convolution We had such a scenario in convolution where ideep tensor shape mismatched aten tensor: when `groups > 1`, DNNL expects weights tensors to be 5-d with an extra group dimension, e.g. `goihw` instead of `oihw` in 2d conv case. As shown below, a lot of extra checks came with this difference in shape before. Now we've completely hidden this difference in ideep and all tensors are going to align with pytorch's definition. So we could safely remove these checks from both aten and c2 integration code. ``` // aten/src/ATen/native/mkldnn/Conv.cpp if (w.ndims() == x.ndims() + 1) { AT_ASSERTM( groups > 1, "Only group _mkldnn_conv2d weights could have been reordered to 5d"); kernel_size[0] = w.get_dim(0) * w.get_dim(1); std::copy_n( w.get_dims().cbegin() + 2, x.ndims() - 1, kernel_size.begin() + 1); } else { std::copy_n(w.get_dims().cbegin(), x.ndims(), kernel_size.begin()); } ``` ------ 3. Enable DNNL built-in cache Previously, we stored DNNL jitted kernels along with intermediate buffers inside ideep using an LRU cache. Now we are switching to the newly added DNNL built-in cache, and no longer caching buffers in order to reduce memory footprint. This change will be mainly reflected in lower memory usage from memory profiling results. On the code side, we removed couple of lines of `op_key_` that depended on the ideep cache before. ------ 4. Use 64-bit integer to denote dimensions We changed the type of `ideep::dims` from `vector<int32_t>` to `vector<int64_t>`. This renders ideep dims no longer compatible with 32-bit dims used by caffe2. So we use something like `{stride_.begin(), stride_.end()}` to cast parameter `stride_` into a int64 vector. <br> Misc changes in each commit: Commit: change build options Some build options were slightly changed, mainly to avoid name collisions with other projects that include DNNL as a subproject. In addition, DNNL built-in cache is enabled by option `DNNL_ENABLE_PRIMITIVE_CACHE`. Old \| New -- \| -- WITH_EXAMPLE \| MKLDNN_BUILD_EXAMPLES WITH_TEST \| MKLDNN_BUILD_TESTS MKLDNN_THREADING \| MKLDNN_CPU_RUNTIME MKLDNN_USE_MKL \| N/A (not use MKL anymore) ------ Commit: aten reintegration - aten/src/ATen/native/mkldnn/BinaryOps.cpp Implement binary ops using new operation `binary` provided by DNNL - aten/src/ATen/native/mkldnn/Conv.cpp Clean up group convolution checks Simplify conv backward integration - aten/src/ATen/native/mkldnn/MKLDNNConversions.cpp Simplify prepacking convolution weights - test/test_mkldnn.py Fixed an issue in conv2d unit test: it didn't check conv results between mkldnn and aten implementation before. Instead, it compared the mkldnn with mkldnn as the default cpu path will also go into mkldnn. Now we use `torch.backends.mkldnn.flags` to fix this issue - torch/utils/mkldnn.py Prepack weight tensor on module `__init__` to achieve better performance significantly ------ Commit: caffe2 reintegration - caffe2/ideep/ideep_utils.h Clean up unused type definitions - caffe2/ideep/operators/adam_op.cc & caffe2/ideep/operators/momentum_sgd_op.cc Unify tensor initialization with `ideep::tensor::init`. Obsolete `ideep::tensor::reinit` - caffe2/ideep/operators/conv_op.cc & caffe2/ideep/operators/quantization/int8_conv_op.cc Clean up group convolution checks Revamp convolution API - caffe2/ideep/operators/conv_transpose_op.cc Clean up group convolution checks Clean up deconv workaround code ------ Commit: custom allocator - Register c10 allocator as mentioned above <br><br> ## Performance We tested inference on some common models based on user scenarios, and most performance numbers are either better than or on par with DNNL 0.20. ratio: new / old \| Latency (batch=1 4T) \| Throughput (batch=64 56T) -- \| -- \| -- pytorch resnet18 \| 121.4% \| 99.7% pytorch resnet50 \| 123.1% \| 106.9% pytorch resnext101_32x8d \| 116.3% \| 100.1% pytorch resnext50_32x4d \| 141.9% \| 104.4% pytorch mobilenet_v2 \| 163.0% \| 105.8% caffe2 alexnet \| 303.0% \| 99.2% caffe2 googlenet-v3 \| 101.1% \| 99.2% caffe2 inception-v1 \| 102.2% \| 101.7% caffe2 mobilenet-v1 \| 356.1% \| 253.7% caffe2 resnet101 \| 100.4% \| 99.8% caffe2 resnet152 \| 99.8% \| 99.8% caffe2 shufflenet \| 141.1% \| 69.0% † caffe2 squeezenet \| 98.5% \| 99.2% caffe2 vgg16 \| 136.8% \| 100.6% caffe2 googlenet-v3 int8 \| 100.0% \| 100.7% caffe2 mobilenet-v1 int8 \| 779.2% \| 943.0% caffe2 resnet50 int8 \| 99.5% \| 95.5% _Configuration: Platform: Skylake 8180 Latency Test: 4 threads, warmup 30, iteration 500, batch size 1 Throughput Test: 56 threads, warmup 30, iteration 200, batch size 64_ † Shufflenet is one of the few models that require temp buffers during inference. The performance degradation is an expected issue since we no longer cache any buffer in the ideep. As for the solution, we suggest users opt for caching allocator like jemalloc as a drop-in replacement for system allocator in such heavy workloads. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32422 Test Plan: Perf results: https://our.intern.facebook.com/intern/fblearner/details/177790608?tab=Experiment%20Results 10% improvement for ResNext with avx512, neutral on avx2 More results: https://fb.quip.com/ob10AL0bCDXW#NNNACAUoHJP Reviewed By: yinghai Differential Revision: D20381325 Pulled By: dzhulgakov fbshipit-source-id: 803b906fd89ed8b723c5fcab55039efe3e4bcb77	2020-03-26 22:07:59 -07:00
Pavel Belevich	11a40410e7	pybind11 type_caster for at::Generator and custom RNG python test (#34774 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34774 This PR provides pybind11's `type_caster<at::Generator>` that allows mapping `at::Generator` instance returned from user-defined method to python `torch::Generator`, defined as `THPGenerator ` c++ class. This allows 1) defining custom RNG in c++ extension 2) using custom RNG in python code. `TestRNGExtension.test_rng` shows how to use custom RNG defined in `rng_extension.cpp` Test Plan: Imported from OSS Differential Revision: D20549451 Pulled By: pbelevich fbshipit-source-id: 312a6deccf8228f7f60695bbf95834620d52f5eb	2020-03-22 10:57:35 -07:00
Nikita Shulga	d3f5045bf5	PyTorch should always depend on `future` (#35057 ) Summary: Because `past` is used in `caffe2.python.core` Pull Request resolved: https://github.com/pytorch/pytorch/pull/35057 Test Plan: CI Differential Revision: D20547042 Pulled By: malfet fbshipit-source-id: cad2123c7b88271fea37f21e616df551075383a8	2020-03-19 17:31:47 -07:00
Eli Uriegas	275f5c8049	setup.py: Add numpy as required for install_requires (#34510 ) Summary: Was originally not a requirement but we should add it back here since it's required on import and we require it anyways for our conda packages. Tested with: ``` ❯ pkginfo -f requires_dist *.whl requires_dist: ['numpy'] ``` Signed-off-by: Eli Uriegas <eliuriegas@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/34510 Differential Revision: D20352125 Pulled By: seemethere fbshipit-source-id: 383e396fe500ed7043d83c3df57d1772d0fff1e6	2020-03-17 13:31:55 -07:00
Nikita Shulga	6d790c3611	Mark PyTorch incompatible with python-3.6.0 (#34724 ) Summary: Per https://github.com/pytorch/pytorch/issues/19161 PyTorch is incompatible with 3.6.0 due to the missing `PySlice_Unpack` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34724 Test Plan: CI + try to load pytorch binary using python-3.6.0 Differential Revision: D20449052 Pulled By: malfet fbshipit-source-id: 2c787fc64f5d1377c7f935ad2f3c77f46723d7dd	2020-03-13 15:22:34 -07:00
Nikita Shulga	dd7cec680c	Do not use clang if it can not parse system extensions (#34549 ) Summary: Attempt to build pytorch with ASAN on system with gcc-8 fails due to the mismatch system compilation flags. Address the issue by using original compiler to build `torch._C` extension Pull Request resolved: https://github.com/pytorch/pytorch/pull/34549 Test Plan: Run `.jenkins/pytorch/build-asan.sh` on FC-30 Differential Revision: D20373781 Pulled By: malfet fbshipit-source-id: 041c8d25f96b4436385a5e0eb6fc46e9b5fdf3f1	2020-03-10 15:40:08 -07:00
xiaobing.zhang	b678256bfb	Move glu to Aten(CPU) (#33179 ) Summary: This PR move glu to Aten(CPU). Test script: ``` import torch import torch.nn.functional as F import time torch.manual_seed(0) def _time(): if torch.cuda.is_available(): torch.cuda.synchronize() return time.time() device = "cpu" #warm up for n in [10, 100, 1000, 10000]: input = torch.randn(128, n, requires_grad=True, device=device) grad_output = torch.ones(128, n // 2, device=device) for i in range(1000): output = F.glu(input) output.backward(grad_output) for n in [10, 100, 1000, 10000]: fwd_t = 0 bwd_t = 0 input = torch.randn(128, n, requires_grad=True, device=device) grad_output = torch.ones(128, n // 2, device=device) for i in range(10000): t1 = _time() output = F.glu(input) t2 = _time() output.backward(grad_output) t3 = _time() fwd_t = fwd_t + (t2 -t1) bwd_t = bwd_t + (t3 - t2) fwd_avg = fwd_t / 10000 * 1000 bwd_avg = bwd_t / 10000 * 1000 print("input size(128, %d) forward time is %.2f (ms); backwad avg time is %.2f (ms)." % (n, fwd_avg, bwd_avg)) ``` Test device: skx-8180. Before: ``` input size(128, 10) forward time is 0.04 (ms); backwad avg time is 0.08 (ms). input size(128, 100) forward time is 0.06 (ms); backwad avg time is 0.14 (ms). input size(128, 1000) forward time is 0.11 (ms); backwad avg time is 0.31 (ms). input size(128, 10000) forward time is 1.52 (ms); backwad avg time is 2.04 (ms). ``` After: ``` input size(128, 10) forward time is 0.02 (ms); backwad avg time is 0.05 (ms). input size(128, 100) forward time is 0.04 (ms); backwad avg time is 0.09 (ms). input size(128, 1000) forward time is 0.07 (ms); backwad avg time is 0.17 (ms). input size(128, 10000) forward time is 0.13 (ms); backwad avg time is 1.03 (ms). ``` Fix https://github.com/pytorch/pytorch/issues/24707, https://github.com/pytorch/pytorch/issues/24708. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33179 Differential Revision: D19839835 Pulled By: VitalyFedyunin fbshipit-source-id: e4d3438556a1068da2c4a7e573d6bbf8d2a6e2b9	2020-02-28 14:54:38 -08:00
Michael Suo	dbe850af5b	[jit] do the code reorg (#33851 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33851 Rationale and context described in #33828. Script to reproduce the move: https://gist.github.com/suo/16cbefaaeb67ca5a7c6caffd49b7f6e9 ghstack-source-id: 99079645 Test Plan: Make sure CI passes Reviewed By: jamesr66a Differential Revision: D20133869 fbshipit-source-id: 390e9241a9c85366d9005c492ac31f10aa96488e	2020-02-27 13:02:51 -08:00
Pavel Belevich	b1c85dd916	Custom RNG DispatchKey (#32325 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32325 The purpose of this PR is to enable PyTorch dispatching on `at::Generator` parameters and demonstrate how it can be used in cpp extensions to implement custom RNG. 1. `CustomRNGKeyId` value added to DispatchKey enum and `DispatchKeySet key_set_` added to `at::Generator` 2. The overloaded `operator()(at::Generator gen)` added to MultiDispatchKeySet. 3. The existing CPUGenerator and CUDAGenerator class are supplied with CPUTensorId and CUDATensorId dispatch keys 4. The implementation of CPU's `cauchy_kernel`(as an example, because it's already moved to ATen) was templatized and moved to `ATen/native/cpu/DistributionTemplates.h` to make it available for cpp extensions 5. Minor CMake changes to make native/cpu tensors available for cpp extensions 6. RegisterCustomRNG test that demonstrates how CustomCPUGenerator class can be implemented and how custom_rng_cauchy_ native function can be registered to handle Tensor::cauchy_ calls. Test Plan: Imported from OSS Differential Revision: D19604558 Pulled By: pbelevich fbshipit-source-id: 2619f14076cee5742094a0be832d8530bba72728	2020-01-29 11:30:04 -08:00
Pritam Damania	f050b16dd9	Move pytorch distributed tests to separate folder for contbuild. (#30445 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30445 Create distributed and rpc directories under caffe/test for better management of unit tests. Differential Revision: D18702786 fbshipit-source-id: e9daeed0cfb846ef68806f6decfcb57c0e0e3606	2020-01-22 21:16:59 -08:00
ashish	9a4219eb39	Install complete set of headers for ROCm build (#32076 ) Summary: This PR adds a more complete list of pytorch header files to be installed at build time. It also fixes one instance of including a header from local src directory instead of installed directory. A more complete set of headers enable other modules to correctly work with pyTorch built for ROCm. cc: ezyang bddppq iotamudelta Pull Request resolved: https://github.com/pytorch/pytorch/pull/32076 Differential Revision: D19372933 Pulled By: ezyang fbshipit-source-id: 3b5f3241c001fa05ea448c359a706ce9a8214aa0	2020-01-13 08:33:28 -08:00
Edward Yang	4ef9daf7b2	Remove dead CAFFE2_LIBS variable (#31155 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31155 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D19262584 Pulled By: ezyang fbshipit-source-id: 147ac5a9c36e813ea9a2f68b498880942d661be5	2020-01-06 14:39:47 -08:00
zrphercule	c564d794ed	Add ATen/native/ headers to torch target (#30835 ) Summary: We dont have ATen/native/*.h in torch target before, and we would like it to be exposed for external use. Pull Request resolved: https://github.com/pytorch/pytorch/pull/30835 Differential Revision: D18836160 Pulled By: zrphercule fbshipit-source-id: 7330a9c9d8b65f173cc332b1cfeeb18c7dca20a8	2019-12-05 13:24:21 -08:00
Sebastian Messmer	bc2e6d10fa	Back out "Revert D17908478: Switch PyTorch/Caffe2 to C++14" Summary: Original commit changeset: 775d2e29be0b Test Plan: CI Reviewed By: mruberry Differential Revision: D18775520 fbshipit-source-id: a350b3f86b66d97241f208786ee67e9a51172eac	2019-12-03 14:33:43 -08:00
Sebastian Messmer	a2ed50c920	Revert D17908478: Switch PyTorch/Caffe2 to C++14 Test Plan: revert-hammer Differential Revision: D17908478 Original commit changeset: 6e340024591e fbshipit-source-id: 775d2e29be0bc3a0db64f164c8960c44d4877d5d	2019-11-27 14:57:05 -08:00
Sebastian Messmer	d0acc9c085	Switch PyTorch/Caffe2 to C++14 (#30406 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30406 ghstack-source-id: 94642238 Test Plan: waitforsandcastle Differential Revision: D17908478 fbshipit-source-id: 6e340024591ec2c69521668022999df4a33b4ddb	2019-11-27 10:47:31 -08:00
Thomas Viehmann	7889e1e3f9	Add `torch.version.hip` from cmake (#29815 ) Summary: This adds the HIP_VERSION cmake variable as hip_version. This should help detecting ROCm, e.g. in https://github.com/pytorch/pytorch/issues/22091. To parallel CUDA, hip_version is a string. An alternative variant might be to split by '.' and only take the first two parts. The method suffers a bit from ROCm not being as monolithic as CUDA. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29815 Differential Revision: D18532267 Pulled By: bddppq fbshipit-source-id: 1bde4ad0cfacc47bfd1c0945e130921d8575a5bf	2019-11-15 12:03:15 -08:00
Junjie Bai	b0c245d52d	Consolidate the places that find pybind11 include dirs (#29659 ) Summary: Also move the logic that installs the pybind11 headers from setup.py to cmake (to align with other headers). Pull Request resolved: https://github.com/pytorch/pytorch/pull/29659 Differential Revision: D18458208 Pulled By: bddppq fbshipit-source-id: cfd1e74b892d4a65591626ab321780c8c87b810d	2019-11-12 14:51:56 -08:00
zrphercule	eae4a69069	Add quantized fbgemm headers to torch target (#29418 ) Summary: We dont have ATen/native/quantized/cpu/*.h in torch target before, and we would like it to be exposed for external use. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29418 Differential Revision: D18383534 Pulled By: zrphercule fbshipit-source-id: 72c06ae2c10e8cc49e7256c9e9b89288263bbfde	2019-11-08 14:32:19 -08:00
peter	d05da7dad3	Fix virtualenv builds on Windows (#29273 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/29058. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29273 Differential Revision: D18349822 Pulled By: ezyang fbshipit-source-id: c4d76521cc0742d890f22f1d7f32dede5600b651	2019-11-06 09:02:30 -08:00
qzhong0605	50fd20b64a	fix bug on setup.py to include header files on caffe2/utils/math (#28869 ) Summary: This problem is from issue [https://github.com/pytorch/pytorch/issues/28753](https://github.com/pytorch/pytorch/issues/28753) The header files on directories`math` and `threadpool` should be included on the built package because they are included on the other header files, such as on file `torch/include/caffe2/utils/math.h` ``` #include "caffe2/core/common.h" #include "caffe2/core/types.h" #include "caffe2/utils/math/broadcast.h" #include "caffe2/utils/math/elementwise.h" #include "caffe2/utils/math/reduce.h" #include "caffe2/utils/math/transpose.h" #include "caffe2/utils/math/utils.h" ``` But the `setup.py` doesn't include the header files on `master` branch. The header files on `utils` directory of a built `torch` package are the following: ``` > ls include/caffe2/utils bench_utils.h conversions.h eigen_utils.h map_utils.h murmur_hash3.h proto_wrap.h smart_tensor_printer.h cast.h cpuid.h filler.h math-detail.h proto_convert.h signal_handler.h string_utils.h cblas.h cpu_neon.h fixed_divisor.h math.h proto_utils.h simple_queue.h zmq_helper.h ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/28869 Differential Revision: D18226319 Pulled By: soumith fbshipit-source-id: 51575ddc559181c069b3324aa9b2d1669310ba25	2019-10-30 11:11:15 -07:00
Wanchao Liang	4beaf1cf1c	add typing runtime dependency for py2 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28442 Test Plan: Imported from OSS Differential Revision: D18075498 fbshipit-source-id: 075f63b1ed2c83d9a64eb81224e0d67c6a63b22c	2019-10-22 22:02:08 -07:00
Hong Xu	a5354adb08	Eliminate the use of CUDA_HOME in setup.py. (#28373 ) Summary: Variables read from CMakeCache.txt are more reliable. Close https://github.com/pytorch/pytorch/issues/28365 Pull Request resolved: https://github.com/pytorch/pytorch/pull/28373 Differential Revision: D18061855 Pulled By: ezyang fbshipit-source-id: c550a365e23464411d75eca167f7e6e053f94872	2019-10-22 14:04:54 -07:00
Rohan Varma	badb08d577	Add clip_grad_norm_ to c++ api (#26140 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26140 Per https://github.com/pytorch/pytorch/issues/25883, we want to work towards C++/Python API parity. This diff adds clip_grad_norm_ to the c++ API to improve parity. ghstack-source-id: 91334333 ghstack-source-id: 91334333 Test Plan: Added a unit test Differential Revision: D17312367 fbshipit-source-id: 753ba3a4d084d01f3cc8919da3108e67c809ad65	2019-10-04 13:50:36 -07:00
Hong Xu	081069e8ca	Remove CUDA_VERSION from Python script (which has already been detected in CMake) (#27316 ) Summary: (Intentionally left blank) Pull Request resolved: https://github.com/pytorch/pytorch/pull/27316 Differential Revision: D17762715 Pulled By: ezyang fbshipit-source-id: 044c0ea6e8c2d12912c946a9a50b934b5253d8c8	2019-10-04 07:49:57 -07:00
Pavel Belevich	493c900810	Extract version to version.txt (#27149 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/27149 Extract version to version.txt and add reading version logic to setup.py and fb/torch_version.py ghstack-source-id: 91271883 Test Plan: N/A Reviewed By: gchanan, ezyang Differential Revision: D17689307 fbshipit-source-id: 21899502027cec71b63d9dc151e09ff5ff3f279d	2019-10-03 12:13:15 -07:00
Hong Xu	5e5cbceeba	remove tools/setup_helpers/cudnn.py (#25876 ) Summary: FindCUDNN.cmake and cuda.cmake have done the detection. This commit deletes `tools/setup_helpers/cudnn.py` as it is no longer needed. Previously in https://github.com/pytorch/pytorch/issues/25482, one test failed because TensorRT detects cuDNN differently, and there may be situations we can find cuDNN but TensorRT cannot. This is fixed by passing our detection result down to TensorRT. Pull Request resolved: https://github.com/pytorch/pytorch/pull/25876 Differential Revision: D17346270 Pulled By: ezyang fbshipit-source-id: c1e7ad4a1cb20f964fe07a72906f2f002425d894	2019-09-24 07:44:33 -07:00
Sebastian Messmer	ed207b53ab	c10::KernelFunction (#26337 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26337 - Factor out boxing and unboxing functionality from the c10 dispatcher into a c10::KernelFunction class - Move that class and everything else it depends on into ATen/core/boxing - This also allows us to get rid of c10::KernelCache. Instead, we now store a pointer to the unboxed functor in c10::KernelFunction. - We're also getting rid of the DispatchTableEntry struct and instead store KernelFunction directly. - To make this work, we need to change the dispatcher calling API from Dispatcher::lookup().callBoxed/callUnboxed and OperatorEntry::lookup().callBoxed/callUnboxed to Dispatcher::callBoxed/callUnboxed and OperatorEntry::callBoxed/callUnboxed. ghstack-source-id: 90459911 Test Plan: unit tests Differential Revision: D17416607 fbshipit-source-id: fd221f1d70eb3f1b4d33092eaa7e37d25684c934	2019-09-20 18:55:25 -07:00
Will Feng	57a4b7c55d	Re-organize C++ API `torch::nn` folder structure (#26262 ) Summary: This PR aims to re-organize C++ API `torch::nn` folder structure in the following way: - Every module in `torch/csrc/api/include/torch/nn/modules/` (except `any.h`, `named_any.h`, `modulelist.h`, `sequential.h`, `embedding.h`) has a strictly equivalent Python file in `torch/nn/modules/`. For example: `torch/csrc/api/include/torch/nn/modules/pooling.h` -> `torch/nn/modules/pooling.py` `torch/csrc/api/include/torch/nn/modules/conv.h` -> `torch/nn/modules/conv.py` `torch/csrc/api/include/torch/nn/modules/batchnorm.h` -> `torch/nn/modules/batchnorm.py` `torch/csrc/api/include/torch/nn/modules/sparse.h` -> `torch/nn/modules/sparse.py` - Containers such as `any.h`, `named_any.h`, `modulelist.h`, `sequential.h` are moved into `torch/csrc/api/include/torch/nn/modules/container/`, because their implementations are too long to be combined into one file (like `torch/nn/modules/container.py` in Python API) - `embedding.h` is not renamed to `sparse.h` yet, because we have another work stream that works on API parity for Embedding and EmbeddingBag, and renaming the file would cause conflict. After the embedding API parity work is done, we will rename `embedding.h` to `sparse.h` to match the Python file name, and move the embedding options out to options/ folder. - `torch/csrc/api/include/torch/nn/functional/` is added, and the folder structure mirrors that of `torch/csrc/api/include/torch/nn/modules/`. For example, `torch/csrc/api/include/torch/nn/functional/pooling.h` contains the functions for pooling, which are then used by the pooling modules in `torch/csrc/api/include/torch/nn/modules/pooling.h`. - `torch/csrc/api/include/torch/nn/options/` is added, and the folder structure mirrors that of `torch/csrc/api/include/torch/nn/modules/`. For example, `torch/csrc/api/include/torch/nn/options/pooling.h` contains MaxPoolOptions, which is used by both MaxPool modules in `torch/csrc/api/include/torch/nn/modules/pooling.h`, and max_pool functions in `torch/csrc/api/include/torch/nn/functional/pooling.h`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/26262 Differential Revision: D17422426 Pulled By: yf225 fbshipit-source-id: c413d2a374ba716dac81db31516619bbd879db7f	2019-09-17 10:07:29 -07:00
Ailing Zhang	079cd4e1fc	Remove requests as dependency (#26083 ) Summary: local build is slow... test in CI... Pull Request resolved: https://github.com/pytorch/pytorch/pull/26083 Differential Revision: D17346949 Pulled By: ailzhang fbshipit-source-id: f552d1a4be55ad4e2bd915af7c5a2c1b6667c446	2019-09-13 08:39:53 -07:00
Hong Xu	8a026d4f74	Remove tools/setup_helpers/dist_check.py (#25879 ) Summary: What dist_check.py does is largely merely determining whether we should use set "USE_IBVERBS" to ON or OFF when the user sets "USE_GLOO_IBVERBS" to ON. But this is unnecessary, because this complicated determination will always be overrided by gloo: `2101e02cea/cmake/Dependencies.cmake (L19-L28)` Since dist_check.py becomes irrelevant, this commit also simplifies the setting of `USE_DISTRIBUTED` (by removing its explicit setting in Python scripts), and deprecate `USE_GLOO_IBVERBS` in favor of `USE_IBVERBS`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/25879 Differential Revision: D17282395 Pulled By: pietern fbshipit-source-id: a10735f50728d89c3d81fd57bcd26764e7f84dd1	2019-09-10 04:33:28 -07:00
Edward Yang	97b432bdf0	Back out "[pytorch][PR] remove tools/setup_helpers/cudnn.py" Summary: Original commit changeset: abd9cd0244ca (Note: this ignores all push blocking failures!) Test Plan: none Reviewed By: nairbv Differential Revision: D17259003 fbshipit-source-id: d7e067eeb36192766c639bfcbc66f540ce8eb77e	2019-09-09 06:47:45 -07:00
Hong Xu	66ac6698f6	remove tools/setup_helpers/cudnn.py (#25482 ) Summary: FindCUDNN.cmake and cuda.cmake have done the detection. This commit deletes `tools/setup_helpers/cudnn.py` as it is no longer needed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/25482 Differential Revision: D17226408 Pulled By: ezyang fbshipit-source-id: abd9cd0244cabea1f5d9f93f828d632d77c8dd5e	2019-09-06 06:54:35 -07:00
Pieter Noordhuis	3556bea5aa	Build torch.distributed with Gloo backend on macOS (#25260 ) Summary: In facebookincubator/gloo#212, a libuv based Gloo transport was introduced, which allows us to use Gloo on macOS (and later perhaps also Windows). This commit updates CMake code to enable building with USE_DISTRIBUTED=1 on macOS. A few notes: * The Caffe2 ops are not compiled, for they depend on `gloo::transport::tcp`. * The process group implementation uses `gloo::transport::tcp` on Linux (because of `epoll(2)` on Linux and `gloo::transport::uv` on macOS). * The TCP store works but sometimes crashes on process termination. * The distributed tests are not yet run. * The nightly builds don't use `USE_DISTRIBUTED=1`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/25260 Reviewed By: mrshenli Differential Revision: D17202381 Pulled By: pietern fbshipit-source-id: ca80a82e78a05b4154271d2fb0ed31c8d9f26a7c	2019-09-05 07:09:50 -07:00
James Reed	f71ddd4292	Switch hub to use `requests` because of SSL (#25083 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/25083 I missed this in the last PR Test Plan: Imported from OSS Differential Revision: D17005372 Pulled By: jamesr66a fbshipit-source-id: 1200a6cd88fb9051aed8baf3162a9f8ffbf65189	2019-08-24 12:06:49 -07:00
Hong Xu	1a9334ea59	Hotpatch CXXFLAGS to be the same as CFLAGS if CXXFLAGS is not set. (#23568 ) Summary: This fixes build regression caused by https://github.com/pytorch/pytorch/issues/23528 because we used to let CXXFLAGS equal CFLAGS. cc suo Pull Request resolved: https://github.com/pytorch/pytorch/pull/23568 Differential Revision: D16568820 Pulled By: suo fbshipit-source-id: 64a0dc923c08ac1751224f42bc4ccdc707341762	2019-08-07 16:25:57 -07:00
Hugo	0f5d071d52	Add python_requires to help pip (#23863 ) Summary: `python_requires` helps the installer choose the correct version of this package for the user's running Python. This is especially necessary when dropping Python 2 (https://github.com/pytorch/pytorch/issues/23795) but is useful now too. Pull Request resolved: https://github.com/pytorch/pytorch/pull/23863 Differential Revision: D16692908 Pulled By: soumith fbshipit-source-id: 3c9ba2eb1d1cf12763d6284daa4f18f605abb373	2019-08-07 12:47:53 -07:00
Edward Yang	a1d945b295	Roll master to 1.3.0 (#23895 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23895 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D16688489 Pulled By: ezyang fbshipit-source-id: a56d0180a0bc57775badd9e31ea3d441d5fd4f88	2019-08-07 08:44:32 -07:00
Soumith Chintala	6313d5e28b	add appropriate install_requires (#23722 ) Summary: This adds: - dependency on numpy if compiled with numpy support - dependency on future if python <= 2.7 Fixes https://github.com/pytorch/pytorch/issues/23670 Pull Request resolved: https://github.com/pytorch/pytorch/pull/23722 Differential Revision: D16643824 Pulled By: soumith fbshipit-source-id: 5cf4d79cd188678cb2328c4286eabd52a2a86fcd	2019-08-04 17:24:19 -07:00
Soumith Chintala	dded794eeb	add setup metadata to help PyPI flesh out content on pypi package page (#22085 ) Summary: add setup metadata to help PyPI flesh out content on pypi package page. Apparently this might help flesh out the "Used By" feature according to driazati Pull Request resolved: https://github.com/pytorch/pytorch/pull/22085 Differential Revision: D16604703 Pulled By: soumith fbshipit-source-id: ddb4f7ba7c24fdf718260aed28cc7bc9afb46de9	2019-08-01 12:15:56 -07:00
Ilia Cherniavskii	74f8094ea5	Rename threading build options Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23407 Test Plan: USE_CUDA=0 ATEN_THREADING=TBB USE_OPENMP=0 USE_TBB=1 MKL_THREADING=TBB BLAS=MKL USE_MKLDNN=1 MKLDNN_THREADING=TBB BUILD_BINARY=1 python setup.py develop install --cmake ./build/bin/parallel_info Imported from OSS Differential Revision: D16522538 Pulled By: ilia-cher fbshipit-source-id: 75c4761d93a7f5936f28e4c5eedcd27d8490d0c5	2019-07-26 13:09:14 -07:00
Hong Xu	82545ecc71	Specify build dir as a global variable in BUILD_DIR in the build system. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23318 Test Plan: Imported from OSS Differential Revision: D16493987 Pulled By: ezyang fbshipit-source-id: 497e9dd924280f61dde095b4f2b50f5402d9da97	2019-07-25 07:19:47 -07:00
Hong Xu	fd1d06e317	Let Python build scripts accept both CMAKE_BUILD_TYPE and the oldschool DEBUG and REL_WITH_DEB_INFO variables. (#22875 ) Summary: Currently the build type is decided by the environment variable DEBUG and REL_WITH_DEB_INFO. This commit also lets CMAKE_BUILD_TYPE be effective. This makes the interface more consistent with CMake. This also prepares https://github.com/pytorch/pytorch/issues/22776. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22875 Differential Revision: D16281663 Pulled By: ezyang fbshipit-source-id: 952f92aad85ff59f1c7abe8256eca8a4a0936026	2019-07-24 08:07:47 -07:00
Hong Xu	60c46dd4df	Let CMake handle NCCL detection instead of our handcrafted Python script. (#22930 ) Summary: --- How does the current code subsume all detections in the deleted `nccl.py`? - The dependency of `USE_NCCL` on the OS and `USE_CUDA` is handled as dependency options in `CMakeLists.txt`. - The main NCCL detection happens in [FindNCCL.cmake](`8377d4b32c/cmake/Modules/FindNCCL.cmake`), which is called by [nccl.cmake](`8377d4b32c/cmake/External/nccl.cmake`). When `USE_SYSTEM_NCCL` is false, the previous Python code defer the detection to `find_package(NCCL)`. The change in `nccl.cmake` retains this. - `USE_STATIC_NCCL` in the previous Python code simply changes the name of the detected library. This is done in `IF (USE_STATIC_NCCL)`. - Now we only need to look at how the lines below line 20 in `nccl.cmake` are subsumed. These lines list paths to header and library directories that NCCL headers and libraries may reside in and try to search these directories for the key header and library files in turn. These are done by `find_path` for headers and `find_library` for the library files in `FindNCCL.cmake`. * The call of [find_path](https://cmake.org/cmake/help/v3.8/command/find_path.html) (Search for `NO_DEFAULT_PATH` in the link) by default searches for headers in `<prefix>/include` for each `<prefix>` in `CMAKE_PREFIX_PATH` and `CMAKE_SYSTEM_PREFIX_PATH`. Like the Python code, this commit sets `CMAKE_PREFIX_PATH` to search for `<prefix>` in `NCCL_ROOT_DIR` and home to CUDA. `CMAKE_SYSTEM_PREFIX_PATH` includes the standard directories such as `/usr/local` and `/usr`. `NCCL_INCLUDE_DIR` is also specifically handled. * Similarly, the call of [find_library](https://cmake.org/cmake/help/v3.8/command/find_library.html) (Search for `NO_DEFAULT_PATH` in the link) by default searches for libraries in directories including `<prefix>/lib` for each `<prefix>` in `CMAKE_PREFIX_PATH` and `CMAKE_SYSTEM_PREFIX_PATH`. But it also handles the edge cases intended to be solved in the Python code more properly: - It only searches for `<prefix>/lib64` (and `<prefix>/lib32`) if it is appropriate on the system. - It only searches for `<prefix>/lib/<arch>` for the right `<arch>`, unlike the Python code searches for `lib/<arch>` in a generic way (e.g., the Python code searches for `/usr/lib/x86_64-linux-gnu` but in reality systems have `/usr/lib/x86_64-some-customized-name-linux-gnu`, see https://unix.stackexchange.com/a/226180/38242 ). --- Regarding for relevant issues: - https://github.com/pytorch/pytorch/issues/12063 and https://github.com/pytorch/pytorch/issues/2877: These are properly handled, as explained in the updated comment. - https://github.com/pytorch/pytorch/issues/2941 does not changes NCCL detection specifically for Windows (it changed CUDA detection). - `b7e258f81e` A versioned library detection is added, but the order is reversed: The unversioned library becomes preferred. This is because normally unversioned libraries are linked to versioned libraries and preferred by users, and local installation by users are often unversioned. Like the document of [find_library](https://cmake.org/cmake/help/v3.8/command/find_library.html) suggests: > When using this to specify names with and without a version suffix, we recommend specifying the unversioned name first so that locally-built packages can be found before those provided by distributions. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22930 Differential Revision: D16440275 Pulled By: ezyang fbshipit-source-id: 11fe80743d4fe89b1ed6f96d5d996496e8ec01aa	2019-07-23 08:45:51 -07:00
Edward Yang	798d5d9771	Revert D16281714: Add sanity checks for NCCL detection. Differential Revision: D16281714 Original commit changeset: 396bcbf099bd fbshipit-source-id: a22cc112d1b6a62d689f9d8a7f93e8be3abe2a44	2019-07-16 13:58:27 -07:00
Hong Xu	e2046f8c1d	Add sanity checks for NCCL detection. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22819 Test Plan: Imported from OSS Differential Revision: D16281714 Pulled By: ezyang fbshipit-source-id: 396bcbf099bd07b996cf779c6b43092096b52d90	2019-07-16 11:32:32 -07:00
Hui Wu	07ef85e326	Add USE_MKLDNN_CBLAS build option. (#19014 ) Summary: MKL-DNN is the main library for computation when we use ideep device. It can use kernels implemented by different algorithms (including JIT, CBLAS, etc.) for computation. We add the "USE_MKLDNN_CBLAS" (default OFF) build option so that users can decide whether to use CBLAS computation methods or not. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19014 Differential Revision: D16094090 Pulled By: ezyang fbshipit-source-id: 3f0b1d1a59a327ea0d1456e2752f2edd78d96ccc	2019-07-02 12:29:54 -07:00
Hong Xu	b9ede6600e	Remove the USE_MIOPEN build option as MIOpen is always used when built with ROCm. (#22420 ) Summary: Close https://github.com/pytorch/pytorch/issues/22200 Pull Request resolved: https://github.com/pytorch/pytorch/pull/22420 Differential Revision: D16087538 Pulled By: bddppq fbshipit-source-id: ecf3e7eb8213bb093e1c5290d096c233284a2ff9	2019-07-02 00:05:59 -07:00
Jon Malmaud	bfeff1eb8f	Stubs for torch.nn (#19089 ) Summary: Closes https://github.com/pytorch/pytorch/issues/18724 Pull Request resolved: https://github.com/pytorch/pytorch/pull/19089 Differential Revision: D16073654 Pulled By: ezyang fbshipit-source-id: 5642179651ce45ab7c5a46cc1fcc4fd6b37fa71c	2019-07-01 09:50:17 -07:00
Pieter Noordhuis	6ff0c6ca3f	Remove THD (#22065 ) Summary: It's been ~9 months since moving THD to the `torch.distributed.deprecated` namespace (see https://github.com/pytorch/pytorch/issues/11405) and we haven't seen issues related to it, so it's time to remove it. Closes https://github.com/pytorch/pytorch/issues/18967. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22065 Reviewed By: mrshenli Differential Revision: D15983669 Pulled By: pietern fbshipit-source-id: 2a2f5866f9a63040bc7cef3956d5fd215aba7165	2019-06-25 12:19:13 -07:00
Ilia Cherniavskii	6350dbddd1	Fix sequential MKL case (#22062 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22062 ghimport-source-id: a30255d7453c4ffecf40215a785c1e06b7296368 Test Plan: USE_CUDA=0 PARALLEL_BACKEND=OPENMP BLAS=MKL USE_MKLDNN=1 MKL_SEQ=1 MKLDNN_THREADING=SEQ BUILD_BINARY=1 python setup.py develop --cmake ./build/bin/parallel_info Imported from OSS Differential Revision: D15938079 Pulled By: ilia-cher fbshipit-source-id: e7ef0c5bc75ebb845ebe66bf76a4070d45305b35	2019-06-24 12:56:43 -07:00
Hong Xu	0408697317	Followup cleanup in cmake.py and add a comment in setup.py (#21792 ) Summary: Following up `b811b6d5c0` * Use property instead of __setattr__ in CMake. * Add a comment clarifying when built_ext.run is called. --- cc ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/21792 Differential Revision: D15860606 Pulled By: umanwizard fbshipit-source-id: ba1fa07f58d4eac81ac27fa9dc7115d1cdd3dec0	2019-06-17 13:46:25 -07:00
Hong Xu	b811b6d5c0	When building extensions, honor options set in CMake. (#21653 ) Summary: Currently when building extensions, variables such as USE_CUDA, USE_CUDNN are used to determine what libraries should be linked. But we should use what CMake has detected, because: 1. If CMake found them unavailable but the variables say some libraries should be linked, the build would fail. 2. If the first build is made using a set of non-default build options, rebuild must have these option passed to setup.py again, otherwise the extension build process is inconsistent with CMake. For example, ```bash # First build USE_CUDA=0 python setup.py install # Subsequent builds like this would fail, unless "build/" is deleted python setup.py install ``` This commit addresses the above issues by using variables from CMakeCache.txt when building the extensions. --- The changes in `setup.py` may look lengthy, but the biggest changed block is mostly moving them into a function `configure_extension_build` (along with some variable names changed to `cmake_cache_vars['variable name']` and other minor changes), because it must be called after CMake has been called (and thus the options used and system environment detected by CMake become available). Pull Request resolved: https://github.com/pytorch/pytorch/pull/21653 Differential Revision: D15824506 Pulled By: ezyang fbshipit-source-id: 1e1eb7eec7debba30738f65472ccad966ee74028	2019-06-14 08:13:40 -07:00
Ilia Cherniavskii	5485f09f18	Native TBB parallel backend (#20480 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20480 ghimport-source-id: c710f897c4c9b9616fc3dd76d80b4845aea43a1f Differential Revision: D15333692 Pulled By: ilia-cher fbshipit-source-id: 61e476dd5c737fe144e3aec000d8ebb11fbc0547	2019-06-13 10:11:16 -07:00
Karl Ostmo	49481d576d	Torch rename (#20774 ) Summary: This renames the CMake `caffe2` target to `torch`, as well as renaming `caffe2_gpu` to `torch_gpu` (and likewise for other gpu target variants). Many intermediate variables that don't manifest as artifacts of the build remain for now with the "caffe2" name; a complete purge of `caffe2` from CMake variable names is beyond the scope of this PR. The shell `libtorch` library that had been introduced as a stopgap in https://github.com/pytorch/pytorch/issues/17783 is again flattened in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20774 Differential Revision: D15769965 Pulled By: kostmo fbshipit-source-id: b86e8c410099f90be0468e30176207d3ad40c821	2019-06-12 20:12:34 -07:00
Hong Xu	646a7f99bb	Move management of calls of "cmake --build" to setup_helper/cmake.py and refactor as a CMake class Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21493 Differential Revision: D15759279 Pulled By: ezyang fbshipit-source-id: 157e1de36f1c5a51caf2a25b363a94369c442012	2019-06-11 07:04:05 -07:00
Hong Xu	240d62fbaa	Move redundant code that checks NumPy during build to a helper module and add an option to disable building with NumPy Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21417 Reviewed By: ezyang Differential Revision: D15694357 Pulled By: fmassa fbshipit-source-id: bc1bda23349ba4531f19619fa4adecb846225c20	2019-06-06 08:15:19 -07:00
Hong Xu	9a989ec469	Add an option to stop the build process once cmake terminates. (#21034 ) Summary: Add an option to setup.py to stop the build process once cmake terminates. This leaves users a chance to fine adjust build options. Also update README accordingly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21034 Differential Revision: D15530096 Pulled By: soumith fbshipit-source-id: 71ac6ff8483c3ee77c38d88f0d059db53a7d3901	2019-05-28 17:11:00 -07:00
Ilia Cherniavskii	580eab6562	Restore TBB module (#20454 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20454 ghimport-source-id: 14aca1dedbe647d41e55e7538a6b7eeab0fc4384 Differential Revision: D15326062 Pulled By: ilia-cher fbshipit-source-id: 02b005a679b10dc7a264978e87a8d2bb98ab972f	2019-05-28 02:49:36 -07:00
Ilia Cherniavskii	82aecfad6a	Native ATen/Parallel backend (#20087 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20087 ghimport-source-id: bcfc8a86abe0893e4a380fe6f6123e2082ba4317 Differential Revision: D15248663 Pulled By: ilia-cher fbshipit-source-id: fdb7a8860c85d8202026b629cb7fa344782bd2c4	2019-05-28 01:40:54 -07:00
Hong Xu	1e8f129a05	In setup.py, also check some submodules of submodules. (#20937 ) Summary: Sometimes users forget using the "--recursive" option when they update submodules. This added check should help expose this issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20937 Differential Revision: D15502846 Pulled By: mrshenli fbshipit-source-id: 34c28a2c71ee6442d16b8b741ea44a18733b1536	2019-05-26 18:43:24 -07:00
Gregory Chanan	47043220ee	Update version strings to 1.2 Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20812 Differential Revision: D15451892 Pulled By: gchanan fbshipit-source-id: 07355dbd446053a69b5cf4e3be1842aa1075c71f	2019-05-24 11:07:29 -07:00
Ilia Cherniavskii	c3d05e86cc	Resend "Split ATen/Parallel into interface and backend" (#20825 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20825 ghimport-source-id: 0371fbd37cb37635647d473d5ac9f2859e787061 Differential Revision: D15458073 Pulled By: ilia-cher fbshipit-source-id: cd27d0da1691f6be1183cd152348ac0d93a53996	2019-05-24 02:03:06 -07:00
Hong Xu	795a1a6ffa	When detecting numpy, assign relavant variables outside the try block (#20739 ) Summary: When detecting the presence of NumPy using import, move numpy-related variable assignments outside the try block (i.e., to an else block) to improve readability. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20739 Differential Revision: D15453916 Pulled By: ezyang fbshipit-source-id: d3c37f2b290846be3c6a1462251cbb3e95d493be	2019-05-22 11:27:36 -07:00
Edward Yang	fd95947e68	Revert D15248618: Split ATen/Parallel into interface and backend Differential Revision: D15248618 Original commit changeset: 060879266bc8 fbshipit-source-id: fc5cbb030b87613c9e15100118c3d4a064097c20	2019-05-22 09:55:51 -07:00
Ilia Cherniavskii	c4a3b4d528	Split ATen/Parallel into interface and backend (#20057 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20057 ghimport-source-id: c583f61bf661c994eb4d0625748a299e892a7246 Differential Revision: D15248618 Pulled By: ilia-cher fbshipit-source-id: 060879266bc8616916fe220adef6ae6c0b076fbd	2019-05-21 19:15:47 -07:00
Ilia Cherniavskii	481b6d0268	Allow a non-OpenMP based build (#19749 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19749 ghimport-source-id: a6636c0acddbdc5fd5b0dcb20b9f80cbdb9159b9 Differential Revision: D15141993 Pulled By: ilia-cher fbshipit-source-id: 96085608398b2a4c97c68b2948f5184d07f9ad3d	2019-05-06 19:34:48 -07:00
Bram Wasti	035966d538	Add options to Operator to enable registration of alias analysis passes (#19382 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19382 ghimport-source-id: aeaad3b84ea20dd95b38635ca28c5ff657187909 Differential Revision: D14990873 Pulled By: bwasti fbshipit-source-id: e1292ac8358ca8ff5bad8d8aeaddf06c23e66067	2019-05-06 15:40:13 -07:00
Jon Malmaud	0565141728	Type annotations for `util.data`. (#18963 ) Summary: I haven't had a chance to rigorously try these out yet so don't merge yet. Closes #18725. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18963 Differential Revision: D14832897 Pulled By: ezyang fbshipit-source-id: 4780e7a34126bc66ddbfd9d808dfc9e0edd77e68	2019-04-08 09:52:53 -07:00
Jon Malmaud	1b25fdbcd0	More type stubs (#18511 ) Summary: Added stubs for: * The `device` module * The `cuda` module * Parts of the `optim` module * Began adding stubs for the `autograd` module. I'll annotate more later but `no_grad` and friends are probably the most used exports from it so it seemed like a good place to start. This would close #16996, although comments on that issue reference other missing stubs so maybe it's worth keeping open as an umbrella issue. The big remaining missing package is `nn`. Also added a `py.typed` file so mypy will pick up on the type stubs. That closes #17639. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18511 Differential Revision: D14715053 Pulled By: ezyang fbshipit-source-id: 9e4882ac997063650e6ce47604b3eaf1232c61c9	2019-04-01 16:03:58 -07:00
Shuichi KITAGUCHI	ddbfdc911d	Create torch/lib directory before copying _C.lib on Windows environment. (#18666 ) Summary: `python setup.py develop` fails with following messages. ~~~ ... -- Building with NumPy bindings -- Not using cuDNN -- Not using MIOpen -- Not using CUDA -- Using MKLDNN -- Not using NCCL -- Building without distributed package Copying extension caffe2.python.caffe2_pybind11_state Copying caffe2.python.caffe2_pybind11_state from torch\Lib\site-packages\caffe2\python\caffe2_pybind11_state.cp37-win_amd64.pyd to C:\data\source\pytorch\build\lib.win-amd64-3.7\caffe2\python\caffe2_pybind11_state.cp37-win_amd64.pyd copying torch\Lib\site-packages\caffe2\python\caffe2_pybind11_state.cp37-win_amd64.pyd -> C:\data\source\pytorch\build\lib.win-amd64-3.7\caffe2\python building 'torch._C' extension creating build\temp.win-amd64-3.7 creating build\temp.win-amd64-3.7\Release creating build\temp.win-amd64-3.7\Release\torch creating build\temp.win-amd64-3.7\Release\torch\csrc ... creating C:\data\source\pytorch\build\lib.win-amd64-3.7\torch C:\Program Files (x86)\Microsoft Visual Studio\2017\Professional\VC\Tools\MSVC\14.16.27023\bin\HostX64\x64\link.exe /nologo /INCREMENTAL:NO /LTCG /nodefaultlib:libucrt.lib ucrt.lib /DLL /MANIFEST:EMBED,ID=2 /MANIFESTUAC:NO /LIBPATH:C:\data\source\pytorch\torch\lib /LIBPATH:C:\data\dlenv\libs /LIBPATH:C:\data\dlenv\PCbuild\amd64 "/LIBPATH:C:\Program Files (x86)\Microsoft Visual Studio\2017\Professional\VC\Tools\MSVC\14.16.27023\ATLMFC\lib\x64" "/LIBPATH:C:\Program Files (x86)\Microsoft Visual Studio\2017\Professional\VC\Tools\MSVC\14.16.27023\lib\x64" "/LIBPATH:C:\Program Files (x86)\Windows Kits\NETFXSDK\4.6.1\lib\um\x64" "/LIBPATH:C:\Program Files (x86)\Windows Kits\10\lib\10.0.17763.0\ucrt\x64" "/LIBPATH:C:\Program Files (x86)\Windows Kits\10\lib\10.0.17763.0\um\x64" shm.lib torch_python.lib /EXPORT:PyInit__C build\temp.win-amd64-3.7\Release\torch/csrc/stub.obj /OUT:build\lib.win-amd64-3.7\torch\_C.cp37-win_amd64.pyd /IMPLIB:build\temp.win-amd64-3.7\Release\torch/csrc\_C.cp37-win_amd64.lib /NODEFAULTLIB:LIBCMT.LIB ライブラリ build\temp.win-amd64-3.7\Release\torch/csrc\_C.cp37-win_amd64.lib とオブジェクト build\temp.win-amd64-3.7\Release\torch/csrc\_C.cp37-win_amd64.exp を作成中コード生成しています。コード生成が終了しました。 copying build\lib.win-amd64-3.7\torch\_C.cp37-win_amd64.pyd -> torch copying build\lib.win-amd64-3.7\caffe2\python\caffe2_pybind11_state.cp37-win_amd64.pyd -> caffe2\python copying build/temp.win-amd64-3.7/Release/torch/csrc/_C.cp37-win_amd64.lib -> build/lib.win-amd64-3.7/torch/lib/_C.lib error: could not create 'build/lib.win-amd64-3.7/torch/lib/_C.lib': No such file or directory ~~~ When `python setup.py install` is executed, `torch/lib` has been created by previous process (copying many files) and this copy succeeds. But in develop mode, that process does not executed and this copy fails. This patch creates `torch/lib` directory if do not exist. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18666 Differential Revision: D14704269 Pulled By: ezyang fbshipit-source-id: b2d7c698a906b945bf34bb78f17b91b4fdfd3294	2019-04-01 07:28:08 -07:00
Edward Yang	173f224570	Turn on F401: Unused import warning. (#18598 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18598 ghimport-source-id: c74597e5e7437e94a43c163cee0639b20d0d0c6a Stack from [ghstack](https://github.com/ezyang/ghstack): * #18598 Turn on F401: Unused import warning. This was requested by someone at Facebook; this lint is turned on for Facebook by default. "Sure, why not." I had to noqa a number of imports in __init__. Hypothetically we're supposed to use __all__ in this case, but I was too lazy to fix it. Left for future work. Be careful! flake8-2 and flake8-3 behave differently with respect to import resolution for # type: comments. flake8-3 will report an import unused; flake8-2 will not. For now, I just noqa'd all these sites. All the changes were done by hand. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: D14687478 fbshipit-source-id: 30d532381e914091aadfa0d2a5a89404819663e3	2019-03-30 09:01:17 -07:00
Gao, Xiang	a40e0a7f2d	Add torch.version.git_version (#18299 ) Summary: Fixes: https://github.com/pytorch/pytorch/issues/18293 cc: colesbury Pull Request resolved: https://github.com/pytorch/pytorch/pull/18299 Differential Revision: D14611972 Pulled By: soumith fbshipit-source-id: cdb48ef37c8869713a9a43ea0da08e1bed9279a2	2019-03-25 19:59:40 -07:00
Sebastian Messmer	daa77c6e26	Move schema inference to c10 (#18090 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18090 This schema inference is needed by the c10 operator registration mechanism. Move it to c10. It is going to be used by diffs stacked on top. Reviewed By: ezyang Differential Revision: D14491454 fbshipit-source-id: 0f8ddcdbd91467c8347d315dd443a1ca8b216481	2019-03-21 14:57:30 -07:00
peter	906f9efc57	Revert "Add check for x64 Python before setup (#17707 )" (#17864 ) Summary: This reverts commit `08fb9021da`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17864 Differential Revision: D14404920 Pulled By: soumith fbshipit-source-id: d41fc06e249f3437d4f80d1d6a5fdbd44c90462b	2019-03-11 08:52:13 -07:00
peter	08fb9021da	Add check for x64 Python before setup (#17707 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/17657. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17707 Differential Revision: D14346705 Pulled By: ezyang fbshipit-source-id: 5daafacdb99eb9a9c6517263d10f20c79f920d24	2019-03-06 10:48:16 -08:00
Lu Fang	9e08c998db	Throw exception when foxi is not checked out (#17477 ) Summary: Add check and provide useful warning/error information to user if foxi is not checked out. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17477 Reviewed By: zrphercule Differential Revision: D14212896 Pulled By: houseroad fbshipit-source-id: 557247d5d8fdc016b1c24c2a21503e59f874ad09	2019-02-25 14:39:24 -08:00
Vishwak Srinivasan	9e69703dac	USE_ --> BUILD_ for CAFFE2_OPS and TEST (#17390 ) Differential Revision: D14195572 Pulled By: soumith fbshipit-source-id: 28e4ff3fe03a151cd4ed014c64253389cb85de3e	2019-02-22 17:19:44 -08:00
Zachary DeVito	356a94b64e	Lazily load libcuda libnvrtc from c++ (#17317 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/16860 Pull Request resolved: https://github.com/pytorch/pytorch/pull/17317 Differential Revision: D14157877 Pulled By: zdevito fbshipit-source-id: c37aec2d77c2e637d4fc6ceffe2bd32901c70317	2019-02-22 13:51:45 -08:00
Soumith Chintala	3069c45069	upgrade documentation in setup.py to NO_ -> USE_ (#17333 ) Summary: fixes https://github.com/pytorch/pytorch/issues/17265 Pull Request resolved: https://github.com/pytorch/pytorch/pull/17333 Differential Revision: D14168483 Pulled By: soumith fbshipit-source-id: a79f4f9d9e18cb64e2f56f777caa69ae92d2fa4b	2019-02-21 10:25:43 -08:00
Tri Dao	37890610b0	Include vec256 headers in setup.py (#17220 ) Summary: Fix #16650. Headers such as `ATen/cpu/vml.h` contain `#include <ATen/cpu/vec256/vec256.h>` for example, but these vec256 headers aren't included, due to commit `e4c0bb1`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17220 Differential Revision: D14165695 Pulled By: ezyang fbshipit-source-id: 27b2aa2a734b3719ca4af0565f79623b64b2620f	2019-02-21 07:37:01 -08:00
Elias Ellison	89df22e57b	Lightweight String check Utility (#16858 ) Summary: light weight implementation of LLVM filecheck utility. Currently only handles string matching - regexes & saving a regex to a variable name can be added as needed. Current intended usage is through FileCheckBuilder python handle, and is shown in the tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16858 Differential Revision: D14096244 Pulled By: eellison fbshipit-source-id: c7c8d1457691c105e6ccbb3c1a378d96baac2569	2019-02-19 12:31:57 -08:00
Dmytro Dzhulgakov	5a26579e27	Add more headers to setup.py to make pytorch/benchmark work (#16890 ) Summary: Since we don't do tmp_install any more it's better to include all necessary headers. cc kostmo for better suggestions of how to list all headers here Pull Request resolved: https://github.com/pytorch/pytorch/pull/16890 Differential Revision: D14079848 Pulled By: dzhulgakov fbshipit-source-id: 4522c80d05e5d91f99f6700cde46cac559330d28	2019-02-13 23:14:36 -08:00
Simeon Monov	bad4442a7c	Parse the command line and check the arguments before build_deps() (#16914 ) Summary: This is needed to check for wrong arguments or --help options before `build_deps()` is executed. Otherwise command line arguments are not parsed and checked until `setup()` is run. Fixes: #16707 Pull Request resolved: https://github.com/pytorch/pytorch/pull/16914 Differential Revision: D14041236 Pulled By: soumith fbshipit-source-id: 41f635772ccf47f05114775d5a19ae04c495ab3b	2019-02-12 00:15:42 -08:00
Zachary DeVito	21193bf123	try to get rid of tmp_install (#16414 ) Summary: Rehash of previous attempts. This tries a different approach where we accept the install as specified in cmake (leaving bin/ include/ and lib/ alone), and then try to adjust the rest of the files to this more standard layout. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16414 Differential Revision: D13863635 Pulled By: zdevito fbshipit-source-id: 23725f5c64d7509bf3ca8f472dcdcad074de9828	2019-01-29 17:29:40 -08:00
Thomas Viehmann	6a6983ed7f	create type hint stub files for module torch (#12500 ) Summary: We have: - This is an initial stab at creating a type stub `torch/__init__.pyi` . - This is only tested on Python 3, since that's the only Python version mypy works on. - So far, we only aim at doing this for torch functions and torch.Tensor. - Quite a few methods and functions have to be typed manually. These are done in `torch/__init__.pyi.in` For me, PyCharm (the non-paid one) didn't seem to indicate errors in the .pyi when opening and seemed to be able to get the type hint for the few functions I tried, but I don't use PyCharm for my usual PyTorch activities, so I didn't extensively try this out. An example of a generated PYI is at [this gist](https://gist.github.com/ezyang/bf9b6a5fa8827c52152858169bcb61b1). Pull Request resolved: https://github.com/pytorch/pytorch/pull/12500 Differential Revision: D13695553 Pulled By: ezyang fbshipit-source-id: 4566c71913ede4e4c23ebc4a72c17151f94e8e21	2019-01-29 12:14:17 -08:00
Zachary DeVito	9477a5d9c8	Remove bash from build (#16289 ) Summary: This commit removes the dependency on `build_pytorch_libs.sh` by moving the remaining functionality that is not expressible in cmake into python. Removing the indirection through bash also removes over 300 lines of environment munging code that is incredibly hard to understand because it passes a lot of secret parameters through `os.env`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16289 Reviewed By: ezyang Differential Revision: D13821662 Pulled By: zdevito fbshipit-source-id: d658d26925e3b1169ac1e3d44a159cf8a1f0d9b1	2019-01-25 16:03:53 -08:00
Zachary DeVito	0cd1ab82b0	Remove dead code from setup.py, remove need for build target. (#16162 ) Summary: Now it is only necessary to use 'develop' or 'install' to build. Incremental cmake is on by default. `develop --cmake` forces it to rerun. The NinjaBuilder stuff is dead. It was used to make building _C.so faster but now _C.so is just an empty stub file. Removed a bunch of custom build commands from setup.py that are no longer meaningful now that cmake handles most of the build. Removed unused targets in build_pytorch_lib.sh/bat Pull Request resolved: https://github.com/pytorch/pytorch/pull/16162 Differential Revision: D13744155 Pulled By: zdevito fbshipit-source-id: d836484782c65b7f8e8c7a82620886f7a7777892	2019-01-21 17:27:56 -08:00
Zachary DeVito	b5c733324c	Fix RERUN_CMAKE Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16132 Differential Revision: D13726816 Pulled By: zdevito fbshipit-source-id: 26ad70651b0138642ad5240670f5c452018c13a2	2019-01-18 00:04:31 -08:00

1 2 3 4 5 ...

592 Commits