pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Guilherme Leobas	375a83e6c1	Annotate torch.utils.(tensorboard/show_pickle/hypify) (#44216 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/44215 Pull Request resolved: https://github.com/pytorch/pytorch/pull/44216 Reviewed By: gchanan Differential Revision: D23963216 Pulled By: ezyang fbshipit-source-id: b3fed51b2a1cbd05e3cd0222c89c38d61d8968c1	2020-09-29 18:14:26 -07:00
Xiang Gao	20ac736200	Remove py2 compatible future imports (#44735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44735 Reviewed By: mruberry Differential Revision: D23731306 Pulled By: ezyang fbshipit-source-id: 0ba009a99e475ddbe22981be8ac636f8a1c8b02f	2020-09-16 12:55:57 -07:00
Jeff Daily	65d4a6b7c0	[ROCm] fix cub hipify mappings (#44431 ) Summary: Fixes ROCm-specific workarounds introduced by https://github.com/pytorch/pytorch/issues/44259. This adds new hipify mappings that properly handle cub outside of caffe2 sources. Pull Request resolved: https://github.com/pytorch/pytorch/pull/44431 Reviewed By: mrshenli Differential Revision: D23617417 Pulled By: ngimel fbshipit-source-id: 5d16afb6b8e6ec5ed049c51571866b0878d534ca	2020-09-09 23:39:25 -07:00
kshitij12345	b6b5ebc345	Add `torch.vdot` (#43004 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/42747 Pull Request resolved: https://github.com/pytorch/pytorch/pull/43004 Reviewed By: mruberry Differential Revision: D23318935 Pulled By: anjali411 fbshipit-source-id: 12d4824b7cb42bb9ca703172c54ec5c663d9e325	2020-09-02 09:00:30 -07:00
Jeff Daily	8fb7c50250	Enable complex blas for ROCm. (#43744 ) Summary: Revert "Skips some complex tests on ROCm (https://github.com/pytorch/pytorch/issues/42759)". This reverts commit `55b1706775`. Use new cuda_to_hip_mappings.py from https://github.com/pytorch/pytorch/issues/43004. Fixes https://github.com/pytorch/pytorch/pull/42383#issuecomment-670771922 CC sunway513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/43744 Reviewed By: glaringlee Differential Revision: D23391263 Pulled By: ngimel fbshipit-source-id: ddf734cea3ba69c24f0d79cf1b87c05cdb45ec3d	2020-08-30 22:43:54 -07:00
Jithun Nair	28be3ef2f2	Fix hipify script for pytorch extensions (#43528 ) Summary: PyTorch extensions can have .cpp or .h files which contain CUDA code that needs to be hipified. The current hipify script logic has overly strict conditions to determine which files get considered for hipification: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L146 These conditions might apply well to pytorch/caffe2 source code, but are overconstrained for third-party extensions. `is_pytorch_file` conditions: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L549 `is_caffe2_gpu_file` conditions: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L561 This PR relaxes these conditions if we're hipifying a pytorch extension (specified by `is_pytorch_extension=True`) and considers all the file extensions specified using the `extensions` parameter: https://github.com/pytorch/pytorch/blob/master/torch/utils/hipify/hipify_python.py#L820 Pull Request resolved: https://github.com/pytorch/pytorch/pull/43528 Reviewed By: mruberry Differential Revision: D23328272 Pulled By: ngimel fbshipit-source-id: 1e9c3a54ae2da65ac596a7ecd5539f3e14eeed88	2020-08-26 18:41:48 -07:00
anjali411	aab66602c4	Add torch.dot for complex tensors (#42745 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42745 Test Plan: Imported from OSS Reviewed By: izdeby Differential Revision: D23056382 Pulled By: anjali411 fbshipit-source-id: c97f15e057095f78069844dbe0299c14104d2fce	2020-08-17 09:05:41 -07:00
Dmytro Dzhulgakov	06d978a9ad	[c10/cuda] Reorganize device_count() and robustly surface ASAN warnings (#42249 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42249 Main change is to bring Caffe2's superior error messages for cuda initialization into c10 and use them in all code paths. Basic logic: \| Case \| Call to device_count() \| init_cuda, e.g. allocating tensor \| \| -- \| -- \| -- \| \| all good \| non-zero \| just works \| \| no gpus \| 0, no warning \| throw exception with good message \| \| driver issues \| 0, produce warning \| throw exception with good message \| \| out of memory with ASAN \| 0, produce warning\| throw exception with ASAN message \| Previously, the error thrown from init_cuda was very generic and the ASAN warning (if any) was buried in the logs. Other clean up changes: * cache device_count() always in a static variable * move all asan macros in c10 Test Plan: Hard to unittest because of build modes. Verified manually that the behavior from the table above holds by running the following script in different modes (ASAN/no-ASAN, CUDA_VISIBLE_DEVICES=): ``` print('before import') import torch print('after import') print('devices: ', torch.cuda.device_count()) x = torch.tensor([1,2,3]) print('tensor creation') x = x.cuda() print('moved to cuda') ``` Reviewed By: ngimel Differential Revision: D22824329 fbshipit-source-id: 5314007313a3897fc955b02f8b21b661ae35fdf5	2020-08-05 11:39:31 -07:00
ziab	1c8217a7a6	Abstract cuda calls made from `torch_python` (#42251 ) Summary: * Make c10::cuda functions regular non-inlined functions * Add driver_version() and device_synchronize() functions With this change I don't see anymore direct calls to CUDA API when look at Modules.cpp.obj FYI malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/42251 Reviewed By: malfet Differential Revision: D22826505 Pulled By: ziab fbshipit-source-id: 8dc2f3e209d3710e2ce78411982a10e8c727573c	2020-07-30 19:18:33 -07:00
Priyanshu	5aa2b572ff	replace black list with block (#42091 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/41729 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42091 Reviewed By: pbelevich Differential Revision: D22792096 Pulled By: ezyang fbshipit-source-id: caafa42d12cbad377b67ddbaba8f84a2b8c98066	2020-07-28 10:23:51 -07:00
wudenggang	9600ed9af3	typo fixes (#41632 ) Summary: typo fixes Pull Request resolved: https://github.com/pytorch/pytorch/pull/41632 Reviewed By: ezyang Differential Revision: D22617827 Pulled By: mrshenli fbshipit-source-id: c2bfcb7cc36913a8dd32f13fc9adc3aa0a9b682f	2020-07-20 07:23:00 -07:00
Stanislau Hlebik	b774ce54f8	remediation of S205607 fbshipit-source-id: 798decc90db4f13770e97cdce3c0df7d5421b2a3	2020-07-17 17:19:47 -07:00
Stanislau Hlebik	8fdea489af	remediation of S205607 fbshipit-source-id: 5113fe0c527595e4227ff827253b7414abbdf7ac	2020-07-17 17:17:03 -07:00
Jade Nie	75b6dd3d49	Wrap Caffe2's SparseLengthsSum into a PyTorch op (#39596 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39596 This diff wraps Caffe2's SparseLengthsSum on GPU as a PT op. Reviewed By: jianyuh Differential Revision: D21895309 fbshipit-source-id: 38bb156f9be8d28225d2b44f5b4c93d27779aff9	2020-07-10 11:19:13 -07:00
Jianyu Huang	62e16934cb	[caffe2] Add the dedup implementation of fused RowWiseAdagrad op on GPUs (#40282 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40282 Test Plan: ``` buck test mode/dev-nosan //caffe2/caffe2/fb/net_transforms/tests:fuse_sparse_ops_test -- 'test_fuse_sparse_adagrad_with_sparse_lengths_sum_gradient \(caffe2\.caffe2\.fb\.net_transforms\.tests\.fuse_sparse_ops_test\.TestFuseSparseOps\)' --print-passing-details ``` https://our.intern.facebook.com/intern/testinfra/testrun/4785074632584150 Reviewed By: jspark1105 Differential Revision: D22102737 fbshipit-source-id: fa3fef7cecb1e2cf5c9b6019579dc0f86fd3a3b2	2020-07-10 09:05:24 -07:00
Jithun Nair	0e6b750288	Insert parentheses around kernel name argument to hipLaunchKernelGGL (#41022 ) Summary: This is to workaround an issue in hipclang wrt templated kernel name arguments to hipLaunchKernelGGL. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41022 Differential Revision: D22404183 Pulled By: ngimel fbshipit-source-id: 63135ccb9e087f4c8e8663ed383979f7e2c1ba06	2020-07-07 11:31:45 -07:00
Jade Nie	dcec099d48	Wrap Caffe2 (RowWise)SparseAdagrad fusion operator as a PT op (#39904 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39904 This diff wraps Caffe2's (RowWise)SparseAdagrad fusion operator on GPU as a PT op. Reviewed By: jianyuh Differential Revision: D22010193 fbshipit-source-id: 5df3c506c0dadd3b21180829fd2d5084ac76abc3	2020-06-16 17:40:05 -07:00
Jeff Daily	1e05e5e0ae	Correct #39759 for HIP. (#39801 ) Summary: Changes in PR https://github.com/pytorch/pytorch/issues/39759 broke HIP caffe2. hipify for caffe2 renames CUDA to HIP; torch does not. If caffe2 calls into torch, it needs to use CUDA-named functions. CC ezyang xw285cornell sunway513 houseroad dzhulgakov Pull Request resolved: https://github.com/pytorch/pytorch/pull/39801 Differential Revision: D21982493 Pulled By: xw285cornell fbshipit-source-id: 8e88e0fb80c71f0342e23ef0214a42d5542bdc70	2020-06-12 10:34:28 -07:00
Xiang Gao	e22dd561ad	Migrate pow kernel to c10::complex (#39286 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39286 Test Plan: Imported from OSS Differential Revision: D21999454 Pulled By: pbelevich fbshipit-source-id: c8f1ba4ff4ec66ffbc283700cabb6794e6b2896a	2020-06-11 15:49:30 -07:00
Christian Puhrsch	8893c0670d	Revert D21511611: Wrap Caffe2 (RowWise)SparseAdagrad fusion operator as a PT op Test Plan: revert-hammer Differential Revision: D21511611 Original commit changeset: 1a0bb8252ec0 fbshipit-source-id: 462cb20585cf3ad415c80f0f1d77e5fdba2a6ea1	2020-06-11 14:40:18 -07:00
Jade Nie	85f1f67f33	Wrap Caffe2 (RowWise)SparseAdagrad fusion operator as a PT op (#38704 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38704 This diff wraps Caffe2's (RowWise)SparseAdagrad fusion operator on GPU as a PT op. Reviewed By: jianyuh, xw285cornell Differential Revision: D21511611 fbshipit-source-id: 1a0bb8252ec0a8229eb80708338cb23008cfb26d	2020-06-11 13:56:02 -07:00
Natalia Gimelshein	9c19a12965	fix asserts in cuda code (#39047 ) Summary: Gets rid of some in-kernel asserts where they can be replaced with static_asserts Replaces bare in-kernel `assert` in one case with `CUDA_KERNEL_ASSERT` where necessary replaces host code `assert`s with `TORCH_INTERNAL_ASSERT` Another group of asserts is in fractional max pooling kernels which should be fixed regardless https://github.com/pytorch/pytorch/issues/39044, the problems there are not just asserts. I've audited remaining cases of in-kernel asserts, and they are more like `TORCH_INTERNAL_ASSERT`, so they should not happen with invalid user data. I think it's ok to leave them as is. Pull Request resolved: https://github.com/pytorch/pytorch/pull/39047 Differential Revision: D21750392 Pulled By: ngimel fbshipit-source-id: e9417523a2c672284de3515933cb7ed166e56719	2020-05-28 15:51:38 -07:00
Jeff Daily	1093e26d72	[ROCm] HIP version guard for occupancy API compatibility (#38551 ) Summary: CC ezyang xw285cornell HIP from ROCm 3.5 renames `hipOccupancyMaxActiveBlocksPerMultiprocessor` to `hipModuleOccupancyMaxActiveBlocksPerMultiprocessor`. In addition, the API parameter types now match CUDA. Add these changes in a backwards-compatible manner. Pull Request resolved: https://github.com/pytorch/pytorch/pull/38551 Differential Revision: D21721832 Pulled By: ezyang fbshipit-source-id: 6fc971845e363d7495d8be9550e76d0f082c3062	2020-05-27 10:09:06 -07:00
Huang Shuang	fedb70a8fb	Fix encoding errors for hipify tool (#37906 ) Summary: Encoding errors occur when using anaconda python 3.6.10 to run hipify_python.py, e.g., "' ASCII 'codec can't decode byte 0xc3". Pull Request resolved: https://github.com/pytorch/pytorch/pull/37906 Differential Revision: D21549531 Pulled By: ezyang fbshipit-source-id: 2ffb5787e192a5c03711baa5c7e2577cb5bcab5a	2020-05-14 08:07:04 -07:00
David Reiss	e75fb4356b	Remove (most) Python 2 support from Python code (#35615 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35615 Python 2 has reached end-of-life and is no longer supported by PyTorch. Now we can clean up a lot of cruft that we put in place to support it. These changes were all done manually, and I skipped anything that seemed like it would take more than a few seconds, so I think it makes sense to review it manually as well (though using side-by-side view and ignoring whitespace change might be helpful). Test Plan: CI Differential Revision: D20842886 Pulled By: dreiss fbshipit-source-id: 8cad4e87c45895e7ce3938a88e61157a79504aed	2020-04-22 09:23:14 -07:00
Thomas Viehmann	d070c0bcf0	ROCm: enable cpp_extensions.load/load_inline (#35897 ) Summary: This enables cpp_extensions.load/load_inline. This works by hipify-ing cuda sources. Also enable tests. CuDNN/MIOpen extensions aren't yet supported, I propose to not do this in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35897 Differential Revision: D20983279 Pulled By: ezyang fbshipit-source-id: a5d0f5ac592d04488a6a46522c58e2ee0a6fd57c	2020-04-13 11:44:08 -07:00
Peter Bell	5fc5cf6571	Stop using ctypes to interface with CUDA libraries. (#33678 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33016, Continuation of https://github.com/pytorch/pytorch/issues/31160 Pull Request resolved: https://github.com/pytorch/pytorch/pull/33678 Differential Revision: D20249187 Pulled By: ezyang fbshipit-source-id: 172ce4a0fee7fbe01436a421d1af22ef6173b6ed	2020-03-11 07:22:46 -07:00
ashish	616beb1412	[ROCm] Added support for pytorch extensions to use HIP (#32669 ) Summary: This pull request has changes for: 1. Enabling a torch module with HIP code to be compiled by cpp_extensions.py 2. Fixes for hipify module to be able to be used by a torch extension cc: ezyang iotamudelta jeffdaily Pull Request resolved: https://github.com/pytorch/pytorch/pull/32669 Differential Revision: D20033893 Pulled By: zou3519 fbshipit-source-id: fd6ddc8cdcd3930f41008636bb2bc9dd26cdb008	2020-02-21 12:10:02 -08:00
Hong Xu	a6a72ac68f	Fix all occurrences of C416. (#33429 ) Summary: C416: Unnecessary (list/set) comprehension - rewrite using list/set(). See https://pypi.org/project/flake8-comprehensions/ Pull Request resolved: https://github.com/pytorch/pytorch/pull/33429 Differential Revision: D19972858 Pulled By: ezyang fbshipit-source-id: faac042a94c59d737bd5ae983121a0a029346e23	2020-02-21 08:32:22 -08:00
Serhat Yilmaz	4f5a4be45f	Add native/quantized to the list of header rewrites (#31151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31151 same as title. I am not sure why this was not added in the first place. Test Plan: wait for build to succeed. Reviewed By: bddppq, xw285cornell Differential Revision: D18880216 fbshipit-source-id: 8b17d4fbd5dd08c28c52df8b1da77b69d56d65dc	2019-12-11 21:59:29 -08:00
Xiaodong Wang	36b73d5a1b	Hipify contrib/nccl (#29385 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29385 hipify contrib/gloo Test Plan: OSS & sandcastle build Reviewed By: bddppq Differential Revision: D18373308 fbshipit-source-id: 39c232db36318af116c341f64d03642639575ecd	2019-11-08 10:39:17 -08:00
Hong Xu	987e37b9c2	Enable EXE001 flake8 check. (#27560 ) Summary: According to https://github.com/pytorch/pytorch/issues/27285 , seems we do not intend to use shebang as an indication of Python version, thus we enable EXE001 flake8 check. For violations, we either remove shebang from non-executable Python scripts or grant them executable permission. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27560 Differential Revision: D17831782 Pulled By: ezyang fbshipit-source-id: 6282fd3617b25676a6d959af0d318faf05c09b26	2019-10-09 09:15:29 -07:00
Your Name	4bd8ae13c6	Move hipify to torch/utils to bundle them into torch package (#27425 ) Summary: Similar to https://github.com/pytorch/pytorch/pull/27418 but try to put it under "torch" namespace Pull Request resolved: https://github.com/pytorch/pytorch/pull/27425 Differential Revision: D17779490 Pulled By: bddppq fbshipit-source-id: 688338d143509b37dfc110df17af3331db48a42b	2019-10-07 17:25:45 -07:00

33 Commits