pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Alex Beloi	a38e43e936	[perf][1/5] Replace IValue::toString()->string() with IValue::toStringRef() (#85437 ) Summary: `IValue::toString()` creates a `new c10::intrusive_ptr` (like `std::shared_ptr`) and `->string()` immediately accesses it, creating an atomic reference increment/decrement. We can skip both of these operations by calling `IValue::toStringRef()`. Test Plan: CI Reviewed By: jaybean-dev Differential Revision: D39605242 Pull Request resolved: https://github.com/pytorch/pytorch/pull/85437 Approved by: https://github.com/jfix71	2022-09-23 23:36:57 +00:00
Sahan Paliskara	0bfa2f8255	Move torch::deploy tests to their own workflow job (#73676 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73676 For some reason https://github.com/pytorch/pytorch/pull/72637 ended up in getting messed up during rebasing so please refer to that pr for review history. This PR creates a new workflow called ` deploy-linux-xenial-cuda11.3-py3.7-gcc7` for torch::deploy tests. For testing go to https://www.torch-ci.com/pytorch/pytorch/pull/73676 and check if a build and test job occur with ` deploy-linux-xenial-cuda11.3-py3.7-gcc7` Test Plan: Imported from OSS Reviewed By: soulitzer Differential Revision: D34586702 Pulled By: PaliC fbshipit-source-id: 5627cf4ff411a4a04030f8b7726f84af979da213 (cherry picked from commit df6dddebb9fe078a6053a31033b5a40cc742fcf3)	2022-03-17 12:19:48 +00:00
Nikita Shulga	cc5f8aea5c	Revert D34868005: [torch::deploy] remove asserts from deploy Test Plan: revert-hammer Differential Revision: D34868005 (`48b165405d`) Original commit changeset: c8bb1f7a2b16 Original Phabricator Diff: D34868005 (`48b165405d`) fbshipit-source-id: a9f2250880f44b2ebe09a44f955dd8568a203b1b (cherry picked from commit c0537d6bf5252f307b28b3c8babd25bf9d7a5f92)	2022-03-15 18:30:08 +00:00
Sahan Paliskara	48b165405d	[torch::deploy] remove asserts from deploy (#73456 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73456 Replaces `at::Error` with a more simple implementation of exceptions in order to reduce the dependency of torch::deploy on torch. Note: Internal Testing / changes are still needed Test Plan: Imported from OSS Reviewed By: samdow Differential Revision: D34868005 Pulled By: PaliC fbshipit-source-id: c8bb1f7a2b169b5a8e3b63a697e0ced748a0524c (cherry picked from commit 51b3763d16e74458a5cfb8e4d660806dea897617)	2022-03-15 17:36:46 +00:00
Scott Wolchok	82f7f8d471	[PyTorch] Adopt IValue::toTupleRef() where obvious (#65505 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65505 Generated with `fastmod -m 'toTuple(\s)->' 'toTupleRef()${1}.'` , followed by `fastmod '(std::move$.)toTupleRef\($.' '${1}toTuple()->'` to unbreak 2 callsites. ghstack-source-id: 142065835 Test Plan: CI Reviewed By: gchanan Differential Revision: D31131025 fbshipit-source-id: 54457ae5bbeb38db9c7f196d469b98521c3d3f34	2021-11-02 10:22:18 -07:00
Shunting Zhang	289b0f7b04	Resent the reverted PR: Add register_frozenpython.cpp to the torch::deploy interpreter library in the OSS build (#67303 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67303 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D32016061 Pulled By: shunting314 fbshipit-source-id: 9460c90dd4f630f4c81dbfbbd772446ddffbabd0	2021-10-29 14:10:43 -07:00
Shunting Zhang	882446c1d2	add frozen_numpy to :builtin_registry_cuda target (#67396 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67396 frozen_numpy did not work on GPU since we didn't added register_frozennumpy to the :builtin_registry_cuda target. This was not found earlier since the unit test we added to test_deploy.cpp is only run on CPU. On GPU, we run test_deploy_gpu.cpp which does not contains the added unit tests for numpy. In this diff, I just duplidate the unit tests for numpy (and pyyaml) across test_deploy.cpp and test_deploy_gpu.cpp. I think ideally we should consolidate there 2 files to a single one. So we can add unit test in a single place while run them in both hardward platforms. Tracking task: T104399180 ghstack-source-id: 141750276 Test Plan: buck test mode/opt :test_deploy_gpu Reviewed By: suo Differential Revision: D31978156 fbshipit-source-id: 2f5cd55ca33107cc7d230b72f1353df81f0a3bda	2021-10-27 17:29:25 -07:00
Michael Suo	33c03cb61a	[deploy][1/n] Make deploy code conform to PyTorch style. (#65861 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65861 First in a series. This PR changes the code in deploy.h/cpp and interpreter_impl.h/cpp to be camel case instead of snake case. Starting with this as it has the most impact on downstream users. Test Plan: Imported from OSS Reviewed By: shannonzhu Differential Revision: D31291183 Pulled By: suo fbshipit-source-id: ba6f74042947c9a08fb9cb3ad7276d8dbb5b2934	2021-09-30 22:59:47 -07:00
Zhengxu Chen	cbc3db8274	Create test for builtin tensorrt module in torch deploy (#63819 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63819 ghstack-source-id: 138521664 Test Plan: buck test mode/dev-nosan caffe2/torch/csrc/deploy:test_deploy_gpu buck test mode/opt-split-dwarf caffe2/torch/csrc/deploy:test_deploy_gpu Reviewed By: wconstab Differential Revision: D30499301 fbshipit-source-id: 0bc165b4ed5be28ebb0becc65f292cf26368692f	2021-09-22 13:42:35 -07:00
Michael Suo	63c90ec3bf	[torch/deploy] add torch.distributed to build (#63918 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63918 Previously we were building with `USE_DISTRIBUTED` off, because c10d was built as a separately library for historical reasons. Since then, lw has merged the c10d build into libtorch, so this is fairly easy to turn on. Differential Revision: D30492442 NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.intern.facebook.com/intern/diff/D30492442/)! D30492442 D30492442 Test Plan: added a unit test Reviewed By: wconstab Pulled By: suo fbshipit-source-id: 843b8fcf349a72a7f6fcbd1fcc8961268690fb8c	2021-08-26 20:58:44 -07:00
Nikita Shulga	a9b0a921d5	Disable `avoid-non-const-global-variables` lint check (#62008 ) Summary: As GoogleTest `TEST` macro is non-compliant with it as well as `DEFINE_DISPATCH` All changes but the ones to `.clang-tidy` are generated using following script: ``` for i in `find . -type f -iname ".c" -or -iname "*.h"\|xargs grep cppcoreguidelines-avoid-non-const-global-variables\|cut -f1 -d:\|sort\|uniq`; do sed -i "/\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-non-const-global-variables)/d" $i; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/62008 Reviewed By: driazati, r-barnes Differential Revision: D29838584 Pulled By: malfet fbshipit-source-id: 1b2f8602c945bd4ce50a9bfdd204755556e31d13	2021-07-22 18:04:40 -07:00
Mike Guo	6ecc1a4c4f	Make pytorch clang-tidy clean (#60649 ) Summary: This PR suppresses clang-tidy warnings in the codebase (for now) so that we can re-enable clang-tidy checks on master. I ran this script to add the `NOLINTNEXTLINE` comments (on a devserver): ```bash python3 setup.py develop # Uses same script that's run on CI and adds the -j (parallel), -s (add comments), -k (continue if diagnostic errors are found) options python3 tools/clang_tidy.py \ -j \ -s \ -k \ -v \ --paths torch/csrc/ \ -g"-torch/csrc/jit/passes/onnx/helper.cpp" \ -g"-torch/csrc/jit/passes/onnx/shape_type_inference.cpp" \ -g"-torch/csrc/jit/serialization/onnx.cpp" \ -g"-torch/csrc/jit/serialization/export.cpp" \ -g"-torch/csrc/jit/serialization/import.cpp" \ -g"-torch/csrc/jit/serialization/import_legacy.cpp" \ -g"-torch/csrc/onnx/init.cpp" \ -g"-torch/csrc/cuda/nccl." \ -g"-torch/csrc/cuda/python_nccl.cpp" \ -g"-torch/csrc/autograd/FunctionsManual.cpp" \ -g"-torch/csrc/generic/.cpp" \ -g"-torch/csrc/jit/codegen/cuda/runtime/*" \ -g"-torch/csrc/deploy/interpreter/interpreter.cpp" \ -g"-torch/csrc/deploy/interpreter/interpreter.h" \ -g"-torch/csrc/deploy/interpreter/interpreter_impl.h" \ -g"-torch/csrc/deploy/interpreter/test_main.cpp" ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/60649 Test Plan: Verified changes by re-running the script (without the `-s` option) and seeing no warnings/errors. Reviewed By: walterddr, janeyx99 Differential Revision: D29504258 Pulled By: 1ntEgr8 fbshipit-source-id: 78310b30ee8213b73ddb4771ad874665323e7a4e	2021-07-01 12:21:07 -07:00
Will Constable	3472f0c94d	Enable torch::deploy GPU tests in sandcastle (#59460 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59460 Original commit changeset: 6e01a96d3746 Test Plan: Verify new tests run in sandcastle and existing CI is OK Reviewed By: H-Huang Differential Revision: D28900869 fbshipit-source-id: a8962ec48c66bba3b4b8f001ece7231953b29e82	2021-06-04 11:13:43 -07:00
Howard Huang	c1c9774acb	Revert D28538996: Enable torch::deploy GPU tests in sandcastle Test Plan: revert-hammer Differential Revision: D28538996 (`4b74c848aa`) Original commit changeset: 1a6ccea07cfe fbshipit-source-id: 6e01a96d3746d3ca3e4e792a7b623ef960c9d2d6	2021-06-03 13:00:25 -07:00
Will Constable	4b74c848aa	Enable torch::deploy GPU tests in sandcastle Summary: Added GPU tests in previous diffs but had to disable them as they only pass locally on devgpu, but not in sandcastle. note: local testing requires mode/dev-nosan or else ASAN interferes with CUDA. Test Plan: Verify tests passing in sandcastle. Reviewed By: malfet Differential Revision: D28538996 fbshipit-source-id: 1a6ccea07cfe2f150eee068594e636add620cd91	2021-06-03 08:10:19 -07:00
Will Constable	24786bd6ef	Make torch::deploy work with or without cuda (#58493 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58493 In fbcode, we want torch::deploy to be a target that works with or without cuda, depending only on whether cuda is linked in the final binary. To enable this, we build both flavors of libinterpreter, and choose which to load at runtime depending on whether cuda is available in the application. This comes at a cost to binary size, as it includes two copies of libinterpreter instead of one. However, it does not require _loading_ two copies of libinterpreter into memory at runtime, so the memory footprint of the interpreter (which we make N copies of) is not impacted. In oss/cmake, this change is a no-op. cuda is already handled there by building just one libinterpreter, but building cuda or not for the whole pytorch build based on a global cmake flag. Test Plan: test in fbcode with new gpu mode unit tests, verify existing oss CI passes Reviewed By: suo Differential Revision: D28512178 fbshipit-source-id: 61354bf78b1932605a841388fcbc4bafc0c4bbb4	2021-05-19 21:44:23 -07:00

16 Commits