pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Richard Barnes	7ec6d1e857	irange-ify 2 (#62113 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62113 Test Plan: Sandcastle Reviewed By: malfet Differential Revision: D29879507 fbshipit-source-id: 1fb114e44afe8c1407f648b705db7fd4edb9d6e3	2021-07-26 12:00:52 -07:00
Gaoxiang Liu	735f8cc6c2	[DI] Allow explicit taskLauncher for torchscript interpreter (#46865 ) Summary: By default, TorchScript execution is single threaded and uses the caller's thread pool. For the use case of distributed inference, we hope there is a way to customize the behavior where the interpreter in torch script can be executed in other places. This diff allows an explicit taskLauncher for torchscript interpreter. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46865 Test Plan: unit test is passed. fbshipit-source-id: 1d7b003926c0d1f8facc53206efb960cff8897ac Fixes #{issue number} Reviewed By: houseroad Differential Revision: D24616102 Pulled By: garroud fbshipit-source-id: 79202b62f92d0b0baf72e4bf7aa3f05e0da91d59	2020-11-04 17:07:55 -08:00
Elias Ellison	9cbeb0faed	[JIT] Dont optimize shape peepholes on inline (#36404 ) Summary: With https://github.com/pytorch/pytorch/pull/35562, we are running peephole optimization on inlining to reduce the number of nodes that are copied. The tracer encodes the sizes in the graph like: ``` graph(%0 : Double(7)): %1 : Function = prim::Constant[name="tensor_size"]() %2 : Tensor = prim::CallFunction(%1, %0) return (%2) ``` however people would like to reuse the graph with different shapes so running size invalidations would invalidate that. long term it might be better for the tracer to not include shape information but there are downstream users of that. Separates out FuseAddMM from peephole so that now there is a single `disable_size_optimizations` parameter, and onnx explicitly invokes fuseaddmm. Pull Request resolved: https://github.com/pytorch/pytorch/pull/36404 Differential Revision: D20968974 Pulled By: eellison fbshipit-source-id: 56f8f1699e3b0adeeccdfd5a67bb975fd41a2913	2020-04-15 17:49:48 -07:00
Elias Ellison	6bc8ffe824	[JIT] Optimize before inlining (#35562 ) Summary: Resubmit of https://github.com/pytorch/pytorch/pull/35424, only this time I run optimizations in the right order so the PR description is actually true. This speeds up the inlining pass of FairSeq model from 180s -> 13s, and MaskRCNN model from 5s -> 1.5s. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35562 Differential Revision: D20738922 Pulled By: eellison fbshipit-source-id: 1439cf9d1f0bc780e2d64a744694f8b3b7ba4b70	2020-04-07 09:42:26 -07:00
Jeremy Lilley	8d64a3848c	[jit] In RPC Server, handle TorchScript continuations asynchronously (#34109 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34109 This change adds glue to GraphExecutor to give the RPC server access to the future-based Interpreter::runAsync() api. Previously, if a server encounted a TorchScript continuation-based block with fork/wait, it would simply block in the server thread until the handler completed, since it uses the synchronous Interpreter::run() api. With the ivalue::Future returned by the Interpreter, we can run the TorchScript code asynchronously from c++ simply by connecting its callback to the server callback. We add test cases to cover the new logic, both rpc_async and remote. ghstack-source-id: 101245438 Test Plan: buck test mode/dev-nosan caffe2/test/distributed/rpc/... Differential Revision: D20194321 fbshipit-source-id: 16785ec5d9ed0b16cb1ffab0a9771a77de30fcb0	2020-03-31 17:21:46 -07:00
Alban Desmaison	e00575044e	Revert D20657271: [pytorch][PR] [JIT] Optimize before inlining Test Plan: revert-hammer Differential Revision: D20657271 Original commit changeset: 7a9006858c2f fbshipit-source-id: d77bbc74479ec8ca5d3254eff498e1cbc04add2b	2020-03-26 13:33:44 -07:00
Elias Ellison	3b2b6ae1a8	[JIT] Optimize before inlining (#35424 ) Summary: This speeds up the inlining pass of FairSeq model from 180s -> 13s. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35424 Differential Revision: D20657271 Pulled By: eellison fbshipit-source-id: 7a9006858c2f1b157f5a3f36ed2b3774cc186de8	2020-03-26 11:08:09 -07:00
James Reed	60e8615a6d	[JIT] Virtualize Function (#33921 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33921 NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.intern.facebook.com/intern/diff/D20153092/)! Test Plan: Imported from OSS Differential Revision: D20177227 Pulled By: jamesr66a fbshipit-source-id: 87f3e484c4f873d60f76f50f6789c1b4a73bdfde	2020-03-07 10:03:50 -08:00

8 Commits