pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Yanan Cao	35a36c1280	Implement JIT Enum type serialization and deserialization (#43460 ) Summary: [Re-review tips: nothing changed other than a type in python_ir.cpp to fix a windows build failure] Adds code printing for enum type Enhance enum type to include all contained enum names and values Adds code parsing for enum type in deserialization Enabled serialization/deserialization test in most TestCases. (With a few dangling issues to be addressed in later PRs to avoid this PR grows too large) Pull Request resolved: https://github.com/pytorch/pytorch/pull/43460 Reviewed By: albanD Differential Revision: D23284929 Pulled By: gmagogsfm fbshipit-source-id: e3e81d6106f18b7337ac3ff5cd1eeaff854904f3	2020-08-24 12:04:31 -07:00
Pavel Belevich	d94b10a832	Revert D23223281: Add Enum TorchScript serialization and deserialization support Test Plan: revert-hammer Differential Revision: D23223281 (`f269fb83c1`) Original commit changeset: 716d1866b777 fbshipit-source-id: da1ad8387b7d7aad9ff69e1ebeb5cd0b9394c2df	2020-08-22 02:38:12 -07:00
Yanan Cao	f269fb83c1	Add Enum TorchScript serialization and deserialization support (#42963 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42963 * Adds code printing for enum type * Enhance enum type to include all contained enum names and values * Adds code parsing for enum type in deserialization * Enabled serialization/deserialization test in most TestCases. (With a few dangling issues to be addressed in later PRs to avoid this PR grows too large) Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D23223281 Pulled By: gmagogsfm fbshipit-source-id: 716d1866b7770dfb7bd8515548cfe7dc4c4585f7	2020-08-21 18:13:27 -07:00
Zino Benaissa	e28a98a904	Turn on non ASCII string literals serialization (#40719 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40719 This is a follow up patch to turn on this feature in order to handle breaking forward compatibility. Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D22457952 Pulled By: bzinodev fbshipit-source-id: fac0dfed8b8b5fa2d52d342ee8cf06742959b3c5	2020-08-06 10:47:09 -07:00
Zino Benaissa	690946c49d	Generalize constant_table from tensor only to ivalue (#40718 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40718 Currently only constant except tensor must be inlined during serialization. Tensor are stored in the contant table. This patch generalizes this capability to any IValue. This is particularly useful for non ASCII string literal that cannot be inlined. Test Plan: Imported from OSS Differential Revision: D22298169 Pulled By: bzinodev fbshipit-source-id: 88cc59af9cc45e426ca8002175593b9e431f4bac	2020-07-09 09:09:40 -07:00
Yanan Cao	04004bf10c	Fix a minor typo "forget add" -> "forget to add" (#41131 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41131 Differential Revision: D22441122 Pulled By: gmagogsfm fbshipit-source-id: 383ef167b7742e2f211d1cae010b6ebb37c6e7a0	2020-07-08 17:00:42 -07:00
Mike Ruberry	e66445878d	Adds dynamic versioning pattern (#40279 ) Summary: BC NOTE: This change makes it so modules saved with torch.jit.save in PyTorch 1.6 can be loaded by previous versions of PyTorch unless they use torch.div or (soon) torch.full. It also lets tensors saved using torch.save be loaded by previous versions. So this is the opposite of BC-breaking, but I'm using that label to highlight this issue since we don't have a "BC-improving" label. PR NOTE: When an operator's semantics change in PyTorch we want to do two things: 1) Preserve the semantics of older serialized Torchscript programs that use the operator 2) Ensure the new semantics are respected Historically, this meant writing a Versioned Symbol that would remap older versions of the operator into current PyTorch code (1), and bumping the produced file format version (2). Unfortunately, bumping the produced file format version is a nuclear option for ensuring semantics are respected, since it also prevents older versions of PyTorch from loading anything (even tensors!) from newer versions. Dynamic versioning addresses the nuclear consequences of bumping the produced file format version by only bumping it when necessary. That is, when an operator with changed semantics is detected in the serialized Torchscript. This will prevent Torchscript programs that use the changed operator from loading on earlier versions of PyTorch, as desired, but will have no impact on programs that don't use the changed operator. Note that this change is only applicable when using torch.jit.save and torch.jit.load. torch.save pickles the given object using pickle (by default), which saves a function's Python directly. No new tests for this behavior are added since the existing tests for versioned division in test_save_load already validate that models with div are loaded correctly at version 4. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40279 Reviewed By: dzhulgakov Differential Revision: D22168291 Pulled By: mruberry fbshipit-source-id: e71d6380e727e25123c7eedf6d80e5d7f1fe9f95	2020-06-24 12:52:50 -07:00
Meghan Lele	d58b8222b7	[JIT] Add support for with statements (#34705 ) Summary: Summary This commit adds support for with statements to PyTorch JIT. Each of the with items in a with statement is represented in the JIT IR as a pair of `prim::Enter` and `prim::Exit` nodes that call the `__enter__` and `__exit__` methods defined on the context manager objects returned by the expressions in the with item. Testing This commit adds unit tests for with statements with named with items, nameless with items, and with statements that encounter exceptions. ``` $ python test/test_jit.py TestWith.test_with_as Fail to import hypothesis in common_utils, tests are not derandomized . ---------------------------------------------------------------------- Ran 1 test in 0.430s OK ``` ``` $ python test/test_jit.py TestWith.test_with_no_as Fail to import hypothesis in common_utils, tests are not derandomized . ---------------------------------------------------------------------- Ran 1 test in 0.264s OK ``` ``` $ python test/test_jit.py TestWith.test_with_exceptions Fail to import hypothesis in common_utils, tests are not derandomized Couldn't download test skip set, leaving all tests enabled... . ---------------------------------------------------------------------- Ran 1 test in 1.053s OK ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/34705 Differential Revision: D22095945 Pulled By: SplitInfinity fbshipit-source-id: f661565a834786725259b8ea014b4d7532f9419d	2020-06-18 16:57:18 -07:00
Yanan Cao	c22bbb2124	[JIT] Add Type::repr_str to return human-readable str (#39544 ) Summary: Clearly expressing a type is inferred by PyTorch instead of explicitly annotated by user makes many error messages more user-friendly Currently Type has two string conversion methods. str() for IR printing and python_str() for serialization and error message generation. If we want to include more information in type printing while maintaining serialization/deserialization correctness, we need to split python_str() into annotation_str() and repr_str(). annotation_str is solely responsible for serialization, it strictly matches format of python type annotation. repr_str() is responsible for generating a human-readable error message that includes information like "this type is inferred, not explicitly annotated" Closes https://github.com/pytorch/pytorch/issues/39449 Pull Request resolved: https://github.com/pytorch/pytorch/pull/39544 Differential Revision: D21978759 Pulled By: gmagogsfm fbshipit-source-id: 733566f5a62e748b5ca4bb3c5943ebb6d5b664d0	2020-06-10 12:01:24 -07:00
Michael Voznesensky	f6f1384811	[JIT] Refactor attributes to support buffers and parameters as first class citizens, add support for iterating over `named_buffers()` (#37905 ) Summary: First part of https://github.com/pytorch/pytorch/issues/36211 - still a WIP, but asking for commentary to ensure this is the direction we want to go in. Pull Request resolved: https://github.com/pytorch/pytorch/pull/37905 Differential Revision: D21633735 Pulled By: voznesenskym fbshipit-source-id: f4e4302e40114513776c9e48867a90d72049e2e9	2020-05-18 23:23:43 -07:00
Michael Suo	b53e6bfd49	[jit] normalize `getMethod` (#37472 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37472 Our convention is for `findX` to return an optional version and `getX` to assert that the X is there. Fix up `getMethod` to be consistent with this convention. Test Plan: Imported from OSS Differential Revision: D21297543 Pulled By: suo fbshipit-source-id: b40f56231cc8183e61bbb01fe5c0c113bcb6464d	2020-05-06 15:22:25 -07:00
Zachary DeVito	b6bb644e41	Fix long line splitting issue in python_print (#37088 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37088 For an inlined expression tree like `(e_0, (e_1, e_long))` the previous algoritm only scanned the same statement as `e_long`, splitting the inlined expressions across lines. Because it did not scan `e_0`, `e_0` would still get emitted inline, causing it to reverse order with `e_1` and `e_long`. The new algorithm scans starting at `e_long` and going all the way back up the expression until it reaches the end of the inlined statement. Caching of what has already been scanned has been added so that if there was a second long long `e_long2` after `e_long`, it would not rescan and re-inline the statements that were already split. Test Plan: Imported from OSS Differential Revision: D21180394 Pulled By: zdevito fbshipit-source-id: 4d142c83a04c89a47d04282f67a513f82cf153c0	2020-04-24 15:14:39 -07:00
Mike Ruberry	bcdb0727c2	Revert D20907254: Fix long line splitting issue in python_print Test Plan: revert-hammer Differential Revision: D20907254 Original commit changeset: ebfc1a4eefc2 fbshipit-source-id: 76440a8649a17728c50e2f3eeb3744a2245f6daf	2020-04-21 16:24:32 -07:00
Zachary DeVito	bf676682e7	Fix long line splitting issue in python_print (#36188 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36188 * Need to remove n^2 behavior for scanning whether to split or not otherwise long inline chains will take a long time re-scanning. Test Plan: Imported from OSS Differential Revision: D20907254 Pulled By: zdevito fbshipit-source-id: ebfc1a4eefc26d5806381e7afd75b7a9cd4cde97	2020-04-21 15:46:42 -07:00
Michael Suo	866d9d4e6a	[jit] Fix name collision on load (#35720 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35720 When modules are saved, all relevant types are serialized according to their qualified name with a compilation unit. Since qualified names are guaranteed to be unique within a compilation unit, this normally works fine. On load, all types are registered in a compilation unit owned by the script::Module. Type names are not unique across compilation units, so if you load two modules with colliding type names, make them submodules of yet another module, and save that module, there is the potential of a name collision. See the added tests for examples if that description is confusing. The solution is to unique type names when serializing code by mangling them if we detect a name collision. Test Plan: Imported from OSS Differential Revision: D20749423 Pulled By: suo fbshipit-source-id: a8827ff1d4a89f3e7964dbbb49b4381863da3e6a	2020-04-01 00:02:38 -07:00
Meghan Lele	6384c2d81b	[JIT] clang-format JIT code (#35115 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35115 This commit runs the newly added tools/clang_format.py on the JIT codebase and includes all of the formatting changes thus produced. Testing: Ran the script, CI. Test Plan: Imported from OSS Reviewed By: eellison Differential Revision: D20568523 Pulled By: SplitInfinity fbshipit-source-id: e09bdb982ccf090eecfb7c7b461b8d0681eef82b	2020-03-26 11:24:51 -07:00
Michael Suo	c235be42dd	[jit] kill script namespace (#34515 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34515 Once upon a time we thought this was necessary. In reality it is not, so removing it. For backcompat, our public interface (defined in `api/`) still has typedefs to the old `script::` names. There was only one collision: `Pass` as a `Stmt` and `Pass` as a graph transform. I renamed one of them. Test Plan: Imported from OSS Differential Revision: D20353503 Pulled By: suo fbshipit-source-id: 48bb911ce75120a8c9e0c6fb65262ef775dfba93	2020-03-11 23:32:48 -07:00
Nikolay Korovaiko	0a4a558c2c	Dictionary Constants (#32869 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32869 Differential Revision: D19909339 Pulled By: Krovatkin fbshipit-source-id: 6fe2a9b470768f84b957c69cdf9af3a1bd9b1ca9	2020-03-09 16:12:36 -07:00
James Reed	45a504dd2d	[JIT] Introduce BuiltinOpFunction and integrate into torchbind (#34098 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34098 * #33900 [JIT] Move stuff out of class_type.cpp Test Plan: Imported from OSS Differential Revision: D20229166 Pulled By: jamesr66a fbshipit-source-id: d658a63a5d6e372e675f35b8456adc8de82b49f3	2020-03-07 10:03:56 -08:00
Shihao Xu	7d01888a75	[JIT] Register rpc.rpc_async(..) as a JIT operator (#33329 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33329 # Use case ``` torch.jit.script def send_rpc_async(dst_worker_name, user_callable_qual_name, tensor): # type: (str, str, Tensor) -> None rpc._rpc_async_torchscript( dst_worker_name, user_callable_qual_name, args=(tensor,) ) ``` # Problem ``` torch.jit.frontend.NotSupportedError: keyword-arg expansion is not supported: File "/data/users/shihaoxu/fbsource/fbcode/buck-out/dev/gen/caffe2/test/distributed/rpc/rpc_spawn#binary,link-tree/torch/distributed/rpc/api.py", line 722 args = args if args else () kwargs = kwargs if kwargs else {} fut = _invoke_rpc_torchscript(to, qualified_name, args, *kwargs) ~~~~~~ <--- HERE return fut ``` # Solution Register `rpc.rpc_async(..)` as a JIT operator to handle variable-length argument list. # Plan This PR is the required changes to make `rpc.rpc_async(..)` a JIT prim operator, which can dynamically handle different number of arguments. - Register "prim::rpc_async" as a `Symbol` in "interned_string.h" - Add a if branch in "python_sugared_value.cpp" `toSugarValue(py::object, ..)` entry utility function to set up how JIT frontend convert `torch.distributed.rpc.rpc_async(..)` Python function (Python object) into a `SpecialFormValue` (IR SugaredValue). - Add a switch case for "prim::rpc_aynsc" Symbol in "ir_emitter.cpp" and `emitApplySpecialForm(..)` to set up how JIT compiler provides inputs to the "prim::rpc_aynsc" Operator. - Register "prim::rpc_async" as a `jit::Operator` and provide implementation in "register_distributed_ops.cpp". Notice, since the distributed module is an optional part when building PyTorch. The code to be added in this PR should be wrapped within preprocessing maco. ``` #ifdef USE_DISTRIBUTED new code here #endif ``` Test Plan: Items that need to be confirmed in the test cases https://fb.quip.com/DCvdA9ZLjeO0 ``` buck test mode/dev-nosan //caffe2/test/distributed/rpc/jit:rpc_fork buck build mode/dev-nosan //caffe2/test/distributed/rpc/jit:rpc_fork \ \ && buck-out/gen/caffe2/test/distributed/rpc/jit/rpc_fork\#binary.par -r test_call_python_function_remotely_from_script_not_supported ``` ``` buck test mode/dev-nosan //caffe2/test/distributed/rpc/jit:rpc_spawn ``` ``` buck test mode/dev-nosan //caffe2/caffe2/python/operator_test:layer_norm_op_test-2.7 -- test_layer_norm_op_jit ``` Differential Revision: D5738300 fbshipit-source-id: a4604fe762e00be062dc8232ca9790df31fb2074	2020-03-03 19:57:42 -08:00
Meghan Lele	cb8d9f99aa	[JIT] Implement Tensor.tolist() (#33472 ) Summary: Summary This commit adds an implementation of `Tensor.tolist()` to the JIT interpreter. Testing This commit adds several unit tests that test that this function works correctly for 0D, 1D, 2D and 3D tensors of type `float`, `int` and `bool`. ``` (base) meghanl-mbp:pytorch meghanl$ python test/test_jit.py TestList.test_to_list -v Fail to import hypothesis in common_utils, tests are not derandomized test_to_list (jit.test_list_dict.TestList) Unit tests for Tensor.tolist() function. ... ok ---------------------------------------------------------------------- Ran 1 test in 0.329s OK ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/33472 Differential Revision: D20109738 Pulled By: SplitInfinity fbshipit-source-id: a6e3fee5e3201d5e1f0c4ca45048488ae2bf5e33	2020-02-27 21:45:46 -08:00
Michael Suo	dbe850af5b	[jit] do the code reorg (#33851 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33851 Rationale and context described in #33828. Script to reproduce the move: https://gist.github.com/suo/16cbefaaeb67ca5a7c6caffd49b7f6e9 ghstack-source-id: 99079645 Test Plan: Make sure CI passes Reviewed By: jamesr66a Differential Revision: D20133869 fbshipit-source-id: 390e9241a9c85366d9005c492ac31f10aa96488e	2020-02-27 13:02:51 -08:00

22 Commits