pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Xuehai Pan	4ee1cb9b95	[BE][Easy] replace `import pathlib` with `from pathlib import Path` (#129426 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/129426 Approved by: https://github.com/malfet	2024-06-30 01:36:07 +00:00
PyTorch MergeBot	2effbcfcd8	Revert "[BE][Easy] replace `import pathlib` with `from pathlib import Path` (#129426 )" This reverts commit `6d75604ef1`. Reverted https://github.com/pytorch/pytorch/pull/129426 on behalf of https://github.com/XuehaiPan due to recognize `Path` as new exported API ([comment](https://github.com/pytorch/pytorch/pull/129426#issuecomment-2198371625))	2024-06-29 23:24:06 +00:00
Xuehai Pan	6d75604ef1	[BE][Easy] replace `import pathlib` with `from pathlib import Path` (#129426 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/129426 Approved by: https://github.com/malfet	2024-06-29 15:42:09 +00:00
Aaron Orenstein	57536286e2	Flip default value for mypy disallow_untyped_defs [10/11] (#127847 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127847 Approved by: https://github.com/oulgen ghstack dependencies: #127842, #127843, #127844, #127845, #127846	2024-06-08 18:50:06 +00:00
Aaron Gokaslan	c5fafe9f48	[BE]: TRY002 - Ban raising vanilla exceptions (#124570 ) Adds a ruff lint rule to ban raising raw exceptions. Most of these should at the very least be runtime exception, value errors, type errors or some other errors. There are hundreds of instance of these bad exception types already in the codebase, so I have noqa'd most of them. Hopefully this error code will get commiters to rethink what exception type they should raise when they submit a PR. I also encourage people to gradually go and fix all the existing noqas that have been added so they can be removed overtime and our exception typing can be improved. Pull Request resolved: https://github.com/pytorch/pytorch/pull/124570 Approved by: https://github.com/ezyang	2024-04-21 22:26:40 +00:00
Aaron Gokaslan	dd5e6e8553	[BE]: Merge startswith calls - rule PIE810 (#96754 ) Merges startswith, endswith calls to into a single call that feeds in a tuple. Not only are these calls more readable, but it will be more efficient as it iterates through each string only once. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96754 Approved by: https://github.com/ezyang	2023-03-14 22:05:20 +00:00
Nikita Shulga	5976f0bdfe	Set min supported Python version to 3.8 (#93155 ) Also, grep for `if sys.version_info .cond. (3, 8)` and replaces them with appropriate action. This is a last in a series of PRs that moved CI/CD away from testing PyTorch behavior against Python-3.7. Fixes https://github.com/pytorch/pytorch/issues/80513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/93155 Approved by: https://github.com/huydhn	2023-01-29 18:28:46 +00:00
kshitij12345	eb9b156019	[fix] MathBits: serialization (#88182 ) Fixes #81690 TODO: * [x] C++ Unpickler Fix (locally tested pickled in Python and unpickled in C++) * [x] C++ Pickler Fix (locally tested pickled in C++ and unpickled in Python) * [x] Do quant_tensor, sparse_tensor, etc require similar changes? (Sparse and Quant don't need this) * [x] Add Comments * [x] How to make sure C++ and Python are in sync? (Functions in `pickler.h` help in getting and setting Tensor Metadata (math-bits for now) on a tensor. They are the only place which should handle this.) Notes: Quant Tensor don't support complex dtypes and for float they segfault with `_neg_view` : https://github.com/pytorch/pytorch/issues/88484 Sparse Tensor: ```python >>> a = torch.tensor([[0, 2.], [3j, 0]]).to_sparse() >>> a.conj().is_conj() False >>> a._neg_view() Traceback (most recent call last): File "<stdin>", line 1, in <module> NotImplementedError: Cannot access storage of SparseTensorImpl ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/88182 Approved by: https://github.com/ezyang, https://github.com/anjali411	2022-11-09 17:15:12 +00:00
anjali411	120987ffeb	Fix macos public bindings failures (#80970 ) We are seeing unrelated public bindings test failures on macos tests being triggered on random PRs. Here's an attempt to fix some of those. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80970 Approved by: https://github.com/rohan-varma	2022-07-07 14:10:00 +00:00
Han Qi	b34b192d6b	Reland "Make debug_pkl smaller by only emitting unique traces." (#73368 ) Summary: ## Original commit message: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73368 debug_pkl file inside of pytorch's .pt file consists of a list of SourceRanges. Each SourceRange points to a Source which is a stack track, filename, and start, end numbers. Those are emitted in debug_pkl file as strings. Since many SourceRange shares the same source, the string for trace can be deduped. The newer format saves a set of unique traces in a tuple, then each SourceRange will save the offset of it's trace w.r.t. position in that tuple. (i.e. manually applying dictionary compression). The above helps with smaller file size. On loading, if we copy each trace to Source as string the runtime memory would still blowup. To mitigate this, we use SourceView directly instead of source which will take the reference of string inside of Deserializer and make that into string_view. This is safe because Deserializer is hold by Unpickler by shared_ptr, and Unpickler is also hold by shared_ptr by another Source object. That Source object will be alive during the model construction. Test Plan: ## Original Test plan unit test Took original file (312271638_930.predictor.disagg.local); loaded with `torch.jit.load` save again with `torch.jit.save`. Unzip both, look at contents: ``` [qihan@devvm5585.vll0 ~]$ du archive -h 4.0K archive/xl_model_weights 3.7M archive/extra 8.0K archive/code/__torch__/caffe2/torch/fb/model_transform/splitting 8.0K archive/code/__torch__/caffe2/torch/fb/model_transform 8.0K archive/code/__torch__/caffe2/torch/fb 8.0K archive/code/__torch__/caffe2/torch 8.0K archive/code/__torch__/caffe2 20M archive/code/__torch__/torch/fx/graph_module 20M archive/code/__torch__/torch/fx 8.0K archive/code/__torch__/torch/classes 20M archive/code/__torch__/torch 20M archive/code/__torch__ 20M archive/code 2.7M archive/constants 35M archive [qihan@devvm5585.vll0 ~]$ du resaved -h 4.0K resaved/extra 8.0K resaved/code/__torch__/caffe2/torch/fb/model_transform/splitting 8.0K resaved/code/__torch__/caffe2/torch/fb/model_transform 8.0K resaved/code/__torch__/caffe2/torch/fb 8.0K resaved/code/__torch__/caffe2/torch 8.0K resaved/code/__torch__/caffe2 1.3M resaved/code/__torch__/torch/fx/graph_module 1.3M resaved/code/__torch__/torch/fx 8.0K resaved/code/__torch__/torch/classes 1.4M resaved/code/__torch__/torch 1.4M resaved/code/__torch__ 1.4M resaved/code 2.7M resaved/constants 13M resaved [qihan@devvm5585.vll0 ~]$ ``` ## Additional test: `buck test mode/dev-tsan //caffe2/benchmarks/static_runtime:static_runtime_cpptest -- --exact 'caffe2/benchmarks/static_runtime:static_runtime_cpptest - StaticRuntime.to'` passes test jest.fbios.startup_cold_start.local.simulator f333356873 - Differential Revision: D35196883 Pull Request resolved: https://github.com/pytorch/pytorch/pull/74869 Approved by: https://github.com/gmagogsfm	2022-04-18 22:34:21 +00:00
Han Qi	0723639b60	Revert D34455360: Multisect successfully blamed D34455360 for test failures Summary: This diff is reverting D34455360 (`61d6c43864`) D34455360 (`61d6c43864`) is making the following tests to fail and this revert diff is either the revert of the blame diff or the revert of the stack of diffs that need to be reverted to revert the blame diff Tests affected: - https://www.internalfb.com/intern/test/562950004334605/ Multisect link: https://www.internalfb.com/intern/testinfra/multisect/756170 Test Plan: NA Reviewed By: zhxchen17 Differential Revision: D34596156 fbshipit-source-id: a465bca0094db3caf6130c80f1ed49eea981359b (cherry picked from commit ef5e5578c64ce9827570757fb016aafa9c782c6a)	2022-03-08 23:18:54 +00:00
Han Qi	61d6c43864	Make debug_pkl smaller by only emitting unique traces. (#73368 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73368 debug_pkl file inside of pytorch's .pt file consists of a list of SourceRanges. Each SourceRange points to a Source which is a stack track, filename, and start, end numbers. Those are emitted in debug_pkl file as strings. Since many SourceRange shares the same source, the string for trace can be deduped. The newer format saves a set of unique traces in a tuple, then each SourceRange will save the offset of it's trace w.r.t. position in that tuple. (i.e. manually applying dictionary compression). The above helps with smaller file size. On loading, if we copy each trace to Source as string the runtime memory would still blowup. To mitigate this, we use SourceView directly instead of source which will take the reference of string inside of Deserializer and make that into string_view. This is safe because Deserializer is hold by Unpickler by shared_ptr, and Unpickler is also hold by shared_ptr by another Source object. That Source object will be alive during the model construction. Test Plan: unit test Took original file (312271638_930.predictor.disagg.local); loaded with `torch.jit.load` save again with `torch.jit.save`. Unzip both, look at contents: ``` [qihan@devvm5585.vll0 ~]$ du archive -h 4.0K archive/xl_model_weights 3.7M archive/extra 8.0K archive/code/__torch__/caffe2/torch/fb/model_transform/splitting 8.0K archive/code/__torch__/caffe2/torch/fb/model_transform 8.0K archive/code/__torch__/caffe2/torch/fb 8.0K archive/code/__torch__/caffe2/torch 8.0K archive/code/__torch__/caffe2 20M archive/code/__torch__/torch/fx/graph_module 20M archive/code/__torch__/torch/fx 8.0K archive/code/__torch__/torch/classes 20M archive/code/__torch__/torch 20M archive/code/__torch__ 20M archive/code 2.7M archive/constants 35M archive [qihan@devvm5585.vll0 ~]$ du resaved -h 4.0K resaved/extra 8.0K resaved/code/__torch__/caffe2/torch/fb/model_transform/splitting 8.0K resaved/code/__torch__/caffe2/torch/fb/model_transform 8.0K resaved/code/__torch__/caffe2/torch/fb 8.0K resaved/code/__torch__/caffe2/torch 8.0K resaved/code/__torch__/caffe2 1.3M resaved/code/__torch__/torch/fx/graph_module 1.3M resaved/code/__torch__/torch/fx 8.0K resaved/code/__torch__/torch/classes 1.4M resaved/code/__torch__/torch 1.4M resaved/code/__torch__ 1.4M resaved/code 2.7M resaved/constants 13M resaved [qihan@devvm5585.vll0 ~]$ ``` Reviewed By: gmagogsfm Differential Revision: D34455360 fbshipit-source-id: 8cc716f9bba7183746b1b4ecc33a2de34ac503b9 (cherry picked from commit f1a04730fc9ac8fdab6c8e4c44cb5529e42090e4)	2022-03-02 08:37:08 +00:00
Alban Desmaison	3bd1507ff2	Revert D33994011: Make debug_pkl smaller by only emitting unique traces. Test Plan: revert-hammer Differential Revision: D33994011 (`3d37f5b052`) Original commit changeset: 8e6224c6e942 Original Phabricator Diff: D33994011 (`3d37f5b052`) fbshipit-source-id: 885e739efa1081382e1fcf9c6cccba92c57e9f7a (cherry picked from commit a6d98c85a736c2eb321a6f38005dd0f5dc43eb87)	2022-02-24 16:38:55 +00:00
Han Qi	3d37f5b052	Make debug_pkl smaller by only emitting unique traces. (#72596 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72596 debug_pkl file inside of pytorch's .pt file consists of a list of SourceRanges. Each SourceRange points to a Source which is a stack track, filename, and start, end numbers. Those are emitted in debug_pkl file as strings. Since many SourceRange shares the same source, the string for trace can be deduped. The newer format saves a set of unique traces in a tuple, then each SourceRange will save the offset of it's trace w.r.t. position in that tuple. (i.e. manually applying dictionary compression). The above helps with smaller file size. On loading, if we copy each trace to Source as string the runtime memory would still blowup. To mitigate this, we use SourceView directly instead of source which will take the reference of string inside of Deserializer and make that into string_view. This is safe because Deserializer is hold by Unpickler by shared_ptr, and Unpickler is also hold by shared_ptr by another Source object. That Source object will be alive during the model construction. Test Plan: unit test Took original file (312271638_930.predictor.disagg.local); loaded with `torch.jit.load` save again with `torch.jit.save`. Unzip both, look at contents: ``` [qihan@devvm5585.vll0 ~]$ du archive -h 4.0K archive/xl_model_weights 3.7M archive/extra 8.0K archive/code/__torch__/caffe2/torch/fb/model_transform/splitting 8.0K archive/code/__torch__/caffe2/torch/fb/model_transform 8.0K archive/code/__torch__/caffe2/torch/fb 8.0K archive/code/__torch__/caffe2/torch 8.0K archive/code/__torch__/caffe2 20M archive/code/__torch__/torch/fx/graph_module 20M archive/code/__torch__/torch/fx 8.0K archive/code/__torch__/torch/classes 20M archive/code/__torch__/torch 20M archive/code/__torch__ 20M archive/code 2.7M archive/constants 35M archive [qihan@devvm5585.vll0 ~]$ du resaved -h 4.0K resaved/extra 8.0K resaved/code/__torch__/caffe2/torch/fb/model_transform/splitting 8.0K resaved/code/__torch__/caffe2/torch/fb/model_transform 8.0K resaved/code/__torch__/caffe2/torch/fb 8.0K resaved/code/__torch__/caffe2/torch 8.0K resaved/code/__torch__/caffe2 1.3M resaved/code/__torch__/torch/fx/graph_module 1.3M resaved/code/__torch__/torch/fx 8.0K resaved/code/__torch__/torch/classes 1.4M resaved/code/__torch__/torch 1.4M resaved/code/__torch__ 1.4M resaved/code 2.7M resaved/constants 13M resaved [qihan@devvm5585.vll0 ~]$ ``` Reviewed By: JasonHanwen Differential Revision: D33994011 fbshipit-source-id: 8e6224c6e942e91c3403f686c8f0937d1002ed41 (cherry picked from commit a7014dd4029308c95007f362a57c31796d686647)	2022-02-24 09:31:16 +00:00
Chen Lai	cee84f4051	fix model dump for the lowered module (#72866 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72866 https://github.com/pytorch/pytorch/pull/71597 adds a wrapper `torch.jit.LoweredWrapper` and it breaks the model dump. Fix the model_dump in the notebook ghstack-source-id: 149311636 Test Plan: CI and test with N509022 Before: {F701413403} After: {F701412963} Reviewed By: iseeyuan Differential Revision: D34247216 fbshipit-source-id: 695b02b03675fae596bb450441b327e4cdcffe9c (cherry picked from commit `d46a82a4c1`)	2022-02-17 07:09:44 +00:00
David Reiss	e1d963e8fc	model_dump: Fix memory computation when both constants and data tensors are present (#66006 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66006 Previously, this was resulting in a key collision and a crash. ghstack-source-id: 139342089 Test Plan: Ran webdriver test locally. Reviewed By: dhruvbird Differential Revision: D31281092 fbshipit-source-id: f31311726c681d6d7e0504ff8e84c888af9054f0	2021-10-01 16:31:06 -07:00
David Reiss	23caeb3f71	model_dump: Add a helper to produce html with a single call (#66005 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66005 ghstack-source-id: 139342091 Test Plan: Unit test, and used in a notebook. Reviewed By: dhruvbird Differential Revision: D31281091 fbshipit-source-id: 1e4d0713b9796a3d182de9e676c3b3c3b1610d6e	2021-10-01 16:29:43 -07:00
Jiakai Liu	73c46092f1	[pytorch] sort the output of the model_dump util (#62485 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62485 Make it easier to browse the code section by sorting the files by name. Test Plan: Imported from OSS Reviewed By: dhruvbird, malfet Differential Revision: D30016245 Pulled By: ljk53 fbshipit-source-id: c9cb3c1ad9bcaa5337a6ad5c575ac0c240751f6c	2021-07-30 14:40:07 -07:00
David Reiss	7fdc5f9e08	model_dump: Fix non-counting and double-counting bugs in tensor memory (#60702 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60702 - Instead of traversing and counting all tensor memory, collect a map from storage key to storage info while traversing. Add up sizes at the end to avoid double counting. - Count tensor memory from constants as well. Test Plan: Ran webdriver test. Reviewed By: dhruvbird Differential Revision: D29380396 Pulled By: dreiss fbshipit-source-id: 6d0fd66f677fe23c851aa218387aa4dc59502b1e	2021-07-10 15:16:34 -07:00
David Reiss	158d351517	model_dump: Add webdriver test (#60701 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60701 The unit test previously only tested that the dump could complete successfully. It was not able to verify that any JS worked properly. Now we can test the JS as long as webdriver is installed. Tweaked the implementation of Hider a bit to make it easier for tests to find and open them. I disabled the tests by default since I don't want to deal with webdriver in CI. Enable them with the environment variable RUN_WEBDRIVER=1. We could make the tests use headless mode, but it's kind of fun to watch them run. Add a test to verify that tensor memory computation is working for the simple model. Test Plan: Ran the test. Reviewed By: dhruvbird Differential Revision: D29380398 Pulled By: dreiss fbshipit-source-id: f19d0b05d79ad5a8231e85422976f1889e021c89	2021-07-10 15:16:32 -07:00
David Reiss	cc78c463c0	model_dump: Render constants.pkl similar to data.pkl (#60700 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60700 Test Plan: Dumped a model with a lot of constants (qconvs produced by optimizing). Was able to see them rendered nicely. Reviewed By: dhruvbird Differential Revision: D29380400 Pulled By: dreiss fbshipit-source-id: c951508b92bb2717591dd173282157e1a40a30bd	2021-07-10 15:16:31 -07:00
David Reiss	e292f34def	model_dump: Make stdout argument for main a keyword-only argument (#60699 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60699 Also add a unit test for main, which brings the test coverage up to ~98%. Also factor out the "needs importlib.resources" check into a function for easier reuse. Test Plan: CI Reviewed By: dhruvbird Differential Revision: D29380397 Pulled By: dreiss fbshipit-source-id: bba16da85bf7bfb4370308e38c844694d01b47eb	2021-07-10 15:16:29 -07:00
David Reiss	2942e9aa80	model_dump: update maintainer comment (#60698 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60698 ... to reflect that the Python command should be re-run when changing the model. Test Plan: CI Reviewed By: dhruvbird Differential Revision: D29380399 Pulled By: dreiss fbshipit-source-id: 1ec464da4ebe6ddf400eb4a3b14da683369c0039	2021-07-10 15:15:15 -07:00
Martin Yuan	19b6ee4d4e	model_dump working with delegate models (#61043 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61043 Trying to make model_dump work with delegate models ghstack-source-id: 132809755 Test Plan: N509022. The data.pkl in the lowered model: ``` bash-3.2$ python -m torch.utils.show_pickle /Users/myuan/models/backend/lowered_model.pt@*/data.pkl torch.jit.backend_with_compiler_demo.LoweredModule.__torch__.___torch_mangle_5.ModuleAdd()(state= (torch.jit._pickle.restore_type_tag({'forward': torch.jit._pickle.restore_type_tag({'input_shapes': '((1, 1, 320, 240), (1, 3))', 'some_other_option': 'True'}, 'Dict[str, str]')}, 'Dict[str, Any]'), torch.jit._pickle.restore_type_tag({'forward': 'prim::Constant#1<debug_handle>271,aten::add<debug_handle>272'}, 'Dict[str, str]'), True)) ``` Comparing to data.pkl in scripted_model.pt: ``` __torch__.___torch_mangle_7.ModuleAdd()(state= {'_is_full_backward_hook': None, 'training': True}) ``` Reviewed By: Amyh11325 Differential Revision: D29464860 fbshipit-source-id: d738e98ea518339465f8e3375207cf83e3dac532	2021-07-01 13:39:56 -07:00
Philip Meier	d5988c5eca	remove unused `type: ignore` directives (#60006 ) Summary: During development it is common practice to put `type: ignore` comments on lines that are correct, but `mypy` doesn't recognize this. This often stems from the fact, that the used `mypy` version wasn't able to handle the used pattern. With every new release `mypy` gets better at handling complex code. In addition to fix all the previously accepted but now failing patterns, we should also revisit all `type: ignore` comments to see if they are still needed or not. Fortunately, we don't need to do it manually: by adding `warn_unused_ignores = True` to the configuration, `mypy` will error out in case it encounters an `type: ignore` that is no longer needed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/60006 Reviewed By: jbschlosser, malfet Differential Revision: D29133237 Pulled By: albanD fbshipit-source-id: 41e82edc5cd5affa7ccedad044b59b94dad4425a	2021-06-18 07:23:31 -07:00
David Reiss	68df4d40d2	show_pickle/model_dump: Handle invalid UTF-8 in pickles (#57661 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57661 Thie Pickle "specification" (pickletools.py) states that the argument to a BINUNICODE opcode must be UTF-8 encoded. However, if a PyTorch custom class returns a non-UTF-8 std::string from its pickle method the libtorch Pickler will write it to the output pickle without complaining. Python's _Unpickler (the Python implementation of Unpickler) always throws an exception when trying to deserialize these invalid pickles. We still want to be able to dump these pickle files. Update DumpUnpickler to create its own opcode dispatch table (initialized as a clone of the _Unpickler dispatch table) and patch in a custom function for the BINUNICODE op. We try to emulate the default behavior, but any UnicodeDecodeError is caught and replaced with a dummy object. This could violate the assumptions of a user that expects a str in that position, so we disable this behavior by default. Update model_dump to recognize this special object and allow it to be rendered. Test Plan: Dumped and viewed a model with an invalid string in an object state. Reviewed By: malfet Differential Revision: D28531392 Pulled By: dreiss fbshipit-source-id: ab5aea20975a0ef53ef52a880deaa2c5a626e4a2	2021-06-04 19:42:25 -07:00
David Reiss	a91375432a	model_dump: Accept variable-length debug info (#57660 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57660 Ignore trailing elements so we're compatible with both old and new models. Test Plan: Dumped and old model. Unit test. Reviewed By: malfet Differential Revision: D28531391 Pulled By: dreiss fbshipit-source-id: 197a55ab0e6a7d8e25cbee83852e194afacc988e	2021-05-19 12:25:27 -07:00
David Reiss	ab1fdbefe1	model_dump: Use DumpUnpickler.load instead of .dump (#57659 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57659 Faster since we don't do an automatic pprint, and shorter, simpler code. Test Plan: Dumped some models. Reviewed By: malfet Differential Revision: D28531398 Pulled By: dreiss fbshipit-source-id: 47f1f646d4576af9f7e680933e0512f616dab5c0	2021-05-19 12:25:25 -07:00
David Reiss	53078924ad	model_dump: Add a section that summarizes tensor memory usage (#57658 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57658 Since there is no Python change here and we only do the analysis when rendering the open section, this should have no impact on page size or load time! (Well, a constant impact on page size due to the added code.) Before I made it lazy, I observed that it increased load time by over 100ms for a large model. Test Plan: Dumped a CUDA model and saw the size summary. Reviewed By: malfet Differential Revision: D28531394 Pulled By: dreiss fbshipit-source-id: f77012b7bab069de861a4ba23486c665e1306aa0	2021-05-19 12:25:23 -07:00
David Reiss	ef4e6036bc	model_dump: Handle dict rendering (#57657 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57657 Test Plan: Clicked around a model with some dicts in it. Reviewed By: malfet Differential Revision: D28531397 Pulled By: dreiss fbshipit-source-id: 069690f147e91eadd76fec5f5ca4eec057abcb98	2021-05-19 12:25:21 -07:00
David Reiss	72ff3163bd	model_dump: Handle torch.device objects (#57656 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57656 This came up when dumping a CUDA model. Test Plan: Dumped a CUDA model. Reviewed By: malfet Differential Revision: D28531396 Pulled By: dreiss fbshipit-source-id: fe0e94248c8085a8b760d253ba0b517f153b3442	2021-05-19 12:25:19 -07:00
David Reiss	a380575f5b	model_dump: Refactor renderTensor into a helper method (#57655 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57655 Now lots of code is shared between tensor and qtensor rendering. Net lines of code is actually +1, but it should result in a savings if/when we implement some of those todos. Test Plan: Clicked around in Chrome. Reviewed By: malfet Differential Revision: D28531395 Pulled By: dreiss fbshipit-source-id: 190a04ed587b54d27f3410246763cd636c0634be	2021-05-19 12:25:17 -07:00
David Reiss	3ff76af23c	model_dump: Implement "Hider" properly (#57654 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57654 I learned how to use children in React/Preact. :) Now it's not necessary to give every hidable section its own id and synchonize the "shown=false" with "style='display:none;'". This also means that the hidden elements aren't rendered to the DOM unless the hider is open. Test Plan: Clicked around in Chrome. Reviewed By: malfet Differential Revision: D28531393 Pulled By: dreiss fbshipit-source-id: bc86c823ae4b7e80c000f50c5429d89dff6ae64d	2021-05-19 12:23:59 -07:00
Sam Estep	2e26976ad3	Disallow versionless Python shebangs (#58275 ) Summary: Some machines don't have a versionless `python` on their PATH, which breaks these existing shebangs. I'm assuming that all the existing versionless `python` shebangs are meant to be `python3` and not `python2`; please let me know if my assumption was incorrect for any of these. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58275 Test Plan: CI. Reviewed By: zhouzhuojie Differential Revision: D28428143 Pulled By: samestep fbshipit-source-id: 6562be3d12924db72a92a0207b060ef740f61ebf	2021-05-14 08:26:02 -07:00
Kimish Patel	f4a921600a	[PyTorch, Mobile] Serialization format change for source range (#54284 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54284 In order to bring mobile deployment, via lite interpreter, on feature parity with JIT, with respect model level debug information we must make model level debug information available to mobile runtime. At the moment, model level debug information is stored in SourceRange which associates node's of graph to where the come from in original python source code. This information is serialized as part of debug_pkl and deserialized when JIT loads the model and reads the model code. On lite interpreter, we do not have access to all the functionality of JIT and hence we cannot load model in the same way as JIT, by reading code, constructing module hierarchy and graph corresponding module methods etc. Instead in, lite interpreter, only bytecode corresonding to the compiled graph, Code, is saved. Thus in order to annotate OPs in the bytecode with equivalent SourceRange information we do the following: 1. During model serialization, we create a unique tag for each source range of the model. 2. Create a map of <SourceRange, tag> 3. During debug_pkl serialization we save tag along with SourceRange, on top of byte offset. 4. During bytecode generation, the methods of the top module are lowered. During this process methods are inlined. In the inlined graph, when the node of a graph is lowered to bytecode, we query node's source range and look it up against the map. 5. Resulting source range tag is serialized in module_debug_info. 6. During model deserialization, we read all the debug_pkl records in the archieve and create a map of <tag, SourceRange> 7. This map can be used to find source code information. During mobile runtime: 1. We read all the debug_pkl records and create <tag=debug_handle, SourceRange> map. 1.1 This map, MobileDebugInfo, is a member of mobile Module. 2. Interpreter catches appropriate exceptions and sets the thread local debug handle and rethrows the exception. 3. In Function's run method we catch exception and query current debug handle where the exception happened. 4. Query MobileDebugInfo with debug handle to retrieve source range and augment error with source range info. This information is still incomplete as it does not contain entire callstack. In the following diffs we will serialize InlinedCallStack directly. Note that compilation is gated by SYMBOLICATE_MOBILE_DEBUG_HANDLE macro, so that mobile builds can avoid building MobileDebugInfo, source range and source range pickler/unpickler. Later we will add path where, if building without debug support stack trace will contain only debug handles. They can be symbolicated later. Test Plan: Ported bunch of source range tests from test_jit.py. Added on more test in test_lite_interpreter.py Imported from OSS Reviewed By: raziel Differential Revision: D27174722 fbshipit-source-id: a7b7c6088ce16dec37e823c7fefa4f0b61047e12	2021-05-04 09:19:27 -07:00
David Reiss	89377e3e45	model_dump tool for model inspection (#56868 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56868 See __init__.py for a summary of the tool. The following sections are present in this initial version - Model Size. Show the total model size, as well as a breakdown by stored files, compressed files, and zip overhead. (I expect this breakdown to be a bit more useful once data.pkl is compressed.) - Model Structure. This is basically the output of `show_pickle(data.pkl)`, but as a hierarchical structure. Some structures cause this view to crash right now, but it can be improved incrementally. - Zip Contents. This is basically the output of `zipinfo -l`. - Code. This is the TorchScript code. It's integrated with a blame window at the bottom, so you can click "Blame Code", then click a bit of code to see where it came from (based on the debug_pkl). This currently doesn't render properly if debug_pkl is missing or incomplete. - Extra files (JSON). JSON dumps of each json file under /extra/, up to a size limit. - Extra Pickles. For each .pkl file in the model, we safely unpickle it with `show_pickle`, then render it with `pprint` and include it here if the size is not too large. We aren't able to install the pprint hack that thw show_pickle CLI uses, so we get one-line rendering for custom objects, which is not very useful. Built-in types look fine, though. In particular, bytecode.pkl seems to look fine (and we hard-code that file to ignore the size limit). I'm checking in the JS dependencies to avoid a network dependency at runtime. They were retrieved from the following URLS, then passed through a JS minifier: https://unpkg.com/htm@3.0.4/dist/htm.module.js?module https://unpkg.com/preact@10.5.13/dist/preact.module.js?module Test Plan: Manually ran on a few models I had lying around. Mostly tested in Chrome, but I also poked around in Firefox. Reviewed By: dhruvbird Differential Revision: D28020849 Pulled By: dreiss fbshipit-source-id: 421c30ed7ca55244e9fda1a03b8aab830466536d	2021-04-28 07:33:10 -07:00

36 Commits