pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Kurt Mohler	aea6e2c396	Merge torch.cuda._UntypedStorage into torch._UntypedStorage (#75459 ) Fixes #74933 Pull Request resolved: https://github.com/pytorch/pytorch/pull/75459 Approved by: https://github.com/ezyang	2022-05-19 13:54:39 +00:00
Kurt Mohler	8e7fe87630	Rename `Typed/UntypedStorage` to `_Typed/_UntypedStorage` (#72540 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72540 Reviewed By: jbschlosser Differential Revision: D34216823 Pulled By: bdhirsh fbshipit-source-id: 1bc9930ab582771ebf02308e035576cd1a0dbe47 (cherry picked from commit `329238f612`)	2022-02-15 23:53:01 +00:00
Kurt Mohler	b69155f754	Avoid dtype mismatch error in `torch.save` if storages are unallocated (#68787 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/58970 cc mruberry Pull Request resolved: https://github.com/pytorch/pytorch/pull/68787 Reviewed By: mruberry Differential Revision: D32617425 Pulled By: anjali411 fbshipit-source-id: fe7f2374e4ef4428346a0a202cae8e0d382e03ab	2021-11-24 09:51:29 -08:00
Kurt Mohler	bc3d380ed1	Throw error when saving storages that view same data with different type (#66949 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/58970 cc mruberry Pull Request resolved: https://github.com/pytorch/pytorch/pull/66949 Reviewed By: albanD Differential Revision: D31926323 Pulled By: anjali411 fbshipit-source-id: f6e7acc0c1968b70a94f9b0b69a32780e8e21a62	2021-11-16 08:44:44 -08:00
Kurt Mohler	5883523c1d	Remove dtype from torch.Storage and use only torch.ByteStorage (#62030 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62030 Remove dtype tracking from Python Storage interface, remove all the different `<type>Storage` classes except for `ByteStorage`, and update serialization accordingly, while maintaining as much FC/BC as possible Fixes https://github.com/pytorch/pytorch/issues/47442 * THE SERIALIZATION FORMAT IS FULLY FC/BC. We worked very hard to make sure this is the case. We will probably want to break FC at some point to make the serialization structure of tensors make more sense, but not today. * There is now only a single torch.ByteStorage class. Methods like `Tensor.set_` no longer check that the dtype of storage is appropriate. * As we no longer know what dtype of a storage is, we've removed the size method from Storage, replacing it with nbytes. This is to help catch otherwise silent errors where you confuse number of elements with number of bytes. * `Storage._new_shared` takes a `nbytes` kwarg and will reject previous positional only calls. `Storage._new_with_file` and `_set_from_file` require explicit element size arguments. * It's no longer possible to convert storages to different types using the float/double/etc methods. Instead, do the conversion using a tensor. * It's no longer possible to allocate a typed storage directly using FloatStorage/DoubleStorage/etc constructors. Instead, construct a tensor and extract its storage. The classes still exist but they are used purely for unpickling. * The preexisting serialization format stores dtype with storage, and in fact this dtype is used to determine the dtype of the tensor overall. To accommodate this case, we introduce a new TypedStorage concept that exists only during unpickling time which is used to temporarily store the dtype so we can construct a tensor. If you overrode the handling of pickling/unpickling, you MUST add handling for TypedStorage or your serialization code will degrade to standard file-based serialization. Original pull request: https://github.com/pytorch/pytorch/pull/59671 Reviewed By: soulitzer, ngimel Differential Revision: D29466819 Pulled By: ezyang fbshipit-source-id: 4a14e5d3c2b08e06e558683d97f7378a3180b00e	2021-10-05 13:50:34 -07:00
Shen Li	1022443168	Revert D30279364: [codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: revert-hammer Differential Revision: D30279364 (`b004307252`) Original commit changeset: c1ed77dfe43a fbshipit-source-id: eab50857675c51e0088391af06ec0ecb14e2347e	2021-08-12 11:45:01 -07:00
Zsolt Dollenstein	b004307252	[codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: manual inspection & sandcastle Reviewed By: zertosh Differential Revision: D30279364 fbshipit-source-id: c1ed77dfe43a3bde358f92737cd5535ae5d13c9a	2021-08-12 10:58:35 -07:00
Zhengxu Chen	e62189ad69	[jit] Better checking for overload function declarations. (#59956 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59956 Issue #50175. Basically two things need to be checked and are lacking currently: 1. Overload declarations should always have a single `pass` statement as the body. 2. There should be always an implementation provided for decls which doesn't have the torch.jit._overload decorator. So in this case we need to check whether we are actually compiling a function body with decorator ahead. Test Plan: python test/test_jit.py TestScript.test_function_overloads Imported from OSS Reviewed By: gmagogsfm Differential Revision: D29106555 fbshipit-source-id: 2d9d7df2fb51ab6db0e1b726f9644e4cfbf733d6	2021-08-05 14:21:48 -07:00
Francesco Casalegno	fea3824214	Ensure torch.save() deterministic output (#57536 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/42163. ## {emoji:1f525} Pitch Currently, the binary outputs produced by `torch.save()` are non-deterministic (as pointed out in https://github.com/pytorch/pytorch/issues/42163). This means that running a simple snippet that creates a tensor (or a model) twice will produce output files with a different `md5` sum. Why does this occur? The cause of this behavior lies in the fact that the `obj._cdata` is used to identify a tensor and is written to a file, but the `_cdata` attribute is of course non-deterministic: `a80b215a9a/torch/serialization.py (L416)` Why does this matter? Reproducibility is essential for many Machine Learning projects. For instance, when using [`dvc`](https://dvc.org/) you would expect that if none of the dependencies of a stage of a ML pipeline has changed, then running the same stage another time will produce the same binary output. For the reasons explained above, with `torch` this was not the case, so this PR tries to fix this issue. ## {emoji:1f4cc} Content of this PR ### What changes? - The `persistent_id()` function now returns a deterministic value, rather than `obj._cdata` (which depends on runtime). - As a consequence, `torch.save(obj, "output.pt")` produces a deterministic output, i.e. the `md5` hash of `output.pt` is determinstic. See Test 1 and Test 2 below. ### What does not change? - If an `obj` contains several tensors that share the same underlying data (e.g. they are views of the same tensor),the `obj_key` returned by `persistent_id()` is still going to be the same for all of them - As a consequence, serialization optimizes disk storage by storing only necessary tensors, rather than writing one tensor per view. See Test 3 below. ## � How to test ### Test 1: snipped from https://github.com/pytorch/pytorch/issues/42163 Consider the following `snippet_1.py` (from https://github.com/pytorch/pytorch/issues/42163). ```python import hashlib import torch def get_sha256_hash(file: str, chunk_size: int = 4096) -> str: hasher = hashlib.sha256() with open(file, "rb") as fh: for chunk in iter(lambda: fh.read(chunk_size), b""): hasher.update(chunk) return hasher.hexdigest() file = "tensor.pt" hashes = [] for _ in range(5): obj = torch.ones(1) torch.save(obj, file) hashes.append(get_sha256_hash(file)[:8]) del obj hash = hashes[0] assert all(other == hash for other in hashes[1:]) print(hash) ``` On `master` you obtain an error ```bash $ python snippet_1.py Traceback (most recent call last): File "save_tensor.py", line 84, in <module> assert all(other == hash for other in hashes[1:]) AssertionError ``` while on this PR branch you should get the following consistent behaviour: ```bash $ for run in {1..2}; do python snippet_1.py; done 600a83cb 600a83cb ``` ### Test 2: Deterministic save of `Tensor` and `nn.Module` instances Consider the following `snippet_2.py` ```python import torch torch.manual_seed(0) x = torch.tensor([8., 8., 5., 0.]) torch.save(x, "out_tensor.pt") model = torch.nn.Sequential( torch.nn.Linear(3, 1), torch.nn.Flatten(0, 1) ) torch.save(model, "out_model.pt") ``` On `master` branch, the `md5` hash of `out_tensor.pt` and `out_model.pt` are non-determinstic, for instance you may get ```bash $ for run in {1..2}; do python snippet_2.py; md5 out_pt; done MD5 (`bc9e8af218`) (out_model.pt) = 92dca4a310b691e893f3cb41d64d5af1 MD5 (`bc9e8af218`) (out_tensor.pt) = a4ef290583f50a9c203a42d0cfc078af MD5 (`bc9e8af218`) (out_model.pt) = de3cb9791a66af8aed77ed7224bd1d5c MD5 (`bc9e8af218`) (out_tensor.pt) = 3b8a6009d3a0be5b9dd94152dcc0c7cb ``` while on this PR branch you should get the following consistent behaviour: ```bash $ for run in {1..2}; do python snippet_2.py; md5 out_pt; done MD5 (`bc9e8af218`) (out_model.pt) = dba75fd50a190e4e7fa89b7a2477bab7 MD5 (`bc9e8af218`) (out_tensor.pt) = 029f52f0706d6c813cc796d3cdcd3eb0 MD5 (`bc9e8af218`) (out_model.pt) = dba75fd50a190e4e7fa89b7a2477bab7 MD5 (`bc9e8af218`) (out_tensor.pt) = 029f52f0706d6c813cc796d3cdcd3eb0 ``` ### Test 3: Views of the same tensor are not re-written to file Consider the following `snippet_3.py`. ```python import torch torch.manual_seed(0) x = torch.rand(1_000, 1_000) y = x.T z = x.view(1_000_000, 1) torch.save({"x": x}, "out_tensor_x.pt") torch.save({"x": x, "y": y, "z": z}, "out_tensor_xyz.pt") ``` Both on `master` branch and on this PR branch you should get two output files with same size: ```bash $ python snippet_3.py && du -sh out_tensorpt && md5 out_pt 3.8M out_tensor_x.pt 3.8M out_tensor_xyz.pt MD5 (`bc9e8af218`) (out_tensor_x.pt) = eda516d9156177b27bdc2a75c9064d9b MD5 (`bc9e8af218`) (out_tensor_xyz.pt) = 333b869f5b93ced7b8649ab1571eb8e3 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/57536 Reviewed By: bdhirsh Differential Revision: D28304728 Pulled By: ailzhang fbshipit-source-id: 49788e566a3cd2c6c36dc801e6bdd8f42c9459cb	2021-05-10 11:51:55 -07:00
Yukio Siraichi	9d54475032	Hide module paths leaking in the documentation. (#54585 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/54354 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54585 Reviewed By: H-Huang Differential Revision: D28027037 Pulled By: mruberry fbshipit-source-id: 219874e143221f5e8349d007f88464e0be1a6243	2021-04-27 10:58:01 -07:00
Jeff Yang	475251631b	docs: reference links to serialization.html (#54659 ) Summary: fixes https://github.com/pytorch/pytorch/issues/54311 https://11811979-65600975-gh.circle-artifacts.com/0/docs/generated/torch.save.html Pull Request resolved: https://github.com/pytorch/pytorch/pull/54659 Reviewed By: ailzhang Differential Revision: D27328281 Pulled By: zou3519 fbshipit-source-id: b88d02e5407238a338d537d013a297ae9cdf922b	2021-03-29 10:15:07 -07:00
Philip Meier	b0afe945a7	Fix pylint error torch.tensor is not callable (#53424 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53424 Fixes https://github.com/pytorch/pytorch/issues/24807 and supersedes the stale https://github.com/pytorch/pytorch/issues/25093 (Cc Microsheep). If you now run the reproduction ```python import torch if __name__ == "__main__": t = torch.tensor([1, 2, 3], dtype=torch.float64) ``` with `pylint==2.6.0`, you get the following output ``` test_pylint.py:1:0: C0114: Missing module docstring (missing-module-docstring) test_pylint.py:4:8: E1101: Module 'torch' has no 'tensor' member; maybe 'Tensor'? (no- member) test_pylint.py:4:38: E1101: Module 'torch' has no 'float64' member (no-member) ``` Now `pylint` doesn't recognize `torch.tensor` at all, but it is promoted in the stub. Given that it also doesn't recognize `torch.float64`, I think fixing this is out of scope of this PR. --- ## TL;DR This BC-breaking only for users that rely on unintended behavior. Since `torch/__init__.py` loaded `torch/tensor.py` it was populated in `sys.modules`. `torch/__init__.py` then overwrote `torch.tensor` with the actual function. With this `import torch.tensor as tensor` does not fail, but returns the function rather than the module. Users that rely on this import need to change it to `from torch import tensor`. Reviewed By: zou3519 Differential Revision: D26223815 Pulled By: bdhirsh fbshipit-source-id: 125b9ff3d276e84a645cd7521e8d6160b1ca1c21	2021-03-09 11:32:53 -08:00
Brian Hirsh	18277137ff	make torch.load() aware of import path changes: torch.tensor -> torch._tensor (#53139 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53139 ghstack-source-id: 123090847 Test Plan: Sandcastle Also explicitly tests that this test passes after incorporating the changes from D26656767, and adding a `torch.tensor` -> `torch._tensor` mapping to the `load_module_mapping` dict: `buck test mode/dev //pandora/utils/tests:manifold_utils_tests -- --exact 'pandora/utils/tests:manifold_utils_tests - test_load_dataset_valid_dir (pandora.utils.tests.manifold_utils_tests.TestManifoldUtils)'` With just D26656767, that test fails. With D26656767 + the changes in this diff, that test passes. Reviewed By: ezyang Differential Revision: D26760600 fbshipit-source-id: cb16493b858a358acf468d755740aa272ae9d363	2021-03-04 17:11:20 -08:00
Sam Estep	c147aa306c	Use doctest directly to get docstring examples (#50596 ) Summary: This PR addresses [a two-year-old TODO in `test/test_type_hints.py`](`12942ea52b/test/test_type_hints.py (L21-L22)`) by replacing most of the body of our custom `get_examples_from_docstring` function with [a function from Python's built-in `doctest.DocTestParser` class](https://docs.python.org/3/library/doctest.html#doctest.DocTestParser.get_examples). This mostly made the parser more strict, catching a few errors in existing doctests: - missing `...` in multiline statements - missing space after `>>>` - unmatched closing parenthesis Also, as shown by [the resulting diff of the untracked `test/generated_type_hints_smoketest.py` file](https://pastebin.com/vC5Wz6M0) (also linked from the test plan below), this introduces a few incidental changes as well: - standalone comments are no longer preserved - indentation is now visually correct - [`example_torch_promote_types`](`4da9ceb743/torch/_torch_docs.py (L6753-L6772)`) is now present - an example called `example_torch_tensor___array_priority__` is added, although I can't tell where it comes from - the last nine lines of code from [`example_torch_tensor_align_as`](`5d45140d68/torch/_tensor_docs.py (L386-L431)`) are now present - the previously-misformatted third line from [`example_torch_tensor_stride`](`5d45140d68/torch/_tensor_docs.py (L3508-L3532)`) is now present Pull Request resolved: https://github.com/pytorch/pytorch/pull/50596 Test Plan: Checkout the base commit, typecheck the doctests, and save the generated file: ``` $ python test/test_type_hints.py TestTypeHints.test_doc_examples $ cp test/generated_type_hints_smoketest.py /tmp ``` Then checkout this PR, do the same thing, and compare: ``` $ python test/test_type_hints.py TestTypeHints.test_doc_examples $ git diff --no-index {/tmp,test}/generated_type_hints_smoketest.py ``` The test should succeed, and the diff should match [this paste](https://pastebin.com/vC5Wz6M0). Reviewed By: walterddr Differential Revision: D25926245 Pulled By: samestep fbshipit-source-id: 23bc379ff438420e556263c19582dba06d8e42ec	2021-01-20 15:55:36 -08:00
Hugo van Kemenade	473e78c0fa	Remove redundant code for unsupported Python versions (#49486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49486 Remove code for Python 3.5 and lower. There's more that can be removed/modernised, but sticking mainly to redundant version checks here, to keep the diff/PR smaller. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46579 Reviewed By: zou3519 Differential Revision: D24453571 Pulled By: ezyang fbshipit-source-id: c2cfcf05d6c5f65df64d89c331692c9aec09248e	2021-01-06 12:45:46 -08:00
Zain Patel	bbeee481c3	Fix typo in torch.load docstring for the `f` parameter (#49350 ) Summary: No issue opened for this (that I can see) and it was a fairly small change, so just opening this PR directly! The docstring for `torch.load` had some of parameter descriptions including typos like ``:meth`readline` `` instead of``:meth:`readline` ``. This PR corrects that :) <img width="811" alt="image" src="https://user-images.githubusercontent.com/30357972/102128240-7fa33500-3e45-11eb-8f54-ce5ca7bba96c.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/49350 Reviewed By: glaringlee Differential Revision: D25543041 Pulled By: mrshenli fbshipit-source-id: 10db04d58dd5b07777bdd51d3fcb3c45dea4c84b	2020-12-14 19:16:01 -08:00
Guilherme Leobas	a4e13fcf3f	add type annotations to common_nn.py (#48190 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48189 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48190 Reviewed By: walterddr, zhangguanheng66 Differential Revision: D25245261 Pulled By: malfet fbshipit-source-id: 0eabaed54996be83ead0fd7668f4d2be20adfc17	2020-12-02 14:46:00 -08:00
Thomas Viehmann	7b7f2519d9	Use storage.cpu() for moving storage to CPU in serialization. (#46028 ) Summary: As reported in https://github.com/pytorch/pytorch/issues/46020, something seems to go wrong with the storage._write_file method used with a BytesIO and a GPU buffer. Given that we were going to create the intermediate buffer (currently via BytesIO) anyway, we might as well use storage.cpu() to move the storage to the CPU. This appears to work better. This is a hot fix, further investigation is highly desirable. In particular, I don't have a reproducing test to show. Fixes https://github.com/pytorch/pytorch/issues/46020 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46028 Reviewed By: bdhirsh Differential Revision: D24194370 Pulled By: gchanan fbshipit-source-id: 99d463c4accb4f1764dfee42d7dc98e7040e9ed3	2020-10-13 12:51:10 -07:00
Gregory Chanan	2070834b9e	Improve error checking of Storage._writeFile. (#46036 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46036 Previously, this function didn't do error-bounds checking on the GetItem (GET_ITEM) calls, which led to issues like https://github.com/pytorch/pytorch/issues/46020. A better solution would be to use pybind, but given writing the file is going to dominate bounds checking, this is strictly better. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D24228370 Pulled By: gchanan fbshipit-source-id: f5d0a3d21ff12b4380beefe1e9954fa81ea2f567	2020-10-12 11:10:04 -07:00
Zachary DeVito	cb75addee4	torch.package - a way to package models and code (#45015 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45015 torch.package allows you to write packages of code, pickled python data, and arbitrary binary and text resources into a self-contained package. torch.package.PackageExporter writes the packages and torch.package.PackageImporter reads them. The importers can load this code in a hermetic way, such that code is loaded from the package rather than the normal python import system. This allows for the packaging of PyTorch model code and data so that it can be run on a server or used in the future for transfer learning. The code contained in packages is copied file-by-file from the original source when it is created, and the file format is a specially organized zip file. Future users of the package can unzip the package, and edit the code in order to perform custom modifications to it. The importer for packages ensures that code in the module can only be loaded from within the package, except for modules explicitly listed as external using :method:`extern_module`. The file `extern_modules` in the zip archive lists all the modules that a package externally depends on. This prevents "implicit" dependencies where the package runs locally because it is importing a locally-installed package, but then fails when the package is copied to another machine. Test Plan: Imported from OSS Reviewed By: SplitInfinity Differential Revision: D23824337 Pulled By: zdevito fbshipit-source-id: 1247c34ba9b656f9db68a83e31f2a0fbe3bea6bd	2020-09-22 21:21:21 -07:00
Nikita Shulga	0c01f136f3	[BE] Use f-string in various Python functions (#44161 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44161 Reviewed By: seemethere Differential Revision: D23515874 Pulled By: malfet fbshipit-source-id: 868cf65aedd58fce943c08f8e079e84e0a36df1f	2020-09-04 07:38:25 -07:00
Akihiro Nitta	f17d7a5556	Fix exception chaining in `torch/` (#43836 ) Summary: ## Motivation Fixes https://github.com/pytorch/pytorch/issues/43770. ## Description of the change This PR fixes exception chaining only in files under `torch/` where appropriate. To fix exception chaining, I used either: 1. `raise new_exception from old_exception` where `new_exception` itself seems not descriptive enough to debug or `old_exception` delivers valuable information. 2. `raise new_exception from None` where raising both of `new_exception` and `old_exception` seems a bit noisy and redundant. I subjectively chose which one to use from the above options. ## List of lines containing raise in except clause: I wrote [this simple script](https://gist.github.com/akihironitta/4223c1b32404b36c1b349d70c4c93b4d) using [ast](https://docs.python.org/3.8/library/ast.html#module-ast) to list lines where `raise`ing in `except` clause. - [x] `000739c31a/torch/jit/annotations.py (L35)` - [x] `000739c31a/torch/jit/annotations.py (L150)` - [x] `000739c31a/torch/jit/annotations.py (L158)` - [x] `000739c31a/torch/jit/annotations.py (L231)` - [x] `000739c31a/torch/jit/_trace.py (L432)` - [x] `000739c31a/torch/nn/utils/prune.py (L192)` - [x] `000739c31a/torch/cuda/nvtx.py (L7)` - [x] `000739c31a/torch/utils/cpp_extension.py (L1537)` - [x] `000739c31a/torch/utils/tensorboard/_pytorch_graph.py (L292)` - [x] `000739c31a/torch/utils/data/dataloader.py (L835)` - [x] `000739c31a/torch/utils/data/dataloader.py (L849)` - [x] `000739c31a/torch/utils/data/dataloader.py (L856)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L186)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L189)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L424)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L1279)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L1283)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L1356)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L1388)` - [x] `000739c31a/torch/testing/_internal/common_utils.py (L1391)` - [ ] `000739c31a/torch/testing/_internal/common_utils.py (L1412)` - [x] `000739c31a/torch/testing/_internal/codegen/random_topo_test.py (L310)` - [x] `000739c31a/torch/testing/_internal/codegen/random_topo_test.py (L329)` - [x] `000739c31a/torch/testing/_internal/codegen/random_topo_test.py (L332)` - [x] `000739c31a/torch/testing/_internal/jit_utils.py (L183)` - [x] `000739c31a/torch/testing/_internal/common_nn.py (L4789)` - [x] `000739c31a/torch/onnx/utils.py (L367)` - [x] `000739c31a/torch/onnx/utils.py (L659)` - [x] `000739c31a/torch/onnx/utils.py (L892)` - [x] `000739c31a/torch/onnx/utils.py (L897)` - [x] `000739c31a/torch/serialization.py (L108)` - [x] `000739c31a/torch/serialization.py (L754)` - [x] `000739c31a/torch/distributed/rpc/_testing/faulty_agent_backend_registry.py (L76)` - [x] `000739c31a/torch/distributed/rpc/backend_registry.py (L260)` - [x] `000739c31a/torch/distributed/distributed_c10d.py (L184)` - [x] `000739c31a/torch/_utils_internal.py (L57)` - [x] `000739c31a/torch/hub.py (L494)` - [x] `000739c31a/torch/contrib/_tensorboard_vis.py (L16)` - [x] `000739c31a/torch/distributions/lowrank_multivariate_normal.py (L100)` - [x] `000739c31a/torch/distributions/constraint_registry.py (L142)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/43836 Reviewed By: ailzhang Differential Revision: D23431212 Pulled By: malfet fbshipit-source-id: 5f7f41b391164a5ad0efc06e55cd58c23408a921	2020-08-31 20:26:23 -07:00
Dmytro Dzhulgakov	478fb925e6	[jit] PyTorchStreamReader::getAllRecord should omit archive name prefix (#43317 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43317 Previous version was returning the path with a prefix so subsequent `getRecord` would fail. There's only one place in PyTorch codebase that uses this function (introduced in https://github.com/pytorch/pytorch/pull/29339 ) and it's unlikely that anyone else is using it - it's not a public API anyway. Test Plan: unittest Reviewed By: houseroad Differential Revision: D23235241 fbshipit-source-id: 6f7363e6981623aa96320f5e39c54e65d716240b	2020-08-21 10:39:57 -07:00
mattip	8c653e05ff	DOC: fail to build if there are warnings (#41335 ) Summary: Merge after gh-41334 and gh-41321 (EDIT: both are merged). Closes gh-38011 This is the last in a series of PRs to build documentation without warnings. It adds `-WT --keepgoing` to the shpinx build which will [fail the build if there are warnings](https://www.sphinx-doc.org/en/master/man/sphinx-build.html#cmdoption-sphinx-build-W), print a [trackeback on error](https://www.sphinx-doc.org/en/master/man/sphinx-build.html#cmdoption-sphinx-build-T) and [finish the build](https://www.sphinx-doc.org/en/master/man/sphinx-build.html#cmdoption-sphinx-build-keep-going) even when there are warnings. It should fail now, but pass once the PRs mentioned at the top are merged. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41335 Reviewed By: pbelevich Differential Revision: D22794425 Pulled By: mruberry fbshipit-source-id: eb2903e50759d1d4f66346ee2ceebeecfac7b094	2020-07-28 22:33:44 -07:00
mattip	75155df8b4	Doc warnings (#41068 ) Summary: solves most of gh-38011 in the framework of solving gh-32703. These should only be formatting fixes, I did not try to fix grammer and syntax. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41068 Differential Revision: D22411919 Pulled By: zou3519 fbshipit-source-id: 25780316b6da2cfb4028ea8a6f649bb18b746440	2020-07-07 11:43:21 -07:00
James Reed	9c82b570bf	Fix delegating to jit.load from torch.load (#40937 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40937 Test Plan: Imported from OSS Differential Revision: D22363816 Pulled By: jamesr66a fbshipit-source-id: 50fc318869407fe8b215368026eaceb129b68a46	2020-07-06 09:00:13 -07:00
Nikita Shulga	591fffc524	Type-annotate serialization.py (#40862 ) Summary: Move Storage class from __init__.pyi.in to types.py and make it a protocol, since this is not a real class Expose `PyTorchFileReader` and `PyTorchFileWriter` native classes Ignore function attributes, as there are yet no good way to type annotate those, see https://github.com/python/mypy/issues/2087 Pull Request resolved: https://github.com/pytorch/pytorch/pull/40862 Differential Revision: D22344743 Pulled By: malfet fbshipit-source-id: 95cdb6f980ee79383960f306223e170c63df3232	2020-07-02 07:10:55 -07:00
Wojciech Baranowski	fcadca1bda	serialization: validate sparse tensors after loading (#34059 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33439 This introduces torch._sparse_coo_tensor_unsafe(...) and torch._validate_sparse_coo_tensor_args(...) Pull Request resolved: https://github.com/pytorch/pytorch/pull/34059 Differential Revision: D22161254 Pulled By: ezyang fbshipit-source-id: 994efc9b0e30abbc23ddd7b2ec987e6ba08a8ef0	2020-06-30 22:31:21 -07:00
James Reed	3ecae99dd9	Support Pathlike for zipfile serialization (#40723 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40723 Test Plan: Imported from OSS Differential Revision: D22294575 Pulled By: jamesr66a fbshipit-source-id: b157fa0ab02c4eb22cb99ac870942aeab352b0c5	2020-06-30 10:07:23 -07:00
Ailing Zhang	d7cd16858f	Add documentation about storage sharing is preserved and serialized f… (#40412 ) Summary: …ile size. fixes https://github.com/pytorch/pytorch/issues/40157 Pull Request resolved: https://github.com/pytorch/pytorch/pull/40412 Reviewed By: ezyang Differential Revision: D22265639 Pulled By: ailzhang fbshipit-source-id: 16b0301f16038bd784e7e92f63253fedc7820adc	2020-06-29 17:23:29 -07:00
Mike Ruberry	e66445878d	Adds dynamic versioning pattern (#40279 ) Summary: BC NOTE: This change makes it so modules saved with torch.jit.save in PyTorch 1.6 can be loaded by previous versions of PyTorch unless they use torch.div or (soon) torch.full. It also lets tensors saved using torch.save be loaded by previous versions. So this is the opposite of BC-breaking, but I'm using that label to highlight this issue since we don't have a "BC-improving" label. PR NOTE: When an operator's semantics change in PyTorch we want to do two things: 1) Preserve the semantics of older serialized Torchscript programs that use the operator 2) Ensure the new semantics are respected Historically, this meant writing a Versioned Symbol that would remap older versions of the operator into current PyTorch code (1), and bumping the produced file format version (2). Unfortunately, bumping the produced file format version is a nuclear option for ensuring semantics are respected, since it also prevents older versions of PyTorch from loading anything (even tensors!) from newer versions. Dynamic versioning addresses the nuclear consequences of bumping the produced file format version by only bumping it when necessary. That is, when an operator with changed semantics is detected in the serialized Torchscript. This will prevent Torchscript programs that use the changed operator from loading on earlier versions of PyTorch, as desired, but will have no impact on programs that don't use the changed operator. Note that this change is only applicable when using torch.jit.save and torch.jit.load. torch.save pickles the given object using pickle (by default), which saves a function's Python directly. No new tests for this behavior are added since the existing tests for versioned division in test_save_load already validate that models with div are loaded correctly at version 4. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40279 Reviewed By: dzhulgakov Differential Revision: D22168291 Pulled By: mruberry fbshipit-source-id: e71d6380e727e25123c7eedf6d80e5d7f1fe9f95	2020-06-24 12:52:50 -07:00
James Reed	0d24ed0c81	Add note to torch.save (#40394 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40394 Test Plan: Imported from OSS Reviewed By: ailzhang Differential Revision: D22168181 Pulled By: jamesr66a fbshipit-source-id: 634104a1c18faf3b6cb0e0f49d3980d671a141f4	2020-06-22 18:41:58 -07:00
James Reed	780fa2b489	Switch torch.save to zipfile serialization and swap quantization to that (#39460 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39460 Test Plan: Imported from OSS Differential Revision: D21865748 Pulled By: jamesr66a fbshipit-source-id: 90fddf366fcb3030e09ed79fb3e038f0175875a5	2020-06-10 17:19:55 -07:00
KushajveerSingh	88fe05e106	[Docs] Update torch.(squeeze, split, set_printoptions, save) docs. (#39303 ) Summary: I added the following to the docs: 1. `torch.save`. 1. Added doc for `_use_new_zipfile_serialization` argument. 2. Added a note telling that extension does not matter while saving. 3. Added an example showing the use of above argument along with `pickle_protocol=5`. 2. `torch.split` 1. Added an example showing the use of the function. 3. `torch.squeeze` 1. Added a warning for batch_size=1 case. 4. `torch.set_printoptions` 1. Changed the docs of `sci_mode` argument from ``` sci_mode: Enable (True) or disable (False) scientific notation. If None (default) is specified, the value is defined by `_Formatter` ``` to ``` sci_mode: Enable (True) or disable (False) scientific notation. If None (default=False) is specified, the value is defined by `torch._tensor_str._Formatter`. ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/39303 Differential Revision: D21904504 Pulled By: zou3519 fbshipit-source-id: 92a324257d09d6bcfa0b410d4578859782b94488	2020-06-05 12:57:53 -07:00
davidriazati	da8191a9ad	Remove useless copy on zip file load (#36362 ) Summary: Instead of copying to a buffer, then setting a tensor's storage with that buffer, create a storage directly from the file Pull Request resolved: https://github.com/pytorch/pytorch/pull/36362 Pulled By: driazati Differential Revision: D21889537 fbshipit-source-id: edbd430073c2bbf52332fe7b3b2590e7d936dedf	2020-06-04 16:59:54 -07:00
Nikita Shulga	4c0bf93a0e	Revert D21057090: Remove useless copy on zip file load Test Plan: revert-hammer Differential Revision: D21057090 Original commit changeset: e3d30a3b09f4 fbshipit-source-id: b24cbe77aae38b321882e7dcf41022710ee28ed0	2020-05-21 19:34:18 -07:00
davidriazati	455bf77da5	Remove useless copy on zip file load (#36362 ) Summary: Instead of copying to a buffer, then setting a tensor's storage with that buffer, create a storage directly from the file ](https://our.intern.facebook.com/intern/diff/21057090/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/36362 Pulled By: driazati Differential Revision: D21057090 fbshipit-source-id: e3d30a3b09f4d67bf4bb7a0dd7f4f60c3dd1a47e	2020-05-21 18:57:06 -07:00
David Reiss	e75fb4356b	Remove (most) Python 2 support from Python code (#35615 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35615 Python 2 has reached end-of-life and is no longer supported by PyTorch. Now we can clean up a lot of cruft that we put in place to support it. These changes were all done manually, and I skipped anything that seemed like it would take more than a few seconds, so I think it makes sense to review it manually as well (though using side-by-side view and ignoring whitespace change might be helpful). Test Plan: CI Differential Revision: D20842886 Pulled By: dreiss fbshipit-source-id: 8cad4e87c45895e7ce3938a88e61157a79504aed	2020-04-22 09:23:14 -07:00
Pearu Peterson	8bae1ed144	PCA and SVD for low-rank matrices, LOBPCG for positive-defined generalized eigenvalue problem - copy (#34721 ) Summary: This is a copy of PR https://github.com/pytorch/pytorch/issues/29488 to help the merging process. Pull Request resolved: https://github.com/pytorch/pytorch/pull/34721 Differential Revision: D20444270 Pulled By: vincentqb fbshipit-source-id: 042c56c8c0dae37834f52b4aee2deae7dd6fa659	2020-03-16 14:13:30 -07:00
Edward Yang	4b929e5466	Revert D20193196: [pytorch][PR] PCA and SVD for low-rank matrices, LOBPCG for positive-defined generalized eigenvalue problem Test Plan: revert-hammer Differential Revision: D20193196 Original commit changeset: 78a487991242 fbshipit-source-id: 8da4f8cb17c45af41e8c0ce80bc72581eb10dbb8	2020-03-11 09:24:34 -07:00
Pearu Peterson	2ec779d46c	PCA and SVD for low-rank matrices, LOBPCG for positive-defined generalized eigenvalue problem (#29488 ) Summary: This PR implements the following linear algebra algorithms for low-rank matrices: - [x] Approximate `A` as `Q Q^H A` - using Algorithm 4.4 from [Halko et al, 2009](http://arxiv.org/abs/0909.4061). + exposed as `torch.lowrank.get_approximate_basis(A, q, niter=2, M=None) -> Q` + [x] dense matrices + [x] batches of dense matrices + [x] sparse matrices + [x] documentation - [x] SVD - using Algorithm 5.1 from [Halko et al, 2009](http://arxiv.org/abs/0909.4061). + uses `torch.lowrank.get_approximate_basis` + exposed as `torch.svd_lowrank(A, q=6, niter=2, M=None) -> (U, S, V)` + [x] dense matrices + [x] batches of dense matrices + [x] sparse matrices + [x] documentation - [x] PCA - using `torch.svd_lowrank` + uses `torch.svd_lowrank` + exposed as `torch.pca_lowrank(A, center=True, q=None, niter=2) -> (U, S, V)` + [x] dense matrices + [x] batches of dense matrices + [x] sparse matrices, uses non-centered sparse matrix algorithm + [x] documentation - [x] generalized eigenvalue solver using the original LOBPCG algorithm [Knyazev, 2001](https://epubs.siam.org/doi/abs/10.1137/S1064827500366124) + exposed as `torch.lobpcg(A, B=None, k=1, method="basic", ...)` + [x] dense matrices + [x] batches of dense matrices + [x] sparse matrices + [x] documentation - [x] generalized eigenvalue solver using robust LOBPCG with orthogonal basis selection [Stathopoulos, 2002](https://epubs.siam.org/doi/10.1137/S1064827500370883) + exposed as `torch.lobpcg(A, B=None, k=1, method="ortho", ...)` + [x] dense matrices + [x] batches of dense matrices + [x] sparse matrices + [x] documentation - [x] generalized eigenvalue solver using the robust and efficient LOBPCG Algorithm 8 from [Duersch et al, 2018](https://epubs.siam.org/doi/abs/10.1137/17M1129830) that switches to orthogonal basis selection automatically + the "ortho" method improves iterations so rapidly that in the current test cases it does not make sense to use the basic iterations at all. If users will have matrices for which basic iterations could improve convergence then the `tracker` argument allows breaking the iteration process at user choice so that the user can switch to the orthogonal basis selection if needed. In conclusion, there is no need to implement Algorithm 8 at this point. - [x] benchmarks + [x] `torch.svd` vs `torch.svd_lowrank`, see notebook [Low-rank SVD](https://github.com/Quansight/pearu-sandbox/blob/master/pytorch/Low-rank%20SVD.ipynb). In conclusion, the low-rank SVD is going to be useful only for large sparse matrices where the full-rank SVD will fail due to memory limitations. + [x] `torch.lobpcg` vs `scipy.sparse.linalg.lobpcg`, see notebook [LOBPCG - pytorch vs scipy](https://github.com/Quansight/pearu-sandbox/blob/master/pytorch/LOBPCG%20-%20pytorch%20vs%20scipy.ipynb). In conculsion, both implementations give the same results (up to numerical errors from different methods), scipy lobpcg implementation is generally faster. + [x] On very small tolerance cases, `torch.lobpcg` is more robust than `scipy.sparse.linalg.lobpcg` (see `test_lobpcg_scipy` results) Resolves https://github.com/pytorch/pytorch/issues/8049. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29488 Differential Revision: D20193196 Pulled By: vincentqb fbshipit-source-id: 78a4879912424595e6ea95a95e483a37487a907e	2020-03-11 07:33:49 -07:00
Nathan Goldbaum	84101f353e	Avoid problematic pickle usages on Python 3.8.0 and 3.8.1 (#33824 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/32289 This has been fixed upstream as of Python 3.8.2. I think the easiest and least invasive way to ameliorate this is to catch the error condition and print a more informative error asking the user to update their Python version. It might be possible to buffer the data into memory and then read from memory, but that would be an invasive change and might cause memory exhaustion for very large models. Suggestions for alternate fixes or ways to improve the error message wording are very welcome. Pull Request resolved: https://github.com/pytorch/pytorch/pull/33824 Differential Revision: D20131722 Pulled By: ezyang fbshipit-source-id: a6e3fbf4bf7f9dcce5772b36f7a622cbf14b5ae4	2020-02-26 21:15:38 -08:00
Yash	293fa5fc44	[Documentation] Fix minor typo in torch.serialization (#33549 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33549 Differential Revision: D20002545 Pulled By: albanD fbshipit-source-id: 46fe2002329e5250c009eb066432909b71ecd74d	2020-02-21 09:29:13 -08:00
davidriazati	74ce3a032c	Fix some bugs with zipfile serialization (#32244 ) Summary: Stacked PRs * #32958 - Make zip serialization the default * #32244 - Fix some bugs with zipfile serialization It includes the following changes: * Split up tests so that we can test both serialization methods * Loading something within a buffer doesn't work anymore, so those tests are only on the old serialization method (it's possible but introduces a big slowdown since it requires a linear scan of the entire zipfile to find the magic number at the end) * Call `readinto` on a buffer if possible instead of `read` + a copy * Disable CRC-32 checks on read (there was some issue where miniz said the CRC was wrong but `zipinfo` and `unzip` said the zip file was fine) ](https://our.intern.facebook.com/intern/diff/19418935/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/32244 Pulled By: driazati Reviewed By: eellison Differential Revision: D19418935 fbshipit-source-id: df140854f52ecd04236225417d625374fd99f573	2020-02-05 15:32:14 -08:00
Edgar Andrés Margffoy Tuay	90a259e1e2	Add warning regarding pickle insecurity on torch.load documentation (#32593 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/31875 Added a small warning box based on the one presented on the [pickle](https://docs.python.org/3/library/pickle.html) module regarding the safety issues of unpickling files. i.e., unwanted code execution. Pull Request resolved: https://github.com/pytorch/pytorch/pull/32593 Differential Revision: D19572292 Pulled By: ngimel fbshipit-source-id: 69e7de390133ea77bddcadcd5b6820193c8abcc9	2020-01-25 22:12:37 -08:00
Sameer Deshmukh	2f5eefe525	Raise ValueError if CUDA device is specified without specifying the : (#29087 ) Summary: Fix for https://github.com/pytorch/pytorch/issues/19076 Pull Request resolved: https://github.com/pytorch/pytorch/pull/29087 Differential Revision: D19298959 Pulled By: ezyang fbshipit-source-id: 878ea4840682012f07177d8d159a77c0e5afada6	2020-01-07 10:29:49 -08:00
olramde	d770fbc1d2	Some modifications to improve readability (#31352 ) Summary: In the long string, formalstring thinks it is good to have a name. When using dict, literal is better for readability and faster than dict constructor. I always appreciate your efforts in creating the world's best frameworks. Pull Request resolved: https://github.com/pytorch/pytorch/pull/31352 Differential Revision: D19191967 Pulled By: ngimel fbshipit-source-id: 21f063b163b67de8cf9761a4db5991f74318e991	2020-01-02 12:48:34 -08:00
Kurt Mohler	3694749cd1	Detect dill version in torch.save/load (#30985 ) Summary: Fix for issue https://github.com/pytorch/pytorch/issues/28313 Pull Request resolved: https://github.com/pytorch/pytorch/pull/30985 Differential Revision: D19142947 Pulled By: zou3519 fbshipit-source-id: 10e3a182a99e80ca8c9c8328b6f8764b27d78eb3	2019-12-18 08:05:08 -08:00
davidriazati	2a7a39c1af	(de)serialization of values between C++ and Python (#30108 ) Summary: This PR updates `torch::pickle_save` to use the new zipfile format introduced in #29232 and adds `torch::pickle_load` which can decode the zipfile format. Now that `torch.save/load` use this format as well (if the `_use_new_zipfile_serialization` flag is `True`), raw values saved in Python can be loaded in C++ and vice versa. Fixes #20356 ](https://our.intern.facebook.com/intern/diff/18607087/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/30108 Pulled By: driazati Differential Revision: D18607087 fbshipit-source-id: 067cdd5b1cf9c30ddc7e2e5021a8cceee62d8a14	2019-11-23 00:06:07 -08:00
David Riazati	8c6f0c0587	Detect TorchScript archives in torch.load (#29339 ) Summary: This PR looks for a `constants.pkl` file at the top level in a zip file in `torch.load`. If found, it calls `torch.jit.load` instead and issues a warning to call `torch.jit.load` directly Pull Request resolved: https://github.com/pytorch/pytorch/pull/29339 Differential Revision: D18611095 Pulled By: driazati fbshipit-source-id: f070a02f6b5509054fc3876b3e8356bbbcc183e1	2019-11-22 12:30:30 -08:00

1 2 3

107 Commits