pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Supriya Rao	4db8ca1129	[quant] Create nn.quantized.dynamic.EmbeddingBag (#43088 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/43088 Create quantized module that the user can use to perform embedding bag quantization The module uses the EmbeddingPackedParams to store the weights which can be serialized /deserialized using TorchBind custom classes (C++ get/setstate code) Following PR will add support for `from_float` to convert from float to quantized module Test Plan: python test/test_quantization.py TestDynamicQuantizedModule.test_embedding_bag_api Imported from OSS Reviewed By: vkuzo Differential Revision: D23167519 fbshipit-source-id: 029d7bb44debf78c4ef08bfebf267580ed94d033	2020-08-21 11:45:02 -07:00
chengjun	8d570bc708	Decouple DataParallel/DistributedDataParallel from CUDA (#38454 ) Summary: Decouple DataParallel/DistributedDataParallel from CUDA to support more device types. - Move torch/cuda/comm.py to torch/nn/parallel/comm.py with minor changes for common devices support. Torch.cuda.comm is kept as is for backward compatibility - Provide common APIs to arbitrary device types without changing existing CUDA APIs in torch.cuda space. - Replace the torch.cuda calls in DataParellel/DistributedDataParallel with the new APIs. Related RFC: [https://github.com/pytorch/pytorch/issues/36160](https://github.com/pytorch/pytorch/issues/36160) Pull Request resolved: https://github.com/pytorch/pytorch/pull/38454 Differential Revision: D22051557 Pulled By: mrshenli fbshipit-source-id: 7842dad0e5d3ca0f6fb760bda49182dcf6653af8	2020-07-07 12:48:16 -07:00
Wojciech Baranowski	fcadca1bda	serialization: validate sparse tensors after loading (#34059 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/33439 This introduces torch._sparse_coo_tensor_unsafe(...) and torch._validate_sparse_coo_tensor_args(...) Pull Request resolved: https://github.com/pytorch/pytorch/pull/34059 Differential Revision: D22161254 Pulled By: ezyang fbshipit-source-id: 994efc9b0e30abbc23ddd7b2ec987e6ba08a8ef0	2020-06-30 22:31:21 -07:00
Jerry Zhang	c32fa465a5	Preserve Backward compatibility of models serialized before #31040 (#33796 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33796 Test Plan: Imported from OSS Differential Revision: D20109662 Pulled By: jerryzh168 fbshipit-source-id: 9bc936a59fd6dd1031fbf05eb90f98ae9677b936	2020-02-26 13:40:38 -08:00
Zhu, Haozhe	bd3c6e8e91	avoid large vector copy when query per_channel q_params (#31040 ) Summary: The quantizer use std::vector to save per_channel scales and zero_points, but when query scales(zero_points), it requires to return tensor. These lead to use std::vector to initialize tensors and it dose cost lots of time. So I change quantizer to save per_channel scales and zero_points by using tensor directly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/31040 Differential Revision: D19701070 Pulled By: jerryzh168 fbshipit-source-id: 9043f16c44b74dd8289b8474e540171765a7f92a	2020-02-19 16:24:24 -08:00
Pearu Peterson	b7fb2b8862	Implement pickle support for sparse tensors and torch.layout instances (#27062 ) Summary: Resolves issue https://github.com/pytorch/pytorch/issues/16667 and https://github.com/OpenMined/PySyft/issues/2326 Pull Request resolved: https://github.com/pytorch/pytorch/pull/27062 Differential Revision: D17762932 Pulled By: ezyang fbshipit-source-id: dd99c1f4ac8eb2286eb55aa20ce973f60ce7b7e1	2019-10-04 08:09:32 -07:00
Ailing Zhang	0b79f77a4d	Serialize XLA Tensor (#27041 ) Summary: https://github.com/pytorch/pytorch/issues/25882 Reopening both PRs to master/v1.3.0 to save myself from rebase hell... Pull Request resolved: https://github.com/pytorch/pytorch/pull/27041 Differential Revision: D17676345 Pulled By: ailzhang fbshipit-source-id: 4784c9f8f0723cd3ff081af3336e672812aaad14	2019-10-01 15:05:30 -07:00
Dmytro Dzhulgakov	b93823cb65	Per-channel quantized tensor to have only a single axis (#26675 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26675 Based on offline poll, we're very unlikely to have multi-axis quantized tensors in the foreseeable future. Let's simplify API and just return int instead of list. It also matches the singular `axis` name. Test Plan: Imported from OSS Differential Revision: D17537052 Pulled By: dzhulgakov fbshipit-source-id: 676abc3b251d288468aaed467b5e5ca4063b98b0	2019-09-23 22:29:01 -07:00
Dmytro Dzhulgakov	9aad4d7b5f	Fix _empty_per_channel_affine_quantized to be less hacky (#26243 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26243 This is an attempt to fix _empty_per_channel_affine_quantized to be more sane. It's a factory function that nevertheless receives a Tensor argument and it throws the codegen off course. Before people did a hacky workaround of appending _like to the function name to trick codegen, it also required non-natural argument order. This PR explicitly allows to override the 'category' of the function to make codegen do the right thing. Now name and the argument order (in C++) make more sense. Test Plan: Imported from OSS Differential Revision: D17443221 Pulled By: dzhulgakov fbshipit-source-id: c98c1c74473d8cbf637f511d26ceb949d8ae2a1a	2019-09-23 22:28:58 -07:00
Dmytro Dzhulgakov	ebc2365fd3	Serialization for per channel qtensor (#26339 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26339 Serializes per-channel tensor in both torch.serialization and jit. Since we didn't bind Quantizer properly yet, I chose to save a tuple representing quantizer settings. To avoid recursive tensor serialization calls, I'm using tuple instead of tensor to store scales and zero points. driazati - please check the serialization logic. Is there a good test that compares that JIT serialization and python serialization are equivalent? (I haven't tested it yet) Test Plan: Imported from OSS Differential Revision: D17443222 Pulled By: dzhulgakov fbshipit-source-id: a34758de1ffd2ec1cdc5355f5baf95284a4ccf4b	2019-09-23 13:28:11 -07:00
Jan Schlüter	0bc90194fb	Catch and print exception traceback in parallel_apply() workers (#18055 ) Summary: When an exception occurs in one of the modules passed to `parallel_apply()`, it is caught and re-raised in the main thread. This preserves the original exception type and message, but has the traceback point at the position where it's re-raised, rather than the original point of failure. This PR saves the exception information required to generate the traceback, and includes the original traceback in the message of the exception raised in the main thread. Before: ``` ... File ".../torch/nn/parallel/data_parallel.py", line 153, in parallel_apply return parallel_apply(replicas, inputs, kwargs, self.device_ids[:len(replicas)]) File ".../torch/nn/parallel/parallel_apply.py", line 84, in parallel_apply raise output RuntimeError: expected type torch.FloatTensor but got torch.cuda.FloatTensor ``` After: ``` ... File ".../torch/nn/parallel/data_parallel.py", line 153, in parallel_apply return parallel_apply(replicas, inputs, kwargs, self.device_ids[:len(replicas)]) File ".../torch/nn/parallel/parallel_apply.py", line 88, in parallel_apply ''.join(traceback.format_exception(*exc_info))) RuntimeError: Caught exception in replica 0. Original traceback and message: Traceback (most recent call last): ... File "../models/foo.py", line 319, in bar baz = asdf / ghij[:, np.newaxis] RuntimeError: expected type torch.FloatTensor but got torch.cuda.FloatTensor ``` I took care to raise an exception of the original type (in case the main code checks for that), but replaced the message. It helped me find a bug that did not occur outside `data_parallel()`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18055 Differential Revision: D16444972 Pulled By: zhangguanheng66 fbshipit-source-id: ec436c9d4677fad18106a8046cfa835a20a101ce	2019-07-26 11:41:22 -07:00
Jerry Zhang	277bf69fa0	Add torch.load/torch.save for QTensor (#20830 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20830 att Reviewed By: dzhulgakov Differential Revision: D15340701 fbshipit-source-id: 677038c8101f66dec4856c2eccf9f9e394012226	2019-05-30 20:52:19 -07:00
Wanchao Liang	eabd9eac2a	flake8 fix Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18810 Differential Revision: D14758293 Pulled By: wanchaol fbshipit-source-id: 975abe4fc5dc0dc4d43af61ec0f987e2c5670874	2019-04-03 14:14:18 -07:00
Gregory Chanan	a3da3653eb	Use non-legacy constructors for tensor deserialization. (#18750 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18750 ghimport-source-id: f1475cfb67841c41d9867d4429ba9125d5c7dd07 Stack from [ghstack](https://github.com/ezyang/ghstack): * #18751 Disallow changing the device of a tensor via set_. * #18750 Use non-legacy constructors for tensor deserialization. * #18749 Add device and dtype to storage. Deserialization currently uses legacy constructors. This is bad because we need to maintain them, but there is a more immediate problem: 1) We are trying to implement device caching on TensorImpl to get rid of a virtual dispatch 2) This doesn't work if one is able to change the device of a Tensor underlying a Variable. 3) Deserialization does 2) So the plan is to change deserialization, then enforce that we don't change the device out from underneath a Variable. Differential Revision: D14729513 fbshipit-source-id: 090d6cdb375b94dc1bf4f554b2df243952b8cdc6	2019-04-03 07:54:11 -07:00
Thomas Viehmann	6a6983ed7f	create type hint stub files for module torch (#12500 ) Summary: We have: - This is an initial stab at creating a type stub `torch/__init__.pyi` . - This is only tested on Python 3, since that's the only Python version mypy works on. - So far, we only aim at doing this for torch functions and torch.Tensor. - Quite a few methods and functions have to be typed manually. These are done in `torch/__init__.pyi.in` For me, PyCharm (the non-paid one) didn't seem to indicate errors in the .pyi when opening and seemed to be able to get the type hint for the few functions I tried, but I don't use PyCharm for my usual PyTorch activities, so I didn't extensively try this out. An example of a generated PYI is at [this gist](https://gist.github.com/ezyang/bf9b6a5fa8827c52152858169bcb61b1). Pull Request resolved: https://github.com/pytorch/pytorch/pull/12500 Differential Revision: D13695553 Pulled By: ezyang fbshipit-source-id: 4566c71913ede4e4c23ebc4a72c17151f94e8e21	2019-01-29 12:14:17 -08:00
Edward Yang	3bfa7258b3	Don't serialize hooks (#11705 ) Summary: Fixes #11683. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/11705 Differential Revision: D9833057 Pulled By: ezyang fbshipit-source-id: 18af9bcd77b088326738d567100fbe4a4c869dd6	2018-10-16 20:11:03 -07:00
Peter Goldsborough	fb4e8088f3	Remove methods that start with an underscore from at::Tensor (#11152 ) Summary: This PR cleans up the `at::Tensor` class by removing all methods that start with an underscore in favor of functions in the `at::` namespace. This greatly cleans up the `Tensor` class and makes it clearer what is the public and non-public API. For this I changed `native_functions.yaml` and `Declarations.cwrap` to make all underscore methods `variant: function` (or add such a statement to begin with), and then fixed all code locations using the underscore methods. ezyang colesbury gchanan Pull Request resolved: https://github.com/pytorch/pytorch/pull/11152 Differential Revision: D9683607 Pulled By: goldsborough fbshipit-source-id: 97f869f788fa56639c05a439e2a33be49f10f543	2018-09-07 11:55:11 -07:00
Marcin Elantkowski	4d28b65fb8	fix serialization of nn.Parameter with dill (#10296 ) Summary: Should resolve #9981. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10296 Differential Revision: D9196353 Pulled By: soumith fbshipit-source-id: 109b6da42b7240cdbc7a0586745c735bce5e1279	2018-09-01 23:55:40 -07:00
Richard Zou	1709484a40	Restore tensor.type, tensor.type_as docs (#5746 )	2018-03-14 17:59:31 -04:00
Richard Zou	439aae7e94	Add tensor.repeat docs. Remove legacy tensor repeat function. (#5666 ) * Add tensor.repeat docs. Remove legacy tensor repeat function. * Fix nit	2018-03-09 23:51:47 -05:00
gchanan	6ab33a820c	Support type conversion via type(dtype). (#5441 ) * Support type conversion via type(dtype). * Merge overloads.	2018-02-28 13:05:38 -05:00
Sam Gross	30ec06c140	Merge Variable and Tensor classes (#5225 ) This replaces the torch.Tensor constructors with factories that produce Variables. Similarly, functions on the torch module (e.g. torch.randn) now return Variables. To keep the PR to a reasonable size, I've left most of the unused tensor code. Subsequent PRs will remove the dead code, clean-up calls to torch.autograd.Variable, and rename Variable to Tensor everywhere. There are some breaking changes because Variable and Tensors had slightly different semantics. There's a list of those changes here: https://github.com/pytorch/pytorch/wiki/Breaking-Changes-from-Variable-and-Tensor-merge	2018-02-23 18:03:31 -05:00
Sam Gross	895aebac08	Use Variable instead of Tensor in Function.forward (#4786 ) The Tensor and Variable classes are being merged. autograd.Function.forward is now called on Variables, but with "no-grad" mode (torch.no_grad()) enabled. One benefit is that we no longer have to explicitly track shared storages.	2018-02-06 17:24:27 -05:00
Peter Goldsborough	86fd5fd524	Replace async with non_blocking for Python 3.7 (#4999 ) * Replace async with non_blocking for Python 3.7 upgrade * Remove trailing whitespace * Give _cuda and _type kwargs and accept async for compatibility * Rename async to non_blocking in all C++ code * Add entries for async in python_variable_methods * Friendlier backward compatibility for cuda and type	2018-02-02 09:23:51 -05:00
Sam Gross	720c7b1e2c	Move repeat to torch/_utils.py (#4712 ) This moves the implementation of repeat to _utils so that the autograd function can call it directly instead of relying on forward being called on tensors. This also removes _range, which was previously necessary because we shadowed the built-in range() function.	2018-01-17 17:30:43 -05:00
Natalia Gimelshein	ea28deee75	use torch.cat in _flatten	2017-11-29 10:54:57 +01:00
Ozan Çağlayan	dd6d04ddf2	doc: Normalize all true/false in docstrings to ``True\|False`` (#3593 ) * doc: Normalize all true/false in docstrings to ``True\|False`` This makes them more apparent in the documentation. * doc: fix flake8	2017-11-09 08:12:29 -05:00
SsnL	fa5efab669	comments and case where not all sparse (#3370 )	2017-11-01 06:05:17 -04:00
SsnL	01be4d6b20	sparse broadcast_coalesce and reduce_add_coalesced	2017-10-28 18:52:35 -04:00
SsnL	3a0aee71f3	fix sparse tensor .cpu()	2017-10-28 18:52:35 -04:00
Leonid Vlasenkov	46a868dab7	[Ready] Limit docs line length (#1900 ) * some docs are ready * docs * docs * fix some more * fix some more	2017-07-10 10:24:54 -04:00
Gregory Chanan	bb3779efe8	Add broadcasting to masked_select.	2017-06-24 09:45:21 -04:00
Adam Paszke	12813b88f6	Add DistributedDataParallel	2017-06-12 22:00:22 -04:00
Kai Arulkumaran	ddf6328990	Document type function returns type with no args (#1719 )	2017-06-05 11:54:55 -04:00
Edward Z. Yang	743e4894d2	Prefix values/indices/sparse_mask/nnz with underscore (#1457 ) As discussed in #1441. I also added some docs giving clear guidance about how to coalescing in sparse tensors. Signed-off-by: Edward Z. Yang <ezyang@fb.com>	2017-05-03 11:14:10 -04:00
Martin Raison	01d84c5f9d	revert sparse cuda index type change	2017-04-18 12:46:54 -07:00
Martin Raison	88b42324e7	spcadd, sparseMask, cadd, csub, cmul + tests	2017-04-18 12:46:54 -07:00
Sam Gross	c4d1318662	Fix map_location in torch.load (#1006 )	2017-03-15 16:54:19 -04:00
Martin Raison	f17cfe4293	sparse tensor operations (#735 )	2017-03-03 18:37:03 +01:00
Sam Gross	6464e69e21	Docs for torch.Storage (#475 )	2017-01-18 03:22:30 -05:00
Sam Gross	d951d5b1cd	Fix tensor.cuda(0) when on non-zero device. (#472 )	2017-01-18 01:08:37 -05:00
Adam Paszke	0325e2f646	Major autograd refactor Improves autograd performance by more than 2x and fixes a couple of bugs. All core functions have been moved to C.	2016-10-13 17:17:49 -07:00
Sam Gross	2bc9da4f5e	Support "device" keyword argument (#79 ) Adds the optional "device" keyword argument to Tensor and Storage constructors and .new methods.	2016-10-01 19:32:55 -04:00
Adam Paszke	e034f258e3	Fix ffi utils in Python 2.7	2016-10-01 15:37:05 -07:00
Adam Paszke	11b38a6895	Add more functions to autograd	2016-09-30 16:37:07 -04:00
Sam Gross	cb5d4e836f	Lazy load CUDA and THNN modules (#64 )	2016-09-28 19:29:53 -04:00
Adam Paszke	3eac7164f4	Add data parallel functions to nn	2016-09-27 15:45:45 -07:00
Adam Paszke	8fdec15a55	Codemod to remove camel case method naming	2016-09-20 08:40:28 -07:00
Adam Paszke	da5bb373e6	Type conversions now use auto gpu	2016-09-15 18:48:27 -07:00

49 Commits