pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Derek Kim	9cb41e5386	Enhance the documentation for torch.nn.DataParallel (#15993 ) Summary: I found a few sentences in DataParallel docstring confusing, so I suggest this enhancement. - Arbitrary arguments are allowed to be passed .... INCLUDING tensors (Not EXCLUDING) - The original author said that "other types" are shallow-copied but I think actually only some builtin types are (effectively) shallow-copied. And "other types" are shared. Here is an example. ```python import torch from torch.nn import Module, DataParallel from collections import deque class MyModel(Module): def forward(self, x): x.append(None) model = MyModel(); model.cuda() model = DataParallel(model) d = deque() model.forward(d) print(d) ``` This is a side note. As far as I know, copying objects is not a specially frequent operation in python unlike some other languages. Notably, no copying is involved in assignment or function parameter passing. They are only name bindings and it is the whole point of "everything is object" python philosophy, I guess. If one keep this in mind, it may help you dealing with things like multithreading. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15993 Differential Revision: D14020404 Pulled By: ezyang fbshipit-source-id: a38689c94d0b8f77be70447f34962d3a7cd25e2e	2019-02-10 15:55:31 -08:00
Edward Yang	34cfbb0040	Typofix (#16800 ) Summary: Signed-off-by: Edward Z. Yang <ezyang@fb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/16800 Differential Revision: D13972592 Pulled By: ezyang fbshipit-source-id: 45c352ac6090c8060bf75f44dec7205556986d88	2019-02-06 10:34:04 -08:00
Tongzhou Wang	ac994f2c78	Fix SpectralNorm with DataParallel (#12671 ) Summary: There were two problems with SN + DP: 1. In SN, the updated _u vector is saved back to module via a `setattr`. However, in DP, everything is run on a replica, so those updates are lost. 2. In DP, the buffers are broadcast via a `broadcast_coalesced`, so on replicas they are all views. Therefore, the `detach_` call won't work. Fixes are: 1. Update _u vector in-place so, by the shared storage between 1st replica and the parallelized module, the update is retained 2. Do not call `detach_`. 3. Added comments in SN about the subtlety. 4. Added a note to the DP doc on this particular behavior of DP. cc crcrpar taesung89 The controller you requested could not be found. yaoshengfu Fixes https://github.com/pytorch/pytorch/issues/11476 Pull Request resolved: https://github.com/pytorch/pytorch/pull/12671 Differential Revision: D10410232 Pulled By: SsnL fbshipit-source-id: c447951844a30366d8c196bf9436340e88f3b6d9	2018-10-16 16:02:17 -07:00
Wei Yang	54107ae8cf	convert output_device at data_parallel from torch.device to index (#10189 ) Summary: - fixes #9984 Pull Request resolved: https://github.com/pytorch/pytorch/pull/10189 Differential Revision: D9545390 Pulled By: weiyangfb fbshipit-source-id: 3a6a705437553ba319e9fd4b7f676ff73857a27e	2018-09-11 20:27:07 -07:00
Tongzhou Wang	de460c7ad3	Improvements on conv/pool/fold/stft/ParamDict docs (#11106 ) Summary: Also fixes some incorrect formula rendering. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11106 Differential Revision: D9752433 Pulled By: SsnL fbshipit-source-id: 535fc8498638e8b645757fc7535d8771992b7d21	2018-09-11 08:56:21 -07:00
Tongzhou Wang	c6a923f486	Support modules that output scalar in Gather (and data parallel) (#7973 ) * Support modules that output scalar in Gather (and data parallel) * Improve warning msg	2018-06-01 16:20:39 -04:00
Isaac Ge	537cb10525	improve DataParallel/DistributedDataParallel docs (#7407 )	2018-05-09 10:30:42 +02:00
Tongzhou Wang	1c01eabd3c	Codemod to update our codebase to 0.4 standard (#6641 ) * Codemod to update our codebase to 0.4 standard * Update some of the test scri[ts * remove Variable in test_clip_grad_value * fix _symbolic_override_wrapper_maker	2018-04-17 22:06:54 -04:00
Tongzhou Wang	6b7ec95abb	Link relevant FAQ section in DataLoader docs (#6476 ) * Link FAQ section on workers returning same random numbers in DataLoader docs * explicitly mention section names	2018-04-11 13:41:46 -04:00
Tongzhou Wang	4d15442ebc	Add total_length option to pad_packed_sequence (#6327 ) * add total_length to pad_packed_sequence; add example on how to use pack->rnn->unpack with DP * address comments * fix typo	2018-04-08 20:25:48 -04:00
Carl Lemaire	6b95ca4eda	DataParallel: GPU imbalance warning (#5376 )	2018-02-27 21:30:41 +01:00
Kaiyu Shi	10fd272b7a	Update doc of batch size requirements for DP (#5108 ) * Update doc of batch size requirements for DP Fix #5039 * Delete the recommendation for batch size There's no significant speed difference between divisible and indivisible batch size.	2018-02-26 00:55:08 -05:00
Richard Zou	cac3026b35	Fix typo in DataParallel docs (#5268 )	2018-02-15 23:02:26 +01:00
Tongzhou Wang	805639906a	Broacast output requires_grad if only corresponding input requires_grad (#5061 )	2018-02-05 23:38:35 -05:00
Nintorac	2e42272cc1	Make DataParallel a no-op when CUDA not available (#3318 )	2017-10-29 13:47:36 +01:00
SsnL	de1f4e69dd	raw text (#3327 )	2017-10-28 01:24:02 +05:30
Adam Paszke	421607a935	DataParallel device_ids slicing fixes (#2200 )	2017-07-26 01:54:38 +05:30
Adam Paszke	dc17fb68e4	Fix minor bug in parallel_apply (#2193 )	2017-07-25 03:45:00 +05:30
Adam Paszke	4af40e3471	Let parallel_apply accept arbitrary inputs	2017-07-20 01:45:57 -04:00
Adam Paszke	12813b88f6	Add DistributedDataParallel	2017-06-12 22:00:22 -04:00
Soumith Chintala	e7f5220dfa	device_ids can be None again in data_parallel (#1187 )	2017-04-06 10:30:53 -04:00
Sam Gross	e50a1f19b3	Use streams in scatter to overlap copy with compute	2017-03-14 22:46:07 +01:00
Soumith Chintala	60736bdf99	fix corner case in kwargs for DataParallel (#930 )	2017-03-05 14:27:52 -05:00
Christian Sarofeen	b1ae7f90d5	Added functionality for data parallel table (#843 )	2017-03-05 02:35:46 +01:00
Eli Stevens	88275da5e8	CUDA documentation tweaks (#858 )	2017-02-26 20:37:43 +01:00
Eli Stevens	b87c113cf4	CUDA documentation enhancement and docs versioning (#848 ) * Add more detail to CUDA documentation Also adds better cross-linking to the pages that discuss relevant topics. * Adds recommendation to torch.save docs * Make the version numbers for the docs dynamic Might need tweaks for beta, 1.0, etc.	2017-02-26 08:33:26 -05:00
Adam Paszke	876202503f	Support multiple inputs in data parallel	2017-02-20 23:28:31 -08:00
Natalia Gimelshein	7c44506441	allow DataParallel to have tuple inputs on a single GPU	2017-02-16 19:07:17 +01:00
Adam Paszke	d6fa3b3fd5	Deprecate nn.Container in favor of nn.Module	2017-01-16 19:07:37 -05:00
Sam Gross	ea728e7c5e	Add DataParallel container (#268 ) Adds a container version of the `data_parallel` function. This is a drop-in replacement for the DataParallel class in the ImageNet example.	2016-11-29 16:36:01 -05:00

30 Commits