pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Akifumi Imanishi	b3fda95fe7	Add LazyBatchNormXd (#51862 ) Summary: Same diff with https://github.com/pytorch/pytorch/issues/51548 (cc. albanD) Pull Request resolved: https://github.com/pytorch/pytorch/pull/51862 Reviewed By: izdeby Differential Revision: D26312289 Pulled By: albanD fbshipit-source-id: 9cdec0e0c9021c33d10d85010978c7fa5cb4dc60	2021-02-09 10:29:03 -08:00
Alban Desmaison	a930162c69	Revert D26276903: [pytorch][PR] Add LazyBatchNormXd Test Plan: revert-hammer Differential Revision: D26276903 (`aa1fd6b45a`) Original commit changeset: 0ac706974178 fbshipit-source-id: bfe01b01cd460f1e2845ea5ef1fc1514e6b6ba54	2021-02-05 12:37:29 -08:00
Akifumi Imanishi	aa1fd6b45a	Add LazyBatchNormXd (#51548 ) Summary: This PR implements UninitializedBuffer and LazyBatchnormXd based on https://github.com/pytorch/pytorch/issues/44538. (cc. emcastillo and albanD) Pull Request resolved: https://github.com/pytorch/pytorch/pull/51548 Reviewed By: zhangguanheng66 Differential Revision: D26276903 Pulled By: albanD fbshipit-source-id: 0ac706974178363f8af075e59b41d5989418922f	2021-02-05 10:27:04 -08:00
Nikita Shulga	bf4fcab681	Fix SyncBatchNorm usage without stats tracking (#50126 ) Summary: In `batch_norm_gather_stats_with_counts_cuda` use `input.scalar_type()` if `running_mean` is not defined In `SyncBatchNorm` forward function create count tensor with `torch.float32` type if `running_mean` is None Fix a few typos Pull Request resolved: https://github.com/pytorch/pytorch/pull/50126 Test Plan: ``` python -c "import torch;print(torch.batch_norm_gather_stats_with_counts( torch.randn(1, 3, 3, 3, device='cuda'), mean = torch.ones(2, 3, device='cuda'), invstd = torch.ones(2, 3, device='cuda'), running_mean = None, running_var = None , momentum = .1, eps = 1e-5, counts = torch.ones(2, device='cuda')))" ``` Fixes https://github.com/pytorch/pytorch/issues/49730 Reviewed By: ngimel Differential Revision: D25797930 Pulled By: malfet fbshipit-source-id: 22a91e3969b5e9bbb7969d9cc70b45013a42fe83	2021-01-07 18:31:13 -08:00
Rohan Varma	c0a0845019	Improve new_group example in the context of SyncBatchNorm (#48897 ) Summary: Closes https://github.com/pytorch/pytorch/issues/48804 Improves some documentation/example in SyncBN docs to clearly show that each rank must call into all `new_group()` calls for creating process subgroups, even if they are not going to be part of that particular subgroup. We then pick the right group, i.e. the group that the rank is part of, and pass that into the SyncBN APIs. Doc rendering: <img width="786" alt="syncbn_update" src="https://user-images.githubusercontent.com/8039770/101271959-b211ab80-373c-11eb-8b6d-d56483fd9f5d.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/48897 Reviewed By: zou3519 Differential Revision: D25493181 Pulled By: rohan-varma fbshipit-source-id: a7e93fc8cc07ec7797e5dbc356f1c3877342cfa3	2020-12-11 10:28:08 -08:00
Guilherme Leobas	9b52654620	annotate a few torch.nn.modules.* modules (#45772 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/45771 Pull Request resolved: https://github.com/pytorch/pytorch/pull/45772 Reviewed By: mruberry Differential Revision: D24682013 Pulled By: albanD fbshipit-source-id: e32bc4fe9c586c079f7070924a874c70f3d127fa	2020-11-02 13:04:59 -08:00
Vasiliy Kuznetsov	bdf329ef8a	SyncBN: preserve qconfig if it exists (#45317 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45317 Eager mode quantization depends on the presence of the `config` model attribute. Currently converting a model to use `SyncBatchNorm` removes the qconfig - fixing this. This is important if a BN is not fused to anything during quantization convert. Test Plan: ``` python test/test_quantization.py TestDistributed.test_syncbn_preserves_qconfig ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D23922072 fbshipit-source-id: cc1bc25c8e5243abb924c6889f78cf65a81be158	2020-09-24 22:52:07 -07:00
Lin.Sung	f77ba0e48c	Change typo 'momemtum' to 'momentum' (#45045 ) Summary: As the title. Pull Request resolved: https://github.com/pytorch/pytorch/pull/45045 Reviewed By: mruberry Differential Revision: D23808563 Pulled By: mrshenli fbshipit-source-id: ca818377f4c23d67b037c146fef667ab8731961e	2020-09-21 19:03:26 -07:00
Xiang Gao	20ac736200	Remove py2 compatible future imports (#44735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44735 Reviewed By: mruberry Differential Revision: D23731306 Pulled By: ezyang fbshipit-source-id: 0ba009a99e475ddbe22981be8ac636f8a1c8b02f	2020-09-16 12:55:57 -07:00
Tongzhou Wang	09892de815	Clarify track_running_stats docs; Make SyncBatchNorm track_running_stats behavior consistent (#44445 ) Summary: context: https://github.com/pytorch/pytorch/pull/38084 Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/44445 Reviewed By: colesbury Differential Revision: D23634216 Pulled By: mrshenli fbshipit-source-id: d1242c694dec0e7794651f8031327625eb9989ee	2020-09-11 08:20:34 -07:00
F-G Fernandez	881c1adfcd	Fixed buffer update in BatchNorm when track_running_stats is set to False (#38084 ) Summary: This PR aims at tackling https://github.com/pytorch/pytorch/issues/37823 by: - ensuring that buffers will be used for normalization computation but won't be updated, when buffers are not None, and `track_running_stats=False` - adding a corresponding unittest to ensure expected behaviour Any feedback is welcome! _Note: we might want to update the docstrings of `BatchNorm*d`, feel free to share any suggestion!_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/38084 Differential Revision: D22047871 Pulled By: ezyang fbshipit-source-id: 5acbcad9773e7901f26d625db71d43d7dc236d3e	2020-06-22 08:17:31 -07:00
Edward Yang	eace053398	Move all torch.nn.modules type annotations inline (#38211 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38211 Just because the annotations are inline doesn't mean the files type check; most of the newly annotated files have type errors and I added exclusions for them in mypy.ini. The payoff of moving all of these modules inline is I can delete the relevant code generation logic for the pyi files (which was added ignore annotations that weren't actually relevant anymore.) For the most part the translation was completely mechanical, but there were two hairy issues. First, I needed to work around a Python 3.6 and earlier bug where Generic has a nontrivial metaclass. This fix is in torch/jit/__init__.py. Second, module.py, we need to apply the same fix for avoiding contravariance checks that the pyi file used to have; this is done by declaring forward as a variable (rather than a function), which appears to be sufficient enough to get mypy to not contravariantly check input arguments. Because we aren't actually typechecking these modules in most cases, it is inevitable that some of these type annotations are wrong. I slavishly copied the old annotations from the pyi files unless there was an obvious correction I could make. These annotations will probably need fixing up later. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D21497397 Pulled By: ezyang fbshipit-source-id: 2b08bacc152c48f074e7edc4ee5dce1b77d83702	2020-06-11 15:59:57 -07:00
Antoine Broyelle	bfa76ff407	[Doc] Clarify that variance estimor is biaised for normalization layers (#39752 ) Summary: Closes https://github.com/pytorch/pytorch/issues/39330 Pull Request resolved: https://github.com/pytorch/pytorch/pull/39752 Differential Revision: D21980097 Pulled By: ngimel fbshipit-source-id: 2bdcb8bf8194768985f5a8787712d215c0c5c1ec	2020-06-10 14:44:44 -07:00
Tongzhou Wang	d1cdf1fd56	update convert_sync_batchnorm docs (#39646 ) Summary: fix some inaccuracies Pull Request resolved: https://github.com/pytorch/pytorch/pull/39646 Differential Revision: D21930023 Pulled By: mrshenli fbshipit-source-id: 9c6b8eeefeb0482a6ae7f825ae055090ce589223	2020-06-08 18:42:42 -07:00
Tongzhou Wang	dfc4be205e	Fix broken reference in sync bn doc (#38890 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38890 Differential Revision: D21722162 Pulled By: ezyang fbshipit-source-id: a7d18239917b2886fe8c1c0aaf42fc8491c8e10c	2020-05-27 11:30:48 -07:00
jiej	5b8a79ab49	fix the device inconsistency for import convert_sync_batchnorm (#38729 ) Summary: This fixes the device inconsistency reported in https://github.com/pytorch/pytorch/issues/37930 Pull Request resolved: https://github.com/pytorch/pytorch/pull/38729 Differential Revision: D21671039 Pulled By: ngimel fbshipit-source-id: 17fdb4eae2ddaf64560dd026fe39958536ab313f	2020-05-20 15:42:53 -07:00
Tongzhou Wang	44cead3a31	Improve syncbn doc format (#38423 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38423 Differential Revision: D21601342 Pulled By: jerryzh168 fbshipit-source-id: dd2bf012831025495e9ece3db08536dd1d515645	2020-05-18 11:52:07 -07:00
Edward Yang	4fef3763dd	Revert "Revert D21337640: [pytorch][PR] Split up documentation into subpages and clean up some warnings" (#37778 ) Summary: Original PR: https://github.com/pytorch/pytorch/pull/37419 cc mattip suo Pull Request resolved: https://github.com/pytorch/pytorch/pull/37778 Differential Revision: D21385774 Pulled By: ezyang fbshipit-source-id: 5de532faab8bae132736b6b5189e0ee2ac9935be	2020-05-04 14:32:35 -07:00
Michael Suo	20f7e62b1d	Revert D21337640: [pytorch][PR] Split up documentation into subpages and clean up some warnings Test Plan: revert-hammer Differential Revision: D21337640 Original commit changeset: d4ad198780c3 fbshipit-source-id: fa9ba6ac542173a50bdb45bfa12f3fec0ed704fb	2020-05-04 10:57:55 -07:00
mattip	f10fbcc820	Split up documentation into subpages and clean up some warnings (#37419 ) Summary: xref gh-32838, gh-34032 This is a major refactor of parts of the documentation to split it up using sphinx's `autosummary` feature which will build out `autofuction` and `autoclass` stub files and link to them. The end result is that the top module pages like torch.nn.rst and torch.rst are now more like table-of-contents to the actual single-class or single-function documentations pages. Along the way, I modified many of the docstrings to eliminate sphinx warnings when building. I think the only thing I changed from a non-documentation perspective is to add names to `__all__` when adding them to `globals()` in `torch.__init__.py` I do not know the CI system: are the documentation build artifacts available after the build, so reviewers can preview before merging? Pull Request resolved: https://github.com/pytorch/pytorch/pull/37419 Differential Revision: D21337640 Pulled By: ezyang fbshipit-source-id: d4ad198780c3ae7a96a9f22651e00ff2d31a0c0f	2020-05-04 09:39:22 -07:00
hello@nicklashansen.com	d3a0bdd06b	proofreading (#29797 ) Summary: two instances of if -> it in torch.nn.modules.batchnorm.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/29797 Differential Revision: D19698613 Pulled By: ezyang fbshipit-source-id: 7312b2333f227113e904dfa91db90d00e525affb	2020-02-04 14:30:36 -08:00
Michael Suo	3552be1090	[jit] fix the NoneType param/buffer hack (#32745 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32745 Some parameters (like `bias` in conv) are optional. To achieve this previously, you had to add `bias` as a constant, which would invoke some pretty weird behavior in the frontend, summarized as: ``` if bias is not None: add it as a parameter normally else: # bias is None add it as a constant with the value None ``` There are several things bad about this: 1. Bias is not a constant. Marking it `__constants__` is confusing. 2. It basically relies on an implementation detail (the frontend processes parameters before constants) to work. Okay, whatever. I don't even know why we did this originally, but getting rid of it doesn't break anything, so I assume improved NoneType refinement has made this a non-issue. Note on perf: this will make no difference; if bias was `None` it's still folded out today, if bias is a Tensor it would be added as a parameter both before and after this change Test Plan: Imported from OSS Differential Revision: D19628634 Pulled By: suo fbshipit-source-id: d9128a09c5d096b938fcf567b8c23b09ac9ab37f	2020-01-29 17:04:39 -08:00
Alban Desmaison	81048c41ab	remove simple .data from torch/nn Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31482 Test Plan: Imported from OSS Differential Revision: D19303243 Pulled By: albanD fbshipit-source-id: 5afdfeb4b8382c09b9ec65acd545148ed76d4285	2020-01-15 12:40:38 -08:00
Peter Bell	37ca5a8a64	convert_sync_batchnorm should not convert _InstanceNorm instances (#29985 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/29187 This introduces a new class `_NormBase` that `_InstanceNorm` and `_BatchNorm` inherit from separately. This means the `isinstance(module, _BatchNorm)` check won't falsely pass for `_InstanceNorm`. The suggested fix of adding `and not isinstance(module, _InstanceNorm)` works as well, but requires introducing a cyclic dependency between `instancenorm.py` and `batchnorm.py`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29985 Differential Revision: D18588104 Pulled By: yf225 fbshipit-source-id: f599da3b902ad9c56836db4d429bfc462ed51338	2019-11-19 09:39:36 -08:00
jiej	9c7e604c60	SyncBatchNorm Update on input dimension checks (#29626 ) Summary: update the requirements on input dimensions for `torch.nn.SyncBatchNorm`: 1. 2D inputs is now permissible, https://github.com/pytorch/pytorch/issues/20204 ; 2. requires at least two element along normalization plane (BatchNorm behavior); Pull Request resolved: https://github.com/pytorch/pytorch/pull/29626 Differential Revision: D18492531 Pulled By: albanD fbshipit-source-id: f008e46a2d520d73c3c2730890a7424eba2ede9e	2019-11-18 16:09:51 -08:00
Igor Fedan	9dcf5191d5	explicitly provide memory format when calling to clone() at batchnorm.py Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28689 Test Plan: Imported from OSS Differential Revision: D18333368 Pulled By: ifedan fbshipit-source-id: e440c80ce8a64e1aae709cd935b14c7024a17787	2019-11-07 06:42:14 -08:00
jiej	0af60a5c06	(#27299 ) Summary: Removing in-place operator for num_batches_tracked increment. The in-place operator used here turns out to block many optimization opportunities due to alias assumption for inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27299 Differential Revision: D17909341 Pulled By: ngimel fbshipit-source-id: 7d635be94dfd2002af435acf6ea71995adaa40f6	2019-10-14 17:48:27 -07:00
Tongzhou Wang	eeaef217b3	Eliminate outdated comments Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26933 Differential Revision: D17685153 Pulled By: ezyang fbshipit-source-id: e402a12dc9a172649f153903a3a7834004b5667a	2019-10-02 08:01:49 -07:00
Spandan Tiwari	8986b9e38d	Momentum setting in SyncBatchNorm forward (inference) pass. (#24995 ) Summary: This is a fix for a potential ONNX export issue with SyncBatchNorm where irrespective of the value of momentum, the value for momentum in ONNX BN node is always 0. The details are captured in https://github.com/pytorch/pytorch/issues/18525. The fix in this PR for `SyncBatchNorm` is very similar to the fix that went in https://github.com/pytorch/pytorch/pull/18764 for `BatchNorm` (I think this site was just missed). Please note that there are no ONNX test points added for this, because SyncBatchNorm works exclusively with tensors on GPU and the ONNX test passes are CPU only. If there's a way to add a test point, please let me know. Pull Request resolved: https://github.com/pytorch/pytorch/pull/24995 Differential Revision: D17085570 Pulled By: dzhulgakov fbshipit-source-id: 162d428673c269efca4360fb103854b7319ec204	2019-08-29 23:16:46 -07:00
Yuxin Wu	927fb56ee0	Allow SyncBatchNorm without DDP in inference mode (#24815 ) Summary: Fix https://github.com/pytorch/pytorch/issues/22538 Pull Request resolved: https://github.com/pytorch/pytorch/pull/24815 Test Plan: Can run a detectron2 evaluation without entering DDP. #sandcastle Differential Revision: D16883694 Pulled By: ppwwyyxx fbshipit-source-id: 3195bc4e7f43a994821069f229b26302e2988739	2019-08-19 13:43:42 -07:00
Zhi Tian	6eb3969ac7	keep reuqires_grad unchanged after converting bn to syncbn (#22569 ) Summary: After converting BN layers to SyncBN layers, the function will set all `requires_grad = True` regardless of the original requires_grad states. I think it is a bug and have fixed it in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22569 Differential Revision: D16151647 Pulled By: zou3519 fbshipit-source-id: e2ad1886c94d8882485e7fb8be51ad76469ecc67	2019-07-10 08:38:04 -07:00
David Riazati	10c4b98ade	Remove weak script (#22212 ) Summary: * Deletes all weak script decorators / associated data structures / methods * In order to keep supporting the standard library in script, this enables recursive script on any function defined in `torch.nn` * Most changes in `torch/nn` are the result of `ag -Q "weak" torch/nn/ -l \| xargs sed -i '/weak/d'`, only `rnn.py` needed manual editing to use the `ignore` and `export` to continue supporting the overloaded `forward` methods * `Sequential`/`ModuleList` no longer need to be added to constants since they are compiled on demand This should also fix https://github.com/pytorch/pytorch/issues/22212 Pull Request resolved: https://github.com/pytorch/pytorch/pull/22212 Differential Revision: D15988346 Pulled By: driazati fbshipit-source-id: af223e3ad0580be895377312949997a70e988e4f	2019-07-03 17:28:25 -07:00
Kaixhin	c60465873c	Fix batch norm multiplier init (#13774 ) Summary: Fixes #12259, needs to make sure tests (see #13766) don't break due to numerical precision issues. Not sure what would need to be adjusted here... Pull Request resolved: https://github.com/pytorch/pytorch/pull/13774 Differential Revision: D15715021 Pulled By: ezyang fbshipit-source-id: 20ce2beee1b39ebe9f023c5f2b25be53acccb5f3	2019-06-07 07:50:39 -07:00
davidriazati	736bf7b46c	Fix __constants__ for some nn modules (#21071 ) Summary: A bunch of modules were missing entries for `__constants__` which was making their `__repr__`s not work. Others had `__constants__` that were not necessary since it was provided by some parent class instead. Fixes #20978 ](https://our.intern.facebook.com/intern/diff/15539518/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/21071 Pulled By: driazati Differential Revision: D15539518 fbshipit-source-id: 24bdd1ef41ef636eefd5d2bad4ab2d79646ed4f0	2019-05-29 13:55:53 -07:00
Soumith Chintala	6e76813a39	fix SyncBatchNorm doc (#20991 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/19265 Pull Request resolved: https://github.com/pytorch/pytorch/pull/20991 Differential Revision: D15513518 Pulled By: soumith fbshipit-source-id: 9618c0b2442e013e4d37793cdb04cb4f4b1b141c	2019-05-27 14:46:58 -07:00
Zhang Liliang	f7a7868820	add process_group in convert_sync_batchnorm (#19240 ) Summary: In line 508. convert_sync_batchnorm is called recursively to convert the bn to syncbn, thus the process_group also should be passed in the function. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19240 Differential Revision: D15240318 Pulled By: ezyang fbshipit-source-id: 0fc9e856392824814991e5e9e8f9513d57f311af	2019-05-07 06:51:18 -07:00
Spandan Tiwari	df05c7fbac	Fix momentum setting in BatchNorm forward pass. (#18764 ) Summary: This is a fix for issue https://github.com/pytorch/pytorch/issues/18525. The issue is related not only to ONNX export, but can manifest in other scenarios. An existing test point in test/onnx/test_operators.py has been updated to cover this scenario as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18764 Reviewed By: zrphercule Differential Revision: D14735166 Pulled By: houseroad fbshipit-source-id: 5a737c648f64355929ff31eb12bd4869e744768d	2019-04-08 16:30:00 -07:00
Arunava	79533ef097	convert_sync_batch_norm to SyncBatchNorm (#18787 ) Summary: Closes #18382 Please let me know if any changes are required. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18787 Differential Revision: D14821147 Pulled By: soumith fbshipit-source-id: edd98eab1b3f4151c4ae5148146435ddb2ae678d	2019-04-07 00:13:02 -07:00
jiej	39669316a6	(#14267 ) Summary: - Summary: Added synchronized batch normalization, allows synchronization of stats across mini-batches between processes within a process group. Current implementation uses a mixture of extended ATen native functions (cpp cuda extension) + torch.nn.modules (c10d python API) - User-facing api: 1. torch.nn.utils.convert_sync_batchnorm(modules, process_group=None) 2. torch.nn.SyncBatchNorm(num_features, eps=1e-5, momentum=0.1, affine=True, track_running_stats=True, *process_group=None) - supported use case: DistributedDataParallel with single-gpu multi-process* a. User creates model containing `torch.nn.SyncBatchNorm` layers through one of the ways listed below: 1. use layers directly: torch.nn.SyncBatchNorm(...) similar API as with torch.nn.BatchNormXd(...) with added argument `process_group` which is used to limit the scope of synchronization within each process group. Default value is None, which implies synchronization across all GPUs 2. use torch.nn.utils.convert_sync_batchnorm(modules, process_group) recursively convert all `torch.nn.BatchNormXd` into `torch.nn.SyncBatchNorm` preserving values of parameters/buffers. the utility function also allows user to specify process_group value to all converted layers. b. user wraps their model with `torch.distributed.parallel.DataParallelDistributed`, from this point, user should follow the general guidelines for DDP use guide - Error checking For use cases not supported, we error out: 1. Application launched without ddp: > import torch > sbn = torch.nn.SyncBatchNorm(10).cuda() > inp = torch.randn(5, 10, 3, 3).cuda() > sbn(inp) --> Error! > AttributeError: SyncBatchNorm is only supported within torch.nn.parallel.DistributedDataParallel 2. Application launched using DDP with multi-GPU per-process: > ddp_module = nn.parallel.DistributedDataParallel(module, device_ids=device_ids, output_device=args.local_rank) > ValueError: SyncBatchNorm is only supported for DDP with single GPU per process Pull Request resolved: https://github.com/pytorch/pytorch/pull/14267 Differential Revision: D14270035 Pulled By: ezyang fbshipit-source-id: 4956d8fa565c32e9df5408d53719ff9f945f4d6d	2019-03-06 13:39:11 -08:00
James Webber	162ad94590	Fixed typo in batchnorm docstrings Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15975 Differential Revision: D13642271 Pulled By: soumith fbshipit-source-id: 60ffa392bf1f916f2b93c943bb44a642a9815c42	2019-01-11 17:28:37 -08:00
Adam Paszke	c79e305add	Don't DCE PythonOp Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14773 Reviewed By: eellison Differential Revision: D13327673 Pulled By: suo fbshipit-source-id: 236db3407c7eacac470530836e3d4d0dc323110c	2018-12-04 21:37:36 -08:00
Wanchao Liang	d872af9282	Add tests for dropout/batchnorm train/eval, remove training constants (#14780 ) Summary: This PR: 1. add tests for batchnorm/dropout for train/eval parameter mutatino 2. remove training constants from all our standard library Pull Request resolved: https://github.com/pytorch/pytorch/pull/14780 Differential Revision: D13331578 Pulled By: wanchaol fbshipit-source-id: d92ca3ce38cc2888688d50fe015e3e22539a20a5	2018-12-04 18:17:43 -08:00
David Riazati	c3bfa0e52b	BatchNorm support not tracking stats Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14764 Differential Revision: D13325800 Pulled By: driazati fbshipit-source-id: a3e4773dc31b83565e7a4de33614d6efd4a12de9	2018-12-04 15:11:53 -08:00
Wanchao Liang	119f9ec291	enable NoneValue parameter assignment for WeakScriptModule (#14715 ) Summary: This PR: 1. Handle None value attr in the WeakScriptModuleProxy 2. add back module tests that now passing Pull Request resolved: https://github.com/pytorch/pytorch/pull/14715 Differential Revision: D13313573 Pulled By: wanchaol fbshipit-source-id: a6b7892707350290a6d69b6f6270ad089bfc954b	2018-12-03 20:40:55 -08:00
Wanchao Liang	d6bfc53b9e	Export BatchNorm functional and module, add necessary JIT support (#14016 ) Summary: This PR did three things: 1. It export the BatchNorm functional and module, and rewrite some of the components to stay align with the current supported JIT features 2. In the process of export, add necessary compiler support for in_place op aug assign 4. change the test_jit behavior in add_module_test to utilize a single rng state during module initialization Pull Request resolved: https://github.com/pytorch/pytorch/pull/14016 Differential Revision: D13112064 Pulled By: wanchaol fbshipit-source-id: 31e3aee5fbb509673c781e7dbb6d8884cfa55d91	2018-11-20 14:15:06 -08:00
Junjie Bai	4484f67b47	Revert D10203439: [pytorch][PR] Fix batch norm multiplier init Differential Revision: D10203439 Original commit changeset: 999cc134a45e fbshipit-source-id: 7871e384063db2f3788169338e9c965d5f8ac351	2018-11-09 00:37:05 -08:00
Kaixhin	c9be135bb9	Fix batch norm multiplier init (#12325 ) Summary: Fixes #12259 Pull Request resolved: https://github.com/pytorch/pytorch/pull/12325 Differential Revision: D10203439 Pulled By: SsnL fbshipit-source-id: 999cc134a45e2554313adb7eb93ee98e1f84335f	2018-11-08 19:00:00 -08:00
Tongzhou Wang	2cd912bcc2	Fix more spectral norm bugs (#13350 ) Summary: Problems with SN and DP after #12671 : 1. in eval mode, `weight_orig` is not getting correct gradient #12737 . Fix: keep `v` vector around as a buffer and always calculate `W = W_orig / (u @ W_orig @ v)` even in eval. 2. in training mode, the `weight` buffer of the parallelized module is never updated, if someone touches `weight_orig` and/or `weight` and makes them not sharing storage. So in `eval` the weight used is wrong. Fix: Make `weight` not a buffer anymore and always calculate it as above. 3. #12671 changed SN to update `u` in-place to make DP work correctly, but then it breaks backward through two forwards (e.g., the common GAN loss `D(real) - D(fake)`) because the vectors needed to backprop the 1st forward is changed in the 2nd forward. Fix: This PR clones `u` and `v` before using them. To maintain BC, I added a hook interface for producing and loading state_dict. This is ugly and we should really have better interface for spectral_norm. But for the purpose to fix this issue, I make this patch. Even if we have a better interface, BC mechanism for legacy loading legacy state_dict still needs to be done. cc The controller you requested could not be found. crcrpar Pull Request resolved: https://github.com/pytorch/pytorch/pull/13350 Differential Revision: D12931044 Pulled By: SsnL fbshipit-source-id: 8be6f934eaa62414d76d2c644dedd7e1b7eb31ef	2018-11-06 19:16:13 -08:00
vishwakftw	f9a99d5504	Specify default initialization schemes for modules in docs (#9038 ) Summary: This closes #6906 . Reviewed By: ezyang Differential Revision: D8698632 Pulled By: weiyangfb fbshipit-source-id: 259c1dbdc264a8e9f83e196fa72d135babd97d48	2018-07-24 11:58:08 -07:00
Tongzhou Wang	623ae0c07c	Fix loading 0.4 BN checkpoints (#9004 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/8481 Closes https://github.com/pytorch/pytorch/pull/9004 Reviewed By: soumith Differential Revision: D8684017 Pulled By: SsnL fbshipit-source-id: 57820ad5f6b60795358c9447409a364a93ffa1d9	2018-07-01 22:24:21 -07:00

1 2

87 Commits