pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Rohan Varma	6ee5e490d4	[BE][SyncBN] Avoid sync stats in eval mode (#56982 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56982 SyncBatchNorm should behave as a regular BN layer in eval model, this change ensures that this is the case. In particular, the bug was when `track_running_stats=False`, `bn_training` would be set to True in eval mode, but this would trigger a collective sync in syncBN. However, in eval mode syncBN should behave like a regular BN layer and not do this sync. Closes https://github.com/pytorch/pytorch/issues/48988 Ensured with unittest that when used for inference on a single rank, stats sync is not triggered. ghstack-source-id: 127544421 Test Plan: CI Reviewed By: SciPioneer Differential Revision: D27579297 fbshipit-source-id: 26406e2793f0be14f2daa46ae66f97a8494182ed	2021-04-28 09:53:30 -07:00
Joel Schlosser	febff45900	Support factory kwargs in torch.nn modules (#54508 ) Summary: Continuation of https://github.com/pytorch/pytorch/pull/53144 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54508 Reviewed By: albanD Differential Revision: D27939544 Pulled By: jbschlosser fbshipit-source-id: 4bf517e5f74f093e27ca38a85e732da65e44d805	2021-04-22 16:16:53 -07:00
Joel Schlosser	12b2bc94d7	Revert D27909732: [pytorch][PR] Support factory kwargs in torch.nn modules Test Plan: revert-hammer Differential Revision: D27909732 (`5a09def9b0`) Original commit changeset: d8684b2403ab fbshipit-source-id: d00d69fae4fa4ed58d9e97e70b27a06a0dcb39e4	2021-04-21 13:44:03 -07:00
Joel Schlosser	5a09def9b0	Support factory kwargs in torch.nn modules (#54508 ) Summary: Continuation of https://github.com/pytorch/pytorch/pull/53144 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54508 Reviewed By: malfet Differential Revision: D27909732 Pulled By: jbschlosser fbshipit-source-id: d8684b2403ab7eb336371d118799146a2520bd76	2021-04-21 13:20:11 -07:00
Sam Estep	75024e228c	Add lint for unqualified `type: ignore` (#56290 ) Summary: The other half of https://github.com/pytorch/pytorch/issues/56272. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56290 Test Plan: CI should pass on the tip of this PR, and we know that the lint works because the following CI runs (before this PR was finished) failed: - https://github.com/pytorch/pytorch/runs/2384511062 - https://github.com/pytorch/pytorch/actions/runs/765036024 Reviewed By: seemethere Differential Revision: D27867219 Pulled By: samestep fbshipit-source-id: e648f07b6822867e70833e23ddafe7fb7eaca235	2021-04-21 08:07:23 -07:00
Yi Wang	5017c5fcad	[SPMD] Remove _specify_ddp_gpu_num method (#56425 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56425 As SPMD mode is gone, `_specify_ddp_gpu_num` becomes useless. It only checks if the module is a GPU module. This actually is already checked by the caller of this function (in fairscale and some other codebases). Additionally also remove `enable_pytorch_sync_bn` wrapper that only calls this function and does nothing else. ghstack-source-id: 126885376 Test Plan: waitforbuildbot Reviewed By: zhaojuanmao Differential Revision: D27866440 fbshipit-source-id: d2fd5cf43eda25c0a2bd35f647848ec0dbd6ad0f	2021-04-20 11:17:47 -07:00
Yi Wang	07653b7fe0	[SPMD] Remove ddp_gpu_size field from SyncBatchNorm (#55946 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55946 As `ddp_gpu_size` field of `SyncBatchNorm` will always be 1 for GPU modules, remove this field and the relevant code. ghstack-source-id: 126883498 Test Plan: waitforbuildbot Reviewed By: zhaojuanmao Differential Revision: D27746021 fbshipit-source-id: b4518c07e6f0c6943fbd7a7548500a7d4337126c	2021-04-19 21:41:29 -07:00
Natalia Gimelshein	92d24e3060	Revert D27855386: [pytorch][PR] Support factory kwargs in torch.nn modules Test Plan: revert-hammer Differential Revision: D27855386 (`40483acc51`) Original commit changeset: dabd505d2a04 fbshipit-source-id: f5bf3120d87861b30a8e1bf11977ad7d27cd8500	2021-04-19 20:07:20 -07:00
Joel Schlosser	40483acc51	Support factory kwargs in torch.nn modules (#54508 ) Summary: Continuation of https://github.com/pytorch/pytorch/pull/53144 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54508 Reviewed By: bdhirsh Differential Revision: D27855386 Pulled By: jbschlosser fbshipit-source-id: dabd505d2a04208e74b158570fb2859c736eea2c	2021-04-19 12:24:58 -07:00
Sam Estep	d05e7c163f	Revert D27600457: [pytorch][PR] Support factory kwargs in torch.nn modules Test Plan: revert-hammer Differential Revision: D27600457 (`1077f87269`) Original commit changeset: b58bfee61c39 fbshipit-source-id: 19d5bfc5133a3880383731d0332503ca1f3bce0c	2021-04-19 07:47:24 -07:00
Joel Schlosser	1077f87269	Support factory kwargs in torch.nn modules (#54508 ) Summary: Continuation of https://github.com/pytorch/pytorch/pull/53144 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54508 Reviewed By: mrshenli Differential Revision: D27600457 Pulled By: jbschlosser fbshipit-source-id: b58bfee61c3917524b4622f63ef216c27a588eb1	2021-04-19 06:58:40 -07:00
Yi Wang	d398a705c6	Clang-format batchnorm.py and distributed.py (#55971 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55971 Per title ghstack-source-id: 126454339 Test Plan: N/A Reviewed By: zhaojuanmao Differential Revision: D27752315 fbshipit-source-id: 64ca5dea7b2689037594a6bd9a75641a9bb817c1	2021-04-13 18:40:23 -07:00
Yukio Siraichi	27048c1dfa	Remove legacy constructor calls from _torch_ folder. (#53889 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53146 Related to https://github.com/pytorch/pytorch/issues/47112 As mentioned in https://github.com/pytorch/pytorch/issues/47112, the plan is to: 1. Verify that all `torch.Tensor()` scenarios are covered by other functions 2. Scrub internal `torch.Tensor()` uses 3. Update the docs and throw `TORCH_WARN_ONCE` if someone uses `torch.Tensor()` In this PR, I replaced all occurrences of `torch.Tensor` present in the _torch_ folder. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53889 Reviewed By: walterddr, zou3519 Differential Revision: D27190743 Pulled By: jbschlosser fbshipit-source-id: 7ecc201d57935b8dbb98ae3718b60d95cb55a010	2021-03-19 15:20:19 -07:00
Edward Yang	72c7983f23	Remove __get__ from Tensor stub. (#54208 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54208 It seems like it was added to suppress some errors in LazyModules, but I think we should solve those more directly with some type ignores in more surgical places. Fixes #54087. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D27137363 Pulled By: ezyang fbshipit-source-id: 017cafcc3350e73cd62436078835b97cd9b3b929	2021-03-17 21:40:58 -07:00
Emilio Castillo	c0c5f80f36	Lazy Modules Documentation Clarifications (#53495 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53366 gchanan albanD Thanks for the feedback. Did a first pass trying to address the concerns in the original issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53495 Reviewed By: mrshenli Differential Revision: D26914768 Pulled By: albanD fbshipit-source-id: fa049f1952ef05598f0da2abead9a5a5d3602f75	2021-03-09 13:09:33 -08:00
Sam Estep	8c798e0622	Forbid trailing whitespace (#53406 ) Summary: Context: https://github.com/pytorch/pytorch/pull/53299#discussion_r587882857 These are the only hand-written parts of this diff: - the addition to `.github/workflows/lint.yml` - the file endings changed in these four files (to appease FB-internal land-blocking lints): - `GLOSSARY.md` - `aten/src/ATen/core/op_registration/README.md` - `scripts/README.md` - `torch/csrc/jit/codegen/fuser/README.md` The rest was generated by running this command (on macOS): ``` git grep -I -l ' $' -- . ':(exclude)/contrib/' ':(exclude)third_party' \| xargs gsed -i 's/ *$//' ``` I looked over the auto-generated changes and didn't see anything that looked problematic. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53406 Test Plan: This run (after adding the lint but before removing existing trailing spaces) failed: - https://github.com/pytorch/pytorch/runs/2043032377 This run (on the tip of this PR) succeeded: - https://github.com/pytorch/pytorch/runs/2043296348 Reviewed By: walterddr, seemethere Differential Revision: D26856620 Pulled By: samestep fbshipit-source-id: 3f0de7f7c2e4b0f1c089eac9b5085a58dd7e0d97	2021-03-05 17:22:55 -08:00
Xiao Wang	d30f4d1dfd	Migrate apex.parallel.SyncBatchNorm channels_last to pytorch (#46906 ) Summary: per title This PR did - Migrate `apex.parallel.SyncBatchNorm` channels_last to pytorch `torch.nn.SyncBatchNorm` - Fix a TODO here by fusing `sum`, `div` kernels into backward elementwise kernel `b167402e2e/torch/nn/modules/_functions.py (L76-L95)` Todo - [x] Discuss a regression introduced in https://github.com/pytorch/pytorch/pull/37133#discussion_r512530389, which is the synchronized copy here `b167402e2e/torch/nn/modules/_functions.py (L32-L34)` Comment: This PR uses apex version for the size check. Test passed and I haven't seen anything wrong so far. - [x] The restriction to use channels_last kernel will be like this ``` inline bool batch_norm_use_channels_last_kernels(const at::Tensor& self) { return self.is_contiguous(at::MemoryFormat::ChannelsLast) \|\| self.ndimension() == 2; } ``` I think we can relax that for channels_last_3d as well? Comment: we don't have benchmark for this now, will check this and add functionality later when needed. - [x] Add test - [x] Add benchmark Detailed benchmark is at https://github.com/xwang233/code-snippet/tree/master/syncbn-channels-last Close https://github.com/pytorch/pytorch/issues/50781 Pull Request resolved: https://github.com/pytorch/pytorch/pull/46906 Reviewed By: albanD Differential Revision: D26771437 Pulled By: malfet fbshipit-source-id: d00387044e9d43ac7e6c0e32a2db22c63d1504de	2021-03-03 15:29:45 -08:00
zilinzhu	c8b3686a3e	Make bias in lazy modules lazy and avoid create empty tensors (#52212 ) Summary: Some minor improvement for lazy modules introduced in https://github.com/pytorch/pytorch/issues/44538, https://github.com/pytorch/pytorch/issues/47350 and https://github.com/pytorch/pytorch/issues/51548. This PR mainly turn the bias to `UninitializedParameter` and instead of creating empty tensors like ```python self.bias = Parameter(torch.Tensor(0)) self.bias = UninitializedParameter() ``` I think it would be better to ```python self.register_parameter('bias', None) self.bias = UninitializedParameter() ``` In addition, I change the constructor of the `LazyBatchNorm` from ```python self.running_mean = UninitializedBuffer() ``` to ```python self.register_buffer('running_mean', UninitializedBuffer()) ``` as the original one would not change the underlying `self._buffers`. Thank you for your time on reviewing this PR :). Gently ping albanD, mruberry Pull Request resolved: https://github.com/pytorch/pytorch/pull/52212 Reviewed By: jbschlosser Differential Revision: D26504508 Pulled By: albanD fbshipit-source-id: 7094d0bb4fa9e2a40a07b79d350ea12a6ebfd080	2021-02-18 06:34:53 -08:00
Akifumi Imanishi	b3fda95fe7	Add LazyBatchNormXd (#51862 ) Summary: Same diff with https://github.com/pytorch/pytorch/issues/51548 (cc. albanD) Pull Request resolved: https://github.com/pytorch/pytorch/pull/51862 Reviewed By: izdeby Differential Revision: D26312289 Pulled By: albanD fbshipit-source-id: 9cdec0e0c9021c33d10d85010978c7fa5cb4dc60	2021-02-09 10:29:03 -08:00
Alban Desmaison	a930162c69	Revert D26276903: [pytorch][PR] Add LazyBatchNormXd Test Plan: revert-hammer Differential Revision: D26276903 (`aa1fd6b45a`) Original commit changeset: 0ac706974178 fbshipit-source-id: bfe01b01cd460f1e2845ea5ef1fc1514e6b6ba54	2021-02-05 12:37:29 -08:00
Akifumi Imanishi	aa1fd6b45a	Add LazyBatchNormXd (#51548 ) Summary: This PR implements UninitializedBuffer and LazyBatchnormXd based on https://github.com/pytorch/pytorch/issues/44538. (cc. emcastillo and albanD) Pull Request resolved: https://github.com/pytorch/pytorch/pull/51548 Reviewed By: zhangguanheng66 Differential Revision: D26276903 Pulled By: albanD fbshipit-source-id: 0ac706974178363f8af075e59b41d5989418922f	2021-02-05 10:27:04 -08:00
Nikita Shulga	bf4fcab681	Fix SyncBatchNorm usage without stats tracking (#50126 ) Summary: In `batch_norm_gather_stats_with_counts_cuda` use `input.scalar_type()` if `running_mean` is not defined In `SyncBatchNorm` forward function create count tensor with `torch.float32` type if `running_mean` is None Fix a few typos Pull Request resolved: https://github.com/pytorch/pytorch/pull/50126 Test Plan: ``` python -c "import torch;print(torch.batch_norm_gather_stats_with_counts( torch.randn(1, 3, 3, 3, device='cuda'), mean = torch.ones(2, 3, device='cuda'), invstd = torch.ones(2, 3, device='cuda'), running_mean = None, running_var = None , momentum = .1, eps = 1e-5, counts = torch.ones(2, device='cuda')))" ``` Fixes https://github.com/pytorch/pytorch/issues/49730 Reviewed By: ngimel Differential Revision: D25797930 Pulled By: malfet fbshipit-source-id: 22a91e3969b5e9bbb7969d9cc70b45013a42fe83	2021-01-07 18:31:13 -08:00
Rohan Varma	c0a0845019	Improve new_group example in the context of SyncBatchNorm (#48897 ) Summary: Closes https://github.com/pytorch/pytorch/issues/48804 Improves some documentation/example in SyncBN docs to clearly show that each rank must call into all `new_group()` calls for creating process subgroups, even if they are not going to be part of that particular subgroup. We then pick the right group, i.e. the group that the rank is part of, and pass that into the SyncBN APIs. Doc rendering: <img width="786" alt="syncbn_update" src="https://user-images.githubusercontent.com/8039770/101271959-b211ab80-373c-11eb-8b6d-d56483fd9f5d.png"> Pull Request resolved: https://github.com/pytorch/pytorch/pull/48897 Reviewed By: zou3519 Differential Revision: D25493181 Pulled By: rohan-varma fbshipit-source-id: a7e93fc8cc07ec7797e5dbc356f1c3877342cfa3	2020-12-11 10:28:08 -08:00
Guilherme Leobas	9b52654620	annotate a few torch.nn.modules.* modules (#45772 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/45771 Pull Request resolved: https://github.com/pytorch/pytorch/pull/45772 Reviewed By: mruberry Differential Revision: D24682013 Pulled By: albanD fbshipit-source-id: e32bc4fe9c586c079f7070924a874c70f3d127fa	2020-11-02 13:04:59 -08:00
Vasiliy Kuznetsov	bdf329ef8a	SyncBN: preserve qconfig if it exists (#45317 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45317 Eager mode quantization depends on the presence of the `config` model attribute. Currently converting a model to use `SyncBatchNorm` removes the qconfig - fixing this. This is important if a BN is not fused to anything during quantization convert. Test Plan: ``` python test/test_quantization.py TestDistributed.test_syncbn_preserves_qconfig ``` Imported from OSS Reviewed By: jerryzh168 Differential Revision: D23922072 fbshipit-source-id: cc1bc25c8e5243abb924c6889f78cf65a81be158	2020-09-24 22:52:07 -07:00
Lin.Sung	f77ba0e48c	Change typo 'momemtum' to 'momentum' (#45045 ) Summary: As the title. Pull Request resolved: https://github.com/pytorch/pytorch/pull/45045 Reviewed By: mruberry Differential Revision: D23808563 Pulled By: mrshenli fbshipit-source-id: ca818377f4c23d67b037c146fef667ab8731961e	2020-09-21 19:03:26 -07:00
Xiang Gao	20ac736200	Remove py2 compatible future imports (#44735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44735 Reviewed By: mruberry Differential Revision: D23731306 Pulled By: ezyang fbshipit-source-id: 0ba009a99e475ddbe22981be8ac636f8a1c8b02f	2020-09-16 12:55:57 -07:00
Tongzhou Wang	09892de815	Clarify track_running_stats docs; Make SyncBatchNorm track_running_stats behavior consistent (#44445 ) Summary: context: https://github.com/pytorch/pytorch/pull/38084 Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/44445 Reviewed By: colesbury Differential Revision: D23634216 Pulled By: mrshenli fbshipit-source-id: d1242c694dec0e7794651f8031327625eb9989ee	2020-09-11 08:20:34 -07:00
F-G Fernandez	881c1adfcd	Fixed buffer update in BatchNorm when track_running_stats is set to False (#38084 ) Summary: This PR aims at tackling https://github.com/pytorch/pytorch/issues/37823 by: - ensuring that buffers will be used for normalization computation but won't be updated, when buffers are not None, and `track_running_stats=False` - adding a corresponding unittest to ensure expected behaviour Any feedback is welcome! _Note: we might want to update the docstrings of `BatchNorm*d`, feel free to share any suggestion!_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/38084 Differential Revision: D22047871 Pulled By: ezyang fbshipit-source-id: 5acbcad9773e7901f26d625db71d43d7dc236d3e	2020-06-22 08:17:31 -07:00
Edward Yang	eace053398	Move all torch.nn.modules type annotations inline (#38211 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38211 Just because the annotations are inline doesn't mean the files type check; most of the newly annotated files have type errors and I added exclusions for them in mypy.ini. The payoff of moving all of these modules inline is I can delete the relevant code generation logic for the pyi files (which was added ignore annotations that weren't actually relevant anymore.) For the most part the translation was completely mechanical, but there were two hairy issues. First, I needed to work around a Python 3.6 and earlier bug where Generic has a nontrivial metaclass. This fix is in torch/jit/__init__.py. Second, module.py, we need to apply the same fix for avoiding contravariance checks that the pyi file used to have; this is done by declaring forward as a variable (rather than a function), which appears to be sufficient enough to get mypy to not contravariantly check input arguments. Because we aren't actually typechecking these modules in most cases, it is inevitable that some of these type annotations are wrong. I slavishly copied the old annotations from the pyi files unless there was an obvious correction I could make. These annotations will probably need fixing up later. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Differential Revision: D21497397 Pulled By: ezyang fbshipit-source-id: 2b08bacc152c48f074e7edc4ee5dce1b77d83702	2020-06-11 15:59:57 -07:00
Antoine Broyelle	bfa76ff407	[Doc] Clarify that variance estimor is biaised for normalization layers (#39752 ) Summary: Closes https://github.com/pytorch/pytorch/issues/39330 Pull Request resolved: https://github.com/pytorch/pytorch/pull/39752 Differential Revision: D21980097 Pulled By: ngimel fbshipit-source-id: 2bdcb8bf8194768985f5a8787712d215c0c5c1ec	2020-06-10 14:44:44 -07:00
Tongzhou Wang	d1cdf1fd56	update convert_sync_batchnorm docs (#39646 ) Summary: fix some inaccuracies Pull Request resolved: https://github.com/pytorch/pytorch/pull/39646 Differential Revision: D21930023 Pulled By: mrshenli fbshipit-source-id: 9c6b8eeefeb0482a6ae7f825ae055090ce589223	2020-06-08 18:42:42 -07:00
Tongzhou Wang	dfc4be205e	Fix broken reference in sync bn doc (#38890 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38890 Differential Revision: D21722162 Pulled By: ezyang fbshipit-source-id: a7d18239917b2886fe8c1c0aaf42fc8491c8e10c	2020-05-27 11:30:48 -07:00
jiej	5b8a79ab49	fix the device inconsistency for import convert_sync_batchnorm (#38729 ) Summary: This fixes the device inconsistency reported in https://github.com/pytorch/pytorch/issues/37930 Pull Request resolved: https://github.com/pytorch/pytorch/pull/38729 Differential Revision: D21671039 Pulled By: ngimel fbshipit-source-id: 17fdb4eae2ddaf64560dd026fe39958536ab313f	2020-05-20 15:42:53 -07:00
Tongzhou Wang	44cead3a31	Improve syncbn doc format (#38423 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38423 Differential Revision: D21601342 Pulled By: jerryzh168 fbshipit-source-id: dd2bf012831025495e9ece3db08536dd1d515645	2020-05-18 11:52:07 -07:00
Edward Yang	4fef3763dd	Revert "Revert D21337640: [pytorch][PR] Split up documentation into subpages and clean up some warnings" (#37778 ) Summary: Original PR: https://github.com/pytorch/pytorch/pull/37419 cc mattip suo Pull Request resolved: https://github.com/pytorch/pytorch/pull/37778 Differential Revision: D21385774 Pulled By: ezyang fbshipit-source-id: 5de532faab8bae132736b6b5189e0ee2ac9935be	2020-05-04 14:32:35 -07:00
Michael Suo	20f7e62b1d	Revert D21337640: [pytorch][PR] Split up documentation into subpages and clean up some warnings Test Plan: revert-hammer Differential Revision: D21337640 Original commit changeset: d4ad198780c3 fbshipit-source-id: fa9ba6ac542173a50bdb45bfa12f3fec0ed704fb	2020-05-04 10:57:55 -07:00
mattip	f10fbcc820	Split up documentation into subpages and clean up some warnings (#37419 ) Summary: xref gh-32838, gh-34032 This is a major refactor of parts of the documentation to split it up using sphinx's `autosummary` feature which will build out `autofuction` and `autoclass` stub files and link to them. The end result is that the top module pages like torch.nn.rst and torch.rst are now more like table-of-contents to the actual single-class or single-function documentations pages. Along the way, I modified many of the docstrings to eliminate sphinx warnings when building. I think the only thing I changed from a non-documentation perspective is to add names to `__all__` when adding them to `globals()` in `torch.__init__.py` I do not know the CI system: are the documentation build artifacts available after the build, so reviewers can preview before merging? Pull Request resolved: https://github.com/pytorch/pytorch/pull/37419 Differential Revision: D21337640 Pulled By: ezyang fbshipit-source-id: d4ad198780c3ae7a96a9f22651e00ff2d31a0c0f	2020-05-04 09:39:22 -07:00
hello@nicklashansen.com	d3a0bdd06b	proofreading (#29797 ) Summary: two instances of if -> it in torch.nn.modules.batchnorm.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/29797 Differential Revision: D19698613 Pulled By: ezyang fbshipit-source-id: 7312b2333f227113e904dfa91db90d00e525affb	2020-02-04 14:30:36 -08:00
Michael Suo	3552be1090	[jit] fix the NoneType param/buffer hack (#32745 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/32745 Some parameters (like `bias` in conv) are optional. To achieve this previously, you had to add `bias` as a constant, which would invoke some pretty weird behavior in the frontend, summarized as: ``` if bias is not None: add it as a parameter normally else: # bias is None add it as a constant with the value None ``` There are several things bad about this: 1. Bias is not a constant. Marking it `__constants__` is confusing. 2. It basically relies on an implementation detail (the frontend processes parameters before constants) to work. Okay, whatever. I don't even know why we did this originally, but getting rid of it doesn't break anything, so I assume improved NoneType refinement has made this a non-issue. Note on perf: this will make no difference; if bias was `None` it's still folded out today, if bias is a Tensor it would be added as a parameter both before and after this change Test Plan: Imported from OSS Differential Revision: D19628634 Pulled By: suo fbshipit-source-id: d9128a09c5d096b938fcf567b8c23b09ac9ab37f	2020-01-29 17:04:39 -08:00
Alban Desmaison	81048c41ab	remove simple .data from torch/nn Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31482 Test Plan: Imported from OSS Differential Revision: D19303243 Pulled By: albanD fbshipit-source-id: 5afdfeb4b8382c09b9ec65acd545148ed76d4285	2020-01-15 12:40:38 -08:00
Peter Bell	37ca5a8a64	convert_sync_batchnorm should not convert _InstanceNorm instances (#29985 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/29187 This introduces a new class `_NormBase` that `_InstanceNorm` and `_BatchNorm` inherit from separately. This means the `isinstance(module, _BatchNorm)` check won't falsely pass for `_InstanceNorm`. The suggested fix of adding `and not isinstance(module, _InstanceNorm)` works as well, but requires introducing a cyclic dependency between `instancenorm.py` and `batchnorm.py`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/29985 Differential Revision: D18588104 Pulled By: yf225 fbshipit-source-id: f599da3b902ad9c56836db4d429bfc462ed51338	2019-11-19 09:39:36 -08:00
jiej	9c7e604c60	SyncBatchNorm Update on input dimension checks (#29626 ) Summary: update the requirements on input dimensions for `torch.nn.SyncBatchNorm`: 1. 2D inputs is now permissible, https://github.com/pytorch/pytorch/issues/20204 ; 2. requires at least two element along normalization plane (BatchNorm behavior); Pull Request resolved: https://github.com/pytorch/pytorch/pull/29626 Differential Revision: D18492531 Pulled By: albanD fbshipit-source-id: f008e46a2d520d73c3c2730890a7424eba2ede9e	2019-11-18 16:09:51 -08:00
Igor Fedan	9dcf5191d5	explicitly provide memory format when calling to clone() at batchnorm.py Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/28689 Test Plan: Imported from OSS Differential Revision: D18333368 Pulled By: ifedan fbshipit-source-id: e440c80ce8a64e1aae709cd935b14c7024a17787	2019-11-07 06:42:14 -08:00
jiej	0af60a5c06	(#27299 ) Summary: Removing in-place operator for num_batches_tracked increment. The in-place operator used here turns out to block many optimization opportunities due to alias assumption for inputs. Pull Request resolved: https://github.com/pytorch/pytorch/pull/27299 Differential Revision: D17909341 Pulled By: ngimel fbshipit-source-id: 7d635be94dfd2002af435acf6ea71995adaa40f6	2019-10-14 17:48:27 -07:00
Tongzhou Wang	eeaef217b3	Eliminate outdated comments Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/26933 Differential Revision: D17685153 Pulled By: ezyang fbshipit-source-id: e402a12dc9a172649f153903a3a7834004b5667a	2019-10-02 08:01:49 -07:00
Spandan Tiwari	8986b9e38d	Momentum setting in SyncBatchNorm forward (inference) pass. (#24995 ) Summary: This is a fix for a potential ONNX export issue with SyncBatchNorm where irrespective of the value of momentum, the value for momentum in ONNX BN node is always 0. The details are captured in https://github.com/pytorch/pytorch/issues/18525. The fix in this PR for `SyncBatchNorm` is very similar to the fix that went in https://github.com/pytorch/pytorch/pull/18764 for `BatchNorm` (I think this site was just missed). Please note that there are no ONNX test points added for this, because SyncBatchNorm works exclusively with tensors on GPU and the ONNX test passes are CPU only. If there's a way to add a test point, please let me know. Pull Request resolved: https://github.com/pytorch/pytorch/pull/24995 Differential Revision: D17085570 Pulled By: dzhulgakov fbshipit-source-id: 162d428673c269efca4360fb103854b7319ec204	2019-08-29 23:16:46 -07:00
Yuxin Wu	927fb56ee0	Allow SyncBatchNorm without DDP in inference mode (#24815 ) Summary: Fix https://github.com/pytorch/pytorch/issues/22538 Pull Request resolved: https://github.com/pytorch/pytorch/pull/24815 Test Plan: Can run a detectron2 evaluation without entering DDP. #sandcastle Differential Revision: D16883694 Pulled By: ppwwyyxx fbshipit-source-id: 3195bc4e7f43a994821069f229b26302e2988739	2019-08-19 13:43:42 -07:00
Zhi Tian	6eb3969ac7	keep reuqires_grad unchanged after converting bn to syncbn (#22569 ) Summary: After converting BN layers to SyncBN layers, the function will set all `requires_grad = True` regardless of the original requires_grad states. I think it is a bug and have fixed it in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22569 Differential Revision: D16151647 Pulled By: zou3519 fbshipit-source-id: e2ad1886c94d8882485e7fb8be51ad76469ecc67	2019-07-10 08:38:04 -07:00
David Riazati	10c4b98ade	Remove weak script (#22212 ) Summary: * Deletes all weak script decorators / associated data structures / methods * In order to keep supporting the standard library in script, this enables recursive script on any function defined in `torch.nn` * Most changes in `torch/nn` are the result of `ag -Q "weak" torch/nn/ -l \| xargs sed -i '/weak/d'`, only `rnn.py` needed manual editing to use the `ignore` and `export` to continue supporting the overloaded `forward` methods * `Sequential`/`ModuleList` no longer need to be added to constants since they are compiled on demand This should also fix https://github.com/pytorch/pytorch/issues/22212 Pull Request resolved: https://github.com/pytorch/pytorch/pull/22212 Differential Revision: D15988346 Pulled By: driazati fbshipit-source-id: af223e3ad0580be895377312949997a70e988e4f	2019-07-03 17:28:25 -07:00

1 2 3

105 Commits