pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

M.L. Croci 1f0223d6bb Fix bug in gaussian_nll_loss (#56469 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/53964. cc albanD almson ## Major changes: - Overhauled the actual loss calculation so that the shapes are now correct (in functional.py) - added the missing doc in nn.functional.rst ## Minor changes (in functional.py): - I removed the previous check on whether input and target were the same shape. This is to allow for broadcasting, say when you have 10 predictions that all have the same target. - I added some comments to explain each shape check in detail. Let me know if these should be shortened/cut. Screenshots of updated docs attached. Let me know what you think, thanks! ## Edit: Description of change of behaviour (affecting BC): The backwards-compatibility is only affected for the `reduction='none'` mode. This was the source of the bug. For tensors with size (N, D), the old returned loss had size (N), as incorrect summation was happening. It will now have size (N, D) as expected. ### Example Define input tensors, all with size (2, 3). `input = torch.tensor([[0., 1., 3.], [2., 4., 0.]], requires_grad=True)` `target = torch.tensor([[1., 4., 2.], [-1., 2., 3.]])` `var = 2*torch.ones(size=(2, 3), requires_grad=True)` Initialise loss with reduction mode 'none'. We expect the returned loss to have the same size as the input tensors, (2, 3). `loss = torch.nn.GaussianNLLLoss(reduction='none')` Old behaviour: `print(loss(input, target, var)) ` `# Gives tensor([3.7897, 6.5397], grad_fn=<MulBackward0>. This has size (2).` New behaviour: `print(loss(input, target, var)) ` `# Gives tensor([[0.5966, 2.5966, 0.5966], [2.5966, 1.3466, 2.5966]], grad_fn=<MulBackward0>)` `# This has the expected size, (2, 3).` To recover the old behaviour, sum along all dimensions except for the 0th: `print(loss(input, target, var).sum(dim=1))` `# Gives tensor([3.7897, 6.5397], grad_fn=<SumBackward1>.` ![doc1](https://user-images.githubusercontent.com/26558092/115391089-f7f47b00-a1d6-11eb-8726-e4da9057aee0.png) ![doc2](https://user-images.githubusercontent.com/26558092/115391094-f925a800-a1d6-11eb-954b-afd187f42bc7.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/56469 Reviewed By: jbschlosser, agolynski Differential Revision: D27894170 Pulled By: albanD fbshipit-source-id: 197890189c97c22109491c47f469336b5b03a23f		2021-04-22 07:43:48 -07:00
..
_static	Add documentation page for pipeline parallelism. (#50791 )	2021-01-25 13:47:13 -08:00
_templates	various doc building cleanups (#53851 )	2021-03-16 15:01:59 -07:00
community	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
notes	Revert D23752058: [pytorch][PR] Don't split oversize cached blocks	2021-04-14 09:24:08 -07:00
rpc	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
scripts	Optimize SiLU (Swish) op in PyTorch (#42976 )	2020-08-16 13:21:57 -07:00
__config__.rst	Fix __config__ docs (#48557 )	2020-11-29 23:57:06 -08:00
amp.rst	[Relanding] Implemented torch.linalg.multi_dot (#52859 )	2021-04-01 04:49:05 -07:00
autograd.rst	breakup autograd documentation (#55672 )	2021-04-14 12:40:00 -07:00
backends.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
benchmark_utils.rst	Expand benchmark utils docs (#51664 )	2021-02-04 00:22:41 -08:00
bottleneck.rst
checkpoint.rst
complex_numbers.rst	various doc building cleanups (#53851 )	2021-03-16 15:01:59 -07:00
conf.py	various doc building cleanups (#53851 )	2021-03-16 15:01:59 -07:00
cpp_extension.rst
cpp_index.rst
cuda.rst	breakup optim, cuda documentation (#55673 )	2021-04-14 12:44:00 -07:00
cudnn_persistent_rnn.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
cudnn_rnn_determinism.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
data.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
ddp_comm_hooks.rst	[SPMD] Restrict DDP communication hooks to SPSD mode (#55253 )	2021-04-05 16:46:47 -07:00
distributed.optim.rst	[Reland] Update and expose ZeroRedundancyOptimizer docs (#53112 )	2021-03-02 14:16:12 -08:00
distributed.rst	update distributed doc table for alltoall nccl (#54277 )	2021-03-19 15:35:10 -07:00
distributions.rst	Add sample validation for LKJCholesky.log_prob (#52763 )	2021-02-25 16:12:29 -08:00
dlpack.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
docutils.conf
fft.rst	Use autosummary on torch.fft, torch.linalg (#55748 )	2021-04-13 12:02:36 -07:00
futures.rst
fx.rst	[FX][docs] Render inherited methods in fx.Tracer API reference (#53630 )	2021-03-09 14:30:41 -08:00
hub.rst	Add a torch.hub.load_local() function that can load models from any local directory with a hubconf.py (#44204 )	2020-09-21 14:17:21 -07:00
index.rst	Updated the tech docs to be consistent with other two descriptions (#56338 )	2021-04-20 09:00:42 -07:00
jit_builtin_functions.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
jit_language_reference_v2.rst	[JIT] Put explicit error message on class attribute accesses. (#55723 )	2021-04-16 15:47:10 -07:00
jit_language_reference.rst	add type annotations to torch.nn.modules.conv (#49564 )	2021-01-15 11:16:11 -08:00
jit_python_reference.rst
jit_unsupported.rst	[JIT] Update docs for recently added features (#45232 )	2020-09-28 18:17:42 -07:00
jit.rst	Remove caption for Lang Reference (#56526 )	2021-04-20 14:33:42 -07:00
linalg.rst	Use autosummary on torch.fft, torch.linalg (#55748 )	2021-04-13 12:02:36 -07:00
math-quantizer-equation.png
mobile_optimizer.rst	Mod lists to neutral+descriptive terms in caffe2/docs (#49803 )	2020-12-23 11:37:11 -08:00
model_zoo.rst
multiprocessing.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
name_inference.rst	Add CSR (compressed sparse row) layout for sparse tensors (#50937 )	2021-04-12 10:09:12 -07:00
named_tensor.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
nn.functional.rst	Fix bug in gaussian_nll_loss (#56469 )	2021-04-22 07:43:48 -07:00
nn.init.rst
nn.rst	docs: separate autosummary for flatten layers (#54663 )	2021-03-29 10:23:34 -07:00
onnx.rst	[ONNX] Add hardsigmoid symbolic in opset 9 #49649 (#54193 )	2021-04-07 14:28:31 -07:00
optim.rst	breakup optim, cuda documentation (#55673 )	2021-04-14 12:44:00 -07:00
package.rst	[package] Massage exporter docstrings (#56547 )	2021-04-21 14:06:54 -07:00
pipeline.rst	Add tutorials to pipeline docs. (#55209 )	2021-04-05 20:01:00 -07:00
profiler.rst	docs: fix profiler docstring (#55750 )	2021-04-13 00:23:14 -07:00
quantization-support.rst	[docs][quant] Add fx graph mode quant api doc (#55306 )	2021-04-05 13:56:23 -07:00
quantization.rst	[docs][quant] Fix FX Graph Mode Quantization tutorial link (#54715 )	2021-03-29 17:25:19 -07:00
random.rst
rpc.rst	Add 'remote_parameters' and 'get_module_rref' to RemoteModule docs. (#54645 )	2021-03-26 21:41:28 -07:00
sparse.rst	Add CSR (compressed sparse row) layout for sparse tensors (#50937 )	2021-04-12 10:09:12 -07:00
special.rst	[special] Add `i0e` (#54409 )	2021-04-15 06:06:11 -07:00
storage.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
tensor_attributes.rst	Remove legacy constructor calls from pytorch codebase. (#54142 )	2021-04-11 15:45:17 -07:00
tensor_view.rst	Add `torch.swapdims` and `torch.swapaxes` (#46041 )	2020-11-18 11:35:53 -08:00
tensorboard.rst
tensors.rst	Revert "Deprecate legacy constructor `torch.Tensor()` (#54414 )" (#55831 )	2021-04-15 14:06:10 -07:00
torch.nn.intrinsic.qat.rst	[quantization] Add some support for 3d operations (#50003 )	2021-03-10 16:40:35 -08:00
torch.nn.intrinsic.quantized.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
torch.nn.intrinsic.rst	[quantization] Add some support for 3d operations (#50003 )	2021-03-10 16:40:35 -08:00
torch.nn.qat.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
torch.nn.quantized.dynamic.rst	Forbid trailing whitespace (#53406 )	2021-03-05 17:22:55 -08:00
torch.nn.quantized.rst	[quant] add docs for embedding/embedding_bag (#51770 )	2021-02-05 11:43:15 -08:00
torch.overrides.rst	Add documentation for torch.overrides submodule. (#48170 )	2020-11-30 11:25:31 -08:00
torch.quantization.rst	Lint trailing newlines (#54737 )	2021-03-30 13:09:52 -07:00
torch.rst	DOC: use autosummary on tensors.rst (#55042 )	2021-04-08 06:44:23 -07:00
type_info.rst	DOC: split quantization.rst into smaller pieces (#41321 )	2020-07-25 23:59:40 -07:00