pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Maggie Moss	154e4d36e9	Fix pyrelfy ignore syntax in distributions and ao (#166248 ) Ensures existing pyrefly ignores only ignore the intended error code pyrefly check lintrunner Pull Request resolved: https://github.com/pytorch/pytorch/pull/166248 Approved by: https://github.com/oulgen	2025-10-26 22:13:48 +00:00
Maggie Moss	b13cd141b3	Add pyrefly suppressions (#164748 ) Adds suppressions to pyrefly will typecheck clean: https://github.com/pytorch/pytorch/issues/163283 Test plan: dmypy restart && python3 scripts/lintrunner.py -a pyrefly check step 1: delete lines in the pyrefly.toml file from the `project-excludes` field step 2: run pyrefly check step 3: add suppressions, clean up unused suppressions before: https://gist.github.com/maggiemoss/4b3bf2037014e116bc00706a16aef199 after: 0 errors (4,263 ignored) Pull Request resolved: https://github.com/pytorch/pytorch/pull/164748 Approved by: https://github.com/oulgen	2025-10-07 17:31:18 +00:00
Xuehai Pan	4cc8b60d1b	[BE][1/16] fix typos in torch/ (#156311 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/156311 Approved by: https://github.com/albanD	2025-07-09 11:02:22 +00:00
Randolf Scholz	6c38b9be73	[typing] Add type hints to `__init__` methods in `torch.distributions`. (#144197 ) Fixes #144196 Extends #144106 and #144110 ## Open Problems: - [ ] Annotating with `numbers.Number` is a bad idea, should consider using `float`, `SupportsFloat` or some `Procotol`. https://github.com/pytorch/pytorch/pull/144197#discussion_r1903324769 # Notes - `beta.py`: needed to add `type: ignore` since `broadcast_all` is untyped. - `categorical.py`: converted `else` branches of mutually exclusive arguments to `if` branch[^2]. - ~~`dirichlet.py`: replaced `axis` with `dim` arguments.~~ #144402 - `gemoetric.py`: converted `else` branches of mutually exclusive arguments to `if` branch[^2]. - ~~`independent.py`: fixed bug in `Independent.__init__` where `tuple[int, ...]` could be passed to `Distribution.__init__` instead of `torch.Size`.~~ EDIT: turns out the bug is related to typing of `torch.Size`. #144218 - `independent.py`: made `Independent` a generic class of its base distribution. - `multivariate_normal.py`: converted `else` branches of mutually exclusive arguments to `if` branch[^2]. - `relaxed_bernoulli.py`: added class-level type hint for `base_dist`. - `relaxed_categorical.py`: added class-level type hint for `base_dist`. - ~~`transforms.py`: Added missing argument to docstring of `ReshapeTransform`~~ #144401 - ~~`transforms.py`: Fixed bug in `AffineTransform.sign` (could return `Tensor` instead of `int`).~~ #144400 - `transforms.py`: Added `type: ignore` comments to `AffineTransform.log_abs_det_jacobian`[^1]; replaced `torch.abs(scale)` with `scale.abs()`. - `transforms.py`: Added `type: ignore` comments to `AffineTransform.__eq__`[^1]. - `transforms.py`: Fixed type hint on `CumulativeDistributionTransform.domain`. Note that this is still an LSP violation, because `Transform.domain` is defined as `Constraint`, but `Distribution.domain` is defined as `Optional[Constraint]`. - skipped: `constraints.py`, `constraints_registry.py`, `kl.py`, `utils.py`, `exp_family.py`, `__init__.py`. ## Remark `TransformedDistribution`: `__init__` uses the check `if reinterpreted_batch_ndims > 0:`, which can lead to the creation of `Independent` distributions with only 1 component. This results in awkward code like `base_dist.base_dist` in `LogisticNormal`. ```python import torch from torch.distributions import * b1 = Normal(torch.tensor([0.0]), torch.tensor([1.0])) b2 = MultivariateNormal(torch.tensor([0.0]), torch.eye(1)) t = StickBreakingTransform() d1 = TransformedDistribution(b1, t) d2 = TransformedDistribution(b2, t) print(d1.base_dist) # Independent with 1 dimension print(d2.base_dist) # MultivariateNormal ``` One could consider changing this to `if reinterpreted_batch_ndims > 1:`. [^1]: Usage of `isinstance(value, numbers.Real)` leads to problems with static typing, as the `numbers` module is not supported by `mypy` (see <https://github.com/python/mypy/issues/3186>). This results in us having to add type-ignore comments in several places [^2]: Otherwise, we would have to add a bunch of `type: ignore` comments to make `mypy` happy, as it isn't able to perform the type narrowing. Ideally, such code should be replaced with structural pattern matching once support for Python 3.9 is dropped. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144197 Approved by: https://github.com/malfet Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>	2025-04-06 17:50:35 +00:00
Xuehai Pan	995df34b19	[BE][PYFMT] migrate PYFMT for `torch.{distributed,distributions}` to `ruff format` (#144547 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144547 Approved by: https://github.com/kwen2501	2025-02-28 07:35:56 +00:00
Randolf Scholz	64cd81712d	`torch.distributions`: replace `numbers.Number` with `torch.types.Number`. (#145086 ) Fixes #144788 (partial) Pull Request resolved: https://github.com/pytorch/pytorch/pull/145086 Approved by: https://github.com/malfet	2025-01-27 20:24:55 +00:00
Randolf Scholz	983bf604e5	ReshapeTransform: added missing argument in docstring (#144401 ) See https://github.com/pytorch/pytorch/pull/144197#discussion_r1907336339 Pull Request resolved: https://github.com/pytorch/pytorch/pull/144401 Approved by: https://github.com/janeyx99, https://github.com/malfet Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com> Co-authored-by: Jane (Yuan) Xu <31798555+janeyx99@users.noreply.github.com>	2025-01-13 17:59:59 +00:00
Randolf Scholz	355b0bc7e3	[typing] Add type hints to `@property` and `@lazy_property` in `torch.distributions`. (#144110 ) Fixes #76772, #144196 Extends #144106 - added type annotations to `lazy_property`. - added type annotation to all `@property` and `@lazy_property` inside `torch.distributions` module. - added simply type-check unit test to ensure type inference is working. - replaced deprecated annotations like `typing.List` with the corresponding counterpart. - simplified `torch.Tensor` hints with plain `Tensor`, otherwise signatures can become very verbose. Pull Request resolved: https://github.com/pytorch/pytorch/pull/144110 Approved by: https://github.com/Skylion007	2025-01-07 19:27:36 +00:00
Xuehai Pan	b25ef91bf1	[BE][Easy][18/19] enforce style for empty lines in import segments in `torch/d*/` (#129770 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129770 Approved by: https://github.com/wconstab	2024-08-01 04:22:50 +00:00
Aaron Orenstein	7c12cc7ce4	Flip default value for mypy disallow_untyped_defs [6/11] (#127843 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127843 Approved by: https://github.com/oulgen ghstack dependencies: #127842	2024-06-08 18:49:29 +00:00
Till Hoffmann	5296c14094	Add inverse gamma distribution and fix `sign` bug in `PowerTransform`. (#104501 ) This PR comprises a few small contributions: 1. `PowerTransform` returned a sign of `+1` irrespective of exponent. However, it should return the sign of the exponent because the gradient has the same sign as the exponent. That issue has been fixed. 2. Added tests to catch errors akin to 1. in the future. 3. Added an `InverseGamma` distribution as a `TransformedDistribution` with `PowerTransform(-1)` and `Gamma` base distribution. The `InverseGamma` is often used as a prior for the length scale of Gaussian processes to aggressively suppress short length scales (see [here](https://betanalpha.github.io/assets/case_studies/gaussian_processes.html#323_Informative_Prior_Model) for a discussion). Note: I added a `positive` constraint for the support of the inverse gamma distribution because the `PowerTransform(-1)` can fail for `nonnegative` constraints if the random variable is zero. ```python >>> torch.distributions.InverseGamma(0.5, 1.0).log_prob(torch.zeros(1)) --------------------------------------------------------------------------- ValueError Traceback (most recent call last) <ipython-input-8-758aa22deacd> in <module> ----> 1 torch.distributions.InverseGamma(0.5, 1.0).log_prob(torch.zeros(1)) ~/git/pytorch/torch/distributions/transformed_distribution.py in log_prob(self, value) 140 """ 141 if self._validate_args: --> 142 self._validate_sample(value) 143 event_dim = len(self.event_shape) 144 log_prob = 0.0 ~/git/pytorch/torch/distributions/distribution.py in _validate_sample(self, value) 298 valid = support.check(value) 299 if not valid.all(): --> 300 raise ValueError( 301 "Expected value argument " 302 f"({type(value).__name__} of shape {tuple(value.shape)}) " ValueError: Expected value argument (Tensor of shape (1,)) to be within the support (GreaterThan(lower_bound=0.0)) of the distribution InverseGamma(), but found invalid values: tensor([0.]) ``` This differs from the scipy implementation. ```python >>> scipy.stats.invgamma(0.5).pdf(0) 0.0 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/104501 Approved by: https://github.com/fritzo, https://github.com/ezyang	2023-11-01 02:26:25 +00:00
Edward Z. Yang	3bf922a6ce	Apply UFMT to low traffic torch modules (#106249 ) Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/106249 Approved by: https://github.com/Skylion007	2023-07-29 23:37:30 +00:00
Justin Chu	3721fa5612	[BE] Enable ruff's UP rules and autoformat optim/ (#105426 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105426 Approved by: https://github.com/malfet, https://github.com/albanD, https://github.com/aaronenyeshi, https://github.com/janeyx99	2023-07-18 21:07:43 +00:00
Xuehai Pan	5b1cedacde	[BE] [2/3] Rewrite `super()` calls in functorch and torch (#94588 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94588 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-10 21:16:33 +00:00
Aaron Gokaslan	8fce9a09cd	[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 ) Apply parts of pyupgrade to torch (starting with the safest changes). This PR only does two things: removes the need to inherit from object and removes unused future imports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94308 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-07 21:10:56 +00:00
Peter Bell	4058dedf21	Replace log(1 + x) with log1p(x) (#92114 ) `log1p` offers better precision near zero since `(1 + x) - 1` truncates any values less than the float epsilon to zero. For `soft_margin_loss` this also requires one fewer kernel invocation which for numel=1e7 gives me a 1.2x speedup on CUDA and a 1.1x speedup on CPU. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92114 Approved by: https://github.com/ngimel, https://github.com/lezcano	2023-01-18 10:43:56 +00:00
Till Hoffmann	b485781440	Add a transform for positive-definite matrices. (#76777 ) The `PositiveDefiniteTransform` is required to transform from an unconstrained space to positive definite matrices, e.g. to support testing the Wishart mode in #76690. It is a simple extension of the `LowerCholeskyTransform`. I've also added a small test that ensures the generated data belong to the domain of the associated transform. Previously, the data generated for the inverse transform of the `LowerCholeskyTransform` wasn't part of the domain, and the test only passed because the comparison uses `equal_nan=True`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76777 Approved by: https://github.com/lezcano, https://github.com/fritzo, https://github.com/soumith	2022-12-08 09:18:44 +00:00
Till Hoffmann	bf6481553a	Ensure `Transform` is pickleable. (#81707 ) `Transform` is not currently pickleable if the inverse transform cache `_inv` is not `None` because `_inv` is a `weakref` which cannot be serialized by `pickle`. The following succeeds. ```python >>> import torch as th >>> import pickle >>> dist = th.distributions.TransformedDistribution( ... th.distributions.Normal(0, 1), ... [th.distributions.AffineTransform(2, 3)] ... ) >>> th.save(dist, "some-file.pt") ``` But the transformed distribution can no longer be pickled after evaluating `log_prob` (which implicitly creates `_inv`). ```python >>> dist.log_prob(th.linspace(0, 1, 10)) >>> th.save(dist, "some-file.pt") TypeError: cannot pickle 'weakref' object ``` This PR fixes the issue by setting `_inv` to `None` in `__getstate__`. cc @fritzo, @neerajprad Pull Request resolved: https://github.com/pytorch/pytorch/pull/81707 Approved by: https://github.com/fritzo	2022-07-22 06:33:53 +00:00
MohammadReza Ebrahimi	29a9928767	torch.distribution examples rendering issue (#81611 ) # Issue "Example" section in the documentation is not rendering correctly for [`torch.distributions.transforms.CatTransform`](https://pytorch.org/docs/stable/distributions.html#torch.distributions.transforms.CatTransform) and [`torch.distributions.transforms.StackTransform`](https://pytorch.org/docs/stable/distributions.html#torch.distributions.transforms.StackTransform) # Fix Simply add an empty line after the `Example::` keyword to fix the issue. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81611 Approved by: https://github.com/kit1980	2022-07-19 01:06:37 +00:00
Fritz Obermeyer	076619d7c2	Fix docstring hiding due to #45689 (#73747 ) Summary: Fixes hiding of the `CatTransform` docstring due to a misplaced type annotation in https://github.com/pytorch/pytorch/issues/45689. Also fixes that annotation and adds to to `StackTransform` to keep `CatTransform` similar. Pull Request resolved: https://github.com/pytorch/pytorch/pull/73747 Reviewed By: mruberry Differential Revision: D34649927 Pulled By: neerajprad fbshipit-source-id: e4594fd76020a37ac4cada88e5fb08191984e911 (cherry picked from commit cec256c3242d1cf55073a980060af87c1fd59ac9)	2022-03-08 03:37:33 +00:00
Jacob Hepkema	91261feb7b	Add SoftplusTransform (#52300 ) Summary: This pull request introduces `SoftplusTransform` to `torch.distributions.transforms`. `SoftplusTransform` transforms via the mapping `Softplus(x) = log(1 + exp(x))`. Note that the transform is different to [`torch.nn.Softplus`](https://pytorch.org/docs/stable/generated/torch.nn.Softplus.html#torch.nn.Softplus), as that has additional `beta` and `threshold` parameters. Inverse and `log_abs_det_jacobian` for a more complex `SoftplusTransform` can be added in the future. vitkl fritzo Addresses the issue discussed here: [pyro issue 855](https://github.com/pyro-ppl/numpyro/issues/855) Pull Request resolved: https://github.com/pytorch/pytorch/pull/52300 Reviewed By: albanD, ejguan Differential Revision: D34082655 Pulled By: neerajprad fbshipit-source-id: 6114e74ee5d73c1527191bed612a142d691e2094 (cherry picked from commit a181a3a9e53a34214a503d38760ad7778d08a680)	2022-02-25 02:30:03 +00:00
Till Hoffmann	b014d4ddb9	Add transformation using cdf of distribution. (#72495 ) Summary: This PR adds a transform that uses the cumulative distribution function of a given probability distribution. For example, the following code constructs a simple Gaussian copula. ```python # Construct a Gaussian copula from a multivariate normal. base_dist = MultivariateNormal( loc=torch.zeros(2), scale_tril=LKJCholesky(2).sample(), ) transform = CumulativeDistributionTransform(Normal(0, 1)) copula = TransformedDistribution(base_dist, [transform]) ``` The following snippet creates a "wrapped" Gaussian copula for correlated positive variables with Weibull marginals. ```python transforms = [ CumulativeDistributionTransform(Normal(0, 1)), CumulativeDistributionTransform(Weibull(4, 2)).inv, ] wrapped_copula = TransformedDistribution(base_dist, transforms) ``` cc fritzo Pull Request resolved: https://github.com/pytorch/pytorch/pull/72495 Reviewed By: ejguan Differential Revision: D34085919 Pulled By: albanD fbshipit-source-id: 7917391519a96b0d9b54c52db65d1932f961d070 (cherry picked from commit `572196146e`)	2022-02-09 14:46:47 +00:00
Fritz Obermeyer	a347c747df	Fix TransformedDistribution shaping logic (#50581 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/50496 Fixes https://github.com/pytorch/pytorch/issues/34859 Fixes https://github.com/pytorch/pytorch/issues/21596 This fixes many bugs involving `TransformedDistribution` and `ComposeTransform` when the component transforms changed their event shapes. Part of the fix is to introduce an `IndependentTransform` analogous to `distributions.Independent` and `constraints.independent`, and to introduce methods `Transform.forward_shape()` and `.inverse_shape()`. I have followed fehiepsi's suggestion and replaced `.input_event_dim` -> `.domain.event_dim` and `.output_event_dim` -> `.codomain.event_dim`. This allows us to deprecate `.event_dim` as an attribute. ## Summary of changes - Fixes `TransformDistribution` and `ComposeTransform` shape errors. - Fixes a behavior bug in `LogisticNormal`. - Fixes `kl_divergence(TransformedDistribution, TransformedDistribution)` - Adds methods `Transform.forward_shape()`, `.inverse_shape()` which are required for correct shape computations in `TransformedDistribution` and `ComposeTransform`. - Adds an `IndependentTransform`. - Adds a `ReshapeTransform` which is invaluable in testing shape logic in `ComposeTransform` and `TransformedDistribution` and which will be used by stefanwebb flowtorch. - Fixes incorrect default values in `constraints.dependent.event_dim`. - Documents the `.event_dim` and `.is_discrete` attributes. ## Changes planned for follow-up PRs - Memoize `constraints.dependent_property` as we do with `lazy_property`, since we now consult those properties much more often. ## Tested - [x] added a test for `Dist.support` vs `Dist(**params).support` to ensure static and dynamic attributes agree. - [x] refactoring is covered by existing tests - [x] add test cases for `ReshapedTransform` - [x] add a test for `TransformedDistribution` on a wide grid of input shapes - [x] added a regression test for https://github.com/pytorch/pytorch/issues/34859 cc fehiepsi feynmanliang stefanwebb Pull Request resolved: https://github.com/pytorch/pytorch/pull/50581 Reviewed By: ezyang, glaringlee, jpchen Differential Revision: D26024247 Pulled By: neerajprad fbshipit-source-id: f0b9a296f780ff49659b132409e11a29985dde9b	2021-01-25 16:34:12 -08:00
vikigenius	1e2d1d7242	Fixed cat transform to work with event_dim > 0 (#49111 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/44530 As explained in the issue description, CatTransform does not work with event_dim > 0. This PR fixes this. If this gets approved I am hoping to do the same for StackTransform as well. fritzo Can you take a look at this ? Pull Request resolved: https://github.com/pytorch/pytorch/pull/49111 Reviewed By: neerajprad Differential Revision: D25526005 Pulled By: ezyang fbshipit-source-id: e14430093f550d5e0da7a311f9cd44796807830f	2020-12-14 14:16:18 -08:00
neerajprad	5489a98cd3	Add support for CorrCholeskyTransform (#48041 ) Summary: This adds a transform to convert a real vector of (D * (D-1))/2 dimension into the cholesky factor of a D x D correlation matrix. This follows the implementation in [NumPyro](https://github.com/pyro-ppl/numpyro/blob/master/numpyro/distributions/transforms.py) by fehiepsi. This is needed for the LKJDistribution which will be added in a subsequent PR. Also in line with the ongoing effort to refactor distributions test, this moves the transforms test into its own file that uses pytest with parametrized fixtures. For review: fehiepsi - could you help review the math? fritzo - do you have any suggestions for what to do about the event dimension (more details are in the comment below)? ezyang - could you review the changes in `run_test.py`? Instead of a separate `PYTEST_TESTS`, I have clubbed these tests in `USE_PYTEST_LIST` to avoid duplicate logic. The only difference is that we do not anymore check if pytest is not installed and exclude the tests in the list. I figured that if existing tests are already using pytest, this should not matter. TODOs (probably not all can be satisfied at the same time): - [x] Use operations that are JIT friendly, i.e. the transform works with different sized input under JIT. - [x] Resolve test failures - currently `arange(scalar_tensor)` fails on certain backends but this is needed for JIT. Maybe we should only support same sized tensor under JIT? - [x] Add tests to check that the transform gives correct gradients and is in agreement with the `log_det_jacobian`. - [x] Add `input_event_dim` and `output_event_dim` to `CorrCholeskyTransform`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48041 Reviewed By: zhangguanheng66 Differential Revision: D25262505 Pulled By: neerajprad fbshipit-source-id: 5a57e1c19d8230b53592437590b9169bdf2f71e9	2020-12-03 03:21:08 -08:00
Xu Zhao	146721f1df	Fix typing errors in the torch.distributions module (#45689 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/42979. Pull Request resolved: https://github.com/pytorch/pytorch/pull/45689 Reviewed By: agolynski Differential Revision: D24229870 Pulled By: xuzhao9 fbshipit-source-id: 5fc87cc428170139962ab65b71cacba494d46130	2020-10-12 10:29:45 -07:00
Xingdong Zuo	f13653db29	[Update transforms.py]use build-in `atanh` in TanhTransform (#40160 ) Summary: Since `torch.atanh` is recently implemented in https://github.com/pytorch/pytorch/issues/38388, we should simply use it for `TanhTransform`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/40160 Differential Revision: D22208039 Pulled By: ezyang fbshipit-source-id: 34dfbc91eb9383461e16d3452e3ebe295f39df26	2020-06-30 21:38:22 -07:00
Fritz Obermeyer	00b7d84eb7	Add a .with_cache() method to distributions.Transform objects (#36882 ) Summary: This resolves an issue observed by stefanwebb where the composition of multiple transforms is cached only if all components are cached. This PR adds a new method `.with_cache()` so that e.g. you can compose a normalizing flow (that needs to be cached) with a `SigmoidTransform` (that wasn't already cached) by calling `.with_cache()` on the latter. This issue also comes up when composing non-cached constraint transforms as returned by `transform_to()` and `biject_to()`: after this PR you can call `transform_to(constraints.positive).with_cache()` to get a cached `ExpTransform`. ## Tested - [x] added a unit test Pull Request resolved: https://github.com/pytorch/pytorch/pull/36882 Differential Revision: D21155914 Pulled By: ezyang fbshipit-source-id: 3c06e63785ca2503e08a5cd7532aff81882835e9	2020-04-21 10:50:31 -07:00
Xingdong Zuo	feaa622fc6	[Update transforms.py]Add `TanhTransform` (#19785 ) Summary: Resolves https://github.com/pytorch/pytorch/issues/33195 Pull Request resolved: https://github.com/pytorch/pytorch/pull/19785 Differential Revision: D19642395 Pulled By: ezyang fbshipit-source-id: 73c386fb89cd195201757b5fa47d6c01914a1f8f	2020-02-18 17:42:10 -08:00
Brian Wignall	f326045b37	Fix typos, via a Levenshtein-type corrector (#31523 ) Summary: Should be non-semantic. Uses https://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines to find likely typos, with https://github.com/bwignall/typochecker to help automate the checking. Uses an updated version of the tool used in https://github.com/pytorch/pytorch/pull/30606 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/31523 Differential Revision: D19216749 Pulled By: mrshenli fbshipit-source-id: 7fd489cb9a77cd7e4950c1046f925d57524960ea	2020-01-17 16:03:19 -08:00
Fritz Obermeyer	0745591855	Vectorize LowerCholeskyTransform (#24131 ) Summary: Removes older `torch.stack`-based logic in favor of `torch.diagonal()` and `torch.diag_embed()`. I see 100x speedup in my application, where my batched matrix has shape `(800, 32 ,32)`. ```py import torch from torch.distributions import constraints, transform_to x = torch.randn(800, 32, 32, requires_grad=True) # Before this PR: %%timeit transform_to(constraints.lower_cholesky)(x).sum().backward() # 579 ms ± 34.4 ms per loop (mean ± std. dev. of 7 runs, 1 loop each) # After this PR: %%timeit transform_to(constraints.lower_cholesky)(x).sum().backward() # 4.5 ms ± 241 µs per loop (mean ± std. dev. of 7 runs, 100 loops each) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/24131 Differential Revision: D16764035 Pulled By: ezyang fbshipit-source-id: 170cdb0d924cdc94cd5ad3b75d1427404718d437	2019-08-15 06:46:19 -07:00
Fritz Obermeyer	06762b4721	Fix distributions.Categorical.sample bug from .view() (#23328 ) Summary: This modernizes distributions code by replacing a few uses of `.contiguous().view()` with `.reshape()`, fixing a sample bug in the `Categorical` distribution. The bug is exercised by the following test: ```py batch_shape = (1, 2, 1, 3, 1) sample_shape = (4,) cardinality = 2 logits = torch.randn(batch_shape + (cardinality,)) dist.Categorical(logits=logits).sample(sample_shape) # RuntimeError: invalid argument 2: view size is not compatible with # input tensor's size and stride (at least one dimension spans across # two contiguous subspaces). Call .contiguous() before .view(). # at ../aten/src/TH/generic/THTensor.cpp:203 ``` I have verified this works locally, but I have not added this as a regression test because it is unlikely to regress (the code is now simpler). Pull Request resolved: https://github.com/pytorch/pytorch/pull/23328 Differential Revision: D16510678 Pulled By: colesbury fbshipit-source-id: c125c1a37d21d185132e8e8b65241c86ad8ad04b	2019-07-29 12:09:50 -07:00
fehiepsi	91ea2cd5a7	clip sigmoid to prevent transforms return inf/nan values (#20288 ) Summary: This PR addresses some numerical issues of Sigmoid/StickBreakingTransform, where these transforms give +-inf when the unconstrained values move to +-20 areas. For example, with ``` t = torch.distributions.SigmoidTransform() x = torch.tensor(20.) t.inv(t(x)), t.log_abs_det_jacobian(x, t(x)) ``` current behaviour the inverse will return `inf` and logdet return `-inf` while this PR makes it to `15.9424` and `-15.9424`. And for ``` t = torch.distributions.StickBreakingTransform() x = torch.tensor([20., 20.]) t.inv(t(x)), t.log_abs_det_jacobian(x, t(x)) ``` current value is `(inf, nan)` and `-inf` for logdet, while this PR makes it `[16.6355, 71.3942]` and `-47.8272` for logdet. Although these finite values are wrong and seems unavoidable, it is better than returning `inf` or `nan` in my opinion. This is useful in HMC where despite that the grad will be zero when the unconstrained parameter moves to unstable area (due to clipping), velocity variable will force the parameter move to another area which by chance can move the parameter out of unstable area. But inf/nan can be useful to stop doing inference early. So the changes in this PR might be inappropriate. I also fix some small issues of `_Simplex` and `_RealVector` constraints where batch shape of the input is not respected when checking validation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20288 Differential Revision: D15742047 Pulled By: ezyang fbshipit-source-id: b427ed1752c41327abb3957f98d4b289307a7d17	2019-06-10 11:16:31 -07:00
Xingdong Zuo	c5d5d45f40	Fix numerically instability of `SigmoidTransform` (#19802 ) Summary: fix #18254 for numerically instability of `SigmoidTransform` Pull Request resolved: https://github.com/pytorch/pytorch/pull/19802 Differential Revision: D15701837 Pulled By: ezyang fbshipit-source-id: fe6c755c523487c8bbdcc3bfb8455801617c70a4	2019-06-06 13:58:53 -07:00
Ahmad Salim Al-Sibahi	b8256280ce	Working on component-wise transformations that mimic `torch.cat` and `torch.stack` (#11868 ) Summary: As discussed in #11755 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/11868 Differential Revision: D10032248 Pulled By: ezyang fbshipit-source-id: d3a81c19f65a3e716f7f1cfc0a42b86c32fc484c	2019-05-07 07:49:29 -07:00
Neeraj Pradhan	fb40e58f24	Remove deprecated tensor constructors in torch.distributions (#19979 ) Summary: This removes the deprecated `tensor.new_*` constructors (see #16770) from `torch.distributions` module. Pull Request resolved: https://github.com/pytorch/pytorch/pull/19979 Differential Revision: D15195618 Pulled By: soumith fbshipit-source-id: 46b519bfd32017265e90bd5c53f12cfe4a138021	2019-05-02 20:45:02 -07:00
Fritz Obermeyer	0d366e1bde	Support multiple inheritance in torch.distributions (#16772 ) Summary: This adds calls to `super().__init__()` in three classes in torch.distributions. This is needed when `Distribution` and `Transform` objects are used with multiple inheritance, as e.g. combined with `torch.nn.Module`s. For example ```py class MyModule(torch.distributions.Transform, torch.nn.Module): ... ``` cc martinjankowiak esling who have wanted to use this pattern, e.g. in #16756 Pull Request resolved: https://github.com/pytorch/pytorch/pull/16772 Differential Revision: D13978633 Pulled By: soumith fbshipit-source-id: 8bc6cca1747cd74d32135ee2fe588bba2ea796f1	2019-02-07 01:37:57 -08:00
vishwakftw	b535aecd7c	Fix warnings emitted when testing distributions (#12038 ) Summary: The earlier tests had around 80 warnings, and now there are 6 warnings: these are due to JIT The changes remove the wrapping of a Tensor by a Tensor constructor, which emits warnings due to the changes in https://github.com/pytorch/pytorch/pull/11061 . Pull Request resolved: https://github.com/pytorch/pytorch/pull/12038 Differential Revision: D10033392 Pulled By: apaszke fbshipit-source-id: b1faf368e650d062d7983f9932511bee4702a893	2018-09-26 09:24:54 -07:00
vishwakftw	f940af6293	Bag of Distributions doc fixes (#10894 ) Summary: - Added `__repr__` for Constraints and Transforms. - Arguments passed to the constructor are now rendered with :attr: Closes https://github.com/pytorch/pytorch/issues/10884 Pull Request resolved: https://github.com/pytorch/pytorch/pull/10894 Differential Revision: D9514161 Pulled By: apaszke fbshipit-source-id: 4abf60335d876449f2b6477eb9655afed9d5b80b	2018-08-27 09:55:27 -07:00
Wei Yang	cb1bfe91af	Deprecated several functions at torch.nn.functional (#8748 ) Summary: 1. fixes #6245 2. deprecated tanh, sigmoid Closes https://github.com/pytorch/pytorch/pull/8748 Differential Revision: D8697975 Pulled By: weiyangfb fbshipit-source-id: f30714aa0611a1fe870040692f3dbcc8238aece9	2018-07-02 15:54:46 -07:00
Tongzhou Wang	1c01eabd3c	Codemod to update our codebase to 0.4 standard (#6641 ) * Codemod to update our codebase to 0.4 standard * Update some of the test scri[ts * remove Variable in test_clip_grad_value * fix _symbolic_override_wrapper_maker	2018-04-17 22:06:54 -04:00
Fritz Obermeyer	0b17f4b87e	[distributions] Support python floats in AffineTransform (#6035 ) This avoids promotion from python float to torch.Tensor for AffineTransform. This appears to be needed so that constraint registration works across CPU and all GPUs. Previous discussion at `3a25db73c8 (r176361909)` Background: There are three basic types of objects in torch.distributions: - Distributions are flyweight objects constructed from tensor or float args. They always promote float args to tensors. - Transforms are longer-lived objects (sometimes cached; some are static globals). They can take float arguments. This PR makes AffineTransform avoid promoting float args to tensors. - Constraints are long-lived objects. They can take either float or tensor arguments. They do not promote floats to tensors. These are relatively symbolic and are not much more than partially evaluated comparisons, e.g. constraints.positive is basically a symbolic version of lambda x: x > 0 that can be stored in a ConstraintRegistry table. The Problem: Sometimes we want to apply transform_to(constraints.positive) to a torch.Cuda.FloatTensor. This is fine since transform_to(constraints.positive)(x) = ExpTransform()(x) = x.exp() which works with any tensor type. Other times we want to apply transform_to(constraints.greater_than(1.5)) to a torch.cuda.FloatTensor. This is problematic before this PR since transform_to(constraints.greater_than(1.5))(x) = ComposeTransform([ExpTransform(), AffineTransform(1.5, 1)])(x) = AffineTransform(1.5, 1)(x.exp()) = t.loc + t.scale * x.exp() # where t = AffineTransform(1.5, 1) Before this PR, AffineTransform would promote t.loc and t.scale to tensors. This promotion can happen as early as library load time for some transforms, e.g. transform_to(constraints.unit_interval). Therefore before this PR, the second example would error at t.scale * x.exp() because t.scale is a [default] torch.FloatTensor whereas x.exp() is a torch.cuda.FloatTensor. Proposed solution: This PR merely adds support for python floats as the .loc and .scale parameters of AffineTransform. This should suffice for most purposes since only AffineTransform and a handful of parameter-free transforms are ever stored in the global transform_to and biject_to registries. Alternative solutions include: - allowing promotion from torch.FloatTensor to all other tensor types, e.g. torch.cuda.FloatTensor. - adding a handful of specific parameter-free transforms like NegateTransform() in lieu of AffineTransform(0, -1). Tested: added a regression test * Support python floats in AffineTransform * Update docstrings	2018-04-01 23:56:34 -04:00
Fritz Obermeyer	03a6952ac9	[distributions] Fix scalar bugs in torch.distributions.transforms etc. (#5931 )	2018-03-25 13:33:31 +02:00
Du Phan	41c84ca735	Support batch LowerCholeskyTransform (#5980 ) * batch lower cholesky transform * add checking contiguous * remove cache file	2018-03-24 13:43:46 -04:00
Vishwak Srinivasan	ed0f629fe9	[distributions] Implement Power transform (#5976 )	2018-03-24 15:20:07 +01:00
Maruan	7cbbc0bc74	Implementation of the logistic-normal distribution (#5547 )	2018-03-22 00:32:14 +01:00
Fritz Obermeyer	e43d0ac92a	[distributions] Support pickling of constraint objects (#5910 )	2018-03-20 22:00:37 +01:00
Fritz Obermeyer	abf97e954e	Avoid in-place ops in BoltzmannTransform (#5842 )	2018-03-17 10:24:23 -04:00
Sam Gross	54b4cdeffa	Replace all uses of 'Tensor or Variable' with 'Tensor' (#5508 ) Replace all uses of 'Tensor or Variable' and 'Variable or Tensor' with 'Tensor'	2018-03-02 14:26:11 -05:00
Fritz Obermeyer	b608ea9178	Fix sign error in TransformedDistribution.cdf() and .icdf() (#5172 )	2018-02-12 11:45:22 +01:00

1 2

52 Commits