pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Aaron Orenstein	086d146f6f	Update ruff linter for PEP585 (#147540 ) This turns on PEP585 enforcement in RUFF. - Updates the target python version - Stops ignoring UP006 warnings (PEP585) - Fixes a few issues which crept into the tree in the last day Pull Request resolved: https://github.com/pytorch/pytorch/pull/147540 Approved by: https://github.com/justinchuby, https://github.com/Skylion007	2025-02-22 04:45:17 +00:00
Aaron Orenstein	7ce4974e50	Fix PEP585 update (#147536 ) Summary: D69920347 causes a pyre failure due to changing a base object from typing.Iterable to abc.Iterable. For now revert that change until it can be dealt with on its own. Test Plan: failures from D69920347 pass locally unit tests pass Reviewed By: oulgen Differential Revision: D69936518 Pull Request resolved: https://github.com/pytorch/pytorch/pull/147536 Approved by: https://github.com/jeanschmidt	2025-02-21 14:37:03 +00:00
Aaron Orenstein	db4ce78d46	PEP585: More UP006 fixes (#146392 ) This should be the final PR before we can enable RUFF UP006. Pull Request resolved: https://github.com/pytorch/pytorch/pull/146392 Approved by: https://github.com/justinchuby, https://github.com/albanD, https://github.com/Skylion007	2025-02-20 06:18:13 +00:00
Zhou32	ecf44d1002	Fixed a typo in dataset.py (#146600 ) Changed word 'Mult' to 'Multi'. Pull Request resolved: https://github.com/pytorch/pytorch/pull/146600 Approved by: https://github.com/Skylion007	2025-02-07 05:09:51 +00:00
Aaron Orenstein	629840e038	Backout PEP585 use of Iterable (#145438 ) Summary: Importing Iterable from collections.abc here causes an internal product to fail MRO discovery causing a collision between Iterable and Generic. This fixes the failure on D68461304 Differential Revision: D68531443 Pull Request resolved: https://github.com/pytorch/pytorch/pull/145438 Approved by: https://github.com/izaitsevfb	2025-01-23 11:45:37 +00:00
Aaron Orenstein	2f9d378f7b	PEP585 update - torch/utils (#145201 ) See #145101 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/145201 Approved by: https://github.com/bobrenjc93	2025-01-21 21:04:10 +00:00
bobrenjc93	90e81a157a	Migrate from Tuple -> tuple in torch/utils/data (#144255 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144255 Approved by: https://github.com/andrewkho	2025-01-08 04:09:45 +00:00
Xuehai Pan	f1df13f023	[BE][Easy] Fix `PYI001`: unprefixed-type-param in `torch/utils/data/datapipes` (#129885 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/129885 Approved by: https://github.com/ezyang	2024-07-02 14:56:27 +00:00
Xuehai Pan	f911957573	[BE] sort imports in `torch.utils.data` (#127704 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/127704 Approved by: https://github.com/ezyang ghstack dependencies: #127706	2024-06-27 23:16:24 +00:00
Aaron Orenstein	8db9dfa2d7	Flip default value for mypy disallow_untyped_defs [9/11] (#127846 ) See #127836 for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/127846 Approved by: https://github.com/ezyang ghstack dependencies: #127842, #127843, #127844, #127845	2024-06-08 18:50:06 +00:00
Xuehai Pan	67ef2683d9	[BE] wrap deprecated function/class with `typing_extensions.deprecated` (#127689 ) Use `typing_extensions.deprecated` for deprecation annotation if possible. Otherwise, add `category=FutureWarning` to `warnings.warn("message")` if the category is missing. Note that only warnings that their messages contain `[Dd]eprecat(ed\|ion)` are updated in this PR. Resolves #126888 - #126888 This PR is split from PR #126898. - #126898 ------ Pull Request resolved: https://github.com/pytorch/pytorch/pull/127689 Approved by: https://github.com/Skylion007	2024-06-02 12:30:43 +00:00
PyTorch MergeBot	033e733021	Revert "[BE] wrap deprecated function/class with `typing_extensions.deprecated` (#126898 )" This reverts commit `749a132fb0`. Reverted https://github.com/pytorch/pytorch/pull/126898 on behalf of https://github.com/fbgheith due to switching typing-extensions=4.3.0 to 4.9.0 causes internal failure ([comment](https://github.com/pytorch/pytorch/pull/126898#issuecomment-2142884456))	2024-05-31 19:47:24 +00:00
Xuehai Pan	749a132fb0	[BE] wrap deprecated function/class with `typing_extensions.deprecated` (#126898 ) Use `typing_extensions.deprecated` for deprecation annotation if possible. Otherwise, add `category=FutureWarning` to `warnings.warn("message")` if the category is missing. Note that only warnings that their messages contain `[Dd]eprecat(ed\|ion)` are updated in this PR. UPDATE: Use `FutureWarning` instead of `DeprecationWarning`. Resolves #126888 - #126888 Pull Request resolved: https://github.com/pytorch/pytorch/pull/126898 Approved by: https://github.com/albanD	2024-05-29 12:09:27 +00:00
Aaron Gokaslan	1d5a9a1c1a	[Easy][BE]: remove itertools.accumulate Python 2 shim and apply UFMT (#116192 ) Removes an unnecessary duplicated utility functions and just have it rely on itertools. Since the file is low traffic, I also added the modified files to UFMT'd files and formatted them. Pull Request resolved: https://github.com/pytorch/pytorch/pull/116192 Approved by: https://github.com/malfet	2023-12-20 20:36:59 +00:00
Aryan Gupta	92e7f79609	Doc: Add and Fix docstrings for torch.util.data files (#112817 ) Fixes #112635 Fix docstrings for `torch.utils.data` files. ``` Before: > pydocstyle torch/utils/data/graph.py --count Before: 5 After: 1 > pydocstyle torch/utils/data/graph_settings.py --count Before: 8 After: 3 > pydocstyle torch/utils/data/dataloader.py --count Before: 12 After: 6 > pydocstyle torch/utils/data/dataset.py --count Before: 28 After: 23 > pydocstyle torch/utils/data/sampler.py --count Before: 24 After: 19 > pydocstyle torch/utils/data/_utils/signal_handling.py --count Before: 1 After: 0 > pydocstyle torch/utils/data/_utils/__init__.py --count Before: 2 After: 0 > pydocstyle torch/utils/data/_utils/collate.py --count Before: 20 After: 6 > pydocstyle torch/utils/data/_utils/fetch.py --count Before: 3 After: 0 > pydocstyle torch/utils/data/_utils/pin_memory.py --count Before: 4 After: 1 > pydocstyle torch/utils/data/datapipes/_decorator.py --count Before: 19 After: 16 > pydocstyle torch/utils/data/datapipes/_hook_iterator.py --count Before: 13 After: 0 > pydocstyle torch/utils/data/datapipes/_typing.py --count Before: 17 After: 4 > pydocstyle torch/utils/data/datapipes/gen_pyi.py --count Before: 19 After: 4 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/112817 Approved by: https://github.com/kit1980	2023-11-07 17:59:56 +00:00
Ramil Nugmanov	91eeb77260	StackDataset batched sampling (#110694 ) Optimization of loading minibatches Pull Request resolved: https://github.com/pytorch/pytorch/pull/110694 Approved by: https://github.com/ejguan	2023-10-10 22:05:51 +00:00
Navid Sheik	96a3a7cc82	[pytorch] make IterableDataset of Iterable type (#109645 ) Summary: Makes `IterableDataset` of `Iterable` type. Test Plan: tests next diff in the stack are all green Differential Revision: D49420146 Pull Request resolved: https://github.com/pytorch/pytorch/pull/109645 Approved by: https://github.com/DanilBaibak, https://github.com/Skylion007	2023-09-25 14:18:15 +00:00
katotaisei	bcda859e34	fix typos (#108006 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/108006 Approved by: https://github.com/Skylion007	2023-08-28 19:49:09 +00:00
Nikita Shulga	5837e95d30	[Reland] Update mypy to 1.4.1 (#105227 ) This PR re-lands - [Typing] Fix PEP 484 Violation (#105022) - Update mypy to 1.4.1 (#91983) That were reverted due to the conflict with internal source repo. Mostly fixes for PEP-484 violation (i.e. when default arg is set to None, but type is not annotated as optional) Plus few real fixes: - Add missing `_get_upgraders_entry_map` to `torch/_C/__init__.pyi` - Add missing return statement to `torch._export. deserialize_graph` - Fix error message in `torch.ao.ns.fx.weight_utils.get_lstm_mod_weights` - Add assert it `torch/optim/optimizer.py` that Optional list is not None TODO (in followup PR): - Fix erroneous `isinstance` check in `torch/ao/quantization/_pt2e/qat_utils.py` Unrelated, to bypass CI failures due to the gcc9 dependency update in Ubuntu-18.04: - Add hack to squash older libstdc++ from conda environment in favor one from OS to `.ci/docker/install_conda.sh` - Update bazel cuda builds to focal, as with libstdc++-6.0.32 bazel builds loose the ability to catch exceptions (probably because they link with cupti statically, but I could not found where it is done) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105227 Approved by: https://github.com/atalman, https://github.com/albanD, https://github.com/Skylion007	2023-07-15 20:30:20 +00:00
PyTorch MergeBot	15fd1ea118	Revert "[Reland] Update mypy to 1.4.1 (#105227 )" This reverts commit `c9c4f8efc3`. Reverted https://github.com/pytorch/pytorch/pull/105227 on behalf of https://github.com/atalman due to trying to mitigate ci sev #105248 ([comment](https://github.com/pytorch/pytorch/pull/105227#issuecomment-1636510935))	2023-07-14 22:28:35 +00:00
Nikita Shulga	c9c4f8efc3	[Reland] Update mypy to 1.4.1 (#105227 ) This PR re-lands - [Typing] Fix PEP 484 Violation (#105022) - Update mypy to 1.4.1 (#91983) That were reverted due to the conflict with internal source repo. Mostly fixes for PEP-484 violation (i.e. when default arg is set to None, but type is not annotated as optional) Plus few real fixes: - Add missing `_get_upgraders_entry_map` to `torch/_C/__init__.pyi` - Add missing return statement to `torch._export. deserialize_graph` - Fix error message in `torch.ao.ns.fx.weight_utils.get_lstm_mod_weights` - Add assert it `torch/optim/optimizer.py` that Optional list is not None TODO (in followup PR): - Fix erroneous `isinstance` check in `torch/ao/quantization/_pt2e/qat_utils.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105227 Approved by: https://github.com/atalman, https://github.com/albanD, https://github.com/Skylion007	2023-07-14 20:45:12 +00:00
PyTorch MergeBot	3c5a494d7a	Revert "Update mypy to 1.4.1 (#91983 )" This reverts commit `634659e262`. Reverted https://github.com/pytorch/pytorch/pull/91983 on behalf of https://github.com/malfet due to It's dependent change was reverted, so reverting this one as well, to keep CI clean ([comment](https://github.com/pytorch/pytorch/pull/91983#issuecomment-1636059709))	2023-07-14 15:59:16 +00:00
Nikita Shulga	634659e262	Update mypy to 1.4.1 (#91983 ) Mostly fixes for PEP-484 violation (i.e. when default arg is set to None, but type is not annotated as optional) Plus few real fixes: - Add missing `_get_upgraders_entry_map` to `torch/_C/__init__.pyi` - Add missing return statement to `torch._export. deserialize_graph` - Fix error message in `torch.ao.ns.fx.weight_utils.get_lstm_mod_weights` - TODO (in followup PR): - Fix erroneous `isinstance` check in `torch/ao/quantization/_pt2e/qat_utils.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/91983 Approved by: https://github.com/kit1980, https://github.com/ZainRizvi, https://github.com/huydhn, https://github.com/thiagocrepaldi, https://github.com/aaronenyeshi	2023-07-13 16:30:36 +00:00
Ramil Nugmanov	28098cae6b	[DataLoader] Adding `StackDataset` (#101338 ) Torch wrapping datasets list has: `TensorDataset` `ConcatDataset` `ChainDataset` `TensorDataset` is useful for stacking sets of tensors but can't work with objects without `.size()` method. This PR proposes `StackDataset`, similar to `TensorDataset` but for a general case like `ConcatDataset`. Possible usage of `StackDataset` is multimodal networks with different input like image+text or for staking non-tensor input and property to predict. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101338 Approved by: https://github.com/ejguan, https://github.com/NivekT	2023-05-18 00:57:12 +00:00
Ramil Nugmanov	a2e81a8004	[DataLoader] `__getitems__` added to description of Dataset API and better supported within `Subset` (#100375 ) DataLoader supports batched loading from Mapped Datasets. This is the fetcher's implementation of auto-detection of batch loading support. torch.utils.data._utils.fetch._MapDatasetFetcher ``` class _MapDatasetFetcher(_BaseDatasetFetcher): def fetch(self, possibly_batched_index): if self.auto_collation: if hasattr(self.dataset, "__getitems__") and self.dataset.__getitems__: data = self.dataset.__getitems__(possibly_batched_index) else: data = [self.dataset[idx] for idx in possibly_batched_index] ``` Description of Dataset API now shows this feature. Additionally, Subset dataset now supports `__getitems__` if parent dataset supports it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100375 Approved by: https://github.com/ejguan, https://github.com/NivekT	2023-05-05 15:52:28 +00:00
Chase	2f41bc5465	[DataLoader] Add context to NotImplementedErrors in dataset.py (#100667 ) Add helpful context message to `NotImplementedError`'s thrown by Dataset and IterableDataset, reminding users that they must implement `__getitem__`/`__iter__` in subclasses. Currently, users are presented with a bare `NotImplementedError` without describing the remedy. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100667 Approved by: https://github.com/NivekT	2023-05-05 02:16:42 +00:00
Sergii Dymchenko	46faa79e09	Simplify by using yield from in torch/utils/data (#97839 ) Also see https://github.com/pytorch/pytorch/pull/97831 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97839 Approved by: https://github.com/NivekT, https://github.com/Skylion007	2023-03-29 04:51:26 +00:00
Xuehai Pan	5b1cedacde	[BE] [2/3] Rewrite `super()` calls in functorch and torch (#94588 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94588 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-10 21:16:33 +00:00
joncrall	ad782ff7df	Enable xdoctest runner in CI for real this time (#83816 ) Builds on #83317 and enables running the doctests. Just need to figure out what is causing the failures. Pull Request resolved: https://github.com/pytorch/pytorch/pull/83816 Approved by: https://github.com/ezyang, https://github.com/malfet	2022-12-29 05:32:42 +00:00
joncrall	b136f3f310	More doctest refinements. (#83317 ) Follow up to #82797 Now that the doctests themselves are in a better state, we should be able to enable xdoctest on the CI so they stay that way. @ezyang @vadimkantorov Pull Request resolved: https://github.com/pytorch/pytorch/pull/83317 Approved by: https://github.com/ezyang	2022-08-22 20:07:26 +00:00
joncrall	4618371da5	Integrate xdoctest - Rebased (#82797 ) This is a new version of #15648 based on the latest master branch. Unlike the previous PR where I fixed a lot of the doctests in addition to integrating xdoctest, I'm going to reduce the scope here. I'm simply going to integrate xdoctest, and then I'm going to mark all of the failing tests as "SKIP". This will let xdoctest run on the dashboards, provide some value, and still let the dashboards pass. I'll leave fixing the doctests themselves to another PR. In my initial commit, I do the bare minimum to get something running with failing dashboards. The few tests that I marked as skip are causing segfaults. Running xdoctest results in 293 failed, 201 passed tests. The next commits will be to disable those tests. (unfortunately I don't have a tool that will insert the `#xdoctest: +SKIP` directive over every failing test, so I'm going to do this mostly manually.) Fixes https://github.com/pytorch/pytorch/issues/71105 @ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/82797 Approved by: https://github.com/ezyang	2022-08-12 02:08:01 +00:00
Robert	3064982fb8	Support percentages in random_split (#78877 ) Fixes #78510 This PR adds support for using fractions with `random_split`. This should be completely backwards-compatible as the fractional-style splitting is only applied when the sum across the input lengths is lower than 1.0 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78877 Approved by: https://github.com/ejguan	2022-06-16 02:00:25 +00:00
Erjia Guan	0289ab2cec	Fix data-related public API (#368 ) Summary: X-link: https://github.com/pytorch/data/pull/368 This is PR aims to expose the right data-relate API. There are two more changes made in this PR to convert public api to private api `check_lambda_fn` -> `_check_lambda_fn` `deprecation_warning` -> `_deprecation_warning` Pull Request resolved: https://github.com/pytorch/pytorch/pull/76143 Reviewed By: albanD, NivekT Differential Revision: D35798311 Pulled By: ejguan fbshipit-source-id: b13fded5c88a533c706702fb2070c918c839dca4 (cherry picked from commit 0b534b829a2e90e1e533951c6d334fdeaa9358b9)	2022-04-21 17:27:05 -07:00
Kevin Tse	eec994fc16	[DataPipe] Separating DataPipes from Dataset into different files (#73396 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73396 Separating DataPipes from Dataset into different files. This makes the code more maintainable and simplifies some of the code generation. I have also tried to move `datapipe.py` into `torch.utils.data.datapipes`, but that will lead to circular import and rewriting many import statements. Should I put more time and go down that path some more? Fixes https://github.com/pytorch/data/issues/213 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D34481962 Pulled By: NivekT fbshipit-source-id: 42fb26fe7fc334636852cfd8719fc807bdaa7912 (cherry picked from commit 81e76a64e297cb5c58caa951c554e49526173936)	2022-03-15 14:46:34 +00:00
Kevin Tse	cd4ecce1bb	[DataPipe] Fix issue with DataPipe serialization with `dill` (#72896 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72896 Fixing the issue described here: https://github.com/pytorch/data/issues/214 There will be a follow-up PR in TorchData as well Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D34258669 Pulled By: NivekT fbshipit-source-id: 6dd88250ed14ebe779915dc46139be7e012e9d1b (cherry picked from commit 025b8ed98019e576bfef04c33a3f33ed1a426a66)	2022-02-23 16:31:20 +00:00
Kevin Tse	f5e201e4e9	[DataPipe] Adding usage examples for IterDataPipes (#73033 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73033 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D34313793 Pulled By: NivekT fbshipit-source-id: 51125be2f79d73d02658b2b1c2691f96be8d4769 (cherry picked from commit `3e3c2df7c6`)	2022-02-18 15:12:34 +00:00
Kevin Tse	3e1eff9a0e	[DataPipe] Add docstrings for IterDataPipe and MapDataPipe, along with small doc changes for consistency (#72618 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72618 The major changes are in torch/utils/data/dataset.py Let me know if anything is unclear. I'm open to suggestion. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D34119492 Pulled By: NivekT fbshipit-source-id: 358cb6d33d18501f9042431350f872ebaa9b4070 (cherry picked from commit `53b484f60a`)	2022-02-10 16:25:36 +00:00
Erjia Guan	0721fc6474	Decouple MapDataPipe from Dataset (#70991 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/70991 Test Plan: Imported from OSS Reviewed By: dagitses Differential Revision: D33477680 Pulled By: ejguan fbshipit-source-id: d3e89492e921a96791319f35052a229684ddf7cf	2022-01-07 14:28:41 -08:00
Vitaly Fedyunin	d90012689f	[DataPipe] Control shuffle settings from DataLoader2 (#65756 ) Summary: Makes `shuffle` DataPipe sensitive to DataLoader(2) `shuffle` kwarg. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65756 Reviewed By: albanD Differential Revision: D31344867 Pulled By: VitalyFedyunin fbshipit-source-id: e0084e0ac193ac784d6298328ca1222745681347	2021-12-14 07:35:26 -08:00
Tim Poulsen	486ae5c733	Dataset & IterableDataset attribute errors prints attribute (#69021 ) Summary: The message is the message from a standard attribute error. Thought it would be informative when the error is thrown. Alternatively in python 3.10, one can set the keyword arguments 'name' and 'obj', reference: https://github.com/python/cpython/blob/3.10/Doc/library/exceptions.rst#concrete-exceptions Fixes #{?} Pull Request resolved: https://github.com/pytorch/pytorch/pull/69021 Reviewed By: samdow Differential Revision: D32730362 Pulled By: ejguan fbshipit-source-id: 7132ba612fa6075aeffb9315ce651828e9a8e0bc	2021-12-01 10:16:31 -08:00
Erjia Guan	4c87aa77d1	[DataPipe] Traverse DataPipe graph excluding primitive and callable (#67783 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/67783 Add `getstate_hook` to exclude primitive objects and callable when serialization when `exclude_primitive` is enabled for `traverse`. For graph traversing, we don't have to handle the lambda and other stuff. This is used by `OnDiskCacheHolder` to trace the DataPipe Graph. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D32146697 Pulled By: ejguan fbshipit-source-id: 03b2ce981bb21066e807f57c167b77b2d0e0ce61	2021-11-15 06:46:31 -08:00
Kevin Tse	0b09d62cf3	[hackathon][DataPipe] adding .pyi file generation for torch.utils.data.datapipes (#67374 ) Summary: Stack from [ghstack](https://github.com/ezyang/ghstack): * __->__ https://github.com/pytorch/pytorch/issues/67374 This is a work in progress. Related TorchData issue: https://github.com/pytorch/data/issues/80 cc VitalyFedyunin ejguan NivekT Pull Request resolved: https://github.com/pytorch/pytorch/pull/67374 Reviewed By: H-Huang Differential Revision: D32153211 Pulled By: NivekT fbshipit-source-id: b4c61f191f20fd98ca44bb9e4f972c6d812994a0	2021-11-08 14:43:24 -08:00
Vitaly Fedyunin	ab5e1c69a7	[WIP] Example of DataPipes and DataFrames integration (#60840 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60840 Test Plan: Imported from OSS Reviewed By: wenleix, ejguan Differential Revision: D29461080 Pulled By: VitalyFedyunin fbshipit-source-id: 4909394dcd39e97ee49b699fda542b311b7e0d82	2021-09-13 18:50:15 -07:00
Santiago Castro	69f4401b7b	Make datasets in `ConcatDataset` not need to be sized (#64114 ) Summary: `datasets` needs to be iterable, but also sized because the length is checked. But immediately after it's converted to a list. By changing the order of these 2 lines, it doesn't need to be sized anymore. Pull Request resolved: https://github.com/pytorch/pytorch/pull/64114 Reviewed By: H-Huang Differential Revision: D30641480 Pulled By: ejguan fbshipit-source-id: 7e16548c2123afa65b83845f9929271fa07fe1e8	2021-09-01 15:32:50 -07:00
Santiago Castro	6b85c99ce5	Avoid an unnecessary list creation in `DataChunk` (#64111 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64111 Reviewed By: mruberry Differential Revision: D30639383 Pulled By: ezyang fbshipit-source-id: 96b243307413c99a67d55d862a71937e1ef210f4	2021-08-30 19:25:42 -07:00
Erjia Guan	383a33a0eb	Make DataChunk support list in-place ops (#63422 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63422 Fixes #63095 Make `DataChunk` delegate to list method. Then it will support in-place operations: - `sort` - `reverse` - `append` - `extend` - `random.shuffle` Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D30379027 Pulled By: ejguan fbshipit-source-id: d176bd0cc8b89b915c7bb184ff243ab1f605616d	2021-08-18 08:48:47 -07:00
Erjia Guan	56d609d93e	Replace str by repr for DataChunk (#63184 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63184 Test Plan: Imported from OSS Reviewed By: bdhirsh Differential Revision: D30288892 Pulled By: ejguan fbshipit-source-id: 45c88fdd3987e234f2c22ebbbfd8d5044983c34c	2021-08-16 06:41:38 -07:00
Vitaly Fedyunin	d3bdf345cb	Introducing DataChunk for DataPipes batching (#62768 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62768 This is part of TorchArrow DF support preparation, separating it to multiple PRs to simplify review process. Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30149090 Pulled By: VitalyFedyunin fbshipit-source-id: a36b5ff56e2ac6b06060014d4cd41b487754acb8	2021-08-06 08:38:33 -07:00
Vitaly Fedyunin	171598f0e3	[Refactoring] Fix imports order for torch/utils/data/dataset.py (#61328 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/61328 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D29588897 Pulled By: VitalyFedyunin fbshipit-source-id: 63df653fb471532819c83ebcee4f9dc951500ffb	2021-07-22 08:30:08 -07:00
Simon Seo	b505adbb09	Fix typo in ChainDataset docs (#60336 ) Summary: * chainning -> chaining Pull Request resolved: https://github.com/pytorch/pytorch/pull/60336 Reviewed By: bdhirsh Differential Revision: D29265236 Pulled By: anjali411 fbshipit-source-id: 17a9b73af9e094550bd1ee25bc9439fb8d455e2b	2021-06-21 11:47:21 -07:00

1 2

85 Commits