pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
PyTorch MergeBot	dac895c10a	Revert "Multiprocessing support for NT (#110292 )" This reverts commit `f17fe89e14`. Reverted https://github.com/pytorch/pytorch/pull/110292 on behalf of https://github.com/kit1980 due to Causes CUDA memory leaks ([comment](https://github.com/pytorch/pytorch/pull/110292#issuecomment-1749852095))	2023-10-06 01:07:40 +00:00
PyTorch MergeBot	81ce5d5725	Revert "pin_memory support for NT (#110404 )" This reverts commit `3597325bc7`. Reverted https://github.com/pytorch/pytorch/pull/110404 on behalf of https://github.com/kit1980 due to Previous PR in the stack caused CUDA memory leaks ([comment](https://github.com/pytorch/pytorch/pull/110404#issuecomment-1749850211))	2023-10-06 01:03:17 +00:00
Joel Schlosser	3597325bc7	pin_memory support for NT (#110404 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/110404 Approved by: https://github.com/cpuhrsch, https://github.com/albanD ghstack dependencies: #110292	2023-10-05 16:33:22 +00:00
Joel Schlosser	f17fe89e14	Multiprocessing support for NT (#110292 ) Fixes #110161 Allows NTs to be used in DataLoaders with `num_workers > 1`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110292 Approved by: https://github.com/cpuhrsch, https://github.com/albanD	2023-10-05 15:04:48 +00:00
PyTorch MergeBot	7e6cf04a84	Revert "Multiprocessing support for NT (#110292 )" This reverts commit `881e7304d6`. Reverted https://github.com/pytorch/pytorch/pull/110292 on behalf of https://github.com/jbschlosser due to Address review comments ([comment](https://github.com/pytorch/pytorch/pull/110292#issuecomment-1743524901))	2023-10-02 18:27:13 +00:00
Joel Schlosser	881e7304d6	Multiprocessing support for NT (#110292 ) Fixes #110161 Allows NTs to be used in DataLoaders with `num_workers > 1`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110292 Approved by: https://github.com/cpuhrsch ghstack dependencies: #110219	2023-10-02 18:14:34 +00:00
Aaron Gokaslan	3488837ec1	Update ruff to v0.0.286 (#108058 ) Updates ruff to v0.0.286 and fixes one false negative. Pull Request resolved: https://github.com/pytorch/pytorch/pull/108058 Approved by: https://github.com/albanD	2023-08-28 22:55:56 +00:00
PyTorch MergeBot	ecde622649	Revert "reseed all Generators in Dataloader's _worker_loop() -- via GC (#107131 )" This reverts commit `42625da5e1`. Reverted https://github.com/pytorch/pytorch/pull/107131 on behalf of https://github.com/facebook-github-bot due to Diff reverted internally ([comment](https://github.com/pytorch/pytorch/pull/107131#issuecomment-1690325745))	2023-08-23 17:08:07 +00:00
Nicolas Hug	42625da5e1	reseed all Generators in Dataloader's _worker_loop() -- via GC (#107131 ) Alternative to https://github.com/pytorch/pytorch/pull/107034, implements @ezyang 's suggestion from https://github.com/pytorch/pytorch/pull/107034#discussion_r1292857201. This PR addresses https://fb.workplace.com/groups/pytorch.oss.dev/posts/1699944830430051 and does a bunch of stacked changes: - Make `Generator` class support GC;this makes all `Generator` instances tracked and accessile through Python's GC. - Use the GC to retrieve all existing Generator instances in Dataloader's `_worker_loop` and re-seed them: this extends what is already applied to the global/default Generator, which is already re-seeded. ~TODO: a bit of docs and justification, which I'll do if this PR is mergeable.~ -- Done CC @albanD @ezyang as previously discussed BC-Breaking Note ------------------- We now re-seed all `Generator` instances within the `Dataloader` workers' loop to ensure that their RNG is different across workers. Previously, the RNG of user-defined `Generators` would be the same across workers, which could lead to wrong training procedures. This only affects user-defined `Generators`, not the default `Generator` (which was already re-seeded). Pull Request resolved: https://github.com/pytorch/pytorch/pull/107131 Approved by: https://github.com/ezyang	2023-08-18 10:23:23 +00:00
shibo19	bb2fcc7659	unify TEST_CUDA (#106685 ) Fixes #ISSUE_NUMBER as title, unify TEST_CUDA Pull Request resolved: https://github.com/pytorch/pytorch/pull/106685 Approved by: https://github.com/zou3519	2023-08-10 09:01:36 +00:00
Justin Chu	4cc1745b13	[BE] f-stringify torch/ and scripts (#105538 ) This PR is a follow up on the pyupgrade series to convert more strings to use f-strings using `flynt`. - https://docs.python.org/3/reference/lexical_analysis.html#f-strings - https://pypi.org/project/flynt/ Command used: ``` flynt torch/ -ll 120 flynt scripts/ -ll 120 flynt tools/ -ll 120 ``` and excluded `collect_env.py` Pull Request resolved: https://github.com/pytorch/pytorch/pull/105538 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-07-21 19:35:24 +00:00
Justin Chu	73e1455327	[BE] Enable ruff's UP rules and autoformat test/ (#105434 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105434 Approved by: https://github.com/albanD	2023-07-19 20:36:06 +00:00
Ramil Nugmanov	28098cae6b	[DataLoader] Adding `StackDataset` (#101338 ) Torch wrapping datasets list has: `TensorDataset` `ConcatDataset` `ChainDataset` `TensorDataset` is useful for stacking sets of tensors but can't work with objects without `.size()` method. This PR proposes `StackDataset`, similar to `TensorDataset` but for a general case like `ConcatDataset`. Possible usage of `StackDataset` is multimodal networks with different input like image+text or for staking non-tensor input and property to predict. Pull Request resolved: https://github.com/pytorch/pytorch/pull/101338 Approved by: https://github.com/ejguan, https://github.com/NivekT	2023-05-18 00:57:12 +00:00
Zachary DeVito	7ff1f3f3f6	Revert "Revert "Expandable blocks in allocator (#96995 )"" (#99275 ) This reverts commit `851e89c8e8`. Differential Revision: [D45034526](https://our.internmc.facebook.com/intern/diff/D45034526) Pull Request resolved: https://github.com/pytorch/pytorch/pull/99275 Approved by: https://github.com/eellison	2023-04-17 23:46:08 +00:00
PyTorch MergeBot	851e89c8e8	Revert "Expandable blocks in allocator (#96995 )" This reverts commit `6a50b83b73`. Reverted https://github.com/pytorch/pytorch/pull/96995 on behalf of https://github.com/izaitsevfb due to Breaks internal tests	2023-04-16 19:23:37 +00:00
Zachary DeVito	6a50b83b73	Expandable blocks in allocator (#96995 ) Common advice we give for handling memory fragmentation issues is to allocate a big block upfront to reserve memory which will get split up later. For programs with changing tensor sizes this can be especially helpful to avoid OOMs that happen the first time we see a new largest input and would otherwise have to allocate new segments. However the issue with allocating a block upfront is that is nearly impossible to correctly estimate the size of that block. If too small, space in the block will run out and the allocator will allocate separate blocks anyway. Too large, and other non-PyTorch libraries might stop working because they cannot allocate any memory. This patch provides the same benefits as using a pre-allocating block but without having to choose its size upfront. Using the cuMemMap-style APIs, it adds the ability to expand the last block in a segment when more memory is needed. Compared to universally using cudaMallocAsync to avoid fragmentation, this patch can fix this common fragmentation issue while preserving most of the existing allocator behavior. This behavior can be enabled and disabled dynamically. This should allow users to, for instance, allocate long-lived parameters and state in individual buffers, and put temporary state into the large expandable blocks, further reducing fragmentation. See inline comments for information about the implementation and its limitations. Pull Request resolved: https://github.com/pytorch/pytorch/pull/96995 Approved by: https://github.com/eellison	2023-04-14 09:49:11 +00:00
puririshi98	8aa34602f7	Jetson Update for CI Redo (#94549 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/94549 Approved by: https://github.com/ezyang, https://github.com/malfet	2023-02-21 17:13:38 +00:00
Xuehai Pan	046e88a291	[BE] [3/3] Rewrite `super()` calls in test (#94592 ) Rewrite Python built-in class `super()` calls. Only non-semantic changes should be applied. - #94587 - #94588 - #94592 Also, methods with only a `super()` call are removed: ```diff class MyModule(nn.Module): - def __init__(self): - super().__init__() - def forward(self, ...): ... ``` Some cases that change the semantics should be kept unchanged. E.g.: `f152a79be9/caffe2/python/net_printer.py (L184-L190)` `f152a79be9/test/test_jit_fuser_te.py (L2628-L2635)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/94592 Approved by: https://github.com/ezyang, https://github.com/seemethere	2023-02-12 22:20:53 +00:00
Aaron Gokaslan	67d9790985	[BE] Apply almost all remaining flake8-comprehension checks (#94676 ) Applies the remaining flake8-comprehension fixes and checks. This changes replace all remaining unnecessary generator expressions with list/dict/set comprehensions which are more succinct, performant, and better supported by our torch.jit compiler. It also removes useless generators such as 'set(a for a in b)`, resolving it into just the set call. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94676 Approved by: https://github.com/ezyang	2023-02-12 01:01:25 +00:00
Aaron Gokaslan	9171f7d4cd	[BE] Modernize PyTorch even more for 3.8 with pyupgrade (#94520 ) Applies some more pyupgrade fixits to PyTorch Pull Request resolved: https://github.com/pytorch/pytorch/pull/94520 Approved by: https://github.com/ezyang	2023-02-10 18:02:50 +00:00
Aaron Gokaslan	8fce9a09cd	[BE]: pyupgrade Python to 3.8 - imports and object inheritance only (#94308 ) Apply parts of pyupgrade to torch (starting with the safest changes). This PR only does two things: removes the need to inherit from object and removes unused future imports. Pull Request resolved: https://github.com/pytorch/pytorch/pull/94308 Approved by: https://github.com/ezyang, https://github.com/albanD	2023-02-07 21:10:56 +00:00
Jeff Daily	04689ae209	[CI][ROCm] skip multiprocessing tests that trigger hangs (#92101 ) Skip tests affected by #90940. Pull Request resolved: https://github.com/pytorch/pytorch/pull/92101 Approved by: https://github.com/huydhn	2023-01-13 22:39:00 +00:00
Sergii Dymchenko	a775204499	Fix issue 38095 TODO in test_dataloader.py (#90084 ) Fix TODO related to https://github.com/pytorch/pytorch/issues/38095 Pull Request resolved: https://github.com/pytorch/pytorch/pull/90084 Approved by: https://github.com/clee2000, https://github.com/NivekT	2022-12-03 03:01:52 +00:00
Huy Do	21dd311077	Add a mode to rerun all disabled tests (without running anything else) (#88646 ) Rerun all disabled test to gather their latest result so that we can close disabled tickets automatically. When running under this mode (RERUN_DISABLED_TESTS=true), only disabled tests are run while the rest are skipped `<skipped message="Test is enabled but --rerun-disabled-tests verification mode is set, so only disabled tests are run" type="skip"/>` The logic is roughly as follows, the test runs multiple times (n=50) * If the disabled test passes, and it's flaky, do nothing because it's still flaky. In the test report, we'll see the test passes with the following skipped message: ``` <testcase classname="TestMultiprocessing" file="test_multiprocessing.py" line="357" name="test_fs" time="0.000" timestamp="0001-01-01T00:00:00"> <skipped message="{"flaky": True, "num_red": 4, "num_green": 0, "max_num_retries": 3, "rerun_disabled_test": true}" type="skip"/> </testcase> ``` * If the disabled test passes every single time, and it is not flaky anymore, mark it so that it can be closed later. We will see the test runs and passes, i.e. ``` <testcase classname="TestCommonCUDA" name="test_out_warning_linalg_lu_factor_cuda" time="0.170" file="test_ops.py" /> ``` * If the disabled test fails after all retries, this is also expected. So only report this but don't fail the job (because we don't care about red signals here), we'll see the test is skipped (without the `flaky` field), i.e. ``` <testcase classname="TestMultiprocessing" file="test_multiprocessing.py" line="357" name="test_fs" time="0.000" timestamp="0001-01-01T00:00:00"> <skipped message="{"num_red": 4, "num_green": 0, "max_num_retries": 3, "rerun_disabled_test": true}" type="skip"/> </testcase> ``` This runs at the same schedule as `mem_leak_check` (daily). The change to update test stats, and (potentially) grouping on HUD will come in separated PRs. ### Testing * pull https://github.com/pytorch/pytorch/actions/runs/3447434434 * trunk https://github.com/pytorch/pytorch/actions/runs/3447434928 Pull Request resolved: https://github.com/pytorch/pytorch/pull/88646 Approved by: https://github.com/clee2000	2022-11-15 05:08:26 +00:00
Kevin Tse	be8d88f8d0	[DataLoader] Removing DataLoader2 related code (#88848 ) Removing these lines of code as `DataLoader2` has been added to [TorchData](https://github.com/pytorch/data). I'm importing this to confirm it will not impact internal codes. Differential Revision: [D41201578](https://our.internmc.facebook.com/intern/diff/D41201578) Pull Request resolved: https://github.com/pytorch/pytorch/pull/88848 Approved by: https://github.com/ejguan	2022-11-11 22:27:01 +00:00
erjia	4c19981316	[DataPipe] Reset Shuffler's iterator when NotStarted (#83535 ) This PR changes the behavior of `IterDataPipe` to always invoke `reset` for the state of `NotStarted`. The main reason is we normally put lazy initialization code into `reset` function. Even for the state of `NotStarted`, we should invoke `reset` to initialize those lazy variables. Otherwise, we have to manually determine if the state is `NotStarted` or `Iterating` in `__iter__` function and only manually invoke `reset` in the state of `NotStarted`. This PR also makes `Shuffler` is able to serialize with `buffer` and `rng_state`. The following part is removed: ~I am also add `_snapshot_state` into serialization state and during `__setstate__` only change the state to `Restored` if the original state is `Iterating`. Especially, for the case of deserializing/serializing `NotStarted` DataPipe (multiprocessing), we would invoke `set_seed` for `Shuffler`. We need the `DataPipe` remains as `NotStarted` to properly `reset`.~ I am listing all the expected behavior state transition below: - Initial state: `NotStarted` - `iter` -> Call `reset` and change the state to `Iterating` - serialize/deserialize -> Keep the state as `NotStarted` (will `reset` if `iter` is called afterwards) - Initial state: `Iterating` - `iter` -> Call `reset` and keep the state to `Iterating` - serialize/deserialize -> Change the state as `Restored` - Initial state: `Restored` - `iter` -> Only change the state to `Iterating` - serialize/deserialize -> Not allowed Pull Request resolved: https://github.com/pytorch/pytorch/pull/83535 Approved by: https://github.com/NivekT	2022-08-25 19:45:41 +00:00
soulitzer	4f00c7589d	Fix and unskip dataloader tests on ARM (#83125 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/83125 Approved by: https://github.com/albanD	2022-08-12 19:21:59 +00:00
albanD	2255911f8a	Make M1 tests green (#82213 ) This is skipping all the failing tests and add a new master job to test on M1 Pull Request resolved: https://github.com/pytorch/pytorch/pull/82213 Approved by: https://github.com/seemethere, https://github.com/soulitzer, https://github.com/malfet	2022-08-05 16:12:08 +00:00
Alexander Grund	2785818f5a	Choose test affinity based on current affinity (#80327 ) This avoids test failures in cgroup environments Fixes https://github.com/pytorch/pytorch/issues/44368 CC @VitalyFedyunin new PR after #44369 got closed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80327 Approved by: https://github.com/VitalyFedyunin	2022-07-20 16:45:57 +00:00
erjia	270069cfb9	Fix DataLoader flaky tests that run out of shared memory (#81660 ) Fixes #70517 #70547 #74598 I basically disable the `loader_kill` test that would prevent proper clean up, which would be handled by Python interpreter in normal python script execution. However, in pytest framework, the python interpreter never exits and execute clean up on the shared memory using `/multiprocessing/resource_tracker.py`. So, we might encounter the running out of shared memory in some cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81660 Approved by: https://github.com/NivekT	2022-07-19 15:38:20 +00:00
Robert	3064982fb8	Support percentages in random_split (#78877 ) Fixes #78510 This PR adds support for using fractions with `random_split`. This should be completely backwards-compatible as the fractional-style splitting is only applied when the sum across the input lengths is lower than 1.0 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78877 Approved by: https://github.com/ejguan	2022-06-16 02:00:25 +00:00
Michael Suo	c978b609f7	[ci] remove IN_CI env var The conventional env var to set is CI. Both circle and GHA set it, so IN_CI is unnecessary Pull Request resolved: https://github.com/pytorch/pytorch/pull/79229 Approved by: https://github.com/janeyx99	2022-06-11 17:16:30 +00:00
Vitaly Fedyunin	883f8ef62e	[DataLoader] DataLoader now automatically apply sharding to DataPipes Pull Request resolved: https://github.com/pytorch/pytorch/pull/78631 Approved by: https://github.com/ejguan, https://github.com/NivekT	2022-06-02 17:40:29 +00:00
Sergii Dymchenko	e8bf3a9cd4	Remove Python 2-related code from dataloader (#78594 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78594 Approved by: https://github.com/seemethere	2022-06-01 05:25:23 +00:00
Alban Desmaison	7c3d3b759b	Migrate x86 trunk build/test to macos12 This will enable MPS building but will NOT test mps as the runner do not have AMD gpus Pull Request resolved: https://github.com/pytorch/pytorch/pull/77645 Approved by: https://github.com/malfet, https://github.com/seemethere	2022-05-18 11:59:19 +00:00
Jeeja	45bbc4c028	Update Dataloader with default parameter device (#65402 ) Summary: pin_memory, has optional device parameter to specify which device you want to pin for. With this above change the Dataloader will work only for CUDA backend. To add support for other backend which supports pinned memory, dataloader is updated with device as optional parameter. Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/65402 Reviewed By: zou3519 Differential Revision: D32282204 Pulled By: VitalyFedyunin fbshipit-source-id: e2e09876969af108d0db38af7c2d1b2f1cfa9858 (cherry picked from commit 3b76e151964fce442e27fe8fb5c37af930da4fa1)	2022-04-21 01:33:53 +00:00
Philip Meier	3c10987692	don't add extra shuffle in DataLoader2 if one is present Without this, `DataLoader2` will just add an `Shuffler` to the end of the datapipe if `shuffle=True`: ```py from torch.utils.data.dataloader_experimental import DataLoader2 from torchdata.datapipes.iter import IterableWrapper, IterDataPipe, Shuffler class Sorter(IterDataPipe): def __init__(self, datapipe): self.datapipe = datapipe def __iter__(self): return iter(sorted(self.datapipe)) data = list(range(1000)) dp = IterableWrapper(data) dp = Shuffler(dp).set_shuffle(False) dp = Sorter(dp) dl2 = DataLoader2(dp, shuffle=True, batch_size=None) assert list(dl2) == data # fails unless you hit a lucky random seed ``` This example is somewhat non-sensical, but demonstrates we cannot simply add a `Shuffler`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75014 Approved by: https://github.com/ejguan	2022-04-05 19:53:08 +00:00
Evren Tumer	7534525735	Reset worker cycle iterator for determinism across runs (#73675 ) Summary: Reset worker cycle iterator for determinism across runs Fixes https://github.com/pytorch/pytorch/issues/73603 Pull Request resolved: https://github.com/pytorch/pytorch/pull/73675 Reviewed By: bdhirsh Differential Revision: D34688704 Pulled By: ejguan fbshipit-source-id: 7bab11f0b9f59645d9b168fa11d92dc7c2c4d34e (cherry picked from commit eb5fd559224988f9967528e154cf37c5031fe7c2)	2022-03-09 14:55:07 +00:00
Kevin Tse	cd4ecce1bb	[DataPipe] Fix issue with DataPipe serialization with `dill` (#72896 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72896 Fixing the issue described here: https://github.com/pytorch/data/issues/214 There will be a follow-up PR in TorchData as well Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D34258669 Pulled By: NivekT fbshipit-source-id: 6dd88250ed14ebe779915dc46139be7e012e9d1b (cherry picked from commit 025b8ed98019e576bfef04c33a3f33ed1a426a66)	2022-02-23 16:31:20 +00:00
Erjia Guan	67a275c293	Fix persistent worker exits before pin_memory thread (#71579 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71579 Fixes #1551 As the comment in the code, register a function to terminate persistent workers. By adding a reference of these workers in `atexit`, it would prevent Python interpreter kills these persistent worker processes before `pin_memorh_thread` exits. And, if users explicitly kills DataLoader iterator, such function in `atexit` would be a no-op. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D33896537 Pulled By: ejguan fbshipit-source-id: 36b57eac7523d8aa180180c2b61fc693ea4638ae (cherry picked from commit `05add2ae0f`)	2022-02-01 23:57:17 +00:00
pyhuang97@gmail.com	16a9ffba4b	Allow specifying num_samples to RandomSampler even when replacement=False (#71568 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/38032 #39214 Hi, I modified the RandomSampler to satisfy the requirement of https://github.com/pytorch/pytorch/issues/38032. I also added and deleted some test cases in the test/test_dataloader.py to match with the new requirement. Pull Request resolved: https://github.com/pytorch/pytorch/pull/71568 Reviewed By: mikaylagawarecki Differential Revision: D33741776 Pulled By: ejguan fbshipit-source-id: 2d25f5096b7b36ad9fb6455107182f387cf8ee43 (cherry picked from commit `9c7e1891c2`)	2022-01-25 15:34:24 +00:00
Nikita Shulga	86aefdc082	Revert D33694867: Fix persistent worker exits before pin_memory thread Test Plan: revert-hammer Differential Revision: D33694867 (`e2191e7084`) Original commit changeset: 0847f4d424a0 Original Phabricator Diff: D33694867 (`e2191e7084`) fbshipit-source-id: 5f28616700d8647cbe468a9e300724a7f0c6cc15 (cherry picked from commit `3d8125ba6d`)	2022-01-22 00:09:28 +00:00
Erjia Guan	e2191e7084	Fix persistent worker exits before pin_memory thread (#71579 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71579 Fixes #1551 As the comment in the code, register a function to terminate persistent workers. Using `atexit` to make sure termination of persistent workers always happens at the end (after pin_memory_thread exits). We need such mechanism because Python interpreter would clean up worker process before DataLoader iterator in some rare cases. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D33694867 Pulled By: ejguan fbshipit-source-id: 0847f4d424a0cd6b3c0be8235d505415970254e8 (cherry picked from commit `18ad4621af`)	2022-01-21 20:31:16 +00:00
Vitaly Fedyunin	d90012689f	[DataPipe] Control shuffle settings from DataLoader2 (#65756 ) Summary: Makes `shuffle` DataPipe sensitive to DataLoader(2) `shuffle` kwarg. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65756 Reviewed By: albanD Differential Revision: D31344867 Pulled By: VitalyFedyunin fbshipit-source-id: e0084e0ac193ac784d6298328ca1222745681347	2021-12-14 07:35:26 -08:00
Kevin Tse	39fb855d91	[DataLoader] Implementing communication processes for Map-style DataPipes (#68549 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68549 cc SsnL VitalyFedyunin ejguan NivekT Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D32922676 Pulled By: NivekT fbshipit-source-id: fd918a342214d617a489ac5acffff15b55e9b255	2021-12-08 07:27:01 -08:00
Santiago Castro	f776f30780	Keep the sequence or mapping type in `default_collate` (#68779 ) Summary: `default_collate`, `default_convert`, and `pin_memory` convert sequences into lists. I believe they should keep the original type when possible (e.g., I have a class that inherits from `list`, which comes from a 3rd party library that I can't change, and provides extra functionality). Note it's easy to do when the type supports an iterable in its creation but it's not always the case (e.g., `range`). Even though this can be accomplished if using a custom `default_collate`/`default_convert`, 1) this is behavior they should support out-of-the-box IMHO, and 2) `pin_memory` still does it. cc VitalyFedyunin ejguan NivekT Pull Request resolved: https://github.com/pytorch/pytorch/pull/68779 Reviewed By: wenleix Differential Revision: D32651129 Pulled By: ejguan fbshipit-source-id: 17c390934bacc0e4ead060469cf15dde815550b4	2021-11-29 13:14:20 -08:00
Jane Xu	39215ddf84	[skip ci] Set test owners for dataloader tests (#66839 ) Summary: Action following https://github.com/pytorch/pytorch/issues/66232 cc SsnL VitalyFedyunin ejguan NivekT Pull Request resolved: https://github.com/pytorch/pytorch/pull/66839 Reviewed By: ejguan Differential Revision: D31761722 Pulled By: janeyx99 fbshipit-source-id: 8315ac03352c11b3215d89856b3cfda6cd78fa0c	2021-10-19 08:31:16 -07:00
Michael Suo	9d13ae450a	[oss/ci] skip all dataloader tests with asan (#66561 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66561 See https://github.com/pytorch/pytorch/issues/66223 for context. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D31617142 Pulled By: suo fbshipit-source-id: 16b280fc47a7c40fa19c5c72192d342dd33680bf	2021-10-13 11:39:41 -07:00
Michael Suo	213c3f45da	[oss/ci] skip TestDataLoaderPersistentWorkers on ASAN (#66236 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66236 it's flaky, see https://github.com/pytorch/pytorch/issues/66223 Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D31462056 Pulled By: suo fbshipit-source-id: f4362a8020dc05ac8856706c0508d48be026eeb8	2021-10-06 17:56:19 -07:00
Erjia Guan	b777d790ea	Convert Sampler back to lazily construction (#63646 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63646 Fixes #63609 Test Plan: Imported from OSS Reviewed By: NivekT Differential Revision: D30451774 Pulled By: ejguan fbshipit-source-id: 550d77494326446d1a42b5da0559e0d384c47413	2021-09-30 07:32:06 -07:00
Nikita Shulga	4a7a0ea42e	Skip flaky ASAN tests (#65792 ) Summary: See https://github.com/pytorch/pytorch/issues/65727 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65792 Reviewed By: janeyx99 Differential Revision: D31254490 Pulled By: malfet fbshipit-source-id: 76714db30a5566fbab95179236ccdafab22cf551	2021-09-28 22:33:02 -07:00
Nikita Shulga	145202c45b	Define timeout in TestIndividualWorkerQueue (#65742 ) Summary: This test occasionally deadlocks while waiting for the child process to report result. But as the test is small, entire test should never take more than 1-2 sec, but to be on the safe side set timeout to 5 sec Somewhat mitigates https://github.com/pytorch/pytorch/issues/65727 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65742 Reviewed By: janeyx99, ejguan Differential Revision: D31235116 Pulled By: malfet fbshipit-source-id: 0cdd2f7295f6f9fcefee954a14352e18fae20696	2021-09-28 10:01:19 -07:00
Hong Xu	fb8bdb8039	When test set_affinity, don't hardcode the CPU ID (#65042 ) Summary: The setaffinity test always fails when the number of CPUs is smaller than 3. Changed the test to be dynamically based on the number of CPUs of the system. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65042 Reviewed By: jbschlosser Differential Revision: D30960554 Pulled By: ejguan fbshipit-source-id: 55ac12714b4b0964b48c3617b79a7a345d40ebce	2021-09-15 08:10:59 -07:00
Rishi Puri	2ae938e15e	Fixes failure in test_dataloader.py that occurs on jetson boards (#64757 ) Summary: CUDA IPC is not supported for jetsons Pull Request resolved: https://github.com/pytorch/pytorch/pull/64757 Reviewed By: jbschlosser Differential Revision: D30900593 Pulled By: ejguan fbshipit-source-id: c6b2e8a9746276fdb4a009b6412e47cc8aac69f2	2021-09-13 10:11:04 -07:00
Erjia Guan	3cd0a4ac15	Fix test_ind_worker_queue by setting max_num_worker based on system resource (#63779 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63779 Fixes #63657 Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D30494185 Pulled By: ejguan fbshipit-source-id: d1bd24299b25d589889604aaf18ad347bdff4df4	2021-09-02 12:36:56 -07:00
Vitaly Fedyunin	82174330d0	[DataLoader2] Adding Messages, Protocols, Loop wrappers (#63882 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63882 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30627452 Pulled By: VitalyFedyunin fbshipit-source-id: 561ea2df07f3572e04401171946154024126387b	2021-08-30 07:57:20 -07:00
Erjia Guan	ad47fb8858	Rename IterableAsDataPipe to IterableWrapper (#63981 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63981 Rename `IterableAsDataPipe` to `IterableWrapper` based on our naming convention `Op-er` Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D30554197 Pulled By: ejguan fbshipit-source-id: c2eacb20df5645d83ca165d6a1591f7e4791990f	2021-08-26 10:23:25 -07:00
Vitaly Fedyunin	e1bdebf685	Adding DataLoader2 class as future replacement of DataLoader (#63742 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63742 Supports sharding and batching on loader level** Supports sharding and batching on loader level Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30494506 Pulled By: VitalyFedyunin fbshipit-source-id: 6648e09d955055ac38e3a4e3973f701acefca762	2021-08-23 18:09:07 -07:00
Alban Desmaison	71da114412	Revert D30426527: Adding DataLoader2 class as future replacement of DataLoader Test Plan: revert-hammer Differential Revision: D30426527 (`5a7133b87f`) Original commit changeset: e5905d3364c4 fbshipit-source-id: 794d8a4e9256ccff8cf894aee10eff6adc30d502	2021-08-20 12:06:52 -07:00
Vitaly Fedyunin	5a7133b87f	Adding DataLoader2 class as future replacement of DataLoader (#63523 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63523 Supports sharding and batching on loader level** * #63522 Adding IterableAsDataPipe IterDataPipe usefull for tests and simple cases Supports sharding and batching on loader level Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30426527 Pulled By: VitalyFedyunin fbshipit-source-id: e5905d3364c4880e720dd62fb066f08881c71a6e	2021-08-20 09:01:55 -07:00
Shen Li	1022443168	Revert D30279364: [codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: revert-hammer Differential Revision: D30279364 (`b004307252`) Original commit changeset: c1ed77dfe43a fbshipit-source-id: eab50857675c51e0088391af06ec0ecb14e2347e	2021-08-12 11:45:01 -07:00
Zsolt Dollenstein	b004307252	[codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: manual inspection & sandcastle Reviewed By: zertosh Differential Revision: D30279364 fbshipit-source-id: c1ed77dfe43a3bde358f92737cd5535ae5d13c9a	2021-08-12 10:58:35 -07:00
DamonDeng	53489bc385	fix for #60319 , forcing to use fork as start method in test/test_dat… (#60868 ) Summary: fix for https://github.com/pytorch/pytorch/issues/60319 , forcing to use fork as start method in test/test_dataloader.py Fixes #{60319} Pull Request resolved: https://github.com/pytorch/pytorch/pull/60868 Reviewed By: mruberry Differential Revision: D29432876 Pulled By: ejguan fbshipit-source-id: 5da25f7cfaf8ea0803c0b1aacf2badd656799e16	2021-06-29 09:30:37 -07:00
Rong Rong (AI Infra)	510334f34b	[BE] clean up IS_PYTORCH_CI and IN_CI (#60279 ) Summary: `IS_PYTORCH_CI` and `IN_CI` are used randomly, however in some cases IN_CI is not currently set because it only exist in .circleci/scripts/setup_ci_environment.sh. This cleans up the 2 flags and only use IN_CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/60279 Test Plan: CI Reviewed By: seemethere Differential Revision: D29239545 Pulled By: walterddr fbshipit-source-id: a069424a2bb8790a3adfdaf0dc460301026bf8c7	2021-06-20 19:45:07 -07:00
TJ-coding	7c29ca7f2b	Fix Subset of a Subset not sliceable issue (#59513 ) Summary: Dataset can be indexed by a list, but a list can not be indexed by a list. This gives error when slicing a Subset initialised with a Subset, instead of a dataset. Fixed the issue by changing the indices to a Tensor which can be indexed by a list. Fixes https://github.com/pytorch/pytorch/issues/59512 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59513 Reviewed By: zou3519 Differential Revision: D29196891 Pulled By: ejguan fbshipit-source-id: ccde6e474fbcbddd2e9c7c107bc8b5de1307cdb9	2021-06-18 07:07:34 -07:00
Erjia Guan	3b977a0d28	[DataLoader] Add `generate_state` for NumPy seeding (#56797 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56797 After adding default seeding strategy for NumPy random module within each worker of DataLoader #56488, two concerns are raised: - We dropped the support for NumPy < 1.17 due to `SeedSequence` - In order to support seeding for NumPy < 1.17, how can we provide seed for `numpy.random`? - First option is set the same seed as `random`. But, the problem is a same algorithm is shared between `numpy.random` and `random`. With the same seed, they will have exact same state sequence. Thanks to rkern, we noticed this so-called [bad things](https://github.com/PyTorchLightning/pytorch-lightning/pull/6960#issuecomment-818393659). - Considering most of users do not aware this problem, we can provide a better seed by default for `numpy.random` using same `SeedSequence` algorithm as numpy. This is just a workaround with hard-coded function to generate an array of four int32 as the seed. To better coping with this problem since there are amount of 3rd party libraries not just `NumPy` having random module. We may at the end need to implement a `SeedSequence` within `torch.random` module, then users can `spawn` a new `SeedSequence` for each library. Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D28000619 Pulled By: ejguan fbshipit-source-id: 5701c8124a38ea5ded69eb8eee70f9680877ffa6	2021-04-27 08:14:02 -07:00
Yukio Siraichi	93bf0ae6fc	Remove legacy constructor calls from pytorch codebase. (#54142 ) Summary: Follow up from https://github.com/pytorch/pytorch/issues/53889 Related to https://github.com/pytorch/pytorch/issues/47112 Removing every occurrence of the legacy constructor call present in PyTorch at: - _docs_ - _benchmarks_ - _test_ - _caffe2_ - _CONTRIBUTING.md_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/54142 Reviewed By: ngimel Differential Revision: D27699450 Pulled By: mruberry fbshipit-source-id: 530aa3f5746cc8bc1407d5d51b2bbd8075e30546	2021-04-11 15:45:17 -07:00
Winston Smith	8ed20b3f65	Leak Caffe2 threadpool in child processes right after fork to prevent segfault (#54895 ) Summary: ## Problem summary Fixes https://github.com/pytorch/pytorch/issues/54752 - when the number of threads is more than 3 and at least one `set_num_threads` invocation has taken place before forking child processes by the dataloader, `set_num_threads(1)` in the child process causes a segfault, as during its invocation, the child process is made to handle the data structures of the Caffe2 thread-pool of the parent process, whose data structures it inherits from the parent process (these threads don't exist in the child process, but some of its data structures do, due to the copy-on-write technique used by `fork`). ## Solution malfet [advised](https://github.com/pytorch/pytorch/issues/54752#issuecomment-810315302) & [authored code](https://github.com/pytorch/pytorch/pull/54895#pullrequestreview-625670122) for adding a `pthread_atfork` handler in `pytorch/caffe2/utils/threadpool/pthreadpool-cpp.cc`, that's invoked in the child process right after fork, to leak the Caffe2 thread-pool (the child inherits the thread-pool's data structures from its parent process, but doesn't actually have those threads, since after `fork` , a child process only has one thread). ## Additional changes Added unittest `test_no_segfault` to test for this issue in `test_dataloader.py` Also enabled `test_segfault` (which actually makes sure that segfaults happen in worker processes in a particular case). Pull Request resolved: https://github.com/pytorch/pytorch/pull/54895 Reviewed By: zhangguanheng66 Differential Revision: D27542253 Pulled By: malfet fbshipit-source-id: 10f9c67ce1ff1aa37d3efebf405bd93f7f9d2489	2021-04-03 10:51:20 -07:00
Nikita Shulga	f2689b1e13	Make ideep honor `torch.set_num_thread` changes (#53871 ) Summary: When compiled with OpenMP support `ideep`'s computational_cache would cache max number of OpenMP workers This number could be wrong after `torch.set_num_threads` call, so clean it after the call. Fixes https://github.com/pytorch/pytorch/issues/53565 Pull Request resolved: https://github.com/pytorch/pytorch/pull/53871 Reviewed By: albanD Differential Revision: D27003265 Pulled By: malfet fbshipit-source-id: 1d84c23070eafb3d444e09590d64f97f99ae9d36	2021-03-13 11:20:44 -08:00
Yi Zhang	fd582af06c	enable coverage test for dataloader on Windows (#52550 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/50661 For coverage, The class qualified name is `'SimpleCustomBatch': <class '__mp_main__.SimpleCustomBatch'>` For pytest The class qualified name is `'SimpleCustomBatch': <class 'test_dataloader.SimpleCustomBatch'>` So move the class to one separate file ![image](https://user-images.githubusercontent.com/16190118/108611869-d6b51f80-741d-11eb-908e-be7a64da916d.png) As malfet suggestion, use __import__ to avoid adding new file. Pull Request resolved: https://github.com/pytorch/pytorch/pull/52550 Reviewed By: walterddr Differential Revision: D26754023 Pulled By: malfet fbshipit-source-id: 34b0fbe7336b9303cedc28ec6116ab752a2d3630	2021-03-02 18:40:47 -08:00
Kyle Chen	a9f7ae5357	[ROCm] Enable test cases in test/test_dataloader.py for ROCm (#52766 ) Summary: Enabling test cases in test_dataloader.py for ROCm because they are passing now. Signed-off-by: Kyle Chen <kylechen@amd.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/52766 Reviewed By: H-Huang Differential Revision: D26706402 Pulled By: ngimel fbshipit-source-id: 63d4ea6d9b16f6244eb0f0f8f7a957bac8469111	2021-03-01 13:32:35 -08:00
Erjia Guan	89b1053413	[DataLoader] Move BufferedShuffle from Dataset to DataPipe (#52141 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52141 Remove BufferShuffleDataSet, as it's not being used anywhere within PyTorch (no usage on Github based on a search) and it's not included in the release of PyTorch 1.7.1. Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D26710940 Pulled By: ejguan fbshipit-source-id: 90023b4bfb105d6aa392753082100f9181ecebd0	2021-03-01 12:54:44 -08:00
Chester Liu	58eb23378f	Clean up usage of torch._six partially (#49785 ) Summary: See https://github.com/pytorch/pytorch/issues/42919 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49785 Reviewed By: mruberry Differential Revision: D25963833 Pulled By: bugra fbshipit-source-id: 11c90d6b8d3f206c9d0a4d8621b773beb10c6ba2	2021-02-08 13:58:34 -08:00
Tongzhou Wang	54ce171f16	Fix persistent_workers + pin_memory (#48543 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48370 https://github.com/pytorch/pytorch/issues/47445 cc emcastillo who authored the original functionality. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48543 Reviewed By: bdhirsh Differential Revision: D25277474 Pulled By: ejguan fbshipit-source-id: 1967002124fb0fff57caca8982bc7df359a059a2	2021-01-08 07:04:10 -08:00
Hugo van Kemenade	473e78c0fa	Remove redundant code for unsupported Python versions (#49486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49486 Remove code for Python 3.5 and lower. There's more that can be removed/modernised, but sticking mainly to redundant version checks here, to keep the diff/PR smaller. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46579 Reviewed By: zou3519 Differential Revision: D24453571 Pulled By: ezyang fbshipit-source-id: c2cfcf05d6c5f65df64d89c331692c9aec09248e	2021-01-06 12:45:46 -08:00
Pritam Damania	21c38e1799	Additional validation for DistributedSampler. (#48865 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48865 If DistributedSampler was provided an invalid rank (ex: https://discuss.pytorch.org/t/distributed-datasets-on-multi-machines/105113), it failed with a cryptic assertion failure. To fix this issue, I've added an additional check to DistributedSampler to validate we provide a valid rank. ghstack-source-id: 117906769 Test Plan: 1) waitforbuildbot 2) Unit test added. Reviewed By: malfet Differential Revision: D25344945 fbshipit-source-id: 7685e00c8b2c200efbd2949fb32ee32ea7232a08	2020-12-11 17:22:22 -08:00
SsnL	4abca9067b	Fix dataloader hang with large sampler (#48669 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48669 Reviewed By: zhangguanheng66 Differential Revision: D25255763 Pulled By: VitalyFedyunin fbshipit-source-id: d06421f52bb1d00cdf8025f1a2ba0d1f9284731a	2020-12-02 09:07:30 -08:00
lixinyu	67b7e751e6	add warning if DataLoader is going to create excessive number of thread (#46867 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46867 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D24545540 Pulled By: glaringlee fbshipit-source-id: a3bef0d417e535b8ec0bb33f39cfa2308aadfff0	2020-10-30 07:54:23 -07:00
Nikita Shulga	f363a2e106	Mark top 3 slowest tests as slow (#46068 ) Summary: `TCPStoreTest.test_numkeys_delkeys` takes 5+ min (mostly in idle wait for socket timeout) `TestDataLoader.test_proper_exit` and `TestDataLoaderPersistentWorkers.test_proper_exit` take 2.5 min each `TestXNNPACKConv1dTransformPass.test_conv1d_with_relu_fc` takes 2 min to finish Add option to skip reporting test classes that run for less than a second to `print_test_stats.py` and speed up `TestTorchDeviceTypeCUDA.test_matmul_45724_cuda` Pull Request resolved: https://github.com/pytorch/pytorch/pull/46068 Reviewed By: mruberry Differential Revision: D24208660 Pulled By: malfet fbshipit-source-id: 780e0d8be4f0cf69ea28de79e423291a1f3349b7	2020-10-08 21:10:03 -07:00
Erjia Guan	96540e918c	Add ShuffleDataset with buffer (#45290 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45290 Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D24001084 Pulled By: erjia-guan fbshipit-source-id: d8a7455cf3f18e1f8c1edc53c42c1a99c8573c51	2020-09-30 07:58:15 -07:00
Akihiro Nitta	84949672bf	Fix exception chaining in `test/` (#44193 ) Summary: ## Motivation This PR fixes https://github.com/pytorch/pytorch/issues/43770 and is the continuation of https://github.com/pytorch/pytorch/issues/43836. ## Description of the change This PR fixes exception chaining only in files under `test/` where appropriate. To fix exception chaining, I used either: 1. `raise new_exception from old_exception` where `new_exception` itself seems not descriptive enough to debug or `old_exception` delivers valuable information. 2. `raise new_exception from None` where raising both of `new_exception` and `old_exception` seems a bit noisy and redundant. ## List of lines containing `raise` in `except` clause: I wrote [this simple script](https://gist.github.com/akihironitta/4223c1b32404b36c1b349d70c4c93b4d) using [ast](https://docs.python.org/3.8/library/ast.html#module-ast) to list lines where `raise`ing in `except` clause. - [x] `f8f35fddd4/test/test_cpp_extensions_aot.py (L16)` - [x] `f8f35fddd4/test/test_jit.py (L2503)` - [x] `f8f35fddd4/test/onnx/model_defs/word_language_model.py (L22)` - [x] `f8f35fddd4/test/onnx/verify.py (L73)` - [x] `f8f35fddd4/test/onnx/verify.py (L110)` - [x] `f8f35fddd4/test/onnx/test_verify.py (L31)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L255)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L2992)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L3025)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L3712)` - [x] `f8f35fddd4/test/distributed/test_distributed.py (L3180)` - [x] `f8f35fddd4/test/distributed/test_distributed.py (L3198)` - [x] `f8f35fddd4/test/distributed/test_data_parallel.py (L752)` - [x] `f8f35fddd4/test/distributed/test_data_parallel.py (L776)` - [x] `f8f35fddd4/test/test_type_hints.py (L151)` - [x] `f8f35fddd4/test/test_jit_fuser.py (L771)` - [x] `f8f35fddd4/test/test_jit_fuser.py (L773)` - [x] `f8f35fddd4/test/test_dispatch.py (L105)` - [x] `f8f35fddd4/test/test_distributions.py (L4738)` - [x] `f8f35fddd4/test/test_nn.py (L9824)` - [x] `f8f35fddd4/test/test_namedtensor.py (L843)` - [x] `f8f35fddd4/test/test_jit_fuser_te.py (L875)` - [x] `f8f35fddd4/test/test_jit_fuser_te.py (L877)` - [x] `f8f35fddd4/test/test_dataloader.py (L31)` - [x] `f8f35fddd4/test/test_dataloader.py (L43)` - [x] `f8f35fddd4/test/test_dataloader.py (L365)` - [x] `f8f35fddd4/test/test_dataloader.py (L391)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/44193 Reviewed By: albanD Differential Revision: D23681529 Pulled By: malfet fbshipit-source-id: 7c2256ff17334625081137b35baeb816c1e53e0b	2020-09-14 14:20:16 -07:00
Emilio Castillo	5472426b9f	Reset `DataLoader` workers instead of creating new ones (#35795 ) Summary: This PR needs discussion as it changes the behavior of `DataLoader`. It can be closed if its not considered a good practice. Currently, the `DataLoader` spawns a new `_BaseDataLoaderIter` object every epoch, In the case of the multiprocess DataLoader, every epoch the worker processes are re-created and they make a copy of the original `Dataset` object. If users want to cache data or do some tracking on their datasets, all their data will be wiped out every epoch. Notice that this doesn't happen when the number of workers is 0. giving some inconsistencies with the multiprocess and serial data loaders. This PR keeps the `_BaseDataLoaderIter` object alive and just resets it within epochs, so the workers remain active and so their own `Dataset` objects. People seem to file issues about this often. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35795 Reviewed By: ailzhang Differential Revision: D23426612 Pulled By: VitalyFedyunin fbshipit-source-id: e16950036bae35548cd0cfa78faa06b6c232a2ea	2020-09-01 11:48:00 -07:00
Nikita Shulga	2b70f82737	fix typo in test_dataloader test_multiprocessing_contexts (take 2) (#43588 ) Summary: 2nd attempt to land https://github.com/pytorch/pytorch/pull/43343 Pull Request resolved: https://github.com/pytorch/pytorch/pull/43588 Reviewed By: seemethere Differential Revision: D23332284 Pulled By: malfet fbshipit-source-id: d78faf468c56af2f176dbdd2ce4bd51f0b5df6fd	2020-08-25 21:11:53 -07:00
George Guanheng Zhang	9420c773d0	Revert D23299452: [pytorch][PR] fix typo in test_dataloader test_multiprocessing_contexts Test Plan: revert-hammer Differential Revision: D23299452 (`6a2d7a05c4`) Original commit changeset: 9489c48b83bc fbshipit-source-id: e8c15d338dd89d8e92f3710e9cf149149bd2e763	2020-08-25 12:34:49 -07:00
Jeff Daily	6a2d7a05c4	fix typo in test_dataloader test_multiprocessing_contexts (#43343 ) Summary: https://github.com/pytorch/pytorch/issues/22990 added a multiprocessing_context argument to DataLoader, but a typo in the test causes the wrong DataLoader class to be used. Pull Request resolved: https://github.com/pytorch/pytorch/pull/43343 Reviewed By: glaringlee Differential Revision: D23299452 Pulled By: malfet fbshipit-source-id: 9489c48b83bce36f46d350cad902f7ad96e1eec4	2020-08-25 09:36:56 -07:00
yl-to	1b55e2b043	add prefetch_factor for multiprocessing prefetching process (#41130 ) Summary: fix https://github.com/pytorch/pytorch/issues/40604 Add parameter to Dataloader to configure the per-worker prefetch number. Before this edit, the prefetch process always prefetch 2 * num_workers data items, this commit help us make this configurable, e.x. you can specify to prefetch 10 * num_workers data items. Pull Request resolved: https://github.com/pytorch/pytorch/pull/41130 Reviewed By: izdeby Differential Revision: D22705288 Pulled By: albanD fbshipit-source-id: 2c483fce409735fef1351eb5aa0b033f8e596561	2020-07-24 08:38:13 -07:00
Daiming Yang	ad7133d3c1	Patch for #40026 RandomSampler generates samples one at a time when replacement=True (#41682 ) Summary: Fix https://github.com/pytorch/pytorch/issues/32530 Fix/Patch https://github.com/pytorch/pytorch/pull/40026 Resubmit this patch and fix the type error. Force the input type to `manual_seed()` in `sampler.py` to be `int`. ezyang Pull Request resolved: https://github.com/pytorch/pytorch/pull/41682 Reviewed By: izdeby Differential Revision: D22665477 Pulled By: ezyang fbshipit-source-id: 1725c8aa742c31e74321f20448f4b6a392afb38d	2020-07-22 13:45:09 -07:00
Shen Li	86590f226e	Revert D22519869: [pytorch][PR] RandomSampler generates samples one at a time when replacement=True Test Plan: revert-hammer Differential Revision: D22519869 (`09647e1287`) Original commit changeset: be6585002586 fbshipit-source-id: 31ca5ceb24dd0b291f46f427a6f30f1037252a5d	2020-07-16 12:59:10 -07:00
Daiming Yang	09647e1287	RandomSampler generates samples one at a time when replacement=True (#40026 ) Summary: Fix https://github.com/pytorch/pytorch/issues/32530 I used the next() function to generate samples one at a time. To compensate replacement=False, I added a variable called "sample_list" to RandomSampler for random permutation. cc SsnL Pull Request resolved: https://github.com/pytorch/pytorch/pull/40026 Reviewed By: zhangguanheng66 Differential Revision: D22519869 Pulled By: ezyang fbshipit-source-id: be65850025864d659a713b3bc461b25d6d0048a2	2020-07-16 11:42:32 -07:00
Wojciech Baranowski	0e09511af9	type annotations for dataloader, dataset, sampler (#39392 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/38913 Pull Request resolved: https://github.com/pytorch/pytorch/pull/39392 Reviewed By: anjali411 Differential Revision: D22102489 Pulled By: zou3519 fbshipit-source-id: acb68d9521145f0b047214d62b5bdc5a0d1b9be4	2020-07-07 07:16:18 -07:00
X Wang	063d5b0d3f	Remove get_fail_msg in test_dataloader.test_proper_exit (#40745 ) Summary: Close https://github.com/pytorch/pytorch/issues/40744 Pull Request resolved: https://github.com/pytorch/pytorch/pull/40745 Reviewed By: ezyang Differential Revision: D22308972 Pulled By: colesbury fbshipit-source-id: 4b4847e6b926b2614c8b14f17a9db3b0376baabe	2020-07-06 07:48:32 -07:00
Linyuan Gong	0a75234934	Allow np.memmap objects (numpy arrays based on files) to be processed… (#39847 ) Summary: Allow np.memmap objects to be processed by default_collate np.memmap objects has the same behavior as numpy arrays, and the only difference is that they are stored in a binary file on the disk. However, the default_collate function used by PyTorch DataLoader only accepts np.array, and rejects np.memmap by type checking. This commit allows np.memmap objects to be processed by default_collate. In this way, users can use in-disk large arrays with PyTorch DataLoader. Pull Request resolved: https://github.com/pytorch/pytorch/pull/39847 Reviewed By: ezyang Differential Revision: D22284650 Pulled By: zou3519 fbshipit-source-id: 003e3208a2afd1afc2e4640df14b3446201e00b4	2020-06-30 15:00:20 -07:00
Tongzhou Wang	23db54acdf	[DataLoader] add repr for WorkerInfo (#39975 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/39975 Differential Revision: D22039414 Pulled By: ezyang fbshipit-source-id: 230f68a91fca901bce652fdf88ba88167f39b978	2020-06-16 08:19:32 -07:00
ShawnZhong	c8c53c802e	Add `generator=` kwarg for DataLoader & random samplers (#39737 ) Summary: Fix https://github.com/pytorch/pytorch/issues/39572 Add `generator=` kwarg for DataLoader & random samplers cc: SsnL, deeppatel4557, albanD, mitar Pull Request resolved: https://github.com/pytorch/pytorch/pull/39737 Differential Revision: D22019132 Pulled By: albanD fbshipit-source-id: 835e08b86c5396bc0b0e41057661306b15394d6e	2020-06-15 07:01:20 -07:00
Daiming Yang	0b90b9cdd3	Allow shuffle when auto-batching disabled in DataLoader (#39865 ) Summary: Fix https://github.com/pytorch/pytorch/issues/35761 cc SsnL Note: closed the other PR for this new branch. Pull Request resolved: https://github.com/pytorch/pytorch/pull/39865 Differential Revision: D22003612 Pulled By: ezyang fbshipit-source-id: 26aecd1b298fe99d3924f4c8157cd6cae2561c7c	2020-06-11 15:17:46 -07:00
Hong Xu	283a3ff16d	The exception raised when RandomSampler.replacement is non-boolean should be TypeError (#36547 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36547 Differential Revision: D21818752 Pulled By: ezyang fbshipit-source-id: 7502a24a0df134c44ac72959ba992777c873f8e9	2020-06-02 06:54:02 -07:00
Donna Choi	3d2fce6bc3	Change len(DataLoader) for IterableDataset (#38925 ) Summary: Fix https://github.com/pytorch/pytorch/issues/36176 One-liner change to ensure that ```len(loader) == (len(dataset) // batch_size)``` for IterableDataset. Pull Request resolved: https://github.com/pytorch/pytorch/pull/38925 Differential Revision: D21731587 Pulled By: ezyang fbshipit-source-id: 59a086165a004c0c1c8a1ee0776b1444bd26de23	2020-05-27 11:56:41 -07:00
Donna Choi	8c07a98adc	Error out of default_collate for lists of unequal size (#38492 ) Summary: Fix issue https://github.com/pytorch/pytorch/issues/23141# In the below example ```default_collate``` collates each element of the list. Since the second element isn't present in all samples, it is discarded: ``` from torch.utils.data import Dataset from torch.utils.data import DataLoader import numpy as np class CustomDataset(Dataset): def __len__(self): return 2 def __getitem__(self, idx): tmp = { "foo": np.array([1, 2, 3]), "bar": ["X"] * (idx+1), } return tmp training = CustomDataset() for batch in DataLoader(training, batch_size=2): print(batch) ``` Yields ``` { 'foo': tensor( [ [1, 2, 3], [1, 2, 3] ] ), 'bar': [ ('X', 'X'), ] } ``` Based on discussion in the issue, it seems the best course of action is to error out in this case. This seems consistent with what is done for tensor elements, as seen in [TensorShape.cpp line 1066](https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/TensorShape.cpp#L1060) which is called when ```torch.stack``` is called. In this PR, I introduce a similar message to error out for lists. SsnL Pull Request resolved: https://github.com/pytorch/pytorch/pull/38492 Differential Revision: D21620396 Pulled By: ezyang fbshipit-source-id: 17f59fbb1ed1f0d9b2185c95b9ebe55ece701b0c	2020-05-18 14:53:33 -07:00
SsnL	b5868b2833	Relax sampler check in BatchSampler (#38403 ) Summary: Since the check was added in https://github.com/pytorch/pytorch/pull/6249, one can not pass an iterable as a sampler to the data loader anymore, which was a very handy feature (e.g., https://github.com/pytorch/pytorch/issues/1337). I think the check should be removed for two-fold reasons: 1. It is too strict. There is no reason that it should not be a general iterable. 2. It is inconsistent. In `DataLoader` (the main place where people use samplers), you can pass a general iterable as `batch_sampler` but not `sampler` due to this check. Pull Request resolved: https://github.com/pytorch/pytorch/pull/38403 Differential Revision: D21555958 Pulled By: soumith fbshipit-source-id: c7267bb99a31edd8f2750689205d6edc5dab5cff	2020-05-13 22:24:29 -07:00
Vitaly Fedyunin	57d01be92b	Replacing assertEqual with assertEqualIgnoreType wherever types missmatch (#38102 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/38102 Test Plan: Imported from OSS Differential Revision: D21477060 Pulled By: VitalyFedyunin fbshipit-source-id: 25e0fd837ca9bfccf0ce994c80f7790c894096d4	2020-05-09 14:48:55 -07:00

1 2 3 4 5 ...

256 Commits