pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
AlexanderRadionov	831780390c	Fixed non-determinate preprocessing on DataLoader (#4640 ) dded ind_worker_queue parameter to data.DataLoader. It makes preprocessing determinate. DataLoader in multiprocessing mode may cause non-deterministic issue. Even if radom_seed has frozen, each subprocess may get tasks in unstable order. This is caused by different I/O time while data loads. If you use augmentation while data loading, it makes results unreproduceble. Look at the https://discuss.pytorch.org/t/deterministic-non-deterministic-results-with-pytorch/9087 To fix this issue I have added the individual queue for each worker. In this case each worker get tasks in the stable order. In summary, subprocess produces the stable results. To reproduce issue you may change ind_worker_queue to False and run the script several times. Code to reproduce issue is in the corresponding PR. * TestIndividualWorkerQueue added to DataLoader tests * Review fixes * "Simplify" code by removing itertools * Rebase conflicts fix * Review fixes * Fixed shutdown behavior * Removed ind_worker_queue flag. * Rebase on master * Disable tests that use DataLoader with multiple workers (#5322)	2018-03-23 17:43:59 -04:00
Will Feng	0340e46f9b	Disable tests that use DataLoader with multiple workers (#5322 )	2018-02-21 09:20:37 -05:00
Tongzhou Wang	964707e9b5	temporarily disable test_segfault until we figure out why it intermittently fails on cuda CI workere (#4976 )	2018-01-31 19:04:44 -05:00
Tongzhou Wang	64a9ecae02	Dataloader issues (#4643 ) * EINTR and kill by loader fix * addressed @apaszke 's comments * remove EINTR handling and add test if we are in main thread before setting SIGCHLD	2018-01-29 01:18:17 +01:00
peterjc123	2dd7039b6b	Fix multiprocessing and dataloader tests on Windows (#4453 )	2018-01-06 17:41:36 +01:00
Tongzhou Wang	cc9dc3f343	add lock for SynchronizedSeedDataset; add additional os level close stderr for tests that launch failing process (#4463 )	2018-01-03 22:45:05 -05:00
Alykhan Tejani	18a866aedd	Add random_split to torch.utils.data.dataset (#4435 )	2018-01-02 18:56:49 +01:00
Will Feng	1681d07199	Disable tests and fix issues with Windows CUDA build (#4251 )	2017-12-20 11:30:21 +01:00
Tongzhou Wang	5cc26c0c90	Add default PyTorch seeding and worker_init_fn to DataLoader (#4018 ) * Add default PyTorch seeding and worker_init_fn to DataLoader * generate seed using current RNG each time * worker_seed <- main_proc_RNG_generated_seed + worker_id	2017-12-18 02:19:08 -05:00
Will Feng	db446d69ca	Fix issues with Windows 7 & 10 CPU build (#4065 )	2017-12-15 10:14:43 +01:00
SsnL	1661370ac5	Signal handling in DataLoader workers; Timeout option (#3474 )	2017-11-29 23:52:14 +01:00
Richard Zou	e579ae75b5	Fix error when default_collate is passed a collection of numpy.str_ (#3404 ) * Fix error when default_collate is passed a collection of numpy.str_ * Error if default_collate input is nested nparray containing non-numbers	2017-11-08 10:02:08 -05:00
Tzu-Wei Huang	618026e999	implements operator + for Dataset class (#3180 ) * implements operator + for Dataset class * check for exact equivalent	2017-10-29 01:19:59 +05:30
Valentin Haenel	d592e188f7	port of ConcatDataset (#1902 )	2017-06-27 12:31:56 -04:00
Sam Gross	f09027bc29	Add batch sampler to DataLoader (#1867 )	2017-06-22 20:18:31 +02:00
Isac Arnekvist	156fe28666	dataloader can now handle growing datasets (#1575 )	2017-05-17 19:23:15 -04:00
Sasank Chilamkurthy	94b147fd41	Allows dicts batches in dataloader. (#1354 ) * Allow dicts in Dataloader * use collections.Sequence instead of collections.Iterable in dataloader	2017-04-28 19:14:52 +02:00
Adam Paszke	605b3c86ce	Retain the type of numpy scalars in collate_fn	2017-04-11 14:48:54 -07:00
Eli Stevens	e216f557fd	Fixes issue returning strings from a Dataloader with pin_memory=True (#908 )	2017-03-13 10:11:07 +01:00
Adam Paszke	7ea6ae57c8	Support numpy arrays in default_collate	2017-02-20 23:28:31 -08:00
zhtvk	4d37ef878c	Remove view on data and target tensors of dim 1 in TensorDataset (#609 )	2017-02-09 22:06:39 +01:00
Luke Yeager	e7c1e6a8e3	[pep8] Fix most lint automatically with autopep8 Here's the command I used to invoke autopep8 (in parallel!): git ls-files \| grep '\.py$' \| xargs -n1 -P`nproc` autopep8 -i Several rules are ignored in setup.cfg. The goal is to let autopep8 handle everything which it can handle safely, and to disable any rules which are tricky or controversial to address. We may want to come back and re-enable some of these rules later, but I'm trying to make this patch as safe as possible. Also configures flake8 to match pep8's behavior. Also configures TravisCI to check the whole project for lint.	2017-01-28 01:15:51 +01:00
Adam Paszke	a1fa995044	Fixes and improvements (#593 ) * Fix error in ELU backward * Add --seed flag for testst st * Add test for BatchNorm eval * Fix autograd.backward docs * Support cc flags in cuDNN search * Fix IndexSelect backward formula	2017-01-25 22:21:49 -05:00
Sam Gross	ac8a5e7f0d	Remove error message assertion (#480 ) Depending on how PyTorch is compiled, the source code for DataLoader might not be fully available which can cause a spurious error in test_dataloader.py	2017-01-18 13:16:38 -05:00
Sergey Zagoruyko	89d930335b	fix tests for GPU-less setup (#298 )	2016-12-12 10:56:57 +01:00
Sam Gross	be3276fcdd	Account for batch_size in DataLoader.__len__() (#277 )	2016-12-02 01:21:36 -05:00
Sam Gross	aea6ba4bcd	Support pinned memory in the DataLoader (#265 ) DataLoader now supports the constructor argument 'pin_memory'. When set to true, tensors in the sample are copied to pinned memory. This happens in a background thread when num_workers > 1.	2016-11-29 12:35:03 -05:00
Sam Gross	6db721b5dd	Make DataLoader preserve the ordering of the dataset (#135 )	2016-10-21 23:54:16 -04:00

1 2 3 4 5

228 Commits