pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Michael Suo	c978b609f7	[ci] remove IN_CI env var The conventional env var to set is CI. Both circle and GHA set it, so IN_CI is unnecessary Pull Request resolved: https://github.com/pytorch/pytorch/pull/79229 Approved by: https://github.com/janeyx99	2022-06-11 17:16:30 +00:00
Vitaly Fedyunin	883f8ef62e	[DataLoader] DataLoader now automatically apply sharding to DataPipes Pull Request resolved: https://github.com/pytorch/pytorch/pull/78631 Approved by: https://github.com/ejguan, https://github.com/NivekT	2022-06-02 17:40:29 +00:00
Sergii Dymchenko	e8bf3a9cd4	Remove Python 2-related code from dataloader (#78594 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78594 Approved by: https://github.com/seemethere	2022-06-01 05:25:23 +00:00
Alban Desmaison	7c3d3b759b	Migrate x86 trunk build/test to macos12 This will enable MPS building but will NOT test mps as the runner do not have AMD gpus Pull Request resolved: https://github.com/pytorch/pytorch/pull/77645 Approved by: https://github.com/malfet, https://github.com/seemethere	2022-05-18 11:59:19 +00:00
Jeeja	45bbc4c028	Update Dataloader with default parameter device (#65402 ) Summary: pin_memory, has optional device parameter to specify which device you want to pin for. With this above change the Dataloader will work only for CUDA backend. To add support for other backend which supports pinned memory, dataloader is updated with device as optional parameter. Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/65402 Reviewed By: zou3519 Differential Revision: D32282204 Pulled By: VitalyFedyunin fbshipit-source-id: e2e09876969af108d0db38af7c2d1b2f1cfa9858 (cherry picked from commit 3b76e151964fce442e27fe8fb5c37af930da4fa1)	2022-04-21 01:33:53 +00:00
Philip Meier	3c10987692	don't add extra shuffle in DataLoader2 if one is present Without this, `DataLoader2` will just add an `Shuffler` to the end of the datapipe if `shuffle=True`: ```py from torch.utils.data.dataloader_experimental import DataLoader2 from torchdata.datapipes.iter import IterableWrapper, IterDataPipe, Shuffler class Sorter(IterDataPipe): def __init__(self, datapipe): self.datapipe = datapipe def __iter__(self): return iter(sorted(self.datapipe)) data = list(range(1000)) dp = IterableWrapper(data) dp = Shuffler(dp).set_shuffle(False) dp = Sorter(dp) dl2 = DataLoader2(dp, shuffle=True, batch_size=None) assert list(dl2) == data # fails unless you hit a lucky random seed ``` This example is somewhat non-sensical, but demonstrates we cannot simply add a `Shuffler`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75014 Approved by: https://github.com/ejguan	2022-04-05 19:53:08 +00:00
Evren Tumer	7534525735	Reset worker cycle iterator for determinism across runs (#73675 ) Summary: Reset worker cycle iterator for determinism across runs Fixes https://github.com/pytorch/pytorch/issues/73603 Pull Request resolved: https://github.com/pytorch/pytorch/pull/73675 Reviewed By: bdhirsh Differential Revision: D34688704 Pulled By: ejguan fbshipit-source-id: 7bab11f0b9f59645d9b168fa11d92dc7c2c4d34e (cherry picked from commit eb5fd559224988f9967528e154cf37c5031fe7c2)	2022-03-09 14:55:07 +00:00
Kevin Tse	cd4ecce1bb	[DataPipe] Fix issue with DataPipe serialization with `dill` (#72896 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72896 Fixing the issue described here: https://github.com/pytorch/data/issues/214 There will be a follow-up PR in TorchData as well Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D34258669 Pulled By: NivekT fbshipit-source-id: 6dd88250ed14ebe779915dc46139be7e012e9d1b (cherry picked from commit 025b8ed98019e576bfef04c33a3f33ed1a426a66)	2022-02-23 16:31:20 +00:00
Erjia Guan	67a275c293	Fix persistent worker exits before pin_memory thread (#71579 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71579 Fixes #1551 As the comment in the code, register a function to terminate persistent workers. By adding a reference of these workers in `atexit`, it would prevent Python interpreter kills these persistent worker processes before `pin_memorh_thread` exits. And, if users explicitly kills DataLoader iterator, such function in `atexit` would be a no-op. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D33896537 Pulled By: ejguan fbshipit-source-id: 36b57eac7523d8aa180180c2b61fc693ea4638ae (cherry picked from commit `05add2ae0f`)	2022-02-01 23:57:17 +00:00
pyhuang97@gmail.com	16a9ffba4b	Allow specifying num_samples to RandomSampler even when replacement=False (#71568 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/38032 #39214 Hi, I modified the RandomSampler to satisfy the requirement of https://github.com/pytorch/pytorch/issues/38032. I also added and deleted some test cases in the test/test_dataloader.py to match with the new requirement. Pull Request resolved: https://github.com/pytorch/pytorch/pull/71568 Reviewed By: mikaylagawarecki Differential Revision: D33741776 Pulled By: ejguan fbshipit-source-id: 2d25f5096b7b36ad9fb6455107182f387cf8ee43 (cherry picked from commit `9c7e1891c2`)	2022-01-25 15:34:24 +00:00
Nikita Shulga	86aefdc082	Revert D33694867: Fix persistent worker exits before pin_memory thread Test Plan: revert-hammer Differential Revision: D33694867 (`e2191e7084`) Original commit changeset: 0847f4d424a0 Original Phabricator Diff: D33694867 (`e2191e7084`) fbshipit-source-id: 5f28616700d8647cbe468a9e300724a7f0c6cc15 (cherry picked from commit `3d8125ba6d`)	2022-01-22 00:09:28 +00:00
Erjia Guan	e2191e7084	Fix persistent worker exits before pin_memory thread (#71579 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/71579 Fixes #1551 As the comment in the code, register a function to terminate persistent workers. Using `atexit` to make sure termination of persistent workers always happens at the end (after pin_memory_thread exits). We need such mechanism because Python interpreter would clean up worker process before DataLoader iterator in some rare cases. Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D33694867 Pulled By: ejguan fbshipit-source-id: 0847f4d424a0cd6b3c0be8235d505415970254e8 (cherry picked from commit `18ad4621af`)	2022-01-21 20:31:16 +00:00
Vitaly Fedyunin	d90012689f	[DataPipe] Control shuffle settings from DataLoader2 (#65756 ) Summary: Makes `shuffle` DataPipe sensitive to DataLoader(2) `shuffle` kwarg. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65756 Reviewed By: albanD Differential Revision: D31344867 Pulled By: VitalyFedyunin fbshipit-source-id: e0084e0ac193ac784d6298328ca1222745681347	2021-12-14 07:35:26 -08:00
Kevin Tse	39fb855d91	[DataLoader] Implementing communication processes for Map-style DataPipes (#68549 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/68549 cc SsnL VitalyFedyunin ejguan NivekT Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D32922676 Pulled By: NivekT fbshipit-source-id: fd918a342214d617a489ac5acffff15b55e9b255	2021-12-08 07:27:01 -08:00
Santiago Castro	f776f30780	Keep the sequence or mapping type in `default_collate` (#68779 ) Summary: `default_collate`, `default_convert`, and `pin_memory` convert sequences into lists. I believe they should keep the original type when possible (e.g., I have a class that inherits from `list`, which comes from a 3rd party library that I can't change, and provides extra functionality). Note it's easy to do when the type supports an iterable in its creation but it's not always the case (e.g., `range`). Even though this can be accomplished if using a custom `default_collate`/`default_convert`, 1) this is behavior they should support out-of-the-box IMHO, and 2) `pin_memory` still does it. cc VitalyFedyunin ejguan NivekT Pull Request resolved: https://github.com/pytorch/pytorch/pull/68779 Reviewed By: wenleix Differential Revision: D32651129 Pulled By: ejguan fbshipit-source-id: 17c390934bacc0e4ead060469cf15dde815550b4	2021-11-29 13:14:20 -08:00
Jane Xu	39215ddf84	[skip ci] Set test owners for dataloader tests (#66839 ) Summary: Action following https://github.com/pytorch/pytorch/issues/66232 cc SsnL VitalyFedyunin ejguan NivekT Pull Request resolved: https://github.com/pytorch/pytorch/pull/66839 Reviewed By: ejguan Differential Revision: D31761722 Pulled By: janeyx99 fbshipit-source-id: 8315ac03352c11b3215d89856b3cfda6cd78fa0c	2021-10-19 08:31:16 -07:00
Michael Suo	9d13ae450a	[oss/ci] skip all dataloader tests with asan (#66561 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66561 See https://github.com/pytorch/pytorch/issues/66223 for context. Test Plan: Imported from OSS Reviewed By: mruberry Differential Revision: D31617142 Pulled By: suo fbshipit-source-id: 16b280fc47a7c40fa19c5c72192d342dd33680bf	2021-10-13 11:39:41 -07:00
Michael Suo	213c3f45da	[oss/ci] skip TestDataLoaderPersistentWorkers on ASAN (#66236 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66236 it's flaky, see https://github.com/pytorch/pytorch/issues/66223 Test Plan: Imported from OSS Reviewed By: malfet Differential Revision: D31462056 Pulled By: suo fbshipit-source-id: f4362a8020dc05ac8856706c0508d48be026eeb8	2021-10-06 17:56:19 -07:00
Erjia Guan	b777d790ea	Convert Sampler back to lazily construction (#63646 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63646 Fixes #63609 Test Plan: Imported from OSS Reviewed By: NivekT Differential Revision: D30451774 Pulled By: ejguan fbshipit-source-id: 550d77494326446d1a42b5da0559e0d384c47413	2021-09-30 07:32:06 -07:00
Nikita Shulga	4a7a0ea42e	Skip flaky ASAN tests (#65792 ) Summary: See https://github.com/pytorch/pytorch/issues/65727 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65792 Reviewed By: janeyx99 Differential Revision: D31254490 Pulled By: malfet fbshipit-source-id: 76714db30a5566fbab95179236ccdafab22cf551	2021-09-28 22:33:02 -07:00
Nikita Shulga	145202c45b	Define timeout in TestIndividualWorkerQueue (#65742 ) Summary: This test occasionally deadlocks while waiting for the child process to report result. But as the test is small, entire test should never take more than 1-2 sec, but to be on the safe side set timeout to 5 sec Somewhat mitigates https://github.com/pytorch/pytorch/issues/65727 Pull Request resolved: https://github.com/pytorch/pytorch/pull/65742 Reviewed By: janeyx99, ejguan Differential Revision: D31235116 Pulled By: malfet fbshipit-source-id: 0cdd2f7295f6f9fcefee954a14352e18fae20696	2021-09-28 10:01:19 -07:00
Hong Xu	fb8bdb8039	When test set_affinity, don't hardcode the CPU ID (#65042 ) Summary: The setaffinity test always fails when the number of CPUs is smaller than 3. Changed the test to be dynamically based on the number of CPUs of the system. Pull Request resolved: https://github.com/pytorch/pytorch/pull/65042 Reviewed By: jbschlosser Differential Revision: D30960554 Pulled By: ejguan fbshipit-source-id: 55ac12714b4b0964b48c3617b79a7a345d40ebce	2021-09-15 08:10:59 -07:00
Rishi Puri	2ae938e15e	Fixes failure in test_dataloader.py that occurs on jetson boards (#64757 ) Summary: CUDA IPC is not supported for jetsons Pull Request resolved: https://github.com/pytorch/pytorch/pull/64757 Reviewed By: jbschlosser Differential Revision: D30900593 Pulled By: ejguan fbshipit-source-id: c6b2e8a9746276fdb4a009b6412e47cc8aac69f2	2021-09-13 10:11:04 -07:00
Erjia Guan	3cd0a4ac15	Fix test_ind_worker_queue by setting max_num_worker based on system resource (#63779 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63779 Fixes #63657 Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D30494185 Pulled By: ejguan fbshipit-source-id: d1bd24299b25d589889604aaf18ad347bdff4df4	2021-09-02 12:36:56 -07:00
Vitaly Fedyunin	82174330d0	[DataLoader2] Adding Messages, Protocols, Loop wrappers (#63882 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63882 Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30627452 Pulled By: VitalyFedyunin fbshipit-source-id: 561ea2df07f3572e04401171946154024126387b	2021-08-30 07:57:20 -07:00
Erjia Guan	ad47fb8858	Rename IterableAsDataPipe to IterableWrapper (#63981 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63981 Rename `IterableAsDataPipe` to `IterableWrapper` based on our naming convention `Op-er` Test Plan: Imported from OSS Reviewed By: VitalyFedyunin Differential Revision: D30554197 Pulled By: ejguan fbshipit-source-id: c2eacb20df5645d83ca165d6a1591f7e4791990f	2021-08-26 10:23:25 -07:00
Vitaly Fedyunin	e1bdebf685	Adding DataLoader2 class as future replacement of DataLoader (#63742 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63742 Supports sharding and batching on loader level** Supports sharding and batching on loader level Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30494506 Pulled By: VitalyFedyunin fbshipit-source-id: 6648e09d955055ac38e3a4e3973f701acefca762	2021-08-23 18:09:07 -07:00
Alban Desmaison	71da114412	Revert D30426527: Adding DataLoader2 class as future replacement of DataLoader Test Plan: revert-hammer Differential Revision: D30426527 (`5a7133b87f`) Original commit changeset: e5905d3364c4 fbshipit-source-id: 794d8a4e9256ccff8cf894aee10eff6adc30d502	2021-08-20 12:06:52 -07:00
Vitaly Fedyunin	5a7133b87f	Adding DataLoader2 class as future replacement of DataLoader (#63523 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63523 Supports sharding and batching on loader level** * #63522 Adding IterableAsDataPipe IterDataPipe usefull for tests and simple cases Supports sharding and batching on loader level Test Plan: Imported from OSS Reviewed By: ejguan Differential Revision: D30426527 Pulled By: VitalyFedyunin fbshipit-source-id: e5905d3364c4880e720dd62fb066f08881c71a6e	2021-08-20 09:01:55 -07:00
Shen Li	1022443168	Revert D30279364: [codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: revert-hammer Differential Revision: D30279364 (`b004307252`) Original commit changeset: c1ed77dfe43a fbshipit-source-id: eab50857675c51e0088391af06ec0ecb14e2347e	2021-08-12 11:45:01 -07:00
Zsolt Dollenstein	b004307252	[codemod][lint][fbcode/c*] Enable BLACK by default Test Plan: manual inspection & sandcastle Reviewed By: zertosh Differential Revision: D30279364 fbshipit-source-id: c1ed77dfe43a3bde358f92737cd5535ae5d13c9a	2021-08-12 10:58:35 -07:00
DamonDeng	53489bc385	fix for #60319 , forcing to use fork as start method in test/test_dat… (#60868 ) Summary: fix for https://github.com/pytorch/pytorch/issues/60319 , forcing to use fork as start method in test/test_dataloader.py Fixes #{60319} Pull Request resolved: https://github.com/pytorch/pytorch/pull/60868 Reviewed By: mruberry Differential Revision: D29432876 Pulled By: ejguan fbshipit-source-id: 5da25f7cfaf8ea0803c0b1aacf2badd656799e16	2021-06-29 09:30:37 -07:00
Rong Rong (AI Infra)	510334f34b	[BE] clean up IS_PYTORCH_CI and IN_CI (#60279 ) Summary: `IS_PYTORCH_CI` and `IN_CI` are used randomly, however in some cases IN_CI is not currently set because it only exist in .circleci/scripts/setup_ci_environment.sh. This cleans up the 2 flags and only use IN_CI Pull Request resolved: https://github.com/pytorch/pytorch/pull/60279 Test Plan: CI Reviewed By: seemethere Differential Revision: D29239545 Pulled By: walterddr fbshipit-source-id: a069424a2bb8790a3adfdaf0dc460301026bf8c7	2021-06-20 19:45:07 -07:00
TJ-coding	7c29ca7f2b	Fix Subset of a Subset not sliceable issue (#59513 ) Summary: Dataset can be indexed by a list, but a list can not be indexed by a list. This gives error when slicing a Subset initialised with a Subset, instead of a dataset. Fixed the issue by changing the indices to a Tensor which can be indexed by a list. Fixes https://github.com/pytorch/pytorch/issues/59512 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59513 Reviewed By: zou3519 Differential Revision: D29196891 Pulled By: ejguan fbshipit-source-id: ccde6e474fbcbddd2e9c7c107bc8b5de1307cdb9	2021-06-18 07:07:34 -07:00
Erjia Guan	3b977a0d28	[DataLoader] Add `generate_state` for NumPy seeding (#56797 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56797 After adding default seeding strategy for NumPy random module within each worker of DataLoader #56488, two concerns are raised: - We dropped the support for NumPy < 1.17 due to `SeedSequence` - In order to support seeding for NumPy < 1.17, how can we provide seed for `numpy.random`? - First option is set the same seed as `random`. But, the problem is a same algorithm is shared between `numpy.random` and `random`. With the same seed, they will have exact same state sequence. Thanks to rkern, we noticed this so-called [bad things](https://github.com/PyTorchLightning/pytorch-lightning/pull/6960#issuecomment-818393659). - Considering most of users do not aware this problem, we can provide a better seed by default for `numpy.random` using same `SeedSequence` algorithm as numpy. This is just a workaround with hard-coded function to generate an array of four int32 as the seed. To better coping with this problem since there are amount of 3rd party libraries not just `NumPy` having random module. We may at the end need to implement a `SeedSequence` within `torch.random` module, then users can `spawn` a new `SeedSequence` for each library. Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D28000619 Pulled By: ejguan fbshipit-source-id: 5701c8124a38ea5ded69eb8eee70f9680877ffa6	2021-04-27 08:14:02 -07:00
Yukio Siraichi	93bf0ae6fc	Remove legacy constructor calls from pytorch codebase. (#54142 ) Summary: Follow up from https://github.com/pytorch/pytorch/issues/53889 Related to https://github.com/pytorch/pytorch/issues/47112 Removing every occurrence of the legacy constructor call present in PyTorch at: - _docs_ - _benchmarks_ - _test_ - _caffe2_ - _CONTRIBUTING.md_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/54142 Reviewed By: ngimel Differential Revision: D27699450 Pulled By: mruberry fbshipit-source-id: 530aa3f5746cc8bc1407d5d51b2bbd8075e30546	2021-04-11 15:45:17 -07:00
Winston Smith	8ed20b3f65	Leak Caffe2 threadpool in child processes right after fork to prevent segfault (#54895 ) Summary: ## Problem summary Fixes https://github.com/pytorch/pytorch/issues/54752 - when the number of threads is more than 3 and at least one `set_num_threads` invocation has taken place before forking child processes by the dataloader, `set_num_threads(1)` in the child process causes a segfault, as during its invocation, the child process is made to handle the data structures of the Caffe2 thread-pool of the parent process, whose data structures it inherits from the parent process (these threads don't exist in the child process, but some of its data structures do, due to the copy-on-write technique used by `fork`). ## Solution malfet [advised](https://github.com/pytorch/pytorch/issues/54752#issuecomment-810315302) & [authored code](https://github.com/pytorch/pytorch/pull/54895#pullrequestreview-625670122) for adding a `pthread_atfork` handler in `pytorch/caffe2/utils/threadpool/pthreadpool-cpp.cc`, that's invoked in the child process right after fork, to leak the Caffe2 thread-pool (the child inherits the thread-pool's data structures from its parent process, but doesn't actually have those threads, since after `fork` , a child process only has one thread). ## Additional changes Added unittest `test_no_segfault` to test for this issue in `test_dataloader.py` Also enabled `test_segfault` (which actually makes sure that segfaults happen in worker processes in a particular case). Pull Request resolved: https://github.com/pytorch/pytorch/pull/54895 Reviewed By: zhangguanheng66 Differential Revision: D27542253 Pulled By: malfet fbshipit-source-id: 10f9c67ce1ff1aa37d3efebf405bd93f7f9d2489	2021-04-03 10:51:20 -07:00
Nikita Shulga	f2689b1e13	Make ideep honor `torch.set_num_thread` changes (#53871 ) Summary: When compiled with OpenMP support `ideep`'s computational_cache would cache max number of OpenMP workers This number could be wrong after `torch.set_num_threads` call, so clean it after the call. Fixes https://github.com/pytorch/pytorch/issues/53565 Pull Request resolved: https://github.com/pytorch/pytorch/pull/53871 Reviewed By: albanD Differential Revision: D27003265 Pulled By: malfet fbshipit-source-id: 1d84c23070eafb3d444e09590d64f97f99ae9d36	2021-03-13 11:20:44 -08:00
Yi Zhang	fd582af06c	enable coverage test for dataloader on Windows (#52550 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/50661 For coverage, The class qualified name is `'SimpleCustomBatch': <class '__mp_main__.SimpleCustomBatch'>` For pytest The class qualified name is `'SimpleCustomBatch': <class 'test_dataloader.SimpleCustomBatch'>` So move the class to one separate file ![image](https://user-images.githubusercontent.com/16190118/108611869-d6b51f80-741d-11eb-908e-be7a64da916d.png) As malfet suggestion, use __import__ to avoid adding new file. Pull Request resolved: https://github.com/pytorch/pytorch/pull/52550 Reviewed By: walterddr Differential Revision: D26754023 Pulled By: malfet fbshipit-source-id: 34b0fbe7336b9303cedc28ec6116ab752a2d3630	2021-03-02 18:40:47 -08:00
Kyle Chen	a9f7ae5357	[ROCm] Enable test cases in test/test_dataloader.py for ROCm (#52766 ) Summary: Enabling test cases in test_dataloader.py for ROCm because they are passing now. Signed-off-by: Kyle Chen <kylechen@amd.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/52766 Reviewed By: H-Huang Differential Revision: D26706402 Pulled By: ngimel fbshipit-source-id: 63d4ea6d9b16f6244eb0f0f8f7a957bac8469111	2021-03-01 13:32:35 -08:00
Erjia Guan	89b1053413	[DataLoader] Move BufferedShuffle from Dataset to DataPipe (#52141 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52141 Remove BufferShuffleDataSet, as it's not being used anywhere within PyTorch (no usage on Github based on a search) and it's not included in the release of PyTorch 1.7.1. Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D26710940 Pulled By: ejguan fbshipit-source-id: 90023b4bfb105d6aa392753082100f9181ecebd0	2021-03-01 12:54:44 -08:00
Chester Liu	58eb23378f	Clean up usage of torch._six partially (#49785 ) Summary: See https://github.com/pytorch/pytorch/issues/42919 Pull Request resolved: https://github.com/pytorch/pytorch/pull/49785 Reviewed By: mruberry Differential Revision: D25963833 Pulled By: bugra fbshipit-source-id: 11c90d6b8d3f206c9d0a4d8621b773beb10c6ba2	2021-02-08 13:58:34 -08:00
Tongzhou Wang	54ce171f16	Fix persistent_workers + pin_memory (#48543 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48370 https://github.com/pytorch/pytorch/issues/47445 cc emcastillo who authored the original functionality. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48543 Reviewed By: bdhirsh Differential Revision: D25277474 Pulled By: ejguan fbshipit-source-id: 1967002124fb0fff57caca8982bc7df359a059a2	2021-01-08 07:04:10 -08:00
Hugo van Kemenade	473e78c0fa	Remove redundant code for unsupported Python versions (#49486 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49486 Remove code for Python 3.5 and lower. There's more that can be removed/modernised, but sticking mainly to redundant version checks here, to keep the diff/PR smaller. Pull Request resolved: https://github.com/pytorch/pytorch/pull/46579 Reviewed By: zou3519 Differential Revision: D24453571 Pulled By: ezyang fbshipit-source-id: c2cfcf05d6c5f65df64d89c331692c9aec09248e	2021-01-06 12:45:46 -08:00
Pritam Damania	21c38e1799	Additional validation for DistributedSampler. (#48865 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48865 If DistributedSampler was provided an invalid rank (ex: https://discuss.pytorch.org/t/distributed-datasets-on-multi-machines/105113), it failed with a cryptic assertion failure. To fix this issue, I've added an additional check to DistributedSampler to validate we provide a valid rank. ghstack-source-id: 117906769 Test Plan: 1) waitforbuildbot 2) Unit test added. Reviewed By: malfet Differential Revision: D25344945 fbshipit-source-id: 7685e00c8b2c200efbd2949fb32ee32ea7232a08	2020-12-11 17:22:22 -08:00
SsnL	4abca9067b	Fix dataloader hang with large sampler (#48669 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/48666 Pull Request resolved: https://github.com/pytorch/pytorch/pull/48669 Reviewed By: zhangguanheng66 Differential Revision: D25255763 Pulled By: VitalyFedyunin fbshipit-source-id: d06421f52bb1d00cdf8025f1a2ba0d1f9284731a	2020-12-02 09:07:30 -08:00
lixinyu	67b7e751e6	add warning if DataLoader is going to create excessive number of thread (#46867 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/46867 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D24545540 Pulled By: glaringlee fbshipit-source-id: a3bef0d417e535b8ec0bb33f39cfa2308aadfff0	2020-10-30 07:54:23 -07:00
Nikita Shulga	f363a2e106	Mark top 3 slowest tests as slow (#46068 ) Summary: `TCPStoreTest.test_numkeys_delkeys` takes 5+ min (mostly in idle wait for socket timeout) `TestDataLoader.test_proper_exit` and `TestDataLoaderPersistentWorkers.test_proper_exit` take 2.5 min each `TestXNNPACKConv1dTransformPass.test_conv1d_with_relu_fc` takes 2 min to finish Add option to skip reporting test classes that run for less than a second to `print_test_stats.py` and speed up `TestTorchDeviceTypeCUDA.test_matmul_45724_cuda` Pull Request resolved: https://github.com/pytorch/pytorch/pull/46068 Reviewed By: mruberry Differential Revision: D24208660 Pulled By: malfet fbshipit-source-id: 780e0d8be4f0cf69ea28de79e423291a1f3349b7	2020-10-08 21:10:03 -07:00
Erjia Guan	96540e918c	Add ShuffleDataset with buffer (#45290 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45290 Test Plan: Imported from OSS Reviewed By: gchanan Differential Revision: D24001084 Pulled By: erjia-guan fbshipit-source-id: d8a7455cf3f18e1f8c1edc53c42c1a99c8573c51	2020-09-30 07:58:15 -07:00
Akihiro Nitta	84949672bf	Fix exception chaining in `test/` (#44193 ) Summary: ## Motivation This PR fixes https://github.com/pytorch/pytorch/issues/43770 and is the continuation of https://github.com/pytorch/pytorch/issues/43836. ## Description of the change This PR fixes exception chaining only in files under `test/` where appropriate. To fix exception chaining, I used either: 1. `raise new_exception from old_exception` where `new_exception` itself seems not descriptive enough to debug or `old_exception` delivers valuable information. 2. `raise new_exception from None` where raising both of `new_exception` and `old_exception` seems a bit noisy and redundant. ## List of lines containing `raise` in `except` clause: I wrote [this simple script](https://gist.github.com/akihironitta/4223c1b32404b36c1b349d70c4c93b4d) using [ast](https://docs.python.org/3.8/library/ast.html#module-ast) to list lines where `raise`ing in `except` clause. - [x] `f8f35fddd4/test/test_cpp_extensions_aot.py (L16)` - [x] `f8f35fddd4/test/test_jit.py (L2503)` - [x] `f8f35fddd4/test/onnx/model_defs/word_language_model.py (L22)` - [x] `f8f35fddd4/test/onnx/verify.py (L73)` - [x] `f8f35fddd4/test/onnx/verify.py (L110)` - [x] `f8f35fddd4/test/onnx/test_verify.py (L31)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L255)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L2992)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L3025)` - [x] `f8f35fddd4/test/distributed/test_c10d.py (L3712)` - [x] `f8f35fddd4/test/distributed/test_distributed.py (L3180)` - [x] `f8f35fddd4/test/distributed/test_distributed.py (L3198)` - [x] `f8f35fddd4/test/distributed/test_data_parallel.py (L752)` - [x] `f8f35fddd4/test/distributed/test_data_parallel.py (L776)` - [x] `f8f35fddd4/test/test_type_hints.py (L151)` - [x] `f8f35fddd4/test/test_jit_fuser.py (L771)` - [x] `f8f35fddd4/test/test_jit_fuser.py (L773)` - [x] `f8f35fddd4/test/test_dispatch.py (L105)` - [x] `f8f35fddd4/test/test_distributions.py (L4738)` - [x] `f8f35fddd4/test/test_nn.py (L9824)` - [x] `f8f35fddd4/test/test_namedtensor.py (L843)` - [x] `f8f35fddd4/test/test_jit_fuser_te.py (L875)` - [x] `f8f35fddd4/test/test_jit_fuser_te.py (L877)` - [x] `f8f35fddd4/test/test_dataloader.py (L31)` - [x] `f8f35fddd4/test/test_dataloader.py (L43)` - [x] `f8f35fddd4/test/test_dataloader.py (L365)` - [x] `f8f35fddd4/test/test_dataloader.py (L391)` Pull Request resolved: https://github.com/pytorch/pytorch/pull/44193 Reviewed By: albanD Differential Revision: D23681529 Pulled By: malfet fbshipit-source-id: 7c2256ff17334625081137b35baeb816c1e53e0b	2020-09-14 14:20:16 -07:00

1 2 3 4

175 Commits