pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Aarti Basant	93ac6a9837	checkpointing for distributed hive reader. Summary: The goal of this diff is: 1) Enable checkpointing to honor batches_per_epoch 2) Resume hive_readers mid-split Reviewed By: azzolini Differential Revision: D5004212 fbshipit-source-id: 2ff5df30ba946eefadd109d80056cde67398a080	2017-06-06 14:20:06 -07:00
Aaron Markham	58f7f2b441	doxygen python block added Summary: Closes https://github.com/caffe2/caffe2/pull/226 Differential Revision: D4793550 Pulled By: JoelMarcey fbshipit-source-id: cc33e58186304fa8dcac2ee9115dcc271d785b1e	2017-03-29 06:46:16 -07:00
Dmytro Dzhulgakov	b61aaa90b6	Stop multi_reader if we run out of data before max_examples Summary: Before we didn't propagate the 'out-of-data' signal if splits_per_epoch wasn't specified. Right now it's a hacky fix (just reuse ReaderWithLimit). azzolini - any suggestions of more elegant solution? I can create an extra reader that just export "is empty" signal out. Overall, I guess we need to turn global_queue into a more sustainable unittest that verifies all possible combinations - I'm still not sure it's correct :-\ Reviewed By: xianjiec Differential Revision: D4665677 fbshipit-source-id: fe44d10ee82c3383145635e67dea1d9b666e061f	2017-03-10 18:03:57 -08:00
Alexander Sidorov	ea9f4da368	fix typo in TextFileReader Summary: as title Reviewed By: bwasti Differential Revision: D4591870 fbshipit-source-id: 01912ee75b036335402c7b4a5b147f20a50ce95b	2017-02-21 14:02:48 -08:00
Alisson Gusatti Azzolini	3bb8755067	Use multi_reader directly Summary: This makes sure dper_example is compatible with the new way of defining checkpoint epochs. See D4499320. Reviewed By: xianjiec Differential Revision: D4511618 fbshipit-source-id: f5188010cdefe3739f87f6049d1ea6aee765c514	2017-02-06 09:59:20 -08:00
Zhao Tan	d8dff5853e	Add numSample field for preComputing Summary: For customers like Ads, Feeds, MarketPlace, their training data size is super large. It is unnecessary and costly to go over all the data to compute meta information. In this diff, numSample option is added in preCompute, so users have control over how many samples they want to use when computing meta information. Differential Revision: D4492399 fbshipit-source-id: 7199381d226ee6300a959fc5e116d39984d199fc	2017-02-02 13:59:30 -08:00
Yangqing Jia	589398950f	fbsync at f5a877	2016-11-18 15:41:06 -08:00
Yangqing Jia	238ceab825	fbsync. TODO: check if build files need update.	2016-11-15 00:00:46 -08:00
Yangqing Jia	d1e9215184	fbsync	2016-10-07 13:08:53 -07:00
Yangqing Jia	0a09d09431	fbsync	2016-09-08 17:56:14 -07:00
Yangqing Jia	b23e51d467	chunky sync	2016-09-06 15:55:19 -07:00
Yangqing Jia	05512d1e10	sync	2016-08-10 11:02:15 -07:00
Yangqing Jia	1ede7a7ff0	more build updates: (1) nccl submodule, cnmem submodule (2) mpi ops fallback test (3) a bit more blob interface (4) fixed tests (5) caffe2.python.io -> caffe2.python.dataio to avoid name conflicts (6) In the build system autogen __init__.py instead of having manual rules just to copy over an empty __init__.py.	2016-08-02 23:28:23 -07:00

13 Commits