pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Aapo Kyrola	ad62e82179	fast simple-net memonger for C++ Summary: To be used with predictor "online": C++ version of memonger for simple nets. Very simple greedy algorithm. Works well at least on Resnet-50 inference graph: only 3 shared blobs are used. Next I will integrate this with predictor and run canary (separate diff). Reviewed By: asaadaldien Differential Revision: D5375392 fbshipit-source-id: d36e419e39a32e568e105657c27fb00c85a2535d	2017-07-06 15:17:07 -07:00
Luke Yeager	fe9b0bfd27	Fix some typos Summary: Closes https://github.com/caffe2/caffe2/pull/882 Differential Revision: D5341277 Pulled By: harouwu fbshipit-source-id: bb5595c65c05ca7ea1a1d060d61d14fbfe008241	2017-06-28 13:50:48 -07:00
Alexander Sidorov	c8410859d9	Operator python stacktraces, attempt 2 Summary: Last time I used uuid filled into OperatorDef. And operator_tracebacks was populated using traceback.extract_stack. There were several issues with this approach: 1. A random field in OperatorDef breaks workflows relying on memoization, i.e. when computation is skipped based on already computed result before. 2. Adding one more field revealed RNNs being non forward compatible wrt to new fields in there. prototxt format seems to not allow forward compatibility (thanks jamesr66a for the investigation!). For RNNs we need to swtich them to a more resilient approach. azzolini's proposed change to OperatorDef / NetDef would allow that by just nesting NetDef dirrectly inside OperatorDef without need for extra serialization. 3. traceback.extract_stack is very slow when executable is on a remote filesystem. It does one or more os.stat for each frame on the stack. For some cases it ended up being up to 15 extra minutes on model construction. In this diff I use a different approach which should fix all those problems above. 1.2. are solved by not adding a new field at all. Instead I report operator idx wrt to a net it runs in. Thanks akyrola and dzhulgakov for the idea. Downside here is that operator list manipulation breaks the logic and separately created ops are not covered at all. 3. I solved this by operating on raw frames without using traceback and inspect modules which end up doing a lot of file system calls. See function extract_stacktace in core.py with additional comments. Reviewed By: dzhulgakov Differential Revision: D5286285 fbshipit-source-id: 626dd0f5f6b8b1d86bd6bf519078b122f43ddcaa	2017-06-25 19:32:58 -07:00
Thomas Dudziak	342de07231	Core unit test fixes for Python 3 Summary: As title Differential Revision: D5291327 fbshipit-source-id: 7dd9279c53ba55d3422c31973ffcec5705787fdf	2017-06-23 13:22:16 -07:00
Alisson Gusatti Azzolini	7d482742fd	Allow tasks/execution_steps to be cloned at runtime Summary: Advantages of cloning the tasks/execution_steps at runtime: - Less complexity on the python side: no need to clone nets and add prefixes to blob names - Faster start-up: we had cases of complex plans that took up to 30min to be created. - Better isolation: each task cloned at runtime has its own child workspace, preventing false sharing of blobs. - Opens up possibility for dynamic scheduling: Number of threads per task can be increased on the fly, at runtime. Reviewed By: dzhulgakov Differential Revision: D5100730 fbshipit-source-id: 71b83193b135da4e6eaf2536d8fc266528e1fdcc	2017-06-20 22:32:07 -07:00
Alexander Sidorov	83e6a0bec8	Revert uuid change to OperatorDef protobuf Summary: a few issues: 1. Randomization hurts memoization 1. Even if we make it non random, then we can get key colisions when loading it back. 2. RNNs use prototxt for step net and apparently its not forward compatible like normal protobuf is I am thinking of a better less invasive solution now. Reviewed By: jamesr66a Differential Revision: D5272118 fbshipit-source-id: ab577fad04fbfc632e1fceffa923377a0d3da1be	2017-06-19 16:47:31 -07:00
Alexander Sidorov	eebda50b79	Operator python traceback Summary: This is going to show a python Caffe2 user where a failed operator was created. Motivation for having this information not right in protobuf is to avoid having it too verboose and keep ability to read protobufs of a net after a simple print() call. Reviewed By: jamesr66a Differential Revision: D5226047 fbshipit-source-id: 7edfe850e05a2ec209577142aa3368664a57a108	2017-06-13 18:50:02 -07:00
Alisson Gusatti Azzolini	d3ec6e8f55	Run python op builder at op creation time Summary: This allows to construct a python op by passing a pickled "builder function call" as an argument to the op. The builder function is called at PythonOp construction time and returns a function that will be called when the op is run. This way we allow to drop the dependency on 'tokens', which didn't work properly for protobufs that get distributed to other processes. Now, the PythonOp definition is self-contained: as long as the build dependencies are right, sharding the protobuf is enough to execute the net remotely. Reviewed By: dzhulgakov Differential Revision: D5080833 fbshipit-source-id: a5deaca5d3143024cdb121519689224e9dbec5ce	2017-06-13 16:29:22 -07:00
Thomas Dudziak	c7f5bf282b	Revert py::bytes -> std::string Summary: As title Reviewed By: salexspb Differential Revision: D5229338 fbshipit-source-id: 3bc9442c76061436db8f3217c1ba8edfd9581f8b	2017-06-12 14:11:37 -07:00
Yiming Wu	8cd208ad6f	Infer input and output device from OperatorDef through OperatorSchema Summary: Infer input and output device from OperatorDef through OperatorSchema. This is inspired by shape inference. With this feature, we can easily analysis device information for all blobs in the net in a generic way. It is really helpful for auto cross device execution. Reviewed By: akyrola, dzhulgakov Differential Revision: D5161065 fbshipit-source-id: ee656123112171a4ca00f2fb3f6940f32ddf3135	2017-06-05 23:47:33 -07:00
Ross Girshick	8e99824ce7	Allow subsets of gradient outputs / inputs in Python ops Summary: I'm using Python ops in a project and need corresponding Python gradient ops. For my use case, only a subset of the forward op outputs have gradients and only a subset of forward op inputs have gradients. However the current implementation of `GetPythonGradient` forces all grad inputs and outputs to exist. This diff allows one to specify that only a subset of grad inputs / outputs are used when constructing the Python op. I'm not sure if this is up to caffe2 standards, so please push back on style and content as needed. Reviewed By: dzhulgakov Differential Revision: D4897004 fbshipit-source-id: 96fffe8634c51a49b6bce7339a46c6235f7d4bbd	2017-06-05 12:52:01 -07:00
Thomas Dudziak	3ccbf23132	String-related fixes for Python 3 Summary: This diff is one step towards enabling python 3 build by making it be more diligent in its handling of strings. Reviewed By: salexspb Differential Revision: D4893083 fbshipit-source-id: 28b8adf3280e8d1f0a7dc9b0fee5ad53f2fada57	2017-05-26 16:04:32 -07:00
Dmytro Dzhulgakov	35eaf444c0	Quickly hack sparsenn_benchmarks to also do BenchmarkNet Summary: Makes benchmark a bit hacky, but it's a benchmark after all :) Specifically ports functionality of proper BenchmarkNet run from the ads_benchmarks so that we can see training net perf. Also adds --report_interval parameter to print stats more often when running in hogwild mode kdub0 - hopefully if you have time you can integrate it properly with the Flow's workflow harouwu -shouldn't conflict too much with your current diff Reviewed By: rayleichen Differential Revision: D5125183 fbshipit-source-id: 9c6f1663bc85e26d6609f0f2f23aa280731939db	2017-05-26 10:48:45 -07:00
Aapo Kyrola	658c337f41	Error status for Gloo ops, and handling in elastic dpm Summary: Add a RandomFailureOp and handling to elastic data parallel model of the status code Reviewed By: andrewwdye Differential Revision: D5065936 fbshipit-source-id: 24224f9ea414ee535c9e90cc28add5189354b0ef	2017-05-17 00:16:52 -07:00
Alisson Gusatti Azzolini	75bc9f5e77	Relax requirement on token uniqueness Summary: Relax requirement on token uniqueness since a few use cases broke after the uniqueness requirement was added in a previous diff. Reviewed By: kittipatv Differential Revision: D5034132 fbshipit-source-id: 327eb065923e6ea152a360324316f81b7fb9564b	2017-05-09 19:36:00 -07:00
Alisson Gusatti Azzolini	bd8ed6641c	Stabilize PythonOp token name Summary: For distributed jobs, we were relying on the order the PythonOps were registered, which was very fragile. Reviewed By: dzhulgakov Differential Revision: D5016847 fbshipit-source-id: f5601467c5b0569d5e8a0efdd76abad0d703c5f5	2017-05-09 11:19:44 -07:00
Aapo Kyrola	5c52392229	opsify AccumulateInputGradients Summary: Part of project to make all gradient accumulation business ops in RecurrentNetworkGradientOp, this makes the accumulateInputGradients ops. Also added way to mark operators private so they don't appear in docs. Reviewed By: salexspb Differential Revision: D5006698 fbshipit-source-id: 226d7afb473290c8d0f936d2cc87640be3e06615	2017-05-05 09:13:39 -07:00
Yangqing Jia	cf317d1106	create_net: explicitly specify if one wants to overwrite the network. Summary: This is from discussion with dzhulgakov : as a step towards revisiting the core.Net autonaming, we will first guard against accidental overwrites of existing networks in the workspace. ajtulloch since we are doing Predictors in mobile, this should be safe right? azzolini - I assume this would be safe, but would love to get your approval. akyrola - would this hurt xray? Reviewed By: dzhulgakov Differential Revision: D4897725 fbshipit-source-id: aa41271927ad6671f07a53b9505283623f8c49e5	2017-04-17 21:46:53 -07:00
Dongsheng Fang	3c0dc06ac8	Add __builtin_cpu_supports function def in windows Summary: Closes https://github.com/caffe2/caffe2/pull/253 Differential Revision: D4892628 Pulled By: Yangqing fbshipit-source-id: 45d49121027454d9259c4a753438d8f0771cf042	2017-04-14 19:46:19 -07:00
Yangqing Jia	ca0c8e5b25	remove import_array() help and use import_array1 Summary: TSIA. See https://github.com/numpy/numpy/blob/master/numpy/core/code_generators/generate_numpy_api.py Reviewed By: jamorton Differential Revision: D4893002 fbshipit-source-id: 4b6bee1bdf8ae905e4c0952a3e8bbbacd4129a50	2017-04-14 19:46:19 -07:00
Fei Sun	e2323ad688	Add CAFFE_ENFORCE to protobuf parsing Summary: Add CAFFE_ENFORCE to make sure the protobuf parsing is successful. Reviewed By: salexspb Differential Revision: D4843662 fbshipit-source-id: 20cab7180e6b0e5afb5e29ff3333591659e41f7a	2017-04-06 14:34:30 -07:00
Fei Sun	95657ea1e8	Protobuf is binary string. Use bytes instead. Summary: Prepare for the Protobuf change. Reviewed By: dzhulgakov Differential Revision: D4784884 fbshipit-source-id: 86219eecefaf7637e70339437c9274c526ebd6fe	2017-03-28 19:03:23 -07:00
Alexander Sidorov	56f324d191	Added predictor bindings to python interface Summary: from caffe2.python import workspace; p = workspace.Predictor(init_net, predict_net); outputs = p.run(inputs) Reviewed By: Yangqing Differential Revision: D4576793 fbshipit-source-id: b829bbcaf2e7c34dad85024177433207bd96a234	2017-03-15 11:17:54 -07:00
Kittipat Virochsiri	f0d78753ae	Make ModelExporter.load_from_db() load to specific workspace Summary: In case of distributed task, load_from_db() loads to wrong workspace (when used inside a Python op). Passing which workspace to use explicitly so that it loads to the one Python op is being run. Reviewed By: kennyhorror Differential Revision: D4653692 fbshipit-source-id: 94585c012b05ee38b9ce5e8ef0efdd50aa41dd2b	2017-03-08 09:31:42 -08:00
Zachary Mirman	1c92e85dae	Added editDistance helper to caffe2 operators Summary: Added editDistance helper to caffe2 operators Differential Revision: D4622152 fbshipit-source-id: 4d6246b8226c1283d5883edfaa27e8f7748fdc4c	2017-02-28 13:31:56 -08:00
Yangqing Jia	47b65b6d8d	Add a create your own dataset tutorial Summary: bwasti - will follow up via email. Closes https://github.com/caffe2/caffe2/pull/166 Differential Revision: D4596858 Pulled By: Yangqing fbshipit-source-id: 6d088ccf1604e0dc9b94cbf0a75b51587e734d95	2017-02-22 03:31:47 -08:00
Yangqing Jia	8ca1b3baea	import_array python3 compatibility Summary: TSIA Reviewed By: salexspb Differential Revision: D4535571 fbshipit-source-id: 61ce724d4fc3c79fac551e8622a2d45cda67f80a	2017-02-09 10:08:13 -08:00
Andrew Dye	306fde233a	Accept optional blob map for InferShapesAndTypes Summary: Shape inference allows Caffe2 to compute shapes of blobs without running a model. Update InferShapesAndTypes() to accept an optional blob:dimensions map so that external input blobs do not need to be part of the workspace. InferShapesAndTypes() in workspace.py conditionally calls the ...from_workspace or ...from_map bindings. Note I favored a small amount of code duplication here for the sake of readability. InferShapesAndTypes() in operator.cc has been refactored into mirrored entry points, invoking a common helper. Other minor changes to address linter warnings. Reviewed By: dzhulgakov Differential Revision: D4524873 fbshipit-source-id: 56f863b759c016d7f23523f06fda3aa5bba22357	2017-02-08 15:04:24 -08:00
Aapo Kyrola	6a03641cde	Add num_iters to RunNet() Summary: Running RunNet() in python in a loop can be a performance issue if the python code is doing a lot of other processing, such as data input, because python's Global Interpreter lock (GIL) will prevent the RunNet() to be called. This can easily be fixed by making RunNet() run multiple iterations inside the C++ land. (Another way to accomplish the same thing is to use Caffe2's "execution plans", but that requires more setup). + fixed timing reporting in my OC workflow + improved one error log in data_workers.py Sorry for piggypagging those small changes, but landing diffs currently is slow... Reviewed By: rpenggithub Differential Revision: D4523575 fbshipit-source-id: 039a647576efad5dd9afda74df478ac22b43c103	2017-02-07 14:16:14 -08:00
Aapo Kyrola	dcefc74a0c	Shape and Type Inference Part1 Summary: This is a bit large diff, sorry about it. It includes basic shape and type inference functionality, based on YQ's Schema scaffolding. I added some helper functions to make it easier to write simple translations. Bigger refactoring was needed for ConvPoolBase so that we could use the shape inference already there in the schema. I annotated enough operators to be able to infer forward-pass of shapes for basic convnet, and added test for that. I intend to bootcamp some annotations and annotate enough to handle Resnets fully. Need to think about gradients, if they could be annotated in an easier way. Only shapes are now exposed to Python, types will follow later. Also the inference is not called yet anywhere but unit test. Also I am not sure if everything is in the best location in the code, but shouldn't be hard to move stuff around. Reviewed By: dzhulgakov Differential Revision: D4436818 fbshipit-source-id: eebee5937ccc9ac09c245465302388a1fae6933c	2017-02-02 22:29:22 -08:00
Yangqing Jia	8553bd3f68	Ensure we are not using Eigen LGPL code, and build on raspbian. Summary: Turns out that building on raspbian is easy as a cake for caffe2 - cmake is awesome. Closes https://github.com/caffe2/caffe2/pull/112 Differential Revision: D4480985 Pulled By: Yangqing fbshipit-source-id: 5dbe5e1e71d8680dea7a5ec8a9ce7fbe6aa5270a	2017-01-30 09:44:27 -08:00
Fei Sun	cc65cc64c8	Create function ParseProtobufFromLargeString to parse strings more than 64MB Summary: Replace ParseFromString with ParseProtobufFromLargeString to get around the limitation of the 64MB limit. Reviewed By: Yangqing Differential Revision: D4466226 fbshipit-source-id: b68a6efc76955db294ddb0d23bbaf03b69e4952a	2017-01-27 10:29:22 -08:00
Dmytro Dzhulgakov	864f561525	Make BlobDeserialization throw exceptions instead of returning bool Summary: Makes it much nicer to spot errors, especially in iPython notebook. Reviewed By: kennyhorror Differential Revision: D4465726 fbshipit-source-id: c0adaf5168248a70987ff9d5dfce54a622ff2219	2017-01-26 09:44:19 -08:00
Ahmed Taei	9ad10959ee	Enable large PlanDef protobuf message. Summary: Enable cases where PlanDef message is bigger than protobuf string decoding limits. Differential Revision: D4412736 fbshipit-source-id: 91ee02d7a8ab85b1c8169683a6c1dccd4c79be40	2017-01-13 09:29:29 -08:00
Bram Wasti	737000b166	Linter fix up to sync fbsource and github	2017-01-06 15:36:17 -08:00
Bram Wasti	3833dad5f6	manual sync of old never sync'd files	2017-01-06 15:28:45 -08:00
Yangqing Jia	5bfd6c4cd1	semicolon	2017-01-04 14:36:16 -08:00
Yangqing Jia	311ae2ba33	build file fix and avx2 on mac fix	2017-01-04 14:35:15 -08:00
bwasti	9ce23cbb71	Fix false positive for non-clang compilers.	2016-12-29 11:39:50 -08:00
Bram Wasti	b48f1ff810	OS X build	2016-12-29 12:25:53 -05:00
Dmytro Dzhulgakov	119b687994	Allow PythonOp to access the workspace Summary: DPER has very strange python ops that play with Workspace - they are somewhat similar to LoadOp/SaveOp, so I guess the semantics is fine. Thus it makes sense to allow python operators to receive workspace pointer similarly to regular Operators. I didn't figure out a better way to implement optional argument than just checking the number of args function receives on python side. Reviewed By: ajtulloch Differential Revision: D4242943 fbshipit-source-id: d97d4227815b741c8f884cfe254b06d2b56b5a41	2016-12-05 11:53:26 -08:00
Yangqing Jia	0e298ec399	Expose MKLMemory to the Python Feed and Fetch interface, and misc changes Summary: This is #2 of a series of changes. It did the following: (1) a few refactor of the MKL memory interface (2) an initial MKLContext to deal with MKL specific computations (3) Provide MKLMemory access in Python with the blob feeder/fetcher registration. Reviewed By: dzhulgakov Differential Revision: D4210123 fbshipit-source-id: adea1f1ffbd0b9ffdd55092676468c16bec08992	2016-11-29 15:18:36 -08:00
Yangqing Jia	589398950f	fbsync at f5a877	2016-11-18 15:41:06 -08:00
Yangqing Jia	238ceab825	fbsync. TODO: check if build files need update.	2016-11-15 00:00:46 -08:00
Yangqing Jia	d1e9215184	fbsync	2016-10-07 13:08:53 -07:00
Yangqing Jia	3d54e7b40e	fbsync: changes to implement operator schema	2016-09-08 18:07:01 -07:00
Yangqing Jia	b23e51d467	chunky sync	2016-09-06 15:55:19 -07:00

47 Commits