pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
Yiming Wu	1b7497807f	cnnmodelhelper deprecate warning Summary: We will start our API migration process. Before that, I want to make sure people don't add new CNNModelHelper instance to our opensource code. So that I put deprecation warning here in advance Reviewed By: salexspb Differential Revision: D5093556 fbshipit-source-id: 74bf4a7782c2d882f72f202d48c72255d152b68a	2017-05-18 23:35:26 -07:00
Yiming Wu	3eeca5b5e0	arg scope in ModelHelper Summary: based on our discussion, we want an arg_map in ModelHelper and create arg_scope for that model within brew. Now it is realized Reviewed By: salexspb Differential Revision: D5042983 fbshipit-source-id: ddd2c7e9bca1be2f08a32f7252b44d3b60a57996	2017-05-12 11:18:59 -07:00
Simon Layton	1d0ba2cfbd	New cudnn ops Summary: cuDNN versions of dropout and LRN (for native fp16 support), port of Caffe's max pooling algo that uses an explicit mask to store locations (also supports fp16 storage) Closes https://github.com/caffe2/caffe2/pull/396 Reviewed By: akyrola Differential Revision: D4990880 Pulled By: asaadaldien fbshipit-source-id: a716acffb656843e9b31e3e6808bd2d8aa959d03	2017-05-08 16:33:21 -07:00
Du Tran	033ab9da1b	Adding video data layer for caffe2 Summary: Adding a simple video data layer which allows to read video data from frames, videos and output 5D tensor. It also allows multiple labels. The current implementation is based on ffmpeg Differential Revision: D4801798 fbshipit-source-id: 46448e9c65fb055c2d71855447383a33ade0e444	2017-05-05 14:16:38 -07:00
Yiming Wu	aa5a46b848	fix LRN order Summary: fix LRN helper's order Reviewed By: salexspb Differential Revision: D4949902 fbshipit-source-id: 88b1aa985546d36aa66c0677c617979ff186d78a	2017-04-27 16:46:47 -07:00
Yiming Wu	2c8b41e3f3	Adding add_weight_decay and image_input to brew module Summary: Adding add_weight_decay and image_input to brew module & remove `getWeights` and `getBias` from CNNModelHelper With fbgs `useWeights`, the results show that noone but add_weight_decay is using this function. I checked with oculus people, their getWeights is a different function. kennyhorror Please notice whether this is going to affect you :) Reviewed By: salexspb Differential Revision: D4945392 fbshipit-source-id: 4ef350fd81dd40a91847e9f3ebc5421eb564df32	2017-04-25 16:03:58 -07:00
Yiming Wu	0bb558716a	rename model_helpers to brew and lowercase all helper functions Summary: rename model_helpers to brew. This is a big diff now. I did these things: 1. replace model_helpers with brew: find . -type f -exec sed -i 's/model_helpers/brew/g' {} + 2. rename model_helpers.py and model_helpers_test.py 3. rename ModelHelpersTest to BrewTest 4. lowercase all the helper functions to distinguish them from single op 5. run my unittests 6. run converge tests Reviewed By: salexspb Differential Revision: D4930465 fbshipit-source-id: f420a1b03238df1cbe9f4426e0b9c43a12119661	2017-04-24 15:52:26 -07:00
Yiming Wu	bef6e45f8b	rename ModelHelperBase Summary: rename ModelHelperBase to Model. This is the result of running: find . -type f -exec sed -i 's/ModelHelperBase/ModelHelper/g' {} + We had 19 results when fbgs ModelHelperBase. Here is 20 instances because I added 1 test in model_helpers_test.py Reviewed By: salexspb Differential Revision: D4928337 fbshipit-source-id: bc4c12b60b90c167e717de50ea9fe17521e142e3	2017-04-24 15:52:26 -07:00
Yiming Wu	c3a4468af6	Add conv helpers and proxy to CNN Summary: Add conv helpers, the migration of functions assumes that people should not do cnn_model = CNNModelHelper(use_cudnn=True) cnn_model.Conv(..., use_cudnn=False, ...) Reviewed By: salexspb Differential Revision: D4884974 fbshipit-source-id: 12af6e2a5863eba789232cd4a4771f95d05f9227	2017-04-17 15:03:05 -07:00
Yiming Wu	2043b3c114	train and algebra helpers Summary: Adding train and algebra helpers Reviewed By: salexspb Differential Revision: D4884951 fbshipit-source-id: 7a18eb986a7356977a6c3d7a62a996ddce0c793e	2017-04-17 15:03:05 -07:00
Yiming Wu	277b4eca97	array helpers (concat) Summary: Adding array helpers Reviewed By: salexspb Differential Revision: D4884933 fbshipit-source-id: 2ec3dd37b243c8c717e299876eef7650a08d3f2b	2017-04-17 15:03:04 -07:00
Yiming Wu	ed3f0ac5e9	nonlinearity helpers Summary: adding nonlinearity helpers Reviewed By: salexspb Differential Revision: D4884894 fbshipit-source-id: fe180df23daabb62175d5a6ae7b46ccb5f7d0123	2017-04-17 15:03:04 -07:00
Yiming Wu	3623c241c4	normalization helpers Summary: Add normalization helpers Reviewed By: salexspb Differential Revision: D4884786 fbshipit-source-id: 529e678bae133e85d981310014c15d551d39d48b	2017-04-17 15:03:04 -07:00
Yiming Wu	e881c4c590	removing __all__ in fc, dropout, pooling Summary: removing __all__ in fc, dropout, pooling Reviewed By: salexspb Differential Revision: D4884742 fbshipit-source-id: 4c5cedc9205851b0f3aac6832cebd3d48d0c1e74	2017-04-17 15:03:04 -07:00
Ahmed Taei	a207aa4dbc	Fix backward compatibility bug for cnn model helper arguments Summary: For new trained models passing kernels=2*[kernel] and using old code for inference that will not work because (kernels) argument isn't supported and we are not passing kernel. Reviewed By: salexspb Differential Revision: D4888795 fbshipit-source-id: 1649b073c4e1da1d59da9cea581b4dcab6dbaf5c	2017-04-14 09:47:48 -07:00
Aapo Kyrola	580ff3a594	Revert D4854240: [EAZY][C2 OSS] Add normalization helpers and proxy to CNNModelHelper Summary: This reverts commit 3fa594d79960742b34e20d843e8b6ef8aeb601d3 Differential Revision: D4854240 fbshipit-source-id: d08cb30f188f876e1962f53a44f4e6d4ea68297f	2017-04-12 16:46:01 -07:00
Aapo Kyrola	32b30ff1fe	Revert D4854440: [EASY][C2 OSS] Add Nonlinearity helpers and proxy to CNNModelHelper Summary: This reverts commit a337e5279729f1c938f34b3994ab8827ee94aa93 Differential Revision: D4854440 fbshipit-source-id: 00ef9724654990356be9df9bb1f65b4fd0fd0ffc	2017-04-12 16:36:33 -07:00
Aapo Kyrola	a8ef3b4090	Revert D4855073: [EAZY][C2 OSS] Add array_helpers and proxy to CNN Summary: This reverts commit 7272f62cff5d065eb028b8118a1ca190bd801fd5 Differential Revision: D4855073 fbshipit-source-id: a121e6bb98c37c7af0b59efad275e00bd5d21163	2017-04-12 16:36:33 -07:00
Aapo Kyrola	7867262d39	Revert D4855040: [EASY][C2 OSS] Add Algebra and train helpers and proxy them to CNNMH Summary: This reverts commit d948ea913f674a6e47c4b72629a2d33253cb3130 Differential Revision: D4855040 fbshipit-source-id: c8efa9566a3ec6b9a9d3ad0e8cab3cc656627473	2017-04-12 16:36:32 -07:00
Yiming Wu	8de1ce57d2	Add Algebra and train helpers and proxy them to CNNMH Summary: Add Algebra and train helpers and proxy them to CNNMH Reviewed By: salexspb Differential Revision: D4855040 fbshipit-source-id: d948ea913f674a6e47c4b72629a2d33253cb3130	2017-04-11 23:03:00 -07:00
Yiming Wu	b2e94a7bcb	Add array_helpers and proxy to CNN Reviewed By: salexspb Differential Revision: D4855073 fbshipit-source-id: 7272f62cff5d065eb028b8118a1ca190bd801fd5	2017-04-11 23:02:59 -07:00
Yiming Wu	e7cdd90490	Add Nonlinearity helpers and proxy to CNNModelHelper Summary: Add Nonlinearity helpers and proxy to CNNModelHelper Reviewed By: salexspb Differential Revision: D4854440 fbshipit-source-id: a337e5279729f1c938f34b3994ab8827ee94aa93	2017-04-11 23:02:59 -07:00
Yiming Wu	b8f2baec8e	Add normalization helpers and proxy to CNNModelHelper Summary: Add normalization helpers and proxy to CNNModelHelper Reviewed By: salexspb Differential Revision: D4854240 fbshipit-source-id: 3fa594d79960742b34e20d843e8b6ef8aeb601d3	2017-04-11 23:02:59 -07:00
Yiming Wu	d35b7569db	Add Pooling Helpers, proxy to CNNModelHelper Summary: Add Pooling Helpers, proxy to CNNModelHelper Reviewed By: salexspb Differential Revision: D4854014 fbshipit-source-id: 672fcd886153136b707866400b2705544eaf4ec9	2017-04-11 23:02:59 -07:00
Yiming Wu	64599d8351	create helpers package and add dropout Summary: Helpers package and Dropout helper file Reviewed By: salexspb Differential Revision: D4837140 fbshipit-source-id: cd3030974421ce6830747935183e098aa04b2803	2017-04-07 17:33:49 -07:00
Aapo Kyrola	8c769258f8	fix cnn.Softmax when called with only inputs Summary: Many dper code was callling model_helper.Softmax() without outputs, causing python error.. Sorry! Reviewed By: xianjiec Differential Revision: D4845359 fbshipit-source-id: 7b6d547acb968371bf7cae1eb68fb5a8609877ec	2017-04-06 15:33:54 -07:00
Yiming Wu	b922b19bfd	add weights bias to modelhelperbase Summary: add weights and bias to modelhelperbase Reviewed By: salexspb Differential Revision: D4837125 fbshipit-source-id: 6a357c0e3d07d35aa6cdeb8ef803976646b9dbe6	2017-04-06 11:16:55 -07:00
Aapo Kyrola	c66c8f6e84	Add Softmax to cnn.py, cuDNN engine. Summary: Softmax was not in the model helper, so added it there so we can set the CUDNN engine, as it is the preferred version. Reviewed By: asaadaldien Differential Revision: D4835624 fbshipit-source-id: 7f0c84b7a73653119901795782709a6a617345c5	2017-04-05 14:20:23 -07:00
Aapo Kyrola	e13e9c1302	cuDNN version of TransposeOp Summary: Uses the cudnnTransformTensor function. It works by shuffling the strides according to the transpose axis. Significant speedup over current GPU version . + moves the transpose test under utility_ops, because hypothesis_test is too big Reviewed By: jamesr66a Differential Revision: D4810993 fbshipit-source-id: 82577c4ced1389e70bd5992820ae4d8297a3817f	2017-04-03 13:33:10 -07:00
Aaron Markham	58f7f2b441	doxygen python block added Summary: Closes https://github.com/caffe2/caffe2/pull/226 Differential Revision: D4793550 Pulled By: JoelMarcey fbshipit-source-id: cc33e58186304fa8dcac2ee9115dcc271d785b1e	2017-03-29 06:46:16 -07:00
Ahmed Taei	3b7cb50d1c	Add ConvNd to model helper Summary: Add ConvNd interface for Nd convluton and keep Conv for 2d convlution. I added _BaseConv to share code between ConvNd and Conv. Reviewed By: Yangqing Differential Revision: D4660822 fbshipit-source-id: 8339421351ce9a36ce5a165f7fa455cfcc61733d	2017-03-22 15:47:48 -07:00
Alexander Sidorov	f97d7949d0	Remove legacy LSTM, cleanup tests Summary: we don't use this one any more except a few tests Reviewed By: urikz Differential Revision: D4731401 fbshipit-source-id: c5c28b7594e3251f501fc28455dfc9bd2093a836	2017-03-17 16:33:53 -07:00
Alexander Sidorov	1fac027d0e	Quantized Training API Summary: These python helpers are going to provide sufficient book keeping when adding quantization for conv layers Reviewed By: Yangqing Differential Revision: D4671478 fbshipit-source-id: 292e2f633dd30969c0afbe7a8075b340ce9a6d12	2017-03-13 22:17:58 -07:00
Pooya Davoodi	d85ca8c6df	Do not initialize BN params if init_params is false. Summary: If init_params is False, the parameters should not be initialized. This is particularly important when testing a model that provides values for these BN parameters. Closes https://github.com/caffe2/caffe2/pull/174 Differential Revision: D4621791 Pulled By: Yangqing fbshipit-source-id: 518443925990a12c1d5729b0971ebe19ba5d8998	2017-02-27 20:19:03 -08:00
Aapo Kyrola	9eeeb8407f	use CUDA version of AccuracyOp with top_k=1 Summary: D4348953 added support for accuracy for top_k>1, which is only supported on CPU, requiring data to be copied to CUDA. But that diff did not take into account that we have top_k=1 version of AccuracyOp for CUDA. This diff ensures we use the CUDA version for top_k=1. Differential Revision: D4607767 fbshipit-source-id: 8becda23890343043eb79ad04e4c6196e9010f0c	2017-02-23 19:02:53 -08:00
Kittipat Virochsiri	ba7fad53b5	Support for sample softmax Summary: This diff adds ability to train multiclass classifier on sampled subset of classes. This basically implements what described in https://arxiv.org/abs/1412.2007 without the sampling probability correction. Since this implement uniform sampling, sampling probabilities are cancelled out in softmax anyway. The trick to make this work is to have 2 different nets for prediction and training, both shared parameters. The model is built normally until the last layer. If sampling is needed, then we do the following: The class sampling works as following: Reviewed By: xianjiec Differential Revision: D4512859 fbshipit-source-id: ab537bcac81d5e5877a8795045e8682c8064da68	2017-02-17 09:31:54 -08:00
James Cross	b436788b16	LSTMUnit: pass through H values Summary: Pass through the h-value recurrent output unchanged at each LSTM step beyond the valid part of a sequence (computed based on seqLengths, allowing batching of sequences of different length). This enables using the final-step output of each sequence as the output when one vector is desired for the entire sequence. Gradient also passed back unchanged. Also made some cosmetic changes to recurrent_network_test.py (seq_lengths offset corrected, should be in [1, T] rather than [0, T-1]). Reviewed By: urikz Differential Revision: D4540307 fbshipit-source-id: 73a9f6326069d713dcb0cdc8d17869317c6dbe96	2017-02-16 15:31:38 -08:00
James Cross	63901e9aca	allow recurrent network gradient op to receive gradient on any combination of network output blobs Summary: (Caffe2) Modified RecurrentNetworkGradient operator so that training is possible with any of the output blob(s) receiving gradient during the backward pass. This is realized through a new argument for the RecurrentNetwork op, outputs_with_grads, which takes a list of the indices of the output blobs which will receive gradient. The default case (only receiving gradient from the first output blob) remains the default. New unit test covers the case where outputs_with_grads = [1, 2] using Python LSTM wrapper. Reviewed By: urikz Differential Revision: D4518516 fbshipit-source-id: 5c531582b20f3cf727d1aa91239b4d5a2b8a7c1f	2017-02-15 16:00:45 -08:00
David Truong	60be25f4cd	Added shape inference to padding operator for tensors Summary: Can now infer the shape of the tensor Differential Revision: D4529339 fbshipit-source-id: 33553611fd3ecd7fde4b7b432c7720255ddda8be	2017-02-13 11:04:13 -08:00
Alexander Sidorov	b7fa6b2a8b	remove recurrent_inputs in a favor of recurrent_input_ids Summary: I have forgotten to remove this one. The rest of indexing instead of string names is comming after D4446813 lands as scratches aren't inputs or outputs and thus can't be indexed. Reviewed By: urikz Differential Revision: D4465748 fbshipit-source-id: 2ccbedfb35541ef4a2231d1480eef59025bd5290	2017-01-31 13:14:33 -08:00
Yury Zemlyanskiy	22e1bdd6d1	Use stack workspaces in RecurrentNetwork Summary: This diff use stack workspaces in RecurrentNetwork, which allows to simplify the implementation and get rid of scratches. Reviewed By: salexspb Differential Revision: D4446813 fbshipit-source-id: 514eec7e4300bdf492a9cb192b40cf4f89acf656	2017-01-27 11:44:26 -08:00
Yury Zemlyanskiy	0e3146e1e8	Remove recurrent_sizes from RecurrentNetwork Summary: Remove usage of recurrent_sizes, so recurrent states' sizes can depend on input (in case of attention matrix for beam decoder). I removed recurrent_sizes from forward and backward steps. Reviewed By: salexspb Differential Revision: D4427688 fbshipit-source-id: 580420a294d309c86ec5cb4e677058623b7228e1	2017-01-24 23:14:25 -08:00
Alexander Sidorov	b1472a173a	don't hardcode outputs order to work only for lstm + don't pass blob names for parameters Summary: In this diff I stop passing parameters by name and also remove hardcoded output ids which were there specifically for LSTM to work. It also allows to avoid using recurrent_sizes in the backward pass (for forward this is done in D4427688) Using similar technic it should be simple enough to eliminate blob name passing at all. Then we can fix scoping. These can be done in a next diff. Reviewed By: urikz Differential Revision: D4444614 fbshipit-source-id: 3580a76365502b9f2f09e3d8b7e78084ca739f00	2017-01-24 16:29:23 -08:00
Kevin Matzen	6a7dd236fa	instance norm Summary: Added gradient and GPU implementation to caffe2 InstanceNorm op Reviewed By: Yangqing Differential Revision: D4304808 fbshipit-source-id: 6feecaed589ea9f825260a49b39b4260da6e5426	2017-01-20 12:29:28 -08:00
Pooya Davoodi	92ebb58a06	Top-k accuracy operator on host Summary: Automatically copy from device -> host if necessary. Thanks to pooyadavoodi for the host top-k code. Closes https://github.com/caffe2/caffe2/pull/51 Reviewed By: Yangqing Differential Revision: D4348953 Pulled By: bwasti fbshipit-source-id: be650855cdd6c2c7bed838155f30e9fa92759dfe	2017-01-10 18:44:30 -08:00
Simon Layton	7c3f1521a7	Gpu transform Summary: Adds a thread pool for image decode, and optional GPU-based data conversion, mean subtraction and std division Closes https://github.com/caffe2/caffe2/pull/56 Reviewed By: Yangqing Differential Revision: D4341326 Pulled By: bwasti fbshipit-source-id: 6485616ea7d212c7701274a40fae912db30dff4a	2017-01-03 17:59:34 -08:00
Priya Goyal	3eb08feff5	Support no_bias in naive group conv implementation Summary: I was testing perf difference between naive group conv and cudnn group conv. I am doing no_bias conv and added support for that in naive implementation although its deprecated, i thought it would be nice to have working things in our code Differential Revision: D4363168 fbshipit-source-id: 29719013d79b449fd359884709c7a1195be51ae3	2016-12-22 14:14:26 -08:00
Aapo Kyrola	db5cc8f278	revert exhaustive_search setting to False Summary: As per discussion in D4355529 Reviewed By: prigoyal Differential Revision: D4362162 fbshipit-source-id: 795fcf1507235a7dc3c7a10b0453037936d057aa	2016-12-22 12:44:42 -08:00
Yangqing Jia	2c6a579859	Make all convolution operators allow optional bias term Summary: It used to be that only the cudnn engine supports it, and now it should be fully supported by any conv engine. To ignore bias, simply use a convolution op that has two inputs instead of 3. The gradient operator will automatically figure out that it does not compute the bias gradient. Reviewed By: prigoyal Differential Revision: D4354183 fbshipit-source-id: cf71b6289a254d15a6a663a85df63fbbaec3702b	2016-12-21 15:14:24 -08:00
Aapo Kyrola	5209a28c95	cuddn_exhaustive_search default True Summary: As discussed, this improves performance a lot and is not a memory hog anymore. Anyway anyone can also turn it off. Differential Revision: D4338798 fbshipit-source-id: bf0fdb594427ebe90e1e94b2effdc63196096b3f	2016-12-21 09:29:43 -08:00

1 2

67 Commits