pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Orion Reblitz-Richardson	1d5780d42c	Remove Apache headers from source. * LICENSE file contains details, so removing from individual source files.	2018-03-27 13:10:18 -07:00
Kutta Srinivasan	b4b2f0d2cc	Work on fp16 conv op	2018-03-05 21:13:03 -08:00
Junjie Bai	b11ba65204	Experimental support for setup.py develop mode install Summary: `python setup.py develop` / `pip install -e .` Closes https://github.com/caffe2/caffe2/pull/1926 Reviewed By: orionr Differential Revision: D6951780 Pulled By: bddppq fbshipit-source-id: 01249cbca90ec5326ea4107d4e500ae95a9dbd7b	2018-02-12 23:36:18 -08:00
Zhicheng Yan	d79a31761e	rectangle_cropping_multi_cropping_color_jittering_lighting Summary: Change log - Support rectangle cropping, where height and width of clip cropping can be set separately. This is useful when most video resolution is non-square, such as 240p, 360p and 480p where width is significantly larger than height. - Comparisons of training on ucf101 between using 112x112 croppings and using 112x144 cropping. - https://fburl.com/i0rw6y1k - Support 14 multi-cropping per video clip at testing stage to improve classification accuracy. Take left-top, central-top, right-top, left-bottom, central-bottom, right-bottom and central-central croppings as well as their mirrorings. In total, 14 croppings. - Comparisons on the same model trained on UCF-101. Use 1 clip per video - RGB. f41014306, w/o Vs f41014868, w/ multi-cropping: `0.64099 Vs 0.65796` - OF. f41014889, w/o Vs f41014913, w/ multi-cropping: `0.65796 Vs 0.67624` - Support color jittering and color lighting on RGB data for training data augmentation. - Comparisons of training on ucf101 from scratch with and without color jittering and lighting: - https://fburl.com/k69zatul Reviewed By: HengCV Differential Revision: D6962620 fbshipit-source-id: 9b43478945874142727fea351ee04417218e6606	2018-02-12 16:39:06 -08:00
Alexander Sidorov	a3b8c459d4	Revamp MNIST tutorial Summary: Main changes: 1. Move reader creation to Brew in order to be consistent and avoid a wild use of param_init_net 2. Use optimizers for training function, avoid manual optimizer construction 3. Add MLP mode (a default) 4. Fix a bunch of too verbose comments and add a bit of new explanations Closes https://github.com/caffe2/caffe2/pull/1760 Differential Revision: D6749059 Pulled By: salexspb fbshipit-source-id: 9dfbbb2d9772a74a0300c2e404a92e791f7cc593	2018-01-26 09:17:31 -08:00
Ilia Cherniavskii	79ac146808	Add if and while ops to brew Summary: Adding if and while control ops to brew, also adding unit tests Note: unlike net_builder where we can figure which blobs are external and which ones are local to subnets, here in brew we need to use external_blobs param explicitly to point at external blobls Reviewed By: harouwu Differential Revision: D6440508 fbshipit-source-id: c920f0af84b77ccb2d8462ffc7567bb1908c844a	2017-12-05 17:33:34 -08:00
James Cross	2c190d2f05	update transformer code for layer_norm() API change Summary: Quick fix for unit test broken by D6454290. This is my fault for approving while the tests covering the single callsite were broken. Reviewed By: goldsborough Differential Revision: D6466566 fbshipit-source-id: 2683be3d6bb184286e64fbde3e572946e39030c7	2017-12-01 20:19:31 -08:00
Peter Goldsborough	b43c1b2bed	Fix and upgrade brew.layer_norm Summary: While working on layer normalization for LSTMs I encountered an issue where the layer norm parameters (which are the scale/gain and bias/shift from the paper) were not registered in the model for `brew.layer_norm`. salexspb explained that this is because it was using the `init_net_param` API instead of `create_param`. This diff fixes this. While fixing I noticed that I noticed that `brew.layer_norm` actually had a bug where it was multiplying with the bias instead of adding it. Another issue was that the function giving the scale and bias a shape of `[1]`, however the paper (https://arxiv.org/pdf/1607.06450.pdf) specifies that, like for batch norm, there is one scale and bias parameter per neuron, i.e. the shape should be `[1, axis_dimension]`. The API now takes an explicit `dim_in` parameter (also more consistent with other normalization functions in that module) so that this can be specified. See tests for how this now looks. Reviewed By: jhcross Differential Revision: D6454290 fbshipit-source-id: fc00ca614de3190c40ab743e8984bec9e85fb58c	2017-12-01 14:18:28 -08:00
Aapo Kyrola	14f95c2782	Updated brew SpatialBN to use initializers Summary: Updated brew SpatialBN to use initializers similar to other brew ops such as conv and fc instead of initilaizing all of its parameters itself within the brew call. Reviewed By: asaadaldien Differential Revision: D5840359 fbshipit-source-id: 9f3d688d4957605eaf7ecd2488bc26bfb1da3f78	2017-11-02 11:25:45 -07:00
Aapo Kyrola	669ec0ccba	Added FP16 compute support to FC Op Summary: Allow the GEMMs in the FC/FCGradient Op to do FP16 compute instead of FP32 if the appropriate op flag is set. Reviewed By: asaadaldien Differential Revision: D5839777 fbshipit-source-id: 8051daedadf72bf56c298c1cf830b019b7019f43	2017-10-30 17:03:51 -07:00
Junjie Bai	d894a6362f	Add missing is_test argument in ImageInput ops Summary: reported in Github Issue https://github.com/caffe2/caffe2/issues/1269 Reviewed By: salexspb Differential Revision: D6004461 fbshipit-source-id: 03f4bccfe085010b30109ab7b6fe7325caa160ef	2017-10-10 10:03:13 -07:00
James Reed	995c83f945	Disable cudnn dropout Summary: The cudnn version of the DropoutOp was taking a significant (and unwarranted) amount of time in our RNN training. Further investigation showed that setting the cudnn dropout descriptors was an extremely expensive operation (https://pxl.cl/99nT), much more so than the dropout operation itself. This diff adds to the DropoutCell the option to disable cudnn. The non-cudnn version uses a raw curand call that elides all of the expensive descriptor setting. Reviewed By: jmp84, akyrola Differential Revision: D5972022 fbshipit-source-id: 6325ec5d6569f8b94d776cbb2554cc8ddb28f699	2017-10-04 17:24:09 -07:00
Yangqing Jia	8286ce1e3a	Re-license to Apache Summary: Closes https://github.com/caffe2/caffe2/pull/1260 Differential Revision: D5906739 Pulled By: Yangqing fbshipit-source-id: e482ba9ba60b5337d9165f28f7ec68d4518a0902	2017-09-28 16:22:00 -07:00
Junjie Bai	d9b0bcd7a4	Make all existing (except in RoIPool) "is_test" arguments required Reviewed By: akyrola Differential Revision: D5830168 fbshipit-source-id: 8634e9cfe308ba0ee90cd8a5c4b09a47b0b5f015	2017-09-25 23:46:12 -07:00
Aapo Kyrola	fb45383ed6	resubmission of PR1175: fp16 BatchMatMul Summary: PR 1175 caused a build error because gemmBatched was only under a specific #ifdef. Now put it outside the #ifdef, and things work. Reviewed By: asaadaldien Differential Revision: D5834868 fbshipit-source-id: 072a64c8f4b259ff7504104121766115b46b8aa0	2017-09-14 21:46:05 -07:00
Yangqing Jia	f0d0361609	Revert D5794634: [caffe2][PR] fp16: BatchMatMul Summary: This reverts commit 911c462824edec3de529a5a4385a4c437e24bf59 bypass-lint Differential Revision: D5794634 fbshipit-source-id: 1863b02282329cbee6b10e5870f03051b4bb6c58	2017-09-13 18:46:47 -07:00
Luke Yeager	3cfc6f26e7	fp16: BatchMatMul Summary: Was https://github.com/caffe2/caffe2/pull/1151 Closes https://github.com/caffe2/caffe2/pull/1175 Reviewed By: Yangqing Differential Revision: D5794634 Pulled By: akyrola fbshipit-source-id: 911c462824edec3de529a5a4385a4c437e24bf59	2017-09-13 14:35:25 -07:00
Luke Yeager	944115c915	Bugfix for concat frontend Summary: When breaking out pooyadavoodi's change to `brew.concat` from https://github.com/caffe2/caffe2/pull/1151 to https://github.com/caffe2/caffe2/pull/1184, I made it throw an error instead of silently changing removing `order`. But `order` is always present because of [this](https://github.com/caffe2/caffe2/blob/v0.8.1/caffe2/python/model_helper.py#L118), so the frontend can never be used to set `axis`. That's bad. This PR changes the behavior back to Pooya's original implementation. Closes https://github.com/caffe2/caffe2/pull/1202 Reviewed By: akyrola Differential Revision: D5806488 Pulled By: pietern fbshipit-source-id: ceaea77469688a66b269b8ed2944f0d3fe873940	2017-09-11 13:02:59 -07:00
Luke Yeager	03de05229e	brew.concat: don't set both order and axis Summary: Was https://github.com/caffe2/caffe2/pull/1151. pooyadavoodi says this was causing problems for him. I don't remember the details. Closes https://github.com/caffe2/caffe2/pull/1184 Differential Revision: D5794711 Pulled By: akyrola fbshipit-source-id: 4d75f2a9b30881ba662141c352ac556cb5d3cce6	2017-09-08 10:34:34 -07:00
James Reed	f388135d3f	Layer norm brew wrapper Summary: Implement a brew wrapper for the LayerNorm op. This adds the scalar weight and bias terms to the op. Reviewed By: jmp84 Differential Revision: D5595836 fbshipit-source-id: 467b2e1158b0c454a149d4b26c47719826e98752	2017-08-17 11:17:47 -07:00
Simon Layton	85788a0f65	Add TensorCore support Summary: Add support for TensorCore convolution and gemm on Volta hardware. Currently built on top of #1055 Closes https://github.com/caffe2/caffe2/pull/1056 Differential Revision: D5604068 Pulled By: Yangqing fbshipit-source-id: 100f67e26ed5fabb1dbb31dcd77f7ecb84de4ee7	2017-08-10 20:16:48 -07:00
Ahmed Taei	5bb1e6b817	Allow passing unsymmetric 2d kernels to brew.conv. Reviewed By: jay-mahadeokar Differential Revision: D5598523 fbshipit-source-id: 47135a8562f7c720badb2be677cb79730dc417a0	2017-08-10 15:27:16 -07:00
Simon Layton	ded2a5899e	Option to set BN scale and bias initial values Summary: Necessary to reproduce setup from 1-hour imagenet paper Closes https://github.com/caffe2/caffe2/pull/995 Differential Revision: D5547666 Pulled By: akyrola fbshipit-source-id: cbd4396888b02f32c67e1fe7e53636329de64f1b	2017-08-02 11:38:57 -07:00
Kevin Wilfong	60cb55461e	Caffe2: Support additional outputs in ImageInputOp Summary: This allows users to add an arbitrary of additional outputs to ImageInputOp. These are populated by reading additional TensorProto values from the TensorProtos from the DBReader, and converting them into Tensors. Similar to labels, only ints and floats are supported, and multiple values are supported. Reviewed By: panshen1 Differential Revision: D5502019 fbshipit-source-id: 5a8b61b3a8549272a112e8e02cd613d8f9a271ba	2017-08-01 14:36:05 -07:00
Mitchell Wortsman	823869ba79	Adding tanh to brew Summary: Added tanh to brew. Reviewed By: harouwu Differential Revision: D5395358 fbshipit-source-id: 8eb5303f503e10aec4c59b42055933198d67e9b3	2017-07-11 18:17:52 -07:00
Luke Yeager	dfd745a4d1	Conv frontend: checking engine and use_cudnn Summary: Fixes https://github.com/caffe2/caffe2/issues/860 Raise an exception when the user specifies conflicting values for `engine` and `use_cudnn` in the conv frontend. Closes https://github.com/caffe2/caffe2/pull/861 Differential Revision: D5329587 Pulled By: akyrola fbshipit-source-id: 0f1ced9a88c9c6c5a7cb30a070e5bf60129082f0	2017-06-27 09:47:48 -07:00
Davin Wang	dd1525d346	fix #790 so model.init_params = False takes effect Summary: Given the parameter init_params=False, Weight Blob(_w) and Bias Blob (_b) should be suppressed in model.param_init_net. Without this fix, the init_params=False doesn't take effect in brew.conv as it does in brew.fc or other ops. This issue is the root cause of #790 [https://github.com/caffe2/caffe2/pull/790]. Closes https://github.com/caffe2/caffe2/pull/824 Reviewed By: harouwu Differential Revision: D5276676 Pulled By: akyrola fbshipit-source-id: 8f7088a8e1976658f67e027223e555375b3a2392	2017-06-20 14:08:35 -07:00
Zhicheng Yan	ee3727db00	add_helper_function_ElementwiseLinear_op Summary: Add a helper function for parametric op ElementwiseLinear The typical syntax is model.ElementwiseLinear(input, output, dimension) Reviewed By: harouwu, akyrola Differential Revision: D5114152 fbshipit-source-id: 8e8c691f824f518ae510a72ab0c12de1b018f3b5	2017-06-07 13:49:48 -07:00
Andrey Malevich	e05173a476	Create ExternalInitializer to simplify logic around init_params = False Summary: This diff is creating new type of Initializer - ExternalInitializer. This initializer is supposed to be used in cases when the parameter blob is already expected to exist in the workspace. Reviewed By: dzhulgakov Differential Revision: D5171322 fbshipit-source-id: d27861f0f80afdea93c235d49f63da19adccc92c	2017-06-02 18:22:50 -07:00
Andrey Malevich	a8fb85797c	Refactoring of the parameters step 0. Add simple tags and unify interface for params and computed_params. Summary: This diff is the first step in the effort for refactoring all parameters. As a first step - I'm merging concept of params and computed_params, that is going to be based on tags instead (in the first version it's still using old data structs to store all the BlobReferences). Renaming computed_params to non-trainable/non-backprop params should be done is some other diff. Reviewed By: salexspb Differential Revision: D5171159 fbshipit-source-id: 68031ca779f053fb266a7c4a2e5b482a3bd9c832	2017-06-02 17:17:57 -07:00
Ahmed Taei	299f293cb2	Add initializer classes to conv_nd. Summary: Fix parameters passed to _ConvBase Reviewed By: sunwael Differential Revision: D5166836 fbshipit-source-id: 6c2a9fa73cf1199a5f861900554f3075a49104fc	2017-06-01 14:17:55 -07:00
Simon Layton	58874ad5bf	Fp16 training initializers Summary: Re-open for re-importing :) Closes https://github.com/caffe2/caffe2/pull/721 Differential Revision: D5164345 Pulled By: akyrola fbshipit-source-id: e80b32556cd25610602df91a4225b93edc0ca40b	2017-06-01 08:34:46 -07:00
Aapo Kyrola	0f8c8f37a8	Revert D5159712: [caffe2][PR] Fp16 training initializers Summary: This reverts commit 60a889494d2e2f4df1d720331e19f638c5eb95cc Differential Revision: D5159712 fbshipit-source-id: 16040c911b260648857f656f92b165f92c2daae0	2017-06-01 00:17:14 -07:00
Aapo Kyrola	076376f4f6	Revert D5119830: [C2] Refactoring of the parameters step 0. Add simple tags and unify interface for params and computed_params Summary: This reverts commit 2001090a37346eb12abbb234e13e727c288eb8a7 Differential Revision: D5119830 fbshipit-source-id: bf321868338f0db85dff3237af7eaf74212dbdf6	2017-06-01 00:02:21 -07:00
Andrey Malevich	ff61ed358e	Refactoring of the parameters step 0. Add simple tags and unify interface for params and computed_params Summary: This diff is the first step in the effort for refactoring all paramters. As a first step - I'm merging concept of params and computed_params, that is going to be based on tags instead (in the first version it's still using old data structs to store all the BlobReferences). Renaming computed_params to non-trainable/non-backprop params should be done is some other diff. Reviewed By: salexspb Differential Revision: D5119830 fbshipit-source-id: 2001090a37346eb12abbb234e13e727c288eb8a7	2017-05-31 22:36:36 -07:00
Simon Layton	2bfacff426	Fp16 training initializers Summary: Adds support for generating and training pfp16 models. Added SGD optimizer for multi-precision trainers and a new callback to data_parallel_model in order to help multi-precision models keep their different copies of parameters in sync during training. Closes https://github.com/caffe2/caffe2/pull/697 Differential Revision: D5159712 Pulled By: salexspb fbshipit-source-id: 60a889494d2e2f4df1d720331e19f638c5eb95cc	2017-05-31 17:46:58 -07:00
Simon Layton	2c3071fc4e	Rework initializers to pass a class not object Summary: Changed tests Moved to WeightInitializer, BiasInitializer keywords Closes https://github.com/caffe2/caffe2/pull/682 Reviewed By: Yangqing Differential Revision: D5138769 Pulled By: salexspb fbshipit-source-id: 81d266100b2a95c64c0196c16670dfd34ea03e02	2017-05-30 09:06:56 -07:00
Alexander Sidorov	016f72537a	ModelHelper.create_param, Initializer abstraction and ParameterInfo for optimizers Summary: This is going to unblock Nvidia in their work on adding fp16 support to Caffe2. I discussed this with kennyhorror before to make sure this fits into his work on parameter sharing. Reviewed By: kennyhorror Differential Revision: D5127797 fbshipit-source-id: 4db155d320b1862570c23b77c4252bdacbf2296f	2017-05-25 22:03:15 -07:00
Simon Layton	193c9289f0	Fix LRN schema for cuDNN op Summary: Correct schema generation was previously broken leading to invalid gradient op creation. Also exhibited in model_device_helper, where invalid schema were being created on the CPU when kwargs['engine'] == 'CUDNN' Closes https://github.com/caffe2/caffe2/pull/617 Reviewed By: asaadaldien Differential Revision: D5097062 Pulled By: akyrola fbshipit-source-id: e22181f857deccb7b4395e87271e2cbf1226eb64	2017-05-22 08:33:34 -07:00
Yiming Wu	a28b01c155	rnn with brew Summary: Update rnn_cell.py and char_rnn.py example with new `brew` model. - Deprecated CNNModelHelper - replace all helper functions with brew helper functions - Use `model.net.<SingleOp>` format to create bare bone Operator for better clarity. Reviewed By: salexspb Differential Revision: D5062963 fbshipit-source-id: 254f7b9059a29621027d2b09e932f3f81db2e0ce	2017-05-16 13:33:44 -07:00
Simon Layton	1d0ba2cfbd	New cudnn ops Summary: cuDNN versions of dropout and LRN (for native fp16 support), port of Caffe's max pooling algo that uses an explicit mask to store locations (also supports fp16 storage) Closes https://github.com/caffe2/caffe2/pull/396 Reviewed By: akyrola Differential Revision: D4990880 Pulled By: asaadaldien fbshipit-source-id: a716acffb656843e9b31e3e6808bd2d8aa959d03	2017-05-08 16:33:21 -07:00
Du Tran	033ab9da1b	Adding video data layer for caffe2 Summary: Adding a simple video data layer which allows to read video data from frames, videos and output 5D tensor. It also allows multiple labels. The current implementation is based on ffmpeg Differential Revision: D4801798 fbshipit-source-id: 46448e9c65fb055c2d71855447383a33ade0e444	2017-05-05 14:16:38 -07:00
Yiming Wu	aa5a46b848	fix LRN order Summary: fix LRN helper's order Reviewed By: salexspb Differential Revision: D4949902 fbshipit-source-id: 88b1aa985546d36aa66c0677c617979ff186d78a	2017-04-27 16:46:47 -07:00
Yangqing Jia	deb1327b6e	Re-apply #266 Summary: Closes https://github.com/caffe2/caffe2/pull/404 Differential Revision: D4943280 Pulled By: Yangqing fbshipit-source-id: c0988598d8ccb8329feac88382686324b90d4d46	2017-04-25 21:17:04 -07:00
Yiming Wu	2c8b41e3f3	Adding add_weight_decay and image_input to brew module Summary: Adding add_weight_decay and image_input to brew module & remove `getWeights` and `getBias` from CNNModelHelper With fbgs `useWeights`, the results show that noone but add_weight_decay is using this function. I checked with oculus people, their getWeights is a different function. kennyhorror Please notice whether this is going to affect you :) Reviewed By: salexspb Differential Revision: D4945392 fbshipit-source-id: 4ef350fd81dd40a91847e9f3ebc5421eb564df32	2017-04-25 16:03:58 -07:00
Yiming Wu	0bb558716a	rename model_helpers to brew and lowercase all helper functions Summary: rename model_helpers to brew. This is a big diff now. I did these things: 1. replace model_helpers with brew: find . -type f -exec sed -i 's/model_helpers/brew/g' {} + 2. rename model_helpers.py and model_helpers_test.py 3. rename ModelHelpersTest to BrewTest 4. lowercase all the helper functions to distinguish them from single op 5. run my unittests 6. run converge tests Reviewed By: salexspb Differential Revision: D4930465 fbshipit-source-id: f420a1b03238df1cbe9f4426e0b9c43a12119661	2017-04-24 15:52:26 -07:00
Yiming Wu	fa261cdafb	arg_scope for model_helper Summary: arg_scope module for model_helpers. Some coding example with it: with model_helpers.arg_scope([model_helpers.FC], kwargs): model_helpers.FC(model, "x", "out_1", n, n) with model_helpers.arg_scope([myhelper], n=-3): with model_helpers.arg_scope([myhelper], n=-2): with model_helpers.arg_scope([myhelper], n=n): res = model_helpers.myhelper(None) with model_helpers.arg_scope([myhelper], n=-3), \ model_helpers.arg_scope([myhelper], n=-2), \ model_helpers.arg_scope([myhelper], n=n): res = model_helpers.myhelper(None) Reviewed By: salexspb Differential Revision: D4837180 fbshipit-source-id: 2cbd81681779d6cd1e61ee189edcc1cf3bb07d15	2017-04-24 15:52:25 -07:00
Yangqing Jia	a48062b1a2	temporarily fix sync script bugs changes by reverting partially https://github.com/caffe2/caffe2/pull/266/files	2017-04-24 15:49:22 -07:00
Yiming Wu	8a47857ef1	group_conv fix Summary: gorup conv bug fix. Calling conv without model Differential Revision: D4911690 fbshipit-source-id: fc7dd7d1b7056dd2a4a02f97ad037ee29c4d8c24	2017-04-19 10:07:09 -07:00
Yiming Wu	c3a4468af6	Add conv helpers and proxy to CNN Summary: Add conv helpers, the migration of functions assumes that people should not do cnn_model = CNNModelHelper(use_cudnn=True) cnn_model.Conv(..., use_cudnn=False, ...) Reviewed By: salexspb Differential Revision: D4884974 fbshipit-source-id: 12af6e2a5863eba789232cd4a4771f95d05f9227	2017-04-17 15:03:05 -07:00

1 2

66 Commits