pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Kittipat Virochsiri	5c32c82a6d	Add option to subtract log odd from sampled trained prediction. Summary: Useful for sampled softmax training Differential Revision: D4782673 fbshipit-source-id: 88195de60070a0bc16f5e06b9aad4dffd0484546	2017-04-03 17:50:58 -07:00
Xianjie Chen	9fc56793dd	fix trunk for push and small cleanup Summary: multiple places broken, blocking the push :( - fix the weighted training for ads and feeds - fix the publishing if no exporter model is selected - fix the feeds retrieval evaluation - added the default config for retrieval workflows. plan to use for flow test (in next diff) - clean up not used code - smaller hash size for faster canary test Reviewed By: chocjy Differential Revision: D4817829 fbshipit-source-id: e3d407314268b6487c22b1ee91f158532dda8807	2017-04-02 23:35:49 -07:00
Kittipat Virochsiri	3eb3507367	uniform_sampling layer Summary: This layer will be used to sample negative labels for sampled softmax. Differential Revision: D4773444 fbshipit-source-id: 605a979c09d07531293dd9472da9d2fa7439c619	2017-03-29 14:36:12 -07:00
Andrey Malevich	7cc92b1260	Add eval net for layer_model_helper Summary: This diff is adding eval nets to layer model helper. It should be useful for the cases when train/eval nets need some extra input (usually some supervision) for train/eval. For example various sampled layers, etc. Differential Revision: D4769453 fbshipit-source-id: 7a8ec7024051eab73b8869ec21e20b5f10fd9acb	2017-03-29 04:03:40 -07:00
Kittipat Virochsiri	da36212259	SamplingTrain layer Summary: `SamplingTrain` layer is a wrapper around another layer subclassing `SamplingTrainableMixin`. When initiated in the training context, `SamplingTrain` produces sparse output of the wrapped layer. Output can be paired with `indices` to create Map schema. When initiated in prediction context, the full output of the wrap layer is produced. This is liked the SampledFC function in model helper, https://fburl.com/gi9g1awh, with the ability to initiated in both trainig and prediction context. I'd like to get consensus whether we should introduce the `SamplingTrain` layer and the accompaying mixin. This can probably be accomplished in some other way, but I think this is not too bad. Reviewed By: xianjiec Differential Revision: D4689887 fbshipit-source-id: 7be8a52d82f3a09a053378146262df1047ab26a8	2017-03-27 23:31:55 -07:00
Huazhong Ning	8168e8ac25	allows to specify output names for functional layers Summary: currently the output schema and blobs are names as "field_i" which is bad for debugging. This diff allows us to specify output names. Reviewed By: kennyhorror Differential Revision: D4744949 fbshipit-source-id: 8ac4d3c75cacbb4c9b5f55793ac969fe1cf20467	2017-03-23 13:18:58 -07:00
Kittipat Virochsiri	4829bdb1ea	BatchSoftmaxLoss layer Summary: Similar to BatchLRLoss layer Reviewed By: xianjiec Differential Revision: D4689609 fbshipit-source-id: 89fa4b9d4145ce77cb2aaa7a5c0c1a24f901d88f	2017-03-17 10:19:06 -07:00
Kittipat Virochsiri	cea16ff7cd	BatchSigmoidCrossEntropyLoss Summary: To support feed interset team Reviewed By: kdub0 Differential Revision: D4719213 fbshipit-source-id: 8deb3544377fb06593399b101de66f3f845f93b5	2017-03-17 09:35:51 -07:00
Kittipat Virochsiri	61dd35f1d6	FCWithoutBias layer Summary: For some embedding task, we don't want to include bias term in embedding computation. Reviewed By: xianjiec Differential Revision: D4689620 fbshipit-source-id: 4168584681d30c0eaa1d17ceaf68edda11924644	2017-03-15 11:03:37 -07:00
Kittipat Virochsiri	25b1221579	Allow scalar output in functional layer Summary: Some operators, e.g., SoftmaxWithLoss, returns scalar-typed tensor. This would allow us to use those ops without having to write layer manually. Reviewed By: xianjiec, kennyhorror Differential Revision: D4703982 fbshipit-source-id: f33969971c57fc037c9b44adb37af1caba4084b6	2017-03-14 15:32:47 -07:00
Andrey Malevich	a3726759c6	Add a way do describe layers in a more AdHoc manner. Summary: This diff is trying to address one of the concerns that Xianjie have had - requirements create a layer for all operators and attach pass shapes and other info around. The basic idea of the diff: 1. Try to create a layer with a given name, but if it's not available try to fallback on operator with that name (that is expected to have no parameters). 2. For all operators that we're adding through this functional style of creation - try to use C2 Shape/Type inference logic to get output type. If we fail to get - it just return untyped record and expect user to annotate it when it's really needed. Reviewed By: xianjiec Differential Revision: D4408771 fbshipit-source-id: aced7487571940d726424269970df0eb62670c39	2017-02-27 23:30:39 -08:00

11 Commits