Commit Graph

9 Commits

Author SHA1 Message Date
Ahmed Aly
22446a3619 Productionize CRF layer in PyText (#10362)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/10362

This diff implements a manual export from PyText's CRF module to the caffe2 CRF layer.
Note that most of the changes in caffe2/python/crf.py are just formatting changes, the only relevant change is the new class CRFUtils.

Reviewed By: hikushalhere

Differential Revision: D9234126

fbshipit-source-id: 1a67d709034660e8b3d5ac840560b56de63e3f69
2018-08-22 00:25:26 -07:00
Orion Reblitz-Richardson
1d5780d42c Remove Apache headers from source.
* LICENSE file contains details, so removing from individual source files.
2018-03-27 13:10:18 -07:00
Yangqing Jia
8286ce1e3a Re-license to Apache
Summary: Closes https://github.com/caffe2/caffe2/pull/1260

Differential Revision: D5906739

Pulled By: Yangqing

fbshipit-source-id: e482ba9ba60b5337d9165f28f7ec68d4518a0902
2017-09-28 16:22:00 -07:00
Ahmed Aly
2f381bf6a4 Joint intent-slots modeling workflow initial diff
Summary:
This is a prototype for joint intents + slots modeling workflow, it has the following:

1- New data readers and data processors to process joint labels in parallel
2 - New JointNN model
3- New Fblearner workflow (jointnn) for joint modeling experimentations

This is still work in progress, sending the diff to start the discussion about the interface and what we need to support in our joint modeling efforts.

P.S. The number of lines in this diff is multiplied by 3 since caffe2 is mirrored in both fbandroid and fbobjc.  I will highlight the most important parts so that people are not confused.

Differential Revision: D5725243

fbshipit-source-id: ecc5322f937ad0fddaf200a9e090b3573a69f994
2017-09-28 03:47:34 -07:00
Junjie Bai
c1f974aa9f Deprecate CNNModelHelper in python/crf.py
Reviewed By: harouwu

Differential Revision: D5241631

fbshipit-source-id: 3dc448355bc2a766ae9eda1dc579e501743b35cf
2017-06-14 08:49:27 -07:00
Ahmed Aly
58bc830660 Integrate CRF in DeepText + New caffe2 operator for viterbi decode
Summary: Inegration of the CRF Layer in DeepText wordmodels + Implementing the viterbi decode operator in C++ instead of python so that the CRF models can be deployed in production.

Differential Revision: D4912196

fbshipit-source-id: 64f499a1bd47e811e7a96dde839904dcd05cacb3
2017-05-01 20:39:41 -07:00
Aapo Kyrola
e13e9c1302 cuDNN version of TransposeOp
Summary:
Uses the cudnnTransformTensor function. It works by shuffling the strides according to the transpose axis. Significant speedup over current GPU version .
+ moves the transpose test under utility_ops, because hypothesis_test is too big

Reviewed By: jamesr66a

Differential Revision: D4810993

fbshipit-source-id: 82577c4ced1389e70bd5992820ae4d8297a3817f
2017-04-03 13:33:10 -07:00
Aaron Markham
58f7f2b441 doxygen python block added
Summary: Closes https://github.com/caffe2/caffe2/pull/226

Differential Revision: D4793550

Pulled By: JoelMarcey

fbshipit-source-id: cc33e58186304fa8dcac2ee9115dcc271d785b1e
2017-03-29 06:46:16 -07:00
Ahmed Aly
99bfd36a04 CRF layer in caffe2
Summary:
This is implementation of a CRF layer in caffe2 according to this paper: https://arxiv.org/abs/1603.01360
Currently this implementation works only for batch_size = 1

Reference implementations:

- Tensorflow:
 63a21e0540/tensorflow/contrib/crf/python/ops/crf.py

- Theano:
https://github.com/glample/tagger/blob/master/model.py#L286

Differential Revision: D4644004

fbshipit-source-id: bf0801fd8562d11dca3fefe371c3d85e1dd69ccc
2017-03-23 22:02:02 -07:00