pytorch/caffe2/python
Yury Zemlyanskiy 4a53ab3cb6 LSTMWithAttention implementation in Caffe2
Summary:
Implementation of ##LSTMWithAttention##

Still TBD:
1. There are problems with back propagation, because gradient is not implemented for ops with broadcasting
2. I need to make initial_recurrent_state to be of shape [dim] rather than [1, batch_size, dim], so one doesn't need to provide batch_size to LSTMWithAttention

Differential Revision: D4298735

fbshipit-source-id: 8903fcff4d6a66647ee6d45a6ef28803fc3091e5
2017-02-23 04:08:34 -08:00
..
docs Documenation generation to wiki 2017-02-15 16:00:44 -08:00
examples ] Char RNN bug fix for batching 2017-02-22 17:34:30 -08:00
layers model and preprocessor can handle empty dense inputs 2017-02-22 11:19:15 -08:00
mint goodbye old brewery 2017-01-04 20:58:35 -08:00
models Added model downloader 2017-02-22 12:47:15 -08:00
operator_test LSTMWithAttention implementation in Caffe2 2017-02-23 04:08:34 -08:00
_import_c_extension.py fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
attention.py LSTMWithAttention implementation in Caffe2 2017-02-23 04:08:34 -08:00
caffe_translator_test.py protected legacy_pad_, replace DeleteDropout with is_test=True 2016-07-29 11:44:55 -07:00
caffe_translator.py translator fix to solve Aaron's issue 2017-02-13 11:19:13 -08:00
checkpoint_test.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
checkpoint.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
CMakeLists.txt CMake completions work 2017-01-11 16:59:22 -08:00
cnn.py Support for sample softmax 2017-02-17 09:31:54 -08:00
context_test.py Make ContextManager thread-safe 2017-02-13 19:45:35 -08:00
context.py Make ContextManager thread-safe 2017-02-13 19:45:35 -08:00
control_test.py fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
control.py Better visualization for gpu training plan 2016-12-21 09:29:43 -08:00
convnet_benchmarks_test.py chunky sync - build scripts to be written 2016-07-21 10:16:42 -07:00
convnet_benchmarks.py Use ws_nbytes_limit (called cudnn_ws in args). 2017-02-22 12:19:16 -08:00
core_gradients_test.py Implement EnsureDenseOp and EnsureDenseGradientOp. 2017-02-22 07:16:26 -08:00
core_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
core.py Add a create your own dataset tutorial 2017-02-22 03:31:47 -08:00
data_parallel_model_test.py lr_scale to loss_scale 2017-02-03 07:44:40 -08:00
data_parallel_model.py Use fbcollective 2017-02-07 10:48:33 -08:00
data_workers_test.py a couple small reliability improvements 2016-12-15 21:29:29 -08:00
data_workers.py Implemented logging of inputs per second 2017-02-16 12:02:05 -08:00
dataio_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
dataio.py fix typo in TextFileReader 2017-02-21 14:02:48 -08:00
dataset.py fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
db_test.py Fix db_test under tsan 2016-11-29 15:18:37 -08:00
device_checker.py chunky sync 2016-09-06 15:55:19 -07:00
dyndep.py fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
experiment_util.py use Pieter-MPI and fb.distributed 2016-11-29 15:18:36 -08:00
extension_loader.py fbsync 2016-10-07 13:08:53 -07:00
gradient_check_test.py Fix test cases: tensor of size 0 not supported by GPU ops yet. 2016-12-15 19:59:24 -08:00
gradient_checker.py fbsync 2016-10-07 13:08:53 -07:00
hsm_util.py Generate huffman tree 2017-01-19 16:14:23 -08:00
hypothesis_test_util.py LSTMWithAttention implementation in Caffe2 2017-02-23 04:08:34 -08:00
hypothesis_test.py allow recurrent network gradient op to receive gradient on any combination of network output blobs 2017-02-15 16:00:45 -08:00
introspect_vis.py Dump data for DocNN visualization 2017-02-09 13:47:07 -08:00
layer_model_helper.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
layer_model_instantiator.py Add model graph to dper_example 2017-02-07 13:03:54 -08:00
load_save_test.py Add name_overrides argument to SaveOp 2017-02-16 12:32:51 -08:00
memonger_test.py Gradient Input memory sharing using memonger blob sharing 2017-01-09 19:44:23 -08:00
memonger.py Fixes to topological sort, canonical blob naming, sharing final blob 2017-01-25 15:14:26 -08:00
mkl_test_util.py MKLDevice and MKLOperator 2016-12-15 19:59:24 -08:00
model_device_test.py Comment out NHWC Alexnet test for now 2017-01-23 13:59:29 -08:00
model_helper.py Gradient for Copy operator 2017-02-16 06:11:27 -08:00
mpi_python.cc Move mpi_python.cc to the python folder to be more consistent about source file locations. 2017-01-09 10:59:39 -08:00
muji_test.py chunky sync - build scripts to be written 2016-07-21 10:16:42 -07:00
muji.py fbsync 2016-10-07 13:08:53 -07:00
net_builder_test.py Improvements+fixes for NetBuilder 2017-01-03 16:59:24 -08:00
net_builder.py Improve "reporter net" design 2017-02-21 20:17:40 -08:00
net_drawer.py Add model graph to dper_example 2017-02-07 13:03:54 -08:00
net_printer_test.py Debug/Analysis tools for Jobs/ExecutionSteps 2017-02-06 17:31:20 -08:00
net_printer.py Improve "reporter net" design 2017-02-21 20:17:40 -08:00
pipeline.py Better names for nets, steps and tasks 2017-02-09 16:33:54 -08:00
pybind_state_gpu.cc fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
pybind_state_mkl.cc Expose MKLMemory to the Python Feed and Fetch interface, and misc changes 2016-11-29 15:18:36 -08:00
pybind_state.cc Add a create your own dataset tutorial 2017-02-22 03:31:47 -08:00
pybind_state.h Allow PythonOp to access the workspace 2016-12-05 11:53:26 -08:00
python_op_test.py Allow PythonOp to access the workspace 2016-12-05 11:53:26 -08:00
queue_util.py Better names for nets, steps and tasks 2017-02-09 16:33:54 -08:00
record_queue.py chunky sync 2016-09-06 15:55:19 -07:00
recurrent.py LSTMWithAttention implementation in Caffe2 2017-02-23 04:08:34 -08:00
schema_test.py schema.Struct.__add__ 2017-02-06 13:47:58 -08:00
schema.py more clear about supported output dimension 2017-02-16 21:01:52 -08:00
scope_test.py fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
scope.py fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
session_test.py NextScopedBlob with well-defined behavior and respect namescope 2017-02-16 17:16:36 -08:00
session.py Fix issues pickling jobs 2017-02-21 20:47:27 -08:00
sgd_test_util.py Move tests of build_sgd, build_adagrad, and build_adam to pyton directory 2017-02-14 15:32:14 -08:00
sgd_test.py Move build_ftrl to open source directory 2017-02-16 18:02:15 -08:00
sgd.py Move build_ftrl to open source directory 2017-02-16 18:02:15 -08:00
sparse_to_dense_mask_test.py Fix few more operators to handle empty batches correctly. 2016-11-29 15:18:37 -08:00
task.py Gather perf counters for distributed jobs 2017-02-21 22:06:25 -08:00
test_util.py MKL convolution operator 2017-01-23 09:59:30 -08:00
text_file_reader.py fix typo in TextFileReader 2017-02-21 14:02:48 -08:00
timeout_guard.py fbsync. TODO: check if build files need update. 2016-11-15 00:00:46 -08:00
toy_regression_test.py sync 2016-08-10 11:02:15 -07:00
tt_core_test.py sync 2016-08-10 11:02:15 -07:00
tt_core.py sync 2016-08-10 11:02:15 -07:00
utils.py Add a create your own dataset tutorial 2017-02-22 03:31:47 -08:00
visualize.py chunky sync 2016-05-13 14:43:48 -07:00
workspace_test.py Remove redundant and failing test of FeedBlob asserts 2016-12-22 14:59:28 -08:00
workspace.py Remove Model API (unused) 2017-02-21 17:19:05 -08:00