pytorch/caffe2/python
sf-wind 602a09dde7 Update caffe2 from facebook 4f527ef46abf (#2234)
* [GanH]: two_task_discriminator

as titled

and adding label smooth

* [Dper2] Simplified UI options needed for blob magnitude visualization

* [GanH]: fix tags

as titled

* Added type and shape inference for GatherRange operator

This helps with type / shape inference when using this operator in layers.
Also just a nice to have in general.

* Demonstrate Caffe2 exception handling with StoreHandlerTimeoutError in Python

We'd like to catch and recover from certain Caffe2 net exceptions. Use this diff to demonstrate a pattern of registering a pybind exception mapping and catching in Pythonusing caffe2::StoreHandlerTimeoutException.

* Bind Gloo IoException to IoError in Python

Allow peer failure handling and recovery using an exception based mechanism. This diff registers gloo::IoException with pybind.

* [GanH]: add label smoothing to softmax with loss

as titled

* [C2] Enable LARS in Adagrad and hook it to DPER

* [DPER] Don't pass LayerModelHelper in create_trainer_nodes

Since we're planning to get rid of it eventually and I want to get access to
NetDef only interface ASAP - I'm looking towards removing all references to
LMH, where we don't really need them.

* fix bugs in LambdaRankNdcgOp

the loss and gradient in LambdaRankNdcgOp are incorrect. The loss should be negative log of probs instead of log.

* Restrict thread pool on iOS to only big cores

Historically, iPhones exposed only one type of cores, and Caffe2 thread pool used all of them.
However, iPhone 8/iPhone X exposes 2 big + 4 LITTLE cores. As our thread pool doesn't support work stealing or other forms of load balancing, fast cores end up waiting for the slow ones, and it may be better to restrict execution to only 2 fast cores, like we do on Android.

* Remove SparseLength Sum/WeightedSum/Mean operators with fp16 engine

Remove SparseLength Sum/WeightedSum/Mean operators with fp16 engine

* make clang happy and get fewer warnings

make clang happy and get fewer warnings

* [Personalization] Support add_output_schema() in layer_model_helper

Problem:
Currently the output_schema of sparse_nn can only be set once. https://fburl.com/efth5zer.

Solution:
For flexibility, we want to add fields to output_schema incrementally.

Plan:
Wrap the change of `model._output_schema` into a new function `add_output_schema()` for adding additional output_schema.

Callsite:
The add_output_schema() should be called instead at https://fburl.com/efth5zer

Reference:
The newly added `add_output_schema()` will be similar to `add_loss()` in https://fburl.com/t2ii8njh
2018-03-12 12:22:59 -07:00
..
docs Build doxygen docs with cmake and fix catalog generation 2018-01-18 18:47:59 -08:00
examples Fix one more naming issue in resnet50_trainer.py for PR 2205 2018-03-09 13:51:42 -08:00
helpers Work on fp16 conv op 2018-03-05 21:13:03 -08:00
layers Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
mint Re-license to Apache 2017-09-28 16:22:00 -07:00
mkl Add SqueezeOp in MKLDNN 2018-01-22 18:39:42 -08:00
modeling Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
models no_bias in resnet32x32 (#1817) 2018-02-24 16:58:23 -08:00
onnx Report all errors during ONNX backend translation rather than failing fast (#2210) 2018-03-09 10:58:22 -08:00
operator_test Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
predictor Update caffe2 from facebook (#2178) 2018-03-07 16:41:22 -08:00
rnn Fix RNN scoping situation 2018-02-07 17:35:29 -08:00
test Fix occasional test timeouts 2018-01-19 20:08:58 -08:00
__init__.py Experimental support for setup.py develop mode install 2018-02-12 23:36:18 -08:00
_import_c_extension.py Re-license to Apache 2017-09-28 16:22:00 -07:00
allcompare_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
attention.py Re-license to Apache 2017-09-28 16:22:00 -07:00
benchmark_generator.py Re-license to Apache 2017-09-28 16:22:00 -07:00
binarysize.py Re-license to Apache 2017-09-28 16:22:00 -07:00
brew_test.py Add if and while ops to brew 2017-12-05 17:33:34 -08:00
brew.py Revamp MNIST tutorial 2018-01-26 09:17:31 -08:00
build.py Expose CMake options in the binary 2017-10-04 02:33:02 -07:00
cached_reader.py Cached reader 2017-11-15 12:38:49 -08:00
caffe_translator_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
caffe_translator.py Fix for wrong newline in caffe_translator.py (Crop layer translation) 2018-01-12 16:17:53 -08:00
checkpoint_test.py Return an empty TaskGroup if node managers exist in MultiNodeCheckpointManager 2018-01-30 19:20:50 -08:00
checkpoint.py [DT] [4/n] Make epoch_group explicit for JobRunner (#2018) 2018-02-23 10:41:52 -08:00
CMakeLists.txt Fix OSS build 2017-12-21 19:04:25 -08:00
cnn.py Re-license to Apache 2017-09-28 16:22:00 -07:00
context_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
context.py Re-license to Apache 2017-09-28 16:22:00 -07:00
control_ops_grad.py Backpropagation for While op 2017-12-18 16:03:45 -08:00
control_ops_util.py Backpropagation for While op 2017-12-18 16:03:45 -08:00
control_test.py Revert D6026557: [caffe2][PR] Fix "No handlers could be found for logger" 2017-10-12 20:21:52 -07:00
control.py Re-license to Apache 2017-09-28 16:22:00 -07:00
convnet_benchmarks_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
convnet_benchmarks.py Re-license to Apache 2017-09-28 16:22:00 -07:00
core_gradients_test.py fix all the broken tests from adding debug info (#2013) 2018-02-22 17:43:53 -08:00
core_test.py hacks to test exception handling and python operator backtraces 2018-03-07 15:09:17 -08:00
core.py hacks to test exception handling and python operator backtraces 2018-03-07 15:09:17 -08:00
crf.py Re-license to Apache 2017-09-28 16:22:00 -07:00
data_parallel_model_test.py Add support for multi-device batch normalization through an option to data_parallel_model 2018-01-24 13:24:06 -08:00
data_parallel_model_utils.py Allow shifting of activations / ops to other GPUs in data parallel model 2017-11-29 21:17:00 -08:00
data_parallel_model.py Add support for multi-device batch normalization through an option to data_parallel_model 2018-01-24 13:24:06 -08:00
data_workers_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
data_workers.py move print to logger 2017-11-17 18:03:44 -08:00
dataio_test.py Adding a time limit reader 2018-01-02 11:33:53 -08:00
dataio.py Pass in task groups to PipedReaderBuilder (#2182) 2018-03-08 16:16:57 -08:00
dataset.py Re-license to Apache 2017-09-28 16:22:00 -07:00
db_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
device_checker.py Re-license to Apache 2017-09-28 16:22:00 -07:00
dlpack.h Support for DLPack in Python op 2017-12-21 17:02:16 -08:00
dyndep.py Re-license to Apache 2017-09-28 16:22:00 -07:00
embedding_generation_benchmark.py Re-license to Apache 2017-09-28 16:22:00 -07:00
experiment_util.py Re-license to Apache 2017-09-28 16:22:00 -07:00
extension_loader.py Re-license to Apache 2017-09-28 16:22:00 -07:00
functional_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
functional.py Re-license to Apache 2017-09-28 16:22:00 -07:00
fused_8bit_rowwise_conversion_ops_test.py Add float32 <-> fused_rowwise_8bit conversion Caffe2 operators 2018-01-19 15:44:33 -08:00
gradient_check_test.py Backpropagation for While op 2017-12-18 16:03:45 -08:00
gradient_checker.py Re-license to Apache 2017-09-28 16:22:00 -07:00
gru_cell.py GRU sequence lengths: allow unspecified sequence lengths 2018-02-05 13:20:05 -08:00
hsm_util.py Re-license to Apache 2017-09-28 16:22:00 -07:00
hypothesis_test_util.py Ensure indices list in sparse optimizer tests is unique 2018-01-03 12:19:14 -08:00
hypothesis_test.py Patch cuda-convnet2 from internal Facebook changes. 2018-02-28 14:20:48 -08:00
layer_model_helper.py Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
layer_model_instantiator.py [Dper2] Add NetModifier abstraction and support for plotting the norm of blobs (#2201) 2018-03-08 13:41:32 -08:00
layer_parameter_sharing_test.py Add shape checks and print more info in parameter sharing 2017-10-27 01:22:06 -07:00
layer_test_util.py Re-license to Apache 2017-09-28 16:22:00 -07:00
layers_test.py Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
lengths_reducer_fused_8bit_rowwise_ops_test.py Integrate Fused8BitRowwise ops with DPER 2018-01-25 15:02:42 -08:00
lengths_reducer_rowwise_8bit_ops_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
lstm_benchmark.py Re-license to Apache 2017-09-28 16:22:00 -07:00
memonger_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
memonger.py Revert D6026557: [caffe2][PR] Fix "No handlers could be found for logger" 2017-10-12 20:21:52 -07:00
mkl_test_util.py Re-license to Apache 2017-09-28 16:22:00 -07:00
model_device_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
model_helper.py Add regularization for sparse features 2018-02-02 16:03:32 -08:00
modifier_context.py Re-license to Apache 2017-09-28 16:22:00 -07:00
mpi_python.cc Upgrade to 2.2.1 2017-10-22 13:26:56 -07:00
muji_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
muji.py Re-license to Apache 2017-09-28 16:22:00 -07:00
net_builder_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
net_builder.py Minor documentation fix in NetBuiler 2017-11-15 16:22:22 -08:00
net_drawer.py Revert D6026557: [caffe2][PR] Fix "No handlers could be found for logger" 2017-10-12 20:21:52 -07:00
net_printer_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
net_printer.py Separate parameter downloading tasks from training tasks and run them in a different group 2018-01-22 14:04:12 -08:00
numa_benchmark.py [core] Graph-level NUMA awareness in Caffe2 2018-03-06 00:33:11 -08:00
numa_test.py [core] NUMA-aware pinned allocator 2018-03-06 00:33:11 -08:00
observer_test.py Fix RNN scoping situation 2018-02-07 17:35:29 -08:00
optimizer_context.py Re-license to Apache 2017-09-28 16:22:00 -07:00
optimizer_test_util.py Clean up naming of FP16-related code, add comments 2018-03-09 13:51:42 -08:00
optimizer_test.py Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
optimizer.py Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
parallel_workers_test.py Add shutdown_fun to parallel_workers 2017-10-10 12:02:24 -07:00
parallel_workers.py Caffe2: serialize init for parallel workers 2018-01-31 17:50:10 -08:00
parallelize_bmuf_distributed_test.py BMUF cpu support 2017-11-19 23:41:25 -08:00
pipeline_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
pipeline.py Re-license to Apache 2017-09-28 16:22:00 -07:00
predictor_constants.py Re-license to Apache 2017-09-28 16:22:00 -07:00
pybind_state_dlpack.cc Support for DLPack in Python op 2017-12-21 17:02:16 -08:00
pybind_state_dlpack.h Support for DLPack in Python op 2017-12-21 17:02:16 -08:00
pybind_state_gpu.cc [C2 OSS][GPU]exposing totalGlobalMem info to workspace python 2018-02-26 10:26:25 -08:00
pybind_state_mkl.cc Update caffe2 from facebook 4f527ef46abf (#2234) 2018-03-12 12:22:59 -07:00
pybind_state.cc [core] Graph-level NUMA awareness in Caffe2 2018-03-06 00:33:11 -08:00
pybind_state.h Support for DLPack in Python op 2017-12-21 17:02:16 -08:00
python_op_test.py Throw Python exception from PythonOp instead of logging 2017-11-20 09:03:17 -08:00
queue_util.py Re-license to Apache 2017-09-28 16:22:00 -07:00
record_queue.py Re-license to Apache 2017-09-28 16:22:00 -07:00
recurrent.py Remove scoping assertion because it is not useful and causing errors 2017-12-11 18:03:45 -08:00
regularizer_context.py Re-license to Apache 2017-09-28 16:22:00 -07:00
regularizer_test.py add dense regularization 2018-01-08 13:03:17 -08:00
regularizer.py Add regularization for sparse features 2018-02-02 16:03:32 -08:00
rnn_cell.py Scope MultiRNN blobs with name as well as layers (#2025) 2018-03-02 22:21:07 -08:00
schema_test.py add struct get method 2017-12-19 12:35:56 -08:00
schema.py change schema's __repr__() flat output to pprint style indented output 2018-02-16 16:26:11 -08:00
scope_test.py Add a EmptyDeviceScope (i.e. allow setting CurrentDeviceScope() to None) 2017-11-02 11:25:48 -07:00
scope.py Fix RNN scoping situation 2018-02-07 17:35:29 -08:00
session_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
session.py Return an empty TaskGroup if node managers exist in MultiNodeCheckpointManager 2018-01-30 19:20:50 -08:00
sparse_to_dense_mask_test.py Skip negative indices 2017-10-09 16:09:50 -07:00
sparse_to_dense_test.py add sparse_to_dense_test 2018-02-01 08:14:37 -08:00
task.py Re-license to Apache 2017-09-28 16:22:00 -07:00
test_util.py Re-license to Apache 2017-09-28 16:22:00 -07:00
text_file_reader.py Re-license to Apache 2017-09-28 16:22:00 -07:00
timeout_guard.py Generate a core dump when CompleteInTimeOrDie forcefully quits 2018-02-08 14:08:51 -08:00
toy_regression_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
tt_core_test.py Re-license to Apache 2017-09-28 16:22:00 -07:00
tt_core.py Re-license to Apache 2017-09-28 16:22:00 -07:00
utils.py fix Caffe2TensorToNumpyArray for py3 2018-03-06 00:33:11 -08:00
visualize.py Re-license to Apache 2017-09-28 16:22:00 -07:00
workspace_test.py Remove Set/GetDefaultGPUID and move to use current gpu id instead. 2018-01-19 18:03:21 -08:00
workspace.py hacks to test exception handling and python operator backtraces 2018-03-07 15:09:17 -08:00