pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Zachary DeVito	92314c83fa	re-enable copy of python files, but be careful that the copy is only … (#14982 ) Summary: …done once This allow no-op build to work correctly even when BUILD_CAFFE2_OPS is on. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14982 Differential Revision: D13413960 Pulled By: zdevito fbshipit-source-id: 6e5412a8c375af8a47c76f548cdd31cff15f3853	2018-12-11 16:54:08 -08:00
Orion Reblitz-Richardson	9ec0a2aef4	fbshipit-source-id: ba600fcd2b5cefc7621357bdeb05e24cea02e5af	2018-06-27 04:50:56 -07:00
Orion Reblitz-Richardson	1d5780d42c	Remove Apache headers from source. * LICENSE file contains details, so removing from individual source files.	2018-03-27 13:10:18 -07:00
Yangqing Jia	8286ce1e3a	Re-license to Apache Summary: Closes https://github.com/caffe2/caffe2/pull/1260 Differential Revision: D5906739 Pulled By: Yangqing fbshipit-source-id: e482ba9ba60b5337d9165f28f7ec68d4518a0902	2017-09-28 16:22:00 -07:00
Aapo Kyrola	65e675e3e1	Fix net construct bench Summary: Net construct bench was using old version of data_parallel_model API. Reviewed By: bddppq Differential Revision: D5453281 Tags: easy fbshipit-source-id: 93e1ba58511c7b25235ee50d9862fd0614b344c9	2017-07-19 11:23:39 -07:00
Junjie Bai	4fddc04054	Use the same schema of switching to device reduce sum for SumSqrElements Summary: Based on benchmark script located at `caffe2/experiments/python/device_reduce_sum_bench.py`, device reduce sum is slower for N <= 10000, so we only switch to use device reduce for large N in SumElements. This diff applies the same schema for SumSqrElements. Reviewed By: jamesr66a Differential Revision: D5369868 fbshipit-source-id: ae13a611aff9d3464d1c4950ee155c740a2da339	2017-07-05 10:52:17 -07:00
Junjie Bai	f3a59aedff	Use cub::DeviceReduce for faster math::Sum CUDA version Summary: Port SumElements and softmax_ops.cu to use device reduce sum Reviewed By: akyrola Differential Revision: D5351881 fbshipit-source-id: ca9604186c261ffcb1480da2a17baab8a4809372	2017-06-30 15:04:06 -07:00
haracejacob	2ec294a8bb	Fix a few typos and grammars in comment Summary: Fix a few typos and grammars in comment by using language-check, python library spell_checker source code is here : https://github.com/17-1-SKKU-OSS/011A/blob/master/spell_checker/spell_checker.py here is the text file which indicates what things should be fixed : https://github.com/17-1-SKKU-OSS/011A/tree/master/spell_checker/fix/caffe2 Closes https://github.com/caffe2/caffe2/pull/719 Differential Revision: D5165118 Pulled By: aaronmarkham fbshipit-source-id: 7fb8ef7a99d03cd5fd2f9ebdb01b9865e90fc37b	2017-06-14 18:22:39 -07:00
Thomas Dudziak	60c78d6160	Fixes range/xrange for Python 3 Summary: As title Differential Revision: D5151894 fbshipit-source-id: 7badce5d3122e8f2526a7170fbdcf0d0b66e2638	2017-06-07 00:04:26 -07:00
Pieter Noordhuis	bbd7aee9ab	Revert D4952993: [Caffe2] fix mkl_sparse and migrate sparsity experiments Summary: This reverts commit 86c03676ab4e47f04d2d0dd438a4a1c849bbbff0 Differential Revision: D4952993 fbshipit-source-id: 5c213c48ac44ce6aefccacc6d80534648d3c516a	2017-05-17 14:46:56 -07:00
Yiming Wu	f359d70ae7	fix mkl_sparse and migrate sparsity experiments Summary: Migrate experiments folder to fb/sparse folder. Keep FunHashOp and SparseFunHashOp because they are now assumed as a default Op in depr. What I did # Migrate FunHashOp and SparseFunHashOp and their unitests to core-caffe2, make sure tests are passed. # Migrate other Ops in experiment folder to fb/sparse folder. Write new TARGETS files for them. Make sure tests are passed. # Make sure all related tests passed. # Fix MKL definition btw. Make sure that FC_Sparse is not compiled when there is no MKL support Reviewed By: salexspb Differential Revision: D4952993 fbshipit-source-id: 86c03676ab4e47f04d2d0dd438a4a1c849bbbff0	2017-05-16 18:33:51 -07:00
Alexander Sidorov	fc77ae1736	remote some experimental files from open-source repo Differential Revision: D4948835 fbshipit-source-id: 1115914a19d70ae214557132f24e4c302470f47e	2017-04-25 13:31:50 -07:00
Aaron Markham	58f7f2b441	doxygen python block added Summary: Closes https://github.com/caffe2/caffe2/pull/226 Differential Revision: D4793550 Pulled By: JoelMarcey fbshipit-source-id: cc33e58186304fa8dcac2ee9115dcc271d785b1e	2017-03-29 06:46:16 -07:00
Fei Sun	fc2b6e8ed6	Migrate build_sgd to python directory Summary: Currently build_sgd is in facebook specific directory. Need to move it to python so that the open source world can use it. Reviewed By: salexspb Differential Revision: D4547016 fbshipit-source-id: d699b7b1ab8051afdeadedb4d247ec2a04a7a3e7	2017-02-13 13:31:37 -08:00
Bram Wasti	3833dad5f6	manual sync of old never sync'd files	2017-01-06 15:28:45 -08:00
Aapo Kyrola	d38499f727	Optimize BlobIsDefined() + benchmark --> net construction 95 secs to 8.2 secs! Summary: I have noticed that constructing the Xray model takes quite a while. To measure this, I wrote a benchmark script that creates a resnet-50 model on 8 gpus. This takes about 95 secs -- which is kind of annoying when you want to quickly debug stuff. Profiling (using Python's cProfile), I was able to see that the most of the time is used in net.BlobIsDefined(), which does a linear search over external inputs and operator outputs. Thus it gets slower and slower with large nets. This can be fully optimized by keeping a separate lookup table of operator inputs and outputs (and external inputs and outputs). It is a bit annoying to keep this separate data structure, but I setup the unit tests to ensure things are doing correctly over Clones. After the optimization, the net construction drops from 95 secs to 8.2 secs! Reviewed By: azzolini Differential Revision: D4288307 fbshipit-source-id: 0bb82c8bde9d86a2702b298f4aa706cba509346e	2016-12-15 12:01:30 -08:00
Yangqing Jia	f09d2b2b35	changes to make c2 build.	2016-07-21 16:39:08 -07:00
Yangqing Jia	09bed67e4f	add untracked files	2016-07-21 11:26:41 -07:00

18 Commits