pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Orion Reblitz-Richardson	1d5780d42c	Remove Apache headers from source. * LICENSE file contains details, so removing from individual source files.	2018-03-27 13:10:18 -07:00
Pieter Noordhuis	d4db1b90a1	Resuppress adagrad health checks Summary: Commit 479e4ce5 didn't end up solving the health checks firing and they are likely still caused by the remaining `assume` calls. Closes https://github.com/caffe2/caffe2/pull/1625 Differential Revision: D6573036 Pulled By: pietern fbshipit-source-id: eeb21bdd61dca0a632eb1ba9e529177ac2569bfd	2017-12-14 16:34:41 -08:00
Dong Li	c16a21b67d	removed the device_type assumption in adagrad_test Summary: the "assume" statement in adagrad_test leads to health check failure. here we remove it by checking dc == hu.gpu_do Reviewed By: pietern Differential Revision: D6513314 fbshipit-source-id: 4caf2d938e5f5935a95cca8abd99185182223d63	2017-12-13 03:35:51 -08:00
Pieter Noordhuis	fca617c62f	Suppress hypothesis health check in adagrad_test.py Summary: PR #1536 suppressed test_sparse_adagrad but test_row_wise_sparse_adagrad also filters too many examples. Suppress health checks for this test as well. Closes https://github.com/caffe2/caffe2/pull/1599 Differential Revision: D6530850 Pulled By: pietern fbshipit-source-id: c73f30d2e104565421e3e381b1cf66185edc833e	2017-12-10 11:47:15 -08:00
Pieter Noordhuis	6f218cef25	Supress hypothesis health check in adagrad_test.py Summary: With some test seeds this warning starts firing. Should be addressed in a better way, not generating as many invalid examples. Closes https://github.com/caffe2/caffe2/pull/1536 Reviewed By: bddppq Differential Revision: D6437138 Pulled By: pietern fbshipit-source-id: c619d928a585e3d887f686db5d98f841af10c56b	2017-11-29 11:35:04 -08:00
Dong Li	3bfabb4d5f	support float16 input for operator SparseAdagrad Summary: Implemented new CUDA class for operator SparseAdagrad. The param and moment inputs now can be float or float16. The functions for mixed-precision add/mult/store are defined in a separate head file ("caffe2/core/float16_util.h") for reuse purpose. Reviewed By: azzolini Differential Revision: D5880200 fbshipit-source-id: dca227f38629a03a9d771f42efe2c0b673075c4d	2017-10-30 19:32:30 -07:00
Yangqing Jia	8286ce1e3a	Re-license to Apache Summary: Closes https://github.com/caffe2/caffe2/pull/1260 Differential Revision: D5906739 Pulled By: Yangqing fbshipit-source-id: e482ba9ba60b5337d9165f28f7ec68d4518a0902	2017-09-28 16:22:00 -07:00
Frank Jiang	0a5ee1e806	Implemented RowWiseSparseAdagrad operator that only keeps one moment term per embedding Summary: Implemented version of SparseAdagrad that only keeps track of an average sum of squared gradients term for each row of the parameter tensor, rather than a sum of squared gradients term for each individual parameter. Differential Revision: D5881918 fbshipit-source-id: bd96ccf25554b457baaaca9309fc8048adbb37f7	2017-09-26 13:34:44 -07:00
Dmytro Dzhulgakov	cf1ce29631	Fix GPU SparseAdaGrad with empty tensors Summary: CUDA doesn't like 0-sized grids :) Reviewed By: Yangqing Differential Revision: D5495805 fbshipit-source-id: 6819513024978ee6bb70a39b25d23ced06465750	2017-07-25 23:50:54 -07:00
Luke Yeager	a47652379f	Fix SparseAdagrad for indices.ndim>1 Summary: Same fix as https://github.com/caffe2/caffe2/pull/249, but for SparseAdagrad. Also update the tests for both ops to test this functionality. Closes https://github.com/caffe2/caffe2/pull/675 Differential Revision: D5148750 Pulled By: akyrola fbshipit-source-id: d30b722429bc547fd53400c1a29e4ee9e2e6ed18	2017-05-30 12:02:18 -07:00
Luke Yeager	8bd0522c20	Add tests and GPU impls for sparse optimizers Summary: These GPU paths are probably even buggier than the CPU paths for sparse gradients with duplicate indices. Both paths cause multiple momentum updates in a single iteration, but only the GPU path is non-deterministic. Depending on how we decide to address the issues on the CPU path, pooyadavoodi has a good idea for how to match dense behavior with the sparse GPU ops. Closes https://github.com/caffe2/caffe2/pull/254 Reviewed By: bwasti Differential Revision: D4871680 Pulled By: dzhulgakov fbshipit-source-id: 220be57a0f699a22ea85ed4f7022d92d362d06b3	2017-04-13 11:07:40 -07:00

11 Commits