pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Richard Barnes	1433160a36	use irange for loops 6 (#66742 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66742 Modified loops in files under fbsource/fbcode/caffe2/ from the format `for(TYPE var=x0;var<x_max;x++)` to the format `for(const auto var: irange(xmax))` This was achieved by running r-barnes's loop upgrader script (D28874212) with some modification to exclude all files under /torch/jit and a number of reversions or unused variable suppression warnings added by hand. Test Plan: Sandcastle Reviewed By: malfet Differential Revision: D31705366 fbshipit-source-id: be58222426c192406a7f93c21582c3f6f2082401	2021-12-07 16:07:50 -08:00
Xue Li	2f099c7555	Revert D30652629: use irange for loops Test Plan: revert-hammer Differential Revision: D30652629 (`687c2267d4`) Original commit changeset: 0ae6c4bbbb55 fbshipit-source-id: 5c4f067b584a021c8c9656454d1ee60999600fb3	2021-10-15 15:23:10 -07:00
Richard Barnes	687c2267d4	use irange for loops (#66234 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/66234 Modified loops in files under fbsource/fbcode/caffe2/ from the format `for(TYPE var=x0;var<x_max;x++)` to the format `for(const auto var: irange(xmax))` This was achieved by running r-barnes's loop upgrader script (D28874212) with some modification to exclude all files under /torch/jit and a number of reversions or unused variable suppression warnings added by hand. bypass_size_limit allow-large-files Test Plan: Sandcastle Reviewed By: ngimel Differential Revision: D30652629 fbshipit-source-id: 0ae6c4bbbb554bad42e372792a6430e1acf15e3e	2021-10-15 13:50:33 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Hector Yuen	ac8d1a1f76	fix some issues found by enabling -Wshorten-64-to-32 (#18187 ) Summary: when enabling this flag, there were a lot of warnings, this pr focuses on the warnings where this comparison could be affecting array indices, which could be ones most prone to fail the good news is that I didn't find anything obviously concerning one degenerate case could be when the matrices we work with are too skinny could run into issues (dim1=1, dim2 needs to hold a big number) Pull Request resolved: https://github.com/pytorch/pytorch/pull/18187 Differential Revision: D14527182 Pulled By: hyuen fbshipit-source-id: b9f46b6f68ab912c55368961758a7a5af1805555	2019-06-14 16:29:32 -07:00
Hector Yuen	7bb36ada1f	fix -Wsign-compare warnings for some files inside c2 (#18123 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18123 the motivation of this fix is to resolve things like: for(auto i = 0; i < N; i++) where N is bigger than int32 These instances of comparison were found by enabling -Wsign-compare There are way too many things to fix, so issuing this as a series of fixes The plan is to fix all these issues and then enable this flag into Caffe2 to catch future instances Reviewed By: ZolotukhinM Differential Revision: D14497094 fbshipit-source-id: bca3927a2188bd33a508fa503ba221c220cdaefe	2019-03-19 10:39:20 -07:00
Xiaomeng Yang	4ae9ab24b6	Update conv_base to support empty batch (#16603 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16603 Update conv_base to support empty batch Reviewed By: houseroad Differential Revision: D13894111 fbshipit-source-id: fc4370ff16ba6046f374e77bd845d28e6af05ea3	2019-01-31 23:46:18 -08:00
Jerry Zhang	2af95d8e3e	Back out "[pt1][tensor] Change ConvPoolOp<Context>::SetOutputSize to ConvPoolOp<Context>::GetOutputSize" (#16516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16516 Original commit changeset: 64abce3dbaed Reviewed By: dzhulgakov Differential Revision: D13863715 fbshipit-source-id: f1923fdca4a1a82768d9c280a8493ff15a7eb2ba	2019-01-30 12:50:38 -08:00
Jerry Zhang	ff963d4b9f	Change ConvPoolOp<Context>::SetOutputSize to ConvPoolOp<Context>::GetOutputSize (#16273 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16273 Previously we have SetOutputSize which accept a partially initialized Output Tensor and set it to the correct size, the diff change this to GetOutputSize that returns the correct size instead. e.g. ``` auto* Y = Output(0); ConvPoolOp<Context>::SetOutputSize(X, Y, channels); ... Y->mutable_data<T>... ``` --> ``` auto sizes = ConvPoolOp<Context>::GetOutputSize(X, channels); auto* Y = Output(0, sizes, at::dtype<T>()); ``` Reviewed By: dzhulgakov Differential Revision: D13736281 fbshipit-source-id: 64abce3dbaed0b375098463333dfd0ea5a3b1945	2019-01-28 15:56:34 -08:00
Jerry Zhang	eb15587c99	Tensor reinitialization codemod - 2/5 (#15947 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15947 Codemod generated with clangr shard mode, 25 files per diff, To eliminiate partially initialized Tensor, we split the initialization of local Tensor variables into two steps, first declare un uninitialized Tensor, and call `ReinitializeTensor` to initialize it. motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: smessmer Differential Revision: D13586732 fbshipit-source-id: 5295ab27ca0155f96a4fccf9c0ba8a609101ba24	2019-01-11 15:05:01 -08:00
Jongsoo Park	1159302ab1	bug fix in 3d group conv (#15625 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15625 3D group conv (both NCHW and NHWC layout) was not correct. Added group=2 in test_1d_convolution and test_3d_convolution in conv_test Reviewed By: protonu Differential Revision: D13562099 fbshipit-source-id: 586e8a7574a2764f2a3b559db6c2415b3ab90453	2019-01-03 09:46:49 -08:00
Jerry Zhang	f1f7c16c90	Tensor construction codemod(ResizeLike) - 4/7 (#15088 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15088 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: ezyang Differential Revision: D13419682 fbshipit-source-id: 3e59403bc1c0e71e5cb66df932ed0c6a0a72e643	2018-12-13 13:39:56 -08:00
Jerry Zhang	1c2ed4eb23	Tensor construction: combine Resize+mutable_data - 1/4 (#13942 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13942 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: smessmer Differential Revision: D13054770 fbshipit-source-id: a9e86e5dfcb4f7cebf5243e1d359fad064561bed	2018-11-19 15:33:50 -08:00
Junjie Bai	fbd50bbfb9	Revert D13007246: [codemod][caffe2] Tensor construction: combine Resize+mutable_data - 1/4 Differential Revision: D13007246 Original commit changeset: 230de42a3843 fbshipit-source-id: 40ce266826f00d320f7215169188ef4ead232660	2018-11-13 16:41:52 -08:00
Jerry Zhang	9d36c37bdb	Tensor construction: combine Resize+mutable_data - 1/4 (#13853 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13853 Codemod generated with clangr shard mode, 25 files per diff, motivation: https://github.com/pytorch/pytorch/pull/12407 Reviewed By: smessmer Differential Revision: D13007246 fbshipit-source-id: 230de42a3843d71599e812d5511f52f3af47f59b	2018-11-13 12:26:02 -08:00
Jerry Zhang	b652c2de50	Rename dim(i) -> size(i) Summary: Codemod generated with clangr shard mode, 50 files per diff, clangr code(dim(i)->size(i)): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12935287 fbshipit-source-id: 700050640c756d7064c8db4fd50fe6a1421a61ef	2018-11-07 11:07:26 -08:00
Jerry Zhang	3c32f897ca	Rename ndim() -> dim() - 3/6 Summary: Codemod generated with clangr shard mode, 50 files per diff, clangr code(ndim()->dim()): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: dzhulgakov Differential Revision: D12935748 fbshipit-source-id: fccec04e28ec049789f772e70d691382cb8927e0	2018-11-05 23:21:40 -08:00
Jongsoo Park	54e8623d26	3D Conv in NHWC layout (#12733 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12733 Conv in NHWC layout only works for 2D images. This has been a pain point when implementing quantized 3D convolution because we need NHWC layout for best performance (note that NHWC layout in general gives better performance in CPU not just for quantized operators). For example, our quantized ops have a functionality to measure quantized error operator by operator but this needs running a shadow fp32 operator, but this is not easy when there's no 3D conv in NHWC layout is available (currently we're doing layout conversion on the fly for the shadow fp32 operator which is error prone). Some of Caffe2 frameworks like brew generates error when we try to create a 3D conv op in NHWC layout. This was also a blocker for using aibench because aibench is using brew. i-am-not-moving-c2-to-c10 Reviewed By: houseroad Differential Revision: D10333829 fbshipit-source-id: 2d203ee1db833cd3f9d39353219e3894b46c4389	2018-11-04 21:50:09 -08:00
Jongsoo Park	f000101b81	add a few comments on layout after im2col (#12429 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12429 Comments to clarify layout after NHWC im2col for group convolution. i-am-not-moving-c2-to-c10 Reviewed By: houseroad Differential Revision: D10233284 fbshipit-source-id: 996a69f2f932e02c978abaade7571b00741b6ae8	2018-11-04 11:02:58 -08:00
Jerry Zhang	dcbca53e58	Renaming size() to numel() - 1/6 Summary: Codemod generated with clangr shard mode, 50 files per diff Reviewed By: li-roy Differential Revision: D10866373 fbshipit-source-id: 589194164d4fea93b74d83fa7fc4c59558c41f4a	2018-10-29 11:11:19 -07:00
Jerry Zhang	07c0f4a097	Tensor dims() -> sizes() (caffe2/operators) - 1/5 (#13028 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13028 Codemod generated with clangr shard mode, 25 files per diff, for renaming dims() to sizes() Reviewed By: ezyang Differential Revision: D10476220 fbshipit-source-id: 3c3b3d5e2082cd6a1f0ff4a3c8641b30e6f16896	2018-10-24 14:18:18 -07:00
Yangqing Jia	7d5f7ed270	Using c10 namespace across caffe2. (#12714 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12714 This is a short change to enable c10 namespace in caffe2. We did not enable it before due to gflags global variable confusion, but it should have been mostly cleaned now. Right now, the plan on record is that namespace caffe2 and namespace aten will fully be supersets of namespace c10. Most of the diff is codemod, and only two places of non-codemod is in caffe2/core/common.h, where ``` using namespace c10; ``` is added, and in Flags.h, where instead of creating aliasing variables in c10 namespace, we directly put it in the global namespace to match gflags (and same behavior if gflags is not being built with). Reviewed By: dzhulgakov Differential Revision: D10390486 fbshipit-source-id: 5e2df730e28e29a052f513bddc558d9f78a23b9b	2018-10-17 12:57:19 -07:00
Yangqing Jia	38f3d1fc40	move flags to c10 (#12144 ) Summary: still influx. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12144 Reviewed By: smessmer Differential Revision: D10140176 Pulled By: Yangqing fbshipit-source-id: 1a313abed022039333e3925d19f8b3ef2d95306c	2018-10-04 02:09:56 -07:00
Christian Puhrsch	a6630e25af	Remove many caffe2::TIndex and replace them with int64_t (#11943 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11943 See title Reviewed By: ezyang Differential Revision: D9992645 fbshipit-source-id: e8f80d6ea762971513e5e8072975ceea53e1f11a	2018-09-22 18:11:04 -07:00
Jongsoo Park	cc53807be5	group conv with NHWC layout (#10585 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10585 group conv with NHWC layout Reviewed By: BIT-silence Differential Revision: D7547497 fbshipit-source-id: da0ec5a4512c15a0a0d7b79e6ce00c1f8f77f661	2018-08-17 00:39:23 -07:00
Xiaomeng Yang	87cac4c2f1	Update Im2Col related to make preparation for group conv in NHWC order. (#10439 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10439 Update Im2Col related to make preparation for group conv in NHWC order. Reviewed By: houseroad Differential Revision: D9285344 fbshipit-source-id: 1377b0243acb880d2ad9cf73084529a787dcb97d	2018-08-15 17:10:24 -07:00
Jongsoo Park	bd497809e2	CAFFE_ENFORCE -> CAFFE_ENFORCE_EQ for error with more information (#10244 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10244 Use CAFFE_ENFORCE_EQ(x, y) instead of CAFFE_ENFORCE(x == y) in conv_op_impl.h for error messages with more information. Reviewed By: viswanathgs Differential Revision: D9177091 fbshipit-source-id: cf8d10afec1ce6793d3ae0b62f05648722a4130b	2018-08-14 12:24:44 -07:00
Jerry Zhang	aebf3b47ae	Remove template parameter from Tensor (#9939 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9939 Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: ezyang, houseroad Differential Revision: D9024330 fbshipit-source-id: e0b8295d2dc6ebe2963383ded5af799ad17164ba	2018-07-27 10:56:39 -07:00
Jerry Zhang	969b62f276	Revert D8121878: Remove template parameter from Tensor Differential Revision: D8121878 Original commit changeset: 4a5e9a677ba4 fbshipit-source-id: d8e2c0bb145b52fbcca323b22d1d3346f0b3249e	2018-07-26 14:02:04 -07:00
Jerry Zhang	cd5adc7b5f	Remove template parameter from Tensor (#13 ) Summary: Pull Request resolved: https://github.com/facebookresearch/weakly-supervised-action-detection/pull/13 Pull Request resolved: https://github.com/pytorch/translate/pull/166 Pull Request resolved: https://github.com/pytorch/pytorch/pull/9125 Closes https://github.com/pytorch/pytorch/pull/9125 Use inheritance for polymorphism, and remove template parameter This is to change the templating in call sites, the core implementations will change later Before Caffe2 Tensor class was compile-time fixed to bind to a particular device/context. With this change, we're making it a runtime property (stored inside the tensor), but preserve the same semantics. For example, one has to specify device type in order to create a Tensor - there are no uninitialized tensors. More specifically the changes are: 1. We added an extra argument DeviceType to most of the constructors of the tensor, e.g. (Tensor(DeviceType type)), 2. Semantics of constructor Tensor(const Tensor<SrcContext>& src, ContextForCopy* context); is changed, in this constructor, the second context is passed in to enable us to call the templated Copy function, it could be in a different context as source and target previously, now we'll enforce that the context should have same device type as src, if it is provided. 3. To preserve 'get-or-construct' semantics of Blob, we added specialized getter Blob::GetMutableTensor that verifies both that Blob contains a Tensor and that it's of a correct type 4. Specifically, Tensor type is not default-constructible any more (as we don't have unknown device tensors) and thus some of the code handling STL containers needs to change Note: Some changes are postponed just to keep this diff a bit smaller. Please see `TODO`s. Reviewed By: xw285cornell Differential Revision: D8121878 fbshipit-source-id: 4a5e9a677ba4ac82095df959851a054c81eccf81	2018-07-26 10:25:23 -07:00
Xiaomeng Yang	5df3eae89e	Add 1x1 specialization for conv with NCHW order (#9671 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/9671 Add 1x1 specialization for conv with NCHW order Reviewed By: houseroad Differential Revision: D8944686 fbshipit-source-id: 94bf44f69498b1934b7dfff4c0e989342c7bb61c	2018-07-23 18:54:58 -07:00
Xiaomeng Yang	921dece2d7	Update Im2ColNd functions (#7505 ) Update Im2ColNd functions	2018-05-12 15:59:50 -07:00
Orion Reblitz-Richardson	1d5780d42c	Remove Apache headers from source. * LICENSE file contains details, so removing from individual source files.	2018-03-27 13:10:18 -07:00
Xiaomeng Yang	afafe8a466	Add LC Layer Summary: Add the 1st version of LC layer. Reviewed By: Yangqing Differential Revision: D6788647 fbshipit-source-id: ebee9215a1d6e1e567548a0fef771802851682a3	2018-01-24 16:51:17 -08:00
Yangqing Jia	8286ce1e3a	Re-license to Apache Summary: Closes https://github.com/caffe2/caffe2/pull/1260 Differential Revision: D5906739 Pulled By: Yangqing fbshipit-source-id: e482ba9ba60b5337d9165f28f7ec68d4518a0902	2017-09-28 16:22:00 -07:00
Viswanath Sivakumar	a4cc9f2fbf	Per-workspace mutex for shared im2col buffer Summary: Shared im2col buffer needs a mutex only to protect it from ops within a workspace (since the shared buffer is created per workspace). The current implementation has a global mutex which affects perf when running multiple nets in parallel. I don't feel great about adding a mutex for this in workspace, let me know if anyone has better suggestions. Reviewed By: akyrola Differential Revision: D5341476 fbshipit-source-id: 1c9a92ef488ffb0c0013a7656bcb3d530bc7208b	2017-06-29 10:19:37 -07:00
Luke Yeager	b229b7ff11	Fix typo 'convlution' Summary: Closes https://github.com/caffe2/caffe2/pull/470 Differential Revision: D5003850 Pulled By: pietern fbshipit-source-id: 62ba13f58dae9f19a434f2075ff3ac143d34feb5	2017-05-04 17:02:35 -07:00
Ahmed Taei	e41d35909a	Conv-ND NCHW CUP/CUDA implementation Summary: Migrate caffe1 ConvNd implementation to caffe2. Reviewed By: Yangqing Differential Revision: D4659868 fbshipit-source-id: 14b178af3faa2c0b12e5a9f7aa76c1d8945419ea	2017-03-20 14:01:07 -07:00
Ahmed Taei	771d169c7c	Extend conv params to handel nd inputs Summary: Extend ConvOp parameters to handel ND convlution input parameters. Differential Revision: D4659838 fbshipit-source-id: 920f40dd80acfd03e04fcc04221209302232906d	2017-03-20 13:18:39 -07:00
Yangqing Jia	642b5a863f	Adding changes that enable MSVC build Summary: MSVC 2015 has known bugs about template functions so these changes aim to fix them - no functional differences introduced. Closes https://github.com/caffe2/caffe2/pull/179 Reviewed By: ajtulloch Differential Revision: D4635241 Pulled By: Yangqing fbshipit-source-id: a282a96e1e626e9440c1e3f3cb15b5b1fa710887	2017-03-01 16:47:58 -08:00
Yangqing Jia	2c6a579859	Make all convolution operators allow optional bias term Summary: It used to be that only the cudnn engine supports it, and now it should be fully supported by any conv engine. To ignore bias, simply use a convolution op that has two inputs instead of 3. The gradient operator will automatically figure out that it does not compute the bias gradient. Reviewed By: prigoyal Differential Revision: D4354183 fbshipit-source-id: cf71b6289a254d15a6a663a85df63fbbaec3702b	2016-12-21 15:14:24 -08:00
Yangqing Jia	6abf5c99dc	Implement group convolution in the cudnn interface. Summary: This is an ongoing work - currently the forward pass is implemented, but backward is yet to be done. We might want a CPU counterpart as well. I will wait for D4341288 to land and then make bias optional. Reviewed By: prigoyal Differential Revision: D4342210 fbshipit-source-id: 51bb0e98d917970bdc040d076b535beb8e994d9a	2016-12-20 13:44:44 -08:00
Yangqing Jia	fd5da05a05	yes it is right. Summary: TSIA Reviewed By: bwasti Differential Revision: D4348965 fbshipit-source-id: 60b95328086cf0bcf9690cef1e8cddfcbe997c72	2016-12-19 15:29:27 -08:00
Simon Layton	05233cd5b8	Make bias optional in cuDNN conv op Summary: Yangqing This seems to work for me, not sure if it's implemented in the right way for you to accept :) Allows user to specify "no_bias" as an option for convolution layers (only cuDNN at this point), so that the bias associated with that operator is not allocated or computed. This is useful in particular for conv + BatchNorm combinations (such as ResNets), as the bias term can be handled by both conv and Batch Norm, wasting memory and computation. Closes https://github.com/caffe2/caffe2/pull/50 Reviewed By: Yangqing Differential Revision: D4341288 Pulled By: bwasti fbshipit-source-id: e6138d0024c83ed876dff2f83ffbebe7de502fd8	2016-12-19 14:59:49 -08:00
Yangqing Jia	44509f9f91	fbsync: mostly lint changes, added mkl files	2016-10-11 22:45:06 -07:00
Yangqing Jia	d1e9215184	fbsync	2016-10-07 13:08:53 -07:00
Yangqing Jia	05512d1e10	sync	2016-08-10 11:02:15 -07:00
Yangqing Jia	6463eebc7b	chunky sync - build scripts to be written	2016-07-21 10:16:42 -07:00
Yangqing Jia	559053d3a8	chunky sync	2016-05-13 14:43:48 -07:00
Yangqing Jia	98c5b86ef7	A few changes: (1) cudnn for conv (2) cublas: after going through the work I feel it's beter to use HOST pointer mode, so changed it. (3) storage order: despite that googlenet and multibox uses NHWC, it seems better to be still using NCHW as default to be consistent with caffe and cudnn; moved to NCHW as default.	2015-10-21 22:37:11 -07:00

1 2

53 Commits