pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Nikita Shulga	e4ee5ca698	Revert D31326599: [pytorch][PR] Compile without -Wno-unused-variable Test Plan: revert-hammer Differential Revision: D31326599 (`a6280ab653`) Original commit changeset: 924155f1257a fbshipit-source-id: b8ee5bc0298637443232f5ee9ec79e51ed256faf	2021-10-01 20:40:47 -07:00
Nikita Shulga	a6280ab653	Compile without -Wno-unused-variable (#65954 ) Summary: Delete `-Wno-unused-variable` from top level `CMakeLists.txt` Still suppress those warnings for tests and `torch_python` Delete number of unused variables from caffe2 code Use `(void)var;` to suppress unused variable in range loops Use `C10_UNUSED` for global constructors and use `constexpr` instead of `static` for global constants Pull Request resolved: https://github.com/pytorch/pytorch/pull/65954 Reviewed By: ngimel Differential Revision: D31326599 Pulled By: malfet fbshipit-source-id: 924155f1257a2ba1896c50512f615e45ca1f61f3	2021-10-01 17:40:47 -07:00
Nikita Shulga	a9b0a921d5	Disable `avoid-non-const-global-variables` lint check (#62008 ) Summary: As GoogleTest `TEST` macro is non-compliant with it as well as `DEFINE_DISPATCH` All changes but the ones to `.clang-tidy` are generated using following script: ``` for i in `find . -type f -iname ".c" -or -iname "*.h"\|xargs grep cppcoreguidelines-avoid-non-const-global-variables\|cut -f1 -d:\|sort\|uniq`; do sed -i "/\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-non-const-global-variables)/d" $i; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/62008 Reviewed By: driazati, r-barnes Differential Revision: D29838584 Pulled By: malfet fbshipit-source-id: 1b2f8602c945bd4ce50a9bfdd204755556e31d13	2021-07-22 18:04:40 -07:00
Adam Simpkins	fadaa52f64	[caffe2] add an EstimateAllBlobSizes operator (#59775 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/59775 This operator is similar to `GetAllBlobNames` but also returns the estimated size required to serialize each node. One goal of this operator is to allow checkpoint saving logic to estimate the amount of space/bandwidth required to save a checkpoint when first starting training, without actually serializing any blobs yet. Currently the checkpointing logic uses `GetAllBlobNames` to determine the blobs to checkpoint. It can instead be updated to use `EstimateAllBlobSizes` to also get an estimate for how much space will be required for the checkpoint. ghstack-source-id: 132275153 Test Plan: Included a new unit test. Reviewed By: mraway Differential Revision: D29020227 fbshipit-source-id: 811e5d86c4b59183e84e6424c48c97739be09043	2021-06-24 16:55:22 -07:00
Michael Voznesensky	d1d24304ee	[Caffe2] [Easy] Fix comment on caffe2_serialize_using_bytes_as_holder to reflect correct types Summary: the logic is: ``` template <typename T> typename std::enable_if< std::is_same<T, bool>::value \|\| std::is_same<T, uint8_t>::value \|\| std::is_same<T, int8_t>::value \|\| std::is_same<T, uint16_t>::value \|\| std::is_same<T, int16_t>::value, void>::type ``` Test Plan: N/A Reviewed By: simpkins Differential Revision: D28587311 fbshipit-source-id: 970c673a9c1256600ec8bdd5f9ca53333a60d588	2021-05-20 18:03:34 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Adam Simpkins	87989a6cf9	[caffe2] support serializing float data as bfloat16 (#53735 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53735 Add an option to BlobSerializationOptions to request that float data be serialized as bfloat16. This reduces the serialized data size at the expense of some loss in precision. ghstack-source-id: 124317910 Test Plan: Included a new unit test. Reviewed By: mraway Differential Revision: D26658205 fbshipit-source-id: 74521ed161059066355a3f208488ed01a344dbb5	2021-03-24 13:27:22 -07:00
frdong	92770d25cd	fix comparison of narrow type with wide type in loop condition (#53951 ) Summary: fix Semmle warning: Comparison of narrow type with wide type in loop condition For example there is below piece of code: for (int i=0; i<array.size(); ++i) {} The problem is that array.size() return type is size_t can be larger type than int depending on the implementation so there is chance that i overflows (for very large array that array size is beyond the range of integer) and this loop will never be terminated. Pull Request resolved: https://github.com/pytorch/pytorch/pull/53951 Reviewed By: zou3519 Differential Revision: D27181495 Pulled By: malfet fbshipit-source-id: 0612c5cedcdc656c193085e7fbb87dd163f20688	2021-03-22 16:40:35 -07:00
Adam Simpkins	ccdcfba5de	[caffe2] Refactor tensor serialization function (#53404 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53404 This refactors `TensorSerializer::Serialize()` so that we have a separate helper function for each data type. This should make it slightly easier in the future to add new serialization formats for specific data types. ghstack-source-id: 124085413 Test Plan: Confirmed the existing tests pass. This diff is not expected to have any behavior changes. Reviewed By: mraway, glamtechie Differential Revision: D26658204 fbshipit-source-id: 232776262db6486ba845a7ba223e3987053dac27	2021-03-17 12:36:31 -07:00
Adam Simpkins	33aaea912a	[caffe2] Support deserializing tensors using alternate serialization formats (#53403 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53403 This updates the `TensorProto` field to independently track the data type of the in-memory (deserialized) data from the serialized data format. This will allow us to support multiple different serialization formats in the future. For instance, we could choose to perform quantization of floating point data types, or varint encoding for integer fields. For now this diff does not actually change the serialization code path yet, and does not introduce any new serialization formats, but only refactors the deserialization code path to make it easier to introduce new formats. I'm not really that thrilled with the heavy use of macros and templates here, but I didn't really see better alternatives that made it as simple to specify new deserialization function implementations. ghstack-source-id: 123594220 Test Plan: Confirmed that the existing unit tests pass. This diff only touches the deserialization code path and not the serialization code to help ensure that the deserialization code works with the existing serialization logic, and that there are no changes to the current serialization format. Reviewed By: mraway Differential Revision: D26658206 fbshipit-source-id: d7297d600aee28b92fd9f4ece437b7f519060942	2021-03-12 11:35:15 -08:00
Adam Simpkins	7e5ffbfa94	[caffe2] add a SerializationOptions field for the save operator (#53402 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53402 Add an `options` field to the `Save` operator which accepts options for how to serialize different blobs. At the moment this simply allows controlling the existing `chunk_size` behavior, but in the future we can add other options, such as the ability to control compression settings or other serialization formats. ghstack-source-id: 123567034 Test Plan: Added a new test to `load_save_test.py` that passes in options and verifies that blobs were serialized with the expected number of chunks. buck test caffe2/caffe2:caffe2_test_cpu \ caffe2/caffe2/core:serialization_test \ caffe2/caffe2/python/operator_test:load_save_test Reviewed By: mraway Differential Revision: D26502577 fbshipit-source-id: 6e302e530bb96990517c2e35c505db7f14a56284	2021-03-11 13:02:58 -08:00
Adam Simpkins	27d89057f8	[caffe2] fix deserialization of unknown tensor data_type values (#52411 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52411 The `TensorDeserializer` code previously did not correctly handle unknown `data_type` values. It attempted to deserialize the data as floats, rather than recognizing that it did not understand the data type and erroring out. Google protobuf will never return unknown values for enum fields. If an unknown value is found in serialized data, the protobuf code discards it. As a result `has_data_type()` will return false, but `get_data_type()` will simply return the default value, which happens to be set to `FLOAT`. As a result if we ever encounter a serialized blob with an unknown data type the previous code would incorrectly think the data type was `FLOAT`. This fixes the code to check if the `data_type` value is present before reading it. ghstack-source-id: 121915981 Test Plan: Included a unit test that verifies this behavior. Confirmed that without this fix the code proceeded with the float deserialization code path. When deserializing int32_t data it fortunately did fail later due to an unexpected field length check, but this isn't guaranteed to be the case. In some cases it potentially could incorrectly succeed and return wrong data. Reviewed By: mraway Differential Revision: D26375502 fbshipit-source-id: 4f84dd82902e18df5e693f4b28d1096c96de7916	2021-02-17 19:13:43 -08:00
Kai Yang	fc314350ad	Make RebatchingBuffer compatible with auto shape inference Summary: no-op to operator behavior, resolve https://fburl.com/wte0v7tf Test Plan: buck test Reviewed By: huangyi1979 Differential Revision: D26333212 fbshipit-source-id: d237e8caf5977bc19fcced6aeedc6464fc905457	2021-02-09 12:37:26 -08:00
Hao Lu	da6f249a10	[caffe2] DeserializeToNDArray (#49135 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49135 Differential Revision: D25417845 fbshipit-source-id: 4d8efd440bc2577fb717f911a401e7b81d48b907	2020-12-10 21:59:25 -08:00
Rohith Menon	4e16be9073	[MemLeak] Fix memory leak from releasing unique ptr (#41883 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/41883 Fix memory leak from releasing unique ptr Test Plan: Tested serialization with and without the change. Heap profile without change: ``` Welcome to jeprof! For help, type 'help'. (jeprof) top Total: 7298.4 MB 4025.2 55.2% 55.2% 4025.2 55.2% c10::alloc_cpu (inline) 3195.3 43.8% 98.9% 3195.3 43.8% caffe2::SerializeUsingBytesOrInt32 63.6 0.9% 99.8% 63.6 0.9% __gnu_cxx::new_allocator::allocate (inline) 5.0 0.1% 99.9% 5.0 0.1% google::protobuf::RepeatedField::Reserve 2.5 0.0% 99.9% 2.5 0.0% folly::aligned_malloc (inline) 1.2 0.0% 99.9% 1.2 0.0% caffe2::detail::CopyFromProtoWithCast (inline) 1.0 0.0% 99.9% 1.0 0.0% __new_exitfn 1.0 0.0% 100.0% 1.0 0.0% std::_Function_base::_Base_manager::_M_init_functor (inline) 0.5 0.0% 100.0% 0.5 0.0% folly::HHWheelTimerBase::newTimer (inline) 0.5 0.0% 100.0% 0.5 0.0% std::__detail::_Hashtable_alloc::_M_allocate_node ``` Heap profile with change: ``` Welcome to jeprof! For help, type 'help'. (jeprof) top Total: 6689.2 MB 4025.2 60.2% 60.2% 4025.2 60.2% c10::alloc_cpu (inline) 2560.0 38.3% 98.4% 2560.0 38.3% caffe2::::HugePagesArena::alloc_huge (inline) 90.9 1.4% 99.8% 90.9 1.4% __gnu_cxx::new_allocator::allocate (inline) 5.0 0.1% 99.9% 5.0 0.1% google::protobuf::RepeatedField::Reserve 2.0 0.0% 99.9% 2.0 0.0% prof_backtrace_impl (inline) 1.0 0.0% 99.9% 20.3 0.3% std::__cxx11::basic_string::_M_construct (inline) 1.0 0.0% 99.9% 1.0 0.0% std::_Function_base::_Base_manager::_M_init_functor (inline) 0.5 0.0% 99.9% 0.5 0.0% folly::UnboundedQueue::allocNextSegment (inline) 0.5 0.0% 100.0% 0.5 0.0% folly::aligned_malloc (inline) 0.5 0.0% 100.0% 0.5 0.0% __new_exitfn ``` Reviewed By: yinghai Differential Revision: D22662093 fbshipit-source-id: d0b8ff1ed26c72b14bb02fb1146c51ef11a7e519	2020-07-22 16:54:19 -07:00
Sean Lynch	64689c2474	Remove unecessary copy within blob serialization (#40096 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40096 Declaring `tensor_proto` to be of type `auto` means that it will copy the entire `TensorProto` instead of just keeping a reference. This changes it to just use a const reference instead. Test Plan: Using the model loader benchmark to measure model loading performance: ### `tensor_proto` is of type `const auto&` ``` ============================================================================ caffe2/caffe2/fb/predictor/ModelLoaderBenchmark.cpprelative time/iter iters/s ============================================================================ BlobProtoInt32DeserializationFloat16 11.08ms 90.27 BlobProtoByteDeserializationFloat16 1509.73% 733.73us 1.36K ---------------------------------------------------------------------------- BlobProtoInt32DeserializationUInt8 10.48ms 95.45 BlobProtoByteDeserializationUInt8 2974.57% 352.22us 2.84K ============================================================================ ``` ### `tensor_proto` is of type `auto` ``` ============================================================================ caffe2/caffe2/fb/predictor/ModelLoaderBenchmark.cpprelative time/iter iters/s ============================================================================ BlobProtoInt32DeserializationFloat16 13.84ms 72.26 BlobProtoByteDeserializationFloat16 658.85% 2.10ms 476.08 ---------------------------------------------------------------------------- BlobProtoInt32DeserializationUInt8 17.09ms 58.51 BlobProtoByteDeserializationUInt8 3365.98% 507.80us 1.97K ============================================================================ ``` Reviewed By: marksantaniello Differential Revision: D21959644 fbshipit-source-id: 6bc2dfbde306f88bf7cd4f9b14b95ac69c2e1b4d	2020-06-16 14:45:59 -07:00
Rohith Menon	879a90b322	[ModelLoading] Use byte encoding for uint8, fp16 etc. instead of int32 (#34343 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34343 Use byte encoding for uint8, fp16 etc. instead of int32 in TensorProto serialization/deserialization tl;dr - fp16 tensor deserialization 12x faster, serialized size 25% lower - uint8 tensor deserialization 36x faster, serialized size 25% lower Test Plan: ``` ============================================================================ caffe2/caffe2/fb/predictor/ModelLoaderBenchmark.cpprelative time/iter iters/s ============================================================================ BlobProtoInt32DeserializationFloat16 12.37ms 80.82 BlobProtoByteDeserializationFloat16 1125.46% 1.10ms 909.64 ---------------------------------------------------------------------------- BlobProtoInt32DeserializationUInt8 17.57ms 56.92 BlobProtoByteDeserializationUInt8 3629.45% 484.02us 2.07K ============================================================================ ``` Reviewed By: yinghai Differential Revision: D20137451 fbshipit-source-id: 8ed4be2286a6d4c7e134fcb0832f22bc645039a1	2020-03-06 11:58:30 -08:00
Shunting Zhang	7f5f2e8871	add ZERO_COLLISION_HASH to caffe2 data type (#30912 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30912 Add a new data type ZERO_COLLISION_HASH . Test Plan: ci Reviewed By: boryiingsu Differential Revision: D18843626 fbshipit-source-id: b2d8280f13c78b4a656cf95822198df59de7b64c	2019-12-10 21:36:24 -08:00
Edward Yang	1e6acc676f	Replace caffe2::DeviceGuard with c10::cuda::CUDAGuard (#17623 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17623 Despite it's generic sounding name, caffe2::DeviceGuard actually only worked on CUDA devices. Rename it to something that more clearly spells out its applicability. I'm not sure if it's the right call, but in this patch I added 'using CUDAGuard = c10::cuda::CUDAGuard', as this seems to be more in-line with how the Caffe2 codebase is currently written. More idiomatic c10 namespace style would be to say cuda::CUDAGuard. Willing to change this if people shout. This is a respin of D13156470 (#14284) Reviewed By: dzhulgakov Differential Revision: D14285504 fbshipit-source-id: 93b8ab938b064572b3b010c307e1261fde0fff3d	2019-03-06 10:48:15 -08:00
Michael Liu	5f866d0ea2	Apply modernize-use-override (2nd iteration) Summary: Use C++11’s override and remove virtual where applicable. Change are automatically generated. Reviewed By: Orvid Differential Revision: D14086124 fbshipit-source-id: 2005227d095d776ca3b4309a57f54e25782b9b58	2019-02-14 16:52:57 -08:00
Shahzad Lone	53ae8bc64d	Reserve vectors that we know the size in advance for. (#16201 ) Summary: Save reallocation costs, by reserving vectors according to how many elements we expect to put in. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16201 Differential Revision: D13762594 Pulled By: ezyang fbshipit-source-id: 7e3bfe421489dde48a2ddb0920dd155f69baecc0	2019-01-22 08:02:40 -08:00
Edward Yang	f4c59c5fdf	Replace SwitchToDevice(0) with SwitchToDevice() (#15126 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/15126 I want to make people stop manufacturing StreamId from thin air, and a first step is to make people use the default stream. Reviewed By: dzhulgakov Differential Revision: D13432922 fbshipit-source-id: 9f0d8d70646c50d979bde5ba3c3addeebac48a3d	2018-12-17 15:15:00 -08:00
Jerry Zhang	9b272c08cf	Remove partially initialized Tensor in Deserialization (#14197 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14197 Pull Request resolved: https://github.com/pytorch/pytorch/pull/13642 Previously we pass in a patially initialized Tensor to Deserialize and it will fill it with the result of deserialization of a tensor proto. Now we want it to return a Tensor directly since it's just a shared pointer to TensorImpl. Reviewed By: dzhulgakov Differential Revision: D12874357 fbshipit-source-id: 12b80a763375da23cfa64a74d6bc186d8d03b94f	2018-12-10 17:17:29 -08:00
Dmytro Dzhulgakov	0cfbbceac3	Change Tensor::CopyFrom to a simple double dispatch (#14268 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14268 Removes the need for Context in Tensor by doing simple dispatch for CopyBytes. It'd eventually be subsumed by Roy Li's changes of proper copy_ op, but before that is done, let's get a clear logic of how copies are implemented and clean up some craft in CopyFrom implementation. Note, that with these changes, one can probably can get rid of Context::CopyFromCPU/CopyToCPU, but it's a matter for follow up diffs. This diff doesn't change the API of Tensor yet, but relies on the fact that passing `Context` to CopyFrom makes copy async if the device is CUDA and doesn't have any effect otherwise (that's how Context methods are implemented). This doesn't change semantics of copy async implementation - as before it blindly calls cudaMemcpyAsync which probably means that it can be misused if invoked separately outside of operator body. I'll leave it for the follow up copy_ unification. For Extend() we always do async copy - it makes sense as it's an in-place device-device operation and only any further op would be observable. Note: there are now three ways of invoking copy in C2 code - templated CopyBytes, virtual CopyFromCPU/etc, and double-dispatch free method here. Hopefully we can get rid of the second one. Also, please advise whether it's c10-worthy :) Reviewed By: ezyang Differential Revision: D13117987 fbshipit-source-id: a6772d6dcf3effaf06717da3a656fc9873b310b5	2018-11-28 15:45:37 -08:00
Jerry Zhang	a228a95b94	Rename ndim() -> dim() - 1/6 Summary: Codemod generated with clangr shard mode, 50 files per diff, clangr code(ndim()->dim()): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12935693 fbshipit-source-id: f24f1c10cd5bbb9e63cda0a0da989e6e3766380a	2018-11-07 07:30:11 -08:00
Jerry Zhang	2e1b7a6f4f	Renaming dim() to size() - 1/3 (#13434 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13434 Codemod generated with clangr shard mode, 50 files per diff, clangr code(dim->size): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12867223 fbshipit-source-id: 3e05be1a370ebd1a273bd4c70499d019fd056ac4	2018-10-31 17:43:52 -07:00
Jerry Zhang	edd902594a	Renaming meta() to dtype() - 1/2 (#13333 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13333 Codemod generated with clangr shard mode, 50 files per diff, clangr code(meta->dtype): diffusion/FBS/browse/master/fbcode/caffe2/caffe2/fb/codemods/TensorMethodRename.cpp Reviewed By: ezyang Differential Revision: D12845168 fbshipit-source-id: 492091963d2211ea80215200e981965767566135	2018-10-31 17:14:08 -07:00
Dmytro Dzhulgakov	47c0d88739	Bring back warning for dtype uninitialized in serialization (#13239 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13239 Previous diff missed the if (dtype_initialized) check, duh. Also, for safety of spamming - using LOG_EVERY_MS if it's available Reviewed By: kennyhorror Differential Revision: D12818938 fbshipit-source-id: 76590bd1b28010fb13f5d33423c8eac1395e9f76	2018-10-29 22:09:54 -07:00
Jerry Zhang	eea2ee6d29	Renaming size() to numel() - 1/17 Summary: Codemod generated with clangr shard mode, 25 files per diff Reviewed By: li-roy Differential Revision: D10866237 fbshipit-source-id: 020fcfdf52083430c5b674eda8e07ad3adfcc838	2018-10-26 15:36:59 -07:00
Edward Yang	f282fa1afe	Comment out LOG(ERROR) for legacy no-dtyle serialization behavior Reviewed By: wylqc Differential Revision: D12569279 fbshipit-source-id: 46def8ca163bcf9070a1179166fd8970e07ee229	2018-10-26 13:18:27 -07:00
Dmytro Dzhulgakov	c95fa4b904	fix dtype uninitialized tensor serialization Summary: See D10380678 for the discussion. Caffe2 serialization code was able to handle dtype uninitalized tensor as long as their numel was 0 O_O. For safety to unblock the push I'm preserving this behavior with critical. As we fix all occurrences of old API, we can delete this test. Reviewed By: kennyhorror Differential Revision: D10866562 fbshipit-source-id: e172bd045fdfca660ff05b426e001f5f2f03f408	2018-10-26 01:30:47 -07:00
Michael Antonov	a6949abb15	Guard all Caffe2 protobuf string serializations with CAFFE_ENFORCE (fixed reverted bug) (#12848 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12848 Updated all non-test uses of protobuf::MessageLite::SerializeAsString to call SerializeAsString_EnforceCheck so that the return value is checked and can throw an exception if failing. Most of the affected code was called from classes derived from BlobSerializeBase. Didn't touch most tests and ENFORCE calls because they usually do checks anyway. Original commit changeset: c0760e73ecc7 Reviewed By: dzhulgakov Differential Revision: D10453456 fbshipit-source-id: d2f2b7b4578e721924354149f08f627c7e3bf070	2018-10-23 16:21:26 -07:00
Junjie Bai	805f4d5cb8	Revert D10416438: Guard all Caffe2 protobuf string serializations with CAFFE_ENFORCE Differential Revision: D10416438 Original commit changeset: cb842e3e26b0 fbshipit-source-id: c0760e73ecc76ca9b1b74f6844e243c2df5260a2	2018-10-18 13:46:33 -07:00
Michael Antonov	63cd051867	Guard all Caffe2 protobuf string serializations with CAFFE_ENFORCE (#12799 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12799 Updated all non-test uses of protobuf::MessageLite::SerializeAsString to call SerializeAsString_EnforceCheck so that the return value is checked and can throw an exception if failing. Most of the affected code was called from classes derived from BlobSerializeBase. Didn't touch most tests and ENFORCE calls because they usually do checks anyway. Reviewed By: ezyang Differential Revision: D10416438 fbshipit-source-id: cb842e3e26b0918829d71267a375d4dd40600d58	2018-10-18 12:49:01 -07:00
Yangqing Jia	7d5f7ed270	Using c10 namespace across caffe2. (#12714 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12714 This is a short change to enable c10 namespace in caffe2. We did not enable it before due to gflags global variable confusion, but it should have been mostly cleaned now. Right now, the plan on record is that namespace caffe2 and namespace aten will fully be supersets of namespace c10. Most of the diff is codemod, and only two places of non-codemod is in caffe2/core/common.h, where ``` using namespace c10; ``` is added, and in Flags.h, where instead of creating aliasing variables in c10 namespace, we directly put it in the global namespace to match gflags (and same behavior if gflags is not being built with). Reviewed By: dzhulgakov Differential Revision: D10390486 fbshipit-source-id: 5e2df730e28e29a052f513bddc558d9f78a23b9b	2018-10-17 12:57:19 -07:00
Sebastian Messmer	dd7501e3a8	Remove Blob::ShareExternal from serialization (#11926 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11926 With the preparation work of diffs stacked below, we're now able to remove this call to Blob::ShareExternal(), preparing for removing that function from Blob, Reviewed By: dzhulgakov Differential Revision: D9884563 fbshipit-source-id: 7dd5c5fe02be0df7a44be45587c1dd7c474126ef	2018-10-17 11:50:35 -07:00
Sebastian Messmer	6cbf1992bd	Serialization takes pointers instead of Blob (#11925 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11925 This is step 1 in the refactoring to remove Blob::ShareExternal(), i.e. Blob would then always own its contents. ShareExternal() is for example used to pass non-owning blobs to serialization. This diff prepares removing that. Reviewed By: ezyang Differential Revision: D9884177 fbshipit-source-id: d01df9a613a4fc62e5679fe45bfc47e2c899b818	2018-10-17 11:50:34 -07:00
Lu Fang	30aaa07594	New serialization format (#12384 ) Summary: Addressed Dima's feedback. The proposal is here: https://fb.quip.com/TbQmAuqIznCf Pull Request resolved: https://github.com/pytorch/pytorch/pull/12384 Reviewed By: dzhulgakov Differential Revision: D10246743 Pulled By: houseroad fbshipit-source-id: c80db0c35d60ca32965275da705f2b1dfb2a7265	2018-10-16 16:36:58 -07:00
Yangqing Jia	713e706618	Move exception to C10 (#12354 ) Summary: There are still a few work to be done: - Move logging and unify AT_WARN with LOG(ERROR). - A few header files are still being plumbed through, need cleaning. - caffe2::EnforceNotMet aliasing is not done yet. - need to unify the macros. See c10/util/Exception.h This is mainly a codemod and not causing functional changes. If you find your job failing and trace back to this diff, usually it can be fixed by the following approaches: (1) add //caffe2/c10:c10 to your dependency (or transitive dependency). (2) change objects such as at::Error, at::Optional to the c10 namespace. (3) change functions to the c10 namespace. Especially, caffe2::MakeString is not overridden by the unified c10::str function. Nothing else changes. Please kindly consider not reverting this diff - it involves multiple rounds of rebasing and the fix is usually simple. Contact jiayq@ or AI Platform Dev for details. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12354 Reviewed By: orionr Differential Revision: D10238910 Pulled By: Yangqing fbshipit-source-id: 7794d5bf2797ab0ca6ebaccaa2f7ebbd50ff8f32	2018-10-15 13:33:18 -07:00
Jerry Zhang	7724807551	Remove ExtractDeviceOption from StaticContext (#12304 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12304 - make ExtractDeviceOption to be a free function. - Add a Strorage(at::Device) constructor in order to preserve the device_id. Reviewed By: dzhulgakov Differential Revision: D10069839 fbshipit-source-id: a5f3994a39bdf1b7503b39bb42c228e438b52bfa	2018-10-10 14:12:16 -07:00
Yangqing Jia	38f3d1fc40	move flags to c10 (#12144 ) Summary: still influx. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12144 Reviewed By: smessmer Differential Revision: D10140176 Pulled By: Yangqing fbshipit-source-id: 1a313abed022039333e3925d19f8b3ef2d95306c	2018-10-04 02:09:56 -07:00
Dmytro Dzhulgakov	1d3f650ce4	Revert D10098106: [pytorch][PR] [WIP] New version of PT1 model format Differential Revision: D10098106 Original commit changeset: 94ec7fc57c84 fbshipit-source-id: 38f729b0970618f38359797b806cbbcd865f4715	2018-10-02 00:43:40 -07:00
Lu Fang	35becd1879	New version of PT1 model format (#12149 ) Summary: Considered four different existing formats: 1) static graph, 2) torch script, 3) pickle files, 4) PyTorch C++ serialize APIs Pull Request resolved: https://github.com/pytorch/pytorch/pull/12149 Reviewed By: BIT-silence Differential Revision: D10098106 Pulled By: houseroad fbshipit-source-id: 94ec7fc57c842e50fae5286ddeda657a4967a07a	2018-10-01 15:57:02 -07:00
Jerry Zhang	006171fffc	Back out "[pytorch][PR] Revert "Move CreateContext to global registry (#11688 )"" (#12121 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12121 Pull Request resolved: https://github.com/pytorch/pytorch/pull/12055 Original commit changeset: 6ca9de65b707 Reviewed By: ezyang Differential Revision: D10033396 fbshipit-source-id: ca9f4b2f7ef0561f619b833415d394a8b9972bf4	2018-10-01 11:10:46 -07:00
Yangqing Jia	9c49bb9ddf	Move registry fully to c10 (#12077 ) Summary: This does 6 things: - add c10/util/Registry.h as the unified registry util - cleaned up some APIs such as export condition - fully remove aten/core/registry.h - fully remove caffe2/core/registry.h - remove a bogus aten/registry.h - unifying all macros - set up registry testing in c10 Also, an important note that we used to mark the templated Registry class as EXPORT - this should not happen, because one should almost never export a template class. This PR fixes that. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12077 Reviewed By: ezyang Differential Revision: D10050771 Pulled By: Yangqing fbshipit-source-id: 417b249b49fed6a67956e7c6b6d22374bcee24cf	2018-09-27 03:09:54 -07:00
Sebastian Messmer	8f0db9bbbb	Removing some dependency edges from Blob to other caffe2 (#12043 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12043 Re-trying D9979976, this time with all call sites fixed. D9979976 got reverted because there was a call site that wasn't covered by sandcastle it seems. I fixed it and used 'grep' to ensure there aren't any more call sites in fbsource. Reviewed By: ezyang Differential Revision: D10026392 fbshipit-source-id: cd341514a8e53a40147ea0ee3e52f63bb6444157	2018-09-25 11:40:24 -07:00
Edward Yang	d7e11e3aae	Revert "Move CreateContext to global registry (#11688 )" (#12049 ) Summary: This reverts commit `3ae6ee4ebd`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12049 Differential Revision: D10030954 Pulled By: ezyang fbshipit-source-id: 6ca9de65b707c5b4c68280fc6f1b8e5ad7251efc	2018-09-25 10:13:43 -07:00
Maciej Bargiel	2cdf98a74d	Back out "Removing some dependency edges from Blob to other caffe2" Summary: The controller you requested could not be found. Original commit changeset: 2ea17724e223 Differential Revision: D10026321 Ninja: stable broken fbshipit-source-id: faf87cb7cc0f78c2c10d4aa6fceea279cd27acd6	2018-09-25 01:11:14 -07:00
Sebastian Messmer	17a65bf9b6	Removing some dependency edges from Blob to other caffe2 (#11923 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11923 This is pre-work to allow moving Blob to ATen/core, which cannot depend on caffe2 anymore. (1) Removing the Blob -> Tensor dependency allows us to move Blob to ATen/core and use it inside IValue without having to wait for the Tensor merge to be complete. (2) In the final Blob design, we want it to be a very small class that doesn't have any special treatment for Tensor (or to be more correct, doesn't allow storing Tensor anymore), so this is anyhow the direction we want to go. This changes call sites that will have to be moved to IValue later, but they cannot be moved to IValue directly, because for that, IValue first needs to be able to store Blob, which in turn first needs this diff and some other changes coming up in future diffs. Codemods: $ codemod --extensions h,hpp,c,cpp,cc "([a-zA-Z0-9_]+)\\.IsTensorType\\(" "BlobIsTensorType(\\1, " $ codemod --extensions h,hpp,c,cpp,cc "([a-zA-Z0-9_]+)->IsTensorType\\(" "BlobIsTensorType(\\1, " $ codemod --extensions h,hpp,c,cpp,cc "([a-zA-Z0-9_]+)\\.GetMutableTensor\\(" "BlobGetMutableTensor(\\1, " $ codemod --extensions h,hpp,c,cpp,cc "([a-zA-Z0-9_]+)->GetMutableTensor\\(" "BlobGetMutableTensor(\\1, " It is, however, not only these codemods because regex based refactoring was only able to match a small amount of the call sites. To catch more, I wouldn've needed a AST aware tool like clangr, which I didn't figure out how to use. Reviewed By: ezyang Differential Revision: D9979976 fbshipit-source-id: 2ea17724e223b5b73b44f99362727759ca689e61	2018-09-24 22:57:05 -07:00
Jerry Zhang	3ae6ee4ebd	Move CreateContext to global registry (#11688 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11688 As a first step to remove static context(merge with allocator), we'll create a global registries for context constructors, and remove CreateContext function from tensor. Reviewed By: ezyang, dzhulgakov Differential Revision: D9779821 fbshipit-source-id: 8b239ea50af7a0556fde2382f58f79194f0e3dc1	2018-09-24 17:07:50 -07:00

1 2

78 Commits