pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Michael Suo	3b2844eeea	Make CompilationUnit own Functions (#22202 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22202 ghimport-source-id: de6c963af1df76d2d6357155e64a5913ab879f76 Test Plan: Imported from OSS Differential Revision: D15998761 Pulled By: suo fbshipit-source-id: 5414a6424953738d823b265d20dc67dde6e5b2d8	2019-07-04 17:12:00 -07:00
Wanchao Liang	799633e4cd	move casting ops from prim to aten Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22275 Test Plan: Imported from OSS Differential Revision: D16060597 Pulled By: wanchaol fbshipit-source-id: a11d8ad3b037e15bd670cc7cd3fefd4f0abd0bba	2019-07-03 22:22:28 -07:00
Sebastian Messmer	e68dc899d1	Fix compiler warnings (#22162 ) Summary: Fix various compiler warnings Pull Request resolved: https://github.com/pytorch/pytorch/pull/22162 Differential Revision: D16085339 Pulled By: smessmer fbshipit-source-id: d36a4b334315f1a5942cac46443a7d166ca36d0d	2019-07-02 14:12:55 -07:00
Sebastian Messmer	6d5871300b	Use concrete types on call sites for Dict/List (#22004 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22004 In future, we want all dicts/lists to store information about the types they contain. This is only possible if the creation API doesn't allow creating lists/dicts without type information. This diff removes some call sites that don't specify type information and have it specify type information. Reviewed By: dzhulgakov Differential Revision: D15906387 fbshipit-source-id: 64766a2534b52c221e8a5501a85eaad13812e7bd	2019-07-02 11:52:35 -07:00
xzhu1900	f0f2331a1c	Add support for cross-chunk shuffling in ChunkDataset (#22347 ) Summary: This change adds one advanced support for cross-chunk shuffling. For training with static dataset, the default configuration is at user's disposal. However, in some user cases, over each epoch, new data is added to the current dataset, thus the dataset's size is dynamically changing/increasing. In order to mix the new data and the old data for better random sampling, one approach is to shuffle examples from more than 1 chunks. This feature is supported with this change. By specifying the `cross_chunk_shuffle_count_` on construction, advanced user can specify how many chunks to shuffle example from. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22347 Differential Revision: D16081378 Pulled By: zhangguanheng66 fbshipit-source-id: fd001dfb9e66947839adecfb9893156fbbce80d0	2019-07-01 19:13:34 -07:00
Roy Li	6c454ff14c	Stop using Type in Python bindings (#21963 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21963 ghimport-source-id: 4d9d66ba2c8587503d892b67f535cc2a62e2d19e Test Plan: Imported from OSS Differential Revision: D15897423 Pulled By: li-roy fbshipit-source-id: 2dd55ceb80971df7c86545b7bfff733387f13572	2019-06-30 04:11:32 -07:00
xzhu1900	f39b6624ba	ChunkDataset checkpoint support (#21889 ) Summary: When dealing with large scale dataset, it is handy if we can save the dataset status and resume later. Especially in cases where some unexpected crash happens, user don't need to start over the whole dataset from begining. Instead, they can reload it from the last checkpoint. This change adds support for checkpoint save/load logic in ChunkDataset. On ChunkDataset construction, user can specify a file name from which to load the checkpoint. If it is empty, default to start from fresh; otherwise the ChunkDataset will 'fast forward' the chunk sampler to the corresponding checkpoint. The user can also call ChunkDataset::save() to serialize current status to a file, which can be used later. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21889 Differential Revision: D16024582 Pulled By: ailzhang fbshipit-source-id: 1862ab5116f94c9d29da174ce04a91041d06cad5	2019-06-26 22:54:14 -07:00
Sebastian Messmer	de85abf226	Allow default construction of Dict/List (#22084 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22084 For DictPtr/ListPtr, default construction was disallowed because it was ambigious if it's supposed to create an empty list or a nullptr. But since we renamed them to Dict/List, we can now allow default construction without ambiguity. Differential Revision: D15948098 fbshipit-source-id: 942a9235b51608d1870ee4a2f2f0a5d0d45ec6e6	2019-06-25 17:40:48 -07:00
Zachary DeVito	5b87049c66	remove uses of std::shared_ptr<Module> (#21934 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21934 ghimport-source-id: e64ab9096f43749ead3ac5567675b815da295664 Test Plan: Imported from OSS Differential Revision: D15892401 Pulled By: zdevito fbshipit-source-id: 6424139206593ff944556c69d8a54723884eacaf	2019-06-25 13:24:38 -07:00
Nikolay Korovaiko	a256b09ce9	Backout Liveness Tests again :-( Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22100 Differential Revision: D15956214 Pulled By: Krovatkin fbshipit-source-id: 9b0c8ecf5b479bf878ffc31acc416bd8dbfe4b50	2019-06-22 00:18:21 -07:00
James Reed	f7b2778cb1	s/uniqueName/debugName/ (#22096 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22096 ghimport-source-id: 8f1d994b98432942b5beeb10bf6d30e447d51997 Test Plan: Imported from OSS Differential Revision: D15956004 Pulled By: jamesr66a fbshipit-source-id: 319d2d20ef0863249a8a2bdd228b4f792d37bfab	2019-06-21 20:54:53 -07:00
Ailing Zhang	856268c716	Revert D15947873: [JIT] s/uniqueName/debugName Differential Revision: D15947873 Original commit changeset: 31a2b30d0ce9 fbshipit-source-id: ef1c0f120c1835184d8106d176cea58ec6ad40b7	2019-06-21 18:51:03 -07:00
James Reed	36e4b54420	s/uniqueName/debugName (#22048 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/22048 ghimport-source-id: a82d80ceec1d8055ce4cf62df10ade4a224109f8 Test Plan: Imported from OSS Differential Revision: D15947873 Pulled By: jamesr66a fbshipit-source-id: 31a2b30d0ce911edf5791ca10040a1e968750b06	2019-06-21 17:59:38 -07:00
Nikolay Korovaiko	f164c01f9c	Adding liveness test cases back Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21762 Differential Revision: D15943509 Pulled By: Krovatkin fbshipit-source-id: 4b65bf63ab15a2347da5f7269cc0f2dbb226b330	2019-06-21 15:09:09 -07:00
Nikolay Korovaiko	a3fc6ed046	Hook up liveness into profiling pipeline. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21881 Differential Revision: D15931627 Pulled By: Krovatkin fbshipit-source-id: dc825a563c7aceb5f66a2ed2a600d550b70941b2	2019-06-20 21:23:16 -07:00
Sebastian Messmer	275087383b	ListPtr->List DictPtr->Dict step 2 (#21937 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21937 This changes call sites to use the new naming scheme Reviewed By: zdevito Differential Revision: D15892404 fbshipit-source-id: 8d32aa90a0ead1066688166478f299fde9c2c133	2019-06-19 18:02:05 -07:00
James Reed	dd046bef8d	NamedTuple serialization (#21839 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21839 ghimport-source-id: b9d82018fbf26b22d58cad3a033cbfe4e879a8fe Test Plan: Imported from OSS Reviewed By: zdevito Differential Revision: D15860002 Pulled By: jamesr66a fbshipit-source-id: 0fc97c4adefa9ae4937f21179c7afa817f4099e5	2019-06-19 10:43:55 -07:00
Michael Suo	4f75da3b41	change ClassType::compilation_unit to return owning ptr (#21787 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21787 ghimport-source-id: eed7b98b0f02745066164b8ef3906291931e2ecb Test Plan: Imported from OSS Differential Revision: D15831353 Pulled By: suo fbshipit-source-id: 50695c35dba8ffea710cbc9aca8aba6a75512fa0	2019-06-16 02:37:07 -07:00
peter	794ee6d00c	Switch to out-source builds for LibTorch Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21772 Differential Revision: D15839332 Pulled By: yf225 fbshipit-source-id: 017cf61c5682c6a8ffeaf2ca952e1418c27be30e	2019-06-14 21:00:18 -07:00
davidriazati	220efdbdc4	Refactor pybind_utils.h (#21550 ) Summary: This refactors pybind_utils so we can have all our type-inferring stuff in 1 place (e.g. for #21379) There is some follow up work to make the error messages better, but I think that's fine to save for another PR. ](https://our.intern.facebook.com/intern/diff/15727002/) Pull Request resolved: https://github.com/pytorch/pytorch/pull/21550 Pulled By: driazati Differential Revision: D15727002 fbshipit-source-id: a6974f2e1e5879f0503a18efc138da31cda7afa2	2019-06-14 17:27:45 -07:00
Nikolay Korovaiko	a85305fdea	Hook up profiled execution in the interpreter (#21799 ) Summary: Rebasing https://github.com/pytorch/pytorch/pull/21616 onto master Pull Request resolved: https://github.com/pytorch/pytorch/pull/21799 Differential Revision: D15832854 Pulled By: Krovatkin fbshipit-source-id: 88d754446df2abc25ea86e46764848d48ee3a5fc	2019-06-14 16:56:13 -07:00
James Reed	4bcc72fe95	Support for NamedTuple (#21428 ) Summary: Resolves https://github.com/pytorch/lockdown/issues/18 This implements NamedTuple by taking advantage of the existing `names` field in `TupleType`. TODO: This currently doesn't retain the NamedTuple-ness through serialization. Discussed with suo offline, we can probably make a way to define an anonymous NamedTuple in script (e.g. `NamedTuple('Foo', [('a', int), ('b', float), ('c', List[float])])` and serialize that TODO: implement support for calling the constructor with kwargs Pull Request resolved: https://github.com/pytorch/pytorch/pull/21428 Differential Revision: D15741564 Pulled By: jamesr66a fbshipit-source-id: c077cbcea1880675ca6deb340a9ec78f824a136c	2019-06-14 16:45:56 -07:00
Mikhail Zolotukhin	fbecb4621f	schema_matching.cpp: improve error messages. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21141 Differential Revision: D15808354 Pulled By: ZolotukhinM fbshipit-source-id: 16d938fd5acafb445a0c433cabc9a55cab563165	2019-06-13 17:04:38 -07:00
Will Feng	d3b3cbe26e	Revert D15769066: [pytorch][PR] schema_matching.cpp: improve error messages. Differential Revision: D15769066 Original commit changeset: 5853e0360581 fbshipit-source-id: ac6fa8429136abf4c7835919009f936eea11ea7b	2019-06-12 20:17:38 -07:00
Karl Ostmo	49481d576d	Torch rename (#20774 ) Summary: This renames the CMake `caffe2` target to `torch`, as well as renaming `caffe2_gpu` to `torch_gpu` (and likewise for other gpu target variants). Many intermediate variables that don't manifest as artifacts of the build remain for now with the "caffe2" name; a complete purge of `caffe2` from CMake variable names is beyond the scope of this PR. The shell `libtorch` library that had been introduced as a stopgap in https://github.com/pytorch/pytorch/issues/17783 is again flattened in this PR. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20774 Differential Revision: D15769965 Pulled By: kostmo fbshipit-source-id: b86e8c410099f90be0468e30176207d3ad40c821	2019-06-12 20:12:34 -07:00
Nikolay Korovaiko	e9121e27ce	remove liveness tests Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21719 Differential Revision: D15797628 Pulled By: Krovatkin fbshipit-source-id: 87742bdde0b05aff4341ababb1f55c51991768ec	2019-06-12 19:04:41 -07:00
Nikolay Korovaiko	8dd670657b	Liveness for BailOut graphs Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21615 Differential Revision: D15793434 Pulled By: Krovatkin fbshipit-source-id: d89f1bf61ea57a1e3b75f8e2b200c27beb8b46cf	2019-06-12 17:22:33 -07:00
Sebastian Messmer	b527e48588	Use c10::List (#21177 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21177 - Integrate c10::ListPtr into IValue and the c10 dispatcher. - Streamline conversion to/from IValue. Before, we had IValue::to<> and kernel_functor.h had its own ivalue_to_arg_type and return_type_to_ivalue. They are now unified. Also, this means that nested types like Dicts of Lists of Optional of Dict of ... do work as expected now Differential Revision: D15476433 fbshipit-source-id: bde9df80df20091aa8e6ae17ba7e90abd149b954	2019-06-12 13:58:24 -07:00
Mikhail Zolotukhin	96910251e0	schema_matching.cpp: improve error messages. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21141 Differential Revision: D15769066 Pulled By: ZolotukhinM fbshipit-source-id: 5853e0360581c44e42b068add3bf2bc68e671b2b	2019-06-12 12:43:12 -07:00
Will Feng	8cc8e15473	Back out "[pytorch][PR] [Re-landing] Fix caffe2 windows CI for new Windows AMI" (#21670 ) Summary: Original commit changeset: e65c1d6bfcc9 Pull Request resolved: https://github.com/pytorch/pytorch/pull/21670 Differential Revision: D15776087 Pulled By: yf225 fbshipit-source-id: cbb55cbbcb133cae1aeb2fe75cc52e7350cc6c88	2019-06-12 10:37:19 -07:00
Michael Suo	a436822c40	Consider contained types in alias analysis (#21431 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21431 ghimport-source-id: d86ce974a065ec572e71cfa14a8f6bdf48216da7 Reviewed By: jamesr66a Differential Revision: D15718560 Pulled By: suo fbshipit-source-id: a36ce907ab26be22f12bab6175797fe8b34721f1	2019-06-10 12:42:10 -07:00
Nikolay Korovaiko	30d6933016	BailOut Graphs Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21381 Differential Revision: D15724412 Pulled By: Krovatkin fbshipit-source-id: 18e4a1916c7cd1baea76953d0087d6257e58c55b	2019-06-10 11:49:38 -07:00
Elias Ellison	e4fae884f6	Change compiler to use Load/Stores, then transform to SSA (#21101 ) Summary: This changes our compiler so it first emits Loads & Stores, and then transforms the graph to SSA in a follow up pass. When a variable is set, we emit a prim::Store, and when a variable is referenced, we emit a prim::Load. ``` a = 1 print(a) ``` becomes: ``` %a.1 : int = prim::Constant[value=1]() prim::Store[name="a"](%a.1) %a : int = prim::Load[name="a"]() prim::Print(%a) ``` In the follow up pass, convertToSSA, the values are turned into SSA form with the Loads & Stores removed. This change will enable breaks and continues because you can transform the graph with the variable naming information still intact. There are still some remaining jitter and edge cases issues that I have to look through, but I think is still ready for eview. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21101 Differential Revision: D15723353 Pulled By: eellison fbshipit-source-id: 3269934d4bc24ddaf3a87fdd20620b0f954d83d0	2019-06-10 10:26:43 -07:00
Ilia Cherniavskii	6fc702f384	Per-callback sampling (#21394 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21394 ghimport-source-id: 2607c7b456031a1ddb19fabc3b6fe2585c276d76 Differential Revision: D15639723 Pulled By: ilia-cher fbshipit-source-id: 938d02c1daf5bec5afa5d3cd021d2dae7e7160ce	2019-06-06 13:46:48 -07:00
peter	bb788631ce	Fix caffe2 windows CI for new Windows AMI (#21452 ) Summary: The alternative of #21410. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21452 Differential Revision: D15701767 Pulled By: ezyang fbshipit-source-id: e65c1d6bfcc98e88460f4a57e5b99c2f395c0ceb	2019-06-06 13:46:45 -07:00
Shen Li	b7b6b612a7	Fix C++ data parallel (#20910 ) Summary: Fixes #19540 CC nmerrill67 C++ data parallel was using Module.clone() to create module replicas on every destination device. However, clone() does not set up gradient edges to point from replicas to the original module. As a result, the gradient will not be aggregated into the original module. This commit fixes the the problem by manually setting gradient edges from every parameter X in every replica to the same parameter X in the original module. ## Failed Attempt Initially I tried implementing what we did in `replicate.py`, which 1. create module replicas 2. use Python `Broadcast` autograd function to broadcast every parameter in the original module to all destination devices. 3. assign the broadcast result params to module replicas' `_parameters` dict. This works in Python because derived module member field params (e.g., `Linear.weight`) and base module `_parameters` (e.g., `Linear._parameters['weight']`) are referencing the same parameter instance. Assigning one of them will apply to both. However, in C++, even though I can modify Module's `parameters_ `values and gradient edges to point to the broadcast source, I cannot touch the weight and bias member fields in Linear, because replicate cannot (and should not) add special-case handlers to every different module. (See `Linear` [.h](https://github.com/pytorch/pytorch/blob/master/torch/csrc/api/include/torch/nn/modules/linear.h), [.cpp](https://github.com/pytorch/pytorch/blob/master/torch/csrc/api/src/nn/modules/linear.cpp)) Although they initially point to the same `TensorImpl` instance, after assigning to `Module.parameters_['weight']`, it will be different from `Linear.weight`. ## Solution Options gchanan and I had several discussions on this issue and figured two solutions to this problem. ### Option One [implemented in this PR] Replicate the module in two steps: 1. call `Module.clone()` to create a module replica on every destination device. 2. manually setting gradient edges from every parameter in every replica to the same parameter in the original module. * Pro: Does not need to change any existing module, and relatively easier to implement * Con: It is a little hackish. ### Options Two Implement a `Replicatable` class (similar to `Cloneable`), and make it a friend class of `Module`. For more details see `Note [Replicating Modules]` in the code change. * Pro: Maybe this aligns more with our existing approach implemented in `Cloneable`? * Con: Require changes to every existing module. I am inclined to go with option one, because `replicate` will only be used on data parallel. I feel it is too big an overkill if we have to change all existing module implementations due to a data parallel requirement. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20910 Differential Revision: D15556426 Pulled By: mrshenli fbshipit-source-id: aa836290ec657b32742e2bea80bd0ac2404ef3b0	2019-06-06 11:57:31 -07:00
Will Feng	c8083e0292	Include named_any.h in modules.h (#21437 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/19462. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21437 Differential Revision: D15684880 Pulled By: yf225 fbshipit-source-id: db23c7e4e0f62d22b0b6c18f15420c3bb66af366	2019-06-06 09:57:33 -07:00
Michael Suo	403ca41142	make analyzeConservative more conservative (#21227 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21227 ghimport-source-id: cac97ba20cb020f3edc4e83e7641201f0826f40a Reviewed By: jamesr66a Differential Revision: D15592316 Pulled By: suo fbshipit-source-id: b311f73a5d81d6d0b0331678b6a625e446588ebd	2019-06-04 15:09:46 -07:00
Nikolay Korovaiko	21113c2d36	EliminateGuards Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21070 Differential Revision: D15603561 Pulled By: Krovatkin fbshipit-source-id: 03056688e8b99eddcb30d80cc20ab37ad3f13af2	2019-06-03 09:39:45 -07:00
James Reed	619261d7a7	Add file-line info for jit.load and string frontend (#21217 ) Summary: This makes file-line reporting also work for things loaded using `torch.jit.load()` as well as the string frontend (via `CompilationUnit`) Pull Request resolved: https://github.com/pytorch/pytorch/pull/21217 Differential Revision: D15590838 Pulled By: jamesr66a fbshipit-source-id: 6b6a12574bf9eca0b83f24f0b50535fda5863243	2019-05-31 23:43:15 -07:00
Sebastian Messmer	834d678eb8	Remove old custom op implementation (#21085 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21085 Now that torch::jit::RegisterOperators() always passes through to torch::RegisterOperators() (see diffs stacked below this), we can remove the old custom op implementation. Reviewed By: dzhulgakov Differential Revision: D15542261 fbshipit-source-id: ef437e6c71950e58fdd237d6abd035826753c2e4	2019-05-31 13:51:14 -07:00
Sebastian Messmer	384d828ea5	Add aliasAnalysis to torch::RegisterOperators() (#21084 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21084 - Now AliasAnalysisKind can be set using the torch::RegisterOperators() API - This also allows us to remove the last place in torch::jit::RegisterOperators that didn't use c10 yet. Reviewed By: dzhulgakov Differential Revision: D15542097 fbshipit-source-id: ea127ecf051a5c1e567e035692deed44e04faa9e	2019-05-31 13:51:07 -07:00
Owen Anderson	181792176d	Implement various AliasAnalysis operations directly on top of MemoryLocations. (#21203 ) Summary: This reduces DenseNet load time by about 25% (down to 5.3s on my laptop) and gets AliasAnalysis out of the profile top hits entirely. Pull Request resolved: https://github.com/pytorch/pytorch/pull/21203 Differential Revision: D15578155 fbshipit-source-id: ddbb1ad25c9540b5214702830084aa51cc6fd3cb	2019-05-31 13:38:32 -07:00
Michael Suo	b6d1a72f48	improve error message on inferred type (#21058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21058 ghimport-source-id: 7fad3a0567022dd417f4bd079a50a22e3c1dc020 Differential Revision: D15547218 Pulled By: suo fbshipit-source-id: 5dbd567c79e6d01e9af4b8552777f7f0043df5b2	2019-05-30 10:50:34 -07:00
Michael Suo	154029a6ff	Revert D15534670: [jit] improve error message on inferred type Differential Revision: D15534670 Original commit changeset: 8bbfd6e9c1af fbshipit-source-id: fe62cf954292e8ef1d00a3cc569206f73cedcd31	2019-05-29 14:56:08 -07:00
Michael Suo	5dacf6b048	improve error message on inferred type (#21058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21058 ghimport-source-id: e7d6e082b0faf4f3d3e683f2c98863ee269439f0 Differential Revision: D15534670 Pulled By: suo fbshipit-source-id: 8bbfd6e9c1afbc3006d7d55ed633e18618e05021	2019-05-29 14:47:00 -07:00
Wanchao Liang	2cd1c78632	Revert D15523444: [jit] move casting ops from prim to aten Differential Revision: D15523444 Original commit changeset: 642342bf1cce fbshipit-source-id: 29de1c7e19cbb3273230c280346e786e61d2d445	2019-05-29 13:42:05 -07:00
Wanchao Liang	a0111aaf0d	move casting ops from prim to aten (#21002 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21002 ghimport-source-id: 4c88a54a3ecb76c5ca3c2c328b749350860a166d Differential Revision: D15523444 Pulled By: wanchaol fbshipit-source-id: 642342bf1ccea83c88897bc023979a32ee01addf	2019-05-29 12:36:47 -07:00
Dmytro Dzhulgakov	f2199a34eb	Hook to store additional metadata about environment (#20863 ) Summary: In larger system environment, there's usually a need to store some information about how the model was created (e.g. from which process, workflow, by which user, etc). It's almost like JPEG metadata written by camera. This PR adds a low-level c++ hook to allow population of additional files in zip container based on environment. The reason to have it a low-level hook instead of top-level API wrapper (e.g. `m.save_with_metadata`) is to capture all usages of the saving API transparently for user. Let me know if there are concerns. Pull Request resolved: https://github.com/pytorch/pytorch/pull/20863 Differential Revision: D15487941 Pulled By: dzhulgakov fbshipit-source-id: 120c5a4c9758aa82846bb51a1207f923e3da1333	2019-05-29 10:11:58 -07:00
Thomas Viehmann	17941f9979	JIT: Eliminate SumToSize by using Optional Lists (#18697 ) Summary: This PR is a eliminates unneeded grad_sum_to_size and in particular speeds up the LSTM backward by allowing better fusion. It consists of two parts: - In AutoDiff, record broadcasting sizes only if the broadcast output size is different from the input size, otherwise record None. - The specialization of Optional arguments (#18407) allows us to then eliminate ` _grad_sum_to_size(t, None)` in the peephole optimization step. Thus, in the LSTM case, no SumToSize remain in the crucial fusion group. The trick here is that we can specialize on the runtime information from the forward. I'm testing that different broadcasting situations lead to different graphs. I didn't move all symbolic_script _grad_sum_to_size to the new logic, but it might be better to do this incrementally, anyway. Pull Request resolved: https://github.com/pytorch/pytorch/pull/18697 Differential Revision: D15482076 Pulled By: wanchaol fbshipit-source-id: 7f89367e35b8729910077c95c02bccefc8678afb	2019-05-24 11:24:17 -07:00

1 2 3 4 5 ...

353 Commits