pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
krshrimali	ef40757de3	OpInfo: `zero_` (#58731 ) Summary: See https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58731 Reviewed By: ngimel Differential Revision: D28784083 Pulled By: mruberry fbshipit-source-id: f06de8045afd3728b1fedc014c091d8fd1955a9f	2021-05-30 21:49:29 -07:00
kshitij12345	445e838210	OpInfo: resize_, resize_as_ (#59176 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59176 Reviewed By: ngimel Differential Revision: D28780083 Pulled By: mruberry fbshipit-source-id: 472584e8faa4cb1031908df097849d2d4167fdf5	2021-05-30 18:53:17 -07:00
kshitij12345	d68df54269	OpInfo: fill_ (#59138 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/59138 Reviewed By: ngimel Differential Revision: D28776451 Pulled By: mruberry fbshipit-source-id: 2e8e9f1805ec7d900223ea749a4a0b86a1bedb54	2021-05-29 00:35:02 -07:00
kshitij12345	c9af4c2636	OpInfo: where (#58349 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58349 Reviewed By: mrshenli Differential Revision: D28744220 Pulled By: mruberry fbshipit-source-id: 893a2fb88a48a60df75c7d6e2f58a42ca949daa7	2021-05-28 18:22:03 -07:00
kshitij12345	f9e8dc005a	OpInfo: clone, contiguous (#58390 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/58390 Reviewed By: soulitzer Differential Revision: D28567821 Pulled By: mruberry fbshipit-source-id: bcf42cb4a9a57d8a15a76819b8a9e2df97cf00be	2021-05-22 18:25:31 -07:00
Heitor Schueroff	9ac0bd23a2	Fix bug in test_fx_experimental codegen (#58587 ) Summary: This PR fixes a bug in test_fx_experimental where code generated for ops with kwarg-only Tensor parameters would fail to execute because they would be called as positional parameters. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58587 Reviewed By: ailzhang Differential Revision: D28548365 Pulled By: heitorschueroff fbshipit-source-id: 8f1746053cbad1b11e817b0099db545d8dd22232	2021-05-20 07:49:08 -07:00
Akifumi Imanishi	3113a1de4a	Fix some tensor operators to return `NotImplemented` for invalid inputs (#58216 ) Summary: Same as https://github.com/pytorch/pytorch/issues/57934. (cc/ albanD) Pull Request resolved: https://github.com/pytorch/pytorch/pull/58216 Reviewed By: ailzhang Differential Revision: D28494886 Pulled By: albanD fbshipit-source-id: 380205867ee1cde90e1c6fcfe2a31749e1243530	2021-05-19 13:09:57 -07:00
James Reed	7b73fdf597	[FX] Fix retracing wrapped functions (#58061 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58061 Test Plan: Imported from OSS Reviewed By: yuhc Differential Revision: D28358801 Pulled By: jamesr66a fbshipit-source-id: c7c9a8a80e5bfe1eb1f6d2cf858ac7e57153a860	2021-05-17 19:50:16 -07:00
Shiyan Deng	bcacf91a71	[fx_glow]Add Support for importing quantized linear in FXIRImporter (#57483 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/57483 Pull Request resolved: https://github.com/pytorch/glow/pull/5622 Quantized linear has packed parameters. We want to unpack it so that it would be easier for graph optimization and importer to deal with the weight and bias. A customized remapping function is used to unpack quantized linear and map it to acc_op.linear. Test Plan: `buck test glow/fb/fx/nnpi_importer:test_importer` Reviewed By: gcatron, jfix71, khabinov Differential Revision: D27451237 fbshipit-source-id: e46e961734788fd5333e227ca6143fd37c33204e	2021-05-14 18:48:31 -07:00
Horace He	84d8e3b0f6	[FX] Finished prepare_for_inference API for release (#58293 ) Summary: Added an ability to configure which passes to run. Pull Request resolved: https://github.com/pytorch/pytorch/pull/58293 Reviewed By: bdhirsh Differential Revision: D28435948 Pulled By: Chillee fbshipit-source-id: dfc7f1ef6b38e6f49c2423a5efe8477a645171d0	2021-05-14 14:10:07 -07:00
Alban Desmaison	5e83c62a9e	Revert D28351931: [pytorch][PR] Fix some tensor operators to return `NotImplemented` for invalid inputs Test Plan: revert-hammer Differential Revision: D28351931 (`35521a2629`) Original commit changeset: 985457a44dba fbshipit-source-id: 10724c219e53648f10a70719e25bcf774c6c7852	2021-05-12 13:58:03 -07:00
Akifumi Imanishi	35521a2629	Fix some tensor operators to return `NotImplemented` for invalid inputs (#57934 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/57719. This PR fixes `torch.Tensor{__rsub__, __rdiv__, __rtruediv__, __pow__, __rmatmul__}` to return `NotImplemented` instead of raising a `TypeError`. cc/ mruberry: The first commit of this PR is the same as `1d209db1cc` excepts the commit message. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57934 Reviewed By: mruberry Differential Revision: D28351931 Pulled By: albanD fbshipit-source-id: 985457a44dba24d2496794dfb8c1661cbcd4ff8f	2021-05-12 11:03:23 -07:00
kshitij12345	ff982ef73d	OpInfo: reshape, reshape_as and minor clean-up (#57460 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57460 Reviewed By: nairbv Differential Revision: D28151675 Pulled By: anjali411 fbshipit-source-id: 2b3bcadab3ff5d1761b2922b63afd70a354e785c	2021-05-12 06:05:21 -07:00
Ilqar Ramazanli	8b816e9010	To implement gradient for Pytorch (#54617 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/56129 Pull Request resolved: https://github.com/pytorch/pytorch/pull/54617 Reviewed By: anjali411 Differential Revision: D28057452 Pulled By: iramazanli fbshipit-source-id: 9bd86679282d34f5e5393e6447121586517eb4f0	2021-05-11 18:52:20 -07:00
kshitij12345	9e6b7e6e6e	OpInfo: expand and expand_as (#57606 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/57606 Reviewed By: albanD Differential Revision: D28249191 Pulled By: mruberry fbshipit-source-id: d985ab4e8a99b116c45953e621092929a9a8028e	2021-05-07 02:50:00 -07:00
kshitij12345	154eca0309	OpInfo: ravel, view, view_as (#56910 ) Summary: Reference: https://github.com/pytorch/pytorch/issues/54261 Pull Request resolved: https://github.com/pytorch/pytorch/pull/56910 Reviewed By: ngimel Differential Revision: D28141867 Pulled By: mruberry fbshipit-source-id: bff49d40d7e3bb36bc83d1405bd77f5529eeffe9	2021-05-02 22:10:36 -07:00
Yukio Siraichi	ce4449918a	Port reverse binary ops to `OpInfo` (#56471 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/54296 Tracking Issue https://github.com/pytorch/pytorch/issues/54261 Summary: - `rsub` (aten function) was already ported - Ported tests for its dunder version: `__rsub__` - Ported tests for the other dunder functions: `__radd__`, `__rmul__`, `__rdiv__`, `__rpow__` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56471 Reviewed By: ngimel Differential Revision: D28142843 Pulled By: mruberry fbshipit-source-id: 3d1bd88a4f124774f48d33a7ca7bfc7f796360df	2021-05-02 16:01:12 -07:00
Horace He	786b0a8091	[FX] fix normalization issues with lists of tensors (#57004 ) Summary: Fixes issue with lists of tensors not being normalized correctly. Pull Request resolved: https://github.com/pytorch/pytorch/pull/57004 Reviewed By: jamesr66a Differential Revision: D28034559 Pulled By: Chillee fbshipit-source-id: f935f0b73a8356acd8a2ae93fcfc0417f0eab224	2021-04-27 20:02:00 -07:00
Heitor Schueroff	57e37080cd	Added OpInfo for torch.einsum (#56276 ) Summary: Adds OpInfo testing for torch.einsum. Pull Request resolved: https://github.com/pytorch/pytorch/pull/56276 Reviewed By: mruberry Differential Revision: D27967095 Pulled By: heitorschueroff fbshipit-source-id: 60524273d2ca885e7eeb932db3e7fd697ae5ca8e	2021-04-27 07:39:38 -07:00
iramazanli	3e006fc57e	Adding hsplit,vsplit and dsplit methods (#53536 ) Summary: Fixes #{issue number} Pull Request resolved: https://github.com/pytorch/pytorch/pull/53536 Reviewed By: albanD Differential Revision: D27938880 Pulled By: iramazanli fbshipit-source-id: f741119517783ec2bafa296622ee518b587dd127	2021-04-26 09:39:09 -07:00
Jordan Fix	4ef8205104	[fx][normalize] Allow for args to be left as args (#55995 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55995 Normalization is kind of broken currently. But making default arguments visible still appears to work, and is nice functionality to still be able to rely on/use. Adds an option to `NormalizeArgs`'s `__init__` called `normalize_to_only_use_kwargs` which defaults to true, which if set to false will keep using the same signature as provided, but additionally set kwargs in kwargs. Test Plan: Added test to `test_fx_experimental`. Reviewed By: 842974287 Differential Revision: D27759448 fbshipit-source-id: 620061fcf46d8549ac70b62aede8b6740aee3778	2021-04-24 08:15:17 -07:00
Horace He	0df239e550	[FX] Make arg normalization a method on Node and not a pass (also augment tests to be exhaustive) (#55992 ) Summary: Commandeered from https://github.com/pytorch/pytorch/pull/54563 Primary changes from first PR: 1. Refactored primary `normalize_function` logic into `operator_schemas.py` so that non-FX users can use it. 2. Refactored tests a bit, and added a path to call `normalize_function` directly. 3. Moved check for `boolean_dispatch` so that `torch.lu` also gets properly handled. Pull Request resolved: https://github.com/pytorch/pytorch/pull/55992 Reviewed By: mruberry Differential Revision: D27774396 Pulled By: Chillee fbshipit-source-id: 7f65632e1d608e4abd55aec5ccbfdc3f67f52b8e	2021-04-22 03:53:41 -07:00
Jordan Fix	5eadc243f3	Preserve node meta info in split_module (#56212 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56212 The current design doesn't make it easy to use `node.copy()`. Explicitly copy over the node's meta. Test Plan: Updated `test_subgraph_creation` in `test_fx_experimental` Reviewed By: jamesr66a Differential Revision: D27808477 fbshipit-source-id: 7fe7b6428c830307dbd1e395f16fa2774936d3b3	2021-04-16 18:02:50 -07:00
James Reed	2236f43da0	[FX] Put tensor metadata into a NamedTuple in ShapeProp (#55930 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55930 Test Plan: Imported from OSS Reviewed By: ansley Differential Revision: D27741730 Pulled By: jamesr66a fbshipit-source-id: 0a0a1b94beed6c482add9e9551f316f3b4220ab2	2021-04-13 22:21:50 -07:00
Yukio Siraichi	93bf0ae6fc	Remove legacy constructor calls from pytorch codebase. (#54142 ) Summary: Follow up from https://github.com/pytorch/pytorch/issues/53889 Related to https://github.com/pytorch/pytorch/issues/47112 Removing every occurrence of the legacy constructor call present in PyTorch at: - _docs_ - _benchmarks_ - _test_ - _caffe2_ - _CONTRIBUTING.md_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/54142 Reviewed By: ngimel Differential Revision: D27699450 Pulled By: mruberry fbshipit-source-id: 530aa3f5746cc8bc1407d5d51b2bbd8075e30546	2021-04-11 15:45:17 -07:00
Shiyan Deng	43ede4c2e3	Add Per Tensor Quantization Support to FXIRImporter (#55405 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55405 Pull Request resolved: https://github.com/pytorch/glow/pull/5516 Allows FXIRImport to import quantized model. This diff doesn't include the supports for per-channel weights, linear and conv. Will address them in the next diff. Test Plan: buck test glow/fb/fx/nnpi_importer:test_importer Reviewed By: jackm321, jfix71 Differential Revision: D27313543 fbshipit-source-id: bf5c96ef5f2ff1835c09db981e0ceefaec56dd5b	2021-04-09 10:49:48 -07:00
James Reed	bcb4583170	[FX] Add a metadata dict to Node and switch shapeprop to use that (#54926 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54926 Test Plan: Imported from OSS Reviewed By: ansley Differential Revision: D27417801 Pulled By: jamesr66a fbshipit-source-id: 68a5155120a235065f58aa64ba1a6a97818dd0c1	2021-03-31 14:36:54 -07:00
Horace He	24bfcd537e	[FX] Added FX prepare_for_inference for Intel CPUs (#53805 ) Summary: Part of https://github.com/pytorch/pytorch/issues/48209 Taken from the docstring: Performs a set of optimization passes to optimize a model for the purposes of inference. Specifically, the passes that are run are: 1. Conv/BN fusion 2. Dropout removal 3. MKL layout optimizations The third optimization takes a function `use_mkl_heuristic` that's used to determine whether a subgraph should be explicity run in MKL layout. I implemented 2 heuristics: 1. Does it in MKL if the subgraph is larger than 2. 2. Benchmarks each subgraph with MKL layout and without, and keeps the subgraph if it's faster. ### Batch size of 10 and multi-threaded. Results with the second heuristic are generally as strong as the "jit.freeze" version, except in `densenet` and `vgg`, where it's faster, likely due to the heuristic being better. With the first heuristic, there are some notable gaps, particularly on `inception_v3` and `alexnet`. ``` model Eager FX FX Auto jit.mkldnn ------------ --------- --------- --------- --------- - custom 0.195614 0.14686 0.15929 0.156442 6 resnet18 0.172012 0.114007 0.119678 0.12945 6 resnet50 0.486463 0.294308 0.299518 0.318121 6 densenet161 0.955309 0.893502 0.882798 1.29315 6 inception_v3 0.38454 0.307076 0.239513 0.233083 6 googlenet 0.229388 0.237486 0.170458 0.174106 6 shufflenet 0.0513613 0.0286739 0.0292908 0.0267209 6 alexnet 0.0709602 0.0768137 0.0660831 0.0650399 6 vgg16 1.053993 0.9013264 0.9360212 1.082820 6 mobilenet 0.12264 0.0970935 0.0936568 0.106314 6 mnasnet 0.0989875 0.0412083 0.0424499 0.0472336 6 resnext 0.476811 0.315428 0.314422 0.343156 6 ``` For single-threaded (still running...) ``` model eager FX FX auto mkl threads ------------ --------- --------- --------- --------- --------- custom 0.0401415 0.259863 0.0263152 0.200667 1 resnet18 0.499931 0.382113 0.383711 0.396335 1 resnet50 1.10353 0.911865 0.923645 0.992125 1 densenet161 2.20158 2.39421 2.08204 2.30124 1 inception_v3 0.79161 0.849207 0.703546 0.724492 1 googlenet 0.66896 0.820965 0.515927 0.529414 1 shufflenet 0.0987308 0.0689343 0.0629298 0.0617193 1 alexnet 0.198795 0.19862 0.19325 0.211934 1 vgg16 3.744 3.2499 3.28503 3.31576 1 mobilenet 0.152725 0.14505 0.135555 0.159754 1 mnasnet 0.141983 0.089406 0.089599 0.0956167 1 resnext 1.13778 0.97016 0.955417 0.965376 1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/53805 Reviewed By: gmagogsfm Differential Revision: D27424611 Pulled By: Chillee fbshipit-source-id: a39137159de962fba7ca15121dfa9e78c1e01223	2021-03-31 10:15:01 -07:00
James Reed	c656a5befa	[FX] Normalize Python operators to `torch.` ops when called with Tensors (#54236 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54236 Test Plan: Imported from OSS Reviewed By: zdevito Differential Revision: D27149411 Pulled By: jamesr66a fbshipit-source-id: fe9c468f7c84c254dbb1b70163d08b343725861a	2021-03-25 22:27:49 -07:00
James Reed	a27f46bbe3	[FX] Experimental type annotation pass using Python signatures (#53831 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53831 Test Plan: Imported from OSS Reviewed By: suo Differential Revision: D26982804 Pulled By: jamesr66a fbshipit-source-id: 17db9f71e729206f29ee231e34723d9616f128b7	2021-03-17 20:43:17 -07:00
Jordan Fix	1053c96693	[GraphModule] Back out changes to module root version of __init__ (#53791 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53791 Reviewed By: houseroad Differential Revision: D26970869 fbshipit-source-id: 80684516f57fd2d1aca794f17fe488b2fe2b2f64	2021-03-10 23:18:56 -08:00
Jordan Fix	3b0e4a6ed4	[GraphModule] Improve buffer registration during init (#53444 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/53444 GraphModule construction has two options when constructing the base nn.Module: a dict of names to attrs to assign to the GraphModule, or another nn.Module to copy attrs from. - For the dict case, add logic to explicitly register `nn.Tensors` that are not `nn.Parameter` as buffers on the GraphModule, else fall back to `__setattr__`. - For the other `nn.Module` case, update so that it checks in the other module whether the attr to copy in is a buffer, and register it as such, else fall back to `__setattr__`. Test Plan: Added tests for fetching params and buffers from a GraphModule using both dict and module `__init__`s Reviewed By: jamesr66a Differential Revision: D26860055 fbshipit-source-id: 8d9999f91fef20aaa10969558006fc356247591f	2021-03-09 21:05:01 -08:00
Ansley Ussery	85109ce427	Support submodule manipulation in GraphModule (#52358 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52358 Test Plan: Imported from OSS Reviewed By: jamesr66a Differential Revision: D26759260 Pulled By: ansley fbshipit-source-id: 25d2b9124a7d957704f1700a45dca143aaed391d	2021-03-04 14:52:35 -08:00
Michael Suo	ecf3ca00d8	[fx] Separate globals assignment from code generation (#51974 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51974 Right now, when an FX `Graph` references an external object, we will emit code like: import foo def forward(input: foo.bar.baz): ... This is problematic in a world with `torch.package`, since then name `foo.bar.baz` may reference a name from any number of packages. This PR lays the groundwork for FX-package integration by separating the resolution of external references from the genration of the function code. When generating a Graph's Python source, we keep track of all external references and assign them unique names. At the end, we have a dictionary mapping names -> actual objects. This becomes the `globals` namespace we pass to `exec` when installing the forward function in a `GraphModule`. This is nice because we can always be sure that `exec` is seeing the same objects that were referenced from the `Graph`, no import statements needed. At serialization time, we use a `ModuleEnv` to resolve the globals dict to a set of import statements that can be run to reprodce the `global` namespace. This is only used on serialiation/deserialization, and those functions are expected to check that the import statements are producing the correct results. Concretely, the code above will now look like: from foo.bar import baz as foo_bar_baz def forward(input: foo_bar_baz): ... Test Plan: Imported from OSS Reviewed By: jamesr66a Differential Revision: D26340593 Pulled By: suo fbshipit-source-id: fe247f75205d0a03fd067bdd0f95491e8edf1436	2021-02-23 13:48:03 -08:00
James Reed	f7a3634466	[WIP][FX] Normalize torch.nn.functional calls (#51816 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51816 Test Plan: Imported from OSS Reviewed By: Chillee Differential Revision: D26290764 Pulled By: jamesr66a fbshipit-source-id: 9c05ff1b7c6f0ab8a13516f7cc2fe279980ebe5d	2021-02-17 15:18:03 -08:00
James Reed	a1c5eba4bd	[FX] Move some heavily used passes out of experimental (#51392 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51392 Test Plan: Imported from OSS Reviewed By: Chillee Differential Revision: D26161172 Pulled By: jamesr66a fbshipit-source-id: 04bfe606555bdf1988f527231d4de2e0196e6b37	2021-02-01 19:02:26 -08:00
Garret Catron	0e8e739a9f	Move AcceleratedGraphModule out of graph_manipulation. (#51220 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51220 testing with OS this time... Reviewed By: jfix71 Differential Revision: D26105140 fbshipit-source-id: b4b7a8f0f4cc8f96f9f8b270277a71061d5e5e84	2021-01-28 02:39:12 -08:00
Nikita Shulga	57484103be	Revert D25675618: Move AcceleratedGraphModule out of graph_manipulation. Test Plan: revert-hammer Differential Revision: D25675618 (`c8a24ebe54`) Original commit changeset: 55636bb2d3d6 fbshipit-source-id: 7b196f7c32830061eca9c89bbcb346cdd66a211e	2021-01-26 15:31:18 -08:00
Garret Catron	c8a24ebe54	Move AcceleratedGraphModule out of graph_manipulation. Test Plan: buck test //caffe2/test:test_fx_experimental buck test //glow/fb/fx_nnpi_importer:test_importer Reviewed By: jfix71 Differential Revision: D25675618 fbshipit-source-id: 55636bb2d3d6102b400f2044118a450906954083	2021-01-26 12:39:49 -08:00
Meghan Lele	11cdb910b4	[fx] Add matrix multiplication fusion pass (#50151 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50151 Summary This commit adds a graph transformation pass that merges several matrix multiplications that use the same RHS operand into one large matrix multiplication. The LHS operands from all of the smaller matrix multiplications are concatenated together and used as an input in the large matrix multiply, and the result is split in order to obtain the same products as the original set of matrix multiplications. Test Plan This commit adds a simple unit test with two matrix multiplications that share the same RHS operand. `python test/test_fx_experimental.py -k merge_matmul -v` Test Plan: Imported from OSS Reviewed By: ngimel Differential Revision: D25809409 Pulled By: SplitInfinity fbshipit-source-id: fb55c044a54dea9f07b71aa60d44b7a8f3966ed0	2021-01-06 21:49:37 -08:00
Natalia Gimelshein	ad7d208ba5	Revert D25239967: [fx] Add matrix multiplication fusion pass Test Plan: revert-hammer Differential Revision: D25239967 (`9b7f3fa146`) Original commit changeset: fb99ad25b7d8 fbshipit-source-id: 370167b5ade8bf2b3a6cccdf4290ea07b8347c79	2021-01-05 23:22:26 -08:00
Meghan Lele	9b7f3fa146	[fx] Add matrix multiplication fusion pass (#50120 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50120 This commit adds a graph transformation pass that merges several matrix multiplications that use the same RHS operand into one large matrix multiplication. The LHS operands from all of the smaller matrix multiplications are concatenated together and used as an input in the large matrix multiply, and the result is split in order to obtain the same products as the original set of matrix multiplications. Test Plan: This commit adds a simple unit test with two matrix multiplications that share the same RHS operand. `buck test //caffe2/test:fx_experimental` Reviewed By: jamesr66a Differential Revision: D25239967 fbshipit-source-id: fb99ad25b7d83ff876da6d19dc4abd112d13001e	2021-01-05 19:37:08 -08:00
Shiyan Deng	107c31f2f5	Add a pass to fetch attributes of nn.Module to fx.node (#47935 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/47935 Fetch the parameters that are needed for lowering from nn.Module to fx.node for leaf_modules. Test Plan: A test `test_fetch` is added to test_fx_experimental.py. Reviewed By: jfix71 Differential Revision: D24957142 fbshipit-source-id: a349bb718bbcb7f543a49f235e071a079da638b7	2020-12-08 18:06:37 -08:00
Wang Xu	6000481473	add a unit test for large node error (#48938 ) Summary: add a unit test to test the situation where a node is too large to fit into any device Pull Request resolved: https://github.com/pytorch/pytorch/pull/48938 Reviewed By: zhangguanheng66 Differential Revision: D25402967 Pulled By: scottxu0730 fbshipit-source-id: a2e2a3dc70d139fa678865ef03e67fa57eff4a1d	2020-12-08 14:45:44 -08:00
Wang Xu	799b700ada	add a unit test for lack of devices (#48858 ) Summary: add a unit test for the situation where devices have no enough memory Pull Request resolved: https://github.com/pytorch/pytorch/pull/48858 Reviewed By: malfet, gcatron Differential Revision: D25341254 Pulled By: scottxu0730 fbshipit-source-id: c0524c22717b6c8afd67f5b0ad0f1851b973e4b7	2020-12-05 06:09:04 -08:00
Horace He	092e52a4da	[fx]added prototype of to_folder (#47544 ) Summary: What this does is that given a `FxModule foo`, you can call `foo.to_folder('foo_folder', 'Foo')` and dump the current FX module into runnable Python code. That is ``` foo = <fxModule> foo = foo.to_folder('bar', 'Foo') from bar import Foo foo2 = Foo() forall x, foo2(x) == Foo(x) ``` This has several use cases, largely lifted from jamesr66a's doc here: https://fb.quip.com/U6KHAFaP2cWa (FB-internal). 1. As we apply more heavy-weight function transformations with FX, figuring out what's going on can be quite a difficult experience. In particular, things that can typically be used for debugging (like `print` or `import pdb; pdb.set_trace()`) no longer work. This is particularly necessary if you're using a FX transform like `grad` or `vmap. With this, you simply open up the dumped file, and add `print`/`pdb` statements wherever you'd like. 2. This also provides an immense amount of user control. Some potential use-cases: - Let's say an existing FX transform has some bug, or generates suboptimal code. Instead of needing to modify that FX transform, writing another FX pass that fixes the suboptimal code, or simply giving up on FX, they can workaround it by simply modifying the resulting code themselves. - This allows users to check in their FX modules into source control. - You could even imagine using this as part of some code-gen type workflow, where you write a function, `vmap` it to get the function you actually want, and then simply copy the output of the `vmap` function without needing FX at all in the final code. An example: ```python class Test(nn.Module): def __init__(self): super(Test, self).__init__() self.W = torch.nn.Parameter(torch.randn(2)) self.linear = nn.Linear(2, 2) self.attr = torch.randn(2) self.attr2 = torch.randn(2) def forward(self, x): return self.linear(self.W + (self.attr + self.attr2) + x) mod = fx.symbolic_trace(Test()) mod.to_folder('foo', 'Foo') ``` results in ```python import torch class Foo(torch.nn.Module): def __init__(self): super().__init__() state_dict = torch.load('foo/state_dict.pt') self.linear = torch.load('foo/linear.pt') # Linear(in_features=2, out_features=2, bias=True) self.__tensor_constant0 = state_dict['__tensor_constant0'] self.W = torch.nn.Parameter(state_dict['W']) def forward(self, x): w = self.W tensor_constant0 = self.__tensor_constant0 add_1 = w + tensor_constant0 add_2 = add_1 + x linear_1 = self.linear(add_2) return linear_1 ``` Some current issues: 1. How do you actually ... save things like modules or parameters? I don't think FX is in the business of tracking initializations and such. Thus, the only way I see to do it is to dump the parameters/modules as blobs, and then load them in the generated initialization. This is a somewhat subpar user experience, and perhaps prevents it from being in some use cases (ie: you would need to check in the blobs into source control to save the model). 2. Currently, the only "atomic" modules we have are those in `torch.nn`. However, if we want to allow flexibility in this, and for example, allow "atomic" modules that are user-defined, then it's not clear how to allow those to be dumped in a way that we can then load elsewhere. Pull Request resolved: https://github.com/pytorch/pytorch/pull/47544 Reviewed By: jamesr66a Differential Revision: D25232917 Pulled By: Chillee fbshipit-source-id: fd2b61a5f40e614fc94256a2957ed1d57fcf5492	2020-12-04 18:33:27 -08:00
Wang Xu	9af627fda1	fix some typos in the fx ir test_fx_experiemntal (#48847 ) Summary: fix some typos in test_fx_experimental.py Pull Request resolved: https://github.com/pytorch/pytorch/pull/48847 Reviewed By: malfet, gcatron Differential Revision: D25339391 Pulled By: scottxu0730 fbshipit-source-id: 388d9da94259d2b306d59f3f4a167e486ac06d60	2020-12-04 12:18:36 -08:00
Wang Xu	7a59a1b574	add aot_based_partition (#48336 ) Summary: This PR add supports on AOT based partition. Given each node and its corresponding partition id, generate the partition, submodules and dag Pull Request resolved: https://github.com/pytorch/pytorch/pull/48336 Reviewed By: gcatron Differential Revision: D25226899 Pulled By: scottxu0730 fbshipit-source-id: 8afab234afae67c6fd48e958a42b614f730a61d9	2020-11-30 19:11:02 -08:00
Horace He	0a3db1d460	[FX] Prototype Conv/BN fuser in FX (#47657 ) Summary: Some interesting stuff going on. All benchmarks are tested with both my implementation as well as the current quantized fuser. For these benchmarks, things like using MKLDNN/FBGEMM make a big differene. ## Manual compilation (everything turned off) In the small case, things look good ``` non-fused: 1.174886703491211 fused: 0.7494957447052002 ``` However, for `torchvision.resnet18`, we see ``` non-fused: 1.2272708415985107 fused: 3.7183213233947754 ``` This is because Conv (no bias) -> Batch Norm is actually faster than Conv (bias) if you don't have any libraries... ## Nightly (CPU) ``` Toy non-fused: 0.45807552337646484 fused: 0.34779977798461914 resnet18 non-fused: 0.14216232299804688 fused: 0.13438796997070312 resnet50 non-fused: 0.2999534606933594 fused: 0.29364800453186035 densenet161 non-fused: 0.6558926105499268 fused: 0.6190280914306641 inception_v3 non-fused: 1.2804391384124756 fused: 1.181272029876709 ``` with MKLDNN. We see a small performance gain across the board, with more significant performance gains for smaller models. ## Nightly (CUDA) ``` M non-fused: 1.2220964431762695 fused: 1.0833759307861328 resnet18 non-fused: 0.09721899032592773 fused: 0.09089207649230957 resnet50 non-fused: 0.2053072452545166 fused: 0.19138741493225098 densenet161 non-fused: 0.6830024719238281 fused: 0.660109281539917 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/47657 Reviewed By: eellison Differential Revision: D25127546 Pulled By: Chillee fbshipit-source-id: ecdf682038def046045fcc09faf9aeb6c459b5e3	2020-11-20 18:51:32 -08:00
Wang Xu	4b56aef05d	add kl_based_partition (#48197 ) Summary: This is a partition search based on Kernighan-Lin algorithm. First, the graph is partitioned using size_based_partition, then nodes from different partitions are swapped until the cost reaches minimum. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48197 Reviewed By: gcatron Differential Revision: D25097065 Pulled By: scottxu0730 fbshipit-source-id: 3a11286bf4e5a712ab2848b92d0b98cd3d6a89be	2020-11-19 17:38:25 -08:00

1 2

73 Commits