pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Edward Yang	173f224570	Turn on F401: Unused import warning. (#18598 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/18598 ghimport-source-id: c74597e5e7437e94a43c163cee0639b20d0d0c6a Stack from [ghstack](https://github.com/ezyang/ghstack): * #18598 Turn on F401: Unused import warning. This was requested by someone at Facebook; this lint is turned on for Facebook by default. "Sure, why not." I had to noqa a number of imports in __init__. Hypothetically we're supposed to use __all__ in this case, but I was too lazy to fix it. Left for future work. Be careful! flake8-2 and flake8-3 behave differently with respect to import resolution for # type: comments. flake8-3 will report an import unused; flake8-2 will not. For now, I just noqa'd all these sites. All the changes were done by hand. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Differential Revision: D14687478 fbshipit-source-id: 30d532381e914091aadfa0d2a5a89404819663e3	2019-03-30 09:01:17 -07:00
Lu Fang	18b31b73fb	Retain the parameter names in ONNX exporter (#17551 ) Summary: So, we will keep the names of ONNX initializers the same as the names in PyTorch state dict. Later, we will make this as the default behavior. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17551 Reviewed By: dzhulgakov Differential Revision: D14491920 Pulled By: houseroad fbshipit-source-id: f355c02e1b90d7ebbebf4be7c0fb6ae208ec795f	2019-03-20 12:11:23 -07:00
Lara Haidar-Ahmad	001cffed9d	ONNX Export IsNan op Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17698 Reviewed By: zrphercule Differential Revision: D14470646 Pulled By: houseroad fbshipit-source-id: d3e6adc83c4f9fa288c5fe0ae4c6af71fdd47905	2019-03-15 12:19:03 -07:00
Lu Fang	1043ff6d68	Set the default ONNX opset to the latest stable opset (i.e., 9) (#17736 ) Summary: 1) The changes in the new opset won't affect internal pipeline. 2) The CI won't be affected by the ONNX changes. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17736 Reviewed By: zrphercule Differential Revision: D14358710 Pulled By: houseroad fbshipit-source-id: 4ef15d2246b50f6875ee215ce37ecf92d555ca6a	2019-03-07 10:56:06 -08:00
Lara Haidar-Ahmad	3f94fc4862	ONNX Export for Max and Average Pooling in CEIL_MODE Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16769 Differential Revision: D14362175 Pulled By: houseroad fbshipit-source-id: 65cfb1dfba6a43d39cc85374add368fe8e4e5645	2019-03-07 10:10:21 -08:00
Lara Haidar	3dba1285ab	ONNX Export Narrow op Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17550 Differential Revision: D14350401 Pulled By: houseroad fbshipit-source-id: 4d88079bb7a8bbd270b0272009826eb3b202cc33	2019-03-06 22:37:58 -08:00
Lara Haidar-Ahmad	073634612f	ONNX Export Argmin and Argmax ops Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17382 Differential Revision: D14338811 Pulled By: houseroad fbshipit-source-id: be07548d8063d1aa94f1801c18137738365b85fb	2019-03-06 12:11:47 -08:00
Spandan Tiwari	c658d9b21b	Temporarily disable Upsample operator tests in pytorch-onnx tests (#17696 ) Summary: In discussion with houseroad, because Upsample op is being updated in ONNX https://github.com/onnx/onnx/pull/1773 and these tests are blocking it. These tests will be updated once the ONNX PR goes in. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17696 Differential Revision: D14338845 Pulled By: houseroad fbshipit-source-id: cfaf8cf1ab578ae69dd3bf21b1c0681b572b9b6f	2019-03-06 11:25:34 -08:00
Lu Fang	b0c18570ca	add the support for stable ONNX opsets in exporter (#16068 ) Summary: Still wip, need more tests and correct handling for opset 8 in symbolics. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16068 Reviewed By: zrphercule Differential Revision: D14185855 Pulled By: houseroad fbshipit-source-id: 55200be810c88317c6e80a46bdbeb22e0b6e5f9e	2019-02-22 12:05:17 -08:00
Lara Haidar	b8d1f4a423	ONNX Export Maxpool Indices Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16455 Differential Revision: D14140375 Pulled By: houseroad fbshipit-source-id: 12d02c447e7fe0fae49969d1daf40a87660ed416	2019-02-19 21:10:14 -08:00
BowenBao	19addc7eb0	Support nonzero onnx export Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/17036 Differential Revision: D14079676 Pulled By: houseroad fbshipit-source-id: 562b538dd9ab330c26f15fdb34c98dc7a23571a1	2019-02-13 23:52:42 -08:00
Edward Yang	d7e6f9b5a7	Revert D14020906: [pytorch][PR] Extend support for exporting reshape to onnx. Differential Revision: D14020906 Original commit changeset: 168616873044 fbshipit-source-id: 2730bb6990d41f3a9cef6625ea919c219733433d	2019-02-11 06:08:55 -08:00
BowenBao	4335aac6e6	Extend support for exporting reshape to onnx. Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16632 Differential Revision: D14020906 Pulled By: ezyang fbshipit-source-id: 168616873044b980145a3554dab942bdec19efb2	2019-02-10 20:19:35 -08:00
Lara Haidar-Ahmad	dff8165d04	ONNX Export Flatten operator Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16240 Reviewed By: bddppq Differential Revision: D13800025 Pulled By: houseroad fbshipit-source-id: ae4c5e42026477b28ffd44eda2438d93936ea510	2019-01-30 11:05:00 -08:00
Lu Fang	21907b6ba2	Fix the dropout onnx symbolic, and ensure all exported models in test_operators.py are eval mode (#16547 ) Summary: In eval mode, skip dropout operator in onnx exporter. Pull Request resolved: https://github.com/pytorch/pytorch/pull/16547 Reviewed By: houseroad Differential Revision: D13877136 Pulled By: dzhulgakov fbshipit-source-id: c366da156f83677bcf4989b79166aae5b6c36125	2019-01-30 01:16:21 -08:00
Wanchao Liang	c6503a4205	Revert D13540278: [pytorch][PR] Unhide unique from C++, make unique partially scriptable Differential Revision: D13540278 Original commit changeset: 3768c76a90b0 fbshipit-source-id: 7a31c239f9dca6ff467344d99820095addcae9d7	2019-01-22 12:22:40 -08:00
Xiang Gao	bed7db7772	Unhide unique from C++, make unique partially scriptable (#15256 ) Summary: This PR does three things: ~~Allow `int64_t?` in function schema, which provide an elegant way of implementing null-able int arguments, as discussed in https://github.com/pytorch/pytorch/pull/15208#pullrequestreview-185230081~~ ~~Originally implemented in https://github.com/pytorch/pytorch/pull/15235~~ ~~Example:~~ ```yaml - func: myop(Tensor self, int64_t? dim=None) -> Tensor variants: function ``` ~~cc: zou3519~~ Edit: implemented in https://github.com/pytorch/pytorch/pull/15234 Previously tried in https://github.com/pytorch/pytorch/pull/12064. There was a problem that C++ does not have kwarg support, which makes it confusing to know whether `unique(t, 1)` actually means `unique(t, dim=1)` or `unique(t, sorted=1)`. Now I think I have a better idea on how to implement this: there are two ATen operators: `unique` and `unique_dim`. `unique` has the same signature as in python, and exported to both python and C++. `unique_dim` has signature `unique_dim(tensor, dim, sorted=False, return_inverse=False)`, and only exported to C++, which could be used more naturally for a C++ user. Differential Revision: D13540278 Pulled By: wanchaol fbshipit-source-id: 3768c76a90b0881f565a1f890459ebccbdfe6ecd	2019-01-21 12:31:37 -08:00
bddppq	1a09a2a27f	Export PyTorch erf to ONNX Erf and add Caffe2 Erf operator Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16106 Differential Revision: D13709490 Pulled By: bddppq fbshipit-source-id: 1b5b32261f06543371f7bd7ac9b11957a5eb4ad0	2019-01-17 09:18:08 -08:00
Jesse Hellemn	8964a2e6e6	Split Caffe2 CI into cmake-only and python builds (#15917 ) Summary: bypass-lint - Change all Caffe2 builds to use setup.py instead of cmake - Add a -cmake- Caffe2 build configuration that uses cmake and only builds cpp - Move skipIfCI logic from onnx test scripts to the rest of CI logic - Removal of old PYTHONPATH/LD_LIBRARY_PATH/etc. env management Pull Request resolved: https://github.com/pytorch/pytorch/pull/15917 Reviewed By: orionr Differential Revision: D13637583 Pulled By: pjh5 fbshipit-source-id: c5c5639db0251ba12b6e4b51b2ac3b26a8953153	2019-01-14 15:20:44 -08:00
BowenBao	24867a58aa	Add support for exporting onnx split (#15092 ) Summary: * With the update of split output to dynamic list it breaks the export to onnx. Now split ir becomes two ops: 1. Dynamic[] <= Split(), and 2. out1, out2, out3 <= Prim::ListUnpack. In this fix these two consecutive ops get fused when being exported to onnx. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15092 Reviewed By: dzhulgakov Differential Revision: D13583832 Pulled By: houseroad fbshipit-source-id: 3eb18c871e750921ad6d5cc179254bee9bcf4c99	2019-01-07 16:09:24 -08:00
zrphercule	43761e01f5	Temporarily disable all XXXlike operator tests in pytorch-onnx test (#15740 ) Summary: We are going to have some breaking changes in ConstantLike and related operators in onnx, therefore it is better to disable all related tests for these operators for now. These operators are not currently supported by caffe2, and are not included in our most recently released onnx, therefore we do not need to worry about internal/external production breaking. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15740 Differential Revision: D13582528 Pulled By: zrphercule fbshipit-source-id: 92a890c1dc2a833969af69edfea85331bb4d562f	2019-01-04 13:36:09 -08:00
Lu Fang	f4c504593c	Fix the (reduce)min and (reduce)max ONNX exporting (#15241 ) Summary: max and reducemax are smashed together, we need to support one input case. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15241 Reviewed By: yinghai Differential Revision: D13473312 Pulled By: houseroad fbshipit-source-id: 9b8c847286a2631b006ca900271bc0d26574101a	2018-12-18 11:48:06 -08:00
Spandan Tiwari	700271d0e9	Adding ONNX export for torch.expand and torch.ne (#15050 ) Summary: `torch.expand` and `torch.ne` are used often in models and this PR adds ONNX export support for them. ArmenAg has created issue https://github.com/pytorch/pytorch/issues/10882 for this. Pull Request resolved: https://github.com/pytorch/pytorch/pull/15050 Differential Revision: D13453036 Pulled By: houseroad fbshipit-source-id: 4724b4ffcebda6cd6b2acac51d6733cb27318daf	2018-12-17 13:48:14 -08:00
Spandan Tiwari	fa12e1e4d4	Export ones_like, zeros_like and full_like using ONNX ConstantLike op. (#14903 ) Summary: This PR does the following: 1) Updates the ONNX export for `torch.zeros_like` and `torch.full_like` ops to use ONNX op `ConstantLike`. This reduces the export of experimental op `ConstantFill`, which may possibly be removed in future, see https://github.com/onnx/onnx/pull/1434). 2) It also adds export support for `torch.ones_like`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14903 Differential Revision: D13383700 Pulled By: houseroad fbshipit-source-id: 566d00a943e9497172fcd5a034b638a650ab13a2	2018-12-08 22:49:02 -08:00
zrphercule	ddca0442b6	Add LogSigmoid support in ONNX symbolic (#14830 ) Summary: Add LogSigmoid: torch.LogSigmoid(x) = onnx.Log(onnx.Sigmoid(x)) Pull Request resolved: https://github.com/pytorch/pytorch/pull/14830 Differential Revision: D13353891 Pulled By: zrphercule fbshipit-source-id: bf456170b9e6c4edad07b3333cd5797f8e0fa97f	2018-12-06 14:17:33 -08:00
Lu Fang	6fccca4278	improve ONNX tests on torch.Linear Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14821 Reviewed By: zrphercule Differential Revision: D13348773 Pulled By: houseroad fbshipit-source-id: 611ca6e28f715e5518649c8c16f702ac3433308c	2018-12-05 17:07:10 -08:00
zrphercule	e114527d19	Add torch.nn.RReLU support in symbolic (#14781 ) Summary: Now we support exporting torch.nn.RReLU in onnx. Pull Request resolved: https://github.com/pytorch/pytorch/pull/14781 Reviewed By: houseroad Differential Revision: D13343872 Pulled By: zrphercule fbshipit-source-id: 1e96b957de4fc2f5ba3959d42329807975419ae3	2018-12-05 13:10:07 -08:00
James Reed	85bde3801b	Tracer now records Python variable names (#13441 ) Summary: This is probably slow but it should make the traces more understandable and make debugging easier. Any suggestions for how to make it faster (i.e. make it so we don't have to traverse all of locals() and globals()) would be appreciated Pull Request resolved: https://github.com/pytorch/pytorch/pull/13441 Differential Revision: D12879763 Pulled By: jamesr66a fbshipit-source-id: b84133dc2ef9ca6cfbfaf2e3f9106784cc42951e	2018-11-08 13:08:42 -08:00
zrphercule	02d3787a19	Support new upsample in symbolic, caffe2 backend & caffe2 frontend (#13272 ) Summary: We updated the description of upsample_op in onnx: https://github.com/onnx/onnx/pull/1467 Therefore, we need to support the new upsample_op in caffe2-onnx backend as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13272 Reviewed By: houseroad Differential Revision: D12833656 Pulled By: zrphercule fbshipit-source-id: 21af5282abaae12d2d044e4018a2b152aff79917	2018-11-05 19:13:57 -08:00
Wanchao Liang	af4a228426	Fix erase_number_type pass, negative indices in c2 and some onnx symbolics (#12888 ) Summary: The PR did two things: 1. fix the bug in erase_number_type on node inputs 2. handle negative indices for dim-reduce in caffe2 Pull Request resolved: https://github.com/pytorch/pytorch/pull/12888 Reviewed By: houseroad Differential Revision: D12833486 Pulled By: wanchaol fbshipit-source-id: c3ceb400d91f0173b73ad95e392b010c3c14db7d	2018-11-05 19:13:49 -08:00
Rui Zhu	479b8266bf	Back out "[pytorch][PR] Support upsample" (#13413 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/13413 Original commit changeset: d5db200365f1 Reviewed By: houseroad Differential Revision: D12870356 fbshipit-source-id: be115d2370636786901c822895664ccace2a9bc2	2018-10-31 15:51:41 -07:00
zrphercule	bb96b6635c	Support upsample (#13152 ) Summary: This will enable the updated attribute and input format of operator upsample. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13152 Reviewed By: houseroad Differential Revision: D12812491 Pulled By: zrphercule fbshipit-source-id: d5db200365f1ab2bd1f052667795841d7ee6beb3	2018-10-29 10:40:35 -07:00
zrphercule	5cbb33f939	Disable upsample optest (#13135 ) Summary: Temporarily disable upsample tests. Pull Request resolved: https://github.com/pytorch/pytorch/pull/13135 Reviewed By: bddppq Differential Revision: D10859926 Pulled By: houseroad fbshipit-source-id: 9eb068198d43ba0939d81a9e41eb6f24ff19cb6d	2018-10-25 20:37:09 -07:00
zrphercule	c6defa0847	Add randn in onnx symbolic (#12880 ) Summary: In this pr we added operator randn in onnx symbolic. Also, related tests are added. Pull Request resolved: https://github.com/pytorch/pytorch/pull/12880 Reviewed By: houseroad Differential Revision: D10501788 Pulled By: zrphercule fbshipit-source-id: ba8bb00ca848c4b95decabf638a1bc13fe11d03e	2018-10-25 14:11:23 -07:00
James Sun	f4944f0f8a	Rename test/common.py to test/common_utils.py (#12794 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12794 common.py is used in base_module for almost all tests in test/. The name of this file is so common that can easily conflict with other dependencies if they happen to have another common.py in the base module. Rename the file to avoid conflict. Reviewed By: orionr Differential Revision: D10438204 fbshipit-source-id: 6a996c14980722330be0a9fd3a54c20af4b3d380	2018-10-17 23:04:29 -07:00
ArmenAg	d5eae90537	update onnx tests (#12619 ) Summary: Fixes #12586 Pull Request resolved: https://github.com/pytorch/pytorch/pull/12619 Reviewed By: ezyang Differential Revision: D10377548 Pulled By: houseroad fbshipit-source-id: 1166e40aa8b98f1fe015fb1bdb2e90acfad3c356	2018-10-15 11:59:19 -07:00
Adam Paszke	78fe149ab9	Fix ONNX bug, add symbolic for full Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/12052 Differential Revision: D10044910 Pulled By: apaszke fbshipit-source-id: 015ef372966d7594e1b450e348d457429f6ef20d	2018-09-26 11:45:25 -07:00
yya007	b91b15d86e	Implementing Matrix Norm for torch.norm (#11261 ) Summary: Currently, norm function only supports vector norm. This PR extends vector norm to matrix norm. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11261 Reviewed By: li-roy Differential Revision: D9652379 Pulled By: yya007 fbshipit-source-id: 519b3fb80b563c17c56a24675c7b0e46bf5a3a1c	2018-09-20 14:43:13 -07:00
Lu Fang	6831d64591	Fix the symbolic for embedding_bag in ONNX_ATEN_FALLBACK (#11840 ) Summary: The ATen interface was changed. Pull Request resolved: https://github.com/pytorch/pytorch/pull/11840 Reviewed By: BIT-silence Differential Revision: D9932452 Pulled By: houseroad fbshipit-source-id: dd2040fcaa0f6052e5856ee19823cf3064124585	2018-09-19 17:40:39 -07:00
Lu Fang	f866574afc	Fix the batchnorm onnx exporting when affine=False Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11249 Reviewed By: Ac2zoom Differential Revision: D9652526 Pulled By: houseroad fbshipit-source-id: 12a9038beddd227a2f9e2178edf4e8d623488c3e	2018-09-05 11:10:25 -07:00
Lu Fang	1b7172a2b9	fix the slice onnx exporting Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/11117 Reviewed By: MisterTea Differential Revision: D9597870 Pulled By: houseroad fbshipit-source-id: 3a2a307ee327397939bedb9150f780682e18a89a	2018-08-31 17:40:03 -07:00
Lu Fang	562fc7631f	Add test cases for ONNX unsqueeze (#10924 ) Summary: PyTorch exporting test and end to end cases. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10924 Reviewed By: Ac2zoom Differential Revision: D9548210 Pulled By: houseroad fbshipit-source-id: 2381d1ad92a4e07f97060eb65c9fd09f60ad3de6	2018-08-29 11:10:21 -07:00
Lu Fang	a5d7abedae	Enable fusing aten::expand on GT, LT, EQ (#10845 ) Summary: GT, LT, EQ all support numpy broadcasting, just enable the fusion. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10845 Reviewed By: bddppq Differential Revision: D9494089 Pulled By: houseroad fbshipit-source-id: 7c65ca06c54dbd476ac7d07b47a413faaed3dd5e	2018-08-28 23:56:50 -07:00
Lu Fang	5ed62ea6fa	Add Upsample example for torch onnx exporting Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/10550 Reviewed By: orionr Differential Revision: D9541932 Pulled By: houseroad fbshipit-source-id: 4d179d189c176482ae919e5cc74607b9d315ed26	2018-08-28 11:39:55 -07:00
Xiang Gao	83066e9b30	Add trigonometry functions for ONNX export (#7540 ) Summary: Trigonometry functions are newly added to ONNX in a recent PR https://github.com/onnx/onnx/pull/869 This PR makes pytorch support exporting graphs with trigonometry functions. This PR might need to wait until it is ready to change ```python _onnx_opset_version = 6 ``` to ```python _onnx_opset_version = 7 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/7540 Differential Revision: D9395041 Pulled By: bddppq fbshipit-source-id: bdf3e9d212b911c8c4eacf5a0753bb092e4748d2	2018-08-19 23:01:28 -07:00
Lu Fang	bdb11e716a	Split the dependence of ONNX from test_operators.py (#10151 ) Summary: Now, run `python test/onnx/test_operators.py --no-onnx`, we won't introduce any onnx python dependence. (No onnx/protobuf python packages needs to be installed) The major changes: - output pbtxt from C++ exporter directly, so the floating format may be slightly different. (This should be fine, since it's just to guard ONNX exporting.) - ONNX python packages are only imported if we run the ONNX related checks. Those checks are disabled when using `--no-onnx` flag. Pull Request resolved: https://github.com/pytorch/pytorch/pull/10151 Reviewed By: jamesr66a Differential Revision: D9130706 Pulled By: houseroad fbshipit-source-id: ea28cf5db8399929179698ee535137f209e9ce6f	2018-08-14 12:54:44 -07:00
Xiang Gao	6fc75eadf0	Add CELU activation to pytorch (#8551 ) Summary: Also fuse input scale multiplication into ELU Paper: https://arxiv.org/pdf/1704.07483.pdf Pull Request resolved: https://github.com/pytorch/pytorch/pull/8551 Differential Revision: D9088477 Pulled By: SsnL fbshipit-source-id: 877771bee251b27154058f2b67d747c9812c696b	2018-08-01 07:54:44 -07:00
Lu Fang	ee827f6ba3	Fix a testcase in logsoftmax onnx export (#9660 ) Summary: We only support special case. The original dim is not supported by ONNX. Pull Request resolved: https://github.com/pytorch/pytorch/pull/9660 Reviewed By: bddppq Differential Revision: D8965507 Pulled By: houseroad fbshipit-source-id: 021dffdf0489c2d3a50bfd1e0c4cfd00d4a3d776	2018-07-27 17:54:32 -07:00
Sam Gross	829d763c69	Implement add, sub, mul, div using TensorIterator (#8919 ) Summary: ``` This adds TensorIterator, a helper class for computing element-wise operations that's intended to replace the CPU and CUDA apply utils functions. CPU kernels are implemented as functions that operate on strided 1-d tensors compared to CPUApplyUtils which operated individual elements. This allows the kernels to handle vectorization, while TensorIterator handles parallelization and non-coalesced dimensions. GPU kernels continue to operate on elements, but the number of specializations is reduced. The contiguous case remains the same. The non-contiguous case uses a single (reduced) shape for all operands and the fast integer division from THCIntegerDivider. To avoid extra specializations for indexing with 64-bits, large operations are split into smaller operations that can be indexed with 32-bits. Major semantic changes: - No more s_add, s_mul, s_div, or s_sub. Broadcasting is handled by TensorIterator. The autograd engine performs the reduction assuming standard broadcasting if the gradient shape does not match the expected shape. Functions that do not use standard broadcasting rules should either continue to trace the expand calls or handle the reduction in their derivative formula. - Use ONNX v7, which supports broadcasting ops. Performance impact: - Small increased fixed overhead (~0.5 us) - Larger overhead for wrapped numbers (~2.5 us) - No significant change for ops on contiguous tensors - Much faster worst-case performance for non-contiguous GPU tensors - Faster CPU bias addition (~2x) - Faster GPU bias addition (~30% faster) Future work: - Decrease overhead, especially for wrapping numbers in Tensors - Handle general inter-type operations - Extend to unary ops and reductions - Use buffering for compute-bound operations on non-contiguous tensors (pull in from CPUApplyUtils) ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/8919 Differential Revision: D8677600 Pulled By: colesbury fbshipit-source-id: 61bc9cc2a36931dfd00eb7153501003fe0584afd	2018-07-27 14:43:24 -07:00
Adam Paszke	aa7af94656	Make JIT tracing a thread-local property (#9414 ) Summary: As in the title. Lets us simplify a lot of code. Depends on #9363, so please review only the last commit. zdevito Pull Request resolved: https://github.com/pytorch/pytorch/pull/9414 Reviewed By: zdevito Differential Revision: D8836496 Pulled By: apaszke fbshipit-source-id: 9b3c3d1f001a9dc522f8478abc005b6b86cfa3e3	2018-07-19 19:09:39 -07:00

1 2

56 Commits