pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
soulitzer	e60f8f4f60	Improve autograd custom function docs (#81340 ) Fixes https://github.com/pytorch/pytorch/issues/81223 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81340 Approved by: https://github.com/albanD	2022-07-21 19:54:30 +00:00
Khaled Zaouk	2fb2740ef9	corrects typo in quantization docs (#81687 ) Fixes #81686 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81687 Approved by: https://github.com/jerryzh168	2022-07-21 00:17:13 +00:00
Joel Schlosser	8573da59c3	Re-enable C++ doc generation (#81719 ) Reverts #80451, as this caused problems reported by many internal and external users. The generated C++ docs are used, even if they are lacking in human-generated content. Fixes #80505 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81719 Approved by: https://github.com/kit1980, https://github.com/albanD	2022-07-20 19:54:47 +00:00
Adam J. Stewart	92c6690b9c	Fix linspace dtype replacement in docs (#81371 ) Fixes #81370 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81371 Approved by: https://github.com/ngimel	2022-07-20 13:06:16 +00:00
titaiwang	69608fc598	[ONNX] remove outdated ImplicitCastType QA in onnx.rst (#81268 ) Extend work from: https://github.com/pytorch/pytorch/pull/80596 This PR removes outdated QA of ImplicitCastType , as the coverage is greatly increased with the introduction of onnx shape inference and scalar type analysis. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81268 Approved by: https://github.com/justinchuby, https://github.com/BowenBao	2022-07-15 16:18:26 +00:00
Danielle Pintz	8926b5b9c2	Fix typos in docs: Profiler and CUDA semantics (#80406 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/80406 Approved by: https://github.com/robieta	2022-07-13 18:53:02 +00:00
Jing Xu	3c7044728b	Enable Intel® VTune™ Profiler's Instrumentation and Tracing Technology APIs (ITT) to PyTorch (#63289 ) More detailed description of benefits can be found at #41001. This is Intel's counterpart of NVidia’s NVTX (https://pytorch.org/docs/stable/autograd.html#torch.autograd.profiler.emit_nvtx). ITT is a functionality for labeling trace data during application execution across different Intel tools. For integrating Intel(R) VTune Profiler into Kineto, ITT needs to be integrated into PyTorch first. It works with both standalone VTune Profiler [(https://www.intel.com/content/www/us/en/developer/tools/oneapi/vtune-profiler.html](https://www.intel.com/content/www/us/en/developer/tools/oneapi/vtune-profiler.html)) and Kineto-integrated VTune functionality in the future. It works for both Intel CPU and Intel XPU devices. Pitch Add VTune Profiler's ITT API function calls to annotate PyTorch ops, as well as developer customized code scopes on CPU, like NVTX for NVidia GPU. This PR rebases the code changes at https://github.com/pytorch/pytorch/pull/61335 to the latest master branch. Usage example: ``` with torch.autograd.profiler.emit_itt(): for i in range(10): torch.itt.range_push('step_{}'.format(i)) model(input) torch.itt.range_pop() ``` cc @ilia-cher @robieta @chaekit @gdankel @bitfort @ngimel @orionr @nbcsm @guotuofeng @guyang3532 @gaoteng-git Pull Request resolved: https://github.com/pytorch/pytorch/pull/63289 Approved by: https://github.com/malfet	2022-07-13 13:50:15 +00:00
vspenubarthi	3b00b17f64	[docs] Updated quantization docs to show per channel support for conv1d (#81349 ) Summary: There is currently per channel quantization support for Conv1d, however this was not highlighted by the documentation for quantization when discussion which modules have per channel quantization support. This adds that there is exisiting support for Conv1d, with evidence reproducable through the test plan below. Test Plan: ``` class SingleLayerModel(torch.nn.Module): def __init__(self): super().__init__() self.conv1d = torch.nn.Conv1d(5, 5, 1).to(dtype=torch.float) def forward(self, x): x = self.conv1d(x) return x def get_example_inputs(self): return (torch.rand(5, 5, 1),) torch.backends.quantized.engine = "fbgemm" model = SingleLayerModel() example_input = model.get_example_inputs()[0] q_config = q_config_mapping = QConfigMapping() q_config_mapping.set_global(torch.ao.quantization.get_default_qconfig(torch.backends.quantized.engine)) prepared = quantize_fx.prepare_fx(model, q_config_mapping, example_input) print(prepared.conv1d.qconfig.weight.p.func) ``` Printing the above lines shows that the Conv1d has a PerChannelMinMaxObserver. To show that this doesn't work for everything, if you replace the Conv1d with a ConvTranspose1d, you will see running the same code above that there is an error thrown about lack of support. Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/81349 Approved by: https://github.com/andrewor14	2022-07-12 23:36:37 +00:00
lezcano	e505796a2c	[Array API] Add linalg.vecdot (#70542 ) This PR adds the function `linalg.vecdot` specified by the [Array API](https://data-apis.org/array-api/latest/API_specification/linear_algebra_functions.html#function-vecdot) For the complex case, it chooses to implement \sum x_i y_i. See the discussion in https://github.com/data-apis/array-api/issues/356 Edit. When it comes to testing, this function is not quite a binopt, nor a reduction opt. As such, we're this close to be able to get the extra testing, but we don't quite make it. Now, it's such a simple op that I think we'll make it without this. Resolves https://github.com/pytorch/pytorch/issues/18027. cc @mruberry @rgommers @pmeier @asmeurer @leofang @AnirudhDagar @asi1024 @emcastillo @kmaehashi Pull Request resolved: https://github.com/pytorch/pytorch/pull/70542 Approved by: https://github.com/IvanYashchuk, https://github.com/mruberry	2022-07-12 14:28:54 +00:00
vitrioil	747b3b311d	Fix links in `torch.testing` docs (#80353 ) Fixes #79266 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80353 Approved by: https://github.com/mruberry	2022-07-11 19:15:53 +00:00
albanD	a879cb5865	Update poi based on recent activity (#81097 ) cc @Lezcano Pull Request resolved: https://github.com/pytorch/pytorch/pull/81097 Approved by: https://github.com/Lezcano, https://github.com/b0noI	2022-07-09 14:39:34 +00:00
Zafar	68ec793cfd	[ao] Moving the sparsity/experimental to sparsity/_experimental (#81149 ) The experimental code in the sparsity does not have user-facing api, and should reside under the proivate package. This involves pruner and base_sparsifier. Pull Request resolved: https://github.com/pytorch/pytorch/pull/81149 Approved by: https://github.com/macandro96	2022-07-09 03:00:11 +00:00
PyTorch MergeBot	39f659c3ba	Revert "[Array API] Add linalg.vecdot (#70542 )" This reverts commit `74208a9c68`. Reverted https://github.com/pytorch/pytorch/pull/70542 on behalf of https://github.com/malfet due to Broke CUDA-10.2 for vecdot_bfloat16, see `74208a9c68`	2022-07-08 22:56:51 +00:00
Sherlock Huang	fc10a63727	Prims+NvFuser Backend Prototype (#80591 ) This PR integrates FX graph partitioner + Aten2Prims DecompositionInterpreter + Prims' TraceExecutor + naive caches for nvFuser. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80591 Approved by: https://github.com/jjsjann123, https://github.com/ezyang	2022-07-08 19:53:03 +00:00
lezcano	74208a9c68	[Array API] Add linalg.vecdot (#70542 ) This PR adds the function `linalg.vecdot` specified by the [Array API](https://data-apis.org/array-api/latest/API_specification/linear_algebra_functions.html#function-vecdot) For the complex case, it chooses to implement \sum x_i y_i. See the discussion in https://github.com/data-apis/array-api/issues/356 Edit. When it comes to testing, this function is not quite a binopt, nor a reduction opt. As such, we're this close to be able to get the extra testing, but we don't quite make it. Now, it's such a simple op that I think we'll make it without this. Resolves https://github.com/pytorch/pytorch/issues/18027. cc @mruberry @rgommers @pmeier @asmeurer @leofang @AnirudhDagar @asi1024 @emcastillo @kmaehashi Pull Request resolved: https://github.com/pytorch/pytorch/pull/70542 Approved by: https://github.com/IvanYashchuk, https://github.com/mruberry	2022-07-08 15:37:58 +00:00
jjsjann123	d2c726d43c	torch.jit doc link for nvfuser readme.md (#77780 ) adding a quick link to nvfuser README.md in jit doc Note that for 1.12 release, we probably want to have the link pointed to the doc in the release code base. I don't know if we have a tag for 1.12 release candidate yet, so we might want to update that. Pull Request resolved: https://github.com/pytorch/pytorch/pull/77780 Approved by: https://github.com/davidberard98	2022-07-07 23:25:35 +00:00
Eddie Yan	ae6dd20ba7	[cuDNN V8 API] (reopen 2) Allow the number of kernels profiled under torch.backends.cudnn.benchmark = True to be limitedCudnnv8 benchmark limit (#78299 ) Reopen of #77002 to address comments by @malfet CC @ngimel @ptrblck Pull Request resolved: https://github.com/pytorch/pytorch/pull/78299 Approved by: https://github.com/ngimel	2022-07-07 23:25:23 +00:00
Christian Puhrsch	c97ff3d51e	Update NestedTensor docs (#80963 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/80963 Approved by: https://github.com/george-qi	2022-07-07 22:15:39 +00:00
Sahan Paliskara	bd6bea35f8	Update package.rst to not include hermetic claim (#81019 ) Summary: Update package.rst to not include hermetic claim as torch.package is not fully hermetic Test Plan: external CI (docs build) Differential Revision: D37670779 Pull Request resolved: https://github.com/pytorch/pytorch/pull/81019 Approved by: https://github.com/priyaramani	2022-07-07 18:40:55 +00:00
albanD	6f1d99b79f	update nn.init doc to reflect the no_grad (#80882 ) Fixes https://github.com/pytorch/pytorch/issues/80839 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80882 Approved by: https://github.com/jbschlosser	2022-07-07 17:19:29 +00:00
lezcano	19f3d4d795	Expose linalg.solve_ex (#80073 ) This prepares for making `linalg.inv_ex` just a call into this function Pull Request resolved: https://github.com/pytorch/pytorch/pull/80073 Approved by: https://github.com/IvanYashchuk, https://github.com/albanD	2022-07-01 16:09:23 +00:00
Andrew M. James	5a4c9e8394	Add spdiags sparse matrix initialization (#78439 ) Similar to [scipy.sparse.spdiags](https://docs.scipy.org/doc/scipy/reference/generated/scipy.sparse.spdiags.html#scipy-sparse-spdiags) Part of #70926 In other functions (ie (torch.diagonal)[https://pytorch.org/docs/stable/generated/torch.diagonal.html#torch.diagonal]) diagonals of a tensor are referenced using the offset and the two dimensions that the diagonal is taken with respect to. Here the reference implementation from scipy is only considering matrix output, so even if we only support 2-d output at first. It may be useful to consider how the dimensions corresponding to each diagonal would be specified for higher dimensional output. The proposed torch signature implies that all offsets refer to the diagonals with respect to the only two dimensions of the output: ``` torch.sparse.spdiags(Tensor diagonals, IntTensor offsets, int[] shape, Layout? layout=None) -> SparseTensor ``` Above it is required that: `diagonals.ndimension() == 2`, `offsets.ndimensions() == 1`, `offsets.shape[0] == diagonals.shape[0]` and `len(shape) == 2`. This would need to be altered for the case where `len(shape)` > 2. One options is: ``` torch.sparse.spdiags(Tensor[] diagonals, IntTensor[] offsets, IntTensor dims, int[] shape, Layout? layout=None) -> SparseTensor ``` Here `offsets` and `diagonals` becomes lists of tensors, and the `IntTensor dims` argument is introduced. This would require that `len(diagonals) == len(offsets) == dims.shape[0]`, `dims.ndimension() == 2` and `dims.shape[1] == 2` also the same restrictions as the 2d case above apply to the elements of `diagonals` and `offsets` pairwise (that is `diagonals[i].ndimension() == 2`, `offsets[i].ndimension() == 1` and `offsets[i].shape[0] == diagonals[i].shape[0]` for all i). This form of the signature would construct the sparse result by placing the values from `diagonals[i][j]` into the diagonal with offset `offset[i][j]` taken with respect to dimensions `dims[i]`. The specialization back to the original signature for the 2d case could be seen as allowing the single row of dims to default to `[0, 1]` when there is only one `diagonals`, `offsets` provided, and shape is `2-d`. This option allows the rows of an input element `diagonals[i]` to have a different length which may be appropriate as the max length of a diagonal along different dimension pairs will be different. Another option is to specify the dimensions the diagonal is taken with respect to for each offset. This signature would look like: ``` torch.sparse.spdiags(Tensor diagonals, IntTensor offsets, IntTensor dims, int[] shape, Layout? layout=None) -> SparseTensor ``` Here, `diagonals` is still 2-D with dimension 0 matching the length of 1-D `offsets` and the tensor input `dims` is also 2-D with dimension 0 matching the length of 1-D `offsets` and the second dimension being fixed at `2` in this case the sparse result is constructed by placing the elements from `diagonals[i]` into the output diagonal `output.diagonal(offset[i], dim0=dims[i][0], dim1=dims[i][1])` (with some additional consideration that makes it more complicated than simply asigning to that view). The specialization from this back to the 2-D form could be seen as assuming `dims = [[0, 1], [0, 1]... len(offsets) times ]` when `len shape==2`. In both proposed signatures for the N-D case the specialization back to the 2-D signature is a bit of a stretch for your typical default arguments logic, however I think the first is better choice as it offers more flexibility. I think some discussion is required about: - [x] Should the N-D output case be implemented from the outset - [x] If not, should the future addition of the N-D output case be considered when designing the interface. - [x] Other thoughts on the signature which includes the `dims` information for the N-D output case. Resolution: Since no one has offered a request for N-D output support, I think is fine to restrict this to sparse matrix generation. Should a request for N-D support come later, an overload accepting the additional `dims` could be added. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78439 Approved by: https://github.com/nikitaved, https://github.com/cpuhrsch, https://github.com/pearu	2022-07-01 01:11:54 +00:00
PyTorch MergeBot	56e3bc5215	Revert "Add spdiags sparse matrix initialization (#78439 )" This reverts commit `cfb2034b65`. Reverted https://github.com/pytorch/pytorch/pull/78439 on behalf of https://github.com/suo due to broke windows builds, see: `cfb2034b65`	2022-06-30 21:04:36 +00:00
Andrew M. James	cfb2034b65	Add spdiags sparse matrix initialization (#78439 ) Similar to [scipy.sparse.spdiags](https://docs.scipy.org/doc/scipy/reference/generated/scipy.sparse.spdiags.html#scipy-sparse-spdiags) Part of #70926 In other functions (ie (torch.diagonal)[https://pytorch.org/docs/stable/generated/torch.diagonal.html#torch.diagonal]) diagonals of a tensor are referenced using the offset and the two dimensions that the diagonal is taken with respect to. Here the reference implementation from scipy is only considering matrix output, so even if we only support 2-d output at first. It may be useful to consider how the dimensions corresponding to each diagonal would be specified for higher dimensional output. The proposed torch signature implies that all offsets refer to the diagonals with respect to the only two dimensions of the output: ``` torch.sparse.spdiags(Tensor diagonals, IntTensor offsets, int[] shape, Layout? layout=None) -> SparseTensor ``` Above it is required that: `diagonals.ndimension() == 2`, `offsets.ndimensions() == 1`, `offsets.shape[0] == diagonals.shape[0]` and `len(shape) == 2`. This would need to be altered for the case where `len(shape)` > 2. One options is: ``` torch.sparse.spdiags(Tensor[] diagonals, IntTensor[] offsets, IntTensor dims, int[] shape, Layout? layout=None) -> SparseTensor ``` Here `offsets` and `diagonals` becomes lists of tensors, and the `IntTensor dims` argument is introduced. This would require that `len(diagonals) == len(offsets) == dims.shape[0]`, `dims.ndimension() == 2` and `dims.shape[1] == 2` also the same restrictions as the 2d case above apply to the elements of `diagonals` and `offsets` pairwise (that is `diagonals[i].ndimension() == 2`, `offsets[i].ndimension() == 1` and `offsets[i].shape[0] == diagonals[i].shape[0]` for all i). This form of the signature would construct the sparse result by placing the values from `diagonals[i][j]` into the diagonal with offset `offset[i][j]` taken with respect to dimensions `dims[i]`. The specialization back to the original signature for the 2d case could be seen as allowing the single row of dims to default to `[0, 1]` when there is only one `diagonals`, `offsets` provided, and shape is `2-d`. This option allows the rows of an input element `diagonals[i]` to have a different length which may be appropriate as the max length of a diagonal along different dimension pairs will be different. Another option is to specify the dimensions the diagonal is taken with respect to for each offset. This signature would look like: ``` torch.sparse.spdiags(Tensor diagonals, IntTensor offsets, IntTensor dims, int[] shape, Layout? layout=None) -> SparseTensor ``` Here, `diagonals` is still 2-D with dimension 0 matching the length of 1-D `offsets` and the tensor input `dims` is also 2-D with dimension 0 matching the length of 1-D `offsets` and the second dimension being fixed at `2` in this case the sparse result is constructed by placing the elements from `diagonals[i]` into the output diagonal `output.diagonal(offset[i], dim0=dims[i][0], dim1=dims[i][1])` (with some additional consideration that makes it more complicated than simply asigning to that view). The specialization from this back to the 2-D form could be seen as assuming `dims = [[0, 1], [0, 1]... len(offsets) times ]` when `len shape==2`. In both proposed signatures for the N-D case the specialization back to the 2-D signature is a bit of a stretch for your typical default arguments logic, however I think the first is better choice as it offers more flexibility. I think some discussion is required about: - [x] Should the N-D output case be implemented from the outset - [x] If not, should the future addition of the N-D output case be considered when designing the interface. - [x] Other thoughts on the signature which includes the `dims` information for the N-D output case. Resolution: Since no one has offered a request for N-D output support, I think is fine to restrict this to sparse matrix generation. Should a request for N-D support come later, an overload accepting the additional `dims` could be added. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78439 Approved by: https://github.com/nikitaved, https://github.com/cpuhrsch, https://github.com/pearu	2022-06-30 19:54:47 +00:00
Bin Wen	45ae244086	[torch.package][doc] PackageExporter does not have file_structure (#79948 ) Summary: found this issue when testing torch.package. also found an open issue https://github.com/pytorch/pytorch/issues/74221. bootstrapping a fix. Reviewed By: d4l3k Differential Revision: D37063748 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79948 Approved by: https://github.com/d4l3k	2022-06-30 19:49:53 +00:00
PyTorch MergeBot	1454515253	Revert "Enable Intel® VTune™ Profiler's Instrumentation and Tracing Technology APIs (ITT) to PyTorch (#63289 )" This reverts commit `f988aa2b3f`. Reverted https://github.com/pytorch/pytorch/pull/63289 on behalf of https://github.com/malfet due to broke trunk, see `f988aa2b3f`	2022-06-30 12:49:41 +00:00
Jing Xu	f988aa2b3f	Enable Intel® VTune™ Profiler's Instrumentation and Tracing Technology APIs (ITT) to PyTorch (#63289 ) More detailed description of benefits can be found at #41001. This is Intel's counterpart of NVidia’s NVTX (https://pytorch.org/docs/stable/autograd.html#torch.autograd.profiler.emit_nvtx). ITT is a functionality for labeling trace data during application execution across different Intel tools. For integrating Intel(R) VTune Profiler into Kineto, ITT needs to be integrated into PyTorch first. It works with both standalone VTune Profiler [(https://www.intel.com/content/www/us/en/developer/tools/oneapi/vtune-profiler.html](https://www.intel.com/content/www/us/en/developer/tools/oneapi/vtune-profiler.html)) and Kineto-integrated VTune functionality in the future. It works for both Intel CPU and Intel XPU devices. Pitch Add VTune Profiler's ITT API function calls to annotate PyTorch ops, as well as developer customized code scopes on CPU, like NVTX for NVidia GPU. This PR rebases the code changes at https://github.com/pytorch/pytorch/pull/61335 to the latest master branch. Usage example: ``` with torch.autograd.profiler.emit_itt(): for i in range(10): torch.itt.range_push('step_{}'.format(i)) model(input) torch.itt.range_pop() ``` cc @ilia-cher @robieta @chaekit @gdankel @bitfort @ngimel @orionr @nbcsm @guotuofeng @guyang3532 @gaoteng-git Pull Request resolved: https://github.com/pytorch/pytorch/pull/63289 Approved by: https://github.com/malfet	2022-06-30 05:14:03 +00:00
Allen Goodman	63ef2a03e5	torch.special.scaled_modified_bessel_k0 (#78900 ) ```Python scaled_modified_bessel_k0(input, *, out=None) -> Tensor ``` Scaled modified Bessel function of the second kind of order $0$. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78900 Approved by: https://github.com/mruberry	2022-06-29 14:53:37 +00:00
Joel Benjamin Schlosser	f70bf13c6e	Disable doxygen / breathe / exhale generation of C++ API docs (#80451 ) Fixes #79992 This PR: * Removes doxygen / breathe / exhale configuration from the Sphinx config in `source/conf.py` so it no longer runs * Maintains the human-generated content describing API usage in the various .rst files * Exception: `library.rst` is removed, as its main purpose is linking to API docs * Removes all links to the generated API docs from the human-generated content The build is nearly instantaneous now and should be much less memory intensive as well. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80451 Approved by: https://github.com/suo	2022-06-28 17:56:41 +00:00
PyTorch MergeBot	602c38ff63	Revert "torch.special.gamma (#78904 )" This reverts commit `f563f25efd`. Reverted https://github.com/pytorch/pytorch/pull/78904 on behalf of https://github.com/suo due to This PR appears to have broken mac tests on master `f563f25efd`	2022-06-28 00:54:22 +00:00
Svetlana Karslioglu	7394de4e1e	Add a note on CUDA 11.6 (#80363 ) Fixes #79876 Pull Request resolved: https://github.com/pytorch/pytorch/pull/80363 Approved by: https://github.com/atalman	2022-06-27 21:34:24 +00:00
Allen Goodman	ab8797d69b	torch.special.spherical_bessel_j0 (#78912 ) ```Python spherical_bessel_j0(input, *, out=None) -> Tensor ``` Spherical Bessel function of the first kind of order $0$. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78912 Approved by: https://github.com/mruberry	2022-06-27 20:14:46 +00:00
Allen Goodman	f563f25efd	torch.special.gamma (#78904 ) ```Python gamma(input, *, out=None) -> Tensor ``` Gamma function $\Gamma\left(\text{input}\right)$. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78904 Approved by: https://github.com/mruberry	2022-06-27 19:36:17 +00:00
migeedz	443db9b58e	Introduce Z3 types and utility functions for constraint generation (#80084 ) Create Z3 types. In particular, dynamic dimensions, dynamic tensor type and tensor types up to size 4. Note that for Z3 decidability reasons, we are using uninterpreted functions for tensor types, which means we must explicitly define tensor constructors with a concrete size (for now, upto size 4). We defer lifting this requirement to future work. Pull Request resolved: https://github.com/pytorch/pytorch/pull/80084 Approved by: https://github.com/anijain2305	2022-06-25 22:27:33 +00:00
Allen Goodman	b3ca3638be	torch.special.scaled_modified_bessel_k1 (#78901 ) ```Python scaled_modified_bessel_k1(input, *, out=None) -> Tensor ``` Scaled modified Bessel function of the second kind of order $1$. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78901 Approved by: https://github.com/mruberry	2022-06-24 20:57:38 +00:00
Sherlock Huang	752c06e0e1	FX graph partitioner and fuser (#79439 ) This PR introduces two components. CapabilityBasedPartitioner for FX graph: given a list of supported operators, this partitioner tries to forms the largest subgraphs that only contain the supported ops. Fuser utility: given a list of nodes in FX graph, it lifts them as a sub-GraphModule in the original graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79439 Approved by: https://github.com/jjsjann123, https://github.com/davidberard98	2022-06-24 18:49:37 +00:00
HDCharles	0308609b41	[quant] Quantizable documentation (#79957 ) Minor documentation entry for the quantizable LSTM and MHA classes. due to weird CI issues old discussion can be found: https://github.com/pytorch/pytorch/pull/71191 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79957 Approved by: https://github.com/z-a-f	2022-06-24 16:55:15 +00:00
macandro96	70b7bca423	[ao][sparsity] Base scheduler class for Data Schedulers (#79817 ) The BaseDataScheduler is the abstract scheduler class specifically for the BaseDataSparsifier class. This class controls a specific hyperparameter of the sparsifier class and varies it across the training process (or across time). Args: data_sparsifier (instance of BaseDataSparsifier) Implemented class data sparsifier class wherein the update_mask is implemented schedule_param (str) A specific hyperparameter of the passed sparsifier that needs to be scheduled/varied last_epoch (int, default=-1) This is specifically is passed when training needs to be resumed from a particular point. verbose (bool, default=False) Verbosity of the BaseDataScheduler The get_schedule_param() function needs to be implemented by the user. Test Plan: ```python test/test_ao_sparsity.py TestBaseDataScheduler``` Differential Revision: [D37358608](https://our.internmc.facebook.com/intern/diff/D37358608) Pull Request resolved: https://github.com/pytorch/pytorch/pull/79817 Approved by: https://github.com/jerryzh168, https://github.com/z-a-f	2022-06-24 16:51:52 +00:00
HDCharles	ffdc5eebc7	[ao][docs] tests for quantization docs (#79923 ) Summary: per https://github.com/pytorch/pytorch/issues/79135 the code snippets in the docs don't run. This is a recurring problem since previously there was no unit test to check that these code snippets actually ran. This PR adds support for such a test, importing the snippet as a string and evaluating it to make sure that it actually runs if the code snippet has user defined code, you can pass in dummy versions using global_inputs. Sometimes the imports of the code snippets behave oddly but you can pass them in as in test_quantization_doc_custom where nnq is passed in. Test Plan: python test/test_quantization.py TestQuantizationDocs also see https://github.com/pytorch/pytorch/pull/79994 to see what shows up in CI when the docs get broken Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79923 Approved by: https://github.com/z-a-f, https://github.com/vspenubarthi	2022-06-23 20:50:31 +00:00
Justin Chu	da33c93169	[ONNX] Clean up `onnx_supported_ops` (#79424 ) - Hide the module from `torch.onnx` public namespace because it is for internal use - Remove unused variables - Fix lint errors - Reformat - Create `onnx` folder under docs/scripts and add it to the onnx merge rule Pull Request resolved: https://github.com/pytorch/pytorch/pull/79424 Approved by: https://github.com/thiagocrepaldi, https://github.com/garymm, https://github.com/kit1980, https://github.com/malfet	2022-06-23 20:44:51 +00:00
Allen Goodman	b3308e21bf	torch.special.airy_ai (#78902 ) ```Python airy_ai(input, *, out=None) -> Tensor ``` Airy function $\text{Ai}\left(\text{input}\right)$. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78902 Approved by: https://github.com/mruberry, https://github.com/linbinyu, https://github.com/seemethere	2022-06-23 19:33:40 +00:00
Edward Z. Yang	f7ee061638	Wconstab/reland pysymint (#79795 ) rebased https://github.com/pytorch/pytorch/pull/79617/ to see if issues are reproducible. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79795 Approved by: https://github.com/malfet	2022-06-20 22:55:06 +00:00
David Berard	8edaf388e5	Fix fx decomposition example Previously GraphAppendingTracer was appending to the wrong graph. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79807 Approved by: https://github.com/kit1980	2022-06-20 17:26:17 +00:00
eqy	eff74ed7bd	[AMP] Use generic autocast in example, specify dtype (#79579 ) CC @mruberry @ptrblck Pull Request resolved: https://github.com/pytorch/pytorch/pull/79579 Approved by: https://github.com/mruberry, https://github.com/ngimel	2022-06-17 21:32:51 +00:00
Rhys Goodall	62ba548cac	[DOC] Missing line in serialization notes (#79454 ) Small typo fix to serialization docs where there was a missing line in one of the examples. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79454 Approved by: https://github.com/mruberry	2022-06-17 18:26:47 +00:00
Orion Reblitz-Richardson	4df76d1df3	Adjust wording for consistency (#79758 ) Requested by some of our internal review. @svekars thoughts? Thanks. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79758 Approved by: https://github.com/svekars, https://github.com/kit1980	2022-06-17 01:39:30 +00:00
Olga Andreeva	8a6d83079c	Functionality/pickling for commhooks (#79334 ) This PR addresses issue address #75666. Stateful communication hook now can be saved and reloaded to resume training. Current PR adds the functionality for PowerSGD communication hook and tests that communication hook can be properly saved and restored. PowerSGD implementation uses ``__slots__``, as a result introduced __getstate__ and __setstate__ methods are implemented to work with `__slots__` and not` __dict__`. `__getstate__ ` Returns: A dictionary that represents a ``PowerSGDState`` which will be pickled and saved. ``process_group`` is non-serializable and excluded from a returned state. `__setstate__` Takes a provided ``state`` and retrieves ``PowerSGDState``. ``process_group`` is set to default with a proper warning issued to a user. Unit test A hook-independent `_test_hook_pickling` is added with this PR, as well as `test_ddp_hook_pickling_powerSGD`, which tests `powerSGD`’s ability to be saved and reloaded. Currently, the test creates a ddp model with a provided hook, trains it for 10 epochs and saves model’s state and hook’s state. During reloading, unit test makes sure that a warning was logged (only one warning and the proper one). It then proceeds to check that reloaded hook and original hook are the same. Finally, it checks that a hook’s state was properly initialized: - it compares slot values (all, but 2: `process_group` and `rng`) for original and reloaded state - it checks that process group was set to a default group - it checks that a random state was restored properly with np.testing.assert_array_equal, because `rng` is an instance of `np.random.RandomState`, represented by a tuple. One of entries is of `ndarray dtype[uint32]` type and `np.testing.assert_array_equal` is used for assertion. Future To-Do: - Implement similar __getstate__ and __setstate__ for other stateful communication hooks - Add appropriate tests Pull Request resolved: https://github.com/pytorch/pytorch/pull/79334 Approved by: https://github.com/rohan-varma, https://github.com/awgu	2022-06-16 23:15:34 +00:00
PyTorch MergeBot	44436947bc	Revert "Reland PySymInt (#79617 )" This reverts commit `8ef6356f26`. Reverted https://github.com/pytorch/pytorch/pull/79617 on behalf of https://github.com/zengk95 due to this is breaking periodic jobs (and maybe pull) on trunk	2022-06-16 19:40:27 +00:00
macandro96	15828bcfd7	[ao][sparsity] Base class for Data Sparsifier Base Data Sparsifier class for all Data sparsifiers. The abstract class accepts raw torch tensors / embedding / embedding bags (refer to SUPPORTED_TYPES above) to prepare for sparsification. In this case, mask (and parametrizations) is owned by the class and not by the user. Specifically, the container object inside the class maintains the mask and parametrizations of the input data Test Plan: ```python test/test_ao_sparsity.py TestBaseDataSparsifier``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/79251 Approved by: https://github.com/z-a-f, https://github.com/HDCharles	2022-06-16 17:31:22 +00:00
Nikolay Korovaiko	8ef6356f26	Reland PySymInt (#79617 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/79617 Approved by: https://github.com/Chillee	2022-06-16 04:18:06 +00:00

1 2 3 4 5 ...

1962 Commits