pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
kshitij12345	a732bbea23	[meta] Add meta support for fft ops (#79311 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/79311 Approved by: https://github.com/ezyang	2022-06-13 01:56:42 +00:00
kshitij12345	bd1a35dfc8	[meta] diag ops, trace (#79341 ) meta registration for `diag.out` and update test skips/expectedFailures Pull Request resolved: https://github.com/pytorch/pytorch/pull/79341 Approved by: https://github.com/ezyang	2022-06-12 18:45:03 +00:00
Edward Z. Yang	213a8fc992	Put symint overloads on a different name Due to implicit conversion shenanigans, having both IntArrayRef and SymIntArrayRef overloads makes {} ambiguous. While we could fix this by making a single unified type that accepts all the overloads we want, an easier fix was to just push the SymIntArrayRef overload to its own name. Signed-off-by: Edward Z. Yang <ezyangfb.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/79281 Approved by: https://github.com/suo	2022-06-12 14:36:39 +00:00
pritam	a81be44410	Fix `shard_module` to appropriately deal with sub process groups. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79264 `shard_module` API didn't work correctly with a sub-pg since `dist.scatter` actually takes the global rank as input for `src`. Fixing this by passing in the appropriate rank to `dist.scatter` Differential Revision: [D37062766](https://our.internmc.facebook.com/intern/diff/D37062766/) Approved by: https://github.com/fduwjj, https://github.com/wanchaol	2022-06-12 03:50:45 +00:00
PyTorch MergeBot	5413580f9e	Revert "[JIT] Propagate profiled information to DifferentiableGraph outputs" This reverts commit `1d2a6c2e94`. Reverted https://github.com/pytorch/pytorch/pull/78875 on behalf of https://github.com/davidberard98 due to Internal failures were bisected to this change	2022-06-12 00:14:08 +00:00
PyTorch MergeBot	66460c4a6a	Revert "Fixes maybe_broadcast to actually broadcast only when needed (#79298 )" This reverts commit `1cb1c2c08c`. Reverted https://github.com/pytorch/pytorch/pull/79298 on behalf of https://github.com/suo due to Broke FakeTensor tests on master, see: `1cb1c2c08c`	2022-06-11 23:36:18 +00:00
Mike Ruberry	1cb1c2c08c	Fixes maybe_broadcast to actually broadcast only when needed (#79298 ) Adds a `same_shape` util and updates maybe_broadcast to use it; previously maybe_broadcast was always broadcasting because its equality check was always failing. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79298 Approved by: https://github.com/ezyang	2022-06-11 22:04:47 +00:00
Michael Suo	30fb2c4aba	[lint] autoformat test/cpp and torch/csrc Let's have some fun. Pull Request resolved: https://github.com/pytorch/pytorch/pull/78828 Approved by: https://github.com/ezyang	2022-06-11 21:11:16 +00:00
Mikayla Gawarecki	1ec30a6647	Add offsets-based reduction to segment_reduce (CPU, CUDA) Pull Request resolved: https://github.com/pytorch/pytorch/pull/78907 Approved by: https://github.com/cpuhrsch	2022-06-11 17:43:42 +00:00
Michael Suo	c978b609f7	[ci] remove IN_CI env var The conventional env var to set is CI. Both circle and GHA set it, so IN_CI is unnecessary Pull Request resolved: https://github.com/pytorch/pytorch/pull/79229 Approved by: https://github.com/janeyx99	2022-06-11 17:16:30 +00:00
Michael Suo	f51d5233f2	[ci] fix GITHUB_ACTIONS env var checks `GITHUB_ACTIONS` is set to `true`, but some of our code checks that it is `1`. Make the checks more general. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79290 Approved by: https://github.com/janeyx99	2022-06-11 17:16:30 +00:00
George Qi	164029f783	masked logasumexp/logaddexp Pull Request resolved: https://github.com/pytorch/pytorch/pull/78291 Approved by: https://github.com/cpuhrsch	2022-06-11 05:46:36 +00:00
lezcano	54949a5abc	Simplify and optimize linalg.solve This PR heavily simplifies the code of `linalg.solve`. At the same time, this implementation saves quite a few copies of the input data in some cases (e.g. A is contiguous) We also implement it in such a way that the derivative goes from computing two LU decompositions and two LU solves to no LU decompositions and one LU solves. It also avoids a number of unnecessary copies the derivative was unnecessarily performing (at least the copy of two matrices). On top of this, we add a `left` kw-only arg that allows the user to solve `XA = B` rather concisely. Pull Request resolved: https://github.com/pytorch/pytorch/pull/74046 Approved by: https://github.com/nikitaved, https://github.com/IvanYashchuk, https://github.com/mruberry	2022-06-11 04:06:40 +00:00
Akshay Parashar	65a37923f9	[Static Runtime] Exception handling during fork subgraph execution (#79292 ) Summary: - Exception handling was not performed in forked subgraph execution - forked subgraph runtime can throw runtime exception. Future returned by prim::fork needs to handle exceptions so that aten::wait handles it. Test Plan: local test cases: - buck test caffe2/benchmarks/static_runtime/fb:test_fb_operators - buck test mode/opt caffe2/benchmarks/static_runtime:static_runtime_cpptest - buck test mode/opt caffe2/test:static_runtime Async execution of the subgraph is tested by adding pytorch profiler hooks on the StaticRuntime execution via below code. Async execution in threadpool is verfiied by checking trace with profile(activities=[ProfilerActivity.CPU]) as prof: static_runtime_module(inputs) prof.export_chrome_trace("trace.json") Differential Revision: D37072493 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79292 Approved by: https://github.com/mikeiovine	2022-06-11 03:11:49 +00:00
Michael Andreas Dagitses	ab2ca95dd1	turn on -Werror=unused-variable in our Bazel CPU build Summary: We also fix any existing issues. Note that we only do this for the CPU build because nvcc is considered a C++ toolchain but it does not have the same flag support. Adding flags to the GPU build will cause nvcc errors. Test Plan: Built locally, rely on CI to confirm. Reviewers: malfet Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79156 Approved by: https://github.com/seemethere, https://github.com/osalpekar, https://github.com/albanD	2022-06-11 02:46:34 +00:00
Mikayla Gawarecki	e727539c29	Support multi-dimensional lengths in segment_reduce to support pytorch_scatter.segment_* functionalities (CUDA) Pull Request resolved: https://github.com/pytorch/pytorch/pull/77061 Approved by: https://github.com/cpuhrsch	2022-06-11 01:45:22 +00:00
anjali411	38350acf8f	Autogen Tags enum, and allow specifying tags while defining an op Pull Request resolved: https://github.com/pytorch/pytorch/pull/79322 Approved by: https://github.com/albanD	2022-06-11 00:29:32 +00:00
drisspg	10dfdf2066	engine changes Pull Request resolved: https://github.com/pytorch/pytorch/pull/79296 Approved by: https://github.com/soulitzer	2022-06-11 00:07:27 +00:00
Michael Andreas Dagitses	606b234336	turn on -Werror=unused-function in our Bazel CPU build Summary: We also fix any existing issues. Note that we only do this for the CPU build because nvcc is considered a C++ toolchain but it does not have the same flag support. Adding flags to the GPU build will cause nvcc errors. Test Plan: Built locally, rely on CI to confirm. Reviewers: malfet Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79154 Approved by: https://github.com/seemethere, https://github.com/osalpekar, https://github.com/albanD	2022-06-10 22:11:54 +00:00
albanD	4aca751921	remove spurious warning in amp (#79203 ) fix https://github.com/pytorch/pytorch/issues/72527 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79203 Approved by: https://github.com/anjali411	2022-06-10 21:53:58 +00:00
PyTorch MergeBot	d28e9e145b	Revert "[nvfuser_upstream_push] nvfuser code base bump 060822 (#79147 )" This reverts commit `49c41b87a2`. Reverted https://github.com/pytorch/pytorch/pull/79147 on behalf of https://github.com/janeyx99 due to Broke 11.3 builds on trunk `49c41b87a2`	2022-06-10 20:55:10 +00:00
PyTorch MergeBot	bcd7a20953	Revert "turn on -Werror=unused-function in our Bazel CPU build" This reverts commit `67d313a032`. Reverted https://github.com/pytorch/pytorch/pull/79154 on behalf of https://github.com/malfet due to Breaks bazel build: `67d313a032`	2022-06-10 20:43:03 +00:00
kshitij12345	5e656eaae5	[refs] ravel (#78421 ) As per title Pull Request resolved: https://github.com/pytorch/pytorch/pull/78421 Approved by: https://github.com/mruberry	2022-06-10 20:20:13 +00:00
kshitij12345	3d77017674	[primTorch] refs: masked_fill (#78132 ) TODO * [x] Add error inputs Pull Request resolved: https://github.com/pytorch/pytorch/pull/78132 Approved by: https://github.com/mruberry	2022-06-10 20:19:48 +00:00
PyTorch MergeBot	b712467cd1	Revert "Add mutation checks for tensor inputs" This reverts commit `83c0a2bc38`. Reverted https://github.com/pytorch/pytorch/pull/79078 on behalf of https://github.com/davidberard98 due to broke bazel build-and-test, see [https://github.com/pytorch/pytorch/runs/6836001002?check_suite_focus=true](https://github.com/pytorch/pytorch/runs/6836001002?check_suite_focus=true%22)	2022-06-10 20:15:30 +00:00
kshitij12345	7b307e5fca	[meta] angle, angle.out (#79278 ) meta registration for `angle, angle.out`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79278 Approved by: https://github.com/anjali411	2022-06-10 20:06:31 +00:00
jjsjann123	49c41b87a2	[nvfuser_upstream_push] nvfuser code base bump 060822 (#79147 ) Syncing nvfuser devel branch to upstream master. https://github.com/csarofeen/pytorch/ Bug fixes and minor refactor Squashed commits to WAR github API Commits that's actually in this PR from the devel branch: ``` 4c60e7dff22a494632370e5df55c011007340d06 Add examples infrastructure for using nvFuser in a standalone program (#1725) 02a05d98334ffa580d73ccb28fdb8c577ad296fe Fix issue #1751 (#1753) 8a69aa320bd7629e1709fe5ceb7104d2c88ec84c Refactor NvFuser transpose API to match eager mode behavior (#1746) ffdf6b7709048170d768217fcd7083fc8387f932 Remove BroadcastWithoutStride. (#1738) 02bab16035e70734450c02124f5cdaa95cf5749d Fix flipping of a boolean flag (#1745) 465d66890c8242e811224359cbdb1c2915490741 cleanup (#1744) 26d354e68720bc7dd2d3b1338ac01b707a230b6a fixing noncontig broadcast (#1742) 856b6b2f9073662dd98ca22ba6c3540e20eb1cdd Add IterDomainBuilder (#1736) 1fd974f912cd4c1e21cbd16e2abb23598d66a02f fixing warning for gcc7 (#1732) de2740a43a869f8272c2648e091d7b8235097db9 disabling complex in python tests for #1730 (#1733) fbbbe0a2e7c7a63e0e2719b8bfccb759b714221a fixing MSVC build (#1728) b5feee5e2b28be688dbddc766f3c0220389c8175 Fix the fused reduction runtime kernel (#1729) 5247682dff5980bb66edf8d3aac25dea2ef2ced5 Re-entrant GroupedGridReduction (#1727) ``` RUN_TORCHBENCH: nvfuser Pull Request resolved: https://github.com/pytorch/pytorch/pull/79147 Approved by: https://github.com/davidberard98	2022-06-10 19:37:42 +00:00
PyTorch MergeBot	35eda5f959	[DataPipe] Correcting deprecation version Pull Request resolved: https://github.com/pytorch/pytorch/pull/79302 Approved by: https://github.com/ejguan	2022-06-10 19:31:29 +00:00
samdow	5e926aafab	add utils for checking that all modes are in the same scope and finding the outermost mode Pull Request resolved: https://github.com/pytorch/pytorch/pull/78847 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-06-10 19:31:05 +00:00
Scott Wolchok	fff1948b02	[PyTorch] intrusive_ptr: don't guarantee release_resources will be called Pull Request resolved: https://github.com/pytorch/pytorch/pull/76767 We're spending a virtual function call in the common case where there are no weak references just to save a small amount of care in intrusive_ptr_target subclasses that override release_resources, of which there aren't very many. Differential Revision: [D36109757](https://our.internmc.facebook.com/intern/diff/D36109757/) NOTE FOR REVIEWERS: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D36109757/)! Approved by: https://github.com/ezyang	2022-06-10 19:30:35 +00:00
Akshay Parashar	26f2376b78	[Static Runtime] support fork and wait operations on Static Runtime (#79211 ) Summary: - Initial support for fork was done on JIT interpreter. This patch enabled the async execution on static runtime - For each forked node, seeprate runtime is created for the execution of subgraph. Async execution is handled by aten::ParallelThreadPoolNative threadpool - aten::wait waits on the future of fork to be completed Test Plan: local test cases: - buck test caffe2/benchmarks/static_runtime/fb:test_fb_operators - buck test mode/opt caffe2/benchmarks/static_runtime:static_runtime_cpptest - buck test mode/opt caffe2/test:static_runtime Async execution of the subgraph is tested by adding pytorch profiler hooks on the StaticRuntime execution via below code. Async execution in threadpool is verfiied by checking trace with profile(activities=[ProfilerActivity.CPU]) as prof: static_runtime_module(inputs) prof.export_chrome_trace("trace.json") Differential Revision: D37044513 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79211 Approved by: https://github.com/mikeiovine	2022-06-10 19:11:03 +00:00
Rohan Varma	ec86070922	Checkpoint util Pull Request resolved: https://github.com/pytorch/pytorch/pull/78704 Approved by: https://github.com/zhaojuanmao	2022-06-10 18:37:36 +00:00
Michael Andreas Dagitses	67d313a032	turn on -Werror=unused-function in our Bazel CPU build Summary: We also fix any existing issues. Note that we only do this for the CPU build because nvcc is considered a C++ toolchain but it does not have the same flag support. Adding flags to the GPU build will cause nvcc errors. Test Plan: Built locally, rely on CI to confirm. Reviewers: malfet Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79154 Approved by: https://github.com/seemethere, https://github.com/osalpekar, https://github.com/albanD	2022-06-10 18:30:08 +00:00
samdow	3734fcc8f8	add ability to push a mode if the current mode is an ancestor Pull Request resolved: https://github.com/pytorch/pytorch/pull/78822 Approved by: https://github.com/ezyang, https://github.com/zou3519	2022-06-10 18:27:04 +00:00
goldenxuett	83c0a2bc38	Add mutation checks for tensor inputs Pull Request resolved: https://github.com/pytorch/pytorch/pull/79078 Approved by: https://github.com/davidberard98, https://github.com/Krovatkin	2022-06-10 18:17:33 +00:00
Brian Hirsh	7b3a0ff87a	Port `index.Tensor` to structured kernels. Tracking issue: #55070 Pull Request resolved: https://github.com/pytorch/pytorch/pull/69607 Approved by: https://github.com/bdhirsh	2022-06-10 17:27:47 +00:00
George Qi	a90f006fe5	add strides to slow path Pull Request resolved: https://github.com/pytorch/pytorch/pull/78610 Approved by: https://github.com/ezyang	2022-06-10 16:59:14 +00:00
kshitij12345	adaecb2cbb	[chalf] index_select: cpu support (#79217 ) Fixes https://github.com/pytorch/pytorch/issues/79204 PR https://github.com/pytorch/pytorch/pull/78173 took care of adding CUDA support. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79217 Approved by: https://github.com/mruberry	2022-06-10 14:06:32 +00:00
Peter Bell	7843a5e882	Move Tensor.grad back into C++ `Tensor.grad` was moved to python in #30531 to add a warning. However, that warning has since been lowered into C++ so this wrapper is no longer necessary. Pull Request resolved: https://github.com/pytorch/pytorch/pull/76675 Approved by: https://github.com/albanD	2022-06-10 13:44:45 +00:00
Tongzhou Wang	dd620c4575	add type annotation to distributions.kl_divergence (#78432 ) Fixes #78431 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78432 Approved by: https://github.com/fritzo, https://github.com/ejguan	2022-06-10 13:39:20 +00:00
PyTorch MergeBot	c99ea0db46	Revert "[PyTorch] Record Sequence Number to Match Forward and Backward Operators (#78795 )" This reverts commit `a299a2fa26`. Reverted https://github.com/pytorch/pytorch/pull/78795 on behalf of https://github.com/janeyx99 due to Broke profiler tests `a299a2fa26`	2022-06-10 13:11:44 +00:00
Michael Andreas Dagitses	f96d96a7fc	turn on -Werror=type-limits in our Bazel CPU build Summary: We also fix any existing issues. Test Plan: Built locally, rely on CI to confirm. Reviewers: malfet Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79139 Approved by: https://github.com/seemethere, https://github.com/osalpekar, https://github.com/albanD	2022-06-10 10:04:08 +00:00
vspenubarthi	28c541776c	[ao] Added fx model report per_channel detector Summary: This code is meant to be a tool to help people get the most out of their backend by hinting them to use per_channel quantization if it's supported, which will help increase accuracy significantly. The code is completed and ready to be reviewed. Test Plan: test/quantization/fx/test_model_report_fx.py Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/79104 Approved by: https://github.com/HDCharles	2022-06-10 08:09:59 +00:00
pritam	b9e3d722c4	Use appropriate dtype for sharded linear implementation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79255 We use several collective operations in our sharded linear implementation and for many collectives, we do not set the `dtype` of the output tensor appropriately. As a result, using a datatype like torch.float16 (which is not the default torch.float32) results in errors. Fixing this across the board and adding appropriate tests. Differential Revision: [D37059752](https://our.internmc.facebook.com/intern/diff/D37059752/) Approved by: https://github.com/fduwjj, https://github.com/wanchaol	2022-06-10 07:32:15 +00:00
Louis Feng	a299a2fa26	[PyTorch] Record Sequence Number to Match Forward and Backward Operators (#78795 ) Summary: Add sequence number to map forward and backward operators. Test Plan: ``` buck build mode/dev-nosan cea/ml_perf_model/gpu/scripts: --show-output buck-out/gen/caffe2/test/profiler#binary.par test_profiler.TestExecutionGraph.test_execution_graph_start_stop ``` Outputs with seq_id: P505545974 Differential Revision: D36881999 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78795 Approved by: https://github.com/robieta	2022-06-10 05:51:17 +00:00
Tran Le	43ad2833b9	[pytorch][papaya] Replace parseObject with dummy function in FlatBufferLoader during load parameters (#79167 ) Summary: As titled. Test Plan: ``` buck test //xplat/caffe2:test_lite_trainer_pickle_and_flatbuffer //xplat/caffe2:test_lite_trainer ``` Differential Revision: D37021858 Pull Request resolved: https://github.com/pytorch/pytorch/pull/79167 Approved by: https://github.com/qihqi	2022-06-10 05:41:25 +00:00
Kshiteej K	d837443a6f	[fix] composite compliance: matrix_rank (#78968 ) Ref: https://github.com/pytorch/pytorch/issues/69991 Pull Request resolved: https://github.com/pytorch/pytorch/pull/78968 Approved by: https://github.com/zou3519	2022-06-10 05:41:19 +00:00
PyTorch MergeBot	fefff54cad	Revert "Revert "Revert "Added {logical_not, trace} refs, moved logical ops to use method overloads""" This reverts commit `a2d2981e8e`. Reverted https://github.com/pytorch/pytorch/pull/79224 on behalf of https://github.com/suo due to broke lots of things `a2d2981e8e`	2022-06-10 04:40:43 +00:00
Horace He	a2d2981e8e	Revert "Revert "Added {logical_not, trace} refs, moved logical ops to use method overloads"" This reverts commit `d67309aefb`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/79224 Approved by: https://github.com/mruberry	2022-06-10 03:07:14 +00:00
PyTorch MergeBot	87a5ecced2	Revert "Support multi-dimensional lengths in segment_reduce to support pytorch_scatter.segment_* functionalities (CUDA)" This reverts commit `40f7ef1f3d`. Reverted https://github.com/pytorch/pytorch/pull/77061 on behalf of https://github.com/janeyx99 due to Broke segment_reduce tests on trunk, e.g., `40f7ef1f3d`	2022-06-10 01:57:34 +00:00

1 2 3 4 5 ...

22158 Commits