pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
Luca Wehrstedt	8e53bf010d	Use new TensorPipe functions to create channels (#51550 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51550 ghstack-source-id: 120931213 Test Plan: Export to CircleCI Reviewed By: beauby Differential Revision: D26147946 fbshipit-source-id: edd44b5edf7041efcc9662cc3bfc550663976fc1	2021-02-03 14:20:49 -08:00
Luca Wehrstedt	56ef24bc0f	Use new TensorPipe functions to create transports (#51549 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51549 ghstack-source-id: 120931215 Test Plan: Export to CircleCI Reviewed By: beauby Differential Revision: D26147369 fbshipit-source-id: 43f58f27edec964c24c0bf4ea76f2a47695ee1ea	2021-02-03 14:17:49 -08:00
Helene	47557b95ef	Removed typographical error from tech docs (#51286 ) Summary: Dublications removed from tech docs. ![Screenshot](https://user-images.githubusercontent.com/71665475/106158807-6e5b8100-6184-11eb-9036-bccdf2086c31.png) Pull Request resolved: https://github.com/pytorch/pytorch/pull/51286 Reviewed By: albanD Differential Revision: D26227627 Pulled By: ailzhang fbshipit-source-id: efa0cd90face458673b8530388378d5a7eb0f1cf	2021-02-03 14:09:36 -08:00
Edward Yang	333a0c8b6f	Add support for generating faithful at::cpu signatures (#51499 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51499 I'm going to turn on at::cpu signatures on for all operators; before I do it I want to make sure I'm at feature parity everywhere. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: bhosmer Differential Revision: D26187855 Pulled By: ezyang fbshipit-source-id: 8fdfd9d843fc98435b1f1df8b475d3184d87dc96	2021-02-03 14:03:50 -08:00
Edward Yang	81c7c3bae5	Add api.structured; switch structured kernels to use const Tensor& everywhere (#51490 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51490 Mutable Tensor ref is a source of endless confusion for kernel writers; if we're going to make everyone rewrite their kernels, might as well also get rid of mutable Tensor& while we're at it. This is a refactor-then-small-update double whammy. The refactor is to separate tools.codegen.api.structured from api.native for describing the type signatures of structured kernels (previously, I was naughtily reusing native for this purpose--now I need it to behave differently as Tensor). This started off as a copy paste, but since there are not that many structured kernels so far I could delete all of the legacy logic from native that didn't make sense (without having to go out and fix all the use sites all at once). One more small addition was teaching translate to convert Tensor& to const Tensor&. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: bhosmer Differential Revision: D26182413 Pulled By: ezyang fbshipit-source-id: ed636866add3581179669cf9283f9835fcaddc06	2021-02-03 14:03:46 -08:00
Edward Yang	648cdb7d0a	Relax type signature for tools.codegen.api.translate (#51477 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51477 Passing in a full binding is still OK, but if you have less (e.g., an Expr/CType), that will do too. I'll need this for some codegen patches I'm doing later. Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: bhosmer Differential Revision: D26179560 Pulled By: ezyang fbshipit-source-id: 5730dfb2c91bf5325496e57b0c91eb6823c9194d	2021-02-03 14:00:47 -08:00
Yi Wang	43df03de13	[Gradient Compression] Replace torch.sqrt(torch.sum(col ** 2)) by torch.norm() (#51629 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51629 Leverage the existing util functions as much as possible for potential performance gain. Original PR issue: Investigate Applying PowerSGD to Communication Hook for Gradient Compression #47202 ghstack-source-id: 120919883 Test Plan: buck test mode/dev-nosan caffe2/test/distributed:c10d -- test_powerSGD_ddp_comm_hook_nccl No performance regression: f248664994 uses `torch.norm()` ``` total: 32 GPUs -- 32 GPUs: p25: 1.050 30/s (batch size 32) p50: 1.230 26/s (batch size 32) p75: 1.449 22/s (batch size 32) p90: 1.611 19/s (batch size 32) p95: 1.702 18/s (batch size 32) backward: 32 GPUs -- 32 GPUs: p25: 0.769 41/s (batch size 32) p50: 0.920 34/s (batch size 32) p75: 1.139 28/s (batch size 32) p90: 1.322 24/s (batch size 32) p95: 1.440 22/s (batch size 32) ``` f248678690 does not use `torch.norm()` ``` total: 32 GPUs -- 32 GPUs: p25: 1.056 30/s (batch size 32) p50: 1.249 25/s (batch size 32) p75: 1.443 22/s (batch size 32) p90: 1.608 19/s (batch size 32) p95: 1.711 18/s (batch size 32) backward: 32 GPUs -- 32 GPUs: p25: 0.777 41/s (batch size 32) p50: 0.939 34/s (batch size 32) p75: 1.127 28/s (batch size 32) p90: 1.322 24/s (batch size 32) p95: 1.448 22/s (batch size 32) ``` Reviewed By: pritamdamania87 Differential Revision: D26219835 fbshipit-source-id: 31d8ad3401d4efced4a6069f4f1e169ea3372697	2021-02-03 13:39:11 -08:00
Hector Yuen	00675292ca	replace silufp16 with cubic interpolation (#51645 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51645 added cubic interpolation Test Plan: increase the input domain, reduced the threshold to 0 Reviewed By: kausv Differential Revision: D26212239 fbshipit-source-id: e0813d8a4f3f54cfd0bf62e385cd28fa4a1976e8	2021-02-03 12:58:38 -08:00
Xu Zhao	cae4379826	Enable FLOPS Computation for Experimental Kineto Profiler (#51503 ) Summary: Add the FLOPS metric computation to the experimental Kineto profiler. This includes saving necessary extra arguments and compute flops in the C++ code, and extract the FLOPS value from the Python frontend. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51503 Test Plan: Build PyTorch with USE_KINETO option, then run the unit test: ```python python test/test_profiler.py -k test_flops ``` Reviewed By: ilia-cher Differential Revision: D26202711 Pulled By: xuzhao9 fbshipit-source-id: 7dab7c513f454355a220b72859edb3ccbddcb3ff	2021-02-03 12:15:23 -08:00
Omkar Salpekar	3361d365bd	[Gloo] Use TORCH_CHECK for ensuring tag is nonnegative (#51370 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51370 TORCH_CHECK should be used when confirming the correctness of function arguments like the tag passed to Gloo functions. ghstack-source-id: 120908449 Test Plan: Sandcastle/CI Reviewed By: mingzhe09088 Differential Revision: D26152359 fbshipit-source-id: ddffaa6f11393aaedaf0870759dc526d8d4530ee	2021-02-03 11:48:20 -08:00
Shen Li	a3f2fe0d52	Prevent CUDAFuture from using uninitialized device index (#51505 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51505 Test Plan: Imported from OSS Reviewed By: lw Differential Revision: D26187380 Pulled By: mrshenli fbshipit-source-id: 437bb1244a65ee859458d9a87fdaef9f4dd20b59	2021-02-03 11:04:33 -08:00
Jasha	a651696ab4	fix misspelling in swa_utils.pyi (#51608 ) Summary: Change `avg_fun -> avg_fn` to match the spelling in the `.py` file. (`swa_utils.pyi` should match `swa_utils.py`) Pull Request resolved: https://github.com/pytorch/pytorch/pull/51608 Reviewed By: glaringlee Differential Revision: D26224779 Pulled By: zou3519 fbshipit-source-id: 01ff7173ba0a996f1b7a653438acb6b6b4659de6	2021-02-03 10:51:22 -08:00
Mikhail Zolotukhin	c639513378	[TensorExpr] Resubmit: Introduce ExternalCall nodes to TE IR. (#51594 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51594 ExternalCall nodes represent opaque calls to external functions to fill a tensor (buffer) with values. It could be used to include nodes that are otherwise not-representable as TE, or whose TE representation is currently too slow. To make an external function available in NNC as ExternalCall, one needs to implement a "bridge" function that would take raw (void*) pointers to the data along with the arrays containing dimension info. This function would then internally call the desired external function and make sure the results of the call are correctly placed in the provided raw data buffers. The reason the PR was previously reverted was that the LLVM generated calls to bridge functions were breaking unwind tables. This is now fixed by requiring bridge functions to never throw and setting the corresponding attribute in the LLVM generated code. Differential Revision: D26213882 Test Plan: Imported from OSS Reviewed By: pbelevich, ngimel Pulled By: ZolotukhinM fbshipit-source-id: db954d8338e2d750c2bf0a41e88e38bd494f2945	2021-02-03 10:22:54 -08:00
anjali411	18a7ec7d7d	Update the JIT complex type name to be consistent with Python (#51476 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51476 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D26179237 Pulled By: anjali411 fbshipit-source-id: 6a5c60c8545eb42416583836b8038ceffd3f3244	2021-02-03 09:59:08 -08:00
Vincent Quenneville-Belair	896f82aa92	[optim] make functional api be private (#51316 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51316 Make optim functional API be private until we release with beta Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D26213469 fbshipit-source-id: b0fd001a8362ec1c152250bcd57c7205ed893107	2021-02-03 09:29:33 -08:00
Jane Xu	550c965b2e	Re-enable test_standalone_load for Windows 11.1 (#51596 ) Summary: This fixes the previous erroring out by adding stricter conditions in cpp_extension.py. To test, run a split torch_cuda build on Windows with export BUILD_SPLIT_CUDA=ON && python setup.py develop and then run the following test: python test/test_utils.py TestStandaloneCPPJIT.test_load_standalone. It should pass. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51596 Reviewed By: malfet Differential Revision: D26213816 Pulled By: janeyx99 fbshipit-source-id: a752ce7f9ab9d73dcf56f952bed2f2e040614443	2021-02-03 08:58:34 -08:00
Jeff Daily	727f163bea	caffe2 test.sh pip might not need sudo if pip is root (#50223 ) Summary: Update logic in MAYBE_SUDO check. Assumption was incorrect that if pip was installed as user then sudo is needed. pip could be installed as root and run as root. Assumption was initially pip was root and user was non root. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50223 Reviewed By: H-Huang Differential Revision: D26212127 Pulled By: walterddr fbshipit-source-id: 20b316606b6c210dc705a972c13088fa3d9bfddd	2021-02-03 08:13:03 -08:00
Iurii Zdebskyi	5cf3278723	Refactor ForeachUnaryOps.cu (#49248 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/49248 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D25502940 Pulled By: izdeby fbshipit-source-id: fce2f18a4f62f7a5fdd6747707d006c3588530d1	2021-02-03 07:05:27 -08:00
Erjia Guan	52de407b4b	[DataLoader] Rename Functional DataSet to DataPipe (#51488 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51488 Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D26209888 Pulled By: ejguan fbshipit-source-id: cb8bc852b1e4d72be81e0297308a43954cd95332	2021-02-03 07:01:09 -08:00
Erjia Guan	bea0519b0b	[WIP][DataLoader] Implement BucketBatchIterableDataset (#51126 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51126 BucketBatch: Get a chunk of data as a bucket, and sort the bucket by the specified key, then batching. If sort key is not specified, directly use batchIterableDS.. 1. Implement BucketBatch for bucket sampler 2. Improve BatchDS tests Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D26209890 Pulled By: ejguan fbshipit-source-id: 8519e2e49da158b3fe32913c8f3cadfa6f3ff1fc	2021-02-03 07:01:05 -08:00
Erjia Guan	14ee63f7e6	[WIP][DataLoader] Implement CallableIterableDataset (#50045 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50045 Add CallableIterableDataset Modify CollateIterableDataset as another callable Test Plan: Imported from OSS Reviewed By: H-Huang Differential Revision: D26209889 Pulled By: ejguan fbshipit-source-id: d4773026c1269e43b29a3efb16e36e1865fdd024	2021-02-03 06:54:48 -08:00
Alban Desmaison	c311b8961a	Revert D26113953: [pytorch][PR] [ZeroRedundancyOptimizer] Elastic and pytorch compatible checkpoints Test Plan: revert-hammer Differential Revision: D26113953 (`bbe18e3527`) Original commit changeset: 030bfeee2c34 fbshipit-source-id: 6c1494ad01c2f96a15601329b4fce3fef4b38a01	2021-02-03 06:12:21 -08:00
Yanan Cao	75ee575671	[Usability] Handle repeated jit.script calls on function gracefully (#51545 ) Summary: Repeated calls on `class` is not handled since `class`'s compilation process will change soon in https://github.com/pytorch/pytorch/issues/44324 Pull Request resolved: https://github.com/pytorch/pytorch/pull/51545 Reviewed By: H-Huang Differential Revision: D26207010 Pulled By: gmagogsfm fbshipit-source-id: 5f3f64b0e4b4ab4dbf5c9411d9c143472922a106	2021-02-03 02:09:25 -08:00
Dhruv Matani	7b556db69d	[PyTorch Mobile] Skip inferring function schema from the C++ function type (#50457 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50457 The code to infer function schema from a C++ function relies on templates and code expansion. This uses valuable binary size. We can avoid inferring the schema from the C++ function type (arguments, name, return value) in case that the function implementation is being added to the dispatcher via `m.impl`. In this case, it is assumed that we have a schema registered already. Adding an implementation via `m.def` still triggers schema inferrence. In addition, we don't do schema schema checks on mobile, so the schema is not needed in the first place. ghstack-source-id: 120915259 Test Plan: Auto-unit tests succeed. ### Size test: igios ``` D25853094-V1 (https://www.internalfb.com/intern/diff/D25853094/?dest_number=119632217) igios: Succeeded Change in Download Size for arm64 + 3x assets variation: -21.8 KiB Change in Uncompressed Size for arm64 + 3x assets variation: -45.5 KiB Mbex Comparison: https://our.intern.facebook.com/intern/mbex/bsb:261049318687117@base/bsb:261049318687117@diff/ ``` ### Size test: fbios ``` D25853094-V1 (https://www.internalfb.com/intern/diff/D25853094/?dest_number=119632217) fbios: Succeeded Change in Download Size for arm64 + 3x assets variation: -27.2 KiB Change in Uncompressed Size for arm64 + 3x assets variation: -80.1 KiB Mbex Comparison: https://our.intern.facebook.com/intern/mbex/bsb:454289062251865@base/bsb:454289062251865@diff/ ``` Reviewed By: smessmer Differential Revision: D25853094 fbshipit-source-id: e138d9dff7561d424bfb732f3a5898466f018f60	2021-02-03 00:37:35 -08:00
Gemfield	62f6e55439	Fix the missing parameter in get_sha function (#51290 ) Summary: get_sha() function didn't pass in the pytorch_root argument, so subprocess.check_output always raise exception since pytorch_root is not defined, thus always return 'Unknown'. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51290 Reviewed By: soumith Differential Revision: D26219051 Pulled By: malfet fbshipit-source-id: fee2c4f5fdfc61983559eec1600b9accb344c527	2021-02-02 23:25:57 -08:00
Ansley Ussery	ab4623da16	Document FX debugging (#51530 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51530 Test Plan: Imported from OSS Reviewed By: jamesr66a Differential Revision: D26192641 Pulled By: ansley fbshipit-source-id: c69ab1bb2451d8ee5a729445f52bccc66e6f431b	2021-02-02 23:17:51 -08:00
Nikita Shulga	f7313b3105	Fix Python.h discovery logic on some MacOS platforms (#51586 ) Summary: On all non-Windows platforms we should use 'posix_prefix' schema to discover location of Python.h header Pull Request resolved: https://github.com/pytorch/pytorch/pull/51586 Reviewed By: ezyang Differential Revision: D26208684 Pulled By: malfet fbshipit-source-id: bafa6d79de42231629960c642d535f1fcf7a427f	2021-02-02 21:38:37 -08:00
Kimish Patel	7360ce36e4	[QNNPACK:Sparsity] Add A matrix pretransformed based sparse kernels for FC (#50587 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50587 This diff introduces two kernesl. One is to pretransform A to do block wise transforms. And then the kernel that directly works on top pretransformed weights. Test Plan: ./build/local/q8gemm-sparse-test ./build/local/fully-connected-sparse-test Imported from OSS Reviewed By: AshkanAliabadi Differential Revision: D25925504 fbshipit-source-id: 9b02819405ce587f20e675b154895dc39ecd1bad	2021-02-02 21:33:02 -08:00
Kimish Patel	eb571b33fe	[QNNPACK Sparse] Create fc sparse operator (#50586 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50586 Creates sparse operator for fully connected layer. Test Plan: ./build/local/fully-connected-sparse-test Imported from OSS Reviewed By: AshkanAliabadi Differential Revision: D25925503 fbshipit-source-id: 49042158ba3bf26a716a6d68258fc7ead85ce9d8	2021-02-02 21:32:58 -08:00
Kimish Patel	520f96b8c7	[QNNPACK] Block Sparse kernel. First commit. (#50585 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50585 This diff introduces sparse kenel for sse. Uses 1x4 block sparse pattern. Test Plan: ./build/local/q8gemm-sparse-test Imported from OSS Reviewed By: AshkanAliabadi Differential Revision: D25925500 fbshipit-source-id: e112cafd3226f8c11487c139cd414fa53a58fd0d	2021-02-02 21:30:24 -08:00
wanyu2018umac	444203c52f	Fix torch.cdist backward CUDA error due to illegal gridDim setting (#51569 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/49928 Pull Request resolved: https://github.com/pytorch/pytorch/pull/51569 Reviewed By: mruberry Differential Revision: D26215694 Pulled By: ngimel fbshipit-source-id: 0710417e6a802424e2dcada325f27452c95d042f	2021-02-02 20:41:24 -08:00
Gemfield	b48ee75507	Fix quantization doc issue (#50187 ) Summary: There has a description error in quantization.rst, fixed it. Pull Request resolved: https://github.com/pytorch/pytorch/pull/50187 Reviewed By: mrshenli Differential Revision: D25895294 Pulled By: soumith fbshipit-source-id: c0b2e7ba3fadfc0977ab2d4d4e9ed4f93694cedd	2021-02-02 20:33:21 -08:00
Jeffrey Wan	b18eeaa80a	Implement `np.diff` for single order differences (#50569 ) Summary: Implements `np.diff` for single order differences only: - method and function variants for `diff` and function variant for `diff_out` - supports out variant, but not in-place since shape changes - adds OpInfo entry, and test in `test_torch` - automatic autograd because we are using the `Math` dispatch _Update: we only support Tensors for prepend and append in this PR. See discussion below and comments for more details._ Currently there is a quirk in the c++ API based on how this is implemented: it is not possible to specify scalar prepend and appends without also specifying all 4 arguments. That is because the goal is to match NumPy's diff signature of `diff(int n=1, int dim=-1, Union[Scalar, Tensor] prepend=None, Union[Scalar, Tensor] append)=None` where all arguments are optional, positional and in the correct order. There are a couple blockers. One is c++ ambiguity. This prevents us from simply doing `diff(int n=1, int dim=-1, Scalar? prepend=None, Tensor? append=None)` etc for all combinations of {Tensor, Scalar} x {Tensor, Scalar}. Why not have append, prepend not have default args and then write out the whole power set of {Tensor, Scalar, omitted} x {Tensor, Scalar, omitted} you might ask. Aside from having to write 18 overloads, this is actually illegal because arguments with defaults must come after arguments without defaults. This would mean having to write `diff(prepend, append, n, dim)` which is not desired. Finally writing out the entire power set of all arguments n, dim, prepend, append is out of the question because that would actually involve 2 * 2 * 3 * 3 = 36 combinations. And if we include the out variant, that would be 72 overloads! With this in mind, the current way this is implemented is actually to still do `diff(int n=1, int dim=-1, Scalar? prepend=None, Tensor? append=None)`. But also make use of `cpp_no_default_args`. The idea is to only have one of the 4 {Tensor, Scalar} x {Tensor, Scalar} provide default arguments for the c++ api, and add `cpp_no_default_args` for the remaining 3 overloads. With this, Python api works as expected, but some calls such as `diff(prepend=1)` won't work on c++ api. We can optionally add 18 more overloads that cover the {dim, n, no-args} x {scalar-tensor, tensor-scalar, scalar-scalar} x {out, non-out} cases for c++ api. _[edit: counting is hard - just realized this number is still wrong. We should try to count the cases we do cover instead and subtract that from the total: (2 * 2 * 3 * 3) - (3 + 2^4) = 17. 3 comes from the 3 of 4 combinations of {tensor, scalar}^2 that we declare to be `cpp_no_default_args`, and the one remaining case that has default arguments has covers 2^4 cases. So actual count is 34 additional overloads to support all possible calls]_ _[edit: thanks to https://github.com/pytorch/pytorch/issues/50767 hacky_wrapper is no longer necessary; it is removed in the latest commit]_ hacky_wrapper was also necessary here because `Tensor?` will cause dispatch to look for the `const optional<Tensor>&` schema but also generate a `const Tensor&` declaration in Functions.h. hacky_wrapper allows us to define our function as `const Tensor&` but wraps it in optional for us, so this avoids both the errors while linking and loading. _[edit: rewrote the above to improve clarity and correct the fact that we actually need 18 more overloads (26 total), not 18 in total to complete the c++ api]_ Pull Request resolved: https://github.com/pytorch/pytorch/pull/50569 Reviewed By: H-Huang Differential Revision: D26176105 Pulled By: soulitzer fbshipit-source-id: cd8e77cc2de1117c876cd71c29b312887daca33f	2021-02-02 20:25:16 -08:00
Yanli Zhao	e54cbb8250	Create PyTorch DDP logging APIs for applications to use (#50637 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50637 add APIs for logging pytorch ddp logging data in applications. Test Plan: unit tests Reviewed By: rohan-varma Differential Revision: D25933411 fbshipit-source-id: 57c248a2f002da06a386fc7406d3e5533ebb9124	2021-02-02 18:24:21 -08:00
Natalia Gimelshein	26f9ac98e5	Revert D26105797: [pytorch][PR] Exposing linear layer to fuser Test Plan: revert-hammer Differential Revision: D26105797 (`e488e3c443`) Original commit changeset: 6f7cedb9f6e3 fbshipit-source-id: f0858cefed76d726e9dba61e51e1eaf2af4c99c5	2021-02-02 17:39:17 -08:00
Jeff Daily	5a402274d4	[ROCm] add 4.0.1 to nightly builds (#51257 ) Summary: Depends on https://github.com/pytorch/builder/pull/628. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51257 Reviewed By: H-Huang, seemethere Differential Revision: D26208135 Pulled By: malfet fbshipit-source-id: 8a4386b5661c6f71df28d98279e2771c4044f06c	2021-02-02 16:52:38 -08:00
Richard Barnes	b283ac6da4	"whitelist" -> "allowlist" (#51375 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51375 Test Plan: Sandcastle tests Reviewed By: iseeyuan Differential Revision: D26150609 fbshipit-source-id: 1ca17bc8943598a42f028005d1f6d3f362fe2659	2021-02-02 16:20:34 -08:00
Richard Barnes	c791a30484	Fix warnings in "ForeachOpsKernels" with c10::irange (#50783 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/50783 Compiling currently shows: ``` Jan 13 16:46:28 In file included from ../aten/src/ATen/native/ForeachOpsKernels.cpp:2: Jan 13 16:46:28 ../aten/src/ATen/native/ForeachUtils.h:28:21: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachUtils.h:44:21: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachUtils.h:149:25: warning: comparison of integers of different signs: 'int64_t' (aka 'long long') and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 for (int64_t i = 0; i < tensors1.size(); i++) { Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachUtils.h:164:25: warning: comparison of integers of different signs: 'int64_t' (aka 'long long') and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 for (int64_t i = 0; i < tensors1.size(); i++) { Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachUtils.h:183:25: warning: comparison of integers of different signs: 'int64_t' (aka 'long long') and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 for (int64_t i = 0; i < tensors1.size(); i++) { Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachUtils.h:198:25: warning: comparison of integers of different signs: 'int64_t' (aka 'long long') and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 for (int64_t i = 0; i < tensors1.size(); i++) { Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:150:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST_ALPHA(add); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:74:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST_ALPHA' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:150:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST_ALPHA(add); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:84:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST_ALPHA' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:151:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST_ALPHA(sub); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:74:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST_ALPHA' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:151:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST_ALPHA(sub); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:84:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST_ALPHA' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:158:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(add); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:31:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:158:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(add); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:40:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:159:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(sub); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:31:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:159:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(sub); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:40:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:160:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(mul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:31:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:160:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(mul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:40:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:161:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(div); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:31:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:161:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_SCALARLIST(div); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:40:21: note: expanded from macro 'FOREACH_BINARY_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < tensors.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:163:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST(mul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:53:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:163:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST(mul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:63:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:164:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST(div); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:53:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:164:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_BINARY_OP_LIST(div); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:63:21: note: expanded from macro 'FOREACH_BINARY_OP_LIST' Jan 13 16:46:28 for (int i = 0; i < tensors1.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:195:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALAR(addcdiv); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:115:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALAR' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:195:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALAR(addcdiv); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:125:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALAR' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:196:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALAR(addcmul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:115:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALAR' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:196:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALAR(addcmul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:125:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALAR' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:198:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALARLIST(addcdiv); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:135:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:198:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALARLIST(addcdiv); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:145:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:199:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALARLIST(addcmul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:135:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { \ Jan 13 16:46:28 ~ ^ ~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:199:1: warning: comparison of integers of different signs: 'int' and 'size_t' (aka 'unsigned long') [-Wsign-compare] Jan 13 16:46:28 FOREACH_POINTWISE_OP_SCALARLIST(addcmul); Jan 13 16:46:28 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Jan 13 16:46:28 ../aten/src/ATen/native/ForeachOpsKernels.cpp:145:21: note: expanded from macro 'FOREACH_POINTWISE_OP_SCALARLIST' Jan 13 16:46:28 for (int i = 0; i < input.size(); i++) { ``` this diff fixes that Test Plan: Sandcastle tests Reviewed By: xush6528 Differential Revision: D25935046 fbshipit-source-id: 9a042367410b3c1ffd27d9f957a623f1bae07d20	2021-02-02 16:13:03 -08:00
jiej	e488e3c443	Exposing linear layer to fuser (#50856 ) Summary: 1. enabling linear in autodiff; 2. remove control flow in python for linear; Pull Request resolved: https://github.com/pytorch/pytorch/pull/50856 Reviewed By: pbelevich Differential Revision: D26105797 Pulled By: eellison fbshipit-source-id: 6f7cedb9f6e3e46daa24223d2a6080880498deb4	2021-02-02 15:39:01 -08:00
Nikita Shulga	5499e839f1	[Fuser] Do not attempt to use OpenMP if build without OpenMP support (#51504 ) Summary: Clang from XCode does not support `-fopenmp` option, no need to try to compile with it. Infer whether OpenMP is supported by checking _OPENMP define. Also, use clang compiler if host app was compiled with clang rather than gcc. Fix few range loop warnings and add static_asserts that range loop variables are raw pointers. This changes makes fuser tests on OS X a bit faster. Before: ``` % python3 test_jit.py -v TestScript.test_batchnorm_fuser_cpu Fail to import hypothesis in common_utils, tests are not derandomized CUDA not available, skipping tests test_batchnorm_fuser_cpu (__main__.TestScript) ... clang: error: unsupported option '-fopenmp' clang: error: unsupported option '-fopenmp' warning: pytorch jit fuser failed to compile with openmp, trying without it... ok ---------------------------------------------------------------------- Ran 1 test in 0.468s OK ``` After: ``` % python3 test_jit.py -v TestScript.test_batchnorm_fuser_cpu Fail to import hypothesis in common_utils, tests are not derandomized CUDA not available, skipping tests test_batchnorm_fuser_cpu (__main__.TestScript) ... ok ---------------------------------------------------------------------- Ran 1 test in 0.435s OK ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/51504 Reviewed By: smessmer Differential Revision: D26186875 Pulled By: malfet fbshipit-source-id: 930b3bcf543fdfad0f493d687072aaaf5f9e2bfc	2021-02-02 15:31:59 -08:00
kshitij12345	38eb836387	[complex] Enable complex autograd and jit tests for `trace` (#51537 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/50381 Now that `index_fill_` supports complex, we can enable complex support for `trace`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/51537 Reviewed By: H-Huang Differential Revision: D26198904 Pulled By: anjali411 fbshipit-source-id: d62bb02549919fe35b0bac44f77af964ebd0e92e	2021-02-02 15:24:38 -08:00
James Reed	209e27eaff	[FX] Add note about more use cases of FX (#51576 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51576 Test Plan: Imported from OSS Reviewed By: ansley Differential Revision: D26203610 Pulled By: jamesr66a fbshipit-source-id: d33a3e7e0f3a959349ed0e29a1aba0592022606d	2021-02-02 14:57:48 -08:00
Jacob Szwejbka	37f1412965	[Pytorch Mobile] Preserved all functions generated by bundled inputs (#51496 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51496 A previous change added the possibility of more functions being generated when bundled inputs are attached. Want to preserve those here in optimize_for_mobile ghstack-source-id: 120862718 Test Plan: Created a dummy model. Augment several methods with bundled inputs. Call optimize for mobile. Verified the functions are still there. Discovered a weird interaction between freeze_module and bundled inputs. If the user does something like inputs =[<inputs>] augment_many_model_functions_with_bundled_inputs( model, inputs={ model.forward : inputs, model.foo : inputs, } ) to attach their bundled inputs, freeze_module within optimize_for_mobile will error out. Instead the user would need to do something like inputs =[<inputs>] inputs2 =[<inputs>] # Nominally the same as the the inputs above augment_many_model_functions_with_bundled_inputs( model, inputs={ model.forward : inputs, model.foo : inputs2, } ) Reviewed By: dhruvbird Differential Revision: D26005708 fbshipit-source-id: 3e908c0f7092a57da9039fbc395aee6bf9dd2b20	2021-02-02 14:57:44 -08:00
Iurii Zdebskyi	cce84b5ca5	[WIP] Update foreach APIs to use scalar lists (#48223 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/48223 Test Plan: Imported from OSS Reviewed By: zou3519 Differential Revision: D25074763 Pulled By: izdeby fbshipit-source-id: 155e3d2073a20d16bdbe358820170bf53f93c7a5	2021-02-02 14:54:28 -08:00
Jagadish Krishnamoorthy	506fdf9abf	[ROCm] disable tests for ROCm 4.0.1 (#51510 ) Summary: These tests are failing for ROCm 4.0/4.0.1 release. Disable the tests until they are fixed. - TestCuda.test_cudnn_multiple_threads_same_device - TestCudaFuser.test_reduction Pull Request resolved: https://github.com/pytorch/pytorch/pull/51510 Reviewed By: H-Huang Differential Revision: D26205179 Pulled By: seemethere fbshipit-source-id: 0c3d29989d711deab8b5046b458c772a1543d8ed	2021-02-02 14:39:08 -08:00
Benjamin Lefaudeux	bbe18e3527	[ZeroRedundancyOptimizer] Elastic and pytorch compatible checkpoints (#50956 ) Summary: - Makes it possible to use non-sharded optimizer checkpoints (as long as the model/param groups are the same, of course) - Makes it possible to save with a given world size, and load with another world size - Use Torch Distributed built-in broadcast object list instead of a ad-hoc version Pull Request resolved: https://github.com/pytorch/pytorch/pull/50956 Reviewed By: malfet Differential Revision: D26113953 Pulled By: blefaudeux fbshipit-source-id: 030bfeee2c34c2d987590d45dc8efe05515f2e5c	2021-02-02 14:32:13 -08:00
Max Balandat	a990ff7001	[SobolEngine] Fix edge case of dtype of first sample (#51578 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51578 https://github.com/pytorch/pytorch/pull/49710 introduced an edge case in which drawing a single sample resulted in ignoring the `dtype` arg to `draw`. This fixes this and adds a unit test to cover this behavior. Test Plan: Unit tests Reviewed By: danielrjiang Differential Revision: D26204393 fbshipit-source-id: 441a44dc035002e7bbe6b662bf6d1af0e2cd88f4	2021-02-02 14:24:56 -08:00
Ivan Yashchuk	4746b3d1fb	Added missing VSX dispatch for cholesky_inverse (#51562 ) Summary: It was overlooked that vsx dispatch is also needed for cholesky_inverse cpu dispatch. See https://github.com/pytorch/pytorch/pull/50269#issuecomment-771688180 Pull Request resolved: https://github.com/pytorch/pytorch/pull/51562 Reviewed By: H-Huang Differential Revision: D26199581 Pulled By: anjali411 fbshipit-source-id: 5d02c6da52ce1d2e9e26001f5d4648a71dd0e829	2021-02-02 13:45:35 -08:00
Ashkan Aliabadi	2565a33c98	[Vulkan] Remove redundant qualifiers on writeonly images. (#51425 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51425 Test Plan: Imported from OSS Reviewed By: SS-JIA Differential Revision: D26179605 Pulled By: AshkanAliabadi fbshipit-source-id: 26358cd4fd23922fed21120e120774eea0b728df	2021-02-02 13:37:59 -08:00
Ashkan Aliabadi	0402df5427	[Vulkan] Improve error handling in a few places. (#51423 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/51423 Test Plan: Imported from OSS Reviewed By: SS-JIA Differential Revision: D26179604 Pulled By: AshkanAliabadi fbshipit-source-id: 2e270423bf7e960e9303b17e0ca1a1530b760ad3	2021-02-02 13:34:43 -08:00

1 2 3 4 5 ...

33381 Commits