pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 00:21:07 +01:00

Author	SHA1	Message	Date
cyy	ac603bc2f8	[Reland] Eliminate invocations of c10::stoi,c10::stod,c10::stoull,c10::stoll (#109566 ) This is reland of #87603 with definitions of c10::stoXX kept for further investigation. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109566 Approved by: https://github.com/huydhn	2023-09-19 07:15:25 +00:00
PyTorch MergeBot	4d44d8c00a	Revert "Eliminate c10::stoi,c10::stod,c10::stoull,c10::stoll (#109179 )" This reverts commit `852f1b8417`. Reverted https://github.com/pytorch/pytorch/pull/109179 on behalf of https://github.com/huydhn due to Sorry for reverting your change but this is breaking periodic buck build, so please fix the issue and reland the change https://github.com/pytorch/pytorch/actions/runs/6207458526/job/16852695272 ([comment](https://github.com/pytorch/pytorch/pull/109179#issuecomment-1724168571))	2023-09-18 18:41:12 +00:00
cyy	852f1b8417	Eliminate c10::stoi,c10::stod,c10::stoull,c10::stoll (#109179 ) We can remove these functions in favor of std ones. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109179 Approved by: https://github.com/colesbury	2023-09-16 07:22:50 +00:00
mikey dagitses	82592f7e53	remove dead torch_pb.h library (#97599 ) This is only used in one place, ensure it still builds. Differential Revision: [D44395699](https://our.internmc.facebook.com/intern/diff/D44395699/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D44395699/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/97599 Approved by: https://github.com/PaliC	2023-03-28 00:55:17 +00:00
mikey dagitses	5d33596c5f	remove dead proto_convert library (#97598 ) This has no code, only a collection of headers. Just make sure the only thing that includes it still builds. Differential Revision: [D44395700](https://our.internmc.facebook.com/intern/diff/D44395700/) NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D44395700/)! Pull Request resolved: https://github.com/pytorch/pytorch/pull/97598 Approved by: https://github.com/PaliC	2023-03-27 21:19:29 +00:00
PyTorch MergeBot	13fbf93238	Revert "remove dead proto_convert library (#97322 )" This reverts commit `d850c33bfe`. Reverted https://github.com/pytorch/pytorch/pull/97322 on behalf of https://github.com/osalpekar due to This broke a large number of internal builds due to not being able to find proto_convert.h. See here: [D44319486](https://www.internalfb.com/diff/D44319486)	2023-03-23 21:38:01 +00:00
mikey dagitses	d850c33bfe	remove dead proto_convert library (#97322 ) remove dead proto_convert library Summary: This has no code, only a collection of headers. Just make sure the only thing that includes it still builds. Test Plan: Rely on CI. Reviewers: sahanp Subscribers: Tasks: Tags: --- Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/pytorch/pytorch/pull/97322). * #97337 * #97336 * #97335 * #97334 * #97325 * #97324 * #97323 * __->__ #97322 Pull Request resolved: https://github.com/pytorch/pytorch/pull/97322 Approved by: https://github.com/malfet	2023-03-22 14:40:31 +00:00
Loren Arthur	0a57a20c02	[caffe2] Fix pybind11 native python link error (#92325 ) Summary: Currently, we define some C++ functions in one C++ Python extension which are used by another. This happens to work, but isn't guaranteed to. This diff moves these functions to a separate C++ library rule to fix this. Test Plan: CI Differential Revision: D42552515 Pull Request resolved: https://github.com/pytorch/pytorch/pull/92325 Approved by: https://github.com/kit1980, https://github.com/Skylion007	2023-01-26 02:33:17 +00:00
Aaron Gokaslan	54fca6a9da	Fix: prefer .is_none() over .is(py::none()) for pybind11 in caffe2 (#88199 ) Follow up to #88051 . I noticed that I missed a few spots in the caffe2 folder. Prefer `.is_none()` over `.is(py::none())` as `.is_none()` is more efficient since it avoid reference counting increments and decrements. Pull Request resolved: https://github.com/pytorch/pytorch/pull/88199 Approved by: https://github.com/albanD, https://github.com/kit1980	2022-11-17 05:01:11 +00:00
Loren Arthur	1dad051b05	Move workspace related functions to separate file (#87651 ) Move workspace related functions to separate file Test Plan: Existing tests Differential Revision: D40657708 Pull Request resolved: https://github.com/pytorch/pytorch/pull/87651 Approved by: https://github.com/malfet	2022-10-29 04:52:01 +00:00
Will Constable	4f34cd6d1e	Replace all CHECK_ and DCHECK_ with TORCH_* macros (#82032 ) Avoid exposing defines that conflict with google logging, since this blocks external usage of libtorch in certain cases. All the 'interesting' changes should be in these two files, and the rest should just be mechanical changes via sed. c10/util/logging_is_not_google_glog.h c10/util/logging_is_google_glog.h Fixes https://github.com/pytorch/pytorch/issues/81415 cc @miladm @malfet Pull Request resolved: https://github.com/pytorch/pytorch/pull/82032 Approved by: https://github.com/soumith, https://github.com/miladm	2022-07-26 01:20:44 +00:00
zhang, xiaobing	86b86202b5	fix torch.config can't respect USE_MKLDNN flag issue (#75001 ) Fixes https://github.com/pytorch/pytorch/issues/74949, which reports that torch.config can't respect USE_MKLDNN flag. Pull Request resolved: https://github.com/pytorch/pytorch/pull/75001 Approved by: https://github.com/malfet	2022-07-17 15:00:48 +00:00
Nikita Shulga	6d85e7dafa	Fix sign-compare in caffe2 Prerequisite change for enabling `-Werror=sign-compare` across PyTorch repo Pull Request resolved: https://github.com/pytorch/pytorch/pull/75082 Approved by: https://github.com/ngimel	2022-04-05 00:08:05 +00:00
Pruthvi Madugundu	085e2f7bdd	[ROCm] Changes not to rely on CUDA_VERSION or HIP_VERSION (#65610 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/65610 - Replace HIP_PLATFORM_HCC with USE_ROCM - Dont rely on CUDA_VERSION or HIP_VERSION and use USE_ROCM and ROCM_VERSION. - In the next PR - Will be removing the mapping from CUDA_VERSION to HIP_VERSION and CUDA to HIP in hipify. - HIP_PLATFORM_HCC is deprecated, so will add HIP_PLATFORM_AMD to support HIP host code compilation on gcc. cc jeffdaily sunway513 jithunnair-amd ROCmSupport amathews-amd Reviewed By: jbschlosser Differential Revision: D30909053 Pulled By: ezyang fbshipit-source-id: 224a966ebf1aaec79beccbbd686fdf3d49267e06	2021-09-29 09:55:43 -07:00
Nikita Shulga	709ac6853a	Fix warnings (#62930 ) Summary: Add `-Wno-writable-strings`(which is clang's flavor of `-Wwrite-strings`) to list of warnings ignored while compiling torch_python. Avoid unnecessary copies in range loop Fix number of signed-unsigned comparisons Found while building locally on M1 Pull Request resolved: https://github.com/pytorch/pytorch/pull/62930 Reviewed By: albanD Differential Revision: D30171981 Pulled By: malfet fbshipit-source-id: 25bd43dab5675f927ca707e32737ed178b04651e	2021-08-11 14:07:10 -07:00
Tanvir Zaman	df18d05429	Make bytes_read available for OperatorCost (#62059 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/62059 GetOperatorCost in Workspace exposes flops and bytes_written only. Make the an additional piece, bytes_read, available from OperatorSchema::Cost. Test Plan: Added the two additional pieces in the unit test testGetOperatorCost in workspace_test buck test caffe2/caffe2/python:workspace_test -- testGetOperatorCost buck test //aml/ml_foundation/exp_platform/large_scale_training/distributed_hogwild/auto_device_placement/tests/... buck test //aiplatform/training/autotuning/tests/... buck test //aiplatform/training/pipelining/tests/... buck test //deeplearning/fblsim/tests/... Flow tests: ADP Greedy: f288078287 ADP MILP: f288079278 Reviewed By: CrazySherman, xtaofb Differential Revision: D29860676 fbshipit-source-id: 8b3a9f2bf17c0dae48cfe2800e8821bf441e0b03	2021-07-27 12:48:36 -07:00
Nikita Shulga	a9b0a921d5	Disable `avoid-non-const-global-variables` lint check (#62008 ) Summary: As GoogleTest `TEST` macro is non-compliant with it as well as `DEFINE_DISPATCH` All changes but the ones to `.clang-tidy` are generated using following script: ``` for i in `find . -type f -iname ".c" -or -iname "*.h"\|xargs grep cppcoreguidelines-avoid-non-const-global-variables\|cut -f1 -d:\|sort\|uniq`; do sed -i "/\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-non-const-global-variables)/d" $i; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/62008 Reviewed By: driazati, r-barnes Differential Revision: D29838584 Pulled By: malfet fbshipit-source-id: 1b2f8602c945bd4ce50a9bfdd204755556e31d13	2021-07-22 18:04:40 -07:00
Nikita Shulga	4cb534f92e	Make PyTorch code-base clang-tidy compliant (#56892 ) Summary: This is an automatic change generated by the following script: ``` #!/usr/bin/env python3 from subprocess import check_output, check_call import os def get_compiled_files_list(): import json with open("build/compile_commands.json") as f: data = json.load(f) files = [os.path.relpath(node['file']) for node in data] for idx, fname in enumerate(files): if fname.startswith('build/') and fname.endswith('.DEFAULT.cpp'): files[idx] = fname[len('build/'):-len('.DEFAULT.cpp')] return files def run_clang_tidy(fname): check_call(["python3", "tools/clang_tidy.py", "-c", "build", "-x", fname,"-s"]) changes = check_output(["git", "ls-files", "-m"]) if len(changes) == 0: return check_call(["git", "commit","--all", "-m", f"NOLINT stubs for {fname}"]) def main(): git_files = check_output(["git", "ls-files"]).decode("ascii").split("\n") compiled_files = get_compiled_files_list() for idx, fname in enumerate(git_files): if fname not in compiled_files: continue if fname.startswith("caffe2/contrib/aten/"): continue print(f"[{idx}/{len(git_files)}] Processing {fname}") run_clang_tidy(fname) if __name__ == "__main__": main() ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/56892 Reviewed By: H-Huang Differential Revision: D27991944 Pulled By: malfet fbshipit-source-id: 5415e1eb2c1b34319a4f03024bfaa087007d7179	2021-04-28 14:10:25 -07:00
Pritam Damania	dc8a8cea79	Move caffe2 signal_handler to c10. (#56717 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56717 The signal_handler was under the caffe2 namespacee but was being used by PyTorch as well. I've fixed this my moving it to the c10 namespace where now both C2 and PyTorch can use it. The signal_handler interface in caffe2/utils/signal_handler.h is kept the same for backward compatiblity for C2, but most of the commmon code is moved to c10. ghstack-source-id: 127446929 Test Plan: waitforbuildbot Reviewed By: ezyang Differential Revision: D27946738 fbshipit-source-id: d6228d1a0108f4c807d405e7a0bb799c5375388f	2021-04-26 23:08:12 -07:00
Pritam Damania	e3691be2d9	Dump C++ stack traces of all threads for distributed tests. (#55003 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/55003 Using the `caffe2::setPrintStackTracesOnFatalSignal` utility in distributed tests to set a signal handler that dumps the state of all threads for all processes when it receives a FATAL signal. This would help in debugging tests further. I had to revert all the python faulthandler code since only one signal handler function is supported, so running python faulthandler with `setPrintStackTracesOnFatalSignal` doesn't work. Sample output: ``` SIGSEGV(11), PID: 3492872, Thread 3492872: [0] ???(0x7fa7b2d1d61b) in libcaffe2_caffe2_caffe2_cpu.so [1] ???(0x7fa7b2d1d3fb) in libcaffe2_caffe2_caffe2_cpu.so [2] ???(0x7fa7b2d1d33d) in libcaffe2_caffe2_caffe2_cpu.so [3] ???(0x7fa7b2d1d167) in libcaffe2_caffe2_caffe2_cpu.so [4] ???(0x7fa7ce683150) in libpthread.so.0 [5] ???(0x7fa7be2b233c) in libcaffe2__C_impl_cuda.so [6] ???(0x7fa7be2ce80c) in libcaffe2__C_impl_cuda.so [7] ???(0x7fa7be2a0512) in libcaffe2__C_impl_cuda.so [8] torch::distributed::rpc::TensorPipeAgent::send(torch::distributed::rpc::WorkerInfo const&, torch::distributed::rpc::Message&&, float, std::unordered_map<signed char, signed char, std::hash<signed char>, std::equal_to<signed char>, std::allocator<std::pair<signed char const, signed char> > > const&)+0x24f(0x7fa7be29f71f) in libcaffe2__C_impl_cuda.so [9] torch::distributed::autograd::sendMessageWithAutograd(torch::distributed::rpc::RpcAgent&, torch::distributed::rpc::WorkerInfo const&, torch::distributed::rpc::Message&&, bool, float, bool)+0x393(0x7fa7b602b203) in libcaffe2_libtorch.so [10] torch::distributed::rpc::pyRpcPythonUdf(torch::distributed::rpc::WorkerInfo const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >&, std::vector<at::Tensor, std::allocator<at::Tensor> >&, float, bool)+0x201(0x7fa7bd844971) in libcaffe2__C_impl_cuda.so ``` ghstack-source-id: 125630551 Test Plan: waitforbuildbot Reviewed By: SciPioneer Differential Revision: D27419714 fbshipit-source-id: 8aca9a14ef688004053d8798124d9c3a3fbe3489	2021-04-03 13:59:56 -07:00
Adam Simpkins	da18313de3	[caffe2] expose whether FBGEMM is available to the Python code (#54274 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/54274 Some of the Python tests need to be aware of whether or not FBGEMM is available, so expose this setting in the pybind extension. ghstack-source-id: 124317732 Test Plan: Will use this variable in the tests on D26658205. Reviewed By: mraway Differential Revision: D27171780 fbshipit-source-id: 4c94144a959bf8bf0e1553b6e029e94a91794e29	2021-03-19 12:52:14 -07:00
Yinghai Lu	f4c33edb45	Add onnxifi interface for set/get options (#52388 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/52388 Pull Request resolved: https://github.com/pytorch/glow/pull/5364 This allows us to change global variables through onnxifi calls. And add python bindings along with it. Note that we supply a dummy backend_id as it's not needed by glow due to setting being global. #codemod Test Plan: ``` buck test mode/dev //glow/fb/test:test_onnxifi_optionnnpi ``` Reviewed By: jfix71, khabinov Differential Revision: D26481652 fbshipit-source-id: 19b8201c77f653cf7d93ad68760aa7fb5ec45ff4	2021-02-18 20:12:34 -08:00
Basil Hosmer	f05b66b70d	pass TypeMeta by value (#45026 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/45026 Test Plan: Imported from OSS Reviewed By: ezyang Differential Revision: D23802943 Pulled By: bhosmer fbshipit-source-id: 81b06ef00bf8eb4375c0e0ff2032e03bd1d1188a	2020-10-30 10:14:17 -07:00
Danny Huang	5ee31308e6	[caffe2] exposes Net cancellation through pybind state (#44043 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44043 To invoke `cancel` from the net instance in Python, we expose it through pybind state. Reviewed By: dzhulgakov Differential Revision: D23249660 fbshipit-source-id: 45a1e9062dca811746fcf2e5e42199da8f76bb54	2020-09-09 18:13:13 -07:00
Jordan Fix	2f8a43341d	Add API for onnxifi with AOT Glow ONNX (#44021 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/44021 Pull Request resolved: https://github.com/pytorch/glow/pull/4854 Test Plan: Added `test_onnxifi_aot.py` Reviewed By: yinghai Differential Revision: D23307003 fbshipit-source-id: e6d4f3e394f96fd22f80eb2b8a686cf8171a54c0	2020-09-03 22:46:20 -07:00
Hector Yuen	c8e789e06e	add fake fp16 fusions to net transforms (#42927 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42927 added fp16 fusion to net transforms refactored the transforms as well as glow_transform to get out of opt/custom so that the OSS builds passed Test Plan: added net runner tests for this Reviewed By: yinghai Differential Revision: D23080881 fbshipit-source-id: ee6451811fedfd07c6560c178229854bca29301f	2020-08-14 13:30:27 -07:00
Mike Ruberry	ddcf3ded3e	Revert D23002043: add net transforms for fusion Test Plan: revert-hammer Differential Revision: D23002043 (`a4b763bc2c`) Original commit changeset: f0b13d51d68c fbshipit-source-id: d43602743af35db825e951358992e979283a26f6	2020-08-10 21:22:57 -07:00
Hector Yuen	a4b763bc2c	add net transforms for fusion (#42763 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42763 add the fp16 fusions as net transforms: -layernorm fused with mul+add -swish int8 Test Plan: added unit test, ran flows Reviewed By: yinghai Differential Revision: D23002043 fbshipit-source-id: f0b13d51d68c240b05d2a237a7fb8273e996328b	2020-08-10 20:16:14 -07:00
Dmytro Dzhulgakov	06d978a9ad	[c10/cuda] Reorganize device_count() and robustly surface ASAN warnings (#42249 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42249 Main change is to bring Caffe2's superior error messages for cuda initialization into c10 and use them in all code paths. Basic logic: \| Case \| Call to device_count() \| init_cuda, e.g. allocating tensor \| \| -- \| -- \| -- \| \| all good \| non-zero \| just works \| \| no gpus \| 0, no warning \| throw exception with good message \| \| driver issues \| 0, produce warning \| throw exception with good message \| \| out of memory with ASAN \| 0, produce warning\| throw exception with ASAN message \| Previously, the error thrown from init_cuda was very generic and the ASAN warning (if any) was buried in the logs. Other clean up changes: * cache device_count() always in a static variable * move all asan macros in c10 Test Plan: Hard to unittest because of build modes. Verified manually that the behavior from the table above holds by running the following script in different modes (ASAN/no-ASAN, CUDA_VISIBLE_DEVICES=): ``` print('before import') import torch print('after import') print('devices: ', torch.cuda.device_count()) x = torch.tensor([1,2,3]) print('tensor creation') x = x.cuda() print('moved to cuda') ``` Reviewed By: ngimel Differential Revision: D22824329 fbshipit-source-id: 5314007313a3897fc955b02f8b21b661ae35fdf5	2020-08-05 11:39:31 -07:00
Yinghai Lu	8850fd1952	Add python inferface to create OfflineTensor (#42516 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42516 att. We need it for some scripts. Reviewed By: houseroad Differential Revision: D22918112 fbshipit-source-id: 8a1696ceeeda67a34114bc57cb52c925711cfb4c	2020-08-04 01:31:34 -07:00
Yinghai Lu	dbdd28207c	Expose a generic shape info struct for ONNXIFI Python interface (#42421 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/42421 Previously, we can only feed shape info from Python with float dtype, and batch based dim type when we do onnxifi from Python. This diff removes this limitation and uses TensorBoundShapes protobuf as a generic shape info struct. This will make the onnxifi interface in Python more flexible. Reviewed By: ChunliF Differential Revision: D22889781 fbshipit-source-id: 1a89f3a68c215a0409738c425b4e0d0617d58245	2020-08-03 16:10:05 -07:00
Priyanshu	6c251f74b2	replace black_list/blacklist with blocklist/block_list (#42089 ) Summary: Fixes https://github.com/pytorch/pytorch/issues/41734 Pull Request resolved: https://github.com/pytorch/pytorch/pull/42089 Reviewed By: pbelevich Differential Revision: D22794556 Pulled By: SplitInfinity fbshipit-source-id: 4404845b6293b076b3c8cc02b135b20c91397a79	2020-07-29 16:26:02 -07:00
Yinghai Lu	3ea15af630	[Onnxifi] Allow adding timeout for OnnxifOp run (#40081 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/40081 Adding the functionality to enable timeout of OnnxifiOp run. In the case of backend hanging, it can error out quickly. Test Plan: ``` buck test glow/fb/test:test_onnxifinnpi -- test_timeout ``` Reviewed By: jackm321 Differential Revision: D22064533 fbshipit-source-id: 25487287c10ab217eb95692f09d48e13e19436ab	2020-06-17 16:21:25 -07:00
Amy Yang	37479ddf4e	[caffe2] create and register child ws in pybind (#36741 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36741 Create child workspace that shares parent workspace's blobs. Register child workspace in registrar to enable switching into child workspace and feeding to child workspace alone. Test Plan: numeric suite unit tests in stacked diff Reviewed By: hx89 Differential Revision: D21055567 fbshipit-source-id: 374b12aef75a4c58452c271f8961ee156ce6c559	2020-04-16 14:53:55 -07:00
Yinghai Lu	dd98abb453	Enable splitSparseLengthsSumSparse in onnxifi (#35555 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35555 Att. So that we can lower the SparseLengthsSum* part of SparseLengthsSumSparse. We update the tying policy between Gather and SparsLengthsWeightSum so that we don't bother lowering a single Gather into the backend, which is inefficient to execute on card and creates bubbles between continuous lowering graphs. Test Plan: ``` buck test glow/fb/test:test_onnxifinnpi ``` Reviewed By: ipiszy Differential Revision: D20688525 fbshipit-source-id: cb8e38239057ff13a8d385ed09d0d019421de78b	2020-03-30 13:34:59 -07:00
anjali411	96eec95ece	torch.from_numpy for complex dtypes (#35531 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35531 Differential Revision: D20693581 Pulled By: anjali411 fbshipit-source-id: d53e26b4175452fa00b287efbfceea18104c1364	2020-03-27 14:40:28 -07:00
Chunli Fu	de3b2f98db	[Shape Inference] Add ssaRewrite pybind func (#35410 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/35410 Reviewed By: yinghai Differential Revision: D20653042 fbshipit-source-id: 3845413d4e80b9be4fb97dc1eb8e824a55fb7576	2020-03-26 00:46:28 -07:00
Michael Suo	c235be42dd	[jit] kill script namespace (#34515 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/34515 Once upon a time we thought this was necessary. In reality it is not, so removing it. For backcompat, our public interface (defined in `api/`) still has typedefs to the old `script::` names. There was only one collision: `Pass` as a `Stmt` and `Pass` as a graph transform. I renamed one of them. Test Plan: Imported from OSS Differential Revision: D20353503 Pulled By: suo fbshipit-source-id: 48bb911ce75120a8c9e0c6fb65262ef775dfba93	2020-03-11 23:32:48 -07:00
Michael Suo	dbe850af5b	[jit] do the code reorg (#33851 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33851 Rationale and context described in #33828. Script to reproduce the move: https://gist.github.com/suo/16cbefaaeb67ca5a7c6caffd49b7f6e9 ghstack-source-id: 99079645 Test Plan: Make sure CI passes Reviewed By: jamesr66a Differential Revision: D20133869 fbshipit-source-id: 390e9241a9c85366d9005c492ac31f10aa96488e	2020-02-27 13:02:51 -08:00
Edward Yang	c47c78d0bf	Revert D19597036: More code fakefp16 mapping unification Test Plan: revert-hammer Differential Revision: D19597036 Original commit changeset: deed61945884 fbshipit-source-id: c057e57810a99464aefb00b645613ecd6a7c5533	2020-01-29 13:32:42 -08:00
Yinghai Lu	642c9ef922	More code fakefp16 mapping unification Summary: ATT Reviewed By: amylittleyang Differential Revision: D19597036 fbshipit-source-id: deed61945884fb4b01d058f3c72c75f5a937a41c	2020-01-29 11:01:24 -08:00
Yinghai Lu	d2fdf140af	Combine all the user inputs together and convert them to fp16 (#31898 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/31898 Att Reviewed By: tracelogfb Differential Revision: D19291357 fbshipit-source-id: 747ed5234ca042ceeaff2d094701ead7597ac3ee	2020-01-08 14:36:42 -08:00
Chunli Fu	42324cb6e8	Change interface from map of TensorShape to shapeInfoMap (#30802 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/30802 Change shape_hints from map<string, TensorShape> to ShapeInfoMap to catch dimType info from model file. Reviewed By: ipiszy Differential Revision: D18821486 fbshipit-source-id: c5d9ed72e158d3698aba38900aeda00f776745b4	2019-12-10 00:35:11 -08:00
James Reed	7a6c3b36a1	Switch ScriptModuleOp to use a unique_ptr Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29856 Test Plan: waitforsadcastle Reviewed By: dzhulgakov Differential Revision: D18516553 fbshipit-source-id: d1e2d49ec613d07b21cd30bd777fbd300032cba1	2019-11-14 19:36:00 -08:00
Yinghai Lu	f0dd7517f2	Add option to clean up allocated activations between c2 runs (#29619 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/29619 att Reviewed By: houseroad Differential Revision: D18415190 fbshipit-source-id: 739aaf436578fac635df10de42b35e2b4368df37	2019-11-13 10:30:10 -08:00
Amy Yang	9588cd921e	weight_names bug fix (#23848 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/23848 Problem: In experiment running feed model 127607201 (/mnt/public/tracelog/feed_repro2/127607201_0.predictor), encountered blob dimensionality mismatch error when running onnxified net. This is due to the model initializing input blobs in current workspace with blob size 0, and onnxifi() falsely identified those input blobs as weight blobs and assigned wrong dimension. Solution: Add option to pass correct weight blob names to onnxifi() instead of using all blobs in current workspace. Reviewed By: yinghai Differential Revision: D16661396 fbshipit-source-id: cabe44db6b64e6538bef4b65e380312214b3ba9f	2019-08-06 10:58:43 -07:00
Will Feng	3a12520844	Pass Variable into Caffe2 ops, by requiring that the Variable doesn't require grad (#22473 ) Summary: As part of the Variable/Tensor merge, we want to be able to pass Variables into Caffe2 without doing extra shallow copy, to improve performance and also allow for in-place mutations in Caffe2 ops. There are a few approaches outlined in https://github.com/pytorch/pytorch/pull/22418, and this PR is the chosen approach. Specifically, we can have the assumption that we won't be connecting autograd to C2 gradients at any point (as it's too tricky and not that useful). Therefore, we can pass Variable into Caffe2 ops by requiring that all Variables in Caffe2 don't require grad. For code paths in Caffe2 that might potentially track gradients (e.g. `ScriptModuleOp` and `call_caffe2_op_from_c10`), we use the `torch::NoGradGuard` to make sure gradients are not tracked. This supersedes https://github.com/pytorch/pytorch/pull/22418. Pull Request resolved: https://github.com/pytorch/pytorch/pull/22473 Differential Revision: D16099042 Pulled By: yf225 fbshipit-source-id: 57efc3c7cfb3048d9abe90e63759acc14ebd2972	2019-07-08 11:31:10 -07:00
Zachary DeVito	5b87049c66	remove uses of std::shared_ptr<Module> (#21934 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21934 ghimport-source-id: e64ab9096f43749ead3ac5567675b815da295664 Test Plan: Imported from OSS Differential Revision: D15892401 Pulled By: zdevito fbshipit-source-id: 6424139206593ff944556c69d8a54723884eacaf	2019-06-25 13:24:38 -07:00
Hong Xu	3bdde56907	Fix incorrect usage of __HIP_PLATFORM_HCC__ (#21757 ) Summary: This avoid using `__HIP_PLATFORM_HCC__` in case it changes in the future. Following up https://github.com/pytorch/pytorch/issues/21718 Pull Request resolved: https://github.com/pytorch/pytorch/pull/21757 Reviewed By: xw285cornell Differential Revision: D15891867 Pulled By: bddppq fbshipit-source-id: 5de55687ab1c86eddf6b4d8d25fee48d96ec72ad	2019-06-18 18:56:32 -07:00
Xiaodong Wang	5a7e2ccc0b	Add use_rocm flag to detect AMD build in the runtime (#21718 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/21718 adding a detection method on whether the package is built for AMD. Reviewed By: bddppq Differential Revision: D15795893 fbshipit-source-id: 91a21ee76b2273b1032507bdebe57e016717181d	2019-06-13 09:30:49 -07:00

1 2 3 4 5

218 Commits