pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
cyy	09291817b2	Fix extra semicolon warning (#148291 ) Fixes #ISSUE_NUMBER Pull Request resolved: https://github.com/pytorch/pytorch/pull/148291 Approved by: https://github.com/Skylion007	2025-03-03 18:51:44 +00:00
cyy	a05b64a38f	[5/N] Fix extra warnings brought by clang-tidy-17 (#138403 ) Follows #137983 Pull Request resolved: https://github.com/pytorch/pytorch/pull/138403 Approved by: https://github.com/ezyang	2024-10-21 02:59:54 +00:00
Valentin Andrei	b139b5090f	[pytorch] Name threads in thread pools for better debugging (#130270 ) Threads inside the thread pools are not named, so they inherit the main process name or the name of the first thread. In our case if we set `pt_main_thread` as the thread name when a thread does `import torch`, this name will be inherited by all the threads in the created pools. This PR names the threads in the pools I was able to find. There are other pools created, like OpenMP ones and we need to follow-up on those. Pull Request resolved: https://github.com/pytorch/pytorch/pull/130270 Approved by: https://github.com/d4l3k, https://github.com/albanD	2024-07-09 08:03:47 +00:00
Nikita Shulga	194950c0ca	Default TreadPool size to number of physical cores (#125963 ) TODO: Some benchmarks Pull Request resolved: https://github.com/pytorch/pytorch/pull/125963 Approved by: https://github.com/janeyx99, https://github.com/Skylion007, https://github.com/gajjanag, https://github.com/jgong5	2024-05-24 16:06:48 +00:00
Nikita Shulga	05d949279c	[C10] cpuinfo error handling (#113771 ) If `cpuinfo_initalize` returns false, call to subsequent cpuinfo functions may result in `abort()` Also, `defaultNumThreads()` method is assumption if one method fails then try another, and finally return 1. Alas there are no good way to test it on x86 platform, but on ARM one can replicate it by running `sudo chmod 750 /sys` and then `python3 -c "import torch;torch._C.profiler.gather_traceback(True, True, True)"` <!-- copilot:poem --> ### <samp>🤖 Generated by Copilot at 4d942e8</samp> > _`cpuinfo` fails_ > _avoid undefined behavior_ > _check before you count_ Partially addresses https://github.com/pytorch/pytorch/issues/113568 Pull Request resolved: https://github.com/pytorch/pytorch/pull/113771 Approved by: https://github.com/atalman	2023-11-15 19:49:34 +00:00
vinithakv	36e6b0cfa2	Fix cpuinfo related crash on ppc64 (#110708 ) The "import torch" crashes with following cpuinfo error on powerpc64. ============================================================== >>> import torch Error in cpuinfo: processor architecture is not supported in cpuinfo Fatal error in cpuinfo: cpuinfo_get_processors_count called before cpuinfo is initialized Aborted (core dumped) ================================================================== The patch fixes this by excluding powerpc from using cpuinfo as it is not supported for ppc64. Pull Request resolved: https://github.com/pytorch/pytorch/pull/110708 Approved by: https://github.com/ezyang	2023-10-08 13:31:54 +00:00
Aleksei Nikiforov	b91ba226ce	Don't use cpuinfo on s390x (#109496 ) It doesn't support s390x and just crashes pytorch on init. Pull Request resolved: https://github.com/pytorch/pytorch/pull/109496 Approved by: https://github.com/huydhn	2023-09-21 12:20:49 +00:00
cyy	8cb96f5f2c	[Reland]Use cpuinfo to determine c10::ThreadPool thread number (#107339 ) Relands PR #107010 and fixes BUCK builds. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107339 Approved by: https://github.com/ezyang	2023-09-14 23:44:23 +00:00
PyTorch MergeBot	6c0bba3daf	Revert "Use cpuinfo to determine c10::ThreadPool thread number (#107010 )" This reverts commit `ad0476540d`. Reverted https://github.com/pytorch/pytorch/pull/107010 on behalf of https://github.com/izaitsevfb due to Breaks internal meta builds ([comment](https://github.com/pytorch/pytorch/pull/107010#issuecomment-1679866821))	2023-08-16 02:20:31 +00:00
cyy	ad0476540d	Use cpuinfo to determine c10::ThreadPool thread number (#107010 ) This PR prefers "logical processor number" (the cpu cores shown in htop) returned by cpuinfo for determining c10 thread number. If that fails, it uses hardware_concurrency exactly. The motivation is that in a x86 host with 64 cores and Hyper-Threading disabled, the current behavior uses 32 threads, resulting half of cores being idle. Pull Request resolved: https://github.com/pytorch/pytorch/pull/107010 Approved by: https://github.com/ezyang	2023-08-15 17:26:24 +00:00
cyy	646fa36875	Add const reference in opportunities detected by clang-tidy (#105931 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105931 Approved by: https://github.com/Skylion007	2023-07-26 21:38:10 +00:00
Benson Ma	66a2600b6a	[T153220354] Fix header inclusions in c10 (#1541 ) (#101846 ) Summary: This is a re-attempt to land the iwyu header changes, by taking the diff from [PR 100304](https://github.com/pytorch/pytorch/pull/100304), and adding the bare minimal changes to make the diff build corectly in the internal builds. X-link: https://github.com/facebookresearch/pytorch3d/pull/1541 X-link: https://github.com/fairinternal/pytorch3d/pull/44 - Re-work D45769819 to fix header inclusions in c10 Test Plan: ``` buck2 build --no-remote-cache mode/dev-nosan //caffe2/c10/... buck2 build --no-remote-cache mode/dev-nosan //deeplearning/fbgemm/fbgemm_gpu/... buck2 build mode/dev-nosan //vision/fair/pytorch3d/pytorch3d:_C ``` Reviewed By: malfet Differential Revision: D45920611 Pull Request resolved: https://github.com/pytorch/pytorch/pull/101846 Approved by: https://github.com/malfet, https://github.com/Skylion007	2023-05-20 19:35:14 +00:00
PyTorch MergeBot	4eaaa08623	Revert "Fix header inclusions in c10 by iwyu (#100304 )" This reverts commit `6037ee8cc9`. Reverted https://github.com/pytorch/pytorch/pull/100304 on behalf of https://github.com/jeanschmidt due to Breaking meta internal builds and fbgemm builds ([comment](https://github.com/pytorch/pytorch/pull/100304#issuecomment-1543919257))	2023-05-11 12:37:35 +00:00
cyy	6037ee8cc9	Fix header inclusions in c10 by iwyu (#100304 ) This work introduces include-what-you-use support for c10 by a CMake option defaulting to off. We also remove some unused header inclusions and fix a trivial inclusion error. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100304 Approved by: https://github.com/ezyang	2023-05-11 05:19:42 +00:00
PyTorch MergeBot	3271413e74	Revert "Fix header inclusions in c10 by iwyu (#100304 )" This reverts commit `39ec5fa722`. Reverted https://github.com/pytorch/pytorch/pull/100304 on behalf of https://github.com/huydhn due to Sorry for reverting your PR, it is almost there but fails on Windows `39ec5fa722`, which is in unstable mode after https://github.com/pytorch/pytorch/pull/100548 ([comment](https://github.com/pytorch/pytorch/pull/100304#issuecomment-1542975714))	2023-05-11 00:37:32 +00:00
cyy	39ec5fa722	Fix header inclusions in c10 by iwyu (#100304 ) This work introduces include-what-you-use support for c10 by a CMake option defaulting to off. We also remove some unused header inclusions and fix a trivial inclusion error. Pull Request resolved: https://github.com/pytorch/pytorch/pull/100304 Approved by: https://github.com/ezyang	2023-05-10 15:42:43 +00:00
Aaron Gokaslan	0247ed27cc	Apply Clang-Tidy readability-container-size-empty (#93236 ) Not only is this change usually shorter and more readable, it also can yield better performance. size() is not always a constant time operation (such as on LinkedLists), but empty() always is. Pull Request resolved: https://github.com/pytorch/pytorch/pull/93236 Approved by: https://github.com/malfet	2023-01-29 23:28:19 +00:00
cyy	f172feae0d	More tidy fixes (#93069 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/93069 Approved by: https://github.com/Skylion007	2023-01-27 06:40:50 +00:00
Nikita Shulga	a9b0a921d5	Disable `avoid-non-const-global-variables` lint check (#62008 ) Summary: As GoogleTest `TEST` macro is non-compliant with it as well as `DEFINE_DISPATCH` All changes but the ones to `.clang-tidy` are generated using following script: ``` for i in `find . -type f -iname ".c" -or -iname "*.h"\|xargs grep cppcoreguidelines-avoid-non-const-global-variables\|cut -f1 -d:\|sort\|uniq`; do sed -i "/\/\/ NOLINTNEXTLINE(cppcoreguidelines-avoid-non-const-global-variables)/d" $i; done ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/62008 Reviewed By: driazati, r-barnes Differential Revision: D29838584 Pulled By: malfet fbshipit-source-id: 1b2f8602c945bd4ce50a9bfdd204755556e31d13	2021-07-22 18:04:40 -07:00
Pritam Damania	12bb1e86ed	Make c10::ThreadPool::available_ atomic. (#58457 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/58457 This variable had concurrent read/write access without any synchronization. The issue was caught and reported by TSAN. ghstack-source-id: 129311384 Test Plan: 1) Verify test locally. 2) waitforbuildbot. Reviewed By: ezyang Differential Revision: D28498116 fbshipit-source-id: 89af068467fed64c131d743504c0cecf3017d638	2021-05-24 17:29:44 -07:00
Scott Wolchok	44cc873fba	[PyTorch] Autoformat c10 (#56830 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56830 Opt into formatting on GitHub and format everything. This is a trial run before turning on formatting for more and eventually all of the codebase. Test Plan: CI Reviewed By: zertosh Differential Revision: D27979080 fbshipit-source-id: a80f0c48691c08ae8ca0af06377b87e6a2351151	2021-04-30 21:23:28 -07:00
Edward Yang	6c602eb099	Don't hold ThreadPool lock when destructing task (#56817 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/56817 Fix https://github.com/pytorch/pytorch/issues/56701 and https://github.com/pytorch/pytorch/issues/56786 Signed-off-by: Edward Z. Yang <ezyang@fb.com> Test Plan: Imported from OSS Reviewed By: pritamdamania87 Differential Revision: D27975642 Pulled By: ezyang fbshipit-source-id: b7f4a6c18a4fa65c38bacc7c46246f0865c95f86	2021-04-27 12:22:49 -07:00
Nikita Shulga	087049000b	Make c10 clang-tidy clean (#55870 ) Summary: This change was autogenerated by running: ``` % find c10 -iname "*.cpp" -exec python3 tools/clang_tidy.py -c build -x {} -s \; ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/55870 Reviewed By: janeyx99 Differential Revision: D27728617 Pulled By: malfet fbshipit-source-id: bede4d7f0c106d51394d1e9efddf01bf894421c5	2021-04-14 11:23:28 -07:00
Jeremy Lilley	468a9d448e	[aten] Pass std::function<> to thread_pool by value, instead of const ref. (#37681 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37681 By passing by value, we can std::move, and avoid unnecessarily copying args that are part of any std::function/lambda state (e.g. in the jit interpreter, there is a std::vector<> stack passed in the InterpreterContinuation) This makes the api also consistent with e.g. folly and best practices. Added a minor at::launch() benchmark to test/cpp/, the difference is mostly noticeable when copying the std::function<> internal args is non-trivial. Benchmarks pre/post (min over ~5 runs) NoData: 5.81 us -> 5.63 us (-3.2%) WithData(0): 6.67 us -> 5.88 us (-11.8%) WithData(4): 6.98 us -> 6.51 us (-6.7%) WithData(256): 9.44 us -> 7.89 (-16.5%) ghstack-source-id: 103322321 Test Plan: - perf: buck run mode/opt caffe2/test/cpp/api:parallel_benchmark pre/post - correctness buck test mode/dev-nosan caffe2/test/... Reviewed By: dzhulgakov Differential Revision: D21355148 fbshipit-source-id: 3567e730845106f1991091e4a892d093e00571c3	2020-05-05 08:41:38 -07:00
Ilia Cherniavskii	c206b4398d	Show errors from the tasks in the thread pool (#33938 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/33938 Making sure we don't silently ignore exceptions from the tasks in the thread pool Test Plan: python setup.py clean && python setup.py develop install Differential Revision: D20178603 Pulled By: ilia-cher fbshipit-source-id: 34971032205a1a53fb7419ed84ebb986f9e959ad	2020-03-02 14:49:52 -08:00
Ilia Cherniavskii	6b74856747	Fix init_thread calls in thread pool initialization (#20848 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20848 ghimport-source-id: e542858a198252838c1f3100dbfbe90fd3960f07 Differential Revision: D15466918 Pulled By: ilia-cher fbshipit-source-id: e75d38f51edd5b508c4ca28a292e4141e90f209f	2019-05-24 01:14:31 -07:00
Ilia Cherniavskii	409200df59	Move inter-op settings into ATen/Parallel (#20050 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/20050 ghimport-source-id: cc102bab8abf3e56c099245976786317ed63ea14 Differential Revision: D15248576 Pulled By: ilia-cher fbshipit-source-id: 55ddcb7af387ddfc68a42ac7167de07ea648e249	2019-05-17 03:12:02 -07:00
Ilia Cherniavskii	9f35185b56	Initialize intra-op threads in JIT thread pool (#19058 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19058 ghimport-source-id: 53e87df8d93459259854a17d4de3348e463622dc Differential Revision: D14849624 Pulled By: ilia-cher fbshipit-source-id: 5043a1d4330e38857c8e04c547526a3ba5b30fa9	2019-04-16 18:27:22 -07:00
Owen Anderson	e34abe03a8	Enable threadpool threads to greedily acquire new tasks if available. (#17808 ) Summary: This improves locality and affinity by keeping work on the same threads preferentially to starting work on new ones, and reduces contention on the threadpool lock more generally. Pull Request resolved: https://github.com/pytorch/pytorch/pull/17808 Differential Revision: D14391282 Pulled By: resistor fbshipit-source-id: 3aec81656a50460a725aa4187c61864295d4f46e	2019-03-12 18:05:55 -07:00
James Reed	1d26a3ae7e	Open registration for c10 thread pool (#17788 ) Summary: 1. Move ATen threadpool & open registration mechanism to C10 2. Move the `global_work_queue` to use this open registration mechanism, to allow users to substitute in their own Pull Request resolved: https://github.com/pytorch/pytorch/pull/17788 Reviewed By: zdevito Differential Revision: D14379707 Pulled By: jamesr66a fbshipit-source-id: 949662d0024875abf09907d97db927f160c54d45	2019-03-08 15:38:41 -08:00

30 Commits