pytorch/test/distributed
Ke Wen c5589074e6 [SymmMem] find_path does not search /usr/local/lib (#157695)
This PR uses `find_library` to replace `find_path`.
It also searches for NVSHMEM host lib and device lib separately.

Tested against system install location: /usr/local/lib and /usr/local/include.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/157695
Approved by: https://github.com/Skylion007
ghstack dependencies: #157513
2025-07-08 01:21:59 +00:00
..
_composable [FSDP2] Fix issue with set_reduce_scatter_divide_factor errors and MixedPrecisionPolicy (#155964) 2025-07-07 17:09:29 +00:00
_shard add support for 0 size shardedTensor and recalculate metadata from all_gather (#152583) 2025-05-03 17:26:29 +00:00
_tools Support XPU in memory tracker (#150703) 2025-06-12 21:33:52 +00:00
algorithms [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
bin
checkpoint [oss] Add version to metadata (#155343) 2025-07-07 20:57:30 +00:00
elastic [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
flight_recorder [fr] Fix one error in analysis script when subPG world size is smaller than global size (#156156) 2025-06-17 21:17:58 +00:00
fsdp Fix: Ensure writeback handles NO_SHARD correctly by flattening tensors before copying (#154369) 2025-07-06 09:20:31 +00:00
launcher [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
nn/jit
optim [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
pipelining [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
rpc [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
tensor Fix index_put propagate strategy arg unpack error (#157671) 2025-07-07 20:18:18 +00:00
_test_template.py [Distributed][CI] Rework continuous TestCase (#153653) 2025-05-25 03:49:29 +00:00
argparse_util_test.py
test_backends.py
test_c10d_common.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_c10d_functional_native.py [cpp wrapper] add AOTI shim for collective ops (#154492) 2025-06-25 01:20:05 +00:00
test_c10d_gloo.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_c10d_logger.py add device generalisation support for distributed tests (#152471) 2025-06-20 07:35:42 +00:00
test_c10d_nccl.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_c10d_object_collectives.py Update test_c10d_object_collectives.py with DistributedTestBase class (#145056) 2025-02-13 03:57:59 +00:00
test_c10d_ops_nccl.py [FSDP2] Fix issue with set_reduce_scatter_divide_factor errors and MixedPrecisionPolicy (#155964) 2025-07-07 17:09:29 +00:00
test_c10d_pypg.py c10d/ProcessGroup: cleanup abort and shutdown (#148798) 2025-03-08 18:33:18 +00:00
test_c10d_spawn_gloo.py
test_c10d_spawn_nccl.py
test_c10d_spawn_ucc.py
test_c10d_spawn.py Add __main__ guards to distributed tests (#154628) 2025-06-04 14:39:57 +00:00
test_c10d_ucc.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_collective_utils.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_composability.py [Distributed][CI] Rework continuous TestCase (#153653) 2025-05-25 03:49:29 +00:00
test_compute_comm_reordering.py add device generalisation support for distributed tests (#152471) 2025-06-20 07:35:42 +00:00
test_control_collectives.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_data_parallel.py
test_device_mesh.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_distributed_spawn.py Remove NO_MULTIPROCESSING_SPAWN checks (#146705) 2025-02-28 05:53:19 +00:00
test_dynamo_distributed.py Revert "[dynamo][fsdp] Consistent behavior of int attributes (#157262)" 2025-07-02 08:30:39 +00:00
test_fake_pg.py add device generalisation support for distributed tests (#152471) 2025-06-20 07:35:42 +00:00
test_functional_api.py [inductor][dynamo] Include operator name in size/stride/alignment assertion (#152353) 2025-06-03 19:21:15 +00:00
test_inductor_collectives.py [inductor_collectives] Make reorder_collectives_preserve_peak pass grouping nodes (#157706) 2025-07-07 23:13:58 +00:00
test_launcher.py
test_multi_threaded_pg.py
test_nccl.py [SymmMem] Fix CI name mismatch; remove TORCH_SYMMMEM requirement (#157597) 2025-07-04 01:43:08 +00:00
test_nvshmem_triton.py Revert "[SymmMem] find_path does not search /usr/local/lib (#157695)" 2025-07-08 01:12:01 +00:00
test_nvshmem.py [SymmMem] find_path does not search /usr/local/lib (#157695) 2025-07-08 01:21:59 +00:00
test_pg_wrapper.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_serialization.py distributed/serialization: add experimental streaming torch.save/load methods (#146555) 2025-02-07 18:08:11 +00:00
test_store.py [BE][PYFMT] migrate PYFMT for test/[a-h]*/ to ruff format (#144555) 2025-06-24 04:53:54 +00:00
test_symmetric_memory.py [SymmMem] Allow selection of allocation backend (#156661) 2025-06-26 21:37:44 +00:00