Maggie Moss
|
c7eee49525
|
Fix pyrefly ignores 1/n (#166239)
First diff adjusting the syntax for pyrefly: ignore suppressions so they only hide one class of type error.
Test:
lintrunner
pyrefly check
Pull Request resolved: https://github.com/pytorch/pytorch/pull/166239
Approved by: https://github.com/oulgen
|
2025-10-26 00:44:10 +00:00 |
|
Yang Wang
|
f76f4abf3f
|
Track monitor (#156907)
Tracking gpu mem allocation, we were tracking the gpu bandwidth memory, the mem allocation is the one reflect wether the gpu is oom or not, upcoming ui fix.
UI fix: https://github.com/pytorch/test-infra/pull/6878/files
Pull Request resolved: https://github.com/pytorch/pytorch/pull/156907
Approved by: https://github.com/huydhn
|
2025-07-18 22:54:13 +00:00 |
|
Yang Wang
|
335c89c6f1
|
[Monitoring] enable local logs and add mac test monitoring (#153454)
Enable to run the upload utilzation logics using local pointer instead of reading from s3, this could be useful for rocm too,
Pull Request resolved: https://github.com/pytorch/pytorch/pull/153454
Approved by: https://github.com/huydhn
|
2025-05-20 17:14:40 +00:00 |
|
Anthony Shoumikhin
|
e2f9759bd0
|
Fix broken URLs (#152237)
Pull Request resolved: https://github.com/pytorch/pytorch/pull/152237
Approved by: https://github.com/huydhn, https://github.com/malfet
|
2025-04-27 09:56:42 +00:00 |
|
Yang Wang
|
fd73ae2068
|
[Utilization] Convert timestamp to str for datetime64 (#145985)
Convert all timestamp(float) to int timestamp during data pipeline for db type datetime64.
float does not work when try to insert into clickhouse using jsonExtract.
Pull Request resolved: https://github.com/pytorch/pytorch/pull/145985
Approved by: https://github.com/huydhn
|
2025-02-03 21:05:18 +00:00 |
|
Yang Wang
|
a9ed7bd78e
|
[utilization] pipeline to create clean db records (#145327)
upload_utilization_script to generate db-ready-insert records to s3
- generate two files: metadata and timeseries in ossci-utilization buckets
- convert log record to db format ones
- add unit test job for tools/stats/
Related Prs:
setup composite action for data pipeline: https://github.com/pytorch/pytorch/pull/145310
add permission for composite action to access S3 bucket: https://github.com/pytorch-labs/pytorch-gha-infra/pull/595
add insert logic in s3 replicator: https://github.com/pytorch/test-infra/pull/6217
Pull Request resolved: https://github.com/pytorch/pytorch/pull/145327
Approved by: https://github.com/huydhn
Co-authored-by: Huy Do <huydhn@gmail.com>
|
2025-01-29 23:48:50 +00:00 |
|
Yang Wang
|
6d4f5f7688
|
[Utilization][Usage Log] Add data model for record (#145114)
Add data model for consistency and data model change in the future.
The data model will be used during the post-test-process pipeline
Pull Request resolved: https://github.com/pytorch/pytorch/pull/145114
Approved by: https://github.com/huydhn
|
2025-01-23 19:04:41 +00:00 |
|