Commit Graph

87 Commits

Author SHA1 Message Date
Eli Uriegas
d0bc01fac2 ci: Migrate hardcoded docker builds to GHA (#67455)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/67455

Migrates docker builds that don't have dependent jobs within the pytorch
repository to our new GHA docker build job

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: malfet, janeyx99

Differential Revision: D31997671

Pulled By: seemethere

fbshipit-source-id: 9d6f58fa8ea8731cf12457fe64dc65e70f3d9f25
2021-10-28 14:50:05 -07:00
Nikita Shulga
0c93c8e39a Disable linux-xenial-cuda10.2 config (#67344)
Summary:
linux-xenial-cuda10.2 and linux-bionic-cuda10.2 are very similar, no
need to run both configs

Moved all auxiliary builds from xenial to bionic

Pull Request resolved: https://github.com/pytorch/pytorch/pull/67344

Reviewed By: seemethere, janeyx99

Differential Revision: D31964850

Pulled By: malfet

fbshipit-source-id: d07ce266c843c7fd69b281e678c4247b0bf6da20
2021-10-28 10:10:13 -07:00
Eli Uriegas
81d188101f .github: Use 4xlarge instances for linux gpu (#67264)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/67264

Downgrades linux gpu instances from 4xlarge -> 8xlarge

We were seeing capacity issues in terms of scaling 8xlarge instances,
downgrading this to 4xlarge (which only have a single gpu) to see if
that helps resolve some of the capacity issues we were seeing

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: janeyx99

Differential Revision: D31933488

Pulled By: seemethere

fbshipit-source-id: b41922ebb675e663cb035cd3795bc9bae94dcac7
2021-10-26 16:17:33 -07:00
Eli Uriegas
7052c41899 .github: Add workflow to build all docker images (#67215)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/67215

We were regularly seeing gaps in our docker image builds due to specific
workflows not being run when docker builds occurred on PRs, this should
remove that ambiguity and ensure that all docker builds be re-built if a
rebuild is deemed necessary

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: albanD

Differential Revision: D31910422

Pulled By: seemethere

fbshipit-source-id: f346e64f1857e35a995c49bf30521a3acd8af0b1
2021-10-26 11:14:04 -07:00
Alex Zhao
ac948f4f35 .github: Migrate linux-xenial-py3.6-gcc7 to GHA (#67072)
Summary:
Fixes https://github.com/pytorch/pytorch/issues/66888

cc seemethere

Pull Request resolved: https://github.com/pytorch/pytorch/pull/67072

Reviewed By: seemethere

Differential Revision: D31900833

Pulled By: zhaoalex

fbshipit-source-id: 93f8995611169d991f90e07e8c13e08182969577
2021-10-25 11:40:12 -07:00
Nikita Shulga
77beccaedb Do not build PyTorch with caffe2 by default (#66658)
Summary:
CAFFE2 has been deprecated for a while, but still included in every PyTorch build.
We should stop building it by default, although CI should still validate that caffe2 code is buildable.

Build even fewer dependencies when compiling mobile builds without Caffe2
Introduce `TEST_CAFFE2` in torch.common.utils
Skip `TestQuantizedEmbeddingOps` and `TestJit.test_old_models_bc`  is code is compiled without Caffe2
Should be landed after https://github.com/pytorch/builder/pull/864

Pull Request resolved: https://github.com/pytorch/pytorch/pull/66658

Reviewed By: driazati, seemethere, janeyx99

Differential Revision: D31669156

Pulled By: malfet

fbshipit-source-id: 1cc45e2d402daf913a4685eb9f841cc3863e458d
2021-10-21 20:32:47 -07:00
Nikita Shulga
9c4d7d96db Address feedback from #66673 (#66905)
Summary:
Specify both `build_generates_artifacts` and `exclude_tests` properties as suggested in https://github.com/pytorch/pytorch/pull/66673#pullrequestreview-783667960

Pull Request resolved: https://github.com/pytorch/pytorch/pull/66905

Reviewed By: seemethere

Differential Revision: D31779742

Pulled By: malfet

fbshipit-source-id: 21f5543f3b767f38132be8c7e163455f39ff893f
2021-10-19 18:27:45 -07:00
Nikita Shulga
cbd7bac914 Migrate clang5-mobile build to GHA (#66673)
Summary:
`linux-xenial-py3-clang5-mobile-build`, `linux-xenial-py3-clang5-mobile-custom-build-dynamic`, `linux-xenial-py3-clang5-mobile-custom-build-dynamic` and `linux-xenial-py3-clang5-mobile-code-analysis` are just the flavors of regular linux build job with no tests.
`linux-xenial-py3-clang5-mobile-code-analysis` is the master only job

`code-analysis` job is dispatch to `.jenkins/pytorch/build-mobile-code-analysis.sh` in
583217fe37/.jenkins/pytorch/build.sh (L23-L25)
and all `mobile-build` jobs are dispatched to `.jenkins/pytorch/build-mobile.sh` in
583217fe37/.jenkins/pytorch/build.sh (L19-L21)

Rename `is_libtorch` `CIWorkflow` property into `build_generates_artifacts` and change defaults from False to True
Both libtorch and mobile build jobs do not generate build artifacts

Pull Request resolved: https://github.com/pytorch/pytorch/pull/66673

Reviewed By: janeyx99

Differential Revision: D31674434

Pulled By: malfet

fbshipit-source-id: 24d05d55366202cd4d9c25ecab429cb8f670ded0
2021-10-19 14:13:29 -07:00
Eli Uriegas
8492e6bc6a .github: scheduled -> schedule, fix periodic (#66531)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/66531

The github.event_name should be schedule not scheduled

Reference, https://docs.github.com/en/actions/learn-github-actions/events-that-trigger-workflows#schedule

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: malfet

Differential Revision: D31598136

Pulled By: seemethere

fbshipit-source-id: 4d67f7731b21e05dabc8f54b4ebf9a5d2d3a4e1e
2021-10-12 19:46:01 -07:00
Jane Xu
d921891f57 GHA: Stop skipping periodic jobs (#66264)
Summary:
they have been skipped for too long
![image](https://user-images.githubusercontent.com/31798555/136433267-f35c0507-23ab-4348-be43-78d299c3d654.png)

Pull Request resolved: https://github.com/pytorch/pytorch/pull/66264

Reviewed By: dagitses, malfet, seemethere

Differential Revision: D31478705

Pulled By: janeyx99

fbshipit-source-id: 1324b123e3f8646e5cd671af4c1850398a6f6e3b
2021-10-12 14:39:47 -07:00
Eli Uriegas
3b4b1b2d23 .github: Remove confusing ciflow_config.enabled variable (#66260)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/66260

Every workflow has ciflow enabled so this is not needed anymore

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: dagitses, janeyx99

Differential Revision: D31493340

Pulled By: seemethere

fbshipit-source-id: 8718fe5d22f4be6e0900962576782a9f23162a39
2021-10-11 09:39:31 -07:00
Nikita Shulga
ebe530a9cd Periodic jobs should not have CIFLOW_DEFAULT label (#66300)
Summary:
Noticed that `periodic-pytorch-linux-xenial-cuda10.2-cudnn7-py3-gcc7-slow-gradcheck` job has a `ciflow/default`, but does not have a `ciflow/scheduled` label
Added asserts to enforce that jobs with non-trival is_scheduled property do not have default and do have scheduled labesl

Rename `periodic-pytorch-linux-xenial-cuda10.2-cudnn7-py3-gcc7-slow-gradcheck` to `periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck`

Fixes #{issue number}

Pull Request resolved: https://github.com/pytorch/pytorch/pull/66300

Reviewed By: seemethere

Differential Revision: D31493323

Pulled By: malfet

fbshipit-source-id: 194c1d7a4e659847d94a547b87a0d7d08e66406d
2021-10-07 19:57:32 -07:00
Eli Uriegas
84c5970a77 ci: Migrate slow_gradcheck to GHA (#65730)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/65730

This should close out the door on migrating all scheduled workflows we have for CircleCI

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

cc ezyang seemethere malfet pytorch/pytorch-dev-infra

Test Plan: Imported from OSS

Reviewed By: albanD

Differential Revision: D31225188

Pulled By: seemethere

fbshipit-source-id: 4c49e88ec017edc30e07325dbc613ff54dd164d8
2021-10-06 14:29:14 -07:00
Nikita Shulga
6d4d636d66 [GHA] Rectify trigger_action_only flag (#66209)
Summary:
No longer needed, as PR can be opened/reopened with specific label

Fixes https://github.com/pytorch/pytorch/issues/66110

Pull Request resolved: https://github.com/pytorch/pytorch/pull/66209

Reviewed By: seemethere

Differential Revision: D31436292

Pulled By: malfet

fbshipit-source-id: 5b6e0875bec261862017dfe0eb3a5ec57fb8c705
2021-10-06 10:46:10 -07:00
Eli Uriegas
d2021e5e68 ci: Migrate vulkan builds to GHA (#66044)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/66044

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: malfet

Differential Revision: D31370889

Pulled By: seemethere

fbshipit-source-id: 399f5f0c184f7856dcddb138c357f1374706e676
2021-10-05 16:11:36 -07:00
Nikita Shulga
1932bc69e9 Move GHA to ONNX (#65975)
Summary:
- Delete CircleCI ONNX config
- Add sharded ONNX job to the list of generated workflows
- Move ONNX runtime installation from `pytorch-job-specs.yml` to `.jenkins/caffe2/test.sh`
- Limit MKLDNN to AVX2 ISA while running  Caffe2 tests

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65975

Reviewed By: seemethere

Differential Revision: D31327206

Pulled By: malfet

fbshipit-source-id: 15aa53e4481e846c62b4ee2db5c03047d68679a4
2021-10-05 09:31:57 -07:00
Jane Xu
69da4b4381 GHA: make obvious when we are running smoke tests to user (#66011)
Summary:
This PR clarifies what's run on PRs by explicitly stating when it runs smoke tests for windows CUDA and makes the logic so that user defined labels override other workflow logic.

1. Move smoke tests to its own config.

2. Make sure that when a user specifies a ciflow label that is not the default, the workflow runs as if it is on trunk.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/66011

Test Plan:
the default on PRs would generate this matrix (default replaced by smoke_tests)
![image](https://user-images.githubusercontent.com/31798555/135672182-64454ea3-ff43-4746-b8e4-09b0b28e9d33.png)
But when retriggered with a label, it looks like (note that there's no smoke_tests config):
![image](https://user-images.githubusercontent.com/31798555/135672601-5aa9a268-bc76-40f1-80c6-62b3fac6601d.png)

Reviewed By: VitalyFedyunin, seemethere

Differential Revision: D31355130

Pulled By: janeyx99

fbshipit-source-id: fed58ade4235b58176e1d1a24101aea0bea83aa4
2021-10-04 07:53:17 -07:00
Nikita Shulga
6a30d83596 Move ASAN to GHA (#65846)
Summary:
- Introduce `ciflow/sanitizers` label
- Modify asan pattern in `.jenkins/pytorch/build.sh`
- Produce wheel in `.jenkins/pytorch/build-asan.sh`
- Increase stack size hard limit to 82Mb in test docker containers

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65846

Reviewed By: seemethere

Differential Revision: D31282654

Pulled By: malfet

fbshipit-source-id: f73e692899cc9bbe106ececc26f1fe430dfeae9d
2021-09-30 09:49:52 -07:00
Nikita Shulga
5950240bdf Stop Win+CUDA-10.2 builds (#65649)
Summary:
See https://github.com/pytorch/pytorch/issues/65612 and https://github.com/pytorch/pytorch/issues/25393

Fixes https://github.com/pytorch/pytorch/issues/65648

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65649

Reviewed By: janeyx99

Differential Revision: D31189692

Pulled By: malfet

fbshipit-source-id: 6ec0548d5833f3428d882071d26c357d89b0a9ba
2021-09-29 15:41:23 -07:00
Eli Uriegas
2f712c452e .github: Remove confusing on_pull_request variable (#65731)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/65731

It originally had purpose but after ciflow was introduced every PR had
on_pull_request set so it's not really as useful as it once was

Also removes the equally as confusing only_build_on_pull_request
variable as well

This change should produce no functional changes in our generated workflows

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

cc ezyang seemethere malfet pytorch/pytorch-dev-infra

Test Plan: Imported from OSS

Reviewed By: janeyx99

Differential Revision: D31225398

Pulled By: seemethere

fbshipit-source-id: 7bd8e8175794ab7d09b0632321bf52538435e858
2021-09-29 11:56:13 -07:00
Nikita Shulga
72b27bde83 [CIFlow] Modify workflow trigger logic (#65733)
Summary:
CIFLow workflows should always run on push event
On pull-request workflow should run if label conditions are met or if
no `ciflow/` labels are associated with it, workflow is enabled by
default

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65733

Reviewed By: zhouzhuojie

Differential Revision: D31251278

Pulled By: malfet

fbshipit-source-id: 31ce745cb224df7c6fec1682ec4180513e3dadf3
2021-09-28 19:19:49 -07:00
Nikita Shulga
e1340d4282 [GHA] Small refactors (#65647)
Summary:
Introduce `main` method in generate_ci_workflows
Check that all `ciflow/` labels start with the same prefix
Move `ciflow_should_run` defenition to common.yml.j2

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65647

Reviewed By: janeyx99

Differential Revision: D31189537

Pulled By: malfet

fbshipit-source-id: 7cc47f63fb334c57f450034b931ff5bae1c0ed8b
2021-09-27 13:14:49 -07:00
Nikita Shulga
e331beef20 Delete code coverage jobs from CI (#65362)
Summary:
As it does not seem useful to the lots of peope, see https://fb.workplace.com/groups/1144215345733672/posts/2062909540530910

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65362

Reviewed By: janeyx99, bdhirsh

Differential Revision: D31061945

Pulled By: malfet

fbshipit-source-id: 912ed92cc901a370a40448f1127c3ba43640ac43
2021-09-22 05:38:35 -07:00
Jane Xu
9afdf017dc Add force_on_cpu test to win cuda10.2 on GHA (#65094)
Summary:
Part of migrating from Circle.

Once we get a successful force_on_cpu test, we can move it to trunk only.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65094

Reviewed By: seemethere

Differential Revision: D31086289

Pulled By: janeyx99

fbshipit-source-id: e1d135cc844d51f0b243b40efb49edca277d9de8
2021-09-21 11:14:15 -07:00
zhouzhuojie
2abf3594d5 Fix unassigned ciflow trigger (#65354)
Summary:
Fixes https://github.com/pytorch/pytorch/issues/65250#issuecomment-923120764

this is a limitation of github action triggers, it's hard to introduce condition before the workflow, that's why we intentionally pick the rare event ("unassigned"). The fix I think for people didn't opt-in ciflow and manually unassign, is to run all the CI (otherwise we introduce new condition on this and not worth to make things even complex).

`unassigned` event payload looks like this, just to make sure `github.event.assignee.login` is pointing to the right location.

```
  {
    "action": "unassigned",
    "assignee": {
      "avatar_url": "https://avatars.githubusercontent.com/u/658840?v=4",
      "events_url": "https://api.github.com/users/zhouzhuojie/events{/privacy}",
      "followers_url": "https://api.github.com/users/zhouzhuojie/followers",
      "following_url": "https://api.github.com/users/zhouzhuojie/following{/other_user}",
      "gists_url": "https://api.github.com/users/zhouzhuojie/gists{/gist_id}",
      "gravatar_id": "",
      "html_url": "https://github.com/zhouzhuojie",
      "id": 658840,
      "login": "zhouzhuojie",
      "node_id": "MDQ6VXNlcjY1ODg0MA==",
      "organizations_url": "https://api.github.com/users/zhouzhuojie/orgs",
      "received_events_url": "https://api.github.com/users/zhouzhuojie/received_events",
      "repos_url": "https://api.github.com/users/zhouzhuojie/repos",
      "site_admin": false,
      "starred_url": "https://api.github.com/users/zhouzhuojie/starred{/owner}{/repo}",
      "subscriptions_url": "https://api.github.com/users/zhouzhuojie/subscriptions",
      "type": "User",
      "url": "https://api.github.com/users/zhouzhuojie"
    },
```

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65354

Reviewed By: malfet, seemethere, janeyx99

Differential Revision: D31060212

Pulled By: zhouzhuojie

fbshipit-source-id: ce815cc96e8a00016646d6f02f0917169fa652dc
2021-09-20 12:33:23 -07:00
Nikita Shulga
67570a60ba Disable ParallelTBB (#65092)
Summary:
As ParallelTBB's `at::get_thread_num` is not compatible with general model used by OpenMP and ParallelNative (where it is an contiguous thread index within parallel loop), see https://github.com/pytorch/pytorch/issues/64571#issuecomment-914691883

More examples of similar regressions: https://github.com/pytorch/pytorch/runs/3612142217

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65092

Reviewed By: zhouzhuojie

Differential Revision: D30995936

Pulled By: malfet

fbshipit-source-id: db145b6a850d794f2c954f59f30249b291473e36
2021-09-16 12:38:45 -07:00
Eli Uriegas
3d5923366d .github: Enable only specific workflows for canary (#65099)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/65099

Utilizes ciflow to enable only specific workflows for
pytorch/pytorch-canary to reduce noise on that specific repository

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: jbschlosser

Differential Revision: D30973691

Pulled By: seemethere

fbshipit-source-id: 371765535b42a00bd72c2551c4faebf733d759f0
2021-09-15 16:53:12 -07:00
Eli Uriegas
59c486f2f3 ci: Disable jit legacy on circleci, enable on gha (#65106)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/65106

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

cc ezyang seemethere malfet lg20987 pytorch/pytorch-dev-infra

Test Plan: Imported from OSS

Reviewed By: malfet, janeyx99

Differential Revision: D30976186

Pulled By: seemethere

fbshipit-source-id: 8958f821eab9aa284496c57915894ed70f6b2fff
2021-09-15 16:11:38 -07:00
Jane Xu
3f27c1ae78 Replace windows 10.2 smoke tests on PRs to be 11.3 (#65090)
Summary:
As we default to linux CUDA 11.3 on PRs, we should do the same thing with Windows (instead of having 10.2 be the default). This means that 10.2 will now be master only, and 11.3 windows smoke tests will run on every PR.

This also copies over the "run smoke tests only" config--removing that will be in a separate PR once there's more certain decision making.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/65090

Reviewed By: seemethere

Differential Revision: D30968382

Pulled By: janeyx99

fbshipit-source-id: c73f9a2cc800b678909365c4d80627d29fc09f94
2021-09-15 16:01:07 -07:00
driazati
ee554e2e96 Re-land Fix test report uploading (#64958)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/64958

This is a re-do of #64846 which was missing a path prefix for windows test reports

Test Plan: Imported from OSS

Reviewed By: seemethere

Differential Revision: D30915253

Pulled By: driazati

fbshipit-source-id: d14d0a64d2f8aabc335db9c4d0d2b63512887c66
2021-09-13 18:36:26 -07:00
Eli Uriegas
914e3a861a Revert D30878101: [pytorch][PR] Fix test report uploading
Test Plan: revert-hammer

Differential Revision:
D30878101 (fba40bfc1a)

Original commit changeset: 0730f17fa3f4

fbshipit-source-id: dad89e68b4daf656dd0b592bc9b2758f00af38c6
2021-09-13 15:24:44 -07:00
driazati
fba40bfc1a Fix test report uploading (#64846)
Summary:
Previously we just weren't uploading Windows test report XML files to S3, only to GitHub actions. This was different than Linux where we use both (though maybe we can kill the GHA upload in a follow up PR since I don't think it's very useful anymore). This factors it all out into a macro so they both do the same thing. This also fixes the naming of uploaded files to include info about the job name (the full config, so they can be matched to the job visually or by the included job id).

See https://hud.pytorch.org/pr/64846 for results

Pull Request resolved: https://github.com/pytorch/pytorch/pull/64846

Reviewed By: seemethere

Differential Revision: D30878101

Pulled By: driazati

fbshipit-source-id: 0730f17fa3f46a32c131f52669084c3103b0e616
2021-09-13 13:22:54 -07:00
Eli Uriegas
ff18195df9 .github: Upgrade windows CUDA 10.1 -> 10.2 (#64658)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/64658

We don't release 10.1 anymore so let's bump to 10.2

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

Test Plan: Imported from OSS

Reviewed By: malfet, janeyx99

Differential Revision: D30811178

Pulled By: seemethere

fbshipit-source-id: c504ebf7f0d4c0d6229319d774f808b4ba0facd9
2021-09-08 14:43:33 -07:00
Nikita Shulga
a48d83a575 Move ParallelTBB to GHA (take 2) (#64193)
Summary:
2nd attempt to do the same
Skip failing `TestTensorCreationCPU.test_trilu_indices_cpu`

Pull Request resolved: https://github.com/pytorch/pytorch/pull/64193

Reviewed By: mrshenli

Differential Revision: D30779469

Pulled By: malfet

fbshipit-source-id: 5c51fcbb383d0823d0e953d7af181b5f22eda9ab
2021-09-07 15:11:00 -07:00
Nikita Shulga
571a2becf3 Move ParallelNative and PureTorch to GHA (#64452)
Summary:
Separate ParallelTBB move to https://github.com/pytorch/pytorch/pull/64193 as it requires some further investiagation

Pull Request resolved: https://github.com/pytorch/pytorch/pull/64452

Reviewed By: seemethere, janeyx99

Differential Revision: D30738337

Pulled By: malfet

fbshipit-source-id: 81c46423e903058bd1a3e8553e8a10ce978eeefd
2021-09-06 11:40:44 -07:00
Eli Uriegas
3c79e0b314 .github: Migrate pytorch_linux_bionic_py_3_6_clang9 to GHA (#64218)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/64218

Relies on https://github.com/fairinternal/pytorch-gha-infra/pull/11

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

cc ezyang seemethere malfet walterddr lg20987 pytorch/pytorch-dev-infra bdhirsh

Test Plan: Imported from OSS

Reviewed By: malfet, H-Huang, janeyx99

Differential Revision: D30651516

Pulled By: seemethere

fbshipit-source-id: e5843dfe84f096f2872d88f2e53e9408ad2fe399
2021-09-02 14:51:00 -07:00
Eli Uriegas
09e53c0cfe .github: Adding configuration for backwards_compat (#64204)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/64204

Adds backwards_compat to our existing test matrix for github actions

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

cc ezyang seemethere malfet walterddr lg20987 pytorch/pytorch-dev-infra

Test Plan: Imported from OSS

Reviewed By: malfet

Differential Revision: D30646764

Pulled By: seemethere

fbshipit-source-id: f0da6027e29fab03aff058cb13466fae5dcf3678
2021-08-30 13:59:00 -07:00
Eli Uriegas
9035a1cb4d .github: Adding configuration for docs_test (#64201)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/64201

Adds docs_test to our existing test matrix for github actions

Signed-off-by: Eli Uriegas <eliuriegas@fb.com>

cc ezyang seemethere malfet walterddr lg20987 pytorch/pytorch-dev-infra

Test Plan: Imported from OSS

Reviewed By: malfet

Differential Revision: D30646765

Pulled By: seemethere

fbshipit-source-id: 946adae01ff1f1f7ebe626e408e161b77b19a011
2021-08-30 13:57:20 -07:00
Nikita Shulga
9db56531f7 Revert D30620966: [pytorch][PR] Move Parallel[Native|TBB] to GHA
Test Plan: revert-hammer

Differential Revision:
D30620966 (223f886032)

Original commit changeset: 9a23e4b3e168

fbshipit-source-id: b9248d377b9a7b850dfb3f10f3350fbc9855acfe
2021-08-29 15:51:27 -07:00
Nikita Shulga
223f886032 Move Parallel[Native|TBB] to GHA (#64123)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/64123

Reviewed By: driazati

Differential Revision: D30620966

Pulled By: malfet

fbshipit-source-id: 9a23e4b3e16870f77bf18df4370cd468603d592d
2021-08-28 11:50:30 -07:00
Sergei Vorobev
f922b58b5f [bazel] GPU-support: add @local_config_cuda and @cuda (#63604)
Summary:
## Context

We take the first step at tackling the GPU-bazel support by adding bazel external workspaces `local_config_cuda` and `cuda`, where the first one has some hardcoded values and lists of files, and the second one provides a nicer, high-level wrapper that maps into the already expected by pytorch bazel targets that are guarded with `if_cuda` macro.

The prefix `local_config_` signifies the fact that we are breaking the bazel hermeticity philosophy by explicitly relaying on the CUDA installation that is present on the machine.

## Testing

Notice an important scenario that is unlocked by this change: compilation of cpp code that depends on cuda libraries (i.e. cuda.h and so on).

Before:
```
sergei.vorobev@cs-sv7xn77uoy-gpu-1628706590:~/src/pytorch4$ bazelisk build --define=cuda=true //:c10
ERROR: /home/sergei.vorobev/src/pytorch4/tools/config/BUILD:12:1: no such package 'tools/toolchain': BUILD file not found in any of the following directories. Add a BUILD file to a directory to mark it as a package.
 - /home/sergei.vorobev/src/pytorch4/tools/toolchain and referenced by '//tools/config:cuda_enabled_and_capable'
ERROR: While resolving configuration keys for //:c10: Analysis failed
ERROR: Analysis of target '//:c10' failed; build aborted: Analysis failed
INFO: Elapsed time: 0.259s
INFO: 0 processes.
FAILED: Build did NOT complete successfully (2 packages loaded, 2 targets configured)
```

After:
```
sergei.vorobev@cs-sv7xn77uoy-gpu-1628706590:~/src/pytorch4$ bazelisk build --define=cuda=true //:c10
INFO: Analyzed target //:c10 (6 packages loaded, 246 targets configured).
INFO: Found 1 target...
Target //:c10 up-to-date:
  bazel-bin/libc10.lo
  bazel-bin/libc10.so
INFO: Elapsed time: 0.617s, Critical Path: 0.04s
INFO: 0 processes.
INFO: Build completed successfully, 1 total action
```

The `//:c10` target is a good testing one for this, because it has such cases where the [glob is different](075024b9a3/BUILD.bazel (L76-L81)), based on do we compile for CUDA or not.

## What is out of scope of this PR

This PR is a first in a series of providing the comprehensive GPU bazel build support. Namely, we don't tackle the [cu_library](11a40ad915/tools/rules/cu.bzl (L2)) implementation here. This would be a separate large chunk of work.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63604

Reviewed By: soulitzer

Differential Revision: D30442083

Pulled By: malfet

fbshipit-source-id: b2a8e4f7e5a25a69b960a82d9e36ba568eb64595
2021-08-27 09:33:42 -07:00
Rong Rong (AI Infra)
7ccc4b5cc8 [CI] move distributed test into its own CI job (#62896)
Summary:
Moving distributed to its own job.

- [x] ensure there should be a distributed test job for every default test job matrix (on GHA)
- [x] ensure that circleci jobs works for distributed as well
- [x] waiting for test distributed to have its own run_test.py launch options, see https://github.com/pytorch/pytorch/issues/63147

Pull Request resolved: https://github.com/pytorch/pytorch/pull/62896

Reviewed By: seemethere

Differential Revision: D30230856

Pulled By: walterddr

fbshipit-source-id: 0cad620f6cd9e56c727c105458d76539a5ae976f
2021-08-26 08:02:20 -07:00
David Riazati
1be1c901aa Remove render_test_results job (#63877)
Summary:
This removes the `render_test_results` job we had before which had been causing some confusion among devs when it failed and isn't really necessary now that we can actually render test results on the PR HUD.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63877

Reviewed By: walterddr, janeyx99

Differential Revision: D30546705

Pulled By: driazati

fbshipit-source-id: 55fdafdb6f80924d941ffc15ee10787cb54f34a1
2021-08-25 15:55:55 -07:00
zhouzhuojie
3d4aabfc48 Fix ciflow/all label generation (#63954)
Summary:
the `ciflow/all` is automatically added but need to be added before we call `gen_root_job_condition`.

- fix the order of adding `ciflow/all`
- refactor all the string into global constants

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63954

Reviewed By: malfet

Differential Revision: D30545596

Pulled By: zhouzhuojie

fbshipit-source-id: 83ab668f0234488afb855a72e3ebd4503f7f1a78
2021-08-25 11:32:32 -07:00
Jane Xu
8c897d254d Swap CUDA 11.1 and 11.3 in CI to make 11.1 periodic (#63900)
Summary:
Preparing for supporting 11.3 in the next release.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63900

Reviewed By: malfet

Differential Revision: D30541437

Pulled By: janeyx99

fbshipit-source-id: a7297da7f7818a4291b1c321d62d76fc2c0f1f90
2021-08-25 09:01:26 -07:00
zhouzhuojie
3926fdbaa4 [skip ci] Add generated comment to ruleset json (#63896)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/63896

Reviewed By: heitorschueroff

Differential Revision: D30529820

Pulled By: zhouzhuojie

fbshipit-source-id: 7529803af23ea36a7bcb673cd399da80da8e3feb
2021-08-25 08:53:33 -07:00
zhouzhuojie
975542c314 Add more ciflow labels for more workflows (#63410)
Summary:
- Add more ciflow labels and enable it for more workflows.
- Only the 'ciflow/default' workflows are run by default on pull_request time
- Other labels can be manually triggered by (adding the labels + unassign pytorchbot), OR wait for pytorchbot's comment opt-in rollout
- The label design is a logical operator `OR`, i.e. adding ('ciflow/cuda' + 'ciflow/win') will trigger the union of them. (design feedback is needed here)

Typical default workflows for normal PRs.

<details>
<summary>Generated label rules</summary>

![image](https://user-images.githubusercontent.com/658840/129779905-eb5e56dd-a696-4040-9eb6-71ecb6487dc1.png)

```
{
  "label_rules": {
    "ciflow/all": [
      "libtorch-linux-xenial-cuda10.2-py3.6-gcc7",
      "libtorch-linux-xenial-cuda11.1-py3.6-gcc7",
      "linux-bionic-cuda10.2-py3.9-gcc7",
      "linux-bionic-py3.8-gcc9-coverage",
      "linux-xenial-cuda10.2-py3.6-gcc7",
      "linux-xenial-cuda11.1-py3.6-gcc7",
      "linux-xenial-py3.6-gcc5.4",
      "linux-xenial-py3.6-gcc7-bazel-test",
      "periodic-libtorch-linux-xenial-cuda11.3-py3.6-gcc7",
      "periodic-linux-xenial-cuda11.3-py3.6-gcc7",
      "periodic-win-vs2019-cuda11.3-py3",
      "win-vs2019-cpu-py3",
      "win-vs2019-cuda10.1-py3",
      "win-vs2019-cuda11.1-py3"
    ],
    "ciflow/bazel": [
      "linux-xenial-py3.6-gcc7-bazel-test"
    ],
    "ciflow/coverage": [
      "linux-bionic-py3.8-gcc9-coverage"
    ],
    "ciflow/cpu": [
      "linux-bionic-py3.8-gcc9-coverage",
      "linux-xenial-py3.6-gcc5.4",
      "linux-xenial-py3.6-gcc7-bazel-test",
      "win-vs2019-cpu-py3"
    ],
    "ciflow/cuda": [
      "libtorch-linux-xenial-cuda10.2-py3.6-gcc7",
      "libtorch-linux-xenial-cuda11.1-py3.6-gcc7",
      "linux-bionic-cuda10.2-py3.9-gcc7",
      "linux-xenial-cuda10.2-py3.6-gcc7",
      "linux-xenial-cuda11.1-py3.6-gcc7",
      "periodic-libtorch-linux-xenial-cuda11.3-py3.6-gcc7",
      "periodic-linux-xenial-cuda11.3-py3.6-gcc7",
      "periodic-win-vs2019-cuda11.3-py3",
      "win-vs2019-cuda10.1-py3",
      "win-vs2019-cuda11.1-py3"
    ],
    "ciflow/default": [
      "linux-bionic-py3.8-gcc9-coverage",
      "linux-xenial-cuda11.1-py3.6-gcc7",
      "linux-xenial-py3.6-gcc5.4",
      "linux-xenial-py3.6-gcc7-bazel-test",
      "win-vs2019-cpu-py3",
      "win-vs2019-cuda10.1-py3"
    ],
    "ciflow/libtorch": [
      "libtorch-linux-xenial-cuda10.2-py3.6-gcc7",
      "libtorch-linux-xenial-cuda11.1-py3.6-gcc7",
      "periodic-libtorch-linux-xenial-cuda11.3-py3.6-gcc7"
    ],
    "ciflow/linux": [
      "libtorch-linux-xenial-cuda10.2-py3.6-gcc7",
      "libtorch-linux-xenial-cuda11.1-py3.6-gcc7",
      "linux-bionic-cuda10.2-py3.9-gcc7",
      "linux-bionic-py3.8-gcc9-coverage",
      "linux-xenial-cuda10.2-py3.6-gcc7",
      "linux-xenial-cuda11.1-py3.6-gcc7",
      "linux-xenial-py3.6-gcc5.4",
      "linux-xenial-py3.6-gcc7-bazel-test",
      "periodic-libtorch-linux-xenial-cuda11.3-py3.6-gcc7",
      "periodic-linux-xenial-cuda11.3-py3.6-gcc7"
    ],
    "ciflow/scheduled": [
      "periodic-libtorch-linux-xenial-cuda11.3-py3.6-gcc7",
      "periodic-linux-xenial-cuda11.3-py3.6-gcc7",
      "periodic-win-vs2019-cuda11.3-py3"
    ],
    "ciflow/slow": [
      "linux-bionic-cuda10.2-py3.9-gcc7",
      "linux-xenial-cuda10.2-py3.6-gcc7"
    ],
    "ciflow/win": [
      "periodic-win-vs2019-cuda11.3-py3",
      "win-vs2019-cpu-py3",
      "win-vs2019-cuda10.1-py3",
      "win-vs2019-cuda11.1-py3"
    ]
  },
  "version": "v1"
}
```
</details>

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63410

Reviewed By: ngimel

Differential Revision: D30378553

Pulled By: zhouzhuojie

fbshipit-source-id: 4e0953740793e5e72b95018f8ab2ce4a6a364c38
2021-08-17 17:00:09 -07:00
Nikita Shulga
794b04c6c8 Add step to report code coverage from GHA (#63373)
Summary:
Similar to the logic provided in b2069e7d01/.circleci/verbatim-sources/job-specs/pytorch-job-specs.yml (L197-L201)

Fixes https://github.com/pytorch/pytorch/issues/63366

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63373

Reviewed By: walterddr

Differential Revision: D30357737

Pulled By: malfet

fbshipit-source-id: 20b115eb4d6412bd9895680308a9097742d2ae7b
2021-08-16 20:42:38 -07:00
Rong Rong (AI Infra)
0b89e69e7c [BE] delete GHA generated workflow files before regen (#63148)
Summary:
Unlike circle which all workflow goes in one file, GHA legacy generated files will stay silently in once's PR. e.g. when we change build_environment name and that's not ideal.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63148

Reviewed By: bdhirsh

Differential Revision: D30283382

Pulled By: walterddr

fbshipit-source-id: ffdd5bf9561dd38499052855a12ee5cf838a20b0
2021-08-12 14:43:00 -07:00
Rong Rong (AI Infra)
daba551922 [BE] shorten CI name part2 (#63030)
Summary:
Fixes https://github.com/pytorch/pytorch/issues/62357
there's no need to specify cudnn version since they are recommended from cuda version already.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/63030

Reviewed By: zhouzhuojie, driazati

Differential Revision: D30226354

Pulled By: walterddr

fbshipit-source-id: 7e2dc577810e0ce80ee27569c25a814566250ab1
2021-08-12 08:14:22 -07:00