pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-06 12:20:52 +01:00

Author	SHA1	Message	Date
Yuanyuan Chen	8de85896e0	Enable ruff rule E721 (#165162 ) `E721` checks for object type comparisons using == and other comparison operators. This is useful because it is recommended to use `is` for type comparisons. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165162 Approved by: https://github.com/Skylion007	2025-10-13 01:48:55 +00:00
PyTorch MergeBot	816fb7f48d	Revert "Enable ruff rule E721 (#165162 )" This reverts commit `9e7c19f72b`. Reverted https://github.com/pytorch/pytorch/pull/165162 on behalf of https://github.com/pytorch-auto-revert due to Reverted automatically by pytorch's autorevert, to avoid this behaviour add the tag autorevert: disable ([comment](https://github.com/pytorch/pytorch/pull/165162#issuecomment-3393328271))	2025-10-11 13:25:40 +00:00
Yuanyuan Chen	9e7c19f72b	Enable ruff rule E721 (#165162 ) `E721` checks for object type comparisons using == and other comparison operators. This is useful because it is recommended to use `is` for type comparisons. Pull Request resolved: https://github.com/pytorch/pytorch/pull/165162 Approved by: https://github.com/Skylion007	2025-10-11 06:43:53 +00:00
Catherine Lee	4908fb53c3	[testing] Add test owner labels for some ao sparse tests (#163203 ) I am trying to give some test files better owner labels than `module: unknown`. I am not sure them, but they seem pretty reasonable Pull Request resolved: https://github.com/pytorch/pytorch/pull/163203 Approved by: https://github.com/jcaip	2025-09-18 16:08:13 +00:00
Xuehai Pan	6d5c789ad5	[BE][PYFMT] migrate PYFMT for `test/[a-h]*/` to `ruff format` (#144555 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/144555 Approved by: https://github.com/ezyang ghstack dependencies: #144551, #144554	2025-06-24 04:53:54 +00:00
Anthony Barbier	b1b8e57cda	Add __main__ guards to ao tests (#154612 ) This is the first PR of a series in an attempt to get the content of #134592 merged as smaller PRs (Given that the original one was closed due to a lack of reviewers). This specific PR contains: - Add and use a common raise_on_run_directly method for when a user runs a test file directly which should not be run this way. Print the file which the user should have run. - Update ao tests. There will be follow up PRs to update the other test suites but I don't have permissions to create branches directly on pytorch/pytorch so I can't create a stack and therefore will have to create them one at the time. Cc @jerryzh168 Pull Request resolved: https://github.com/pytorch/pytorch/pull/154612 Approved by: https://github.com/jcaip	2025-06-10 18:33:09 +00:00
Tom Ritchford	d25e6e623f	Fix unused Python variables in test/[a-d]* (#134665 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/134665 Approved by: https://github.com/albanD	2024-12-13 22:13:12 +00:00
Oguz Ulgen	221350e3a4	Add None return type to init -- tests (#132352 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/132352 Approved by: https://github.com/ezyang ghstack dependencies: #132335, #132351	2024-08-01 15:44:51 +00:00
Xuehai Pan	548c460bf1	[BE][Easy][7/19] enforce style for empty lines in import segments in `test/[a-c]/` and `test/[q-z]/` (#129758 ) See https://github.com/pytorch/pytorch/pull/129751#issue-2380881501. Most changes are auto-generated by linter. You can review these PRs via: ```bash git diff --ignore-all-space --ignore-blank-lines HEAD~1 ``` Pull Request resolved: https://github.com/pytorch/pytorch/pull/129758 Approved by: https://github.com/ezyang	2024-07-31 10:54:03 +00:00
Arun Pa	f71e368969	UFMT formatting on test/autograd test/ao test/cpp test/backends (#123369 ) Partially addresses #123062 Ran lintrunner on - test/_test_bazel.py - test/ao - test/autograd test/backends test/benchmark_uitls test/conftest.py test/bottleneck_test test/cpp Pull Request resolved: https://github.com/pytorch/pytorch/pull/123369 Approved by: https://github.com/huydhn	2024-04-05 18:51:38 +00:00
atalman	244b124bb8	Add linux cpu test for 3.12 (#117853 ) This is continuation of work: https://github.com/pytorch/pytorch/pull/113987 Co-authored-by: albanD <desmaison.alban@gmail.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/117853 Approved by: https://github.com/albanD	2024-02-14 20:52:23 +00:00
Aaron Gokaslan	3fe437b24b	[BE]: Update flake8 to v6.1.0 and fix lints (#116591 ) Updates flake8 to v6.1.0 and fixes a few lints using sed and some ruff tooling. - Replace `assert(0)` with `raise AssertionError()` - Remove extraneous parenthesis i.e. - `assert(a == b)` -> `assert a == b` - `if(x > y or y < z):`->`if x > y or y < z:` - And `return('...')` -> `return '...'` Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com> Pull Request resolved: https://github.com/pytorch/pytorch/pull/116591 Approved by: https://github.com/albanD, https://github.com/malfet	2024-01-03 06:04:44 +00:00
Aaron Gokaslan	ee5d981249	[BE]: Enable RUFF PERF402 and apply fixes (#115505 ) * Enable PERF402. Makes code more efficient and succinct by removing useless list copies that could be accomplished either via a list constructor or extend call. All test cases have noqa added since performance is not as sensitive in that folder. Pull Request resolved: https://github.com/pytorch/pytorch/pull/115505 Approved by: https://github.com/malfet	2023-12-20 18:01:24 +00:00
Kazuaki Ishizaki	deb800ee81	Fix typo under test directory (#111304 ) This PR fixes typo in comments under `test` directory. Pull Request resolved: https://github.com/pytorch/pytorch/pull/111304 Approved by: https://github.com/Skylion007	2023-10-16 23:06:06 +00:00
LINGAO XIAO	e7b2430818	add pruning method: Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (#95689 ) add `class FPGMStructured` add `function FPGM_structured()` add `function _validate_distance_type()` add `function _compute_distance()` Implement method mentioned in issue #39765 --- FPGMSparsifier Implement with the new pytorch pruning API torch.ao.pruning. It is a structured pruning method, and it is added under torch.ao.pruning._experimental. Test cases are added at `test_structured_sparsifier.py`. Pull Request resolved: https://github.com/pytorch/pytorch/pull/95689 Approved by: https://github.com/jcaip	2023-08-02 16:24:42 +00:00
Justin Chu	c0d8a4af0a	[BE] Enable ruff's UP rules and autoformat ao/ (#105430 ) Pull Request resolved: https://github.com/pytorch/pytorch/pull/105430 Approved by: https://github.com/albanD, https://github.com/malfet	2023-07-19 13:44:37 +00:00
Jesse Cai	93063768da	[pruning][core][feature] Implement convert for pruner (#97545 ) Summary: This PR implements `BaseSparsifier.convert()`, which performs module swapping. The modules and mappings will be merged in a future PR. Test Plan: `python test/test_ao_sparsity.py -- TestBaseSparsifier.test_convert` Pull Request resolved: https://github.com/pytorch/pytorch/pull/97545 Approved by: https://github.com/jerryzh168	2023-04-05 16:57:11 +00:00
Jesse Cai	86ab4d49d4	[pruning][core][feature] LSTM Structured Pruning prune_functions + pattern (#90801 ) Summary: This PR adds in support for LSTM Structured Pruning. - Adds in LSTMSaliencyPruner, an implemented pruner that splits the packed weights, finds the appropriate mask for each piece individually based on saliency, and then combines to create an overall mask for the LSTM. - Adds in pruning functions for LSTM pruning, which will split the weights, apply the masks, and then recombine the pruned weights. Works for both single and multiple-layer LSTMs. Also added a basic pattern to the default set of of patterns for LSTM -> Linear pruning LSTM -> LayerNorm -> Linear pruning Adds in test to check that LSTM pruning works, as well as for LSTMSaliencyPruner Test Plan: `python test/test_ao_sparsity.py -- TestBaseStructuredSparsifier.test_prune_lstm_linear_single_layer` `python test/test_ao_sparsity.py -- TestBaseStructuredSparsifier.test_prune_lstm_linear_multiple_layer` `python test/test_ao_sparsity.py -- TestBaseStructuredSparsifier.test_prune_lstm_layernorm_linear_single_layer` `python test/test_ao_sparsity.py -- TestBaseStructuredSparsifier.test_prune_lstm_layernorm_linear_multiple_layer` `python test/test_ao_sparsity.py -- TestSaliencyPruner.test_lstm_saliency_pruner_update_mask` Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D42199001](https://our.internmc.facebook.com/intern/diff/D42199001) Pull Request resolved: https://github.com/pytorch/pytorch/pull/90801 Approved by: https://github.com/jerryzh168	2023-02-01 19:29:03 +00:00
Jesse Cai	32e9b29ce9	[pruning][core][feature] Add in SaliencyPruner to pruner._experimental (#91814 ) Summary: This PR adds in SaliencyPruner, an implementation of L1 norm pruning for structured pruning, as well as additional tests for the SaliencyPruner The README.md references this file but I forgot to add it in earlier when writing the tutorial. Test Plan: ``` python test/test_ao_sparsity.py -- TestSaliencyPruner ``` Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/91814 Approved by: https://github.com/jerryzh168	2023-01-10 04:04:55 +00:00
Eddie Yan	dabf515c18	[cuDNN][cuDNN V8 API] (re-re-re-open) cuDNN V8 API on by default (#91117 ) Re-opening following #91025 CC @ptrblck @ngimel Pull Request resolved: https://github.com/pytorch/pytorch/pull/91117 Approved by: https://github.com/ngimel	2022-12-20 18:52:29 +00:00
PyTorch MergeBot	ba7aeac37b	Revert "[cuDNN][cuDNN V8 API] (re-re-open) cuDNN V8 API on by default (#89022 )" This reverts commit `eecd621f06`. Reverted https://github.com/pytorch/pytorch/pull/89022 on behalf of https://github.com/ngimel due to breaks some convolution configurations #91025	2022-12-16 23:06:35 +00:00
Eddie Yan	eecd621f06	[cuDNN][cuDNN V8 API] (re-re-open) cuDNN V8 API on by default (#89022 ) Testing V8 on by default again after fixes have been merged for e.g., https://github.com/pytorch/torchdynamo/issues/1833 One new failure that seems to be surfaced with V8 on appears in halonext + amp ``` RuntimeError: Internal Triton PTX codegen error: Segmentation fault (core dumped) ``` But I'm not sure if this points to a V8 issue or a Triton issue CC @ngimel @ptrblck Current dynamo benchmarks on A100: v7 vs. v8 \|dev \|name \|batch_size\|abs_latency_v7\|abs_latency_v8\| \|----\|-------------------------------\|----------\|--------------\|--------------\| \|cuda\|adv_inception_v3 \|128 \|166.0240 \|165.5798 \| \|cuda\|beit_base_patch16_224 \|64 \|123.5912 \|123.0797 \| \|cuda\|botnet26t_256 \|128 \|107.7343 \|107.5948 \| \|cuda\|cait_m36_384 \|4 \|184.5038 \|184.0271 \| \|cuda\|coat_lite_mini \|128 \|142.3061 \|140.5814 \| \|cuda\|convit_base \|64 \|165.2499 \|161.0743 \| \|cuda\|convmixer_768_32 \|32 \|325.6984 \|325.7094 \| \|cuda\|convnext_base \|64 \|237.4632 \|238.0142 \| \|cuda\|crossvit_9_240 \|128 \|72.2980 \|72.4367 \| \|cuda\|cspdarknet53 \|64 \|96.6862 \|96.8308 \| \|cuda\|deit_base_distilled_patch16_224\|64 \|117.6045 \|117.9616 \| \|cuda\|dla102 \|128 \|182.3073 \|182.2304 \| \|cuda\|dm_nfnet_f0 \|128 \|133.6011 \|133.6298 \| \|cuda\|dpn107 \|32 \|148.5080 \|148.5885 \| \|cuda\|eca_botnext26ts_256 \|128 \|113.8676 \|113.1514 \| \|cuda\|eca_halonext26ts \|128 \|119.2242 \|119.1845 \| \|cuda\|ese_vovnet19b_dw \|128 \|80.0217 \|79.9438 \| \|cuda\|fbnetc_100 \|128 \|91.4548 \|91.4009 \| \|cuda\|fbnetv3_b \|128 \|115.4496 \|115.5058 \| \|cuda\|gernet_l \|128 \|114.8365 \|114.7870 \| \|cuda\|ghostnet_100 \|128 \|58.5766 \|58.5766 \| \|cuda\|gluon_inception_v3 \|128 \|165.5222 \|165.7167 \| \|cuda\|gluon_xception65 \|32 \|165.8779 \|165.7818 \| \|cuda\|gmixer_24_224 \|128 \|116.3611 \|113.4925 \| \|cuda\|gmlp_s16_224 \|128 \|121.2607 \|121.2534 \| \|cuda\|hrnet_w18 \|128 \|246.5706 \|246.7599 \| \|cuda\|inception_v3 \|128 \|166.1096 \|166.2034 \| \|cuda\|jx_nest_base \|32 \|93.6064 \|93.4088 \| \|cuda\|lcnet_050 \|128 \|21.4156 \|21.4207 \| \|cuda\|levit_128 \|128 \|27.2901 \|27.2543 \| \|cuda\|mixer_b16_224 \|128 \|157.8992 \|158.2878 \| \|cuda\|mixnet_l \|128 \|197.3443 \|197.2125 \| \|cuda\|mnasnet_100 \|128 \|71.4604 \|71.2997 \| \|cuda\|mobilenetv2_100 \|128 \|67.6080 \|67.7515 \| \|cuda\|mobilenetv3_large_100 \|128 \|57.7224 \|57.6591 \| \|cuda\|mobilevit_s \|64 \|93.0372 \|93.0530 \| \|cuda\|nfnet_l0 \|128 \|113.1664 \|113.2853 \| \|cuda\|pit_b_224 \|64 \|133.3333 \|133.4153 \| \|cuda\|pnasnet5large \|16 \|238.9545 \|238.8122 \| \|cuda\|poolformer_m36 \|64 \|144.2353 \|144.2375 \| \|cuda\|regnety_002 \|128 \|32.8534 \|32.9069 \| \|cuda\|repvgg_a2 \|128 \|102.4150 \|102.3827 \| \|cuda\|res2net101_26w_4s \|64 \|120.8127 \|120.8322 \| \|cuda\|res2net50_14w_8s \|128 \|149.7052 \|149.8969 \| \|cuda\|res2next50 \|128 \|153.7439 \|153.8215 \| \|cuda\|resmlp_12_224 \|128 \|89.1918 \|86.9226 \| \|cuda\|resnest101e \|64 \|159.4706 \|159.3133 \| \|cuda\|rexnet_100 \|128 \|88.0032 \|88.0397 \| \|cuda\|sebotnet33ts_256 \|64 \|80.4635 \|80.0120 \| \|cuda\|selecsls42b \|128 \|70.4430 \|70.3663 \| \|cuda\|spnasnet_100 \|128 \|78.0537 \|78.1991 \| \|cuda\|swin_base_patch4_window7_224 \|64 \|212.9073 \|213.0824 \| \|cuda\|swsl_resnext101_32x16d \|32 \|193.0229 \|193.0404 \| \|cuda\|tf_efficientnet_b0 \|128 \|97.1316 \|97.0410 \| \|cuda\|tf_mixnet_l \|128 \|203.4956 \|203.5340 \| \|cuda\|tinynet_a \|128 \|82.4038 \|82.8733 \| \|cuda\|tnt_s_patch16_224 \|128 \|284.8576 \|284.8867 \| \|cuda\|twins_pcpvt_base \|64 \|118.3893 \|119.2329 \| \|cuda\|visformer_small \|128 \|126.0533 \|126.0390 \| \|cuda\|vit_base_patch16_224 \|64 \|118.2873 \|118.0573 \| \|cuda\|volo_d1_224 \|64 \|108.7764 \|108.2063 \| \|cuda\|xcit_large_24_p8_224 \|5 \|100.4656 \|100.5209 \| v7 vs. v8 amp \|dev \|name \|batch_size\|abs_latency_v7\|abs_latency_v8\| \|----\|-------------------------------\|----------\|--------------\|--------------\| \|cuda\|adv_inception_v3 \|128 \|104.9729 \|105.1237 \| \|cuda\|beit_base_patch16_224 \|64 \|75.4330 \|75.2039 \| \|cuda\|botnet26t_256 \|128 \|74.5149 \|74.8071 \| \|cuda\|cait_m36_384 \|4 \|110.9788 \|111.5170 \| \|cuda\|coat_lite_mini \|128 \|62.3618 \|64.4965 \| \|cuda\|convit_base \|64 \|116.4054 \|117.9129 \| \|cuda\|convmixer_768_32 \|32 \|264.4401 \|264.4491 \| \|cuda\|convnext_base \|64 \|182.9009 \|179.2136 \| \|cuda\|crossvit_9_240 \|128 \|48.8586 \|48.8359 \| \|cuda\|cspdarknet53 \|64 \|80.0245 \|80.0160 \| \|cuda\|deit_base_distilled_patch16_224\|64 \|66.5921 \|66.7448 \| \|cuda\|dla102 \|128 \|116.7780 \|117.1683 \| \|cuda\|dm_nfnet_f0 \|128 \|78.9322 \|79.1135 \| \|cuda\|dpn107 \|32 \|85.5206 \|85.7514 \| \|cuda\|eca_botnext26ts_256 \|128 \|76.3672 \|77.0050 \| \|cuda\|eca_halonext26ts \|128 \|86.2458 \| \| \|cuda\|ese_vovnet19b_dw \|128 \|43.2943 \|43.3379 \| \|cuda\|fbnetc_100 \|128 \|54.8479 \|54.9251 \| \|cuda\|fbnetv3_b \|128 \|70.7504 \|71.0188 \| \|cuda\|gernet_l \|128 \|66.1607 \|66.0379 \| \|cuda\|ghostnet_100 \|128 \|43.8882 \|43.9336 \| \|cuda\|gluon_inception_v3 \|128 \|104.9297 \|105.0204 \| \|cuda\|gluon_xception65 \|32 \|85.7118 \|85.8370 \| \|cuda\|gmixer_24_224 \|128 \|75.1214 \|76.1170 \| \|cuda\|gmlp_s16_224 \|128 \|76.4207 \|76.6641 \| \|cuda\|hrnet_w18 \|128 \|186.1326 \|186.2435 \| \|cuda\|inception_v3 \|128 \|105.0561 \|105.0783 \| \|cuda\|jx_nest_base \|32 \|65.3066 \|65.3245 \| \|cuda\|lcnet_050 \|128 \|14.7991 \|14.8687 \| \|cuda\|levit_128 \|128 \|19.2893 \|19.4772 \| \|cuda\|mixer_b16_224 \|128 \|93.9826 \|94.2056 \| \|cuda\|mixnet_l \|128 \|147.1245 \|147.0435 \| \|cuda\|mnasnet_100 \|128 \|39.1781 \|39.2565 \| \|cuda\|mobilenetv2_100 \|128 \|42.3704 \|42.3114 \| \|cuda\|mobilenetv3_large_100 \|128 \|37.2946 \|37.2816 \| \|cuda\|mobilevit_s \|64 \|55.8930 \|55.8934 \| \|cuda\|nfnet_l0 \|128 \|64.0448 \|64.4438 \| \|cuda\|pit_b_224 \|64 \|80.6342 \|80.2933 \| \|cuda\|pnasnet5large \|16 \|154.9611 \|154.8654 \| \|cuda\|poolformer_m36 \|64 \|101.7489 \|101.8138 \| \|cuda\|regnety_002 \|128 \|27.0939 \|27.0309 \| \|cuda\|repvgg_a2 \|128 \|60.9651 \|61.2533 \| \|cuda\|res2net101_26w_4s \|64 \|77.3291 \|77.4739 \| \|cuda\|res2net50_14w_8s \|128 \|93.6572 \|93.7221 \| \|cuda\|res2next50 \|128 \|112.4975 \|112.3248 \| \|cuda\|resmlp_12_224 \|128 \|59.5422 \|60.7644 \| \|cuda\|resnest101e \|64 \|97.9894 \|98.3358 \| \|cuda\|rexnet_100 \|128 \|55.2218 \|55.0718 \| \|cuda\|sebotnet33ts_256 \|64 \|60.4880 \|60.8113 \| \|cuda\|selecsls42b \|128 \|41.4294 \|41.5341 \| \|cuda\|spnasnet_100 \|128 \|45.0037 \|45.0304 \| \|cuda\|swin_base_patch4_window7_224 \|64 \|98.2561 \|98.6925 \| \|cuda\|swsl_resnext101_32x16d \|32 \|100.6179 \|100.9195 \| \|cuda\|tf_efficientnet_b0 \|128 \|56.5344 \|56.4591 \| \|cuda\|tf_mixnet_l \|128 \|153.0318 \|152.9367 \| \|cuda\|tinynet_a \|128 \|54.1307 \|53.9298 \| \|cuda\|tnt_s_patch16_224 \|128 \|142.4801 \|142.6589 \| \|cuda\|twins_pcpvt_base \|64 \|67.9027 \|67.8325 \| \|cuda\|visformer_small \|128 \|72.5589 \|72.9427 \| \|cuda\|vit_base_patch16_224 \|64 \|71.4885 \|71.7342 \| \|cuda\|volo_d1_224 \|64 \|69.3539 \|69.5910 \| \|cuda\|xcit_large_24_p8_224 \|5 \|59.9000 \|59.9699 \| v7 vs. v8 float16 \|dev \|name \|batch_size\|abs_latency\|abs_latency\| \|----\|-------------------------------\|----------\|-----------\|-----------\| \|cuda\|adv_inception_v3 \|128 \|104.2544 \|104.2677 \| \|cuda\|beit_base_patch16_224 \|64 \|85.3601 \|85.3786 \| \|cuda\|botnet26t_256 \|128 \|72.1476 \|71.8277 \| \|cuda\|cait_m36_384 \|4 \|108.3075 \|108.5941 \| \|cuda\|coat_lite_mini \|128 \|61.2382 \|61.6049 \| \|cuda\|convmixer_768_32 \|32 \|263.3818 \|263.3598 \| \|cuda\|convnext_base \|64 \|172.6821 \|173.8520 \| \|cuda\|crossvit_9_240 \|128 \|44.6321 \|44.6340 \| \|cuda\|cspdarknet53 \|64 \|79.3165 \|79.2964 \| \|cuda\|deit_base_distilled_patch16_224\|64 \|61.9816 \|62.2109 \| \|cuda\|dla102 \|128 \|115.7403 \|115.9928 \| \|cuda\|dm_nfnet_f0 \|128 \|77.5434 \|77.7440 \| \|cuda\|dpn107 \|32 \|83.6489 \|83.5605 \| \|cuda\|eca_botnext26ts_256 \|128 \|73.9953 \|74.1031 \| \|cuda\|eca_halonext26ts \|128 \|81.7951 \|81.7103 \| \|cuda\|ese_vovnet19b_dw \|128 \|42.9618 \|42.8853 \| \|cuda\|fbnetc_100 \|128 \|54.3590 \|54.3575 \| \|cuda\|fbnetv3_b \|128 \|69.7977 \|70.1696 \| \|cuda\|gernet_l \|128 \|64.8684 \|65.1726 \| \|cuda\|ghostnet_100 \|128 \|43.2054 \|43.1319 \| \|cuda\|gluon_inception_v3 \|128 \|104.1988 \|104.3030 \| \|cuda\|gluon_xception65 \|32 \|84.2245 \|84.5085 \| \|cuda\|gmixer_24_224 \|128 \|82.0418 \|82.7252 \| \|cuda\|gmlp_s16_224 \|128 \|75.4792 \|75.8374 \| \|cuda\|hrnet_w18 \|128 \|184.1450 \|184.1848 \| \|cuda\|inception_v3 \|128 \|104.1203 \|104.2536 \| \|cuda\|jx_nest_base \|32 \|58.2386 \|58.4901 \| \|cuda\|lcnet_050 \|128 \|14.6409 \|14.5616 \| \|cuda\|levit_128 \|128 \|22.3875 \|22.4680 \| \|cuda\|mixer_b16_224 \|128 \|98.9534 \|98.4730 \| \|cuda\|mixnet_l \|128 \|146.1623 \|146.1947 \| \|cuda\|mnasnet_100 \|128 \|38.9208 \|39.3463 \| \|cuda\|mobilenetv2_100 \|128 \|41.8946 \|41.9847 \| \|cuda\|mobilenetv3_large_100 \|128 \|36.7810 \|36.8264 \| \|cuda\|mobilevit_s \|64 \|55.3211 \|55.3186 \| \|cuda\|nfnet_l0 \|128 \|63.1302 \|63.5544 \| \|cuda\|pit_b_224 \|64 \|73.8752 \|73.4602 \| \|cuda\|pnasnet5large \|16 \|151.6806 \|151.6111 \| \|cuda\|poolformer_m36 \|64 \|86.8341 \|86.8021 \| \|cuda\|regnety_002 \|128 \|26.6798 \|26.5295 \| \|cuda\|repvgg_a2 \|128 \|61.6652 \|62.1482 \| \|cuda\|res2net101_26w_4s \|64 \|75.8037 \|75.7739 \| \|cuda\|res2net50_14w_8s \|128 \|92.6362 \|92.4338 \| \|cuda\|res2next50 \|128 \|111.5371 \|111.5832 \| \|cuda\|resmlp_12_224 \|128 \|58.2349 \|57.9807 \| \|cuda\|resnest101e \|64 \|96.1114 \|96.2742 \| \|cuda\|rexnet_100 \|128 \|54.8138 \|54.7643 \| \|cuda\|sebotnet33ts_256 \|64 \|53.1524 \|53.3823 \| \|cuda\|selecsls42b \|128 \|40.6070 \|40.7104 \| \|cuda\|spnasnet_100 \|128 \|44.5732 \|44.4318 \| \|cuda\|swin_base_patch4_window7_224 \|64 \|98.6447 \|98.8445 \| \|cuda\|swsl_resnext101_32x16d \|32 \|97.0195 \|97.2968 \| \|cuda\|tf_efficientnet_b0 \|128 \|56.0640 \|56.0278 \| \|cuda\|tf_mixnet_l \|128 \|152.0958 \|152.0874 \| \|cuda\|tinynet_a \|128 \|53.3694 \|53.3762 \| \|cuda\|tnt_s_patch16_224 \|128 \|130.2981 \|130.3726 \| \|cuda\|twins_pcpvt_base \|64 \|62.5459 \|62.6416 \| \|cuda\|visformer_small \|128 \|68.8502 \|69.1756 \| \|cuda\|vit_base_patch16_224 \|64 \|65.8587 \|66.0285 \| \|cuda\|volo_d1_224 \|64 \|64.5348 \|64.6057 \| Pull Request resolved: https://github.com/pytorch/pytorch/pull/89022 Approved by: https://github.com/ngimel	2022-12-15 03:24:44 +00:00
Jesse Cai	de016b3799	[pruning][core][feature] Implement prune for structured pruning (#89777 ) Summary: This PR implements `prune` in BaseStructuredSparsifier: `prune` is a function that takes in a model with structured sparsity parametritizations (the result of `prepare`) and will return a resized model with the masked out weights removed. `prune` is defined by a mapping from patterns to different pruning functions. - patterns are just sequences of operations, for example `(nn.Linear, activation, nn.Linear)` - pruning functions are functions that take in an matched pattern as args and will resize the appropriate layer sizes and weights. ``` def prune_linear_activation_linear(linear1, activation, linear2): pass ``` - This is one line in the pattern config `(nn.Linear, activation, nn.Linear): prune_linear_activation_linear` At a high level `prune` works by finding instances of the graph that match different patterns and then calling the mapped pruning functions on those matched patterns. This is unlike the previous code which attempted to do both at the same time. There may be some gaps in the patterns compared to the previous implementation, but the conversion functionality support should be the same. Currently we have pruning functions for the following patterns: - linear -> linear - linear -> activation -> linear - conv2d -> conv2d - conv2d -> activation -> conv2d - conv2d -> activation -> pool -> conv2d - conv2d -> pool -> activation -> conv2d - conv2d -> adaptive pool -> flatten -> linear Added in MyPy type hints as well for the prune_functions. Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/89777 Approved by: https://github.com/vkuzo	2022-12-08 07:13:24 +00:00
PyTorch MergeBot	1b1301f16a	Revert "[pruning][core][feature] Implement prune for structured pruning (#89777 )" This reverts commit `3531e44307`. Reverted https://github.com/pytorch/pytorch/pull/89777 on behalf of https://github.com/clee2000 due to breaking test_ao_sparcity due to import `3531e44307` https://github.com/pytorch/pytorch/actions/runs/3641476330/jobs/6147830487, probably a landrace with 824641b083860df4d7ffef06a798ea2702bc4bde?	2022-12-07 19:41:15 +00:00
Jesse Cai	3531e44307	[pruning][core][feature] Implement prune for structured pruning (#89777 ) Summary: This PR implements `prune` in BaseStructuredSparsifier: `prune` is a function that takes in a model with structured sparsity parametritizations (the result of `prepare`) and will return a resized model with the masked out weights removed. `prune` is defined by a mapping from patterns to different pruning functions. - patterns are just sequences of operations, for example `(nn.Linear, activation, nn.Linear)` - pruning functions are functions that take in an matched pattern as args and will resize the appropriate layer sizes and weights. ``` def prune_linear_activation_linear(linear1, activation, linear2): pass ``` - This is one line in the pattern config `(nn.Linear, activation, nn.Linear): prune_linear_activation_linear` At a high level `prune` works by finding instances of the graph that match different patterns and then calling the mapped pruning functions on those matched patterns. This is unlike the previous code which attempted to do both at the same time. There may be some gaps in the patterns compared to the previous implementation, but the conversion functionality support should be the same. Currently we have pruning functions for the following patterns: - linear -> linear - linear -> activation -> linear - conv2d -> conv2d - conv2d -> activation -> conv2d - conv2d -> activation -> pool -> conv2d - conv2d -> pool -> activation -> conv2d - conv2d -> adaptive pool -> flatten -> linear Added in MyPy type hints as well for the prune_functions. Test Plan: Reviewers: Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/89777 Approved by: https://github.com/vkuzo	2022-12-07 17:52:01 +00:00
Jesse Cai	9a1c6fd506	[pruning][core][feature] Align BaseStructuredPruner with existing pruning flow (#88436 ) Summary: This PR aligns the "eager" mode of the structured pruning flow with the existing unstructured pruning flow. The base pruner has been moved from and has been renamed from BasePruner to BaseStructuredPruner `torch/ao/pruning/_experimental/pruner/base_pruner.py -> /torch/ao/pruning/_experimental/pruner/base_structured_pruner.py` Support for pruning batchnorm modules in the config have been removed, so now the structured pruning code can use more of the BaseSparsifier logic and we don't need to override as many functions. Since we aim to only support a single flow, we have only updated ZeroesParametrizations (FakeStructuredSparsity) and BiasHook. The parameterizations have also been rewritten to use a bool mask tensor for keeping track of pruned rows, instead of using sets before. This better aligns structured and unstructured sparsity. The BaseStructuredSparsifier tests have also been updated to reflect the above changes. I also removed `squash_mask` tests because they were breaking CI and `squash_mask` is no longer used. We will migrate the structured pruning code out of this folder in a later PR. Test Plan: ``` python test/test_ao_sparsity -- TestBaseStructuredPruner ``` Reviewers: z-a-f vkuzo Subscribers: Tasks: Tags: Pull Request resolved: https://github.com/pytorch/pytorch/pull/88436 Approved by: https://github.com/vkuzo	2022-12-03 00:53:53 +00:00

26 Commits