Commit Graph

7 Commits

Author SHA1 Message Date
samdow
0a7d6f34b0 expanded weights: instance norm faster rule
Pull Request resolved: https://github.com/pytorch/pytorch/pull/70141

Approved by: https://github.com/zou3519
2022-04-19 19:40:09 +00:00
samdow
72f7193f4d expanded weights: group norm faster rule
Pull Request resolved: https://github.com/pytorch/pytorch/pull/73696

Approved by: https://github.com/zou3519
2022-03-31 20:06:54 +00:00
samdow
8b8f3e836b expanded weights: layer norm faster rule
Pull Request resolved: https://github.com/pytorch/pytorch/pull/73694

Approved by: https://github.com/zou3519
2022-03-31 19:10:43 +00:00
samdow
fc47257b30 expanded weights: embedding faster rule
Pull Request resolved: https://github.com/pytorch/pytorch/pull/73693

Approved by: https://github.com/zou3519
2022-03-29 21:28:17 +00:00
Samantha Andow
78e17eaadc expanded weights: conv faster rule (#73692)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/73692

Test Plan: Imported from OSS

Reviewed By: bdhirsh

Differential Revision: D34719302

Pulled By: samdow

fbshipit-source-id: 2288320a5f5d6a442da78e9fbe722f300b844be9
(cherry picked from commit a4cf23383c16d3c61d53e9d21f426259d2dc2d37)
2022-03-10 04:06:08 +00:00
Philip Meier
0973c5a1cc align signature of make_tensor with other creation ops (#72702)
Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/72702

Test Plan: Imported from OSS

Reviewed By: mrshenli

Differential Revision: D34457729

Pulled By: mruberry

fbshipit-source-id: 83d580c4201eef946dc9cf4b9e28a3d36be55609
(cherry picked from commit aa4cf20fbeb4b795595729b8ac2e6ba7707d8283)
2022-02-25 06:30:31 +00:00
Samantha Andow
53faf78143 expanded weights without fast rules (#70140)
Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/70140

[Design Doc for Expanded Weights](https://gist.github.com/samdow/fa0a164fec7963f93ff45284989cfc55) <-- gives an overview of the design for Expanded Weights

Introduces the ExpandedWeights mechanism and user-facing API without any custom implemented, faster rules.
 - User facing API is in `_stateless.py` (with documentation)
 - Testing is in test_expanded_weights
 - The rest is the implementation of the erroring fallback + the mechanism for being able to register faster per sample grad rules. Only linear is implemented here, but they are all implemented in #70141

Test Plan: Imported from OSS

Reviewed By: mikaylagawarecki

Differential Revision: D34350950

Pulled By: samdow

fbshipit-source-id: 69c664b0bc3dff6951358d79d7e5d94882f7aef2
(cherry picked from commit ae1620d3b6)
2022-02-22 20:35:16 +00:00