pytorch

OSSForks/pytorch

Fork 0

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Commit Graph

Author	SHA1	Message	Date
Jongsoo Park	964732fa8d	use fbgemm gconv in dnnlowp (#16020 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/16020 Needs to go over more iterations. For conv, I think we need a high level interface that abstracts out low-level details of which code path will be taken (acc16, outlier-aware, depth-wise, group conv, ...) otherwise the client code will be complex as can be seen from DNNLOWP Conv ops. This will also help us to make interface more stable. Reviewed By: dskhudia, jianyuh Differential Revision: D13588996 fbshipit-source-id: 9afce9e441bcaf20437fcc2874fb9d4165a46bcb	2019-01-15 00:02:31 -08:00
Jongsoo Park	b039a715ce	pre-pack operation of dnnlowp conv with 16-bit accumulation (#14881 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/14881 This diff allows us to pre-quantize and pre-pack weight matrix used in DNNLOWP_ACC16 . The intended use pattern is run Int8ConvPackWeight in init_net that generates a packed weight and Int8Conv with DNNLOWP_ACC16 engine uses the the packed weight. Reviewed By: csummersea Differential Revision: D13374662 fbshipit-source-id: dd02b9a4eb7af1fe208aa857fcd0b445e6e395af	2018-12-10 01:08:21 -08:00

Author

SHA1

Message

Date

Jongsoo Park

964732fa8d

use fbgemm gconv in dnnlowp (#16020 )

Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/16020

Needs to go over more iterations. For conv, I think we need a high level interface that abstracts out low-level details of which code path will be taken (acc16, outlier-aware, depth-wise, group conv, ...) otherwise the client code will be complex as can be seen from DNNLOWP Conv ops. This will also help us to make interface more stable.

Reviewed By: dskhudia, jianyuh

Differential Revision: D13588996

fbshipit-source-id: 9afce9e441bcaf20437fcc2874fb9d4165a46bcb

2019-01-15 00:02:31 -08:00

Jongsoo Park

b039a715ce

pre-pack operation of dnnlowp conv with 16-bit accumulation (#14881 )

Summary:
Pull Request resolved: https://github.com/pytorch/pytorch/pull/14881

This diff allows us to pre-quantize and pre-pack weight matrix used in DNNLOWP_ACC16 .
The intended use pattern is run Int8ConvPackWeight in init_net that generates a packed weight and Int8Conv with DNNLOWP_ACC16 engine uses the the packed weight.

Reviewed By: csummersea

Differential Revision: D13374662

fbshipit-source-id: dd02b9a4eb7af1fe208aa857fcd0b445e6e395af

2018-12-10 01:08:21 -08:00

2 Commits