pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Andrew Or 782f3489c6 [Quant][fx][bc-breaking] Integrate BackendConfig with quantization flow (part 2) (#82557 ) This is part 2 of the effort to replace `backend_config_dict` with a python config object, a more formal and robust API that leads to better user experience. This commit integrates the `BackendConfig` implemented in part 1 (https://github.com/pytorch/pytorch/pull/81469) with the existing FX graph mode quantization flow. Test Plan: python test/test_quantization.py TestQuantizeFx python test/test_quantization.py TestQuantizeFxOps BC-breaking Notes: Before: ``` import torch from torch.ao.quantization import get_default_qconfig_mapping from torch.ao.quantization.backend_config import ObservationType from torch.ao.quantization.quantize_fx import prepare_fx, convert_fx dtype_config = { "input_dtype": torch.quint8, "output_dtype": torch.quint8 "weight_dtype": torch.qint8, "bias_dtype": torch.float, } backend_config_dict = { "name": "my_backend", "configs": [{ "pattern": torch.nn.Linear, "observation_type": ObservationType.OUTPUT_USE_DIFFERENT_OBSERVER_AS_INPUT, "dtype_configs": [dtype_config], "root_module": torch.nn.Linear, "reference_quantized_module": torch.nn.quantized._reference.Linear, "qat_module": torch.nn.qat.Linear, }] } m = MyModel() qconfig_mapping = get_default_qconfig_mapping() example_inputs = (torch.rand(3, 3),) m = prepare_fx( m, qconfig_mapping, example_inputs, backend_config_dict=backend_config_dict) m = convert_fx(m, backend_config_dict=backend_config_dict) ``` After: ``` import torch from torch.ao.quantization import get_default_qconfig_mapping from torch.ao.quantization.backend_config import ( BackendConfig, BackendPatternConfig, DTypeConfig, ObservationType, ) from torch.ao.quantization.quantize_fx import prepare_fx, convert_fx dtype_config = DTypeConfig( input_dtype=torch.quint8, output_dtype=torch.quint8 weight_dtype=torch.qint8, bias_dtype=torch.float, ) backend_config = BackendConfig("my_backend").set_backend_pattern_config( BackendPatternConfig(torch.nn.Linear) .set_observation_type(ObservationType.OUTPUT_USE_DIFFERENT_OBSERVER_AS_INPUT) .add_dtype_config(dtype_config) .set_root_module(torch.nn.Linear) .set_reference_quantized_module(torch.nn.quantized._reference.Linear) .set_qat_module(torch.nn.qat.Linear)) m = MyModel() qconfig_mapping = get_default_qconfig_mapping() example_inputs = (torch.rand(3, 3),) m = prepare_fx(m, qconfig_mapping, example_inputs, backend_config=backend_config) m = convert_fx(m, backend_config=backend_config) ``` Reviewers: jerryzh168 Subscribers: jerryzh168, supriyar Differential Revision: [D38471932](https://our.internmc.facebook.com/intern/diff/D38471932) Pull Request resolved: https://github.com/pytorch/pytorch/pull/82557 Approved by: https://github.com/jerryzh168		2022-08-08 18:55:50 +00:00
..
_static
_templates	Fix left nav (#78552 )	2022-06-01 00:49:53 +00:00
community	[ONNX] Update merge rules and persons of interest (#82673 )	2022-08-04 18:11:41 +00:00
elastic
notes	Improve autograd custom function docs (#81340 )	2022-07-21 19:54:30 +00:00
rpc
scripts	[ONNX] Clean up `onnx_supported_ops` (#79424 )	2022-06-23 20:44:51 +00:00
amp.rst	Remove operators that support BFloat16 in the fp32 cast policy list of AutocastCPU (#77623 )	2022-05-17 16:49:17 +00:00
autograd.rst	Enable Intel® VTune™ Profiler's Instrumentation and Tracing Technology APIs (ITT) to PyTorch (#63289 )	2022-07-13 13:50:15 +00:00
backends.rst	Update backends.rst (#82525 )	2022-08-03 18:33:15 +00:00
benchmark_utils.rst
bottleneck.rst	Enable Intel® VTune™ Profiler's Instrumentation and Tracing Technology APIs (ITT) to PyTorch (#63289 )	2022-07-13 13:50:15 +00:00
checkpoint.rst
complex_numbers.rst	Add a note on CUDA 11.6 (#80363 )	2022-06-27 21:34:24 +00:00
conf.py	[Quant][fx][bc-breaking] Integrate BackendConfig with quantization flow (part 2) (#82557 )	2022-08-08 18:55:50 +00:00
config_mod.rst
cpp_extension.rst
cpp_index.rst
cuda.rst	Python Jiterator supports multiple outputs (#78139 )	2022-05-24 21:52:56 +00:00
cudnn_persistent_rnn.rst
cudnn_rnn_determinism.rst
data.rst	[DataLoader] Minor documentation improvement	2022-05-31 15:59:46 +00:00
ddp_comm_hooks.rst	Fix two small typos in ddp_comm_hooks.rst (#82047 )	2022-07-23 19:10:57 +00:00
deploy.rst	Back out "Back out "[torch deploy] Update deploy.rst with working simple example"" (#76713 )	2022-05-03 14:12:18 +00:00
distributed.algorithms.join.rst
distributed.elastic.rst
distributed.optim.rst
distributed.rst	Add `TORCH_CPP_LOG_LEVEL` to the docs	2022-05-03 17:01:11 +00:00
distributions.rst
dlpack.rst
docutils.conf
fft.rst
fsdp.rst
futures.rst
fx.rst	CSE Pass and common pass Tests (#81742 )	2022-07-22 03:45:09 +00:00
hub.rst
index.rst	Add docs for Python Registration	2022-06-13 23:21:23 +00:00
jit_builtin_functions.rst
jit_language_reference_v2.rst
jit_language_reference.rst
jit_python_reference.rst
jit_unsupported.rst
jit_utils.rst	Create __init__.py (#78629 )	2022-06-03 18:14:21 +00:00
jit.rst	torch.jit doc link for nvfuser readme.md (#77780 )	2022-07-07 23:25:35 +00:00
library.rst	Add docs for Python Registration	2022-06-13 23:21:23 +00:00
linalg.rst	[Array API] Add linalg.vecdot (#70542 )	2022-07-12 14:28:54 +00:00
math-quantizer-equation.png
mobile_optimizer.rst
model_zoo.rst
monitor.rst
multiprocessing.rst
name_inference.rst
named_tensor.rst	Add torch.unflatten and improve its docs (#81399 )	2022-07-29 15:02:42 +00:00
nested.rst	Update NestedTensor docs (#80963 )	2022-07-07 22:15:39 +00:00
nn.functional.rst	Add Dropout1d module	2022-06-15 14:39:07 +00:00
nn.init.rst	update nn.init doc to reflect the no_grad (#80882 )	2022-07-07 17:19:29 +00:00
nn.rst	Add Dropout1d module	2022-06-15 14:39:07 +00:00
onnx_supported_aten_ops.rst	Add list of supported ATen ops by ONNX converter into torch.onnx page	2022-04-07 00:05:44 +00:00
onnx.rst	Revert "[ONNX] Type utility for converting among JIT, torch and ONNX data types (#81953 )"	2022-08-07 20:15:28 +00:00
optim.rst	Remove misleading statement in optim.Optimizer docs (#76967 )	2022-05-10 14:39:53 +00:00
package.rst	Update package.rst to not include hermetic claim (#81019 )	2022-07-07 18:40:55 +00:00
pipeline.rst
profiler.rst
quantization-accuracy-debugging.rst	quant docs: best practices for quantization accuracy debugging	2022-05-17 12:16:52 +00:00
quantization-backend-configuration.rst	quantization: autogenerate quantization backend configs for documentation (#75126 )	2022-04-04 22:22:30 +00:00
quantization-support.rst	[quant] Quantizable documentation (#79957 )	2022-06-24 16:55:15 +00:00
quantization.rst	corrects typo in quantization docs (#81687 )	2022-07-21 00:17:13 +00:00
random.rst
rpc.rst
sparse.rst	Revise sparse docs regarding Sparse Compressed tensors (#82108 )	2022-07-29 18:15:09 +00:00
special.rst	torch.special.scaled_modified_bessel_k0 (#78900 )	2022-06-29 14:53:37 +00:00
storage.rst	Rename `_Typed/_UntypedStorage` to `Typed/UntypedStorage` and update docs (#82438 )	2022-07-30 19:37:08 +00:00
tensor_attributes.rst
tensor_view.rst
tensorboard.rst
tensors.rst	Revise sparse docs regarding Sparse Compressed tensors (#82108 )	2022-07-29 18:15:09 +00:00
testing.rst	Fix links in `torch.testing` docs (#80353 )	2022-07-11 19:15:53 +00:00
torch.ao.ns._numeric_suite_fx.rst
torch.ao.ns._numeric_suite.rst
torch.overrides.rst	Revert "Revert "Implement sym_sizes to create proper IR for sym ints representing tensor sizes (#76836 )""	2022-05-18 18:40:57 +00:00
torch.rst	Add torch.unflatten and improve its docs (#81399 )	2022-07-29 15:02:42 +00:00
type_info.rst	ENH: Convert finfo.tiny to finfo.smallest_normal (#76292 )	2022-05-20 00:59:48 +00:00