pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

Author	SHA1	Message	Date
xiorcale	6a37582162	Fix misleading doc string in quint8.h (#48418 ) Summary: The doc string let suppose that `quint8` is for singed 8 bit values. Pull Request resolved: https://github.com/pytorch/pytorch/pull/48418 Reviewed By: ngimel Differential Revision: D25181705 Pulled By: mrshenli fbshipit-source-id: 70e151b6279fef75505f80a7b0cd50032b4f1008	2020-11-25 20:48:39 -08:00
Igor Sugak	f1e89fbe53	[pytorch] add missing host-device attribute to fix clang build (#37358 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37358 Test Plan: ```lang=bash buck build mode/opt -c fbcode.cuda_use_clang=true //vision/fair/detectron2/tools:benchmark ``` Reviewed By: ngimel Differential Revision: D21262235 fbshipit-source-id: 00633352d87da0881b2cc90759265fa0d0bd96be	2020-04-27 18:24:20 -07:00
Jerry Zhang	385165ec67	[reland][quant] QuantizedCUDA implementation (#36936 ) (#37081 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/37081 Closes https://github.com/pytorch/pytorch/issues/30813 Relanding of https://github.com/pytorch/pytorch/pull/35463 1. Tensor quantization logic(quantize_) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. Test Plan: Imported from OSS Differential Revision: D21206694 Pulled By: jerryzh168 fbshipit-source-id: c7433aad9c095a34c57e6dddd128b5c5d9292373	2020-04-24 10:21:59 -07:00
Mike Ruberry	4bbc49f53a	Revert D21143025: [reland][quant] QuantizedCUDA implementation Test Plan: revert-hammer Differential Revision: D21143025 Original commit changeset: 11405e2e8f87 fbshipit-source-id: ce471ec95c1fc6abff6d1bbdba11bef02f3a0d62	2020-04-21 20:36:12 -07:00
Jerry Zhang	97d3a8495d	[reland][quant] QuantizedCUDA implementation (#36936 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/36936 Closes https://github.com/pytorch/pytorch/issues/30813 Relanding of https://github.com/pytorch/pytorch/pull/35463 1. Tensor quantization logic(quantize_) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. Test Plan: Imported from OSS Differential Revision: D21143025 Pulled By: jerryzh168 fbshipit-source-id: 11405e2e8f87e48fadc0a084c51db15f85ccb500	2020-04-21 13:18:52 -07:00
Alban Desmaison	49b10c58a3	Revert D20896697: [pytorch][PR] QuantizedCUDA implementation Test Plan: revert-hammer Differential Revision: D20896697 Original commit changeset: 163554efa23d fbshipit-source-id: e3e370ef7c8be68ea34368dfcc7a7efc9d1f8761	2020-04-19 12:41:51 -07:00
Aleksandr Fedorov	f6daa6220e	QuantizedCUDA implementation (#35463 ) Summary: Closes https://github.com/pytorch/pytorch/issues/30813 1. Tensor quantization logic(quantize_) is moved to the aten/native/quantized. Previously all logic for tensor quantization lived in the aten/quantized/Quantizer.cpp file, and started to become complicated and hard to read. This problem should be addressed in refactoring PR. Still, I reworked this partially because I had to add tensor quantization logic for CUDA, and it was native to move everything to the aten/native/quantized. 2. Requirements to run CUDA_tensor_apply was eased to process any tenser that lives on the CUDA device(QuantizedCUDA included). 3. All quantized data types now have a default constructor. NVCC refuses to compile any gpu_kernel or CUDA_tensor_apply* without them. 4. Minor changes in many files to register QuantizedCUDA backend. 5. test_quantized_tensor is extended to process QuantizedCUDA backend where possible. Pull Request resolved: https://github.com/pytorch/pytorch/pull/35463 Differential Revision: D20896697 Pulled By: jerryzh168 fbshipit-source-id: 163554efa23d11a2b10bbc2492439db4798eb26b	2020-04-19 08:33:16 -07:00
Jerry Zhang	220e6894c5	Rename qint8 data type (#19932 ) Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/19932 In preparation to add int8_t data type for QTensor Reviewed By: zafartahirov Differential Revision: D15137838 fbshipit-source-id: 59462c36d6fc5982986d4196bf3f32f49bb294d7	2019-05-16 18:09:28 -07:00

8 Commits