pytorch

mirror of https://github.com/zebrajr/pytorch.git synced 2025-12-07 12:21:27 +01:00

History

Mark Santaniello 1a99689d71 [caffe2] Fix preprocessor checks for FMA Summary: I think this preprocessor check is incorrect. The fused multiply-add (FMA) instructions are not part of AVX2. Test Plan: CI Reviewed By: jspark1105 Differential Revision: D24237836 fbshipit-source-id: 44f9b9179918332eb85ac087827726300f56224e		2020-10-11 11:48:32 -07:00
..
__init__.py	remediation of S205607	2020-07-17 17:19:47 -07:00
adagrad_avx2.cc	implement L2 regularization for Adagrad in caffe2 and dper (#37705 )	2020-05-03 10:42:49 -07:00
adagrad.cc	implement L2 regularization for Adagrad in caffe2 and dper (#37705 )	2020-05-03 10:42:49 -07:00
adagrad.h	[caffe2] Fix preprocessor checks for FMA	2020-10-11 11:48:32 -07:00
CMakeLists.txt	Fix AVX detection with clang-cl (#35653 )	2020-03-30 07:53:37 -07:00
common_avx.cc
common_avx2.cc
common_avx512.cc
common.h	[caffe2] Use cpuinfo in perfkernels to simplify build dependency (#36371 )	2020-04-10 13:26:34 -07:00
cvtsh_ss_bugfix.h	Avoid MSVC _cvtsh_ss() workaround with clang-cl (#29726 )	2019-11-13 12:49:13 -08:00
embedding_lookup_avx2.cc	change the epilogue of SLS to match the simd section (#21439 )	2019-06-05 18:41:55 -07:00
embedding_lookup_fused_8bit_rowwise_avx2.cc	change the epilogue of SLS to match the simd section (#21439 )	2019-06-05 18:41:55 -07:00
embedding_lookup_fused_8bit_rowwise_idx_avx2.cc	[pytorch][embeddingbag_8bit] Add include_last_offset option to Fused 8bit EmbeddingBag and parallelize the op (#32683 )	2020-01-29 16:04:56 -08:00
embedding_lookup_idx_avx2.cc	[pytorch][embeddingbag] Parallelize the EmbeddingBag operator (#4049 )	2020-01-23 21:29:44 -08:00
embedding_lookup_idx.cc	[pytorch][embeddingbag] Parallelize the EmbeddingBag operator (#4049 )	2020-01-23 21:29:44 -08:00
embedding_lookup_idx.h	[pytorch][embeddingbag] Parallelize the EmbeddingBag operator (#4049 )	2020-01-23 21:29:44 -08:00
embedding_lookup.cc	remove dependency to caffe2::math and eigen (#21169 )	2019-05-31 11:55:16 -07:00
embedding_lookup.h	minor comment fix (#22140 )	2019-08-08 21:08:47 -07:00
fused_8bit_rowwise_embedding_lookup_idx.cc	[pytorch][embeddingbag_8bit] Add include_last_offset option to Fused 8bit EmbeddingBag and parallelize the op (#32683 )	2020-01-29 16:04:56 -08:00
fused_8bit_rowwise_embedding_lookup_idx.h	fix comment index_size->output_size (#29831 )	2019-11-16 01:49:02 -08:00
fused_8bit_rowwise_embedding_lookup.cc	remove dependency to caffe2::math and eigen (#21169 )	2019-05-31 11:55:16 -07:00
fused_8bit_rowwise_embedding_lookup.h	fix comment index_size->output_size (#29831 )	2019-11-16 01:49:02 -08:00
fused_nbit_rowwise_conversion.cc	[caffe2] Replace embedding conversion ops with fbgemm functions (#44843 )	2020-09-22 11:57:01 -07:00
fused_nbit_rowwise_conversion.h	[caffe2] optimize 2/4-bit row-wise quantization (#387 )	2020-06-19 21:28:31 -07:00
hp_emblookup_codegen.py	Remove __future__ imports for legacy Python2 supports (#45033 )	2020-09-23 17:57:02 -07:00
lstm_unit_cpu_avx2.cc	[caffe2] Explicit vectorization of LSTM operator (#35556 )	2020-04-01 17:19:56 -07:00
lstm_unit_cpu_common.cc	[caffe2] Explicit vectorization of LSTM operator (#35556 )	2020-04-01 17:19:56 -07:00
lstm_unit_cpu_common.h	[caffe2] Explicit vectorization of LSTM operator (#35556 )	2020-04-01 17:19:56 -07:00
lstm_unit_cpu-impl.h	[caffe2] Explicit vectorization of LSTM operator (#35556 )	2020-04-01 17:19:56 -07:00
lstm_unit_cpu.h	[caffe2] Explicit vectorization of LSTM operator (#35556 )	2020-04-01 17:19:56 -07:00
math_cpu_avx2.cc
math_cpu_base.cc
math.h
typed_axpy_avx.cc	remove dependency to caffe2::math and eigen (#21169 )	2019-05-31 11:55:16 -07:00
typed_axpy_avx2.cc	remove dependency to caffe2::math and eigen (#21169 )	2019-05-31 11:55:16 -07:00
typed_axpy.cc	remove dependency to caffe2::math and eigen (#21169 )	2019-05-31 11:55:16 -07:00
typed_axpy.h