Manojkumar Bhosale 83f460be33 Add MSA optimized ARGB Multiply/Add/Subtract row functions
R=fbarchard@google.com
BUG=libyuv:634

Performance Gain (vs C vectorized)
ARGBMultiplyRow_MSA       - 1.4x
ARGBAddRow_MSA            - 8.6x
ARGBSubtractRow_MSA       - 8.6x

ARGBMultiplyRow_Any_MSA   - 1.35x
ARGBAddRow_Any_MSA        - 7.3x
ARGBSubtractRow_Any_MSA   - 7.2x

Performance Gain (vs C non-vectorized)
ARGBMultiplyRow_MSA       - 4.4x
ARGBAddRow_MSA            - 27x
ARGBSubtractRow_MSA       - 22x

ARGBMultiplyRow_Any_MSA   - 3.5x
ARGBAddRow_Any_MSA        - 23x
ARGBSubtractRow_Any_MSA   - 18x

Review URL: https://codereview.chromium.org/2529983002 .
2016-12-02 15:21:10 +05:30
..
compare_common.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare_neon64.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_neon.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_from_argb.cc Add MSA optimized ARGBToRGB565Row_MSA, ARGBToARGB1555Row_MSA, ARGBToARGB4444Row_MSA, ARGBToUV444Row_MSA functions 2016-11-22 10:47:55 -08:00
convert_from.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_jpeg.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_to_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_to_i420.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
cpu_id.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_decoder.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_validate.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
planar_functions.cc Add MSA optimized ARGB Multiply/Add/Subtract row functions 2016-12-02 15:21:10 +05:30
rotate_any.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_any.cc Add MSA optimized ARGB Multiply/Add/Subtract row functions 2016-12-02 15:21:10 +05:30
row_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_gcc.cc clang-format row_gcc.cc with some functions disabled 2016-11-07 18:37:29 -08:00
row_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_msa.cc Add MSA optimized ARGB Multiply/Add/Subtract row functions 2016-12-02 15:21:10 +05:30
row_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_any.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
video_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00