Manojkumar Bhosale 288bfbefb5 Add MSA optimized remaining scale row functions
R=fbarchard@google.com
BUG=libyuv:634

Performance Gain (vs C vectorized)
ScaleRowDown2_MSA            - ~22.3x
ScaleRowDown2_Any_MSA        - ~19.9x
ScaleRowDown2Linear_MSA      - ~31.2x
ScaleRowDown2Linear_Any_MSA  - ~29.4x
ScaleRowDown2Box_MSA         - ~20.1x
ScaleRowDown2Box_Any_MSA     - ~19.6x
ScaleRowDown4_MSA            - ~11.7x
ScaleRowDown4_Any_MSA        - ~11.2x
ScaleRowDown4Box_MSA         - ~15.1x
ScaleRowDown4Box_Any_MSA     - ~15.1x
ScaleRowDown38_MSA           - ~1x
ScaleRowDown38_Any_MSA       - ~1x
ScaleRowDown38_2_Box_MSA     - ~1.7x
ScaleRowDown38_2_Box_Any_MSA - ~1.7x
ScaleRowDown38_3_Box_MSA     - ~1.7x
ScaleRowDown38_3_Box_Any_MSA - ~1.7x
ScaleAddRow_MSA              - ~1.2x
ScaleAddRow_Any_MSA          - ~1.15x

Performance Gain (vs C non-vectorized)
ScaleRowDown2_MSA            - ~22.4x
ScaleRowDown2_Any_MSA        - ~19.8x
ScaleRowDown2Linear_MSA      - ~31.6x
ScaleRowDown2Linear_Any_MSA  - ~29.4x
ScaleRowDown2Box_MSA         - ~20.1x
ScaleRowDown2Box_Any_MSA     - ~19.6x
ScaleRowDown4_MSA            - ~11.7x
ScaleRowDown4_Any_MSA        - ~11.2x
ScaleRowDown4Box_MSA         - ~15.1x
ScaleRowDown4Box_Any_MSA     - ~15.1x
ScaleRowDown38_MSA           - ~3.2x
ScaleRowDown38_Any_MSA       - ~3.2x
ScaleRowDown38_2_Box_MSA     - ~2.4x
ScaleRowDown38_2_Box_Any_MSA - ~2.3x
ScaleRowDown38_3_Box_MSA     - ~2.9x
ScaleRowDown38_3_Box_Any_MSA - ~2.8x
ScaleAddRow_MSA              - ~8x
ScaleAddRow_Any_MSA          - ~7.46x

Review-Url: https://codereview.chromium.org/2559683002 .
2016-12-21 13:39:44 +05:30
..
compare_common.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare_neon64.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_neon.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_argb.cc Add MSA optimized ARGB Attenuate/RGB565/Shuffle/Shader/Gray/Sepia row functions 2016-12-15 12:06:02 +05:30
convert_from_argb.cc Add MSA optimized ARGB Attenuate/RGB565/Shuffle/Shader/Gray/Sepia row functions 2016-12-15 12:06:02 +05:30
convert_from.cc Add MSA optimized ARGB Attenuate/RGB565/Shuffle/Shader/Gray/Sepia row functions 2016-12-15 12:06:02 +05:30
convert_jpeg.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_to_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_to_i420.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert.cc Android420ToI420 - use ptrdiff_t for difference of u and v pointers 2016-12-14 11:53:55 -08:00
cpu_id.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_decoder.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_validate.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
planar_functions.cc Add MSA optimized ARGB Attenuate/RGB565/Shuffle/Shader/Gray/Sepia row functions 2016-12-15 12:06:02 +05:30
rotate_any.cc Add MSA optimized TransposeWx8_MSA and TransposeUVWx8_MSA functions 2016-12-15 10:06:01 +05:30
rotate_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_msa.cc Add MSA optimized TransposeWx8_MSA and TransposeUVWx8_MSA functions 2016-12-15 10:06:01 +05:30
rotate_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate.cc Add MSA optimized TransposeWx8_MSA and TransposeUVWx8_MSA functions 2016-12-15 10:06:01 +05:30
row_any.cc Add MSA optimized ARGB Attenuate/RGB565/Shuffle/Shader/Gray/Sepia row functions 2016-12-15 12:06:02 +05:30
row_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_gcc.cc clang-format row_gcc.cc with some functions disabled 2016-11-07 18:37:29 -08:00
row_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_msa.cc Add MSA optimized ARGB Attenuate/RGB565/Shuffle/Shader/Gray/Sepia row functions 2016-12-15 12:06:02 +05:30
row_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_any.cc Add MSA optimized remaining scale row functions 2016-12-21 13:39:44 +05:30
scale_argb.cc Add MSA optimized ARGB scaling functions 2016-12-07 11:47:15 +05:30
scale_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_msa.cc Add MSA optimized remaining scale row functions 2016-12-21 13:39:44 +05:30
scale_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale.cc Add MSA optimized remaining scale row functions 2016-12-21 13:39:44 +05:30
video_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00