Manojkumar Bhosale 09b8c971b3 Add MSA optimized NV12/21 To RGB row functions
R=fbarchard@google.com
BUG=libyuv:634

Performance Gain (vs C auto-vectorized)
NV12ToARGBRow_MSA       - ~1.5x
NV12ToARGBRow_Any_MSA   - ~1.4x
NV12ToRGB565Row_MSA     - ~1.4x
NV12ToRGB565Row_Any_MSA - ~1.4x
NV21ToARGBRow_MSA       - ~1.5x
NV21ToARGBRow_Any_MSA   - ~1.5x
SobelRow_MSA            - ~4.3x
SobelRow_Any_MSA        - ~3.4x
SobelToPlaneRow_MSA     - ~8.0x
SobelToPlaneRow_Any_MSA - ~4.7x
SobelXYRow_MSA          - ~3.0x
SobelXYRow_Any_MSA      - ~2.5x

Performance Gain (vs C non-vectorized)
NV12ToARGBRow_MSA       - ~6.5x
NV12ToARGBRow_Any_MSA   - ~6.5x
NV12ToRGB565Row_MSA     - ~6.2x
NV12ToRGB565Row_Any_MSA - ~6.1x
NV21ToARGBRow_MSA       - ~6.5x
NV21ToARGBRow_Any_MSA   - ~6.5x
SobelRow_MSA            - ~14.5x
SobelRow_Any_MSA        - ~11.3x
SobelToPlaneRow_MSA     - ~34.2x
SobelToPlaneRow_Any_MSA - ~19.4x
SobelXYRow_MSA          - ~11.1x
SobelXYRow_Any_MSA      - ~9.1x

Review-Url: https://codereview.chromium.org/2636483002 .
2017-01-18 09:24:39 +05:30
..
compare_common.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare_neon64.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_neon.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_argb.cc Add MSA optimized NV12/21 To RGB row functions 2017-01-18 09:24:39 +05:30
convert_from_argb.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
convert_from.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
convert_jpeg.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_to_argb.cc add Intel Code Analyst markers 2017-01-13 15:50:24 -08:00
convert_to_i420.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert.cc Add MSA optimized RAW/RGB/ARGB to ARGB/Y/UV row functions 2017-01-13 15:43:37 +05:30
cpu_id.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_decoder.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_validate.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
planar_functions.cc Add MSA optimized NV12/21 To RGB row functions 2017-01-18 09:24:39 +05:30
rotate_any.cc Add MSA optimized rotate functions (used 16x16 transpose) 2017-01-13 15:50:02 +05:30
rotate_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_msa.cc Add MSA optimized rotate functions (used 16x16 transpose) 2017-01-13 15:50:02 +05:30
rotate_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate.cc Add MSA optimized rotate functions (used 16x16 transpose) 2017-01-13 15:50:02 +05:30
row_any.cc Add MSA optimized NV12/21 To RGB row functions 2017-01-18 09:24:39 +05:30
row_common.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
row_gcc.cc add Intel Code Analyst markers 2017-01-13 15:50:24 -08:00
row_mips.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
row_msa.cc Add MSA optimized NV12/21 To RGB row functions 2017-01-18 09:24:39 +05:30
row_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_win.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
scale_any.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
scale_argb.cc Add MSA optimized ARGB scaling functions 2016-12-07 11:47:15 +05:30
scale_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_mips.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
scale_msa.cc Add MSA optimized remaining scale row functions 2016-12-21 13:39:44 +05:30
scale_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
video_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00