Manojkumar Bhosale 7c64163ff4 Add MSA optimized RAW/RGB/ARGB to ARGB/Y/UV row functions
R=fbarchard@google.com
BUG=libyuv:634

Performance Gain (vs C vectorized)
ARGB1555ToARGBRow_MSA     - 1.85
ARGB1555ToARGBRow_Any_MSA - 1.82
RGB565ToARGBRow_MSA       - 2.14
RGB565ToARGBRow_Any_MSA   - 2.08
RGB24ToARGBRow_MSA        - 8.57
RGB24ToARGBRow_Any_MSA    - 7.42
RAWToARGBRow_MSA          - 8.57
RAWToARGBRow_Any_MSA      - 7.42
ARGB1555ToYRow_MSA        - 2.60
ARGB1555ToYRow_Any_MSA    - 2.47
RGB565ToYRow_MSA          - 2.45
RGB565ToYRow_Any_MSA      - 2.33
RGB24ToYRow_MSA           - 2.23
RGB24ToYRow_Any_MSA       - 2.01
RAWToYRow_MSA             - 2.25
RAWToYRow_Any_MSA         - 2.02
ARGB1555ToUVRow_MSA       - 1.40
ARGB1555ToUVRow_Any_MSA   - 1.37
RGB565ToUVRow_MSA         - 1.68
RGB565ToUVRow_Any_MSA     - 1.63
RGB24ToUVRow_MSA          - 3.02
RGB24ToUVRow_Any_MSA      - 2.87
RAWToUVRow_MSA            - 3.04
RAWToUVRow_Any_MSA        - 2.85

Performance Gain (vs C non-vectorized)
ARGB1555ToARGBRow_MSA     - 4.66
ARGB1555ToARGBRow_Any_MSA - 4.45
RGB565ToARGBRow_MSA       - 5.58
RGB565ToARGBRow_Any_MSA   - 5.34
RGB24ToARGBRow_MSA        - 8.57
RGB24ToARGBRow_Any_MSA    - 7.42
RAWToARGBRow_MSA          - 8.57
RAWToARGBRow_Any_MSA      - 7.42
ARGB1555ToYRow_MSA        - 6.38
ARGB1555ToYRow_Any_MSA    - 5.98
RGB565ToYRow_MSA          - 6.42
RGB565ToYRow_Any_MSA      - 6.05
RGB24ToYRow_MSA           - 7.87
RGB24ToYRow_Any_MSA       - 7.01
RAWToYRow_MSA             - 7.98
RAWToYRow_Any_MSA         - 7.01
ARGB1555ToUVRow_MSA       - 5.39
ARGB1555ToUVRow_Any_MSA   - 5.06
RGB565ToUVRow_MSA         - 6.39
RGB565ToUVRow_Any_MSA     - 5.90
RGB24ToUVRow_MSA          - 3.04
RGB24ToUVRow_Any_MSA      - 2.87
RAWToUVRow_MSA            - 3.04
RAWToUVRow_Any_MSA        - 2.88

Review-Url: https://codereview.chromium.org/2600713002 .
2017-01-13 15:43:37 +05:30
..
compare_common.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare_neon64.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_neon.cc xmmword cast for clang 2015-08-18 11:13:12 -07:00
compare_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
compare.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_argb.cc Add MSA optimized RAW/RGB/ARGB to ARGB/Y/UV row functions 2017-01-13 15:43:37 +05:30
convert_from_argb.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
convert_from.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
convert_jpeg.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert_to_argb.cc ConvertToARGB: Allows rotation on ARGB input 2017-01-11 14:38:25 -08:00
convert_to_i420.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
convert.cc Add MSA optimized RAW/RGB/ARGB to ARGB/Y/UV row functions 2017-01-13 15:43:37 +05:30
cpu_id.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_decoder.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
mjpeg_validate.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
planar_functions.cc Add MSA optimized ARGB Attenuate/RGB565/Shuffle/Shader/Gray/Sepia row functions 2016-12-15 12:06:02 +05:30
rotate_any.cc Add MSA optimized TransposeWx8_MSA and TransposeUVWx8_MSA functions 2016-12-15 10:06:01 +05:30
rotate_argb.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_mips.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_msa.cc Add MSA optimized TransposeWx8_MSA and TransposeUVWx8_MSA functions 2016-12-15 10:06:01 +05:30
rotate_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
rotate.cc Add MSA optimized TransposeWx8_MSA and TransposeUVWx8_MSA functions 2016-12-15 10:06:01 +05:30
row_any.cc Add MSA optimized RAW/RGB/ARGB to ARGB/Y/UV row functions 2017-01-13 15:43:37 +05:30
row_common.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
row_gcc.cc clang-format row_gcc.cc with some functions disabled 2016-11-07 18:37:29 -08:00
row_mips.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
row_msa.cc Add MSA optimized RAW/RGB/ARGB to ARGB/Y/UV row functions 2017-01-13 15:43:37 +05:30
row_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
row_win.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
scale_any.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
scale_argb.cc Add MSA optimized ARGB scaling functions 2016-12-07 11:47:15 +05:30
scale_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_gcc.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_mips.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
scale_msa.cc Add MSA optimized remaining scale row functions 2016-12-21 13:39:44 +05:30
scale_neon64.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_neon.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale_win.cc clang-format libyuv 2016-11-07 17:37:23 -08:00
scale.cc Libyuv MIPS DSPR2 optimizations. 2017-01-11 12:19:13 -08:00
video_common.cc clang-format libyuv 2016-11-07 17:37:23 -08:00