libyuv

mirror of https://chromium.googlesource.com/libyuv/libyuv synced 2026-06-16 08:56:09 +08:00

Go to file

George Steed 5d694bec38 [AArch64] Replace UQSHRN{,2} pair by UZP2 in YUVTORGB The existing Neon code makes use of a pair of UQSHRN and UQSHRN2 instructions to extract the top half of a widened multiply result. These instructions would ordinarily saturate, however saturation can never happen in this case since we are shifting by 16 to get the top half of each element, the top bits remain as-is. We could move this to using a slightly simpler non-saturating shift, however in this case it is simpler and faster to just use UZP2 to extract the top half of each 32-bit lane directly. Reduction in runtime for selected kernels: Kernel \| Cortex-A55 \| Cortex-A76 \| Cortex-X2 I400ToARGBRow_NEON \| -9.4% \| -14.9% \| -13.9% I422AlphaToARGBRow_NEON \| -7.9% \| -11.4% \| -11.5% I422ToARGB1555Row_NEON \| -7.3% \| -17.2% \| -14.7% I422ToARGB4444Row_NEON \| -7.6% \| -17.9% \| -13.7% I422ToARGBRow_NEON \| -8.2% \| -9.8% \| -11.9% I422ToRGB24Row_NEON \| -8.0% \| -13.3% \| -12.8% I422ToRGB565Row_NEON \| -7.5% \| -15.1% \| -14.6% I422ToRGBARow_NEON \| -8.3% \| -13.1% \| -12.2% I444AlphaToARGBRow_NEON \| -8.3% \| -7.6% \| -12.7% I444ToARGBRow_NEON \| -8.6% \| -3.5% \| -13.5% I444ToRGB24Row_NEON \| -8.5% \| -7.8% \| -13.4% NV12ToARGBRow_NEON \| -8.8% \| -1.4% \| -12.0% NV12ToRGB24Row_NEON \| -8.5% \| -11.5% \| -12.3% NV12ToRGB565Row_NEON \| -7.9% \| -15.0% \| -15.7% NV21ToARGBRow_NEON \| -8.7% \| -1.6% \| -12.3% NV21ToRGB24Row_NEON \| -8.4% \| -11.5% \| -12.0% UYVYToARGBRow_NEON \| -8.8% \| -8.9% \| -11.9% YUY2ToARGBRow_NEON \| -8.7% \| -10.8% \| -13.3% Bug: libyuv:976 Change-Id: I6c505fe722e5f91f93718b85fe881ad056d8602d Reviewed-on: https://chromium-review.googlesource.com/c/libyuv/libyuv/+/5366653 Reviewed-by: Frank Barchard <fbarchard@chromium.org> Commit-Queue: Frank Barchard <fbarchard@chromium.org>		2024-03-14 20:04:46 +00:00
build_overrides	Define enable_safe_libcxx in build_overrides/build.gni.	2023-05-03 06:08:40 +00:00
docs	Add AMXINT8 cpu detect	2024-02-15 21:44:47 +00:00
include	Revert "AMX detect OS support for linux kernel"	2024-02-29 00:33:29 +00:00
infra/config	infra/config: remove goma property	2023-08-17 06:06:53 +00:00
riscv_script	[RISC-V] Support CMake build with custom compiler flags	2023-07-25 09:21:59 +00:00
source	[AArch64] Replace UQSHRN{,2} pair by UZP2 in YUVTORGB	2024-03-14 20:04:46 +00:00
tools_libyuv	Do not roll the Fuchsia SDK.	2023-07-03 09:18:00 +00:00
unit_test	Add AMXINT8 cpu detect	2024-02-15 21:44:47 +00:00
util	Add AMXINT8 cpu detect	2024-02-15 21:44:47 +00:00
.clang-format	clang-format libyuv	2016-11-07 17:37:23 -08:00
.gitignore	DetilePlane and unittest for NEON	2022-01-31 20:05:55 +00:00
.gn	Roll chromium_revision 829c6df33d..7d683aeda8 (945687:1050091)	2022-09-22 14:56:57 +00:00
.vpython	remove swarming_client	2021-09-09 07:11:45 +00:00
.vpython3	Update vpython3 requests	2023-06-01 19:06:40 +00:00
Android.bp	Split scale_test and scale_plane_test to allow building on small devices	2023-12-09 18:39:41 +00:00
Android.mk	Split scale_test and scale_plane_test to allow building on small devices	2023-12-09 18:39:41 +00:00
AUTHORS	Fix compile errors for ARM targets when libyuv_use_neon = false	2022-05-13 10:12:38 +00:00
BUILD.gn	Split scale_test and scale_plane_test to allow building on small devices	2023-12-09 18:39:41 +00:00
cleanup_links.py	Update PRESUBMIT, cleanup_links and autoroller to py3	2022-02-24 13:34:14 +00:00
CM_linux_packages.cmake	Reduce cmake verbosity and update min version	2022-08-03 06:59:54 +00:00
CMakeLists.txt	Add cpuid target to CMakeList.txt	2023-12-11 18:45:32 +00:00
codereview.settings	[infra] remove no longer supported `git cl upload` setting.	2021-04-28 12:47:52 +00:00
DEPS	[Fuchsia] Add terminal.x64 image to default checkout	2023-09-19 01:50:07 +00:00
DIR_METADATA	Move metadata in OWNERS files to DIR_METADATA files	2021-02-09 19:34:43 +00:00
download_vs_toolchain.py	Update gclient instructions + environment	2022-02-24 15:19:23 +00:00
libyuv.gni	Add GN builds on loongarch platform.	2023-06-19 17:47:05 +00:00
libyuv.gyp	Add libyuv.gyp build files	2022-03-21 23:48:16 +00:00
libyuv.gypi	Add libyuv.gyp build files	2022-03-21 23:48:16 +00:00
LICENSE	Update Copyright notice to follow new chromium conventions.	2012-08-08 19:04:24 +00:00
linux.mk	Split convert_test and convert_argb_test to allow building on small systems that run out of memory compiling unittests.	2023-12-08 13:39:56 +00:00
OWNERS	add jansson@google.com to infra owners to cover when Mirko is OOO	2022-10-28 09:46:02 +00:00
PATENTS	LibYuv: Adding PATENT and LICENSE files	2011-10-25 16:15:49 +00:00
PRESUBMIT.py	Update PRESUBMIT, cleanup_links and autoroller to py3	2022-02-24 13:34:14 +00:00
public.mk	use unix line endings	2018-06-20 23:19:59 +00:00
pylintrc	Use DEPS for all dependencies + add PRESUBMIT.py	2017-02-03 11:36:53 +00:00
README.chromium	Revert "AMX detect OS support for linux kernel"	2024-02-29 00:33:29 +00:00
README.md	Add RAWToARGBRow_RVV,RAWToRGBARow_RVV,RAWToRGB24Row_RVV	2023-04-07 18:45:08 +00:00
winarm.mk	NV12 Copy, include scale_uv.h	2020-12-08 18:54:16 +00:00

README.md

libyuv is an open source project that includes YUV scaling and conversion functionality.

Scale YUV to prepare content for compression, with point, bilinear or box filter.
Convert to YUV from webcam formats for compression.
Convert to RGB formats for rendering/effects.
Rotate by 90/180/270 degrees to adjust for mobile devices in portrait mode.
Optimized for SSSE3/AVX2 on x86/x64.
Optimized for Neon on Arm.
Optimized for MSA on Mips.
Optimized for RVV on RISC-V.

Development

See Getting started for instructions on how to get started developing.

You can also browse the docs directory for more documentation.