AVX2 SAD Optimization: (0fa8b668) · Commits · BC / public / external / libvpx

Commit 0fa8b668 authored 11 years ago by

levytamar82 Committed by Yunqing Wang 11 years ago

AVX2 SAD Optimization:

2 functions were optimized for avx2 by using full 256 bit register
In order to handle 32 elements in parallel instead of only 16 in parallel:
1. vp9_sad32x32x4d
2. vp9_sad64x64x4d

The function level gain is 66% and the user level gain is ~1%.

Change-Id: I4efbb3bc7d8bc03b64b6c98f5cd5c4a9dd3212cb

parent 3967435f

v1.14.0-linphone

No related merge requests found

Hide whitespace changes

Inline Side-by-side

Showing with 170 additions and 2 deletions

Please register or to comment