Commit b84dc949 authored by Mans Rullgard's avatar Mans Rullgard
Browse files

vp9: neon: optimise convolve8_horiz functions

Each iteration of the horizontal loop reuses 7 of the 11 source
values.  Loading only the 4 new values saves some time.

Also add preload for source data.

Overall 4% faster on Chromebook.

Change-Id: I8f69e749f2b7f79e9734620dcee51dbfcd716b44
parent e7c5ca89
Branches
Tags
No related merge requests found
Showing with 111 additions and 61 deletions
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment