- 31 May, 2017 4 commits
-
-
Jerome Jiang authored
Use only the average of center 2x2 pixels in vp8. Change-Id: I2b23ff19a90827226273e0fca49e90c734eda59b
-
Johann Koenig authored
-
Jerome Jiang authored
-
Linfeng Zhang authored
-
- 30 May, 2017 7 commits
-
-
Johann authored
BUG=webm:1423 Change-Id: I33de537f238f58f89b7a6c1c2d6e8110de4b8804
-
Jerome Jiang authored
Change-Id: Ica1b4e918aa759cd0ce65920f9d88452bbf9e3b4
-
Linfeng Zhang authored
BUG=webm:1412 Change-Id: I26e4b34ae9bc1ae80c24f56d740d737a95f1ab84
-
Johann Koenig authored
-
Johann Koenig authored
-
Johann authored
x86 requires 16 byte alignment for some vector loads/stores. arm does not have the same requirement. The asserts are still in avg_pred_sse2.c. This just removes them from the common code. Change-Id: Ic5175c607a94d2abf0b80d431c4e30c8a6f731b6
-
Jerome Jiang authored
-
- 26 May, 2017 7 commits
-
-
Johann authored
Unlike x86 neon only requires type alignment when loading into vectors. Change-Id: I7bbbe4d51f78776e499ce137578d8c0effdbc02f
-
Johann Koenig authored
-
Johann Koenig authored
-
Jerome Jiang authored
Split vp8/vp9 implementations on yv12_copy_frame_c. Remove high-bitdepth codes from vp8_yv12_extend_frame_borders_c. Clean up vp8 codes usage in vp9. BUG=webm:1435 Change-Id: Ic68e79e9d71e1b20ddfc451fb8dcf2447861236d
-
Marco authored
Fix the condition on usage of source_sad for temporal layers. FIx allows it to be used for the case of 1 temporal layer. Change-Id: I02b1b0ade67a7889d1b93cee66d27c0951131fc3
-
Marco Paniconi authored
-
Marco Paniconi authored
-
- 25 May, 2017 3 commits
-
-
Marco authored
For 1 pass CBR SVC mode. Change-Id: Ic026740f9d0ec5eee7c5845be9c5b15884fec48d
-
Jerome Jiang authored
Change-Id: If760f28cbbf22beac1cc9bd1546f13831e9dd3f0
-
Marco authored
Adjust the max_copied_frame setting for temporal layers. Keep the same setting for non-SVC at speed 8. This change also enables copy_partiton for non-SVC at speed 7, but with smaller value of max_copied_frame (=2). ~2% speedup for SVC speed 7, 3 layers, with little/no quality loss. Change-Id: Ic65ac9aad764ec65a35770d263424b2393ec6780
-
- 24 May, 2017 6 commits
-
-
Johann authored
Unlike x86, arm does not impose additional alignment restrictions on vector loads. For incoming values to the first pass, it uses vld1_u32() which typically does impose a 4 byte alignment. However, as the first pass operates on user-supplied values we must prepare for unaligned values anyway (and have, see mem_neon.h). But for the local temporary values there is no stride and the load will use vld1_u8 which does not require 4 byte alignment. There are 3 temporary structures. In the C, one is uint16_t. The arm saturates between passes but still passes tests. If this becomes an issue new functions will be needed. Change-Id: I3c9d4701bfeb14b77c783d0164608e621bfecfb1
-
Johann authored
Change-Id: Idb6248c1429b55176bb3e9f4e8365ea0ed2be62a
-
Johann Koenig authored
* changes: sub pel avg variance neon: 4x block sizes sub pel variance neon: 4x block sizes
-
Johann Koenig authored
* changes: sub pel avg variance neon: add neon optimizations sub pel variance neon: normalize variable names
-
James Zern authored
-
James Zern authored
promote coeff to signed 64-bit to avoid exceeding integer bounds when squaring the value Change-Id: If77bef6bc0a6a4c39ca3013e5e2ddb426a1c6e1f
-
- 23 May, 2017 3 commits
-
-
Linfeng Zhang authored
Extract shared code into inline functions. Change-Id: Iee1e5a4bc6396aeed0d301163095c9b21aa66b2f
-
Linfeng Zhang authored
Make it work in high bit depth. BUG=webm:1412 Change-Id: Ic5cfd410a69709f01e2924774356a108a349d273
-
Gregor Jasny authored
BUG=webm:1428 Change-Id: Iba98aef1159724d106cf39b94d7b69843d76cd48
-
- 22 May, 2017 10 commits
-
-
Johann authored
BUG=webm:1423 Change-Id: Iaab2b9a183fdb54aae5f717aba95d90dc36a9e3b
-
Johann authored
Add optimizations for blocks of width 4 BUG=webm:1423 Change-Id: Idfb458d36db3014d48fbfbe7f5462aa6eb249938
-
Johann authored
These are missing an optimized version of vpx_comp_avg_pred BUG=webm:1423 Change-Id: I31fa6ef842e98f7ff3ea079ffed51ae33178e2ed
-
Johann authored
match vpx_dsp/variance.c variable names Change-Id: I228c6f296c183af147b079b7c8bcdf97bd09cf3a
-
Linfeng Zhang authored
-
Johann authored
Change-Id: I12faca82d062eb33dc48dfeb39739b25112316cd
-
Linfeng Zhang authored
BUG=webm:1412 Change-Id: Ia338a6057d36f9ed7eaa9cbd4dfbf0c3cbdc6468
-
Johann authored
The sub pixel variance uses a temp buffer which guarantees width == stride. Take advantage of this with the 4x and avoid the very costly lane loads. Change-Id: Ia0c97eb8c29dc8dfa6e51a29dff9b75b3c6726f1
-
Johann Koenig authored
* changes: neon variance: process 4x blocks use memcpy for unaligned neon stores
-
Marco Paniconi authored
-