- 29 Jul, 2017 1 commit
-
-
Marco authored
Move the source_sad feature to speed 6 (from speed 7), and add speed feature to switch from the variance-based partition to reference_partition (which uses nonrd-pickmode for bsize selection) if source_sad is high. Currently used only for speed 6 for resoln <= 360p. About 4-5% improvement on 360p in RTC set. Some speed slowdown, but still ~30% faster than speed 5. Change-Id: Ib0330ee5fe9fdd2608aed91359a2a339d967491c
-
- 28 Jul, 2017 6 commits
-
-
Linfeng Zhang authored
* changes: Update vpx_idct16x16_10_add_sse2() Add vpx_idct16x16_38_add_sse2() Rewrite vpx_highbd_idct8x8_{12,64}_add_sse2 Refactor highbd idct 4x4 and 8x8 x86 functions
-
James Zern authored
This reverts commit 03f5e300. This causes test failures under OSX: SSSE3/VP9QuantizeTest.EOBCheck/0 SSSE3/VP9QuantizeTest.OperationCheck/0 Change-Id: I122732717ead1f7af5b04c529a6948e382e5e59b
-
Linfeng Zhang authored
Change-Id: Ia0e20f5fa47382af5785221eebb05212b40bd35c
-
Linfeng Zhang authored
Change-Id: I28150789feadc0b63d2fadc707e48971b41f9898
-
Linfeng Zhang authored
BUG=webm:1412 Change-Id: I35df041b757d42278ac7a5cdbd909e8ffcee1455
-
Linfeng Zhang authored
BUG=webm:1412 Change-Id: I221dff34dd5f71b390b5e043d0a137ccb0a01dec
-
- 27 Jul, 2017 2 commits
-
-
Johann Koenig authored
-
Jerome Jiang authored
-
- 26 Jul, 2017 3 commits
-
-
Jerome Jiang authored
Neutral on RTC metrics and speed on Pixel. Change-Id: I26b907483fe133e6e4c1009d147631f0d0e0f2fb
-
James Zern authored
visual studio reports tran_high_t (int64) -> short in calls to _mm_set1_epi16 Change-Id: Icb8d1baee77ad3d45edb1477a443d3e648f0b745
-
James Zern authored
visual studio reports tran_high_t (int64) -> int in calls to _mm_setr_epi32 Change-Id: Ic2247c8e3800991202151790d78bd94c4f4aed05
-
- 25 Jul, 2017 1 commit
-
-
James Zern authored
allow the right shift to operate on 64-bits, this matches the rest of the implementations previously: b0f1ae14 vpx_get16x16var_avx2: correct cast order Change-Id: I632ee5e418f3f9b30e79ecd05588eb172b0783aa
-
- 24 Jul, 2017 1 commit
-
-
James Zern authored
allow the right shift to operate on 64-bits, this matches the rest of the implementations missed in: 6acd061a variance_avx2: sync variance functions with c-code Change-Id: Icae436b881251ccb9f9ed64fcbf8d358c58a4617
-
- 22 Jul, 2017 1 commit
-
-
James Zern authored
For 8-bit the subtrahend is small enough to fit into uint32_t. For 10/12-bit apply: 63a37d16 Prevent negative variance previously: 47b9a091 Resolve -Wshorten-64-to-32 in highbd variance. c0241664 Resolve -Wshorten-64-to-32 in variance. Change-Id: I181c85f0b9a03da37c2e8b89482d48aa3dbc0aee
-
- 20 Jul, 2017 7 commits
-
-
Marco authored
Change-Id: I49c960179dfc1902aa5e5c99915789878c06bc3d
-
Johann Koenig authored
-
Johann Koenig authored
-
Jerome Jiang authored
-
Johann authored
Avoid unsigned overflow warning: unsigned integer overflow: 19974 - 32703 cannot be represented in type 'unsigned int' Change-Id: Ifebee014342e4c6f3b53306c0cad6ae0b465ac12
-
Johann authored
qcoeff output looks OK but dqcoeff is no good. BUG=webm:1448 Change-Id: I07211db8a8b74f1f45fdd059852e2de0e5ee18fd
-
Johann Koenig authored
-
- 19 Jul, 2017 5 commits
-
-
Johann Koenig authored
-
Jerome Jiang authored
-
Johann authored
eob values are generated by the function. Change-Id: I8ce92100e83022bff99888a5a7e6ef378c49fda3
-
Han Shen authored
Backend specific optimization for PPC VSX reads 16 bytes, whereas arm neon / sse2 only reads <= 8 bytes. Although the extra bytes read are actually never used, this is not a warrant for groping around. Fixed by allocating more when building for VSX. This is reported by asan. Also note - PPC does have assembly that loads 64-bit content from memory - lxsdx loads one 64-bit doubleword (whereas lxvd2x loads two 64-bit doubleword) from memory. However, we only have "vec_vsx_ld" builtins that mapped to lxvd2x, no builtins to lxsdx. The only way to access lxsdx is through inline assembly, which does not fit well in the origin paradigm. Refer: vsx: vpx_tm_predictor_4x4_vsx @ third_party/libvpx/git_root/vpx_dsp/ppc/intrapred_vsx.c neon: vpx_tm_predictor_4x4_neon @ third_party/libvpx/git_root/vpx_dsp/arm/intrapred_neon_asm.asm sse2: tm_predictor_4x4 @ third_party/libvpx/git_root/vpx_dsp/x86/intrapred_sse2.asm BUG=b/63112600 Tested: asan tests passed. Change-Id: I5f74b56e35c05b67851de8b5530aece213f2ce9d
-
Johann Koenig authored
-
- 18 Jul, 2017 13 commits
-
-
Jerome Jiang authored
BUG=webm:1447 Change-Id: I8defe45cb94eb9c209ba72ce446786f24c14c0b8
-
Jerome Jiang authored
Change-Id: I6702b7b11aa4ac9aac5fd54deef4377cdcb29c64
-
Jerome Jiang authored
-
Jerome Jiang authored
-
Johann Koenig authored
* changes: quantize test: test sse2 and avx optimizations quantize test: extend arrays quantize test: restrict and correct input
-
Johann authored
Keep optimized code out of the reference implementation. This matches the style of the other sub calls. Change-Id: I3da6acd4f2c647b029c420e22ac9410a18259689
-
Jerome Jiang authored
When SVC is used, allocate alt-ref in denoiser. Change-Id: I1b17221b55b9444cd23b97d481b54ff8d296d857
-
Johann authored
Copy missing line from avx implementation. Change-Id: I9755c5b4d4034867de6fa9f741c24bf49dce3a27
-
Johann authored
ssse3 does not pass either of the tests. avx 32x32 does not pass. Change-Id: I62c2e31336fd2327327afaa0da896ad79a3def44
-
Jerome Jiang authored
0.007% regression on rtc and 0.004% gain on rtc_derf. 1 thread on QVGA,VGA and HD has ~0.2% speed regression while 2 threads has ~0.2% speed gain on Google Pixel. Change-Id: Ia4a6ec904df670d7001e35e070b01e34149d23dc
-
Johann authored
Officially the quant structures are 8 elements, with one dc element and 7 repeated ac elements. The low bit depth optimizations take advantage of this to fill the xmm registers. The high bit depth version manually duplicates the values. If all the optimizations were unified, the structure sizes could be greatly reduced. Change-Id: Ibd7a0337a7832ce2a1a05ee433c310077e1059ae
-
Johann authored
Use only valid values for quantize inputs. These were determined by looping over vp9_init_quantizer and looking for max and min values. This allows extending the test to the low bit depth functions which were not designed to handle all possible inputs but only valid inputs. Change-Id: I94e1d8863a49ac227845b65c6b50130e10e6319e
-
Marco authored
To fix valgrind issueis with SVC tests. SVC encoding uses prune_evenmore which is causing uinit value. Will re-enable later when issue is resolved. Change-Id: I257ff878cf78197ddd813db056582a4d5fe94f44
-