- 31 Jul, 2017 1 commit
-
-
Marco authored
When the superblock partition is based on the nonrd-pickmode, we need to avoid the denoising. Current condition was based on the speed level. This change is to make the condition at the superblock level, as the switch in partitioning may be done at sb level based on source_sad (e.g., in speed 6). Change-Id: I12ece4f60b93ed34ee65ff2d6cdce1213c36de04
-
- 30 Jul, 2017 1 commit
-
-
James Zern authored
-
- 29 Jul, 2017 6 commits
-
-
James Zern authored
This reverts commit 064fc570. This causes an assertion failure in vp9_mcomp.c when running gtest_filter=VP9/MotionVectorTestLarge.OverallTest/41: `mv->col >= -((1 << (11 + 1 + 2)) - 1) && mv->col < ((1 << (11 + 1 + 2)) - 1)' Change-Id: I449e777bf18b661cb3f1d82253610c55c51687f6
-
James Zern authored
This reverts commit aa1c4cd1. This fails the following tests with extreme input coefficients: SSE2/InvTrans8x8DCT.CompareReference/0 SSE2/InvTrans8x8DCT.CompareReference/2 previously the optimized path was skipped in this range Change-Id: I9af015a46eba96208834a219fafd651d37556a80
-
Marco Paniconi authored
-
Marco Paniconi authored
-
Jerome Jiang authored
Change-Id: I917d106f4c95ea44e413e23881f6303982e1a6a3
-
Marco authored
Move the source_sad feature to speed 6 (from speed 7), and add speed feature to switch from the variance-based partition to reference_partition (which uses nonrd-pickmode for bsize selection) if source_sad is high. Currently used only for speed 6 for resoln <= 360p. About 4-5% improvement on 360p in RTC set. Some speed slowdown, but still ~30% faster than speed 5. Change-Id: Ib0330ee5fe9fdd2608aed91359a2a339d967491c
-
- 28 Jul, 2017 7 commits
-
-
Urvang Joshi authored
The greedy version was already enabled by default here: https://chromium-review.googlesource.com/c/546848/ And the speed+compression gains from greedy version were already mentioned here: https://chromium-review.googlesource.com/c/531675/ Change-Id: Iad9f7d03490c845ad1e230af028c9d39edddca97
-
Linfeng Zhang authored
* changes: Update vpx_idct16x16_10_add_sse2() Add vpx_idct16x16_38_add_sse2() Rewrite vpx_highbd_idct8x8_{12,64}_add_sse2 Refactor highbd idct 4x4 and 8x8 x86 functions
-
James Zern authored
This reverts commit 03f5e300. This causes test failures under OSX: SSSE3/VP9QuantizeTest.EOBCheck/0 SSSE3/VP9QuantizeTest.OperationCheck/0 Change-Id: I122732717ead1f7af5b04c529a6948e382e5e59b
-
Linfeng Zhang authored
Change-Id: Ia0e20f5fa47382af5785221eebb05212b40bd35c
-
Linfeng Zhang authored
Change-Id: I28150789feadc0b63d2fadc707e48971b41f9898
-
Linfeng Zhang authored
BUG=webm:1412 Change-Id: I35df041b757d42278ac7a5cdbd909e8ffcee1455
-
Linfeng Zhang authored
BUG=webm:1412 Change-Id: I221dff34dd5f71b390b5e043d0a137ccb0a01dec
-
- 27 Jul, 2017 2 commits
-
-
Johann Koenig authored
-
Jerome Jiang authored
-
- 26 Jul, 2017 3 commits
-
-
Jerome Jiang authored
Neutral on RTC metrics and speed on Pixel. Change-Id: I26b907483fe133e6e4c1009d147631f0d0e0f2fb
-
James Zern authored
visual studio reports tran_high_t (int64) -> short in calls to _mm_set1_epi16 Change-Id: Icb8d1baee77ad3d45edb1477a443d3e648f0b745
-
James Zern authored
visual studio reports tran_high_t (int64) -> int in calls to _mm_setr_epi32 Change-Id: Ic2247c8e3800991202151790d78bd94c4f4aed05
-
- 25 Jul, 2017 1 commit
-
-
James Zern authored
allow the right shift to operate on 64-bits, this matches the rest of the implementations previously: b0f1ae14 vpx_get16x16var_avx2: correct cast order Change-Id: I632ee5e418f3f9b30e79ecd05588eb172b0783aa
-
- 24 Jul, 2017 1 commit
-
-
James Zern authored
allow the right shift to operate on 64-bits, this matches the rest of the implementations missed in: 6acd061a variance_avx2: sync variance functions with c-code Change-Id: Icae436b881251ccb9f9ed64fcbf8d358c58a4617
-
- 22 Jul, 2017 1 commit
-
-
James Zern authored
For 8-bit the subtrahend is small enough to fit into uint32_t. For 10/12-bit apply: 63a37d16 Prevent negative variance previously: 47b9a091 Resolve -Wshorten-64-to-32 in highbd variance. c0241664 Resolve -Wshorten-64-to-32 in variance. Change-Id: I181c85f0b9a03da37c2e8b89482d48aa3dbc0aee
-
- 20 Jul, 2017 7 commits
-
-
Marco authored
Change-Id: I49c960179dfc1902aa5e5c99915789878c06bc3d
-
Johann Koenig authored
-
Johann Koenig authored
-
Jerome Jiang authored
-
Johann authored
Avoid unsigned overflow warning: unsigned integer overflow: 19974 - 32703 cannot be represented in type 'unsigned int' Change-Id: Ifebee014342e4c6f3b53306c0cad6ae0b465ac12
-
Johann authored
qcoeff output looks OK but dqcoeff is no good. BUG=webm:1448 Change-Id: I07211db8a8b74f1f45fdd059852e2de0e5ee18fd
-
Johann Koenig authored
-
- 19 Jul, 2017 5 commits
-
-
Johann Koenig authored
-
Jerome Jiang authored
-
Johann authored
eob values are generated by the function. Change-Id: I8ce92100e83022bff99888a5a7e6ef378c49fda3
-
Han Shen authored
Backend specific optimization for PPC VSX reads 16 bytes, whereas arm neon / sse2 only reads <= 8 bytes. Although the extra bytes read are actually never used, this is not a warrant for groping around. Fixed by allocating more when building for VSX. This is reported by asan. Also note - PPC does have assembly that loads 64-bit content from memory - lxsdx loads one 64-bit doubleword (whereas lxvd2x loads two 64-bit doubleword) from memory. However, we only have "vec_vsx_ld" builtins that mapped to lxvd2x, no builtins to lxsdx. The only way to access lxsdx is through inline assembly, which does not fit well in the origin paradigm. Refer: vsx: vpx_tm_predictor_4x4_vsx @ third_party/libvpx/git_root/vpx_dsp/ppc/intrapred_vsx.c neon: vpx_tm_predictor_4x4_neon @ third_party/libvpx/git_root/vpx_dsp/arm/intrapred_neon_asm.asm sse2: tm_predictor_4x4 @ third_party/libvpx/git_root/vpx_dsp/x86/intrapred_sse2.asm BUG=b/63112600 Tested: asan tests passed. Change-Id: I5f74b56e35c05b67851de8b5530aece213f2ce9d
-
Johann Koenig authored
-
- 18 Jul, 2017 5 commits
-
-
Jerome Jiang authored
BUG=webm:1447 Change-Id: I8defe45cb94eb9c209ba72ce446786f24c14c0b8
-
Jerome Jiang authored
Change-Id: I6702b7b11aa4ac9aac5fd54deef4377cdcb29c64
-
Jerome Jiang authored
-
Jerome Jiang authored
-
Johann Koenig authored
* changes: quantize test: test sse2 and avx optimizations quantize test: extend arrays quantize test: restrict and correct input
-