- 22 Jul, 2017 1 commit
-
-
James Zern authored
For 8-bit the subtrahend is small enough to fit into uint32_t. For 10/12-bit apply: 63a37d16 Prevent negative variance previously: 47b9a091 Resolve -Wshorten-64-to-32 in highbd variance. c0241664 Resolve -Wshorten-64-to-32 in variance. Change-Id: I181c85f0b9a03da37c2e8b89482d48aa3dbc0aee
-
- 20 Jul, 2017 7 commits
-
-
Marco authored
Change-Id: I49c960179dfc1902aa5e5c99915789878c06bc3d
-
Johann Koenig authored
-
Johann Koenig authored
-
Jerome Jiang authored
-
Johann authored
Avoid unsigned overflow warning: unsigned integer overflow: 19974 - 32703 cannot be represented in type 'unsigned int' Change-Id: Ifebee014342e4c6f3b53306c0cad6ae0b465ac12
-
Johann authored
qcoeff output looks OK but dqcoeff is no good. BUG=webm:1448 Change-Id: I07211db8a8b74f1f45fdd059852e2de0e5ee18fd
-
Johann Koenig authored
-
- 19 Jul, 2017 5 commits
-
-
Johann Koenig authored
-
Jerome Jiang authored
-
Johann authored
eob values are generated by the function. Change-Id: I8ce92100e83022bff99888a5a7e6ef378c49fda3
-
Han Shen authored
Backend specific optimization for PPC VSX reads 16 bytes, whereas arm neon / sse2 only reads <= 8 bytes. Although the extra bytes read are actually never used, this is not a warrant for groping around. Fixed by allocating more when building for VSX. This is reported by asan. Also note - PPC does have assembly that loads 64-bit content from memory - lxsdx loads one 64-bit doubleword (whereas lxvd2x loads two 64-bit doubleword) from memory. However, we only have "vec_vsx_ld" builtins that mapped to lxvd2x, no builtins to lxsdx. The only way to access lxsdx is through inline assembly, which does not fit well in the origin paradigm. Refer: vsx: vpx_tm_predictor_4x4_vsx @ third_party/libvpx/git_root/vpx_dsp/ppc/intrapred_vsx.c neon: vpx_tm_predictor_4x4_neon @ third_party/libvpx/git_root/vpx_dsp/arm/intrapred_neon_asm.asm sse2: tm_predictor_4x4 @ third_party/libvpx/git_root/vpx_dsp/x86/intrapred_sse2.asm BUG=b/63112600 Tested: asan tests passed. Change-Id: I5f74b56e35c05b67851de8b5530aece213f2ce9d
-
Johann Koenig authored
-
- 18 Jul, 2017 12 commits
-
-
Jerome Jiang authored
BUG=webm:1447 Change-Id: I8defe45cb94eb9c209ba72ce446786f24c14c0b8
-
Jerome Jiang authored
Change-Id: I6702b7b11aa4ac9aac5fd54deef4377cdcb29c64
-
Jerome Jiang authored
-
Jerome Jiang authored
-
Johann Koenig authored
* changes: quantize test: test sse2 and avx optimizations quantize test: extend arrays quantize test: restrict and correct input
-
Johann authored
Keep optimized code out of the reference implementation. This matches the style of the other sub calls. Change-Id: I3da6acd4f2c647b029c420e22ac9410a18259689
-
Jerome Jiang authored
When SVC is used, allocate alt-ref in denoiser. Change-Id: I1b17221b55b9444cd23b97d481b54ff8d296d857
-
Johann authored
ssse3 does not pass either of the tests. avx 32x32 does not pass. Change-Id: I62c2e31336fd2327327afaa0da896ad79a3def44
-
Jerome Jiang authored
0.007% regression on rtc and 0.004% gain on rtc_derf. 1 thread on QVGA,VGA and HD has ~0.2% speed regression while 2 threads has ~0.2% speed gain on Google Pixel. Change-Id: Ia4a6ec904df670d7001e35e070b01e34149d23dc
-
Johann authored
Officially the quant structures are 8 elements, with one dc element and 7 repeated ac elements. The low bit depth optimizations take advantage of this to fill the xmm registers. The high bit depth version manually duplicates the values. If all the optimizations were unified, the structure sizes could be greatly reduced. Change-Id: Ibd7a0337a7832ce2a1a05ee433c310077e1059ae
-
Johann authored
Use only valid values for quantize inputs. These were determined by looping over vp9_init_quantizer and looking for max and min values. This allows extending the test to the low bit depth functions which were not designed to handle all possible inputs but only valid inputs. Change-Id: I94e1d8863a49ac227845b65c6b50130e10e6319e
-
Marco authored
To fix valgrind issueis with SVC tests. SVC encoding uses prune_evenmore which is causing uinit value. Will re-enable later when issue is resolved. Change-Id: I257ff878cf78197ddd813db056582a4d5fe94f44
-
- 17 Jul, 2017 2 commits
-
-
Marco authored
When content_state_sb is set to LowVarHighSumdiff, don't reset it to VeryHighSad. Visually better on clips with strong lighting changes. Small/negligible change in RTC metrics and speed. Change-Id: I20c383e3c4cf8d1149de5f9260449c0b7cf7c6aa
-
Marco authored
When int_pro_motion_estimation is done for superblock in choose_partitioning, use it to avoid the full_pixel_search for NEWMV mode, if bsize is >= 32X32. For speed > 7. Small/neutral change on RTC metrics. ~1-2% speedup on arm on high motion clip. Change-Id: I3cfe6833ff4bf75d4afa83eaf058ad45729de85b
-
- 15 Jul, 2017 1 commit
-
-
James Zern authored
-
- 14 Jul, 2017 2 commits
-
-
Jerome Jiang authored
Change-Id: Iddcb83a5968db57cfd312c5bc44b2a226a2a3264
-
Marco authored
Only affects speed 7. Improvement on high motion clips. Change-Id: Ibddb68fed9c63207df29ffd790f9205b1cecf687
-
- 13 Jul, 2017 2 commits
-
-
Johann authored
Although the low bitdepth functions are identical (excepting the need for larger intermediate values) they do not pass these tests. This improves the error output to aid debugging. Simplify buffer usage with Buffer and removing unnecessarily aligned variables. eob is a single element and never written using aligned instructions. BUG=webm:1426 Change-Id: Ic95789a135cf1e8a3846d85270f2b818f6ec7e35
-
James Zern authored
for tests that aren't explicitly testing 2-pass behavior use --passes=1 with this configuration Change-Id: I6a1520ecc65d0f626486604310af29dacb9f197f
-
- 12 Jul, 2017 8 commits
-
-
James Zern authored
-
Johann Koenig authored
-
Marco Paniconi authored
-
Johann Koenig authored
* changes: sad4d neon: 32x[16,32,64] sad4d neon: 16x[8,16,32] sad4d neon: 8x[4,8,16] sad4d neon: 4x4, 4x8
-
Johann authored
Rewrite 64x64. BUG=webm:1425 Change-Id: I336bf5a3aa4b783389c10b16a50f0f559346ecbf
-
Johann authored
Rewrite 32x32. Use half the accumulator registers. BUG=webm:1425 Change-Id: Ibf5e61dc4ba15056102aef8495f4a02c668c5d13
-
Johann authored
Rewrite 16x16. Use half the accumulator registers. BUG=webm:1425 Change-Id: I44b48512b1e3629505d83c2645e800f53878ccc2
-
Johann authored
BUG=webm:1425 Change-Id: I7de2500cca4b621f21478c4b0333c56d76dbc9a4
-