- 05 Sep, 2017 1 commit
-
-
Linfeng Zhang authored
so that the convolve functions are independent of table alignment. Change-Id: Ieab132a30d72c6e75bbe9473544fbe2cf51541ee
-
- 02 Sep, 2017 1 commit
-
-
James Zern authored
fixes test file builds Change-Id: Iaa725ad95d56cf77d9fef8994981a80102e9a966
-
- 01 Sep, 2017 7 commits
-
-
clang-format authored
Change-Id: If4c3e8a396d0fcb304f407b44e28cac3219f038c
-
James Zern authored
based on Google style with the following differences: 3a4 > # Generated with clang-format 4.0.1 13c14 < AllowShortCaseLabelsOnASingleLine: false --- > AllowShortCaseLabelsOnASingleLine: true 23c24 < BraceWrapping: --- > BraceWrapping: 43c44 < ConstructorInitializerAllOnOneLineOrOnePerLine: true --- > ConstructorInitializerAllOnOneLineOrOnePerLine: false 46,47c47,48 < Cpp11BracedListStyle: true < DerivePointerAlignment: true --- > Cpp11BracedListStyle: false > DerivePointerAlignment: false 51c52 < IncludeCategories: --- > IncludeCategories: 78c79 < PointerAlignment: Left --- > PointerAlignment: Right 80c81 < SortIncludes: true --- > SortIncludes: false Change-Id: Ibc0ef87a516b8eae88d426dfdd7624be57e7b87c
-
Peter Boström authored
-
James Zern authored
-
Peter Boström authored
Makes main thread wait for the filter level to be picked to avoid a race between the LPF thread and update_reference_frames(). This also re-enables the failing tests under thread_sanitizer where this data race was detected. BUG=webm:1460 Change-Id: I7f5797142ea0200394309842ce3e91a480be4fbc
-
Peter Boström authored
-
Peter Boström authored
Fixes issue on iPad Pro 10.5 (and probably other places) where threads are not properly synchronized. On x86 this data race was benign as load and store instructions are atomic, they were being atomic in practice as the program hasn't been observed to be miscompiled. Such guarantees are not made outside x86, and real problems manifested where libvpx reliably reproduced a broken bitstream for even just the initial keyframe. This was detected in WebRTC where this device started using multithreading (as its CPU count is higher than earlier devices, where the problem did not manifest as single-threading was used in practice). This issue was not detected under thread-sanitizer bots as mutexes were conditionally used under this platform to simulate the protected read and write semantics that were in practice provided on x86 platforms. This change also removes several mutexes, so encoder/decoder state is lighter-weight after this change and we do not need to initialize so many mutexes (this was done even on non-thread-sanitizer platforms where they were unused). Change-Id: If41fcb0d99944f7bbc8ec40877cdc34d672ae72a
-
- 31 Aug, 2017 5 commits
-
-
Scott LaVarnway authored
-
Jerome Jiang authored
-
Jerome Jiang authored
Neutral on rtc set for speed 8. Neutral on ytlive for speed 5. Saves some computation cycles but no speed gain observed on Pixel. Change-Id: I34c4642cd543aa89c5b9c4bff6b7113577c64c91
-
James Zern authored
vec_vsx_ld -> load_tran_low Change-Id: Id3144cdd528d2d406a515e5812e2ea9e4db64bf1
-
Jerome Jiang authored
-
- 30 Aug, 2017 3 commits
-
-
Jerome Jiang authored
This reverts commit df9ce122. Reason for revert: Re-enabled tests still fail tsan in high bitdepth. Original change's description: > Re-enable disabled tests under TSan. > > These tests point to an already-fixed bug, this should no longer have a > data race. > > BUG=webm:1049 > > Change-Id: Iaedc5db8df99362bdc501b70ff7fdebf8756fdb8 TBR=jzern@google.com,pbos@chromium.org,builds@webmproject.org # Not skipping CQ checks because original CL landed > 1 day ago. Bug: webm:1049 Change-Id: I232f1f7726bf795b301abfb2e07cad6756642e53
-
Scott LaVarnway authored
C vs SSE2 speed gains: _4x4 : ~6.49x _8x8 : ~10.82x _16x16 : ~7.61x _32x32 : ~5.29x BUG=webm:1411 Change-Id: Ibc30c50cb7139049bf05298010803499e6ef949b
-
Scott LaVarnway authored
-
- 29 Aug, 2017 6 commits
-
-
Scott LaVarnway authored
C vs SSE2 speed gains: _4x4 : ~7.39x _8x8 : ~11.36x _16x16 : ~8.68x _32x32 : ~4.33x BUG=webm:1411 Change-Id: I7f1487cd1531d4e7f0fbb4596fed3bfb72a59d58
-
Jerome Jiang authored
-
Peter Boström authored
-
Scott LaVarnway authored
-
Scott LaVarnway authored
C vs SSE2 speed gains: _4x4 : ~8.12x _8x8 : ~9.71x _16x16 : ~8.21x _32x32 : ~5.0x BUG=webm:1422 Change-Id: I5e8a1ed4db7b8dc539b3e2a728b0b34d8b4b1993
-
Jerome Jiang authored
Neutral in borg tests. Some clips show 3-4% speed gain on 2 threads on Pixel. Change-Id: Ic959f34e44892a854551de6e9a3d9ec819ffed00
-
- 28 Aug, 2017 2 commits
-
-
Peter Boström authored
These tests point to an already-fixed bug, this should no longer have a data race. BUG=webm:1049 Change-Id: Iaedc5db8df99362bdc501b70ff7fdebf8756fdb8
-
Jerome Jiang authored
Rev d1477715 fixed the test failure. So remove the resolution condition for using source_sad in speed 6. BUG=webm:1452 Change-Id: I1efba97e1ef5bd4de5f886299f6fcb907187abcd
-
- 25 Aug, 2017 7 commits
-
-
Marco Paniconi authored
-
Marco Paniconi authored
-
Marco authored
Enable adapt_partition for vbr mode for speed 6. This allows the usage of the pickmode-based partition (used in speed 5), but only selectively for superblocks with high source sad, otherwise the faster variance based partition scheme is used. For speed 6 on ytlive set: avgPSNR/SSIM metrics up by ~0.6%, several clips up by ~1.5%. Small/negligible decrease in speed. Change-Id: I12f3efef6b3e059391de330fdbe5a44c2587f1f8
-
Marco Paniconi authored
-
Marco authored
For SVC at speed >= 7: only use the improved mv search on base spatial layer, if top layer resolution is above 640x360. ~2.3% speedup Small/negligible loss in avgPSNR metrics on rtc set. Change-Id: Iaef75a57ebf1c248931bc1aa28d20b7fecac1851
-
Marco Paniconi authored
This reverts commit f60d1dcd. Reason for revert: <INSERT REASONING HERE> Failures in AVX/VP9QuantizeTest in nightly tests. Original change's description: > quantize avx: copy 32x32 implementation > > Ensure avx and ssse3 stay in sync by testing them against each other. > > Change-Id: I699f3b48785c83260825402d7826231f475f697c TBR=slavarnway@google.com,johannkoenig@google.com,builds@webmproject.org Change-Id: Ibd38636212269328317dd0721be9d25452113d1c No-Presubmit: true No-Tree-Checks: true No-Try: true
-
Shiyou Yin authored
Merge "vpx_dsp:loongson optimize vpx_varianceWxH_c,vpx_sub_pixel_varianceWxH_c and vpx_sub_pixel_avg_varianceWxH_c with mmi."
-
- 24 Aug, 2017 8 commits
-
-
Marco Paniconi authored
-
Tom Finegan authored
This avoids an endless build loop at vpx_version.h creation time when diff is not present. Change-Id: I16ae386dbdaf14f9a2b85e4c5d1aaa6c08f52a45
-
Johann Koenig authored
-
Shiyou Yin authored
vpx_dsp:loongson optimize vpx_varianceWxH_c,vpx_sub_pixel_varianceWxH_c and vpx_sub_pixel_avg_varianceWxH_c with mmi. Change-Id: Ia576a721df6312329b599c31cfe1fb1267a9f174
-
Marco authored
For speeds < 7, increase threshold that controls the split of 16x16->8x8 blocks, for resolutions 720p and higher. Minor change for speed 5 (since it uses reference partition scheme which only uses variance partition as first step). For speed 6: ~0.5% increase in avgPSNR/SSIM metrics on ytlvie set. No change in speed. Change-Id: I5126580973201538d8ca26a9256b93c4d11d685b
-
Johann Koenig authored
-
Johann authored
Ensure avx and ssse3 stay in sync by testing them against each other. Change-Id: I699f3b48785c83260825402d7826231f475f697c
-
Johann authored
Still does not pass tests. Does match the previous assembly, although saving the sign before multiplying is dubious. Change-Id: Ia163f18c755aba542d6e93f7bf7343184660df5a
-