1. 16 Oct, 2017 1 commit
    • Linfeng Zhang's avatar
      Add 4 to 3 scaling SSSE3 optimization · 580d3224
      Linfeng Zhang authored
      Note this change will trigger the different C version on SSSE3 and
      generate different scaled output.
      
      Its speed is 2x compared with the version calling vpx_scaled_2d_ssse3().
      
      Change-Id: I17fff122cd0a5ac8aa451d84daa606582da8e194
      580d3224
  2. 10 Oct, 2017 1 commit
  3. 04 Oct, 2017 2 commits
  4. 20 Sep, 2017 1 commit
  5. 06 Sep, 2017 1 commit
    • Linfeng Zhang's avatar
      Add ScaleFrameTest · d5d2cbcc
      Linfeng Zhang authored
      Move class VpxScaleBase to new file test/vpx_scale_test.h.
      Add new file test/vp9_scale_test.cc with ScaleFrameTest.
      
      BUG=webm:1419
      
      Change-Id: Iec2098eafcef99b94047de525e5da47bcab519c1
      d5d2cbcc
  6. 21 Aug, 2017 1 commit
  7. 22 Jun, 2017 1 commit
  8. 13 Jun, 2017 2 commits
  9. 10 May, 2017 1 commit
    • Marco's avatar
      vp9: SVC: Add option to set downsampling filter type. · 4e23998f
      Marco authored
      Add option in SVC to set the filter type and phase for
      the frame level downsampling filters.
      
      For 3 spatial layers: set downsampling filter type to bilinear
      and set phase to 8, for lowest spatial layer.
      
      Change-Id: Id81f4b1ba93db19c1cd37b6a46d1281a2c61bc43
      4e23998f
  10. 02 May, 2017 1 commit
  11. 01 May, 2017 3 commits
  12. 28 Apr, 2017 1 commit
    • Johann's avatar
      Use uint32_t for accumulator · 657f3e9f
      Johann authored
      Be specific about the data type size.
      
      Use convenience macro vp9_zero_array.
      
      Change-Id: I5fadf7dbd408befb73820d85db0be4832e8cfcbd
      657f3e9f
  13. 27 Apr, 2017 1 commit
    • Johann's avatar
      vp9 temporal filter: sse4 implementation · 6dfeea65
      Johann authored
      Approximates division using multiply and shift.
      
      Speeds up both sizes (8x8 and 16x16) by 30 times.
      
      Fix the call sites to use the RTCD function.
      
      Delete sse2 and mips implementation. They were based on a previous
      implementation of the filter. It was changed in Dec 2015:
      ece4fd5d
      
      BUG=webm:1378
      
      Change-Id: I0818e767a802966520b5c6e7999584ad13159276
      6dfeea65
  14. 18 Apr, 2017 1 commit
    • Marco's avatar
      vp9: Add phase to get averaging filter for 1:2 downsampling. · 348bdc01
      Marco authored
      The scaling filter with zero shift will give sub-sampling for
      2x downsampling. Allow for a phase shift to get an averaging filter.
      
      Usage is for source scaling in 1 pass SVC mode for 1:2 downscale.
      Reduces aliasing in downsampled image.
      
      Keep the phase to 0/off for now.
      
      Change-Id: Ic547ea0748d151b675f877527e656407fcf4d51e
      348bdc01
  15. 24 Feb, 2017 2 commits
    • Johann's avatar
      consolidate block_error functions · 904b957a
      Johann authored
      vp9_highbd_block_error_8bit_c was a very simple wrapper around
      vp9_block_error_c. The SSE2 implemention was practically identical to
      the non-HBD one. It was missing some minor improvements which only
      went into the original version.
      
      In quick speed tests, the AVX implementation showed minimal
      improvement over SSE2 when it does not detect overflow. However, when
      overflow is detected the function is run a second time. The
      OperationCheck test seems to trigger this case and reverses any
      speed benefits by running ~60% slower. AVX2 on the other hand is
      always 30-40% faster.
      
      Change-Id: I9fcb9afbcb560f234c7ae1b13ddb69eca3988ba1
      904b957a
    • Johann's avatar
      block error sse2: use tran_low_t · 3c16bbb7
      Johann authored
      Change-Id: Ib04990e4a7bda9fbf501f294da2057a2b2595deb
      3c16bbb7
  16. 16 Feb, 2017 5 commits
  17. 07 Feb, 2017 1 commit
  18. 06 Feb, 2017 1 commit
  19. 05 Nov, 2016 1 commit
  20. 08 Aug, 2016 1 commit
  21. 02 Aug, 2016 1 commit
  22. 29 Jun, 2016 1 commit
  23. 18 Jun, 2016 1 commit
    • James Zern's avatar
      remove vp10 · 67edc5e8
      James Zern authored
      development has moved to the nextgenv2 branch and a snapshot from here
      was used to seed aomedia
      
      BUG=b/29457125
      
      Change-Id: Iedaca11ec7870fb3a4e50b2c9ea0c2b056a0d3c0
      67edc5e8
  24. 10 Jun, 2016 1 commit
  25. 08 Jun, 2016 1 commit
  26. 27 May, 2016 1 commit
    • Linfeng Zhang's avatar
      Upgrade fwht4x4_mmx() to fwht4x4_sse2() for vp9 and vp10. · af7fb17c
      Linfeng Zhang authored
      Function level timing test shows about 27% time saving on
      a Xeon E5-2680 v2 desktop.
      
      Rename vp9_dct_sse2.c to vp9_dct_intrin_sse2.c for vp9 and
      rename dct_sse2.c to dct_intrin_sse2.c for vp10 to avoid
      duplicate basenames.
      
      Actually vp9_fwht4x4_mmx/sse2() and vp10_fwht4x4_mmx/sse2()
      are identical. TODO: They should be unified later if there is
      no intention to keep a duplicate.
      
      Change-Id: I3e537b7bbd9ba417c606cd7c68c4dbbfa583f77d
      af7fb17c
  27. 24 May, 2016 1 commit
  28. 04 May, 2016 1 commit
  29. 02 May, 2016 1 commit
  30. 26 Apr, 2016 1 commit
    • JackyChen's avatar
      vp9: Simplify the logic in denoiser SSE2 code. · 1a53c0c9
      JackyChen authored
      Block size passed into denoiser filter is always >= BLOCK_8X8 (in
      vp9_pick_inter_mode), it is not necessary to check smaller block
      size. Passed the bitexact test on clips with different resolutions and
      noise levels.
      
      Change-Id: I19fa3195d18c27d9e5de60dc11cff1522ef3714e
      1a53c0c9
  31. 09 Feb, 2016 1 commit
    • Alex Converse's avatar
      Restore previous motion search bit-error scale. · fac947df
      Alex Converse authored
      The bit to error transformation got doubled as a result of going from
      8-bit to 9-bit costs (change d13385ce).
      
      Use defines to derive the scale numbers and comment some of the fields.
      
      derf: -0.023 BDRATE
      hevcmr: +0.067 BDRATE
      stdhd: +0.098 BDRATE
      (These are substantially smaller than than the original gains from 8 to
      9 bit costing.)
      
      Change-Id: I6a2b3b029b2f1415e4f90a05709b2333ec0eea9b
      fac947df