1. 01 Dec, 2017 2 commits
  2. 29 Nov, 2017 3 commits
  3. 27 Nov, 2017 1 commit
  4. 17 Nov, 2017 1 commit
  5. 15 Nov, 2017 1 commit
    • Johann's avatar
      fwd txfm ssse3: use GLOBAL() for loading constants · 3e3a5686
      Johann authored
      Fixes a build issue when relocation is not allowed:
      relocation R_X86_64_32 against '.rodata' can not be used when making a shared object
      
      Change-Id: Ica3e90c926847bc384e818d7854f0030f4d69aa0
      3e3a5686
  6. 10 Nov, 2017 1 commit
    • Scott LaVarnway's avatar
      vpx: [x86] add vpx_satd_avx2() · 8e602284
      Scott LaVarnway authored
      SSE2 instrinsic vs AVX2 intrinsic speed gains:
      blocksize   16: ~1.33
      blocksize   64: ~1.51
      blocksize  256: ~3.03
      blocksize 1024: ~3.71
      
      Change-Id: I79b28cba82d21f9dd765e79881aa16d24fd0cb58
      8e602284
  7. 09 Nov, 2017 1 commit
  8. 03 Nov, 2017 1 commit
  9. 26 Oct, 2017 1 commit
  10. 24 Oct, 2017 1 commit
    • Kyle Siefring's avatar
      Optimize convolve8 SSSE3 and AVX2 intrinsics · ae35425a
      Kyle Siefring authored
      Changed the intrinsics to perform summation similiar to the way the assembly does.
      
      The new code diverges from the assembly by preferring unsaturated additions.
      
      Results for haswell
      
      SSSE3
      Horiz/Vert  Size  Speedup
      Horiz       x4    ~32%
      Horiz       x8    ~6%
      Vert        x8    ~4%
      
      AVX2
      Horiz/Vert  Size  Speedup
      Horiz       x16   ~16%
      Vert        x16   ~14%
      
      BUG=webm:1471
      
      Change-Id: I7ad98ea688c904b1ba324adf8eb977873c8b8668
      ae35425a
  11. 23 Oct, 2017 1 commit
  12. 20 Oct, 2017 1 commit
  13. 19 Oct, 2017 1 commit
  14. 17 Oct, 2017 1 commit
  15. 16 Oct, 2017 1 commit
    • Linfeng Zhang's avatar
      Add 4 to 3 scaling SSSE3 optimization · 580d3224
      Linfeng Zhang authored
      Note this change will trigger the different C version on SSSE3 and
      generate different scaled output.
      
      Its speed is 2x compared with the version calling vpx_scaled_2d_ssse3().
      
      Change-Id: I17fff122cd0a5ac8aa451d84daa606582da8e194
      580d3224
  16. 10 Oct, 2017 1 commit
  17. 09 Oct, 2017 1 commit
  18. 08 Oct, 2017 1 commit
    • Kyle Siefring's avatar
      Add AVX2 version of vpx_convolve8_avg. · 9ca06bcd
      Kyle Siefring authored
      vpx_convolve8_avg works by first running a normal horizontal filter then a
      vertical filter averages at the end.
      
      The added vpx_convolve8_avg_avx2 calls pre-existing AVX2 code for the
      horizontal step.
      
      vpx_convolve8_avg_vert_avx2 is also added, but only uses ssse3 code.
      
      Change-Id: If5160c0c8e778e10de61ee9bf42ee4be5975c983
      9ca06bcd
  19. 04 Oct, 2017 1 commit
  20. 03 Oct, 2017 4 commits
  21. 30 Sep, 2017 1 commit
  22. 29 Sep, 2017 1 commit
  23. 28 Sep, 2017 1 commit
  24. 27 Sep, 2017 2 commits
  25. 26 Sep, 2017 1 commit
  26. 22 Sep, 2017 1 commit
  27. 20 Sep, 2017 3 commits
  28. 19 Sep, 2017 4 commits