1. 03 Nov, 2017 1 commit
  2. 26 Oct, 2017 1 commit
  3. 24 Oct, 2017 1 commit
    • Kyle Siefring's avatar
      Optimize convolve8 SSSE3 and AVX2 intrinsics · ae35425a
      Kyle Siefring authored
      Changed the intrinsics to perform summation similiar to the way the assembly does.
      
      The new code diverges from the assembly by preferring unsaturated additions.
      
      Results for haswell
      
      SSSE3
      Horiz/Vert  Size  Speedup
      Horiz       x4    ~32%
      Horiz       x8    ~6%
      Vert        x8    ~4%
      
      AVX2
      Horiz/Vert  Size  Speedup
      Horiz       x16   ~16%
      Vert        x16   ~14%
      
      BUG=webm:1471
      
      Change-Id: I7ad98ea688c904b1ba324adf8eb977873c8b8668
      ae35425a
  4. 23 Oct, 2017 1 commit
  5. 20 Oct, 2017 1 commit
  6. 19 Oct, 2017 1 commit
  7. 17 Oct, 2017 1 commit
  8. 16 Oct, 2017 1 commit
    • Linfeng Zhang's avatar
      Add 4 to 3 scaling SSSE3 optimization · 580d3224
      Linfeng Zhang authored
      Note this change will trigger the different C version on SSSE3 and
      generate different scaled output.
      
      Its speed is 2x compared with the version calling vpx_scaled_2d_ssse3().
      
      Change-Id: I17fff122cd0a5ac8aa451d84daa606582da8e194
      580d3224
  9. 10 Oct, 2017 1 commit
  10. 09 Oct, 2017 1 commit
  11. 08 Oct, 2017 1 commit
    • Kyle Siefring's avatar
      Add AVX2 version of vpx_convolve8_avg. · 9ca06bcd
      Kyle Siefring authored
      vpx_convolve8_avg works by first running a normal horizontal filter then a
      vertical filter averages at the end.
      
      The added vpx_convolve8_avg_avx2 calls pre-existing AVX2 code for the
      horizontal step.
      
      vpx_convolve8_avg_vert_avx2 is also added, but only uses ssse3 code.
      
      Change-Id: If5160c0c8e778e10de61ee9bf42ee4be5975c983
      9ca06bcd
  12. 04 Oct, 2017 1 commit
  13. 03 Oct, 2017 4 commits
  14. 30 Sep, 2017 1 commit
  15. 29 Sep, 2017 1 commit
  16. 28 Sep, 2017 1 commit
  17. 27 Sep, 2017 2 commits
  18. 26 Sep, 2017 1 commit
  19. 22 Sep, 2017 1 commit
  20. 20 Sep, 2017 3 commits
  21. 19 Sep, 2017 4 commits
  22. 18 Sep, 2017 1 commit
  23. 14 Sep, 2017 1 commit
    • Kaustubh Raste's avatar
      mips msa clean-up msa macros · 4ca8f8f5
      Kaustubh Raste authored
      Removed inline for GP load-store in case of (__mips_isa_rev >= 6)
      Created one define LD_V for vector load and ST_V for vector store
      
      Change-Id: Ifec3570fa18346e39791b0dd622892e5c18bd448
      4ca8f8f5
  24. 13 Sep, 2017 1 commit
  25. 12 Sep, 2017 2 commits
    • Johann's avatar
      Revert "Revert "quantize avx: copy 32x32 implementation"" · eb4238ac
      Johann authored
      This reverts commit 8c42237b.
      
      Because ssse3 code is used for the reference, the qcoeff and dqcoeff
      reference buffers must be aligned.
      
      Original change's description:
      > quantize avx: copy 32x32 implementation
      >
      > Ensure avx and ssse3 stay in sync by testing them against each other.
      >
      > Change-Id: I699f3b48785c83260825402d7826231f475f697c
      
      Change-Id: Ieeef11b9406964194028b0d81d84bcb63296ae06
      eb4238ac
    • Kaustubh Raste's avatar
      Optimize mips msa vp9 average mc functions · 30f1ff94
      Kaustubh Raste authored
      Load the specific destination loads instead of vector load
      
      Change-Id: I65ca13ae8f608fad07121fef848e2a18f54171fe
      30f1ff94
  26. 11 Sep, 2017 2 commits
  27. 09 Sep, 2017 2 commits
  28. 07 Sep, 2017 1 commit