1. 18 Jul, 2012 1 commit
  2. 23 Jun, 2012 2 commits
  3. 08 Jun, 2012 1 commit
  4. 21 May, 2012 1 commit
  5. 10 May, 2012 1 commit
    • Christophe Gisquet's avatar
      rv40dsp x86: MMX/MMX2/3DNow/SSE2/SSSE3 implementations of MC · 110d0cdc
      Christophe Gisquet authored
      
      
      Code mostly inspired by vp8's MC, however:
      - its MMX2 horizontal filter is worse because it can't take advantage of
        the coefficient redundancy
      - that same coefficient redundancy allows better code for non-SSSE3 versions
      
      Benchmark (rounded to tens of unit):
              V8x8  H8x8  2D8x8  V16x16  H16x16  2D16x16
      C       445    358   985    1785    1559    3280
      MMX*    219    271   478     714     929    1443
      SSE2    131    158   294     425     515     892
      SSSE3   120    122   248     387     390     763
      
      End result is overall around a 15% speedup for SSSE3 version (on 6 sequences);
      all loop filter functions now take around 55% of decoding time, while luma MC
      dsp functions are around 6%, chroma ones are 1.3% and biweight around 2.3%.
      Signed-off-by: default avatarDiego Biurrun <diego@biurrun.de>
      110d0cdc
  6. 28 Apr, 2012 1 commit
  7. 21 Apr, 2012 2 commits
  8. 04 Apr, 2012 1 commit
  9. 25 Mar, 2012 4 commits
  10. 05 Mar, 2012 1 commit
  11. 15 Feb, 2012 1 commit
  12. 30 Jan, 2012 1 commit
    • Christophe Gisquet's avatar
      x86 dsputil: provide SSE2/SSSE3 versions of bswap_buf · 6b039003
      Christophe Gisquet authored
      
      
      While pshufb allows emulating bswap on XMM registers for SSSE3, more
      shuffling is needed for SSE2. Alignment is critical, so specific codepaths
      are provided for this case.
      
      For the huffyuv sequence "angels_480-huffyuvcompress.avi":
      C (using bswap instruction): ~ 55k cycles
      SSE2:                        ~ 40k cycles
      SSSE3 using unaligned loads: ~ 35k cycles
      SSSE3 using aligned loads:   ~ 30k cycles
      Signed-off-by: default avatarDiego Biurrun <diego@biurrun.de>
      6b039003
  13. 29 Jan, 2012 1 commit
  14. 25 Jan, 2012 1 commit
  15. 14 Dec, 2011 1 commit
  16. 22 Nov, 2011 1 commit
  17. 11 Nov, 2011 1 commit
  18. 07 Nov, 2011 1 commit
  19. 26 Oct, 2011 1 commit
  20. 11 Oct, 2011 1 commit
  21. 15 Aug, 2011 1 commit
  22. 11 Aug, 2011 1 commit
  23. 29 Jul, 2011 1 commit
  24. 21 Jul, 2011 1 commit
  25. 20 Jul, 2011 1 commit
  26. 18 Jul, 2011 1 commit
  27. 10 Jul, 2011 1 commit
  28. 08 Jul, 2011 1 commit
  29. 04 Jul, 2011 2 commits
  30. 03 Jul, 2011 1 commit
  31. 01 Jul, 2011 1 commit
  32. 28 Jun, 2011 1 commit
  33. 18 Jun, 2011 2 commits