1. 30 Nov, 2015 1 commit
    • Jian Zhou's avatar
      SSE2 speed up of h_predictor_4x4 · 9d29d762
      Jian Zhou authored
      Relocate h_predictor_4x4 from SSSE3 to SSE2 with XMM registers.
      Speed up by ~25% in ./test_intra_pred_speed.
      
      Change-Id: I64e14c13b482a471449be3559bfb0da45cf88d9d
      9d29d762
  2. 25 Nov, 2015 1 commit
  3. 19 Nov, 2015 1 commit
    • Jian Zhou's avatar
      Speed up tm_predictor_4x4 · 79b68626
      Jian Zhou authored
      tm_predictor_4x4 is implemented with SSE2 using XMM registers.
      Speed up by ~25% in ./test_intra_pred_speed.
      
      Change-Id: I25074b78d476a2cb17f81cf654bdfd80df2070e0
      79b68626
  4. 18 Nov, 2015 1 commit
  5. 11 Nov, 2015 1 commit
  6. 10 Nov, 2015 1 commit
  7. 20 Oct, 2015 1 commit
    • Geza Lore's avatar
      Optimize vpx_quantize_{b,b_32x32} assembler. · 9cfba09a
      Geza Lore authored
      Added optimization of the 8 bit assembly quantizer routines. This makes
      these functions up to 100% faster, depending on encoding parameters.
      
      This patch maskes the encoder faster in both the high bitdepth and 8bit
      configurations. In the high bitdepth configuration, it effects profile 0
      only.
      
      Based on my profiling using 1080p input the net gain is between 1-3% for
      the 8 bit config, and around 2.5-4.5% for the high bitdepth config,
      depending on target bitrate. The difference between the 8 bit and high
      bitdepth configurations for the same encoder run is reduced by 1% in all
      cases I have profiled.
      
      Change-Id: I86714a6b7364da20cd468cd784247009663a5140
      9cfba09a
  8. 16 Oct, 2015 1 commit
  9. 09 Oct, 2015 1 commit
  10. 06 Oct, 2015 1 commit
    • Julia Robson's avatar
      SSSE3 optimisation for quantize in high bit depth · 37c68efe
      Julia Robson authored
      When configured with high bit detpth enabled, the 8bit quantize
      function stopped using optimised code. This made 8bit content
      decode slowly. This commit re-enables the SSSE3 optimisations.
      
      Change-Id: I194b505dd3f4c494e5c5e53e020f5d94534b16b5
      37c68efe
  11. 05 Oct, 2015 1 commit
    • Julia Robson's avatar
      SSE2 optimisation for quantize in high bit depth · 5e6533e7
      Julia Robson authored
      When configured with high bit detpth enabled, the 8bit quantize
      function stopped using optimised code. This made 8bit content
      decode slowly. This commit re-enables the SSE2 optimisation
      (but not the SSSE3 optimisation).
      
      Change-Id: Id015fe3c1c44580a4bff3f4bd985170f2806a9d9
      5e6533e7
  12. 01 Oct, 2015 1 commit
    • Ronald S. Bultje's avatar
      vp10: reimplement d45/4x4 to match vp8 instead of vp9. · 62a15795
      Ronald S. Bultje authored
      This is more a proof of concept than anything else. The problem here
      isn't so much how to code it, but rather where to place the resulting
      code. All intrapred DSP code lives in vpx_dsp, so do we want the vp10
      specific intra pred functions to live there, or in vp10/?
      
      See issue 1015.
      
      Change-Id: I675f7badcc8e18fd99a9553910ecf3ddf81f0a05
      62a15795
  13. 30 Sep, 2015 2 commits
    • Ronald S. Bultje's avatar
      vp8: change build_intra4x4_predictors() to use vpx_dsp. · c26a9eca
      Ronald S. Bultje authored
      I've added a few new functions (d45e, d63e, he, ve) to cover the
      filtered h/v 4x4 predictors that are vp8-specific, the "correct"
      d45 with the correctly filtered bottom-right pixel (as opposed to
      the unfiltered version in vp9), and the "broken" d63 with weirdly
      filtered bottom-right pixels (which is correctly filtered in vp9).
      
      There may be a minor performance impact on all systems because we
      have to do an extra copy of the Above pixel array to incorporate
      the topleft pixel in the same array (thus fitting the vpx_dsp API).
      In addition, armv6 will have a more serious performance impact b/c
      I removed the armv6/vp8-specific assembly. I'm not sure anyone
      cares...
      
      Change-Id: I7f9e5ebee11d8e21aca2cd517a69eefc181b2e86
      c26a9eca
    • Ronald S. Bultje's avatar
      vp8: change build_intra_predictors_mby_s to use vpx_dsp. · 54d48955
      Ronald S. Bultje authored
      Change-Id: I2000820e0c04de2c975d370a0cf7145330289bb2
      54d48955
  14. 29 Sep, 2015 1 commit
    • Julia Robson's avatar
      Accelerated transform in high bit depth · 406030d1
      Julia Robson authored
      When configured with high bitdepth enabled, the 8bit transform
      stopped using optimised code. This made 8bit content decode slowly.
      
      Change-Id: I67d91f9b212921d5320f949fc0a0d3f32f90c0ea
      406030d1
  15. 27 Aug, 2015 1 commit
    • Johann's avatar
      Add sse2 versions of halfpix variance · a28b2c6f
      Johann authored
      These were lost in the great sub pixel variance move of
      6a82f0d7
      
      Not having these functions caused a ~10% performance regression in
      some realtime vp8 encodes.
      
      Change-Id: I50658483d9198391806b27899f2c0d309233c4b5
      a28b2c6f
  16. 18 Aug, 2015 1 commit
  17. 17 Aug, 2015 1 commit
  18. 14 Aug, 2015 1 commit
  19. 12 Aug, 2015 1 commit
    • Jingning Han's avatar
      Fork VP9 and VP10 codebase · 3ee6db6c
      Jingning Han authored
      This commit folks the VP9 and VP10 codebase and makes libvpx
      support VP8, VP9, and VP10.
      
      Change-Id: I81782e0b809acb3c9844bee8c8ec8f4d5e8fa356
      3ee6db6c
  20. 07 Aug, 2015 3 commits
  21. 04 Aug, 2015 2 commits
    • Jingning Han's avatar
      Change vp9_quantize to vpx_quantize · d621de7e
      Jingning Han authored
      This commit clears all the vp9_ prefix use case in vpx_dsp. It gets
      the vp9 folder ready to branch out vp10.
      
      Change-Id: I2906eec179ee792b4af8c9b4161313653050e931
      d621de7e
    • Jingning Han's avatar
      Replace vp9_ prefix with vpx_ prefix in vpx_dsp function names · 08a453b9
      Jingning Han authored
      This commit clears the function naming convention in vpx_dsp. It
      replaces vp9_ prefix of global functions with vpx_ prefix. It also
      removes the vp9_ prefix from static functions.
      
      Change-Id: I6394359a63b71a51dda01342eec6a3cc08dfeedf
      08a453b9
  22. 03 Aug, 2015 1 commit
  23. 31 Jul, 2015 3 commits
    • Jingning Han's avatar
      Factor inverse transform functions into vpx_dsp · e8b133c7
      Jingning Han authored
      This commit moves the module inverse transform functions from vp9
      to vpx_dsp folder. The hybrid transform wrapper functions stay in
      the vp9 folder, since it involves codec-specific data structures.
      
      Change-Id: Ib066367c953d3d024c73ba65157bbd70a95c9ef8
      e8b133c7
    • Zoe Liu's avatar
      Refactor mips/dspr2 on convolution. · 7cfdc003
      Zoe Liu authored
      Change-Id: If59a39d5a92c261537342726f94bb7f7f26dfff3
      7cfdc003
    • Zoe Liu's avatar
      Code refactor on InterpKernel · 7186a2dd
      Zoe Liu authored
      It in essence refactors the code for both the interpolation
      filtering and the convolution. This change includes the moving
      of all the files as well as the changing of the code from vp9_
      prefix to vpx_ prefix accordingly, for underneath architectures:
      (1) x86;
      (2) arm/neon; and
      (3) mips/msa.
      The work on mips/drsp2 will be done in a separate change list.
      
      Change-Id: Ic3ce7fb7f81210db7628b373c73553db68793c46
      7186a2dd
  24. 30 Jul, 2015 1 commit
  25. 28 Jul, 2015 3 commits
  26. 27 Jul, 2015 2 commits
  27. 24 Jul, 2015 1 commit
  28. 22 Jul, 2015 1 commit
  29. 17 Jul, 2015 2 commits
    • Yunqing Wang's avatar
      Migrate quantization functions from vp9/ to vpx_dsp/ · 38f1fbbb
      Yunqing Wang authored
      The following quantization functions were moved:
      vp9_quantize_b
      vp9_quantize_b_32x32
      vp9_highbd_quantize_b
      vp9_highbd_quantize_b_32x32
      
      vp9_quantize_dc
      vp9_quantize_dc_32x32
      vp9_highbd_quantize_dc
      vp9_highbd_quantize_dc_32x32
      
      The purpose of doing that was to allow these functions to be shared
      by multiple codecs.
      
      Change-Id: Id8ab939f283353cdd07bd930d47db3d932a5d87f
      38f1fbbb
    • Jingning Han's avatar
      Rename loop filter function from vp9_ to vpx_ · 2992739b
      Jingning Han authored
      Change-Id: I6f424bb8daec26bf8482b5d75dd9b0e45c11a665
      2992739b
  30. 16 Jul, 2015 1 commit