1. 29 Oct, 2015 1 commit
  2. 23 Oct, 2015 1 commit
  3. 01 Oct, 2015 1 commit
  4. 30 Sep, 2015 1 commit
    • Debargha Mukherjee's avatar
      Speed up of DST and the search in ext_tx · 3e8cceb3
      Debargha Mukherjee authored
      Adds an early termination to the ext_tx search, and also
      implements the DST transforms more efficiently.
      
      About 4 times faster with the ext-tx experiment.
      
      There is a 0.09% drop in performance on derflr from 1.735% to
      1.648%, but worth it with the speedup achieved.
      
      Change-Id: I2ede9d69c557f25e0a76cd5d701cc0e36e825c7c
      3e8cceb3
  5. 29 Sep, 2015 1 commit
    • Yaowu Xu's avatar
      Merged branch 'master' into nextgenv2 · 7c514e2d
      Yaowu Xu authored
      Resolved Conflicts in the following files:
              configure
              vp10/common/idct.c
              vp10/encoder/dct.c
              vp10/encoder/encodemb.c
              vp10/encoder/rdopt.c
      
      Change-Id: I4cb3986b0b80de65c722ca29d53a0a57f5a94316
      7c514e2d
  6. 25 Sep, 2015 1 commit
  7. 15 Sep, 2015 1 commit
  8. 10 Sep, 2015 1 commit
    • Debargha Mukherjee's avatar
      Comprehensive support for symmetric DST · 4ce81d66
      Debargha Mukherjee authored
      Creates new hybrid transforms combining symmetric DST with
      ADST and DCT. Thus a total of 16 transforms are supported.
      
      derfl: +1.659% (up about 0.2%)
      
      Change-Id: Idde1cecdb59527890bf05da740099c3f6a5b9764
      4ce81d66
  9. 09 Sep, 2015 1 commit
  10. 24 Aug, 2015 1 commit
    • hui su's avatar
      Refactoring on transform types · d76e5b36
      hui su authored
      Prepare for adding more transform varieties (EXT_TX and TX_SKIP in nextgen).
      
      Change-Id: I2dfe024f6be7a92078775917092ed62abc2e7d1e
      d76e5b36
  11. 12 Aug, 2015 3 commits
  12. 10 Aug, 2015 1 commit
  13. 04 Aug, 2015 1 commit
  14. 31 Jul, 2015 1 commit
    • Jingning Han's avatar
      Factor inverse transform functions into vpx_dsp · e8b133c7
      Jingning Han authored
      This commit moves the module inverse transform functions from vp9
      to vpx_dsp folder. The hybrid transform wrapper functions stay in
      the vp9 folder, since it involves codec-specific data structures.
      
      Change-Id: Ib066367c953d3d024c73ba65157bbd70a95c9ef8
      e8b133c7
  15. 13 May, 2015 1 commit
    • Johann's avatar
      Relocate memory operations for common code · 1d7ccd53
      Johann authored
      With the sad functions, and hopefully the variance functions soon,
      moving to the vpx_dsp location, place the defines used in the
      reference C code in a common location.
      
      Change-Id: I4c8ce7778eb38a0a3ee674d2f1c488eda01cfeca
      1d7ccd53
  16. 29 Apr, 2015 1 commit
    • James Zern's avatar
      vpx_mem: remove vpx_memset · f58011ad
      James Zern authored
      vestigial. replace instances with memset() which they already were being
      defined to.
      
      Change-Id: Ie030cfaaa3e890dd92cf1a995fcb1927ba175201
      f58011ad
  17. 06 Jan, 2015 1 commit
  18. 24 Nov, 2014 1 commit
    • Peter de Rivaz's avatar
      Refactored idct routines and headers · 3a8c43a4
      Peter de Rivaz authored
      This change is made in preparation for a
      subsequent patch which adds acceleration
      for the highbitdepth transform functions.
      
      The highbitdepth transform functions attempt
      to use 16/32bit sse instructions where possible,
      but fallback to using the C implementations if
      potential overflow is detected.  For this reason
      the dct routines are made global so they can be
      called from the acceleration functions in the
      subsequent patch.
      
      Change-Id: Ia921f191bf6936ccba4f13e8461624b120c1f665
      (cherry picked from commit 454342d4)
      3a8c43a4
  19. 07 Nov, 2014 1 commit
  20. 05 Nov, 2014 1 commit
  21. 09 Oct, 2014 1 commit
  22. 06 Oct, 2014 1 commit
  23. 04 Oct, 2014 1 commit
  24. 03 Oct, 2014 1 commit
    • Deb Mukherjee's avatar
      Incorporate WRAPLOW macro into non-highbitdepth tx · d50716fa
      Deb Mukherjee authored
      Incorporates the WRAPLOW macro into the non-highbitdepth transforms
      to aid hardware verification between a software C model and an
      intended hardware implementation though the use of the configure
      options: --enable-experimental --enable-emulate-hardware.
      Note that to avoid further discrepancies between the sse/sse2
      implementations of the transforms and the C implementation, when the
      emulate hardware option is invoked, we also disable sse/sse2/etc.
      
      Also incudes some minor cleanups/renaming etc.
      
      Change-Id: Ib864d8493313927d429cce402982f1c8e45b3287
      d50716fa
  25. 30 Sep, 2014 1 commit
    • Jingning Han's avatar
      Remove redundant header file declaration · 0829d2be
      Jingning Han authored
      Some header file in vp9_idct.c has been included in vp9_idct.h.
      This commit removes these redundant declarations.
      
      Change-Id: I0238c27e4efff5c981eb437022c6bc6970c4e445
      0829d2be
  26. 12 Sep, 2014 1 commit
    • Deb Mukherjee's avatar
      Adds high bitdepth transform functions and tests · 10783d4f
      Deb Mukherjee authored
      Adds various high bitdepth transform functions and tests.
      Much of the changes are related to using typedefs tran_low_t
      and tran_high_t for the final transform cofficients and intermediate
      stages of the transform computation respectively rather than fixed
      types int16_t/int. When vp9_highbitdepth configure flag is off,
      these map tp int16_t/int32_t, but when the flag is on, they map
      to int32_t/int64_t to make space for needed extra precision.
      
      Change-Id: I3c56de79e15b904d6f655b62ffae170729befdd8
      10783d4f
  27. 08 May, 2014 1 commit
    • Jingning Han's avatar
      Change eob threshold for partial inverse 8x8 2D-DCT to 12 · 41a350a8
      Jingning Han authored
      The scanning order has the first 12 coefficients of the 8x8 2D-DCT
      sitting in the top left 4x4 block. Hence the partial inverse 8x8
      2D-DCT allows to handle cases with eob below 12.
      
      The overall runtime of the inverse 8x8 2D-DCT unit is reduced from
      166 cycles (using SSE2) to 150 cycles (using SSSE3).
      
      Change-Id: I4514f9748042809ac84df4c14382c00f313f1cd2
      41a350a8
  28. 28 Jan, 2014 1 commit
  29. 20 Nov, 2013 1 commit
  30. 15 Nov, 2013 1 commit
  31. 24 Oct, 2013 1 commit
    • Yunqing Wang's avatar
      Add 32x32 idct function for eob<=34 case · f88315cb
      Yunqing Wang authored
      When only upper-left 8x8 area has non-zero dct coefficients, we
      could skip 1D IDCT for 9th to 32th rows to save operations. This
      function is called when eob <= 34.
      
      Change-Id: I9684b75947bdde346cfe3720f08a953aa7a13fb5
      f88315cb
  32. 12 Oct, 2013 1 commit
  33. 11 Oct, 2013 3 commits
  34. 10 Oct, 2013 2 commits
    • Dmitry Kovalev's avatar
      Removing vp9_idct4_1d_sse2 function. · ddf1b762
      Dmitry Kovalev authored
      We have two SSE2-optimized functions for idct4_1d:
        vp9_idct4_1d_sse2 <-- removing this one
        idct4_1d_sse2
      
      vp9_idct4_1d_sse2 was used only by the following functions which already
      have SSE2 optimized variants:
        vp9_idct4x4_16_add_c   -> vp9_idct4x4_16_add_see2
        idct8_1d               -> vp9_idct8x8_{16, 10, 1}_see2
        vp9_short_iht4x4_add_c -> vp9_short_iht4x4_add_see2
      
      Change-Id: Ib0a7f6d1373dbaf7a4a41208cd9d0671fdf15edb
      ddf1b762
    • Dmitry Kovalev's avatar
      Giving consistent names to IDCT 32x32 functions. · 1e766b50
      Dmitry Kovalev authored
      Renames:
        vp9_short_idct32x32_add   -> vp9_idct32x32_1024_add
        vp9_short_idct32x32_1_add -> vp9_idct32x32_1_add
        vp9_idct_add_32x32        -> vp9_idct32x32_add
      
      Change-Id: Id85306f5814bac6c47463a6b5901a93082510666
      1e766b50
  35. 08 Oct, 2013 1 commit
    • Jingning Han's avatar
      All zero coeff skip in IDCT 32x32 · 6594ca88
      Jingning Han authored
      When all coefficients are zeros, skip the corresponding 1-D inverse
      transform. This practice has been used in the SSE2 implementation of
      inverse 32x32 DCT. This commit imports this algorithm into the C code.
      
      Change-Id: I0f58bfcb183a569fab85d524d5d9cf8ae8653f86
      6594ca88