1. 16 Aug, 2013 - 5 commits
    • hkuang's avatar
      Reduce instructions of idct4x4. · df071520
      hkuang authored
      Change-Id: Ia26a2526804e7e2f656b0051618a615fca8fc79d
      df071520
    • hkuang's avatar
      Reduce the instructions of idct8x8. Also add the · 60ecd60c
      hkuang authored
      saving and restoring of D registers.
      
      Change-Id: Id3630c90fcb160ef939fef55411342608af5f990
      60ecd60c
    • Adrian Grange's avatar
      Fixed typos and formatting · 79f4c1b9
      Adrian Grange authored
      Change-Id: I3814984a624bc64147c57efa74fbdda8eda47262
      79f4c1b9
    • Mans Rullgard's avatar
      vp9: neon: use aligned stores in convolve functions · 4fa93bce
      Mans Rullgard authored
      The destination is block-aligned so it is safe to use aligned
      stores.
      
      Change-Id: I38261e4fa40bc60e6472edffece59e372908da7e
      4fa93bce
    • Dmitry Kovalev's avatar
      Moving from ss_txfrm_size to tx_size. · afd9bd3e
      Dmitry Kovalev authored
      Updating foreach_transformed_block_visitor and corresponding functions
      to accept tx_size instead of ss_txfrm_size. List of functions per file:
      
      vp9_decodframe.c
        decode_block
        decode_block_intra
      
      vp9_detokenize.c
        decode_block
      
      vp9_encodemb.c
        optimize_block
        vp9_xform_quant
        vp9_encode_block_intra
      
      vp9_rdopt.c
        dist_block
        rate_block
        block_yrd_txfm
      
      vp9_tokenize.c
        set_entropy_context_b
        tokenize_b
        is_skippable
      
      Change-Id: I351bf563eb36cf34db71c3f06b9bbc9a61b55b73
      afd9bd3e
  2. 15 Aug, 2013 - 11 commits
    • Adrian Grange's avatar
      Added resizing & initialization of last frame segment map · d5bec522
      Adrian Grange authored
      When the frame size changes the last frame segment map must
      be resized to match and initialized to 0.
      
      Change-Id: Idc10de109f55dbe9af3a6caae355a2974712243d
      d5bec522
    • Jingning Han's avatar
      Refactor rd loop for chroma components · 68369ca8
      Jingning Han authored
      This commit makes the rate-distortion optimization search of chroma
      components consistent across all block sizes. It removes redundant
      codes.
      
      Change-Id: I7e76f54d045e8efdd41d84a164c71f55b484471b
      68369ca8
    • Dmitry Kovalev's avatar
      Converting code from using ss_txfrm_size to tx_size. · bb3b817c
      Dmitry Kovalev authored
      Updated function signatures:
        txfrm_block_to_raster_block
        txfrm_block_to_raster_xy
        extend_for_intra
        vp9_optimize_b
      
      Change-Id: I7213f4c4b1b9ec802f90621d5ba61d5e4dac5e0a
      bb3b817c
    • Dmitry Kovalev's avatar
      Using { 0 } for initialization instead of memset. · 6f4fa44c
      Dmitry Kovalev authored
      Change-Id: I4fad357465022d14bfc7e13b348c6da267587314
      6f4fa44c
    • Dmitry Kovalev's avatar
      Renaming d27 predictor to d207. · 81d7bd50
      Dmitry Kovalev authored
      27 degrees intra predictor is actually 207 degrees, so renaming it.
      
      Change-Id: Ife96a910437eb80ccdc0b7a5b7a62c77542ae5be
      81d7bd50
    • Mans Rullgard's avatar
      vp9: neon: optimise vp9_wide_mbfilter_neon · 67e53716
      Mans Rullgard authored
      Break up long dependency chains to improve instruction scheduling.
      
      Change-Id: I0e0cb66943df24af920767bb4167b25c38af9630
      67e53716
    • Dmitry Kovalev's avatar
      Moving segmentation struct from MACROBLOCKD to VP9_COMMON. · b7616e38
      Dmitry Kovalev authored
      VP9_COMMON is the right place to segmentatation struct because it has
      global segmentation parameters, not something specific to macroblock
      processing.
      
      Change-Id: Ib9ada0c06c253996eb3b5f6cccf6a323fbbba708
      b7616e38
    • Jingning Han's avatar
      Remove unused RDCOST_8X8 macro · b0646f9e
      Jingning Han authored
      Change-Id: I17c7d7eaa60fe69c543403c340f7c1078bfd339f
      b0646f9e
    • Deb Mukherjee's avatar
      Speed feature to skip split partition based on var · 24856b6a
      Deb Mukherjee authored
      Adds a speed feature to disable split partition search based on a
      given threshold on the source variance. A tighter threshold derived
      from the threshold provided is used to also disable horizontal and
      vertical partitions.
      
      Results on derfraw300:
      threshold = 16, psnr = -0.057%, speedup ~1% (football)
      threshold = 32, psnr = -0.150%, speedup ~4-5% (football)
      threshold = 64, psnr = -0.570%, speedup ~10-12% (football)
      
      Results on stdhdraw250:
      threshold = 32, psnr = -0.18%, speedup is somewhat more than derf
      because of a larger number of smoother blocks at higher resolution.
      
      Based on these results, a threshold of 32 is chosen for speed 1,
      and a threshold of 64 is chosen for speeds 2 and above.
      
      Change-Id: If08912fb6c67fd4242d12a0d094783a99f52f6c6
      24856b6a
    • Jingning Han's avatar
      Unify luma and chroma rd-cost estimation · ec01f52f
      Jingning Han authored
      This commit unifies the rate-distortion cost calculation process of
      luma and chroma components. It allows early termination to be enabled
      later in the rd search loop of chroma components, in consistent with
      luma pixels.
      
      Change-Id: I2e52a7c6496176bf2a5e3ef338d34ceb8aad9b3d
      ec01f52f
    • James Zern's avatar
      vp9_dx_iface: check for NULL/0-size input · 20395189
      James Zern authored
      avoids a crash caused by issue #585
      
      Change-Id: I301595ee0227699b0da6f0dad6d870dd546e94ef
      20395189
  3. 14 Aug, 2013 - 6 commits
  4. 13 Aug, 2013 - 5 commits
  5. 12 Aug, 2013 - 12 commits
  6. 11 Aug, 2013 - 1 commit
    • Mans Rullgard's avatar
      vp9: neon: optimise convolve8_horiz functions · b84dc949
      Mans Rullgard authored
      Each iteration of the horizontal loop reuses 7 of the 11 source
      values.  Loading only the 4 new values saves some time.
      
      Also add preload for source data.
      
      Overall 4% faster on Chromebook.
      
      Change-Id: I8f69e749f2b7f79e9734620dcee51dbfcd716b44
      b84dc949