1. 19 Jul, 2013 - 3 commits
  2. 18 Jul, 2013 - 9 commits
  3. 17 Jul, 2013 - 14 commits
    • Dmitry Kovalev's avatar
      Removing kf_{y, uv}_mode_prob arrays from VP9Common. · f9f453ec
      Dmitry Kovalev authored
      These arrays have constant values (no any updates). Removing two
      corresponding memcpy calls. Making a little cleanup in vp9_entropymode.h
      as well: removing redundant 'extern' keyword and moving all function
      declarations at the end.
      
      Change-Id: Ia16b38b46aec2e2500f5df29c40a297ae241dede
      f9f453ec
    • Yunqing Wang's avatar
      Remove unnecessary calling of vp9_init_quantizer() · 3798db88
      Yunqing Wang authored
      vp9_init_quantizer() is called in vp9_create_compressor(), and
      should not be called in vp9_set_speed_features().
      
      Change-Id: Ic2f1f4b0531b9d46bb841d7e1d8da9812207dad6
      3798db88
    • Ronald S. Bultje's avatar
      Add a best_yrd shortcut in splitmv mode search. · c6917528
      Ronald S. Bultje authored
      Encoding of first 50 frames of bus (speed 0) @ 1500kbps goes from
      1min6.2 to 1min5.9, i.e. 0.5% faster overall.
      
      Change-Id: I59d8a3b2f0a75010fa041d5e2646c8caac5bd683
      c6917528
    • Ronald S. Bultje's avatar
      Skip redundant nearest/near/zero encodes in splitmv. · 161c9956
      Ronald S. Bultje authored
      Encode of first 50 frames of bus @ 1500kbps (speed 0) goes from
      1min7.3 to 1min6.2, i.e. 1.7% faster overall.
      
      Change-Id: I19d2deacfbffadd61d32551cee9586757ab4a987
      161c9956
    • Yaowu Xu's avatar
      changed mode checking order · 42facc29
      Yaowu Xu authored
      Change-Id: Ic4c4b363ed840935e42f495f13ea5e601a56f1b2
      42facc29
    • Ronald S. Bultje's avatar
      Skip nearest/near/zero redundant encodes. · 8fea880b
      Ronald S. Bultje authored
      Encode of first 50 frames of bus @ 1500kbps (speed 0) goes from 1min12.8
      to 1min7.3, i.e. 8% faster.
      
      Change-Id: Ia22d1c7b687316c553cc60eacae988b24e175b62
      8fea880b
    • Yunqing Wang's avatar
      Enable disable_splitmv feature for other speeds · 10e83b07
      Yunqing Wang authored
      Added disable_splitmv feature at other speed levels. For speed 3 or
      above, always turn it on.
      
      Change-Id: Ibb36f0a7ef12a34b4f8d0f9cb6193eab43b34360
      10e83b07
    • Ronald S. Bultje's avatar
      Best_rd breakout in rd partition search. · 9f427bfe
      Ronald S. Bultje authored
      About 15% faster for bus (speed 0) first 50 frames @ 1500kbps, which
      goes from 1min36 to 1min24. Results become slightly better (+0.2% on
      derf/yt, +0.4% on hd), probably because of a bugfix for skipmode in
      super_block_yrd(). Overall speed change (on derfraw300) is roughly
      -13%. This can probably be improved further by caching best_yrd
      between partition searches. Also, we might be able to get more
      speedups by always doing PARTITION_NONE before PARTITIONS_SPLIT, not
      just at the sb8x8 level.
      
      Change-Id: I83736949ebd5b4a3b400ee688d7661913fefc98b
      9f427bfe
    • Ronald S. Bultje's avatar
      Do a skip-block check for sub8x8 partitions also. · 83c7e13a
      Ronald S. Bultje authored
      +0.2% SSIM and glbPSNR on derfraw300.
      
      Change-Id: I9cba0bca55e606a22f557c7732b064f738efe84d
      83c7e13a
    • Yunqing Wang's avatar
      Speed up motion estimation using small partitions' result(experiment) · df90d58f
      Yunqing Wang authored
      Current partition checking starts from small sizes, and then goes up
      to large sizes. This experiment uses the small partitions' motion
      estimation result, which is already available, to speed up the
      large partition's motion estimation. We can decide to skip some
      patition checkings if they are unlikely choices. We could use the
      motion vector(MV) result as current partition's prediction MV, limit
      the search range and reference frame.
      
      Current result at speed 1:
      psnr loss: 1.19% for stdhd, 0.287% for derf.
      speed gain: 14% for sunflower(hd), 11% for akiyo.
      
      Further improvement will be done later.
      
      Change-Id: I5abfd070e9cace2e91e2a0247d1325df313887ab
      df90d58f
    • Paul Wilkins's avatar
      Move uv intra mode selection in rd loop. · 2ee338ce
      Paul Wilkins authored
      Use an estimate based on DC_PRED for intra uv cost
      within the rd loop then only do a full uv mode analysis
      if an intra mode is chosen.
      
      Significant speed gains in some cases. Currently only
      enabled for speed 2 pending speed/quality tests.
      
      Change-Id: Ie851a12400d5483bce47ec0e3ccb8516041e91c0
      2ee338ce
    • Paul Wilkins's avatar
      Limit transform sizes searched for uv intra. · 6c667f0f
      Paul Wilkins authored
      Apply limit if search_method == USE_LARGESTALL
      to the range of UV tx sizes searched.
      
      Change-Id: I6db29f0dd237285ffc50d75a37e8b68151ad821c
      6c667f0f
    • Jingning Han's avatar
      Skip redundant motion search in 4x4 level rd loop · a142d6fc
      Jingning Han authored
      This commit makes the encoder to perform motion search only once
      per reference frame type for each 4x4/4x8/8x4 block. For bus_cif
      at 2000 kbps, the runtime goes from 253812ms -> 217817ms
      (14% speed-up) for speed 0.
      
      Change-Id: I5f17599ccc8cfaf93ccb4f98fcb6008af6d79e92
      a142d6fc
    • Dmitry Kovalev's avatar
      Removing two unused arguments from vp9_inc_mv signature. · 41ae3d02
      Dmitry Kovalev authored
      Change-Id: Ieffea49eb7a5e5092f21f8694c546aff69b07c6d
      41ae3d02
  4. 16 Jul, 2013 - 9 commits
  5. 15 Jul, 2013 - 3 commits
    • Ronald S. Bultje's avatar
      Inline xform_quant() in encode_block_intra(). · 6fb41874
      Ronald S. Bultje authored
      Also inline some of the block calculations to assist the compiler to
      not do silly things like calculating the same offset (or converting
      between raster/transform block offset or block, mi and pixel unit)
      many, many, many times.
      
      Cycle times:
      4x4:     584 ->   505 cycles (16% faster)
      8x8:    1651 ->  1560 cycles (6% faster)
      16x16:  7897 ->  7704 cycles (2.5% faster)
      32x32: 16096 -> 15852 cycles (1.5% faster)
      
      Overall, this saves about 0.5 seconds (1min49.8 -> 1min49.3) on the
      first 50 frames of bus (speed 0) @ 1500kbps, i.e. 0.5% overall.
      
      Change-Id: If3dd62453f8e2ab9d4ee616bc4ea956fb8874b80
      6fb41874
    • Jingning Han's avatar
      Skip inter-coded block reconstruction in rd loop · 043e0f9d
      Jingning Han authored
      Skip the inverse transform and reconstruction of inter-mode coded
      blocks in the rate-distortion optimization loop, when skip_encode_sb
      feature is turned on. This provides about 1% speed-up at speed 0,
      and 1.5% speed-up at speed 1. No performance change in both settings.
      
      Change-Id: I2932718bf4d007163702b61b16b6ff100cf9d007
      043e0f9d
    • Jingning Han's avatar
      Skip duplicate block encoding in the rd loop · faff6ed0
      Jingning Han authored
      This speed feature allows the encoder to largely remove the spatial
      dependency between blocks inside a 64x64 superblock, thereby removing
      the need to repeatedly encode superblocks per partition type in the
      rate-distortion optimization loop.
      
      A major challenge lies in the intra modes tested in the rate-distortion
      optimization loop. The subsequent blocks do not have access to the
      reconstructed boundary pixels without the intermediate coding steps.
      This was resolved by using the original pixels for intra prediction
      in the rd loop, followed by an appropriately designed distortion
      modeling on the quantization parameters. Experiments also suggested
      that the performance impact is more discernible at lower bit-rate/psnr
      settings. Hence a quantizer dependent threshold is applied to deactivate
      skip of block coding.
      
      For bus_cif at 2000 kbps,
      speed 0: runtime 269854ms -> 237774ms (12% speed-up) at 0.05dB
               performance loss.
      
      speed 1: runtime 65312ms  -> 61536ms, (7...
      faff6ed0
  6. 14 Jul, 2013 - 1 commit
  7. 13 Jul, 2013 - 1 commit