Commits · b02c4d364f53e14ddae000552a1ddadbc7ceef8f · BC / public / external / libvpx

16 Jul, 2013 - 4 commits

Increase border size from 96 to 160. · b02c4d36

Ronald S. Bultje authored 11 years ago

This is required because upon downscaling, if a motion vector points
partially into the UMV (e.g. all minus 1 of 64+7 pixels, i.e. 70),
then we can point up to 140 pixels into the larger-resolution (2x)
reference buffer UMV, which means the UMV for reference buffers in
downscaling needs to be 140 rounded up to the nearest multiple of 32,
i.e. 160.

Longer-term, we should probably handle the UMV differently by detecting
edge coverage on-the-fly and using a temporary buffer for edge extensions
instead of adding 160 pixels on all sides of the image (which means a
CIF image uses 3x its own area size for borders).

Change-Id: I5184443e6731cd6721fc6a5d430a53e7d91b4f7e

b02c4d36

Inline vp9_quantize() in xform_quant(). · 1ff94fea

Ronald S. Bultje authored 11 years ago

Cycle times:
4x4:    151 to  131 cycles (15% faster)
8x8:    334 to  306 cycles (9% faster)
16x16: 1401 to 1368 cycles (2.5% faster)
32x32: 7403 to 7367 cycles (0.5% faster)

Total encode time of first 50 frames of bus @ 1500kbps (speed 0)
goes from 1min39.2 to 1min38.6, i.e. a 0.67% overall speedup.

Change-Id: I799a49460e5e3fcab01725564dd49c629bfe935f

1ff94fea

Merge "Inline xform_quant() in encode_block_intra()." · 7e684e20
Ronald S. Bultje authored 11 years ago

7e684e20
Merge "Neon: Update mbfilter if all vectors follow one branch." · ce1d69ae
Frank Galligan authored 11 years ago

ce1d69ae

15 Jul, 2013 - 5 commits

Inline xform_quant() in encode_block_intra(). · 6fb41874

Ronald S. Bultje authored 11 years ago

Also inline some of the block calculations to assist the compiler to
not do silly things like calculating the same offset (or converting
between raster/transform block offset or block, mi and pixel unit)
many, many, many times.

Cycle times:
4x4:     584 ->   505 cycles (16% faster)
8x8:    1651 ->  1560 cycles (6% faster)
16x16:  7897 ->  7704 cycles (2.5% faster)
32x32: 16096 -> 15852 cycles (1.5% faster)

Overall, this saves about 0.5 seconds (1min49.8 -> 1min49.3) on the
first 50 frames of bus (speed 0) @ 1500kbps, i.e. 0.5% overall.

Change-Id: If3dd62453f8e2ab9d4ee616bc4ea956fb8874b80

6fb41874

Code cleanup inside vp9_decodeframe.c. · 2c317298

Dmitry Kovalev authored 11 years ago

Removing unused DEC_DEBUG define and dec_debug variable. Changing function
signatures to eliminate code duplication, renaming function
mb_init_dequantizer to init_dequantizer. Also removing redundant curly
braces, and comments.


Change-Id: Ia56ee1b0be5f24abb0e878581845be8a4773c298

2c317298

Neon: Update mbfilter if all vectors follow one branch. · f4f60f60

Frank Galligan authored 11 years ago

Change the mbfilter Neon code from executing both branches if all
vectors follow only one branch.

The code is about 5% faster when executing only one branch and about
1% slower when executing both branches.

-PS5: Remove local stack space from mbfilter.

Change-Id: I6a23f9b318a9f4568a2718b4c9348db988fe2182

f4f60f60

Skip duplicate block encoding in the rd loop · faff6ed0

Jingning Han authored 11 years ago

This speed feature allows the encoder to largely remove the spatial
dependency between blocks inside a 64x64 superblock, thereby removing
the need to repeatedly encode superblocks per partition type in the
rate-distortion optimization loop.

A major challenge lies in the intra modes tested in the rate-distortion
optimization loop. The subsequent blocks do not have access to the
reconstructed boundary pixels without the intermediate coding steps.
This was resolved by using the original pixels for intra prediction
in the rd loop, followed by an appropriately designed distortion
modeling on the quantization parameters. Experiments also suggested
that the performance impact is more discernible at lower bit-rate/psnr
settings. Hence a quantizer dependent threshold is applied to deactivate
skip of block coding.

For bus_cif at 2000 kbps,
speed 0: runtime 269854ms -> 237774ms (12% speed-up) at 0.05dB
         performance loss.

speed 1: runtime 65312ms  -> 61536ms, (7...

faff6ed0

Merge "Fixing vp9_get_pred_context_comp_ref_p function." · 1f14bbb6
Dmitry Kovalev authored 11 years ago

1f14bbb6

13 Jul, 2013 - 3 commits
- Using vp9_copy and vp9_zero instead of custom code. · 42907098
  Dmitry Kovalev authored 11 years ago
```
Change-Id: Id9b6ceeddca3f9b34bfada5c499b1e7a2f42c30b
```
  42907098
- Fixing vp9_get_pred_context_comp_ref_p function. · 31a68bcd
  Dmitry Kovalev authored 11 years ago
```
Adding missed parenthesis around boolean expressions. Bitstream is changed.
Regenerating test vectors.

Change-Id: I4cc00b761e9473f92f180a9fc3a0c607f0aaae56
```
  31a68bcd
- Merge "Removing redundant call to set_mi_row_col." · 31403080
  Dmitry Kovalev authored 11 years ago
  
  31403080
12 Jul, 2013 - 24 commits
- Removing redundant call to set_mi_row_col. · 3c94fffd
  Dmitry Kovalev authored 11 years ago
```
This function is actually called from set_offsets which is called right
before vp9_read_mode_info.

Change-Id: Ibb9d5ad606194bc80eab264fad85b31c9dfd8f77
```
  3c94fffd
- Merge "Fix a build issue" · cdea4a7c
  Yaowu Xu authored 11 years ago
  
  cdea4a7c
- Merge "Adding struct tx_probs and struct tx_counts to cleanup the code." · aa518af8
  Dmitry Kovalev authored 11 years ago
  
  aa518af8
- Merge "Making functions read_{inter, intra}_segment_id more similar." · 444c8d4c
  Dmitry Kovalev authored 11 years ago
  
  444c8d4c
- Merge "vp9_postproc: remove useless self-assign" · c9a2a06c
  James Zern authored 11 years ago
  
  c9a2a06c
- Adding struct tx_probs and struct tx_counts to cleanup the code. · cc662dd7
  Dmitry Kovalev authored 11 years ago
```
Also removing unused declarations from vp9_entropymode.h file.

Change-Id: Ib9c5826db3584a32f6bb3297a76c522b99d83402
```
  cc662dd7
- Merge "Code cleanup in vp9_pred_common.c" · 60969da5
  Dmitry Kovalev authored 11 years ago
  
  60969da5
- Making functions read_{inter, intra}_segment_id more similar. · db0d603b
  Dmitry Kovalev authored 11 years ago
```
Change-Id: I51f9ac910834f2d7aba2be4f7ffbce597e61a144
```
  db0d603b
- vp9_postproc: remove useless self-assign · cca973a1
  James Zern authored 11 years ago
```
Change-Id: I0bc5d2d8c9fec8be18263b0dc2528886bb5b7b61
```
  cca973a1
- Code cleanup in vp9_pred_common.c · 3ab86adb
  Dmitry Kovalev authored 11 years ago
```
No bitstream changes. Using MB_MODE_INFO temp variables instead of
MODE_INFO variables. Removing redundant curly braces.

Change-Id: Ib9d1bedfbd8af97ecc722ccf697ea8177bbe287c
```
  3ab86adb
- Fix a build issue · fb754b18
  Yaowu Xu authored 11 years ago
```
Change-Id: I23a75c495ed7ea917d7f312bef0990e20a6b53d9
```
  fb754b18
- vp9: consistent 'log2' variable naming · 0195fb53
  James Zern authored 11 years ago
```
lg2 -> log2

Change-Id: I0602ddff49e42c9c40c29c084d04b7592b9f8edf
```
  0195fb53
- Merge changes I33e76c42,I24aeac1e,If4192b40 · 37c0a1a8
  James Zern authored 11 years ago
```
* changes:
  vp9_dx_iface: s/vp8/vp9/ where possible
  vp[89]_dx_iface: delete unused function
  vp[89]_dx_iface: factorize vp8_mmap_*()
```
  37c0a1a8
- vp9_dx_iface: s/vp8/vp9/ where possible · 563b4b20
  James Zern authored 11 years ago
```
drop 'vp9_' from most static functions unrelated to the codec interface
itself.

Change-Id: I33e76c425bb7373570a57a61662a56d65ab4bdf3
```
  563b4b20
- Merge "msvs-build: use msbuild for vs >= 2005" · 29080913
  James Zern authored 11 years ago
  
  29080913
- Some minor cleanups for efficiency · 94c481f9
  Deb Mukherjee authored 11 years ago
```
Implements some of the helper functions more efficiently with
lookups rathers than branches. Modeling function is consolidated
to reduce some computations.

Also merged the two enums BLOCK_SIZE_TYPES and BlockSize into
one because there is no need to keep them separate (even though
the semantics are a little different).

No bitstream or output change.

About 0.5% speedup

Change-Id: I7d71a66e8031ddb340744dc493f22976052b8f9f
```
  94c481f9
- Merge "Removing redundant code mostly from vp9_pred_common.{h, c}." · 72763187
  Dmitry Kovalev authored 11 years ago
  
  72763187
- Merge "Speed 2 feature adjustment." · b8ddc9f0
  Paul Wilkins authored 11 years ago
  
  b8ddc9f0
- vp[89]_dx_iface: delete unused function · e202a2be
  James Zern authored 11 years ago
```
static mmap_lkup

Change-Id: I24aeac1eca8453e28d58bc06925e58efc228a0a6
```
  e202a2be
- vp[89]_dx_iface: factorize vp8_mmap_*() · b088998e
  James Zern authored 11 years ago
```
s/vp8/vpx/ -> vpx_codec_internal.h / vpx_codec.c

Change-Id: If4192b40206276a761b01d44e334fe15bcb81128
```
  b088998e
- Merge "Cosmetic changes in 16x16 ADST/DCT unit test" · 119decde
  Jingning Han authored 11 years ago
  
  119decde
- Merge "Remove unnecessary tx_type branch in encode_block" · 84c3ac04
  Jingning Han authored 11 years ago
  
  84c3ac04
- Removing redundant code mostly from vp9_pred_common.{h, c}. · dd150e8e
  Dmitry Kovalev authored 11 years ago
```
Removing redundant function arguments and curly braces.

Change-Id: I46e02561f33fe02e84a3b19756f03b9504bd6a1b
```
  dd150e8e
- Remove unused function block_error(). · ee09dd99
  Ronald S. Bultje authored 11 years ago
```
Change-Id: I78a79fc51c2d7cc3c261f35b569155397f3dc0c4
```
  ee09dd99
11 Jul, 2013 - 4 commits
- Merge "vp9: fix peek_si for version==0" · 30bac896
  James Zern authored 11 years ago
  
  30bac896
- Merge "small update to peek_si/get_si documentation" · 5b11e38a
  James Zern authored 11 years ago
  
  5b11e38a
- Merge "Calling is_inter_mode() instead of custom code." · cae3fb72
  Dmitry Kovalev authored 11 years ago
  
  cae3fb72
- Merge "SSE2 4x4 invserse ADST/DCT transform" · dac5891a
  Jingning Han authored 11 years ago
  
  dac5891a