Commits · 7d67aed16cfacf252d16f5ec9bde0daac1049d8d · BC / public / external / libvpx

03 Apr, 2013 - 1 commit

Renaming sb32_coded and sb64_coded fields. · dca8ad17

Dmitry Kovalev authored 11 years ago

Renaming sb32_coded to prob_sb32_coded and sb64_coded to prob_sb64_coded.

Change-Id: I6de5cad00a57c3e066d53467f8c38cb6073dce11

dca8ad17

02 Apr, 2013 - 3 commits

Lower case names for struct members. · 626635c2

Dmitry Kovalev authored 11 years ago

Lower case member names inside VP9D_CONFIG and VP9D_COMP structs.

Change-Id: I75af9ad2d929a35c357207a3fd9ebedddabf79c3

626635c2

Adding functions with common code for superblock decoding. · 9738e2db

Dmitry Kovalev authored 12 years ago

Adding decode_sb_8x8 and decode_sb_4x4 with common code for superblock
decoding. Renaming decode_superblock32 to decode_sb32 and
decode_superblock64 to decode_sb64.

Change-Id: Id006d7e398b9bfa3acec4326e1e0c537ebfefdd3

9738e2db

Code cleanup. · 50e54c11

Dmitry Kovalev authored 11 years ago

Adding multiple16 function, removing redundant code, better formatting.

Change-Id: I50195b78ac8ab803e3d05c8fb05a7ca134fab386

50e54c11

28 Mar, 2013 - 1 commit

Framework changes in nzc to allow more flexibility · fe9b5143

Deb Mukherjee authored 12 years ago

The patch adds the flexibility to use standard EOB based coding
on smaller block sizes and nzc based coding on larger blocksizes.
The tx-sizes that use nzc based coding and those that use EOB based
coding are controlled by a function get_nzc_used().
By default, this function uses nzc based coding for 16x16 and 32x32
transform blocks, which seem to bridge the performance gap
substantially.

All sets are now lower by 0.5% to 0.7%, as opposed to ~1.8% before.

Change-Id: I06abed3df57b52d241ea1f51b0d571c71e38fd0b

fe9b5143

27 Mar, 2013 - 3 commits

Extracting decode_tiles function. · 72f9f10c

Dmitry Kovalev authored 12 years ago

Extracting decode_tiles function from vp9_decode_frame.

Change-Id: I02a465eeaf76138ef3559e1d46deb452c10e1219

72f9f10c

Removing redundant function arguments. · 17cddb4e

Dmitry Kovalev authored 12 years ago

Almost all arguments for vp9_build_inter32x32_predictors_sb and
vp9_build_inter64x64_predictors_sb can be deduced from the first macroblock
argument.

Change-Id: I5d477a607586d05698d5b3b9b9bc03891dd3fe83

17cddb4e

Extract setup_frame_size and update_frame_context functions. · 8c69c193

Dmitry Kovalev authored 12 years ago

Extracting setup_frame_size and update_frame_context functions. Introducing
vp9_read_prob function as shortcut for (vp9_prob)vp9_read_literal(r, 8).

Change-Id: Ia5c68fd725b2d1b9c5eb20f69cacb62361b5a3dd

8c69c193

26 Mar, 2013 - 6 commits

Implicit weighted prediction experiment · 23144d23

Deb Mukherjee authored 12 years ago

Adds an experiment to use a weighted prediction of two INTER
predictors, where the weight is one of (1/4, 3/4), (3/8, 5/8),
(1/2, 1/2), (5/8, 3/8) or (3/4, 1/4), and is chosen implicitly
based on consistency of the predictors to the already
reconstructed pixels to the top and left of the current macroblock
or superblock.

Currently the weighting is not applied to SPLITMV modes, which
default to the usual (1/2, 1/2) weighting. However the code is in
place controlled by a macro. The same weighting is used for Y and
UV components, where the weight is derived from analyzing the Y
component only.

Results (over compound inter-intra experiment)
derf: +0.18%
yt: +0.34%
hd: +0.49%
stdhd: +0.23%

The experiment suggests bigger benefit for explicitly signaled weights.

Change-Id: I5438539ff4485c5752874cd1eb078ff14bf5235a

23144d23

Use above/left (instead of previous in scan-order) as token context. · 790fb132

Ronald S. Bultje authored 12 years ago

Pearson correlation for above or left is significantly higher than for
previous-in-scan-order (absolute values depend on position in scan, but
in general, we gain about 0.1-0.2 by using either above or left; using
both basically just makes this even better). For eob branch skipping,
we continue to use the previous token in scan order.

This helps about 0.9% on derf after re-training on a limited data set.
Full re-training and results on larger-resolution clips are pending.

Note that this commit breaks trellis, so we can probably get further
gains out of it by fixing trellis at some later point.

Change-Id: Iead68e296fc3a105cca746b5e3da9555d6010cfe

790fb132

Decomposition of vp9_decode_frame function. · 4a3d7860

Dmitry Kovalev authored 12 years ago

Moving code from vp9_decode_frame function into setup_loopfilter and
setup_segmentation functions. A little bit of cleanup.

Change-Id: I2cce1813e4d7aeec701ccf752bf57e3bdd41b51c

4a3d7860

Add an in-loop deringing experiment · 441e2eab

John Koleszar authored 12 years ago

Adds a per-frame, strength adjustable, in loop deringing filter. Uses
the existing vp9_post_proc_down_and_across 5 tap thresholded blur
code, with a brute force search for the threshold.

Results almost strictly positive on the YT HD set, either having no
effect or helping PSNR in the range of 1-3% (overall average 0.8%).
Results more mixed for the CIF set, (-0.5 min, 1.4 max, 0.1 avg).
This has an almost strictly negative impact to SSIM, so examining a
different filter or a more balanced search heuristic is in order.

Other test set results pending.

Change-Id: I5ca6ee8fe292dfa3f2eab7f65332423fa1710b58

441e2eab

Bugfix in model coef prob experiment · d14c7265

Deb Mukherjee authored 12 years ago

Fixes an issue with model based update that got into
the original patch that was merged.

Change-Id: Ie42d3d0aff2e48cd187d96664dbd3e9d6d3ac22f

d14c7265

Modeling default coef probs with distribution · fd18d5df

Deb Mukherjee authored 12 years ago

Replaces the default tables for single coefficient magnitudes with
those obtained from an appropriate distribution. The EOB node
is left unchanged. The model is represeted as a 256-size codebook
where the index corresponds to the probability of the Zero or the
One node. Two variations are implemented corresponding to whether
the Zero node or the One-node is used as the peg. The main advantage
is that the default prob tables will become considerably smaller and
manageable. Besides there is substantially less risk of over-fitting
for a training set.

Various distributions are tried and the one that gives the best
results is the family of Generalized Gaussian distributions with
shape parameter 0.75. The results are within about 0.2% of fully
trained tables for the Zero peg variant, and within 0.1% of the
One peg variant.

The forward updates are optionally (controlled by a macro)
model-based, i.e. restricted to only convey probabilities from the
codebook. Backward updates can also be optionally (controlled by
another macro) model-based, but is turned off by default. Currently
model-based forward updates work about the same as unconstrained
updates, but there is a drop in performance with backward-updates
being model based.

The model based approach also allows the probabilities for the key
frames to be adjusted from the defaults based on the base_qindex of
the frame. Currently the adjustment function is a placeholder that
adjusts the prob of EOB and Zero node from the nominal one at higher
quality (lower qindex) or lower quality (higher qindex) ends of the
range. The rest of the probabilities are then derived based on the
model from the adjusted prob of zero.

Change-Id: Iae050f3cbcc6d8b3f204e8dc395ae47b3b2192c9

fd18d5df

25 Mar, 2013 - 1 commit

Code cleanup. · 7cc14e59

Dmitry Kovalev authored 12 years ago

Lower case variable names, code simplification by using already defined
clamp and read_le16 functions.

Change-Id: I8fd544365bd8d1daed86d7b2ae0843e4ef80df08

7cc14e59

20 Mar, 2013 - 1 commit

Code cleanup: lower case variable names. · 56f3a2c6

Dmitry Kovalev authored 12 years ago

Renaming Width to width, Height to height and Version to version in
several structs and function signatures.

Change-Id: I084c3f7e747cb2ce3345aff27a3dff9b13a87543

56f3a2c6

18 Mar, 2013 - 1 commit

Replace scaling byte with explicit display size · 8a3f55f2

John Koleszar authored 12 years ago

If the intended display size is different than the size the frame is
coded at, then send that size explicitly in the bitstream. Adds a new
bit to the frame header to indicate whether the extra size fields
are present.

Change-Id: I525c66f22d207efaf1e5f903c6a2a91b80245854

8a3f55f2

16 Mar, 2013 - 1 commit

Context-pred fix to not use top/left on edges · b1921b2f

Deb Mukherjee authored 12 years ago

This fix resolves some of the mismatch issues being seen
recently. While this is the right thing to do when tiling
is used for this experiment, it is not the underlying cause
of the the mismatches.
Something else is causing writing outside of the allowable
frame area in the encoder leading to this mismatch.

Change-Id: If52c6f67555aa18ab8762865384e323b47237277

b1921b2f

14 Mar, 2013 - 2 commits

Cleaning up frame decoding functionality. · 9285703e
Dmitry Kovalev authored 12 years ago
```
Change-Id: I25424904fb8541fc19d00d9fbc592379374b98c0
```
9285703e

Fix pulsing issue with scaling · 9b7be888

John Koleszar authored 12 years ago

Updates the YV12_BUFFER_CONFIG structure to be crop-aware. The
exiting width/height parameters are left unchanged, storing the
width and height algined to a 16 byte boundary. The cropped
dimensions are added as new fields.

This fixes a nasty visual pulse when switching between scaled and
unscaled frame dimensions due to a mismatch between the scaling
ratio and the 16-byte aligned sizes.

Change-Id: Id4a3f6aea6b9b9ae38bdfa1b87b7eb2cfcdd57b6

9b7be888

13 Mar, 2013 - 1 commit

removed reference to "LLM" and "x8" · 00555263

Yaowu Xu authored 12 years ago

The commit changed the name of files and function to remove obselete
reference to LLM and x8.

Change-Id: I973b20fc1a55149ed68b5408b3874768e6f88516

00555263

12 Mar, 2013 - 1 commit

Code cleanup. · 2891d70b

Dmitry Kovalev authored 12 years ago

Removing redundant code, introducing new functions for better
decomposition, adding 'clamp' function to vp9_common.h.

Change-Id: Ic3b8ca13bbc38f60f0c9c43910b5802005e31aaf

2891d70b

09 Mar, 2013 - 1 commit

Continued experiment with nonzero count · a28139c8

Deb Mukherjee authored 12 years ago

Adds probability updates for extra bits for the nzcs, code for
getting nzc stats, plus some minor cleanups and fixes.

Change-Id: If2814e7f04fb52f5025ad9f400f3e6c50a00b543

a28139c8

07 Mar, 2013 - 2 commits

Re-add support for ADST in superblocks. · d3724abe

Ronald S. Bultje authored 12 years ago

This also changes the RD search to take account of the correct block
index when searching (this is required for ADST positioning to work
correctly in combination with tx_select).

Change-Id: Ie50d05b3a024a64ecd0b376887aa38ac5f7b6af6

d3724abe

Coding con-zero count rather than EOB for coeffs · eb6ef241

Deb Mukherjee authored 12 years ago

This patch revamps the entropy coding of coefficients to code first
a non-zero count per coded block and correspondingly remove the EOB
token from the token set.

STATUS:
Main encode/decode code achieving encode/decode sync - done.
Forward and backward probability updates to the nzcs - done.
Rd costing updates for nzcs - done.
Note: The dynamic progrmaming apporach used in trellis quantization
is not exactly compatible with nzcs. A suboptimal approach has been
used instead where branch costs are updated to account for changes
in the nzcs.

TODO:
Training the default probs/counts for nzcs

Change-Id: I951bc1e22f47885077a7453a09b0493daa77883d

eb6ef241

05 Mar, 2013 - 1 commit

Make superblocks independent of macroblock code and data. · 111ca421

Ronald S. Bultje authored 12 years ago

Split macroblock and superblock tokenization and detokenization
functions and coefficient-related data structs so that the bitstream
layout and related code of superblock coefficients looks less like it's
a hack to fit macroblocks in superblocks.

In addition, unify chroma transform size selection from luma transform
size (i.e. always use the same size, as long as it fits the predictor);
in practice, this means 32x32 and 64x64 superblocks using the 16x16 luma
transform will now use the 16x16 (instead of the 8x8) chroma transform,
and 64x64 superblocks using the 32x32 luma transform will now use the
32x32 (instead of the 16x16) chroma transform.

Lastly, add a trellis optimize function for 32x32 transform blocks.

HD gains about 0.3%, STDHD about 0.15% and derf about 0.1%. There's
a few negative points here and there that I might want to analyze
a little closer.

Change-Id: Ibad7c3ddfe1acfc52771dfc27c03e9783e054430

111ca421

04 Mar, 2013 - 1 commit

Support 16K sequence coding · 5957b2b5

Jingning Han authored 12 years ago

Fixed a couple of variable/function definitions, as well as header
handling to support 16K sequence coding at high bit-rates.

The width and height are each specified by two bytes in the header.
Use an extra byte to explicitly indicate the scaling factors in
both directions, each ranging from 0 to 15.

Tested coding up to 16400x16400 dimension.

Change-Id: Ibc2225c6036620270f2c0cf5172d1760aaec10ec

5957b2b5

28 Feb, 2013 - 1 commit

Refactor vp9_dequant_idct_add function · 6193bc3b

Yunqing Wang authored 12 years ago

Provided a wrapper and removed duplicate code.

Change-Id: Iaef842226ec348422e459202793b001d0983ea30

6193bc3b

27 Feb, 2013 - 4 commits

Move eob from BLOCKD to MACROBLOCKD. · e8c74e2b
Ronald S. Bultje authored 12 years ago
```
Consistent with VP8.

Change-Id: I8c316ee49f072e15abbb033a80e9c36617891f07
```
e8c74e2b

Combined motion compensation with scaled predictors · 77f88e97

John Koleszar authored 12 years ago

This patch extends the previous support for using references of a
different resolution in ZEROMV mode to all inter prediction modes.
Subpixel based best-mv scoring is disabled when the reference frame
differs in resolution from the current frame.

Change-Id: Id4dc3e5e6692de98d9857fd56bfad3ac57e944ac

77f88e97

Spatial resamping of ZEROMV predictors · eb939f45

John Koleszar authored 12 years ago

This patch allows coding frames using references of different
resolution, in ZEROMV mode. For compound prediction, either
reference may be scaled.

To test, I use the resize_test and enable WRITE_RECON_BUFFER
in vp9_onyxd_if.c. It's also useful to apply this patch to
test/i420_video_source.h:

  --- a/test/i420_video_source.h
  +++ b/test/i420_video_source.h
  @@ -93,6 +93,7 @@ class I420VideoSource : public VideoSource {

     virtual void FillFrame() {
       // Read a frame from input_file.
  +    if (frame_ != 3)
       if (fread(img_->img_data, raw_sz_, 1, input_file_) == 0) {
         limit_ = frame_;
       }

This forces the frame that the resolution changes on to be coded
with no motion, only scaling, and improves the quality of the
result.

Change-Id: I1ee75d19a437ff801192f767fd02a36bcbd1d496

eb939f45

Optimize vp9_dc_only_idct_add_c function · 35bc02c6

Yunqing Wang authored 12 years ago

Wrote SSE2 version of vp9_dc_only_idct_add_c function. In order to
improve performance, clipped the absolute diff values to [0, 255].
This allowed us to keep the additions/subtractions in 8 bits.
Test showed an over 2% decoder performance increase.

Change-Id: Ie1a236d23d207e4ffcd1fc9f3d77462a9c7fe09d

35bc02c6

26 Feb, 2013 - 1 commit

Refactor inter recon functions to support scaling · 6a4f708c

John Koleszar authored 12 years ago

Ensure that all inter prediction goes through a common code path
that takes scaling into account. Removes a bunch of duplicate
1st/2nd predictor code. Also introduces a 16x8 mode for 8x8
MVs, similar to the 8x4 trick we were doing before. This has an
unexpected effect with EIGHTTAP_SMOOTH, so it's disabled in that
case for now.

Change-Id: Ia053e823a8bc616a988a0af30452e1e75a739cba

6a4f708c

25 Feb, 2013 - 1 commit

Code cleanup. · ab196b7e

Dmitry Kovalev authored 12 years ago

Lower case names of variables. Removing redundant spaces, parentheses,
casts, and variables.

Change-Id: I55b80c55b7d5adca44c1e8adb40a124c0680f229

ab196b7e

23 Feb, 2013 - 1 commit
- Split coefficient token tables intra vs. inter. · 0c9e2e9a
  Ronald S. Bultje authored 12 years ago
```
Change-Id: I5416455f8f129ca0f450d00e48358d2012605072
```
  0c9e2e9a
21 Feb, 2013 - 1 commit

Remove "eobs" array in MACROBLOCKD. · 35524e22

Ronald S. Bultje authored 12 years ago

The information is a duplicate of "eob" in BLOCKD.

Change-Id: Ia6416273bd004611da801e4bfa6e2d328d6f02a3

35524e22

20 Feb, 2013 - 1 commit
- Merge lossless experiment · d262e26c
  Yaowu Xu authored 12 years ago
```
Change-Id: I7b7b8d4fda3a23699e0c920d727f8c15d37d43aa
```
  d262e26c
15 Feb, 2013 - 3 commits
- Remove Y2 and Y-no-DC token types from the bitstream. · 3af36ea8
  Ronald S. Bultje authored 12 years ago
```
Change-Id: I7a5314daca993d46b8666ba1ec2ff3766c1e5042
```
  3af36ea8
- Remove y2dc/ac Q delta values from the bitstream. · 48598e30
  Ronald S. Bultje authored 12 years ago
```
Since there is no Y2, these values are always zero. This changes the
bitstream results slightly, hence a separate commit.

Change-Id: I2f838f184341868f35113ec77ca89da53c4644e0
```
  48598e30
- Remove some Y2-related code. · 46dff5d2
  Ronald S. Bultje authored 12 years ago
```
Change-Id: I4f46d142c2a8d1e8a880cfac63702dcbfb999b78
```
  46dff5d2