Commits · 66533b1a8d694a5d9b482352ebcaf0b189da186e · BC / public / external / libvpx

09 Jun, 2011 - 3 commits

remove one set of 16x16 variance funcations · 361717d2

Yaowu Xu authored 13 years ago

call to this set of functions are replaced by var16x16.

Change-Id: I5ff1effc6c1358ea06cda1517b88ec28ef551b0d

361717d2

bug fix mode_info_context not initialized for error-resilient · 45feea4c

James Berry authored 13 years ago

uninitialized xd->mode_info_context would crash
vpxenc for --error-resilient=1.

Change-Id: I31849e40281e3d65ab63257cfec5e93398997f0b

45feea4c

Update keyframe activity in non-RD mode · af49c112

John Koleszar authored 13 years ago

Activity update is no longer dependent on being in RD mode, so update
it unconditionally.

Change-Id: Ib617a6fc210dfc045455e3e4467d7ee5e3d1fa0e

af49c112

08 Jun, 2011 - 8 commits

use GCC inline magic · 79327be6

Johann authored 13 years ago

Better fix for #326. ICC happens to support the inline magic

Change-Id: Ic367eea608c88d89475cb7b05d73500d2a1bc42b

79327be6

vp8_pick_inter_mode: remove best_bmodes · 001bd51c

John Koleszar authored 13 years ago

Since BPRED will be tested at most once, and SPLITMV is not enabled,
there's nothing to clobber the subblock modes, so there's no need to
save and restore them.

Change-Id: I7c3615b69190c10bd068a44df5488d6e8b85a364

001bd51c

vp8_pick_intra_mode: correct returned rate · 91907e0b

John Koleszar authored 13 years ago

The returned rate was always the 4x4 rate, instead of the rate
matching the selected mode.

Change-Id: I51da31f80884f5e37f3bcc77d1047d31e612ded4

91907e0b

Removed unused function parameters · 69d8d386
Scott LaVarnway authored 13 years ago
```
Change-Id: Ib641c624faec28ad9eb99e2b5de51ae74bbcb2a2
```
69d8d386

Adjust errorperbit according to RDMULT in activity masking · 1fba1e38

Yaowu Xu authored 13 years ago

In activity masking, RDO constant RDMULT is adjusted on a per MB basis
adaptive to activity with the MB. errorperbit, which is defined as
RDMULT/RDDIV, is a constant used in motion estimation. Previously, in
activity masking, errorperbit is not changed even when RDMULT is changed.
This commit changed to adjust errorperbit according to the change in
RDMULT.

Test in cif set showed a very small but consistent gain by all quality
metrics (average, overall psnr and ssim) when activity masking is on.

Change-Id: I07ded3e852919ab76757691939fe435328273823

1fba1e38

Move RD intra block mode selection to rdopt.c · 96a42aaa

John Koleszar authored 13 years ago

This change is analogous to I0b67dae1f8a74902378da7bdf565e39ab832dda7,
which made the move for the non-RD path.

Change-Id: If63fc1b0cd1eb7f932e710f83ff24d91454f8ed1

96a42aaa

Move intra block mode selection to pickinter.c · e90d17d2

John Koleszar authored 13 years ago

This commit moves the intra block mode selection from encodeframe.c
to pickinter.c (in the non-RD case). This allowed pick_intra_mbuv_mode
and pick_intra4x4mby_modes to be made static, and is a step towards
refactoring intra mode selection in the main pickinter loop. Gave a
small perf increase (~0.5%).

Change-Id: I0b67dae1f8a74902378da7bdf565e39ab832dda7

e90d17d2

Further activity masking changes: · 4e81a68a

Paul Wilkins authored 13 years ago

Some further re-structuring of activity masking code.
Still has various experimental switches.
Supports a metric based on intra encode.
Experimental comparison against a fixed activity target  rather
than a frame average, for altering rd and zbin.

Overall the SSIM performance is similar  to TT's original
code but there is a much smaller PSNR hit of circa
0.5% instead of 3.2%

Change-Id: I0fd53b2dfb60620b3f74d7415e0b81c1ac58c39a

4e81a68a

07 Jun, 2011 - 3 commits

adjust sad per bit constants · 221e00ea

Yaowu Xu authored 13 years ago

While investigating the effect of DC values on SAD and SSE in motion
estimation, a side finding indicates the two table of constants need
be adjusted. The adjustment was done by multiplying old constants by
90% with rounding. Also absorb the 1/2 scaling constant into the two
tables. Refer to change Ifa285c3e for background of the 1/2 factor.

Cif set test showed a very small gain on all metric.

Change-Id: I04333527a823371175dd46cb04a817e5b9a8b752

221e00ea

Wrapped asserts in critical code with CONFIG_DEBUG · 0e3bcc6f
Scott LaVarnway authored 13 years ago
```
Change-Id: I5b0aaca06f2e0f40588cb24fb0642b6865da8970
```
0e3bcc6f
Removed unused function vp8_treed_read_num · 1374a4db
Scott LaVarnway authored 13 years ago
```
Change-Id: Id66e70540ee7345876f099139887c1843093907f
```
1374a4db

06 Jun, 2011 - 2 commits

remove redundant functions · d4700731

Yaowu Xu authored 13 years ago

The encoder defined about 4 set of similar functions to calculate sum,
variance or sse or a combination of them. This commit removed one set
of these functions, get8x8var and get16x16var, where calls to the later
function are replaced with var16x16 by using the fact on a 16x16 MB:
    variance == sse - sum*sum/256

Change-Id: I803eabd1fb3ab177780a40338cbd596dffaed267

d4700731

Remove hex search's variance calculation while in real-time mode · 03973017

Yunqing Wang authored 13 years ago

In real-time mode motion search, there is no need to calculate
variance. This change improved encoding speed by 1% ~ 2%(speed=-5).

Change-Id: I65b874901eb599ac38fe8cf9cad898c14138d431

03973017

03 Jun, 2011 - 3 commits

Reduce overshoot in 1 pass rate control · 212f6183

John Koleszar authored 13 years ago

This patch attempts to reduce the peak bitrate hit by the encoder
when using small buffer windows.

Tested on the CIF set over 200-500kbps using these settings:

  --buf-sz=500 --buf-initial-sz=250 --buf-optimal-sz=250 \
  --undershoot-pct=100

Two pass encodes were tested at best quality. One pass encodes were
tested only at realtime speed 4:

  --rt --cpu-used=-4

The peak datarate (over the specified 500ms window) was measured
for each encode, and averaged together to get metric for
"average peak," computed as SUM(peak)/SUM(target). This patch
reduces the average peak datarate as follows:

  One pass:
    baseline:   1.29715
    this patch: 1.23664

  Two pass:
    baseline:   1.32702
    this patch: 1.37824

This change had a positive effect on our quality metrics as well:

  One pass CBR:
                    Min  / Mean / Max (pct)
    Average PSNR    -0.42 / 2.86 / 27.32
    Overall PSNR    -0.90 / 2.00 / 17.27
    SSIM            -0.05 / 3.95 / 37.46

  Two pass CBR:
                    Min  / Mean / Max (pct)
    Average PSNR    -4.47 / 4.35 / 35.99
    Overall PSNR    -3.40 / 4.18 / 36.46
    SSIM            -4.56 / 6.98 / 53.67

  One pass VBR:
                    Min  / Mean / Max (pct)
    Average PSNR    -5.21 /  0.01 / 3.30
    Overall PSNR    -8.10 / -0.38 / 1.21
    SSIM            -7.38 / -0.11 / 3.17
    (note: most values here were close to the mean, there were a few
     outliers on files that were very sensitive to golden frame size)

  Two pass VBR:
                    Min  / Mean / Max (pct)
    Average PSNR    0.00 / 0.00 / 0.00
    Overall PSNR    0.00 / 0.00 / 0.00
    SSIM            0.00 / 0.00 / 0.00

Neither one pass or two pass CBR mode adheres particularly strictly
to the short term buffer constraints, and two pass is less
consistent, even in the baseline commit. This should be addressed
in a later commit. This likely will hurt the quality numbers, as it
will have to reduce the burstiness of golden frames.

Aside: My work on this commit makes it clear that we need to make
rate control modes "pluggable", where you can easily write a new
one or work on one in isolation.

Change-Id: I1ea9a48f2beedd59891f1288aabf7064956b4716

212f6183

Removed unnecessary bmi motion vector stores. · f1d6cc79

Scott LaVarnway authored 13 years ago

left_block_mv and above_block_mv will return the MB
motion vector for non SPLITMV macro blocks.

Change-Id: I58dbd7833b4fdcd44b6b72e98ec732c93c2ce4f4

f1d6cc79

Adjust bounds checking for hex search in real-time mode · e5c236c2

Yunqing Wang authored 13 years ago

Currently, hex search couldn't guarantee the motion vector(MV)
found is within the limit of maximum MV. Therefore, very large
motion vectors resulted from big motion in the video could cause
encoding artifacts. This change adjusted hex search bounds
checking to make sure the resulted motion vector won't go out
of the range. James Berry, thank you for finding the bug.

Change-Id: If2c55edd9019e72444ad9b4b8688969eef610c55

e5c236c2

02 Jun, 2011 - 2 commits

Removed B_MODE_INFO · 773768ae

Scott LaVarnway authored 13 years ago

Declared the bmi in BLOCKD as a union instead of B_MODE_INFO.
Then removed B_MODE_INFO completely.

Change-Id: Ieb7469899e265892c66f7aeac87b7f2bf38e7a67

773768ae

Don't allow very short GF groups even when the GF is predicted from an ARF. · 9f002bee

Ronald S. Bultje authored 13 years ago

This is basically a slightly modified version of the previous patch,
and it has a moderately positive effect (SSIM/PSNR both +0.08% avg
on derf-set). Most clips show no change, except waterfall/coastguard,
each ~ +0.8% SSIM/PSNR. You can see similar effects in other clips
by shortening their length to terminate at a very short last group
of frames.

Change-Id: I7a70de99ca1f9fe6a8b6ca7a6e30e8a4b64383e4

9f002bee

01 Jun, 2011 - 5 commits

further clean up of errorperbit and sadperbit · 5b2fb329

Yaowu Xu authored 13 years ago

this commit makes the usage errorperbit and sadperbit consistent for
encoding modes and passes. Removed all different magic weight factors
associated with errorperbit. Now 1/2 is used for both sadperbit16 and
sadperbit4, the /2 operation is merged into initializations of the 2
variables.

Tests on cif set show .23%, 0.18% and 0.19% gain by avg psnr, overall
psnr and ssim respectively.

Change-Id: Ifa285c3e065ce0a5a77addfc9f95aabf54ee270d

5b2fb329

Bugfix in vp8dx_set_reference · 69ba6bd1

Henrik Lundin authored 13 years ago

The fb_idx_ref_cnt book-keeping was in error. Added an assert to
prevent future errors in the reference count vector. Also fixed a
pointer syntax error.

Change-Id: I563081090c78702d82199e407df4ecc93da6f349

69ba6bd1

Fix code under #if CONFIG_INTERNAL_STATS. · 34ba1876
Ronald S. Bultje authored 13 years ago
```
Change-Id: Iccbd78d91c3071b16fb3b2911523a22092652ecd
```
34ba1876

remove some magic weights associated with sad_per_bit · 50916c6a

Yaowu Xu authored 14 years ago

sad_per_bit has been used for a number of motion vector search routines
with different magic weights: 1, 1/2 and 1/4. This commit remove these
magic numbers and use 1/2 for all motion search routines, also reformat
a number of source code lines to within 80 column limit.

Test on cif set shows overall effect is neutral on all metrics. <=0.01%

Change-Id: I8a382821fa4cffc9c0acf8e8431435a03df74885

50916c6a

neon fast quantize block pair · 61f0c090

Tero Rintaluoma authored 13 years ago

vp8_fast_quantize_b_pair_neon function added to quantize
two adjacent blocks at the same time to improve performance.
 - Additional 3-6% speedup compared to neon optimized fast
   quantizer (Tanya VGA@30fps, 1Mbps stream, cpu-used=-5..-16)

Change-Id: I3fcbf141e5d05e9118c38ca37310458afbabaa4e

61f0c090

31 May, 2011 - 2 commits

vp8_pick_inter_mode code cleanup · 1a5a1903

Scott LaVarnway authored 13 years ago

Small code cleanups before attempting to reduce the size
of bmi found in BLOCKD.

Change-Id: Ie9c14adb53afd847716a75bcce067d0e6c04f225

1a5a1903

Initialize first_time_stamp_ever · 0a72f568

John Koleszar authored 13 years ago

Misplaced #endif caused first_time_stamp_ever to only be initialized if
CONFIG_INTERNAL_STATS was set.

Change-Id: I2296a4ab00f7dfb767583edcc5d59b94f48c0621

0a72f568

30 May, 2011 - 1 commit

adds preload for armv6 encoder asm · 5305e79e

Tero Rintaluoma authored 13 years ago

Added preload instructions to armv6 encoder optimizations.
About 5% average speed-up on Tegra2 for VGA@30fps sequence.

Change-Id: I41d74737720fb71ce7a316f07555357822f3347e

5305e79e

27 May, 2011 - 3 commits

bug fix check frame buffer index before copy · 8795b525

James Berry authored 13 years ago

in onyx_if.c update_reference_frames() make
sure that frame buffer indexes are not equal
before preforming a buffer copy.  If two frames
share the same buffer the flags will already be
set correctly.

Change-Id: Ida9b5516d08e3435c90f131d2dc19d842cfb536e

8795b525

Use hex search for realtime mode speed>4 · 4d052bdd

Yunqing Wang authored 13 years ago

Test showed using hex search in realtime mode largely speed up
encoding process, and still achieves similar quality like the
diamond search we have. Therefore, removed the diamond search
option.

Change-Id: I975767d0ec0539f9f6ed7fdfc09506e39761b66c

4d052bdd

Remove unused code · 2dc24635

Yunqing Wang authored 13 years ago

Hex search is not called in rdopt.c

Change-Id: I67347f03e13684147a7c77fb9e9147e440bb5e8e

2dc24635

26 May, 2011 - 1 commit

Broken EC after MODE_INFO size reduction · 4f586f7b

Scott LaVarnway authored 13 years ago

This patch fixes the compiler errors and the seg fault
when running decode_with_partial_drops.

Change-Id: I7c75369e2fef81d53b790d5dabc327218216838b

4f586f7b

25 May, 2011 - 4 commits

fix the mix use of errorperbit and sadperbit · d8c525b8

Yaowu Xu authored 13 years ago

error_per_bit and sad_per_bit were designed as estimates of a bit worth
of sum squared error and sum absolute difference respectively. Under
this assumption, error_per_bit should be used in combination with 2nd
order errors (variance or sum squared error) while sad_per_bit should
be used in combination with 1st order SADs in motion estimation. There
were a few places where sad_per_bit has been misused with variances,
this commit changes to use error_per_bit for those places, also changes
parameter names to properly indicate which constant is being used.

On cif set, the change has a universal gain by all metrics: 0.13% by
average/overall psnr and 0.1% by ssim.

Change-Id: I4850fdcc3fd6886b30f784bd843f13dd401215fb

d8c525b8

remove code not in use · 22c05c05
Yaowu Xu authored 13 years ago
```
Change-Id: I6e5e86235d341cce3b02abda26dbeb71940ed955
```
22c05c05
Return sse value in vp8_variance SSE2 functions · b6679879
Yunqing Wang authored 13 years ago
```
Minor modification.

Change-Id: I09511d38fd1451d5c4106a48acdb3f766ce59cb7
```
b6679879

Use var8x8 instead of get8x8var in VP8_UVSSE · a615c404

Attila Nagy authored 13 years ago

  'sum' returned by get8x8var is not used and var8x8 has optimizations
  for more platforms.

Change-Id: I4a907fb1a05f285669fb0b95dc71d42182c980f6

a615c404

24 May, 2011 - 3 commits

Fix a bug happening while encoding at profile=3 · d75eb736

Yunqing Wang authored 13 years ago

While profile=3, there is no sub-pixel search. Distortion and SSE
have to calculated using get_inter_mbpred_error().

Change-Id: Ifb36e17eef7750af93efa7d0e2870142ef540184

d75eb736

Use int_mv instead of MV in vp8_mv_cont · a39321f3
Scott LaVarnway authored 13 years ago
```
Less operations.

Change-Id: Ibb9cd5ae66b8c7c681c9a654d551c8729c31c3ae
```
a39321f3
Removed unused variable warnings · cfab2cae
Scott LaVarnway authored 13 years ago
```
Change-Id: I6e5e921f03dc15a72da89a457848d519647677a3
```
cfab2cae