Commits · 64baa8df2e3f5cd8036fac9715d3f7d348620fa6 · BC / public / external / libvpx

16 Dec, 2010 - 2 commits

Changed segmentation check order · 64baa8df

Scott LaVarnway authored 14 years ago

In SPLITMV, the 8x8 segment will be checked first.  If the 8x8 rd
is better than the best, we check the other segments.  Otherwise
bail.  Adjustments to the thresh_mult were necessary to make
up for the initial quality loss.
The performance improved by 20% (average) for good quality,
speed 0 and speed 1, while the overall quality remained the same.

Change-Id: I717aef401323c8a254fba3e9777d2a316c774cc3

64baa8df

Adjusted breakout RD for SPLITMV · 81cdeb71

Scott LaVarnway authored 14 years ago

vp8_rd_pick_best_mbsegmentation looks at y only.  The new
breakout does not include the frame cost, the prob_skip_false
cost, or the uv rate.  Performance improved by a few percent
and the quality remained the same.

Change-Id: I94ff013998ac51e8ecce7130870f7b6600758e15

81cdeb71

14 Dec, 2010 - 4 commits

Fix a bug in motion search code(2) · 08706a3e

Yunqing Wang authored 14 years ago

This fix added MV range checks for NEWMV mode as suggested by Jim.
To reduce unnecessary MV range checks, I tried Yaowu's suggestion.
Update UMV borders in NEWMV mode to also cover MV range check.
Also, in this way, every MV that is valid gets checked in diamond
search function.

Change-Id: I95a89ce0daf6f178c454448f13d4249f19b30f3a

08706a3e

Fix a bug in motion search code · 7fb0f868

Yunqing Wang authored 14 years ago

The MV's range is 256. Since the new motion search uses a different
starting MV than the center ref MV, a MV range checking needs to
be done to avoid corruption.

Change-Id: I8ae0721d1bd203639e13891e2e54a2e87276f306

7fb0f868

fix a bug that "optimize" flag is not set for sub-threads · 64f3d915

Yaowu Xu authored 14 years ago

The flag for quantization optimization was not properly propagated to
mb row encoding threads.

Change-Id: Ic561599c35acd94cd5698c9b314bccd596ac2deb

64f3d915

shrink TOKENEXTRA and vp8_extra_bit_struct · 825adc46

Johann authored 14 years ago

Per John's previous change, shrink TOKENEXTRA from 20 to 8 bytes
original: b7b1e6fb
reverted: 41f4458a

Also drop unused field from vp8_extra_bit_struct

Update ARM ASM to deal with this change. In particular, Extra is signed
and needs to be sign-extended when loaded.

Change-Id: Ibd0ddc058432bc7bb09222d6ce4ef77e93a30b41

825adc46

13 Dec, 2010 - 3 commits

Revert "Reduce size of TOKENEXTRA struct" · 41f4458a

John Koleszar authored 14 years ago

This reverts commit b7b1e6fb. Previous
fix is incomplete, breaks ARM. Itchy submit finger.

Change-Id: I939dc0d3bf4173cf951c1d152338ab6ea2184bb9

41f4458a

remove unused temporal preproc code · b1aa54ab

John Koleszar authored 14 years ago

This code is unused, as the current preproc implementation uses the
same spatial filter that postproc uses.

Change-Id: Ia06d5664917d67283f279e2480016bebed602ea7

b1aa54ab

Reduce size of TOKENEXTRA struct · b7b1e6fb

John Koleszar authored 14 years ago

Change the size of structure elements to reduce memory utilization.
Removed the 'section' member entirely, as it is set but never read.

Change-Id: Iad043830392fb4168cb3cd6075fb0eb70c7f691c

b7b1e6fb

10 Dec, 2010 - 1 commit

fix a bug in multithreaded encoding with active_map enabled · 97a86c5b

Yaowu Xu authored 14 years ago

Added the initialization of the pointer to active map. Also added the
same logic for cyclic refresh in mbrow encoding threads.

Change-Id: Ic48d0849dc706b27fba72d07dcc498075725663d

97a86c5b

09 Dec, 2010 - 3 commits

vp8 fast quantizer sse2 optimizations for eob. · e0cf330c

Fritz Koenig authored 14 years ago

Changed the end of block computation to use pmaxw.  Removed
additional pushing and popping of registers that was not needed.

Change-Id: I08cb9b424513cd8a2c7ad8cea53b4e2adc66ef98

e0cf330c

fix uninitialized read in encode breakout · cb969895

John Koleszar authored 14 years ago

Change I3430820b performed an uninitialized read when
encode_breakout == 0, since the sum and sse wouldn't be set:

   if(x->encode_breakout)
       VARIANCE_INVOKE(..., get16x16var)(..., &sum, &sse);
   if (cpi->active_map_enabled && x->active_ptr[0] == 0) {
       ...
   } else if (sse < x->encode_breakout)

Change-Id: I915eb76d1227b4b6d1137a0dedf2c143860098a2

cb969895

Correct q_low and q_high limits for the recode loop · c63fc881

Paul Wilkins authored 14 years ago

Corrected the initial Q range limits for the recode loop
to reflect the current allowed range for the frame.

In experimental work on constrained quality this bug was
causing unnecessary recodes.

Change-Id: I7e256fbfa681293b0223fe21ec329933d76c229f

c63fc881

07 Dec, 2010 - 2 commits

vp8e - static threshold play · 718c1971

Jim Bankoski authored 14 years ago

Realized no need for new assembly code sum is already
calculated.

Change-Id: Ie2d94feb4b7c1f77c5359bca29b66228e41638c9

718c1971

adjust RDMULT for UV plane in quantization RDO · 7c03a1c3

Yaowu Xu authored 14 years ago

This patch adds a weighting factor on RDMULT for UV blocks. The change
has an overall gain about 0.5% based on ssim, between 0.1 and 0.2% by
psnr numbers.

Change-Id: I97781b077ce3bb7e34241b03268491917e8d1d72

7c03a1c3

06 Dec, 2010 - 3 commits

Fix a memory leak problem in encoder · 9520f4b3

Yunqing Wang authored 14 years ago

Deallocating the buffers before re-allocating them.

The fix passed James Berry's test program for memory
leak check.

Change-Id: I18c3cf665412c0e313a523e3d435106c03ca438d

9520f4b3

vp8_rd_pick_best_mbsegmentation code restructure · 2fa5d5a2

Scott LaVarnway authored 14 years ago

Moved the code from the segmentation loop into a function
which is now called for each segment. This will allow us
to change the segment order checking more easily.

Change-Id: I9510d26f0acae5a73043fcca8f1984b121d3e052

2fa5d5a2

Fix for manual Golden frame frequency · 8534071d

Patrik Westin authored 14 years ago

When auto_golden wasn't set it forced all frames to be a golden
frame. Now the manual configured frequency is adhered to.

Change-Id: I360acac9bc487db0d9c4d4da6ee41f70c227c539

8534071d

04 Dec, 2010 - 1 commit

Change to inter_minq table. · cec6a596

Paul Wilkins authored 14 years ago

The inter_minq table controls the range of quantizers available
for a particular frame in two pass relative to a max Q value.

The changes reduces the range somewhat. The effect of this
was a small increase (0.3% average) in psnr for the test set
but it should also help encode speed somewhat for higher
quality modes as it will reduce the number of iterations in the
recode loop.

The change damps the range of quantizers available locally
within a section of a clip and should therefore help keep quality
more uniform. If there is systematic overshoot or undershoot the
range can shift gradually to accommodate. However, there is
some increased risk of overshoot or undershoot against the target
bit rate in VBR mode and this risk will be more pronounced for short
clips.

The change damps the range of quantizers available locally
within a section of a clip and should therefore help keep quality
more uniform. If there is systematic overshoot or undershoot the
range can shift gradually to accommodate. However, there is
some increased risk of overshoot or undershoot against the
target bit rate in VBR mode and this risk will be more
pronounced for short clips.

Change-Id: I84465567d49ae767c6c73ff2a2aac30c895adb52

cec6a596

03 Dec, 2010 - 1 commit

Improve MV prediction accuracy to achieve performance gain · c3bbb291

Yunqing Wang authored 14 years ago

Add vp8_mv_pred() to better predict starting MV for NEWMV
mode in vp8_rd_pick_inter_mode(). Set different search
ranges according to MV prediction accuracy, which improves
encoder performance without hurting the quality. Also,
as Yaowu suggested, using diamond search result as full
search starting point and therefore adjusting(reducing)
full search range helps the performance.

Change-Id: Ie4a3c8df87e697c1f4f6e2ddb693766bba1b77b6

c3bbb291

01 Dec, 2010 - 1 commit

Set refresh_alt_ref_frame on keyframe encode. · 9c8ad79f

Fritz Koenig authored 14 years ago

On a keyframe alt ref and golden are refreshed.  The flag was
not being set and so on the frame after a keyframe, motion
search would occur on the alt ref frame.  This is not necessary
because the alt ref frame identical to the last frame in this
scenario.

Handle corner case where a forward alt-ref frame is put
directly after a keyframe.

Change-Id: I9be4cf290d694f8cf2f9a31852014b5ccf1504d3

9c8ad79f

27 Nov, 2010 - 1 commit
- vp8e - remove unnecessary variance calc · 3430820b
  Jim Bankoski authored 14 years ago
```
only do the variance calculation if necessary
( eg needed for breakout test)
```
  3430820b
22 Nov, 2010 - 1 commit

Recalibration of bits per MB tables · ad6150f7

Paul Wilkins authored 14 years ago

The baseline bits per MB prediction tables have been
re calibrated based on the assumption that bits per mb
is inversely proportional to the quantizer level.

Change-Id: Ibd355c7acac4b8053dda1baf1032fe35f11da7f7

ad6150f7

19 Nov, 2010 - 1 commit

Added extra two pass stats gathering. · 70b885a0

Paul Wilkins authored 14 years ago

Added code to record spend so far against planed budget.

Change-Id: I5a3335346fa1771b2b1219df9f6127f9993d2594

70b885a0

18 Nov, 2010 - 1 commit

remove warning · ed5ab7fa

Pascal Massimino authored 14 years ago

was having: "vp8/encoder/onyx_if.c:5365: warning: comparison of unsigned expression >= 0 is always true"

ed5ab7fa

17 Nov, 2010 - 2 commits

Removed unnecessary checks. · f7670acc

Scott LaVarnway authored 14 years ago

macro_block_yrd and vp8_rdcost_mby are not called for SPLITMV.

Change-Id: I2224d3c8725df526d48426447482768d543752f1

f7670acc

Replaced recode loop test with a function call · f874391e

Paul Wilkins authored 14 years ago

Replaced existing code to decide if a frame recode is required
with a function call. This is to simplify addition of extra clauses
that may be needed for the planned constrained quality mode.

Also fixed a bug where by alt ref not considered in the test.

Change-Id: I3d40bb21abe3e19f8456761e6849deb171738b60

f874391e

16 Nov, 2010 - 1 commit

Comments for alt ref flags. · 69ee697f

Fritz Koenig authored 14 years ago

Clarify what the alt ref flags do when encoding.

Change-Id: I71f78e0f42edae633fb91840f29dfbe64362c44c

69ee697f

15 Nov, 2010 - 2 commits

Remove stack shadowing for x86-x64 for SAD functions. · e1802553

Fritz Koenig authored 14 years ago

x86-64 passes arguments in registers.  There is no need to push
them to the stack before using them.

This fixes 15acc84f where ebx
was not getting preserved on x86.

Change-Id: I1214b5f818a0201f75ab6ad7d5c6f448e09b16c2

e1802553

Bad cost tables used in ARNR filtering. · 373f5c31

Paul Wilkins authored 14 years ago

The use of incorrect mv costing tables in the ARNR sub-pel
filtering code led to corruption of the altref buffer in some cases,
particularly at low data rates.

The average gain from this fix is about 0.3% but there are a few
extreme cases where nasty and visible artifacts manifested and
for these few data points the improvement is > 10%.

PGW and AWG

Change-Id: I95cc02b196a433e71d0d2bd2b933fe68ed31e796

373f5c31

11 Nov, 2010 - 3 commits

make rdmult adaptive for intra in quantizer RDO · ef2f27f1

Yaowu Xu authored 14 years ago

This intends to correct the tendency that VP8 aggressively favors rate
on intra coded frames. Experiments tested different numbers in [0, 1]
and found 9/16 overall provided about 2-4% gains for all-intra coded
clips based on vpx-ssim metric. The impact on regular encoded clips
is much smaller but positive overall. Overall impact on psnr is also
positive even though very small.

Change-Id: If808553aaaa87fdd44691f9787820ac9856d9f8a

ef2f27f1

quantizer: fix assertion in fast quantizer path · 0a49747b

John Koleszar authored 14 years ago

The fast quantizer assembly code has not been updated to match the new
exact quantizer, which was made the default in commit 6adbe090.
Specifically, they are not aware of the potential for the coefficient
to be scaled, which results in the quantized result exceeding the range
of the DCT. This patch restores the previous behavior of using the
non-shifted coefficients when in the fast quantizer code path, but
unfortunately requires rebuilding the tables when switching between the
two.

Change-Id: I0a33f5b3850335011a06906f49fafed54dda9546

0a49747b

Revert "Remove stack shadowing for x86-64" · 58083cb3
Fritz Koenig authored 14 years ago
```
This reverts commit 15acc84f.

Change-Id: Ia640be8cbc134432914849c1750f62575ea084e6
```
58083cb3

10 Nov, 2010 - 4 commits

FDCT optimizations. · 5f0e0617

Fritz Koenig authored 14 years ago

Fixed up the fdct for mmx and 8x4 sse2 to match them
most recent changes.

Change-Id: Ibee2d6c536fe14dcf75cd6eb1c73f4848a56d719

5f0e0617

postproc : Re-work posproc calling to allow more flags. · 647df00f

Fritz Koenig authored 14 years ago

Debugging in postproc needs more flags to allow for specific
block types to be turned on or off in the visualizations.

Must be enabled with --enable-postproc-visualizer during
configuration time.

Change-Id: Ia74f357ddc3ad4fb8082afd3a64f62384e4fcb2d

647df00f

Relax rate control for last few frames · 513f8e68

Paul Wilkins authored 14 years ago

VBR rate control can become very noisy for the last few frames.
If there are a few bits to spare or a small overshoot then the
target rate and hence quantizer may start to fluctuate wildly.

This patch prevents further adjustment of the active Q limits for
the last few frames.

Patch also removes some redundant variables and makes one small bug fix.

Change-Id: Ic167831bec79acc9f0d7e4698bcc4bb188840c45

513f8e68

Tuning for the more exact quantizer. · 6adbe090

Paul Wilkins authored 14 years ago

Small changes to the default zero bin and rounding tables.
Though the tables are currently the same for the Y1 and Y2 cases
I have left them as separate tables in case we want to tune this later.

There is now some adjustment of the zbin based on the prediction mode.
Previously this was restricted to an adjustment for gf/arf 0,0 MV.

The exact quantizer now marginal outperforms and is the default.

The overall average gain is about 0.5%

Change-Id: I5e4353f3d5326dde4e86823684b236a1e9ea7f47

6adbe090

05 Nov, 2010 - 1 commit

improve average framerate calculation · f7e187d3

John Koleszar authored 14 years ago

Change Ice204e86 identified a problem with bitrate undershoot due to
low precision in the timestamps passed to the library. This patch
takes a different approach by calculating the duration of this frame
and passing it to the library, rather than using a fixed duration
and letting the library average it out with higher precision
timestamps. This part of the fix only applies to vpxenc.

This patch also attempts to fix the problem for generic applications
that may have made the same mistake vpxenc did. Instead of
calculating this frame's duration by the difference of this frame's
and the last frame's start time, we use the end times instead. This
allows the framerate calculation to scavenge "unclaimed" time from
the last frame. For instance:

  start |  end  | calculated duration
  ======+=======+====================
    0ms    33ms   33ms
   33ms    66ms   33ms
   66ms    99ms   33ms
  100ms   133ms   34ms

Change-Id: I92be4b3518e0bd530e97f90e69e75330a4c413fc

f7e187d3

01 Nov, 2010 - 1 commit

SSSE3 version of fast quantizer · ff4a71f4

Scott LaVarnway authored 14 years ago

(test clip: tulip)
For good quality mode with speed=1, this gave the encoder
a small (2 - 3%) performance boost.

Change-Id: I8a1d4269465944ac0819986c2f0be4b0a2ee0b35

ff4a71f4

29 Oct, 2010 - 1 commit

Finding first label · dcee88ea

Scott LaVarnway authored 14 years ago

Using tables for the label count and label offset.

Change-Id: Iac3d5b292c37341a881be0af282f5cac3b3e01eb

dcee88ea