Commits · b0519a26b120c8a5763f89f097f8b128d9257a6c · BC / public / external / libvpx

02 Sep, 2010 - 5 commits

Whitespace: nuke CRLFs · 4496db45
John Koleszar authored 14 years ago
```
Change-Id: I8b9fdf9875a8fcff4cb49a3357ce44f18108c2e7
```
4496db45

encoder: remove postproc dependency · 76640f85

James Zern authored 14 years ago

Remove the dependency on postproc.c for the encoder in general, the only
unchecked need for it is when CONFIG_PSNR is enabled. All other cases
are already wrapped in CONFIG_POSTPROC. In the CONFIG_PSNR case the file
will still be included.

Additionally, when VP8_SET_POSTPROC is used with the encoder when post
processing has been disabled an error will be returned.

This addresses issue #153.

Change-Id: Ia6dfe20167f7077734a6058cbd1d794550346089

76640f85

added separate rounding/zbin constants for 2nd order · fca12920

Yaowu Xu authored 14 years ago

This allows experiments of using different rounding and
zerobin constants for 2nd order blocks.

Change-Id: Idd829adba3edd1f713c66151a8d29bb245e33a71

fca12920

Disable frame dropping by default · 23216211

John Koleszar authored 14 years ago

This is not the behavior that most users expect.

Change-Id: I226126ea400c22cf1f7918e80ea7fe0771c569cb

23216211

Fix rare deadlock before loop filter · d45e5501

Frank Galligan authored 14 years ago

There was an extremely rare deadlock that happened when one thread
was waiting to start the loop filter on frame n while the other
threads were starting to work on frame n+1.

Change-Id: Icc94f728b3b6663405435640d9a2996735ba19ef

d45e5501

01 Sep, 2010 - 1 commit
- Replace sleep(0) calls in multi-threaded decoder · 0e78efad
  Yunqing Wang authored 14 years ago
```
This is a workaround for gLucid problem.

Change-Id: I188a016a07e4c2ea212444c5a6284ff3c48a5caa
```
  0e78efad
31 Aug, 2010 - 3 commits

Improved Force Key Frame Behaviour · c239a1b6

Paul Wilkins authored 14 years ago

These changes improve the behaviour of the code with
forced key frames sent in by a calling application.

The sizing of the frames is still suboptimal for two pass in
particular but the behaviour is much better than it was.

Change-Id: I35fae610c67688ccc69d11f385e87dfc884e65a1

c239a1b6

followup arm patch · 0b94f5d6

Johann authored 14 years ago

make the arm asm detokenizer work with the new structures

Change-Id: I7cd92c2a018ec24032bb1cfd1bb9739bc84b444a

0b94f5d6

Changed above and left context data layout · e85e6315

Scott LaVarnway authored 14 years ago

The main reason for the change was to reduce cycles in the token
decoder. (~1.5% gain for 32 bit)  This layout should be more
cache friendly.

As a result of this change, the encoder had to be updated.

Change-Id: Id5e804169d8889da0378b3a519ac04dabd28c837
Note: dixie uses a similar layout

e85e6315

27 Aug, 2010 - 1 commit

Fix harmless off-by-1 error. · 7a8e0a29

Timothy B. Terriberry authored 14 years ago

The memory being zeroed in vp8_update_mode_info_border() was just
 allocated with calloc, and so the entire function is actually
 redundant, but it should be made correct in case someone expects
 it to actually work in the future.

Change-Id: If7a84e489157ab34ab77ec6e2fe034fb71cf8c79

7a8e0a29

24 Aug, 2010 - 1 commit

clean up compiler warnings · 5c244398

Johann authored 14 years ago

did a test compile with clang and got rid of some warnings that have
been annoying me for a while:
vp8/decoder/detokenize.c: In function 'vp8_init_detokenizer':
vp8/decoder/detokenize.c:121: warning: assignment discards qualifiers from pointer target type
vp8/decoder/detokenize.c:122: warning: assignment discards qualifiers from pointer target type
vp8/decoder/detokenize.c:123: warning: assignment from incompatible pointer type
vp8/decoder/detokenize.c:124: warning: assignment discards qualifiers from pointer target type
vp8/decoder/detokenize.c:125: warning: assignment discards qualifiers from pointer target type
vp8/decoder/detokenize.c:128: warning: assignment discards qualifiers from pointer target type
vp8/decoder/detokenize.c:129: warning: assignment discards qualifiers from pointer target type
vp8/decoder/detokenize.c:130: warning: assignment discards qualifiers from pointer target type
vp8/decoder/detokenize.c:131: warning: assignment discards qualifiers from pointer target type

Change-Id: I78ddab176fe47cbeed30379709dc7bab01c0c2e4

5c244398

23 Aug, 2010 - 2 commits

update structures · d73217ab

Johann authored 14 years ago

mbmi and eob moved in previous commits

Change-Id: I30a2eba36addf89ee50b406ad4afdd059a832711

d73217ab

Rework idct calling structure. · 93c32a55

Fritz Koenig authored 14 years ago

Moving the eob structure allows for a non-struct based
function to handle decoding an entire mb of
idct/dequant/recon data.  This allows for SIMD functions
to idct/dequant/recon multiple blocks at once.

SSE2 implementation gives 3% gain on Atom.

Change-Id: I8a8f3efd546ea4e0535f517d94f347cfb737c9c2

93c32a55

20 Aug, 2010 - 1 commit

increase rate control buffer level precision · 8e7ebacb

John Koleszar authored 14 years ago

The external API exposes the RC initial/optimal/full buffer level in
milliseconds, but this value was truncated internally to seconds. This
patch allows the use of the full precision during the conversion from
time to bits.

Change-Id: If8dd2a87614c05747f81432cbe75dd9e6ed2f04e

8e7ebacb

19 Aug, 2010 - 3 commits

Revert "Removed ssse3 sixtap code" · b0660457
Jim Bankoski authored 14 years ago
```
This reverts commit 6ea5bb85.
```
b0660457

cleanup simple loop filter · 52852da7

Johann authored 14 years ago

move some things around, reorder some instructions

constant 0 is used several times. load it once per call in horiz,
once per loop in vert.

separate saturating instructions to avoid stalls.

just use one usub8 call to set GE flags, rather than uqsub8 followed by
usub8 w/ 0

document some stalls for further consideration

Change-Id: Ic3877e0ddbe314bb8a17fd5db73501a7d64570ec

52852da7

fix armv6 simpleloop filter · 467a0b99

Johann authored 14 years ago

test cases were causing a crash because the count was being read
incorrectly. after fixing that, noticed that the output was not
matching. fixed that.

Change-Id: Idb0edb887736bd566a3cf6d4aa1a03ea8d20eb27

467a0b99

18 Aug, 2010 - 1 commit
- Removed ssse3 sixtap code · 6ea5bb85
  Scott LaVarnway authored 14 years ago
```
Change-Id: I0f20fbb898ee31eb94a143471aa6f1ca17a229a4
```
  6ea5bb85
16 Aug, 2010 - 2 commits

store more vars than we removed · c75f3993

Johann authored 14 years ago

only saved r4-11+lr, but were storing r4-r12+lr

Change-Id: If77df1998af50e9badee7d99ef53543046434675

c75f3993

arm: fix missing dependency with --enable-shared · 9aa498b8

John Koleszar authored 14 years ago

The C version of the dequant/idct/add function depends on the C
version of the IDCT, but this isn't compiled in on ARM. Since this
code has asm version, we can just remove this file to eliminate the
link error.

Change-Id: I21de74d89d3765a1db2da27292b20727c53178e9

9aa498b8

13 Aug, 2010 - 1 commit

move segmentation_common to encoder · 80d3923a

John Koleszar authored 14 years ago

vp8_update_gf_useage_maps() is only used by the encoder. This patch
fixes the ability to build in decode-only or encode-only
configurations.

Change-Id: I3a5211428e539886ba998e09e8abd747ac55c9aa

80d3923a

12 Aug, 2010 - 4 commits

framework for assembly version of the detokenizer · 9602799c

Johann authored 14 years ago

adds a compile time option: --enable-arm-asm-detok which pulls in
vp8/decoder/arm/detokenize.asm

currently about break even speed wise, but changes are pending to
the fill code (branch and load 3 bytes versus conditionally always
load one) and the error handling. Currently it doesn't handle zero
runs or overrunning the buffer.

this is really just so i don't have to rebase my changes all the
time to run benchmarks - now just need to replace one file!

Change-Id: I56d0e2354dc0ca3811bffd0e88fe1f952fa6c797

9602799c

update structure · 633646b7

Johann authored 14 years ago

mode_info_context->mbmi no longer gets copied up a level

Change-Id: Icd2d27d381909721326c34594a1ccdc26d48a995

633646b7

remove unused definition · 1ec7981c

Johann authored 14 years ago

asm_offsets contains some definitions which are no longer used. this
was one of them. v6 build works now

Change-Id: If370cfa8acd145de4fead2d9a11b048fccc090df

1ec7981c

Removed unnecessary MB_MODE_INFO copies · 9c7a0090

Scott LaVarnway authored 14 years ago

These copies occurred for each macroblock in the encoder and decoder.
Thetemp MB_MODE_INFO mbmi was removed from MACROBLOCKD.  As a result,
a large number compile errors had to be fixed.

Change-Id: I4cf0ffae3ce244f6db04a4c217d52dd256382cf3

9c7a0090

11 Aug, 2010 - 8 commits

cosmetics: add missing 2D array braces · d22e2968
John Koleszar authored 14 years ago
```
Silences compile warning.

Change-Id: I4b207d97f8570fe29aa2710e4ce4f02e7e43b57a
```
d22e2968

avoid negative array subscript warnings · 392a9582

John Koleszar authored 14 years ago

The mv_ref and sub_mv_ref token encodings are indexed from NEARESTMV
and LEFT4X4, respectively, rather than being zero-based like the
other token encodings.

Change-Id: I3699c3f84111209ecfb91097c4b900773e9a3ad5

392a9582

Finished vp8_sixtap_predict4x4_ssse3 function · b07e5b6f

Scott LaVarnway authored 14 years ago

Added vp8_filter_block1d4_h6_ssse3 and vp8_filter_block1d4_v6_ssse3
assembly routines.  Also removed unused assembly.

Change-Id: I01c1021835f2edda9da706822345f217087ca0d0

b07e5b6f

rename DETOK_[AL] · c0ba42d3

Johann authored 14 years ago

everything else uses lowercase detok

Change-Id: I9671e2e90eb2961208dfa81c00b3accb5749ec04

c0ba42d3

Moved gf_active code to encoder only · 99f46d62

Scott LaVarnway authored 14 years ago

The gf_active code is only used by the encoder, so it was moved from
common and decoder.

Change-Id: Iada15acd5b2b33ff70c34668ca87d4cfd0d05025

99f46d62

Removed duplicate functions · c404fa42
Yaowu Xu authored 14 years ago
```
Change-Id: Ie587972ccefd3c762b8cdf8ef39345cd22924b9b
```
c404fa42

Normalize quantizer's zero bin and rounding factors · 3b95a46c

Yaowu Xu authored 14 years ago

This patch changes a few numbers in the two constant arrays
for quantizer's zerobin and rounding factors, in general to
make the sum of the two factors for any Q to be 128.  While
it might be beneficial to calibrate the two arrays for best
quantizer performance, it is not the purpose of this patch.
Normalizing the two arrays will enable quick optimization
of the current faster quantizer, i.e .zerobin check can be
removed.

Change-Id: If9abfd7929bf4b8e9ecd64a79d817c6728c820bd

3b95a46c

Add trellis quantization. · 8fa38096

Timothy B. Terriberry authored 14 years ago

Replace the exponential search for optimal rounding during
 quantization with a linear Viterbi trellis and enable it
 by default when using --best.
Right now this operates on top of the output of the adaptive
 zero-bin quantizer in vp8_regular_quantize_b() and gives a small
 gain.
It can be tested as a replacement for that quantizer by
 enabling the call to vp8_strict_quantize_b(), which uses
 normal rounding and no zero bin offset.
Ultimately, the quantizer will have to become a function of lambda
 in order to take advantage of activity masking, since there is
 limited ability to change the quantization factor itself.
However, currently vp8_strict_quantize_b() plus the trellis
 quantizer (which is lambda-dependent) loses to
 vp8_regular_quantize_b() alone (which is not) on my test clip.

Patch Set 3:

Fix an issue related to the cost evaluation of successor
states when a coefficient is reduced to zero. With this
issue fixed, now the trellis search almost exactly matches
the exponential search.

Patch Set 2:

Overall, the goal of this patch set is to make "trellis"
search to produce encodings that match the exponential
search version. There are three main differences between
Patch Set 2 and 1:
a. Patch set 1 did not properly account for the scale of
2nd order error, so patch set 2 disable it all together
for 2nd blocks.
b. Patch set 1 was not consistent on when to enable the
the quantization optimization. Patch set 2 restore the
condition to be consistent.
c. Patch set 1 checks quantized level L-1, and L for any
input coefficient was quantized to L. Patch set 2 limits
the candidate coefficient to those that were rounded up
to L. It is worth noting here that a strategy to check
L and L+1 for coefficients that were truncated down to L
might work.

(a and b get trellis quant to basically match the exponential
search on all mid/low rate encodings on cif set, without
a, b, trellis quant can hurt the psnr by 0.2 to .3db at
200kbps for some cif clips)
(c gets trellis quant  to match the exponential search
to match at Q0 encoding, without c, trellis quant can be
1.5 to 2db lower for encodings with fixed Q at 0 on most
derf cif clips)

Change-Id:	Ib1a043b665d75fbf00cb0257b7c18e90eebab95e

8fa38096

10 Aug, 2010 - 2 commits

Added ssse3 version of sixtap filters · e4fe8669

Scott LaVarnway authored 14 years ago

Improved decoder performance by 9% for the clip used.

Change-Id: I8fc5609213b7bef10248372595dc85b29f9895b9

e4fe8669

First modification of multi-thread decoder · ba2e107d

Yunqing Wang authored 14 years ago

This is the first modification of VP8 multi-thread decoder, which uses
same threads to decode macroblocks and then do loopfiltering for each
frame.

Inspired by Rob Clark, synchronization was done on every 8 macroblocks
instead of every macroblock to reduce lock contention.

Comparing with the original code, this implementation gave about 15%-
20% performance gain while decoding my test clips on a Core2 Quad
platform (Linux).

The work is not done yet.

Test on other platforms are needed.

Change-Id: Ice9ddb0b511af1359b9f71e65066143c04fef3b5

ba2e107d

09 Aug, 2010 - 1 commit

Mark loopfilter C functions as static · 618c7d27

John Koleszar authored 14 years ago

Clang defaults to C99 mode, and inline works differently in C99.
(gcc, on the other hand, defaults to a special gnu-style inlining,
which uses different syntax.)   Making the functions static makes sure
clang doesn't decide to discard a function because it's too large to
inline.

Thanks to eli.friedman for the patch.

Fixes http://code.google.com/p/webm/issues/detail?id=114

Change-Id: If3c1c3c176eb855a584a60007237283b0cc631a4

618c7d27

02 Aug, 2010 - 4 commits

nasm: avoid space before the :data symbol type. · 0e8f108f

Jan Kratochvil authored 14 years ago

global label:data
           ^^

Provide nasm compatibility.  No binary change by this patch with yasm
on {x86_64,i686}-fedora13-linux-gnu.  Few longer opcodes with nasm on
{x86_64,i686}-fedora13-linux-gnu have been checked as safe.

Change-Id:	I10f17eb1e4d4a718d4ebd1d0ccddc807c365e021

0e8f108f

nasm: end labels with colon (':') · 0327d3df

Jan Kratochvil authored 14 years ago

Labels should end by colon (':'), nasm requires it.

Provide nasm compatibility.  No binary change by this patch with yasm
on {x86_64,i686}-fedora13-linux-gnu.  Few longer opcodes with nasm on
{x86_64,i686}-fedora13-linux-gnu have been checked as safe.

Change-Id: I0b2ec6f01afb061d92841887affb5ca0084f936f

0327d3df

nasm: use OWORD vs DQWORD · c8134bc5

Jan Kratochvil authored 14 years ago

nasm knows only OWORD.  yasm knows both OWORD and DQWORD.

Provide nasm compatibility.  No binary change by this patch with yasm on
{x86_64,i686}-fedora13-linux-gnu.  Few longer opcodes with nasm on
{x86_64,i686}-fedora13-linux-gnu have been checked as safe.

Change-Id: I62151390089e90df9a7667822fa594ac20b00e78

c8134bc5

Replace pinsrw (SSE) with MMX instructions · 7d243701

Philip Jägenstedt authored 14 years ago

Fixes http://code.google.com/p/webm/issues/detail?id=136

Change-Id:	I5a3e294061644a1a9718e8ba4a39548ede25cc42

7d243701