Commits · f5615b6149037aa1c18703d6272c7273c43b5c6a · BC / public / external / libvpx

11 Aug, 2010 - 9 commits

Merge "Finished vp8_sixtap_predict4x4_ssse3 function" · f5615b61
Scott LaVarnway authored 14 years ago

f5615b61
cosmetics: add missing 2D array braces · d22e2968
John Koleszar authored 14 years ago
```
Silences compile warning.

Change-Id: I4b207d97f8570fe29aa2710e4ce4f02e7e43b57a
```
d22e2968

avoid negative array subscript warnings · 392a9582

John Koleszar authored 14 years ago

The mv_ref and sub_mv_ref token encodings are indexed from NEARESTMV
and LEFT4X4, respectively, rather than being zero-based like the
other token encodings.

Change-Id: I3699c3f84111209ecfb91097c4b900773e9a3ad5

392a9582

Finished vp8_sixtap_predict4x4_ssse3 function · b07e5b6f

Scott LaVarnway authored 14 years ago

Added vp8_filter_block1d4_h6_ssse3 and vp8_filter_block1d4_v6_ssse3
assembly routines.  Also removed unused assembly.

Change-Id: I01c1021835f2edda9da706822345f217087ca0d0

b07e5b6f

rename DETOK_[AL] · c0ba42d3

Johann authored 14 years ago

everything else uses lowercase detok

Change-Id: I9671e2e90eb2961208dfa81c00b3accb5749ec04

c0ba42d3

Moved gf_active code to encoder only · 99f46d62

Scott LaVarnway authored 14 years ago

The gf_active code is only used by the encoder, so it was moved from
common and decoder.

Change-Id: Iada15acd5b2b33ff70c34668ca87d4cfd0d05025

99f46d62

Removed duplicate functions · c404fa42
Yaowu Xu authored 14 years ago
```
Change-Id: Ie587972ccefd3c762b8cdf8ef39345cd22924b9b
```
c404fa42

Normalize quantizer's zero bin and rounding factors · 3b95a46c

Yaowu Xu authored 14 years ago

This patch changes a few numbers in the two constant arrays
for quantizer's zerobin and rounding factors, in general to
make the sum of the two factors for any Q to be 128.  While
it might be beneficial to calibrate the two arrays for best
quantizer performance, it is not the purpose of this patch.
Normalizing the two arrays will enable quick optimization
of the current faster quantizer, i.e .zerobin check can be
removed.

Change-Id: If9abfd7929bf4b8e9ecd64a79d817c6728c820bd

3b95a46c

Add trellis quantization. · 8fa38096

Timothy B. Terriberry authored 14 years ago

Replace the exponential search for optimal rounding during
 quantization with a linear Viterbi trellis and enable it
 by default when using --best.
Right now this operates on top of the output of the adaptive
 zero-bin quantizer in vp8_regular_quantize_b() and gives a small
 gain.
It can be tested as a replacement for that quantizer by
 enabling the call to vp8_strict_quantize_b(), which uses
 normal rounding and no zero bin offset.
Ultimately, the quantizer will have to become a function of lambda
 in order to take advantage of activity masking, since there is
 limited ability to change the quantization factor itself.
However, currently vp8_strict_quantize_b() plus the trellis
 quantizer (which is lambda-dependent) loses to
 vp8_regular_quantize_b() alone (which is not) on my test clip.

Patch Set 3:

Fix an issue related to the cost evaluation of successor
states when a coefficient is reduced to zero. With this
issue fixed, now the trellis search almost exactly matches
the exponential search.

Patch Set 2:

Overall, the goal of this patch set is to make "trellis"
search to produce encodings that match the exponential
search version. There are three main differences between
Patch Set 2 and 1:
a. Patch set 1 did not properly account for the scale of
2nd order error, so patch set 2 disable it all together
for 2nd blocks.
b. Patch set 1 was not consistent on when to enable the
the quantization optimization. Patch set 2 restore the
condition to be consistent.
c. Patch set 1 checks quantized level L-1, and L for any
input coefficient was quantized to L. Patch set 2 limits
the candidate coefficient to those that were rounded up
to L. It is worth noting here that a strategy to check
L and L+1 for coefficients that were truncated down to L
might work.

(a and b get trellis quant to basically match the exponential
search on all mid/low rate encodings on cif set, without
a, b, trellis quant can hurt the psnr by 0.2 to .3db at
200kbps for some cif clips)
(c gets trellis quant  to match the exponential search
to match at Q0 encoding, without c, trellis quant can be
1.5 to 2db lower for encodings with fixed Q at 0 on most
derf cif clips)

Change-Id:	Ib1a043b665d75fbf00cb0257b7c18e90eebab95e

8fa38096

10 Aug, 2010 - 2 commits

Added ssse3 version of sixtap filters · e4fe8669

Scott LaVarnway authored 14 years ago

Improved decoder performance by 9% for the clip used.

Change-Id: I8fc5609213b7bef10248372595dc85b29f9895b9

e4fe8669

First modification of multi-thread decoder · ba2e107d

Yunqing Wang authored 14 years ago

This is the first modification of VP8 multi-thread decoder, which uses
same threads to decode macroblocks and then do loopfiltering for each
frame.

Inspired by Rob Clark, synchronization was done on every 8 macroblocks
instead of every macroblock to reduce lock contention.

Comparing with the original code, this implementation gave about 15%-
20% performance gain while decoding my test clips on a Core2 Quad
platform (Linux).

The work is not done yet.

Test on other platforms are needed.

Change-Id: Ice9ddb0b511af1359b9f71e65066143c04fef3b5

ba2e107d

09 Aug, 2010 - 1 commit

Mark loopfilter C functions as static · 618c7d27

John Koleszar authored 14 years ago

Clang defaults to C99 mode, and inline works differently in C99.
(gcc, on the other hand, defaults to a special gnu-style inlining,
which uses different syntax.)   Making the functions static makes sure
clang doesn't decide to discard a function because it's too large to
inline.

Thanks to eli.friedman for the patch.

Fixes http://code.google.com/p/webm/issues/detail?id=114

Change-Id: If3c1c3c176eb855a584a60007237283b0cc631a4

618c7d27

02 Aug, 2010 - 7 commits

Merge "Issue 150: Fixing linker warning in extend.c." · cfb204ea
John Koleszar authored 14 years ago

cfb204ea

configure: support directories containing .o · 4e6827a0

John Koleszar authored 14 years ago

Fixes http://code.google.com/p/webm/issues/detail?id=96

The regex which postprocesses the gcc make-deps (-M) output was too
greedy and matching in the dependencies part of the rule rather than
the target only. The patch provided with the issue was not correct, as
it tried to match the .o at the end of the line, which isn't correct
at least for my GCC version. This patch matches word characters
instead of .*

Thanks to raimue and the MacPorts community for isolating this issue.

Change-Id: I28510da2252e03db910c017101d9db12e5945a27

4e6827a0

nasm: avoid space before the :data symbol type. · 0e8f108f

Jan Kratochvil authored 14 years ago

global label:data
           ^^

Provide nasm compatibility.  No binary change by this patch with yasm
on {x86_64,i686}-fedora13-linux-gnu.  Few longer opcodes with nasm on
{x86_64,i686}-fedora13-linux-gnu have been checked as safe.

Change-Id:	I10f17eb1e4d4a718d4ebd1d0ccddc807c365e021

0e8f108f

nasm: end labels with colon (':') · 0327d3df

Jan Kratochvil authored 14 years ago

Labels should end by colon (':'), nasm requires it.

Provide nasm compatibility.  No binary change by this patch with yasm
on {x86_64,i686}-fedora13-linux-gnu.  Few longer opcodes with nasm on
{x86_64,i686}-fedora13-linux-gnu have been checked as safe.

Change-Id: I0b2ec6f01afb061d92841887affb5ca0084f936f

0327d3df

nasm: use OWORD vs DQWORD · c8134bc5

Jan Kratochvil authored 14 years ago

nasm knows only OWORD.  yasm knows both OWORD and DQWORD.

Provide nasm compatibility.  No binary change by this patch with yasm on
{x86_64,i686}-fedora13-linux-gnu.  Few longer opcodes with nasm on
{x86_64,i686}-fedora13-linux-gnu have been checked as safe.

Change-Id: I62151390089e90df9a7667822fa594ac20b00e78

c8134bc5

Merge "Replace pinsrw (SSE) with MMX instructions" · 67529821
John Koleszar authored 14 years ago

67529821

Replace pinsrw (SSE) with MMX instructions · 7d243701

Philip Jägenstedt authored 14 years ago

Fixes http://code.google.com/p/webm/issues/detail?id=136

Change-Id:	I5a3e294061644a1a9718e8ba4a39548ede25cc42

7d243701

29 Jul, 2010 - 2 commits
- apple: include proper mach primatives · 38a20e03
  John Koleszar authored 14 years ago
```
Fixes implicit declaration warning for 'mach_task_self'.

Patch courtesy of timeless at gmail.com

Change-Id: I9991dedd1ccfddc092eca86705ecbc3b764b799d
```
  38a20e03
- Merge "Enable the switch between two versions of quantizer" · c2a8d8b5
  Yaowu Xu authored 14 years ago
  
  c2a8d8b5
28 Jul, 2010 - 2 commits

Removed two unused global variables. · 062e6c18

Frank Galligan authored 14 years ago

Removed the global variables vp8_an and vp8_cd. vp8_an was causing problems
because it was increasing the .bss by 1572864 bytes.

Change-Id: I6c12e294133c7fb6e770c0e4536d8287a5720a87

062e6c18

Enable the switch between two versions of quantizer · f95c80b6

Yaowu Xu authored 14 years ago

To facilitate more testing related to quantizer and rate
control, the old version quantizer is added back. old and
new quantizer can be switched back and forth by define or
un-define the macro "EXACT_QUANT".

Change-Id: Ia77e687622421550f10e9d65a9884128a79a65ff

f95c80b6

27 Jul, 2010 - 5 commits

configure: pass original arguments through to make dist · 23d68a5f

John Koleszar authored 14 years ago

When running configure automatically through the make dist target,
reuse the arguments passed to the original configure command.

Change-Id: I40e5b8384d6485a565b91e6d2356d5bc9c4c5928

23d68a5f

Merge "msvs: fix install of codec sources" · aa82363c
John Koleszar authored 14 years ago

aa82363c

x86/sse2: disable asm quantizer · a570bbd4

Johann authored 14 years ago

follow up to Change I0e51492d: neon: disable asm quantizer

Now x86 doesn't segfault with --disable-runtime-cpu-detect and -p=2

Change-Id: I8ca127bb299198efebbcbd5a661e81788361933f

a570bbd4

Fix build w/o RTCD · b9a038a5

Johann authored 14 years ago

So many places to update ...

Change-Id: Ide957b40cc833f99c2d1849acade6850fbf7585d

b9a038a5

neon: disable asm quantizer · d8009c07

John Koleszar authored 14 years ago

The assembly version of the quantizer has not been updated to match
the new exact quantizer introduced in commit e04e2935. That commit tried
to disable this code but missed the non-RTCD case.

Thanks to David Baker <david.baker at openmarket.com> for isolating the
issue and testing this fix.

Change-Id: I0e51492dc6f8e44d2c10b587427448bf94135c65

d8009c07

26 Jul, 2010 - 3 commits

Merge "update arm idct functions" · 1743f948
Fritz Koenig authored 14 years ago

1743f948

Merge changes I896fe6f9,I90d8b167 · 3de8a958

Fritz Koenig authored 14 years ago

* changes:
  Change the x86 idct functions to do reconstruction at the same time
  Combine idct and reconstruction steps

3de8a958

update arm idct functions · 56f5a9a0

Johann authored 14 years ago

Jeff Muizelaar posted some changes to the idct/reconstruction c code.
This is the equivalent update for the arm assembly.

This shows a good boost on v6, and a minor boost on neon.
Here are some numbers for highway in qcif, 2641 frames:
HEAD neon: ~161 fps
new neon:  ~162 fps
HEAD v6:   ~102 fps
new v6:    ~106 fps

The following functions have been updated for armv6 and neon:
vp8_dc_only_idct_add
vp8_dequant_idct_add
vp8_dequant_dc_idct_add

Conflicts:

	vp8/decoder/arm/armv6/dequantdcidct_v6.asm
	vp8/decoder/arm/armv6/dequantidct_v6.asm

Resolved by removing these files. When I rewrote the functions, I also
moved the files to dequant_dc_idct_v6.asm/dequant_idct_v6.asm

Change-Id: Ie3300df824d52474eca1a5134cf22d8b7809a5d4

56f5a9a0

23 Jul, 2010 - 9 commits

Issue 150: Fixing linker warning in extend.c. · 1d8277f8
Justin Lebar authored 14 years ago

1d8277f8
Don't dereference ctx->priv if it hasn't been setup correctly. · 2add72d9
Fredrik Söderquist authored 14 years ago

2add72d9
Only touch ctx->priv if vp8_mmap_alloc succeeded. · eafcf918
Fredrik Söderquist authored 14 years ago

eafcf918
Change the x86 idct functions to do reconstruction at the same time · 98fcccfe
Jeff Muizelaar authored 14 years ago
```
Change-Id: I896fe6f9664e6849c7cee2cc6bb4e045eb42540f
```
98fcccfe

Combine idct and reconstruction steps · b2fa74ac

Jeff Muizelaar authored 14 years ago

This moves the prediction step before the idct and combines the idct and
reconstruction steps into a single step. Combining them seems to give an
overall decoder performance improvement of about 1%.

Change-Id: I90d8b167ec70d79c7ba2ee484106a78b3d16e318

b2fa74ac

Swap alt/gold/new/last frame buffer ptrs instead of copying. · 0ce39012

Fritz Koenig authored 14 years ago

At the end of the decode, frame buffers were being copied.
The frames are not updated after the copy, they are just
for reference on later frames.  This change allows multiple
references to the same frame buffer instead of copying it.

Changes needed to be made to the encoder to handle this.  The
encoder is still doing frame buffer copies in similar places
where pointer reference could be done.

Change-Id: I7c38be4d23979cc49b5f17241ca3a78703803e66

0ce39012

Merge commit 'refs/changes/51/351/1' of... · 68cf2431

Paul Wilkins authored 14 years ago

Merge commit 'refs/changes/51/351/1' of ssh://review.webmproject.org:29418/libvpx into KfRateBugMerged

68cf2431

Merge "Make the quantizer exact." · f5cf8553
Yaowu Xu authored 14 years ago

f5cf8553

Rate control bug with long key frame interval. · 9404c7db

Paul Wilkins authored 14 years ago

In two pass encodes, the calculation of the number of bits
allocated to a KF group had the potential to overflow for high data
rates if the interval is very long.

We observed the problem in one test clip where there was one
section where there was an 8000 frame gap between key frames.

Change-Id: Ic48eb86271775d7573b4afd166b567b64f25b787

9404c7db