Commits · 1600707d350f572d6768a9adb57248ba44c29b82 · BC / public / external / libvpx

18 Sep, 2013 - 4 commits

Fix x86inc.asm to build PIC code correctly · 9d901217

Yunqing Wang authored 11 years ago

Current x86inc.asm didn't handle 32bit PIC build properly.
TEXTRELs were seen in the library built. The PIC macros from
libvpx's x86_abi_support.asm was used to fix this problem.
The assembly code was modified to use the macros.

Notes: We need this fix in for decoder building. Functions in
encoder will be fixed later.

Change-Id: Ifa548d37b1d0bc7d0528db75009cc18cd5eb1838

9d901217

Removing redundant coef calculation + cleanup. · 98cf0145

Dmitry Kovalev authored 11 years ago

Adding temp variable for &x->plane[0], inlining src_diff values.

Change-Id: I24c08a5425a6da6fd66f5b0278f2fce74f9989b2

98cf0145

Removing redundant code from vp9_mcomp.c. · 72fd127f

Dmitry Kovalev authored 11 years ago

Replacing ((1 << MV_MAX_BITS) - 1) with MV_MAX, adding const
qualifiers, reusing computed values.

Change-Id: I7b46d47f6c644b079d9c3478116a9de465a9baec

72fd127f

Fixing typo in the encoder. · 245ca04b
Dmitry Kovalev authored 11 years ago
```
Change-Id: I168efdc366eecf638694f357ccad2f4eba7e2fdb
```
245ca04b

17 Sep, 2013 - 3 commits

Clean up second ref check in sub8x8 rd loop · c437bbcd

Jingning Han authored 11 years ago

This commit cleans up the second reference check in the
rate-distortion optimization loop of sub8x8 blocks.

Change-Id: Ife68feaa4cddbfad2878c9b44d3012788d634f97

c437bbcd

Silence a bunch of MSVC warnings · a783da80
Yaowu Xu authored 11 years ago
```
Change-Id: I16633269582a640809dca27572bbe99efa6369fc
```
a783da80

Remove redundant argument in get_sub_block_mv · 2b3bfaa9

Jingning Han authored 11 years ago

The sub8x8 check can be directly inferred from block_idx, hence
removed from the arguments if get_sub_block_mv.

Change-Id: Ib766d57e81248fb92df0f6d9b163e6c77b933ccd

2b3bfaa9

16 Sep, 2013 - 4 commits

Speed up iht8x8 by rearranging instructions. · 23e1a29f

hkuang authored 11 years ago

Speed improves from 282% to 302% faster based on assembly-perf.

Change-Id: I08c5c1a542d43361611198f750b725e4303d19e2

23e1a29f

fix a problem where an invalid mv used in search · eeae6f94

Yaowu Xu authored 11 years ago

The commit added reset of pred_mv at the beginning of each SB64x64
partition mv search, also limited the usage of pred_mv only when
search on the largest partition is already done. This is to fix
a crash at speed 1/2 encoder where an invalid mv is used in mv
search.

Change-Id: I39010177da76d054e3c90b7899a44feb2e3a5b1b

eeae6f94

Minor clean up. · cb50dc7f

Paul Wilkins authored 11 years ago

Removed some unused code and minor cleanup
/ reordering.

Change-Id: I4083ae56aeb8edfe9b85aa2f42a16aa28d19da94

cb50dc7f

Adjustment to mode_skip_start. · 3b017784

Paul Wilkins authored 11 years ago

Corrected values relating to modified mode order.

Change-Id: I24fccba3af4bc16721d5e7e51888a66305bfa7fe

3b017784

13 Sep, 2013 - 3 commits

Revert "Improved 8t filters" · 2d587619

James Zern authored 11 years ago

This is incompatible with most toolchains other than gcc.

Revert "Deleted #include <inttypes.h>"

This reverts commit 4d018be9.

This reverts commit d22a504d.

Change-Id: I1751dc6831f4395ee064e6748281418e967e1dcf

2d587619

Adaptive motion search control · c4826c59

Jingning Han authored 11 years ago

This commit enables adaptive constraint on motion search range for
smaller partitions, given the motion vectors of collocated larger
partition as a candidate initial search point.

It makes speed 0 runtime of bus at CIF and 2000 kbps goes from
167s down to 162s (3% speed-up), at 0.01dB performance gains. In
the settings of speed 1, this makes the runtime goes from 33687 ms
to 32142 ms (4.5% speed-up), at 0.03dB performance gains.

Compression performance wise, it gains at speed 1:
derf  0.118%
yt    0.237%
hd    0.203%
stdhd 0.438%

Change-Id: Ic8b34c67810d9504a9579bef2825d3fa54b69454

c4826c59

Fix VP9_mode_order[] · 1407cf85

Paul Wilkins authored 11 years ago

Mis-merge of the following change managed to break mode order
and delete two mode options (new alt ref and near alt ref)
It also created a situation where we could test two undefined
modes off the end of the VP9_mode_order[] data structure.
  "clang warnings : remove split and i4x4_pred fake modes"
  "Change Id: I8ef3c*"

Initial testing on Akiyo at speed 2.
101.35	 44.567	 44.447 improves to
96.82	 44.915	 44.815

Approx 0.3-0.4db gain and 2.5% size reduction

Change-Id: Icff813e7c0778d140ad4f0eea18cf1ed203c4e34

1407cf85

12 Sep, 2013 - 4 commits

Add neon optimize iht8x8 which is 282% faster than C. · 182366c7
hkuang authored 11 years ago
```
Change-Id: I963dd4a6e8671957403ccbb9a16ea7de703e3530
```
182366c7

fix clang warning in rdopt · 9ee9918d

Jim Bankoski authored 11 years ago

either missed this or it crept back in

Change-Id: I6cc1519d09e558be7250254c25bde2ae720555ea

9ee9918d

clang warnings : ref frame enum mismatch · e7f2aa0f

Jim Bankoski authored 11 years ago

Convert from refframe_type_t to VP9_REFFRAME

Change-Id: Iff4043c3fdb3e1c9c2b412bdffd5da8ed913ec13

e7f2aa0f

Deleted #include <inttypes.h> · 4d018be9

Paul Wilkins authored 11 years ago

This seems not to be needed and is not supported
in the Windows build.

Change-Id: Iaca3bbf8cca283aee6bc336cb31ba9dd4610322b

4d018be9

11 Sep, 2013 - 7 commits

clang warnings : remove split and i4x4_pred fake modes · 7fb42d90
Jim Bankoski authored 11 years ago
```
Change-Id: I8ef3c7c0f08f0f1f4ccb8ea4deca4cd8143526ee
```
7fb42d90

First draft of vp9_short_idct32x32_add_neon. · 6a501462

Christian Duvivier authored 11 years ago

Lots of TODO which will be taken care in upcoming changes. As is,
about 6x faster than C version.

Change-Id: Ie2557b72fd2d8edca376dbf400a4d173aa5e63e0

6a501462

Clean up of the search best filter speed feature · b9646467

Deb Mukherjee authored 11 years ago

Removes this speed feature since it is very slow and unlikely
to be used in practice. This cleanup removes a bunch of unnecessary
complications in the outer encode loop.

Change-Id: I3c66ef1ca924fbfad7dadff297c9e7f652d308a1

b9646467

Improved 8t filters · d22a504d

Scott LaVarnway authored 11 years ago

Reformatted version of a patch submitted by Erik/Tamar
from Intel.  For the test clips used, the decoder
performance improved by ~2%.

Change-Id: Ifbc37ac6311bca9ff1cfefe3f2e9b7f13a4a511b

d22a504d

Changes in speed 2 settings · 69fe840e

Deb Mukherjee authored 11 years ago

Propose some changes to the speed 2 settings to improve quality.
In particular, turns off the adjust_thresholds_by_speed feature
which improves results by 6%. Also removes the code for
adjust_thresholds_by_speed since it conflicts with the adaptive
rd thresh feature.

Overall, with this change speed 2 is -15.2% from speed 0 settings,
on derf, which is significantly better than -21.6% down before.

Change-Id: I6e90a563470979eb0c258ec32d6183ed7ce9a505

69fe840e

New mode_info_context storage -- undo revert · ac6093d1

Scott LaVarnway authored 11 years ago

mode_info_context was stored as a grid of MODE_INFO structs.
The grid now constists of pointers to MODE_INFO structs.  The
MODE_INFO structs are now stored as a stream (decoder only),
eliminating unnecessary copies and is a little more cache
friendly.

Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d

ac6093d1

code cleanup · 079183c1

Yunqing Wang authored 11 years ago

Removed unused function.

Change-Id: Icb12a09e4d303968be6aec9fae1ef05935913a4f

079183c1

10 Sep, 2013 - 7 commits

Remove redundant condition check in 32x32 quant · 5d93feb6

Jingning Han authored 11 years ago

The c code implementation of 32x32 quantization does the zbin check
of all coefficients prior to the quant/dequant loop, hence removing
the redundant zbin check inside the loop. This only affects the
c code version. SSSE3 version does not separate the zbin check out.

Change-Id: Ic197a7d61d0b25fcac3cc092987651378cb56e4e

5d93feb6

Small tweaks on the constant quality mode · 09830aa0

Deb Mukherjee authored 11 years ago

Improves results a little.
derf is now +1.078% over bitrate control.

Change-Id: I4812136f3e67be21d14ec089419976a32a841785

09830aa0

Stop partition checking when distortion is small · 0607abc3

Yunqing Wang authored 11 years ago

If the current obtained distortion is very small, which happens
for static image case, we pick the current partition type without
further split checking.

This won't affect regular videos. For static videos, we got 10%~12%
encoding speed gain. PSNR was better for some clips, and worse for
others. Overall it was even.

Change-Id: If787a57bedf46fc595ca4f5ded2b0c0a69e9fdef

0607abc3

Modify encode breakout for static frames · 939791a1

Yunqing Wang authored 11 years ago

Thank Paul for the suggestions. While turning on static-thresh
for static-image videos, a big jump on bitrate was seen. In this
patch, we detected static frames in the video using first-pass
stats. For different cases, disable encode breakout or reduce
encode breakout threshold to limit the skipping.

More modification need be done to break incorrect partition
picking pattern for static frames while skipping happens.

Change-Id: Ia25f47041af0f04e229c70a0185e12b0ffa6047f

939791a1

Modified mode skip functionality. · 4f660cc0

Paul Wilkins authored 11 years ago

A previous speed feature skipped modes not used in earlier
partitions but this not longer worked as intended following
changes to the partition coding order and in conjunction
with some other speed features (Especially speed 2 and above).

This modified mode skip feature sets a mask after the first X
modes have been tested in each partition depending on the
reference frame of the current best case.

This patch also makes some changes to the order modes are
tested to fit better with this skip functionality.

Initial testing suggests speed and rd hit count improvements
of up to 20% at speed 1. Quality results. (derf -1.9%, std hd  +0.23%).

Change-Id: Idd8efa656cbc0c28f06d09690984c1f18b1115e1

4f660cc0

Added extra check to rd_auto_partition_range() · 901c4954

Paul Wilkins authored 11 years ago

Added check that the returned max and minimum are
valid in bottom and right border cases.

Change-Id: I2d6cdc9b5f04c7d0ff512ddcf3228331e028bf9b

901c4954

Speed up idct16x16 by rearrange instructions. · fc5ec206

hkuang authored 11 years ago

Speed improve from 376% to 400% faster base on assembly-perf.

Change-Id: If0b2eccc39d5793dc101ce9feb7fcadf88396ea2

fc5ec206

09 Sep, 2013 - 3 commits

API extensions and sample app for spacial scalable encoder · 01b35c3c

Ivan Maltz authored 11 years ago

Sample app: vp9_spatial_scalable_encoder
vpx_codec_control extensions:
  VP9E_SET_SVC
  VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER
  VP9E_SET_MIN_Q, VP9E_SET_MAX_Q
expanded buffer size for vp9_convolve

modified setting of initial width in vp9_onyx_if.c so that layer size
can be set prior to initial encode

Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS)
Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers

Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197

01b35c3c

Remove the use of uninitialized_safe in encode_sb_ · 18c780a0

Jingning Han authored 11 years ago

Initialize the probability model context with default value in
encode_sb.

Change-Id: Id826114024dfc21c7ef41aea9f4a0316d4a5cb95

18c780a0

Revert "New mode_info_context storage" · 54a03e20

James Zern authored 11 years ago

This reverts commit dae17734

Encode crashes, leaks and increases integer overflow errors.

Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2

54a03e20

08 Sep, 2013 - 2 commits

Reduce the amount of extension in src frames · 65c2444e

Yaowu Xu authored 11 years ago

The commit changes the border pixel extension from 160 pixel each side
to what is necessary in arnr filter or motion estimation portion, i.e.
16 pixel on top and left side. For right or bottom side, the extension
is changed to either round up image size to multiple of 64 or at least
16 pixels.

Change-Id: Ic05e19b94368c1ab4df568723aae5734e6c3d2c5

65c2444e

resolve clang issue : implicit convert tx_mode -> tx_size · 9faa7e81
Jim Bankoski authored 11 years ago
```
Change-Id: Ifc9da470358f58e800e3d0d70a565b61e5f7834a
```
9faa7e81

07 Sep, 2013 - 1 commit

Fix overflow issue in 16x16 quantization SSSE3 · 09bc942b

Jingning Han authored 11 years ago

The 16x16 transform unit test suggested that the peak coefficient
value can reach 32639. This could cause potential overflow issue
in the SSSE3 implmentation of 16x16 block quantization. This commit
fixes this issue by replacing addition with saturated addition.

Change-Id: I6d5bb7c5faad4a927be53292324bd2728690717e

09bc942b

06 Sep, 2013 - 2 commits

Enable kf restrictions at speed 4 · f15cdc74
Paul Wilkins authored 11 years ago
```
Change-Id: I453409d3be3f5fe118b15affde45cb52184aef20
```
f15cdc74

Support a constant quality mode in VP9 · e378a89b

Deb Mukherjee authored 11 years ago

Adds a new end-usage option for constant quality encoding in vpx. This
first version implemented for VP9, encodes all regular inter frames
using the quality specified in the --cq-level= option, while encoding
all key frames and golden/altref frames at a quality better than that.

The current performance on derfraw300 is +0.910% up from bitrate control,
but achieved without multiple recode loops per frame.

The decision for qp for each altref/golden/key frame will be improved
in subsequent patches based on better use of stats from the first pass.
Further, the qp for regular inter frames may also be varied around the
provided cq-level.

Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3

e378a89b