Commits · a8f9b9c94ff077a95d0572c79601956d4599db67 · BC / public / external / libvpx

17 Jul, 2013 - 1 commit
- added missed replacement · a8f9b9c9
  Yaowu Xu authored 11 years ago
```
Change-Id: I2bce6f381fef0729b4dd5eb09ccb609f2eddd7ef
```
  a8f9b9c9
16 Jul, 2013 - 37 commits

Merge "Loop filter code cleanup." · f53d007b
Dmitry Kovalev authored 11 years ago

f53d007b

Dmitry Kovalev authored 11 years ago

Removing tile_rows and tile_columns from VP9Common, removing redundant
constants MIN_TILE_WIDTH and MAX_TILE_WIDTH, changing signature of
vp9_get_tile_n_bits.

Change-Id: I8ff3104a38179b2c6900df965c144c1d6f602267

9482a0bf

Loop filter code cleanup. · 2de3c8d2

Dmitry Kovalev authored 11 years ago

Cosmetic code changes, renaming 'flat' local var to 'mask', removing
unused field 'blim' from loopfilter_info_n and loop_filter_info structs.

Change-Id: I51e6ccf727fe361ad9a08e29e1201aa7abd4987f

2de3c8d2

Merge changes I40454d26,I892e76d5,I865ab3f9,I4a4bec17,I61c4351e,I37eb3559,I1031c556,I8c8f1f42 · 98e132bd

James Zern authored 11 years ago

* changes:
  delete vp9_loopfilter_sse2.asm
  vp9_loopfilter_intrin_sse2: cosmetics: fix indent
  delete x86/vp9_loopfilter_x86.h
  vp9_loopfilter_intrin_sse2: make some funcs static
  vp9_loopfilter_intrin_sse2: remove unused uv funcs
  vp9_loopfilter: remove uv function typedef
  filter_block_plane: reuse some constants
  vp9_loopfilter.c: make some functions static

98e132bd

Merge "use consistent framerate naming" · 39ce4b13
James Zern authored 11 years ago

39ce4b13
use consistent framerate naming · 9581eb6e
James Zern authored 11 years ago
```
s/frame_rate/framerate/g

Change-Id: I6fc3e088e419c5f46e3a9390dd8a2cad2677a2fc
```
9581eb6e
Merge "SSE2 16x16 inverse ADST/DCT hybrid transform" · 5e8e2bf4
Jingning Han authored 11 years ago

5e8e2bf4
Merge "Rewriting vp9_set_pred_flag_{seg_id, mbskip}." · 5de96b3c
Dmitry Kovalev authored 11 years ago

5de96b3c
Merge "Moving vp9_kf_default_bmode_probs to vp9_entropymode.c." · 85a0d8e8
Dmitry Kovalev authored 11 years ago

85a0d8e8

delete vp9_loopfilter_sse2.asm · 50015f6e

James Zern authored 11 years ago

sse2 functions are provided by vp9_loopfilter_intrin_sse2.c

Change-Id: I40454d26034e3ef915eeaf889937fe7d1b519b9b

50015f6e

vp9_loopfilter_intrin_sse2: cosmetics: fix indent · 8f4787a3
James Zern authored 11 years ago
```
Change-Id: I892e76d5ad1443b2ea0d1a7839fe26afe9c68ffb
```
8f4787a3

delete x86/vp9_loopfilter_x86.h · af582542

James Zern authored 11 years ago

also remove prototype_loopfilter{,_block} defines from vp9_loopfilter.h

Change-Id: I865ab3f9436c7b1ca166f76630328abf01389405

af582542

Merge "vp9: remove frames_{since,till}.. from MACROBLOCKD" · 5baa416b
James Zern authored 11 years ago

5baa416b
Merge "Cosmetic changes in 4x4 and 8x8 fdct unit tests" · 70fe2b3e
James Zern authored 11 years ago

70fe2b3e

SSE2 16x16 inverse ADST/DCT hybrid transform · d05f66aa

Jingning Han authored 11 years ago

This commit enables SSE2 implementation of 16x16 inverse ADST/DCT
hybrid transform. The runtime goes from 5742 cycles -> 1821 cycles.
This provides about 1% encoding speed-up at speed 0.

Change-Id: I1678d0988bf30b9efd524877705bbb3645edb17b

d05f66aa

Merge "VP[89]_COMMON: remove unused near_boffset" · c0562d08
James Zern authored 11 years ago

c0562d08
Merge "VP9_COMMON: remove unused framerate/bitrate" · 63e914bd
James Zern authored 11 years ago

63e914bd
Merge "yv12config: remove YUV_TYPE" · 3a7c2665
James Zern authored 11 years ago

3a7c2665
Merge "Replace generated quant tables with static lookup tables." · 58a20053
Ronald S. Bultje authored 11 years ago

58a20053

Replace generated quant tables with static lookup tables. · e965cccc

Ronald S. Bultje authored 11 years ago

This prevents possible float rounding issues between architectures.

Change-Id: I6ed260aebd49feb4cfb5596a5370c44be5f72167

e965cccc

Merge "Fix above context pointers" · cc1aac1b
John Koleszar authored 11 years ago

cc1aac1b
Merge "SSE2 8x8 inverse ADST/DCT transform" · 58519047
Jingning Han authored 11 years ago

58519047
Moving vp9_kf_default_bmode_probs to vp9_entropymode.c. · baf0c959
Dmitry Kovalev authored 11 years ago
```
Removing vp9_modelcontext.c.

Change-Id: If2316c58dead2708d9f95b52d9494ba4c1dd7427
```
baf0c959

Rewriting vp9_set_pred_flag_{seg_id, mbskip}. · 863138a2

Dmitry Kovalev authored 11 years ago

Making implementation of vp9_set_pred_flag_{seg_id, mbskip} consistent
with vp9_get_segment_id without using confusing sub(a, b) macro. Passing
mi_row and mi_col to functions explicitly instead of replying on
mb_to_right_edge and mb_to_bottom_edge.

Change-Id: I54c1087dd2ba9036f8ba7eb165b073e807d00435

863138a2

Fix above context pointers · 5efd9609

John Koleszar authored 11 years ago

In the prior code, the above context pointers used for entropy
decoding were initialized on the first frame, and not updated when
the frame size changed. The per-frame code which initializes the
contexts assumes that the contexts are contiguous, leading to an
incomplete initialization when the frame is smaller. This commit
updates the pointers so that the context is contigous whenever
the frame size changes.

Change-Id: I08b53e3a30c8289491212311682ff1b8028cff6c

5efd9609

Merge "vp9_convolve8_[horiz|vert]_avg" · 90ebfe62
Johann authored 11 years ago

90ebfe62
Merge "Skip inter-coded block reconstruction in rd loop" · dd97c62a
Jingning Han authored 11 years ago

dd97c62a
Merge "Removing and moving around constant definitions." · e8e7620a
Dmitry Kovalev authored 11 years ago

e8e7620a
Merge "Change to extend full border only when needed" · c5b0cd84
Yaowu Xu authored 11 years ago

c5b0cd84

Change to extend full border only when needed · 5b915ebd

Yaowu Xu authored 11 years ago

This is a short term optimization till we work out a decoder
implementation requiring no frame border extension.

Change-Id: I02d15bfde4d926b50a4e58b393d8c4062d1be70f

5b915ebd

Removing and moving around constant definitions. · ca75f125

Dmitry Kovalev authored 11 years ago

Removing unused and duplicated constants, moving them from *.h to *.c
if possible.

Change-Id: Ief4d6b984a3ca2e9b38504f0d855ed072cf7133f

ca75f125

Merge "Consistent naming for loop-filter filters." · 65762849
Dmitry Kovalev authored 11 years ago

65762849
Merge "Remove print_nmvcounts" · 6eae37f4
Johann authored 11 years ago

6eae37f4

Increase border size from 96 to 160. · b02c4d36

Ronald S. Bultje authored 11 years ago

This is required because upon downscaling, if a motion vector points
partially into the UMV (e.g. all minus 1 of 64+7 pixels, i.e. 70),
then we can point up to 140 pixels into the larger-resolution (2x)
reference buffer UMV, which means the UMV for reference buffers in
downscaling needs to be 140 rounded up to the nearest multiple of 32,
i.e. 160.

Longer-term, we should probably handle the UMV differently by detecting
edge coverage on-the-fly and using a temporary buffer for edge extensions
instead of adding 160 pixels on all sides of the image (which means a
CIF image uses 3x its own area size for borders).

Change-Id: I5184443e6731cd6721fc6a5d430a53e7d91b4f7e

b02c4d36

Inline vp9_quantize() in xform_quant(). · 1ff94fea

Ronald S. Bultje authored 11 years ago

Cycle times:
4x4:    151 to  131 cycles (15% faster)
8x8:    334 to  306 cycles (9% faster)
16x16: 1401 to 1368 cycles (2.5% faster)
32x32: 7403 to 7367 cycles (0.5% faster)

Total encode time of first 50 frames of bus @ 1500kbps (speed 0)
goes from 1min39.2 to 1min38.6, i.e. a 0.67% overall speedup.

Change-Id: I799a49460e5e3fcab01725564dd49c629bfe935f

1ff94fea

Merge "Inline xform_quant() in encode_block_intra()." · 7e684e20
Ronald S. Bultje authored 11 years ago

7e684e20
Merge "Neon: Update mbfilter if all vectors follow one branch." · ce1d69ae
Frank Galligan authored 11 years ago

ce1d69ae

15 Jul, 2013 - 2 commits

Consistent naming for loop-filter filters. · e973b4e2

Dmitry Kovalev authored 11 years ago

Renaming flatmask4 to flat_mask4, flatmask5 to flat_mask5, hevmask to
hev_mask, filter to filter4, mbfilter to filter8, wide_mbfilter to
filter16.

Change-Id: Ic61c73e59c2eee505257584867aafac99833cea1

e973b4e2

Inline xform_quant() in encode_block_intra(). · 6fb41874

Ronald S. Bultje authored 11 years ago

Also inline some of the block calculations to assist the compiler to
not do silly things like calculating the same offset (or converting
between raster/transform block offset or block, mi and pixel unit)
many, many, many times.

Cycle times:
4x4:     584 ->   505 cycles (16% faster)
8x8:    1651 ->  1560 cycles (6% faster)
16x16:  7897 ->  7704 cycles (2.5% faster)
32x32: 16096 -> 15852 cycles (1.5% faster)

Overall, this saves about 0.5 seconds (1min49.8 -> 1min49.3) on the
first 50 frames of bus (speed 0) @ 1500kbps, i.e. 0.5% overall.

Change-Id: If3dd62453f8e2ab9d4ee616bc4ea956fb8874b80

6fb41874