- 10 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: Iebd182d7aeebc0f8964b6fd35057449bb25b00c1
-
- 06 Mar, 2014 - 3 commits
-
-
James Zern authored
clang reports gcc-4.2.1 in e.g., 3.3, 3.4; add a specific clang version check for _mm256_broadcastsi128_si256 fixes issue #720 Change-Id: I5c8e3c27fdea05d8a5b050e8cb74894b595f4709
-
Dmitry Kovalev authored
Change-Id: I03451c88536bc498edddbe0cd9773ff79da085c2
-
James Zern authored
fixes a warning in vs9/x64 related to ceil() Change-Id: Ic4bde9d0b7e961546dbe304de74aa37fc02fcf94
-
- 05 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: I7f3bcca103f0b1f6b3c064b61472543de9a8288a
-
- 04 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: Ib49d8dbc67c590f22a1a70251ff607c9f38febd7
-
- 03 Mar, 2014 - 5 commits
-
-
Jim Bankoski authored
Change-Id: I916944950deb22f4c2301d83a803b732bf3ecd77
-
Jim Bankoski authored
Change-Id: Ib65fb0679156960305b10fbf590254ff6bf1bfe1
-
James Zern authored
significantly speeds up file generation. the goal of this change is to convert rtcd.sh to perl as directly as possible to allow for simple comparison. future changes can make it more perl-like. --- Linux [CREATE] vpx_scale_rtcd.h real 0m0.485s -> 0m0.022s [CREATE] vp8_rtcd.h real 0m4.619s -> 0m0.060s [CREATE] vp9_rtcd.h real 0m10.102s -> 0m0.087s Windows [CREATE] vpx_scale_rtcd.h real 0m8.360s -> 0m0.080s [CREATE] vp8_rtcd.h real 1m8.083s -> 0m0.160s [CREATE] vp9_rtcd.h real 2m6.489s -> 0m0.233s Change-Id: Idfb71188206c91237d6a3c3a81dfe00d103f11ee
-
Dmitry Kovalev authored
Change-Id: I4a54b12e5229705222c5a101258b9d1f81e2948d
-
Dmitry Kovalev authored
Change-Id: I9423b543e1be414e5c9e10480b813f06e6b88f8a
-
- 01 Mar, 2014 - 3 commits
-
-
Yaowu Xu authored
There were two parameters not in use, this commit removed them. Change-Id: Ia03a73b9a2521400bed539df45574e34214ed93a
-
Yaowu Xu authored
The function has evolved over time, now only calls vp9_rtcd(), so this commit removes the function and changes to call vp9_rtcd() directly. Change-Id: I8cfa6190daa4b28f6f3d1e11bb3a07f9c95322bf
-
levytamar82 authored
Optimizing 2 functions to process 32 elements in parallel instead of 16: 1. vp9_sub_pixel_avg_variance64x64 2. vp9_sub_pixel_avg_variance32x32 both of those function were calling vp9_sub_pixel_avg_variance16xh_ssse3 instead of calling that function, it calls vp9_sub_pixel_avg_variance32xh_avx2 that is written in avx2 and process 32 elements in parallel. This Optimization gave 80% function level gain and 2% user level gain Change-Id: Iea694654e1b7612dc6ed11e2626208c2179502c8
-
- 28 Feb, 2014 - 5 commits
-
-
Dmitry Kovalev authored
different_ref_found is always equal to one (if calculated) because ref_frame[0] != ref_frame[1] for each mi-block. Change-Id: Ibd7625b7b29dec2fd3c40edbc3de1169abb78585
-
Dmitry Kovalev authored
Change-Id: I1fe71e35b1e44da693b43d26607abb33efd56820
-
Dmitry Kovalev authored
Change-Id: I7d11c6ae259aff6560710d16fea3032c661e5b02
-
Dmitry Kovalev authored
Change-Id: Id145da99259866109cfee8b47a1d8f309944b937
-
Dmitry Kovalev authored
Change-Id: Iaac57b24f79cd205a8c62bc1177412d22f5787a8
-
- 27 Feb, 2014 - 2 commits
-
-
hkuang authored
Change-Id: Ie21b5ae89100389b80f919710839084f935a8545
-
Dmitry Kovalev authored
Change-Id: I7b9738a7113c0c4687e5d320581ff69d98a8b271
-
- 26 Feb, 2014 - 2 commits
-
-
Minghai Shang authored
Change-Id: Ia7677fb436667bc4f76db71f65e4784f433f7826
-
- 24 Feb, 2014 - 1 commit
-
-
Yaowu Xu authored
for ABSDATA mode, so segment loop filter level always fall in valid range for both Absolute and delta modes. Change-Id: If90df3411479533dbdab63f8ae088d2f5dd174a9
-
- 22 Feb, 2014 - 1 commit
-
-
Yaowu Xu authored
The qindex for a segment was not clamped in ABSDATA mode, which may cause invalid memory access if an ill-formed stream has a negative value in ABSDATA mode. This commit added clamp to make sure qindex for a segment always fall into valid range. Change-Id: I0a74d00f4ef40aec7edaeca1d03c8645e23ab08c
-
- 21 Feb, 2014 - 1 commit
-
-
Alex Converse authored
Change-Id: I8cc38fdef6a2a0968af8dfe15e7c2b3c46c531ea
-
- 20 Feb, 2014 - 1 commit
-
-
Yaowu Xu authored
Change-Id: I8eeec70a7d4403243762f14d0b560792801645e8
-
- 19 Feb, 2014 - 4 commits
-
-
Dmitry Kovalev authored
Hiding vp9_find_mv_refs_idx() inside vp9_mvref_common.c, moving definition of vp9_find_mv_refs() to vp9_mvref_common.c. Change-Id: I0c9f34b03648785a7d18edf6d4fddd34e55dfcc5
-
James Zern authored
- place extern within the macro - use in the header only Change-Id: I4274b345d8af9ef329c0eb9553a3ddaad70d1d26
-
James Zern authored
+ fix formatting Change-Id: I344d4de089d03e403f0c7b3e64aeb7086cce86ac
-
James Zern authored
+ fix formatting Change-Id: Ia62610bff3d63855104366d7860749b6a3cf4577
-
- 17 Feb, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Passing block MV pointer instead of block index into vp9_full_search_sad{, x3, x8} functions. Change-Id: Ica07356633471c2c8f81b583a7aeba85a436bafb
-
- 15 Feb, 2014 - 2 commits
-
-
James Zern authored
avoids mismatched alignment warnings in visual studio builds Change-Id: I2cedb8042fd47e708bde3f7168a6fb4bd9aaa569
-
James Yu authored
Change-Id: Ifabb8c7ec0c327fea9d6739cab10addb060ff435 Signed-off-by:
James Yu <james.yu@linaro.org>
-
- 14 Feb, 2014 - 4 commits
-
-
levytamar82 authored
Optimizing 2 functions to process 32 elements in parallel instead of 16: 1. vp9_sub_pixel_variance64x64 2. vp9_sub_pixel_variance32x32 both of those function were calling vp9_sub_pixel_variance16xh_ssse3 instead of calling that function, it calls vp9_sub_pixel_variance32xh_avx2 that is written in avx2 and process 32 elements in parallel. This Optimization gave 70% function level gain and 2% user level gain Change-Id: I4f5cb386b346ff6c878a094e1c3b37e418e50bde
-
Adrian Grange authored
Change-Id: I568861ba1d43620865ad9a98a97eef37a51fd856
-
levytamar82 authored
Optimizing all SSSE3 assembly for convolution: 1. vp9_filter_block1d4_h8_sse2 2. vp9_filter_block1d8_h8_sse2 3. vp9_filter_block1d16_h8_sse2 4. vp9_filter_block1d4_v8_sse2 5. vp9_filter_block1d8_v8_sse2 6. vp9_filter_block1d16_v8_sse2 my optimization include: -processing 2x8 elements in one 128 bit register instead of processing 8 elements in one 128 bit register. -removing unecessary loads. This optimization gives between 2.4% user level gain for 480p input and 1.6% user level gain for 720p. This Optimization is done only for 64 bit Change-Id: Ic07fce2f9360329b4f2d956efda1480ae958766b
-
Dmitry Kovalev authored
Change-Id: I48ad06e3e1ae9720a0683022621f4504e3bebce6
-
- 13 Feb, 2014 - 2 commits
-
-
Frank Galligan authored
Added support for external frame buffers to libvpx's VP9 decoder. If the external frame buffer functions are set then libvpx will call the get function whenever it needs a new frame buffer to decode a frame into. And it will call the release function whenever there are no more references to that buffer. Change-Id: Id2934d005f606af6e052fb6db0d5b7c02f567522
-
Yaowu Xu authored
Prior to this commit, both encoder and decoder reset mode/mv info from previous frame in error resilient mode to ensure bitstreams are able to decode when there is loss of frame in decoder side. However, this is not necessary. This commit changed to remove the reset, so encoder can continue to use mode/mv/partition information from previously encoded frame without affecting decodeablilty under loss of frame. Change-Id: I0279f862900dc647fb471ae3389770bb1b9f454f
-