- 26 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: I32beb5f18c10b5771146c55933b5555487f53633
-
- 25 Mar, 2014 - 2 commits
-
-
Dmitry Kovalev authored
Change-Id: I713af99d1e17e05a20eab20df51d74ebfd1a68d2
-
Yaowu Xu authored
Adding the missed include file. Change-Id: I7e48df6b0633afbebaf1ccb3062ae404e7203dc9
-
- 24 Mar, 2014 - 3 commits
-
-
Dmitry Kovalev authored
Change-Id: I47a8b4bf9a6cc0063d1a6785eaaad641d0659e24
-
Dmitry Kovalev authored
Change-Id: I01b3530779da89acb84c71bac5ccac456f00c5ac
-
hkuang authored
Let the calculation to be compatible with Google's HW implementation. Change-Id: I22e179888cdb0419e230351c0a47661b37051fef
-
- 21 Mar, 2014 - 5 commits
-
-
Dmitry Kovalev authored
Change-Id: I7c7cf7d3c7b00d1c74ffa8aa8fb8d78a0e48326f
-
Dmitry Kovalev authored
Change-Id: Ibb72a29cae9ca9443aae56fc4c5458d190eae279
-
levytamar82 authored
2 functions were optimized for avx2 by using full 256 bit register In order to handle 32 elements in parallel instead of only 16 in parallel: 1. vp9_sad32x32x4d 2. vp9_sad64x64x4d The function level gain is 66% and the user level gain is ~1%. Change-Id: I4efbb3bc7d8bc03b64b6c98f5cd5c4a9dd3212cb
-
Yunqing Wang authored
Fixed dr memory errors reported in Issue 736: https://code.google.com/p/webm/issues/detail?id=736 All elements in left_col buffer need to be initialized to ensure the correctness of SIMD operations in x86 optimized code. Change-Id: I8e7f26ab45cca8099c1f9342bcf852f828bda7e4
-
Dmitry Kovalev authored
Change-Id: Ib3c1746e61220c629cbd971b2458aa686b5c9e36
-
- 20 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: Id1fa36c92cb007b73a450cc8552e810cedad38b9
-
- 19 Mar, 2014 - 3 commits
-
-
Dmitry Kovalev authored
Change-Id: I3b944884c048f589c86e0169aeb3c3855bc8b729
-
Yaowu Xu authored
Change-Id: Ic8e52a89e0df816c38cd8ff1b7c53862b9a6dff2
-
Dmitry Kovalev authored
Change-Id: If674140e30c223c88894b983fd22a583efb99dcf
-
- 18 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: Ia561dda53f2dd10e3a10a2df2adb8027ab19397a
-
- 17 Mar, 2014 - 1 commit
-
-
hkuang authored
Fixes issue #731 Change-Id: Id313e84b8fb4ff20f6a4e1ed11cb601927888318
-
- 13 Mar, 2014 - 5 commits
-
-
Dmitry Kovalev authored
Change-Id: I652d17f7bff84f75d015f4f39652472e14eb3134
-
Dmitry Kovalev authored
Change-Id: I18b3939f0b51085cdd25c9182c3a9c7536ca7e3e
-
Dmitry Kovalev authored
Change-Id: I9a19eb96907f674e3ce1e573f5dd49f0fbf2ae4f
-
- 10 Mar, 2014 - 4 commits
-
-
Dmitry Kovalev authored
Change-Id: I7ad79c061ad4efbc4914ac49723b48183fdbdd47
-
Dmitry Kovalev authored
Change-Id: Ied12b39c55667b26fd3bf90eb331e601c53a10f6
-
Dmitry Kovalev authored
Change-Id: I224e49955ad6c833d204feb8efc4056e37d206be
-
Dmitry Kovalev authored
Change-Id: Iebd182d7aeebc0f8964b6fd35057449bb25b00c1
-
- 06 Mar, 2014 - 3 commits
-
-
James Zern authored
clang reports gcc-4.2.1 in e.g., 3.3, 3.4; add a specific clang version check for _mm256_broadcastsi128_si256 fixes issue #720 Change-Id: I5c8e3c27fdea05d8a5b050e8cb74894b595f4709
-
Dmitry Kovalev authored
Change-Id: I03451c88536bc498edddbe0cd9773ff79da085c2
-
James Zern authored
fixes a warning in vs9/x64 related to ceil() Change-Id: Ic4bde9d0b7e961546dbe304de74aa37fc02fcf94
-
- 05 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: I7f3bcca103f0b1f6b3c064b61472543de9a8288a
-
- 04 Mar, 2014 - 1 commit
-
-
Dmitry Kovalev authored
Change-Id: Ib49d8dbc67c590f22a1a70251ff607c9f38febd7
-
- 03 Mar, 2014 - 5 commits
-
-
Jim Bankoski authored
Change-Id: I916944950deb22f4c2301d83a803b732bf3ecd77
-
Jim Bankoski authored
Change-Id: Ib65fb0679156960305b10fbf590254ff6bf1bfe1
-
James Zern authored
significantly speeds up file generation. the goal of this change is to convert rtcd.sh to perl as directly as possible to allow for simple comparison. future changes can make it more perl-like. --- Linux [CREATE] vpx_scale_rtcd.h real 0m0.485s -> 0m0.022s [CREATE] vp8_rtcd.h real 0m4.619s -> 0m0.060s [CREATE] vp9_rtcd.h real 0m10.102s -> 0m0.087s Windows [CREATE] vpx_scale_rtcd.h real 0m8.360s -> 0m0.080s [CREATE] vp8_rtcd.h real 1m8.083s -> 0m0.160s [CREATE] vp9_rtcd.h real 2m6.489s -> 0m0.233s Change-Id: Idfb71188206c91237d6a3c3a81dfe00d103f11ee
-
Dmitry Kovalev authored
Change-Id: I4a54b12e5229705222c5a101258b9d1f81e2948d
-
Dmitry Kovalev authored
Change-Id: I9423b543e1be414e5c9e10480b813f06e6b88f8a
-
- 01 Mar, 2014 - 3 commits
-
-
Yaowu Xu authored
There were two parameters not in use, this commit removed them. Change-Id: Ia03a73b9a2521400bed539df45574e34214ed93a
-
Yaowu Xu authored
The function has evolved over time, now only calls vp9_rtcd(), so this commit removes the function and changes to call vp9_rtcd() directly. Change-Id: I8cfa6190daa4b28f6f3d1e11bb3a07f9c95322bf
-
levytamar82 authored
Optimizing 2 functions to process 32 elements in parallel instead of 16: 1. vp9_sub_pixel_avg_variance64x64 2. vp9_sub_pixel_avg_variance32x32 both of those function were calling vp9_sub_pixel_avg_variance16xh_ssse3 instead of calling that function, it calls vp9_sub_pixel_avg_variance32xh_avx2 that is written in avx2 and process 32 elements in parallel. This Optimization gave 80% function level gain and 2% user level gain Change-Id: Iea694654e1b7612dc6ed11e2626208c2179502c8
-
- 28 Feb, 2014 - 1 commit
-
-
Dmitry Kovalev authored
different_ref_found is always equal to one (if calculated) because ref_frame[0] != ref_frame[1] for each mi-block. Change-Id: Ibd7625b7b29dec2fd3c40edbc3de1169abb78585
-