- 03 Oct, 2013 1 commit
-
-
Yunqing Wang authored
In subpixel filters, prefetched source data, unrolled loops, and interleaved instructions. In HORIZx4, integrated the idea in Scott's CL (commit: d22a504d), which was suggested by Erik/Tamar from Intel. Further tweaking was done to combine row 0, 2, and row 1, 3 in registers to do more 2-row-in-1 operations until the last add. Test showed a ~2% decoder speedup. Change-Id: Ib53d04ede8166c38c3dc744da8c6f737ce26a0e3
-
- 01 Oct, 2013 3 commits
-
-
Yunqing Wang authored
-
Yunqing Wang authored
Interleaved the instructions, reduced register dependency, and prefetched the source data. This improved the decoder speed by 0.6% - 2%. Change-Id: I568067aa0c629b2e58219326899c82aedf7eccca
-
Yaowu Xu authored
near is a key word, changed to use nearmv instead. Change-Id: Ib54438c431b2b2521a62fc7b61a9c127dd7bc01e
-
- 30 Sep, 2013 36 commits
-
-
Dmitry Kovalev authored
-
Dmitry Kovalev authored
-
Jingning Han authored
Make encoder skip rectangular partition check in speed 1 and above, when early termination was triggered in partition split. Thanks Guillaume (gmartres@) for catching this issue. This change makes bus_cif at 2000kbps speed 1 runtime goes down from 25612ms to 23438ms (about 9% speed-up), at the expense of -0.235% performance down. Change-Id: I98613fad081a261d30d5fa206f934ca70601c180
-
Dmitry Kovalev authored
Change-Id: I7380a089105f658257bbb3e30a525da168e76952
-
Dmitry Kovalev authored
-
Dmitry Kovalev authored
We don't need these functions anymore. The only one which was actually used is vp9_add_constant_residual_32x32. Addition of vp9_short_idct32x32_1_add eliminates this single usage. SSE2 optimized version of vp9_short_idct32x32_1_add will be added in the next patch set, right now it is only C implementation. Now we have all idct functions implemented in a consistent manner. Change-Id: I63df79a13cf62aa2c9360a7a26933c100f9ebda3
-
Jim Bankoski authored
-
Jim Bankoski authored
* changes: fixed cpp lint issue in vp9_postproc_x86 nolintify intrinsic idct file
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
* changes: cpplint issues in vp9_filter.h cpplint issues with onyxc_int.h
-
Jim Bankoski authored
-
Paul Wilkins authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
-
Jim Bankoski authored
* changes: fixed cpplint issue with vp9_scale.h vp9_entropymv.c cpplint issues resolved cpplint fixes to debug modes cpplint issues in vp9_onyx.h cpplint issues resolved in vp9_dx_iface.c
-
Jim Bankoski authored
-
Deb Mukherjee authored
-
Paul Wilkins authored
This patch deletes the variance based speed three partitioning. Speed 3 now uses the same partitioning method as speed 2 but with some stricter conditions. The speed and quality are now somewhere between speeds 2 and 4 whereas before it was worse in both than speed 4. Change-Id: Ia142e7007299d79db3ceee6ca8670540db6f7a41
-
Jim Bankoski authored
Change-Id: I4e163cc4ce9ec2f3a5a8b9da478049c71b08d71f
-
Jim Bankoski authored
Change-Id: I8b496191c6a60a60a52c929adca305db47058a84
-
Jim Bankoski authored
Change-Id: I97bf241c0d981721cc74a50be47c9db8a00f6be3
-
Jim Bankoski authored
Change-Id: I442609f689aa9381e1e208012305cf62a6b31eee
-
Jim Bankoski authored
Change-Id: I07fbb32d5cee0003d04b2369cfafcb03c371cd4f
-
Jim Bankoski authored
Change-Id: If6a7a8c48fefc69349c792d8ed52a6e1d374e46e
-