• Yi Luo's avatar
    Improve idct32x32_34_add SSSE3 intrinsics performance · 07c48ccf
    Yi Luo authored
    - Split the transform into first half and second half.
    - Reschedule the instructions to avoid stack spillover.
    - Function level speed improves ~16%.
    
    Change-Id: I166889840d23aa8a273eca00f6fbdae8b4566f35
    07c48ccf