• Johann's avatar
    quantize: copy ssse3 optimizations to intrinsics · d52cb597
    Johann authored
    Fairly minor differences from sse2. pabsw and psignw are the big gains.
    Also re-uses some values in eob calculation to avoid an extra pcmp.
    
    Fixes test failures in HBD and OS X builds.
    
    Allows using it in 32bit builds, where it is about 40% faster than sse2.
    
    Substantially faster than the assembly for skip_block. 10-20% faster the
    rest of the time.
    
    Change-Id: If783bb3567e561e47667e10133b9c84414a334e2
    d52cb597