• Johann's avatar
    quantize avx: copy implementation to intrinsics · 7c278721
    Johann authored
    Adds an early exit based on ptest. Slightly slower than ssse3 in the
    full case because of the extra check, but potentially faster if lots of
    rows can be skipped.
    
    Very close in speed to the assembly.
    
    Can run in 32 bit, unlike the assembly. Allows reworking the function
    prototype to use structs.
    
    Change-Id: If80e2b9ba059370a4cad3c973196e82a97b4330e
    7c278721