    In the function mb_lpf_horizontal_edge_w_avx2_16 the usage of the intrinsic
    _mm256_cvtepu8_epi16 cause a compiler bug in gcc 4.9.1.
    until it will be fixed I created a workaround that create the up convert by
    using broadcast128+shuffle.
    The bug was reported here:
