Fix alignment bugs in genericComposite() in 128bit mode
- When params.srcRowStride is zero we should prepare a whole line of pixels that will fill the destination in one go.
- pixelsAlignmentMask should be 32 bytes for AVX CPUs, no need to make it more
- blockAlign should have a correct size, measured in pixels, not floats
Fixes T862