pub fn rcrsa16(a: usize, b: usize) -> usize
stdsimd
Cross halves of subtracts and adds packed 16-bit signed numbers, dropping least bits