pub fn weight(x: &[u8]) -> u64
Computes hamming weight Uses highly optimized avx2 version if available