Use movups to lower memcpy and memset even if it's not fast (like corei7).