AMDGPU/SI: use S_MOV_B64 for larger copies in copyPhysReg