AMDGPU/SI: Better handle s_wait insertion