NVPTX: support direct f16 <-> f64 conversions via intrinsics.