Fix scheduling for vldm/vstm instructions that load/store more than 32 bytes on Corte...