Hi, thanks for your work. I'd be interested, if the model also provides phoneme-level timing information at inference ?