`return_timestamps` argument only supports segment/sentence level timestamps. Are there current plans to add support for word level timestamps? For many use cases (including for example speech editing) this is required.