Skip to content

Deprecation Warnings from Hugging Face Transformers in Whisper Transcription #9

Open
@siddanth-6365

Description

@siddanth-6365

When running transcription using the Whisper model, several deprecation warnings are displayed:

  • The input name inputs is deprecated—should use input_features instead.
  • A bug fix ([Whisper] Refactor forced_decoder_ids & prompt ids huggingface/transformers#28687) now causes multilingual Whisper to default to language detection followed by transcription instead of translation to English.
  • Passing a tuple of past_key_values is deprecated and will be removed in Transformers v4.43.0. The recommendation is to use an instance of EncoderDecoderCache (e.g., past_key_values=EncoderDecoderCache.from_legacy_cache(past_key_values)).
  • The attention mask is not set because the pad token is the same as the eos token. Explicitly passing an attention_mask is recommended to avoid unexpected behavior.
Image

Next Steps:

  • Should we update our implementation (e.g., switching to input_features, setting an explicit attention mask, and handling past_key_values accordingly) to future-proof the code and avoid these warnings?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions