Skip to content

Support for subtitles in other languages #3

Closed
@antomal

Description

@antomal

Hi, this is a very useful app!

BUT, unfortunately it only downloads English subtitles ("EN").

For example, here is a video in Russian — https://www.youtube.com/watch?v=cjNz4ISyxtg and as a result I get an error in the console:

Starting transcript extraction...

Error processing https://www.youtube.com/watch?v=cjNz4ISyxtg: Could not retrieve a transcript for the video https://www.youtube.com/watch?v=cjNz4ISyxtg! This is most likely caused by: No transcripts were found for any of the requested language codes: ('en',) For this video (cjNz4ISyxtg) transcripts are available in the following languages: (MANUALLY CREATED) - ru ("Russian")[TRANSLATABLE] (GENERATED) - ru ("Russian (auto-generated)")[TRANSLATABLE] (TRANSLATION LANGUAGES) - ab ("Abkhazian") - aa ("Afar") - af ("Afrikaans") - ak ("Akan") - sq ("Albanian") - am ("Amharic") - ar ("Arabic") - hy ("Armenian") - as ("Assamese") - ay ("Aymara") - az ("Azerbaijani") - bn ("Bangla") - ba ("Bashkir") - eu ("Basque") - be ("Belarusian") - bho ("Bhojpuri") - bs ("Bosnian") - br ("Breton") - bg ("Bulgarian") - my ("Burmese") - ca ("Catalan") - ceb ("Cebuano") - zh-Hans ("Chinese (Simplified)") - zh-Hant ("Chinese (Traditional)") - co ("Corsican") - hr ("Croatian") - cs ("Czech") - da ("Danish") - dv ("Divehi") - nl ("Dutch") - dz ("Dzongkha") - en ("English") - eo ("Esperanto") - et ("Estonian") - ee ("Ewe") - fo ("Faroese") - fj ("Fijian") - fil ("Filipino") - fi ("Finnish") - fr ("French") - gaa ("Ga") - gl ("Galician") - lg ("Ganda") - ka ("Georgian") - de ("German") - el ("Greek") - gn ("Guarani") - gu ("Gujarati") - ht ("Haitian Creole") - ha ("Hausa") - haw ("Hawaiian") - iw ("Hebrew") - hi ("Hindi") - hmn ("Hmong") - hu ("Hungarian") - is ("Icelandic") - ig ("Igbo") - id ("Indonesian") - iu ("Inuktitut") - ga ("Irish") - it ("Italian") - ja ("Japanese") - jv ("Javanese") - kl ("Kalaallisut") - kn ("Kannada") - kk ("Kazakh") - kha ("Khasi") - km ("Khmer") - rw ("Kinyarwanda") - ko ("Korean") - kri ("Krio") - ku ("Kurdish") - ky ("Kyrgyz") - lo ("Lao") - la ("Latin") - lv ("Latvian") - ln ("Lingala") - lt ("Lithuanian") - lua ("Luba-Lulua") - luo ("Luo") - lb ("Luxembourgish") - mk ("Macedonian") - mg ("Malagasy") - ms ("Malay") - ml ("Malayalam") - mt ("Maltese") - gv ("Manx") - mi ("Māori") - mr ("Marathi") - mn ("Mongolian") - mfe ("Morisyen") - ne ("Nepali") - new ("Newari") - nso ("Northern Sotho") - no ("Norwegian") - ny ("Nyanja") - oc ("Occitan") - or ("Odia") - om ("Oromo") - os ("Ossetic") - pam ("Pampanga") - ps ("Pashto") - fa ("Persian") - pl ("Polish") - pt ("Portuguese") - pt-PT ("Portuguese (Portugal)") - pa ("Punjabi") - qu ("Quechua") - ro ("Romanian") - rn ("Rundi") - ru ("Russian") - sm ("Samoan") - sg ("Sango") - sa ("Sanskrit") - gd ("Scottish Gaelic") - sr ("Serbian") - crs ("Seselwa Creole French") - sn ("Shona") - sd ("Sindhi") - si ("Sinhala") - sk ("Slovak") - sl ("Slovenian") - so ("Somali") - st ("Southern Sotho") - es ("Spanish") - su ("Sundanese") - sw ("Swahili") - ss ("Swati") - sv ("Swedish") - tg ("Tajik") - ta ("Tamil") - tt ("Tatar") - te ("Telugu") - th ("Thai") - bo ("Tibetan") - ti ("Tigrinya") - to ("Tongan") - ts ("Tsonga") - tn ("Tswana") - tum ("Tumbuka") - tr ("Turkish") - tk ("Turkmen") - uk ("Ukrainian") - ur ("Urdu") - ug ("Uyghur") - uz ("Uzbek") - ve ("Venda") - vi ("Vietnamese") - war ("Waray") - cy ("Welsh") - fy ("Western Frisian") - wo ("Wolof") - xh ("Xhosa") - yi ("Yiddish") - yo ("Yoruba") - zu ("Zulu") If you are sure that the described cause is not responsible for this error and that a transcript should be retrievable, please create an issue at https://github.com/jdepoix/youtube-transcript-api/issues. Please add which version of youtube_transcript_api you are using and provide the information needed to replicate the error. Also make sure that there are no open issues which already describe your problem!

Transcript extraction complete! Starting Gemini processing...

All Gemini responses for all videos have been saved to /Users/antomal/Downloads/gemini.txt.

Perhaps some optional language selection is needed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions