Skip to content

VestigiaLunae/extract-subtitle

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hard-burned subtitles OCR to SRT extractor

The workflow sequence run by the do-all.sh script:

  1. Generate cropped video with ffmpeg (you'll have to adjust the crop area for your video size)
  2. Generate PNG snapshots (using ffmpeg ... fps=1 — 1 snapshot per second)
  3. Optical Character Recognition using macOCR (Apple Silicon only) outputs JSON file.
  4. Convert JSON to SRT + normalize and deduplicate using https://github.com/cdown/srt.

NOTE

  • this collection of scripts is work in progress and will require tweaking for each specific scenario (the corresponding places that need editing are marked with TODO comments in the code); use at your own risk

About

Hard-burned subtitles OCR to SRT extractor

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 91.1%
  • Shell 8.9%