-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Context
- Users want to do OCR on the Zone
- We don't have an OCR engine installed
- If we install the Tesseract-OCR engine (an ubuntu package) then users can install python and R tesseract libraries themselves and then do OCR.
- Users need to be educated on how to use the engine and which libraries to use, etc.
Todo
- Install tesseract-ocr (sudo apt install tesseract-ocr)
- Make example notebooks showing how to use the engine and the libraries
- Add here:
RUN apt-get update && \
Expected Result
- we have the tesseract-ocr engine installed and users can install Python and R packages to use the engine
- This keeps our base image small while enabling OCR
Reference
This epic is based on https://gitlab.k8s.cloud.statcan.ca/zone/zone-pulse/-/issues/21
Metadata
Metadata
Assignees
Labels
No labels