Minimum LibreOffice packages to parse files #2610
Unanswered
nikoszaf41
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am looking into using unstructured to parse
.doc
files in an application that will be running in a docker container.Since I want the image to have the smallest possible size i've been searching to find the LibreOffice packages for
ppt
anddoc
partitioning.What are the minimum required packages I need to install in the container for this
doc
todocx
conversion to run?soffice --headless --convert-to docx example.doc
I've checked
libreoffice-nogui
but it also includes many packages not required for these files typesThe base image I'm using is
python:3.10-slim
Beta Was this translation helpful? Give feedback.
All reactions