Change the repository type filter
All
Repositories list
39 repositories
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
base-images
Public.github
Public- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
unstructured.pytesseract
Publicwolfi-dev-os
Publicpipeline-sec-filings
Public archivepipeline-template
Publicpipeline-oer
Publicpipeline-paddleocr
Publiclangchain
Publicunstructured-api-tools
Public archive