-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Phase 1: Technology proving ground
Our initial milestone is proving out that layout data can be extracted in browser via some combination of pdf.js and tesseract.js.
- Load
pdf.jswith a Svelte app & render a test PDF - Extract text & layout from PDF (with
pdf.jsor withtesseract.js)
Phase 2: Main Functionality
- loading PDFs from the user (and perhaps by URL?)
- Layout analysis to find main content
- font style identification?
- Site layout designs
- Decisions about output formats
Phase 3: App & Deployment
- Analytics configuration
- Stress testing & performance improvement
- Site Copy
- ad hoc user testing & further refinements.
Metadata
Metadata
Assignees
Labels
No labels