Skip to content

Lightweight HTML Triplifier #484

Open
@jmkeil

Description

@jmkeil

The package io.github.sparql-anything.sparql-anything-html has a heavy storage footprint (>160MB) due to its dependency on com.microsoft.playwright.driver-bundle, which basically five times ships Node.js binaries (Windows, Linux, Linux ARM, Mac and Mac ARM). To my understanding, this is needed to run a headless browser that interprets JS in the triplified HTML.

I guess this is not needed in many use cases.

Therefore, I would like to ask you to consider providing an additional lightweight HTML Triplifier that just triplifies the static HTML document. This would result in significantly smaller binaries of upstream projects and would probably also improve the execution time.

Metadata

Metadata

Assignees

No one assigned

    Labels

    ImprovementDoing the same thing but better

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions