Skip to content

Collect HTML responses upstream of annotation #324

Closed
@josh-chamberlain

Description

@josh-chamberlain

Context

The tag collector compresses HTML responses into something we think will be useful. However, it's incomplete—we've decided to store the full HTML response and use that for training, perhaps still weighting based on header tags and whatnot. This gives us flexibility.

Requirements

  • At the same time as we run the tag collector, upstream of annotation, store the full HTML response. This will stay with the data as it moves through the pipeline.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions