Create an overview of cleaning taggers

Agreed with @peterbjorgensen that it would be a great idea to create over overview of what taggers might be relevant for cleaning.

## Outlining
- Create a .md table with relevant taggers + a short description
- Check what filters were used for existing cleaning strategies and at least try to match them (see [here](https://github.com/centre-for-humanities-computing/danish-foundation-models/blob/49f767cb24470ea52133536f1425bc1da332fcec/docs/datasheets/netarkivet_text.md))
- potentially some estimate on speed (time to process danish gigaword Wikipedia section ~55M tokens)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Create an overview of cleaning taggers #207

Outlining

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Create an overview of cleaning taggers #207

Description

Outlining

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions