Skip to content

Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

License

Notifications You must be signed in to change notification settings

crwlrsoft/robots-txt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

420aea3 · Jan 27, 2025

History

28 Commits
Nov 6, 2024
Oct 24, 2021
Nov 7, 2021
Jan 27, 2025
Jan 27, 2025
Oct 24, 2021
Oct 24, 2021
Oct 24, 2021
Nov 6, 2024
Jan 27, 2025
Sep 22, 2022
Jan 27, 2025
Sep 22, 2022
Nov 6, 2024
Nov 7, 2021
Oct 24, 2021

Repository files navigation

crwlr.software logo

Robots Exclusion Standard/Protocol Parser

for Web Crawling/Scraping

Use this library within crawler/scraper programs to parse robots.txt files and check if your crawler user-agent is allowed to load certain paths.

Documentation

You can find the documentation at crwlr.software.

Contributing

If you consider contributing something to this package, read the contribution guide (CONTRIBUTING.md).