Skip to content
Change the repository type filter

All

    Repositories list

    • Storage focused database made in Rust aim for close to no RAM usage and customizable indexes.
      Rust
      1000Updated Jul 3, 2025Jul 3, 2025
    • An implementation of RFC6265
      Rust
      27000Updated Jun 4, 2025Jun 4, 2025
    • Make websites accessible for AI agents
      Python
      7.5k000Updated Nov 18, 2024Nov 18, 2024
    • HTML
      4000Updated Oct 17, 2024Oct 17, 2024
    • hudsucker

      Public
      Intercepting HTTP/S proxy
      Rust
      45000Updated Oct 10, 2024Oct 10, 2024
    • DataHen API Documentation
      HTML
      0001Updated May 28, 2024May 28, 2024
    • Datahen Client for Ruby
      Ruby
      1200Updated May 1, 2024May 1, 2024
    • Proxy benchmark script
      Ruby
      0000Updated Mar 5, 2024Mar 5, 2024
    • QA library that runs on Fetch
      Ruby
      2000Updated Jan 12, 2024Jan 12, 2024
    • Rust
      15000Updated Sep 14, 2023Sep 14, 2023
    • henqa

      Public
      HenQA is a standalone tool for validating massive amounts of data using the JSON schema.
      Go
      0100Updated Jun 15, 2023Jun 15, 2023
    • Datahen Easy Core Toolkit
      Ruby
      1000Updated Feb 14, 2023Feb 14, 2023
    • a Rust library that generates a random combination of millions of user-agents strings.
      Rust
      0000Updated Dec 16, 2022Dec 16, 2022
    • HenQA shared components
      Go
      0000Updated Sep 28, 2022Sep 28, 2022
    • A Stream to link between Reqwest and Actix-web two systems.
      Rust
      1000Updated Mar 28, 2022Mar 28, 2022
    • till

      Public
      DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any scraper in 5 minutes.
      Go
      2281510Updated Dec 5, 2021Dec 5, 2021
    • useragent

      Public
      DataHen useragent tool is a Golang package and standalone tool that generates a random combination of millions of user-agents strings. Currently used in production at DataHen to crawl/scrape through billions of pages.
      Go
      11000Updated Jun 3, 2021Jun 3, 2021
    • license

      Public
      license package signs and verifies responses based on public and private key and timestamp
      Go
      0100Updated Apr 21, 2021Apr 21, 2021
    • ujson

      Public
      ujson package does marshalling like json but without escaping html
      Go
      0000Updated Mar 1, 2021Mar 1, 2021
    • gid

      Public
      gid package is a golang package that is used to generate globally unique IDs (GID) for web pages (HTTP requests). Useful for troubleshooting web scrapers, and reusing web page caches.
      0000Updated Mar 1, 2021Mar 1, 2021
    • Minimal PgBouncer image that is easy to configure
      Shell
      267000Updated Jan 23, 2021Jan 23, 2021
    • Go
      0000Updated Nov 23, 2020Nov 23, 2020
    • A lean and fast 'fs' for the browser
      JavaScript
      58000Updated Nov 9, 2020Nov 9, 2020
    • Ruby
      0000Updated Nov 2, 2020Nov 2, 2020
    • Crawls a web site
      Ruby
      0001Updated Oct 18, 2020Oct 18, 2020
    • afero

      Public
      A FileSystem Abstraction System for Go
      Go
      534000Updated Aug 24, 2020Aug 24, 2020
    • Test Scraper
      Ruby
      0000Updated Jul 2, 2020Jul 2, 2020
    • Minimal environment variable parser for Go
      Go
      15000Updated Apr 6, 2020Apr 6, 2020
    • DataHen Python Library
      Python
      0100Updated Mar 30, 2020Mar 30, 2020
    • 0000Updated Feb 18, 2020Feb 18, 2020