Feature parity: Support for running Crawlee in a web server environment #1148

SeeYangZhi · 2025-04-10T08:01:27Z

I’m currently exploring the Python version of Crawlee and was wondering if there is support (or planned support) for running Crawlee in a web server environment — similar to what’s described in the JavaScript guide here: https://crawlee.dev/js/docs/guides/running-in-web-server

This functionality would be useful for exposing Crawlee crawlers via an API endpoint (e.g., using FastAPI or Flask), enabling dynamic start/stop of crawls and job monitoring through HTTP.

A few questions:

Is such functionality currently supported in crawlee-python?
If not, are there any plans to support this use case?
Any recommended workarounds or best practices in the meantime?

janbuchar · 2025-04-10T08:08:36Z

Hi @SeeYangZhi and thank you for your interest in Crawlee! Thios is already possible, but not exactly straightforward - the approach depends on your exact use case.

Could you describe what you have in mind? Perhaps we can compile this info into a guide later...

SeeYangZhi · 2025-04-10T08:24:19Z

Hi @janbuchar, thanks for the quick response!

My use case involves exposing a Crawlee crawler as a web API, where users can send a POST request with a URL, and receive the extracted data (e.g. page title, presence of specific elements, etc.) as a JSON response.

Essentially, I’m trying to replicate the functionality described in the JS guide for running Crawlee in a web server, but using the Python version of Crawlee with FastAPI.

janbuchar · 2025-04-10T09:12:52Z

Right, so one API call leads to the extraction of a single page. We'll see if we can add a guide for that 🙂

SeeYangZhi · 2025-04-10T13:31:34Z

Right, so one API call leads to the extraction of a single page. We'll see if we can add a guide for that 🙂

Looking forward for the guide!

github-actions bot added the t-tooling Issues with this label are in the ownership of the tooling team. label Apr 10, 2025

Pijukatel self-assigned this Apr 25, 2025

Pijukatel linked a pull request Apr 25, 2025 that will close this issue

docs: Add guide for running crawler in web server #1174

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature parity: Support for running Crawlee in a web server environment #1148

Feature parity: Support for running Crawlee in a web server environment #1148

SeeYangZhi commented Apr 10, 2025

janbuchar commented Apr 10, 2025

SeeYangZhi commented Apr 10, 2025

janbuchar commented Apr 10, 2025

SeeYangZhi commented Apr 10, 2025

Feature parity: Support for running Crawlee in a web server environment #1148

Feature parity: Support for running Crawlee in a web server environment #1148

Comments

SeeYangZhi commented Apr 10, 2025

janbuchar commented Apr 10, 2025

SeeYangZhi commented Apr 10, 2025

janbuchar commented Apr 10, 2025

SeeYangZhi commented Apr 10, 2025