Skip to content
This repository was archived by the owner on Jun 10, 2024. It is now read-only.
This repository was archived by the owner on Jun 10, 2024. It is now read-only.

is there alternative project like this project? #976

@socialpercon

Description

@socialpercon
  • pyspider version:
  • Operating system:
  • Start up command:

Expected behavior

Actual behavior

How to reproduce

is there alternative project like this project?

i don't understand why this project no longer maintainece. i think alternative project more powerful... but i don't know...

Activity

Chaffy-0

Chaffy-0 commented on Dec 6, 2021

@Chaffy-0

Scrapy

JermellB

JermellB commented on Dec 19, 2021

@JermellB

This project isn't maintained any more because their javascript rendering capability is done by phantomjs which is no longer maintained.

Like @Chaffy-0 said, Scrapy is likely the best option if you wanted to do a spider like this.

These days, elasticsearch comes paired with one if you were doing something simple and didn't need to collect and process your own data from the wild.

Most places I've done stuff @ will use things like selenium + chrome or firefox, paired with beautiful soup for the rendered html parsing. Then you could keep track of where you'd spider with simple things like a bloom filter implemented on top of redis or something.

But yeah, Scrapy if you don't feel like getting too dirty.

milahu

milahu commented on Apr 18, 2022

@milahu
roniemartinez

roniemartinez commented on Jun 2, 2022

@roniemartinez

Just in case people will be interested in my project 🙇 : https://github.com/roniemartinez/dude

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @roniemartinez@socialpercon@JermellB@milahu@Chaffy-0

        Issue actions

          is there alternative project like this project? · Issue #976 · binux/pyspider