Skip to content
This repository was archived by the owner on Jun 10, 2024. It is now read-only.
This repository was archived by the owner on Jun 10, 2024. It is now read-only.

<base href='relative_path' is not interpreted properly. #978

Open
@agarwal-nitesh

Description

@agarwal-nitesh
  • pyspider version: latest version, built the master branch using docker.
  • Operating system: macOS Catalina
  • Start up command: docker-compose -f docker-compose.yaml build && docker-compose up

Expected behavior

Either the response should interpret the base tags or, on setting response.url and response.orig_url, proper hrefs are constructed for relative hrefs.

Actual behavior

Proper href URLs are not formed.

How to reproduce

Try to crawl a website with <base href="...."> tag.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions