Skip to content

url_parse doesn't work with URL containing non-ASCII characters #442

Open
@MarekProkop

Description

@MarekProkop

xml2::url_parse("https://www.spa.cz/spacz/images/procedures/Slatinná%20koupel.jpg") returns port -541335376 and all other URL components blank. The reason is the character á in the path. Without it (or if it is URL encoded), the result is correct.

It may be a feature, but I guess it's rather a bug. Such URLs are quite common in some languages and all major web clients (browsers, search engines etc.) can handle them fine.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugan unexpected problem or unintended behavior

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions