irl
is a URL parser library that has a mixture of very strict rules around
a URL's host and port section but very relaxed in all other sections making it
the ideal balance of secure, tolerant, and performant.
It's primary use-case is for being used in HTTP client libraries as they have to deal with the Internet and lots URLs that are definitely not compliant but users expect them to work anyways!
Supports Python 3.6 or later.
python -m pip install irl
>>> import irl
>>> url = irl.URL.parse("https://user:[email protected]:1234/path?q=ue&r=&y#frag")
>>> print(repr(url))
URL(scheme="https", userinfo="user:pass", host="example.com", port=1234, path="/path", query="q=ue&r=&y", fragment="frag")
>>> url.target()
b"/path?qu=e&r=&y"
>>> url.host_header()
b"example.com:1234"
>>> url.query_to_items()
[("q", "ue"), ("r", ""), ("y", None)
>>> url.address()
("example.com", 1234)
This URL parser library wouldn't be possible without the rfc3986 library or the test suite from urllib3. This parser is based heavily on techniques used in both libraries and they directly inspired this libraries creation.
MIT