26 Aug 06:26

github-actions

8ecf09c

2025.08.25 Latest

Latest

Breaking or important changes

Brings in full lxml 6.0.x support. Additional exported constants were already present in earlier types-lxml release, here are the remaining features:
- xmlfile.write() supports writing CDATA object directly
- (#94, thanks to @udifuchs) Element() and ElementTree() used to be factory functions to generate _Element and _ElementTree correspondingly, but now become virtual superclasses themselves
No more tested against lxml 4.9.x. Doesn't mean it will break immediately, but will not have any guarantee that types-lxml completely matches 4.9.x API over time.

Features

Also test against lxml 5.4 and newest 5.3.x
(#92, thanks to @macro1) Apply mypy.stubtest check to help guarantee stub implementation doesn't deviate too much from runtime signatures and types, except intentional ones. Helps finding many of the bug fixes below.
Compatible with mypy 1.16+ and pyright 1.1.399+
(#86) Revive custom target parser support (stub-only ParserTarget as target object, and CustomTargetParser as stub-only variant of XMLParser)
- Functions involved: fromstring(), parse(), _ElementTree.parse(), ElementTree(), fromstringlist(), HTML(), XML()
- Params of all target object methods are positional
- attribute is a dict in target object .start() method
- Leave the capability of creating custom target parser to only XMLParser and HTMLParser, and drop target= param from all parser subclasses (such as lxml.html ones)
- C14NWriterTarget inherits from ParserTarget

Bug Fixes

Sync or add __all__ in various submodules

Fixes for `lxml.etree`

(#85, thanks to @BrandonStudio) cleanup_namespaces() shouldn't warn without keep_ns_prefixes arg
Allow specifying default value of output_parent arg for XSLTExtension.apply_template() and .process_children()
Mark _Attrib as final
Add missing XMLSyntaxAssertionError.__init__()
set_default_parser() arg missing default value
strip_elements() with_tail arg should be keyword-only
Use original param name in tag cleanup functions
Strip unnecessary arguments in XSLTExtension overloads
Give users a rough idea about XSLTExtension method arguments, such as using _Element to approximately represent _ReadOnlyElementProxy. Avoids creating even more stub-only classes and requiring user to poke into them

Fixes for `lxml.html`

FormElement._name is a method, not property

Fixes for `lxml.isoschematron`

some Schematron variables are Literal constants

Fixes for `lxml.objectify`

enable_recursive_str() arg missing default value
parse() file parameter name was wrong

Minor changes

Trim down canonicalize(), etree.tostring() and Extension() overloads to avoid confusion
Implement objectify.NumberElement after all, in rare case where somebody wants to implement new type of number related to DataElement
Move NumberElement._setValueParser() to subclasses
(#71) Remove last traces of _AnyStr
Reorder _ElementTree.write() overloads, with the most generic overload presented first for UX
Fix XMLParser and HTMLParser API doc links
Better docstring and warning for C14NWriterTarget
Drop unused _HtmlElemParser alias

Contributors

macro1, udifuchs, and BrandonStudio

Assets 6

29 Mar 23:11

github-actions

2025.03.30

defc7e5

2025.03.30

Features

(#82) Add buffer type support for upcoming lxml 6.0.
HtmlElement.text_content() result will become plain str since lxml 6.0. This change shouldn't break much compatibility for users of previous lxml versions.
Warn user about str input and guess_charset combo bug in html.html5parser functions
Warn user about incorrect usage of specifying single element as .extend() argument
lxml 6.0 exports LIBXML_COMPILED_FEATURES constant

Bug fixes

(#84) Tag selector supports iterator but not bytearray
A few combinations of QName construction argument were actually disallowed; second argument can't be QName or _Element if first argument is non-empty
Multiple issues for Resolver class
- Don't annotate opaque internal context object
- Drop _ResolverRegistry.resolve() which can't possibly appear in user land code
- Missing default value for Resolver.resolve_file() keyword arguments
- Resolver.resolve() arguments can be None
Drop unused keyword arguments from iterparse() html mode overload
namespaces arg of .xpath() method accepts tuple form. Change for XPath classes already done earlier.
Confine the type of public element (subclass of ElementBase) class attributes
_Element.findtext() didn't allow default argument in certain overload form
RelaxNG.from_rnc_string() base_url argument accepts bytes
html.html5parser guess_charset bug revisited
- parse() is not affected as it always open files/URL in binary mode
- For other functions, even guess_charset=False triggers the bug
Some html5parser.HTMLParser initialisation arguments should be keyword only
Corrected import of typing.Never in html module and html.html5parser submodule
.extend() and __setitem__() of _Element and HtmlElement support iterator as value
_Element.index() had wrong parameter name
Continued verification of properties and arguments supporting bytearray:
- _Element .text and .tail properties
- Content-only elements
- XPath input expression
- _IDDict mixin arguments
- xmlfile.write*() methods and encoding argument

Minor changes

Drop _ElemClsLookupArg alias, which is almost unused
Rename _StrictNSMap to more aptly named _StrOnlyNSMap
Don't include superclass attributes in ParseError definition
Continue getting rid of _AnyStr in most places
Mark constants as Final

Tests related

Migrate following tests to property based runtime testing:
- All basic validators: DTD, RelaxNG, ISO Schematron (XMLSchema done in earlier release)
- All existing _Element method / property tests and content-only elements
- html.html5parser submodule
- XMLID() and friends
- QName
For all negative tests on properties or arguments bombarded with random objects, also add iterables of correct objects to the list, to make sure iterables of correct argument or value would become incorrect arguments.

Documentation

Fill in docstring for all _Element properties and methods

Assets 6

04 Mar 02:25

github-actions

2025.03.04

2c73e69

2025.03.04

Features and breaking changes

Depends on beautifulsoup4 itself because version 4.13 has bundled inline annotation. Dropping types-beautifulsoup4 dependency as result.
Multi subclass patch includes change in CSSSelector result
Implement ErrorTypes constants as enum

Bug fixes

Additional type: ignores that improve compatibility with older versions of mypy and pyright
For soupparser submodule input arguments, copy definition from beautifulsoup4 code directly
html.fragment_fromstring create_parent argument can be string (#83, thanks to @sciyoshi)
XPath namespaces argument can accept namespace tuples
Fixes compatibility with mypy 1.14+
bytes not allowed as html.diff.htmldiff() argument
Parser encoding arguments do support bytearray
_ListErrorLog.filter_from_level() supports real numbers

Minor changes and tests

Migrate beautifulsoup and ErrorLog tests to property based
Migrate cssselect and XMLSchema tests to runtime ones
Add mocked HTTP response to file input fixture; introduces urllib3 and pook as test dependency

Contributors

sciyoshi

Assets 6

24 Feb 06:34

github-actions

2025.02.24

7ceca43

2025.02.24

Features and breaking changes

Add basedpyright type checker support
Incorporate changes from lxml 5.3.1 and (pending) 6.0
- More html.builder shorthands
- libxml feature constants
- etree.DTD(external_id=...) support str now
- Deprecate some Memdebug methods

Bug fixes

html.submit_form() always return HTTPResponse for default handler
Instance attributes are converted to properties because they are not deletable:
- html.SelectElement.multiple
- html.InputElement.type
More function arguments supports bytearray:
- register_namespace()
- inclusive_ns_prefixes parameter of etree.tostring()

Minor changes

Add docstring for some etree module function overloads
Drop _AnyStr from etree module level functions

Assets 6

13 Dec 12:59

github-actions

2024.12.13

525e6f8

2024.12.13

Breaking changes and features

bytearray accepted as tag names, attribute names and attribute values
- Related change: create _TextArg type alias to slowly replace existing _AnyStr (#71)
Warn IDE users via warnings.deprecated about exception upon certain argument combinations in HTML link functions

Bug fixes

Property deleter missing for HTML elements (#73)
etree.strip_attributes() support bytes and QName as input
Completion of #64 for remaining known cases
Corrected link replacement function return type in html.rewrite_links()
etree.canonicalize() shouldn't accept bytes as input

Tests related

Use hypothesis for extensive tests on function arguments, currently used in _Attrib and HTML link function tests (#75)
reveal_type() injector has been split into its own project and pulled via dependency

Internal changes

Folder structure changes for the whole repository (#70)
Remove _HANDLE_FAILURES type alias and show values directly to users
Rename type-only protocol SupportsLaxedItems to SupportsLaxItems

Full Changelog: 2024.11.08...2024.12.13

Assets 6

08 Nov 10:48

github-actions

2024.11.08

99f984d

2024.11.08

Breaking and important changes

pyright users (and IDE that can make use of pyright) will see warning if a single string is supplied where collection of string is expected (tuple, set, list etc). In terms of typing, a single str itself is valid as a Sequence, so type checkers normally would not raise alarm when using str in such function parameters, but can induce unexpected runtime behavior. (#64)
- _ElementTree.write(), etree.fromstringlist(), etree.tostring(), html.soupparser.fromstring(), html.soupparser.parse()
It is possible to verify release files indeed come from GitHub and not maliciously altered. See Release file attestation for detail.
Runtime tests support comparing with mypy results, therefore officially making static stub tests obsolete

Bug fixes

Element tag names, attribute names and attribute values support bytearray. This is discovered via hypothesis testing, which is intended to be utilized in next release
Compatibility with pyright ⩾ 1.1.378, which imposes additional overload warning for etree.iterparse()
Use relative import in lxml.ElementInclude, otherwise mypy triggers --install-type behavior.
ObjectifiedElement __getitem()__ and __setitem()__ should accept str as key, which behaves mostly like __getattr__() and __setattr__(). That means, elem["foo"] is equivalent to elem.foo for non-repeating subelements.

fixes for etree submodule

_Element.tag property is not just a str. It is str after initial document or string parsing, but can be set manually to any type supported by tag name and returns the same object.
When QName is initialized with first argument set to None, _Element can be used as second argument (which is promoted to first argument in implementation)
Relax single argument usage in _Element.iter*() method family, doesn't need tag= keyword when argument is None
FunctionNamespace() should generate an _XPathFunctionNamespaceRegistry object, not its superclass
For decorator usage of _XPathFunctionNamespaceRegistry and _ClassNamespaceRegistry, decorator signature included an extraneous argument, though it doesn't affect any existing correct usage.
indent() first parameter has wrong name

fixes for html submodule

soupparser.parse() should accept pathlib.Path object as input
.value property of SelectElement can't be set to bytes
.action property of FormElement can have a value of None, and can be set to None. They have different meanings though.

Small and internal changes

Declare python 3.13 support and perform CI tests.
Separation of pyright and mypy ignore comments: in previous releases # type: ignore[code] was enabled in pyright settings. Now it only uses # pyright: ignore[code] so mypy comment won't affect pyright behavior.
Add ._name property to html.FormElement for form name
Eliminate typing.TypeAlias usage (declared obsolete, and we can do without it)

Test related changes

Stub tests migration to runtime:
- Most of remaining etree._Element methods, now only .makeelement() and .xpath() left in stub test
Runtime test additions:
- ElementNamespaceClassLookup()
tox config migrated to pyproject.toml, thus requiring tox ⩾ 4.22
Runtime tests are now executed within test-rt folder due to python/mypy#8400
Some tests need to be performed conditionally when multi-subclass patch is applied
Some tests or syntaxes need to be turned off to cope with mypy deficiencies
Usage of Rust-based uv as well as related tox plugin to speed up test environment recreation
Don't force users installing tox-gh-actions when checkout out repository, it is only useful for GitHub workflows

Docstring additions

etree submodule: parse(), fromstringlist(), tostring(), indent(), iselement(), adopt_external_document(), DocInfo properties, QName, CData, some exception classes
html.soupparser submodule: fromstring(), parse(), convert_tree()

Assets 6

16 Sep 07:07

github-actions

2024.09.16

470f1bf

2024.09.16

Bug fix and small changes

Namespace argument in Elementpath methods should allow None (#60 thanks to @cukiernick)

Internal changes

Perform runtime tests against lxml 5.3

Contributors

cukiernick

Assets 6

07 Aug 08:04

github-actions

2024.08.07

9187118

2024.08.07

Breaking changes

Multiple builds available, with the alternative build enhancing multiple XML subclassing scenario. See relevant README section for detail. Thanks to @scanny for the driving force behind #51.
Mypy 1.11 required, which introduced backward incompatible @typing.overload changes.
lxml.html.clean stub depreated, lxml 5.2.0 completely removes the submodule due to multiple security issues. Corresponding code and type definitions are split into a new independent repo.

Features

(#56) Replace typing.TypeGuard with typing.TypeIs
Use callback protocol for more precise element and ElementMaker factory function typing
lxml.etree.ICONV_COMPILED_VERSION exported since 5.2.2
Special handling for ObjectifiedElement and HTMLElement in lxml.cssselect.CSSSelector and various cssselect() methods
html.builder shorthands return more precise element type for certain HTML elements. For example, html.builder.LABEL(), corresponding to <LABEL> tag, yields LabelElement.
More precise etree.Extension() annotation depending on supplied namespace
Stricter namespace argument type in _Element ElementPath methods
For lxml.builder.ElementMaker class:
- Provide better hint in __call__() argument
- Accepts namespace tuple in nsmap argument
- Export private properties
For lxml.sax module:
- Export private properties in various classes
- Explicitly list all inherited methods in ElementTreeContentHandler class, as method arguments names are different from superclass ones
Alert etree.HTMLParser users to remove deprecated strip_cdata argument

Bug fix and small changes

Some _Element related input arguments fixed to use typing.Sequence instead of Interable, as _Element is already an Iterable itself. Supplying _Element where a proper Iterable is expected would cause problem.
Similar situation arises for str or byte in tag selector argument; use typing.Collection to alert user more clearly.
None can't be used as etree.strip_*() argument
Some etree.DocInfo read-only properties can't be None
Fix etree.Resolver method return types
Avoid exception raising arg combinations in html.html5parser.HTMLParser

Internal changes

The usual static stub to runtime test migration:
- Part of basic _Element tests and its find*() methods
- More extensive _Attrib tests
Use ruff to replace black and isort as code formatter
Migrate stub tests to support pytest-mypy-plugins ⩾ 2.0
Use pdm-backend as build backend due to its more versatile versioning support

Contributors

scanny

Assets 6

14 Apr 04:45

github-actions

2024.04.14

8335d33

2024.04.14

Breaking changes

Mypy 1.9 is required, dropping 1.5 support. 1.6 - 1.8 was never supported.
lxml.ElementInclude completely reworked

Features

PEP 696 support, simplifying usage of some subscripted types (#42)
- As a convenient side effect, lxml.html parser constructor signatures can be removed
All annotations do provide default values in their signatures now instead of ...

Bug fix and small changes

Type of _Comment.text property (and those of similar elements) is always str (#46, thanks to @eemeli)
Tag selector argument in element iterator methods should support keyword with a single tag (#45, thanks to @eemeli)
html.fragments_fromstring() should receive same fix as html.html5parser.fragments_fromstring() do (#43, thanks to @Wuestengecko)
@overload for etree.SubElement() on handling of HtmlElement and ObjectifiedElement
Some exported constants were missing from lxml.ElementInclude stub
html.soupparser module functions return type depends on makeelement argument
Keyword arguments in html.soupparser module functions are explicitly listed now (instead of generic **kwargs before)
The 2 arguments in html.diff.html_annotate() should align their annotation types
html.submit_form() return type depends on the result of open_http function argument
Add missing exported variable for lxml.isoschematron
Uppercase variants of output method arguments ("HTML", "TEXT", "XML") were dropped

Internal changes

Usual runtime test additions: lxml.html.soupparser, lxml.ElementInclude, various exported constants
Runtime tests also do test against lxml 5.2

Contributors

eemeli and Wuestengecko

Assets 4

27 Mar 17:13

github-actions

2024.03.27

138fcaf

2024.03.27

Breaking change

Requires cssselect ⩾ 1.2 for annotation in lxml.cssselect, since cssselect is now inline annotated.

Bug fix and small changes

Compatibility with pyright ⩾ 1.1.353
In etree.clean_* functions, first argument (the Element or ElementTree to be processed) must be strictly positional
etree._LogEntry.filename property is never empty, as it uses the value <string> as fallback
etree._BaseErrorLog.receive() argument name was wrong
Self brewed SupportsReadClose protocol dropped, replacing with more standardized SupportsRead
html.html5parser.parse() should support data stream as input
html.html5parser.fragments_fromstring() return type is dependent on no_leading_text argument
encoding arguments in various methods / functions used to only support ASCII and UTF-8 as byte encodings, now the restriction is lifted
Place some typing usage under python version check (if sys.version_info >= (3, x))
etree.PyErrorLog constructor shouldn't accept 2 logger arguments simultaneously
etree.PyErrorLog.level_map property reverted to vanilla type (int) instead of our fake enum

Internal changes

Some runtime tests are lxml version dependent (#34, thanks to @fabaff)
Adds stub check for _Element, _Comment and _ElementTree (#33, thanks to @udifuchs)
Following stub tests migrated to runtime: _Attrib, _ErrorLog and friends, html5lib

Contributors

fabaff and udifuchs

Assets 4

Releases: abelcheung/types-lxml

2025.08.25

Breaking or important changes

Features

Bug Fixes

Fixes for lxml.etree

Fixes for lxml.html

Fixes for lxml.isoschematron

Fixes for lxml.objectify

Minor changes

Contributors

Uh oh!

2025.03.30

Features

Bug fixes

Minor changes

Tests related

Documentation

Uh oh!

2025.03.04

Features and breaking changes

Bug fixes

Minor changes and tests

Contributors

Uh oh!

2025.02.24

Features and breaking changes

Bug fixes

Minor changes

Uh oh!

2024.12.13

Breaking changes and features

Bug fixes

Tests related

Internal changes

Uh oh!

2024.11.08

Breaking and important changes

Bug fixes

fixes for etree submodule

fixes for html submodule

Small and internal changes

Test related changes

Docstring additions

Uh oh!

2024.09.16

Bug fix and small changes

Internal changes

Contributors

Uh oh!

2024.08.07

Breaking changes

Features

Bug fix and small changes

Internal changes

Contributors

Uh oh!

2024.04.14

Breaking changes

Features

Bug fix and small changes

Internal changes

Contributors

Uh oh!

2024.03.27

Breaking change

Bug fix and small changes

Internal changes

Contributors

Uh oh!

Fixes for `lxml.etree`

Fixes for `lxml.html`

Fixes for `lxml.isoschematron`

Fixes for `lxml.objectify`