Skip to content

Feature/etree xml parser#38

Draft
lukavdplas wants to merge 6 commits intodevelopfrom
feature/etree-xml-parser
Draft

Feature/etree xml parser#38
lukavdplas wants to merge 6 commits intodevelopfrom
feature/etree-xml-parser

Conversation

@lukavdplas
Copy link
Contributor

Defines an alternative XML reader based on lxml (without Beautiful soup). The class is very simple; extractors are defined using XPath. The unit tests include some examples.

This reader is generally much faster than the XMLReader, but offers fewer features. (The speed gain is mostly in the initial parsing of the XML file.)

This was built as a proof-of-concept, but we're not planning to merge it currently. We don't want to replace the XMLReader with this one (due to the lack of features), or have two readers side-by-side.

The tests currently fail for Python 3.9, due to a compatibility issue in the test itself; the reader should work fine in Python 3.9.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant