-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Datastore Dump XML Element Names #180
Conversation
- Added regex substitutions for XML element names.
- Added change log file.
Co-authored-by: Ian Ward <[email protected]>
- Added in accented characters to allow list. - Used space special char. - Added uniqueness to the element names after regex substitutions.
- Added rule to start with underscore or letter.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. unicodedata.normalize('NFC', …)
would future-proof this code but is also something we could enforce when creating the columns in the datastore.
Good for upstream too.
@wardi would the |
Yes, just before replacing word characters so the regex works regardless of the source normalization |
- Normalize string.
Fix/ds writer xml element names