Lambda Soup 0.6
Switched to Markup.ml for the underlying parser. As a result:
- There is now a high degree of conformance to the HTML5 specification, including in error recovery.
- Now parsing HTML entities.
- Character encodings are detected automatically.
- All strings are transcoded to UTF-8.
- Empty attributes are handled correctly.
- Parsing is now much slower. This depends on Markup.ml being optimized.
Added functions signals
and from_signals
to convert between Lambda Soup node trees and Markup.ml signal streams.