Open
Description
Hello,
In version https://github.com/html5lib/html5lib-python/releases/tag/0.999999999 , html5lib.tokenizer
was made private
The wpull
project (https://github.com/ArchiveTeam/wpull ) uses this library, and if we were to ever migrate to using the 1.X versions, it would negatively impact the application, because instead of just tokenizing a webpage (see https://github.com/ArchiveTeam/wpull/blob/a4ff4a93f613ce18ad3c515aa3d4f5848a88b98c/wpull/document/htmlparse/html5lib_.py ), we would have to use the full tree parsing which is slower and uses more ram
is there any reason this was made private when the 1.x branch was released?
Metadata
Metadata
Assignees
Labels
No labels