-
Notifications
You must be signed in to change notification settings - Fork 243
Open
Labels
enhancementNew feature or requestNew feature or request
Milestone
Description
It would be great if the endianess of the input buffer could be changed for each match().
For our use case we can have strings which come in little and big endian encoding and we must support both.
The encoding normalization to UTF-8 naturally eats a lot of runtime.
Having this build into PCRE2 would be a blessing.
I am aware that the docs say:
UTF-16 and UTF-32 strings can indicate their endianness by special code knows as a byte-order mark (BOM).
The PCRE2 functions do not handle this, expecting strings to be in host byte order.
But would it be a possible extension? Or is it simply utopic because too complicated to implement?
notxvilka
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request