| [SE-1750] ADH - Remove invalid bytes from UTF-8 encoded XML input | |
|---|---|
| Product Line: | P&C |
| Component/s: | Other |
| Fix Version/s: | SICS 4.9.4 |
Aim of function
Enable importing an XML file which is supposed to be UTF-8 encoded, while it actually contains not-UTF-8 byte sequences.
A new ADH processing option / system parameter, 'Handling of invalid byte sequences when importing UTF-8 encoded XML input files' will be introduced.
This system parameter will determine what action to take should an invalid multi-byte sequence be detected (possibly because the document is actually Windows-1252 encoded). Fail=fail the import. Remove=remove the first byte of the sequence. Replace=replace the first byte with the given replacement character. Convert=import the document assuming Windows-1252 encoding.
System Parameters Affected
"ADH Document Handling - Processing Options - Handling of invalid UTF-8 multi-byte sequences in XML documents - Action"
Existing functionality affected
ADH XML file import, from SICS Workstation as well as SICS ADH Server.