[DFDL-WG] Approaches for handling illegal XML 1.0 characters in DFDL Infoset for XML interchange
Mike Beckerle
mbeckerle.dfdl at gmail.com
Wed Oct 31 16:37:48 EDT 2012
Thought I'd mention that the preferred approach to this in other software
systems seems to be to map the illegal characters to/from the Unicode
Private Use Area.
So illegal XML character 11 0xB becomes codepoint 0xE00B.
Apparently this approach is used by some pieces of commercial software,
notably Microsoft Visio.
http://msdn.microsoft.com/en-us/library/office/aa218415%28v=office.10%29.aspx
...mikeb
--
Mike Beckerle | OGF DFDL WG Co-Chair
Tel: 781-330-0412
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ogf.org/pipermail/dfdl-wg/attachments/20121031/109d89f1/attachment.html>
More information about the dfdl-wg
mailing list