[DFDL-WG] clarification on when escape characters are needed

Mike Beckerle mbeckerle.dfdl at gmail.com
Tue Jun 18 22:42:05 EDT 2013


Suppose I have a sequence. It has an infix separator which is "=".

<sequence dfdl:separator="=" dfdl:separatorPosition="infix">
   <element name="a" type="xs:string"/>
   <element name="b" type="xs:string"/>
</sequence>

Now, consider this data:

password=f82+=7&%q

I want

<a>password</a>
<b>f82+=7&%q</b>

Notice how the b element contains an '=' which was not escaped in any way
in the sequence. Element b is statically known to be last, the separator is
infix; hence, things are unambiguous even if there is no escaping.

However, there is an alternative interpretation, which is that the above
data should fail, because it produces <a>password</a><b>f82+</b> but then
does not find the expected stuff next. Rather it finds the '=7&%q' data. In
other words, the sequence separator divides the sequence content into 3
content regions, but there aren't 3 things to consume those, so it is a
processing error.

Which is correct?

-- 
Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology |
www.tresys.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ogf.org/pipermail/dfdl-wg/attachments/20130618/d8f694d1/attachment.html>


More information about the dfdl-wg mailing list