[DFDL-WG] lengthUnits bits not allowed for complex type - Unnecessary restriction on length units?

Steve Hanson smh at uk.ibm.com
Fri Apr 11 12:27:16 EDT 2014


Agreed that for complex elements use lengthKind 'implicit' and for simple 
BLOBs model as unsigned integer of specified bit length.

Regards
 
Steve Hanson
Architect, IBM DFDL
Co-Chair, OGF DFDL Working Group
IBM SWG, Hursley, UK
smh at uk.ibm.com
tel:+44-1962-815848



From:   Steve Hanson/UK/IBM
To:     Mike Beckerle <mbeckerle.dfdl at gmail.com>, 
Cc:     "dfdl-wg at ogf.org" <dfdl-wg at ogf.org>
Date:   26/03/2014 10:22
Subject:        Re: [DFDL-WG] lengthUnits bits not allowed for complex 
type - Unnecessary restriction on length units?


Mike, 

I think the expectation was that such fields would be modelled as unsigned 
integers.  The issue then becomes one of max size.

From your MIL spec quote below, it looks like this type is dfdl:lengthKind 
'prefixed', with dfdl:prefixLengthIncludesPrefix 'yes' ?

Regards
 
Steve Hanson
Architect, IBM DFDL
Co-Chair, OGF DFDL Working Group
IBM SWG, Hursley, UK
smh at uk.ibm.com
tel:+44-1962-815848




From:   Mike Beckerle <mbeckerle.dfdl at gmail.com>
To:     "dfdl-wg at ogf.org" <dfdl-wg at ogf.org>, 
Date:   25/03/2014 18:08
Subject:        Re: [DFDL-WG] lengthUnits bits not allowed for complex 
type - Unnecessary restriction on length units?
Sent by:        dfdl-wg-bounces at ogf.org



I know on the call I agreed that complex types with length bits wouldn't 
be needed, but I rechecked the format I am trying to implement, which is 
MIL-STD-2045-47001D (which is public) and I found this (emphasis mine):

5.6.42 Group Size field.
This field shall be a 12-bit binary number indicating the size, in bits, 
of the Future Use Group in which this field is
contained. A value of “0” should not be used for this field. If the parent 
group is specified present then this child
field is mandatory.

Turns out that these things are all just binary blobs in this spec 
document. There is no sub-structure provided as these are "Future Use", 
but it's always possible they will get specified or already are and I am 
just unaware of the document which gives the format of some of these. 
There are a flock of related standards to this one. Most are, 
unfortunately, not publicly available.

I am at a bit of a loss how to model these bit-length fields. HexBinary 
type allows length only in bytes to be compatible with XSD hexBinary which 
has a string-of-hex representation. 

I suppose I could use an xs:nonNegativeInteger with up to 4096 bits as the 
blob. Is there any other viable option? (array of bit is not viable)

My concrete suggestion: this data format is the same one that motivates 
the proposed dfdl:bitOrder property. I suggest that dfdl:bitOrder, along 
with any other issues needed to implement this standard, all be addressed 
at once when I've had time to complete an initial implementation in the 
daffodil code base. 

Lifting the restriction on these lengthUnits bits for complex types may 
well be required.

...mikeb



Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | 
www.tresys.com
Please note: Contributions to the DFDL Workgroup's email discussions are 
subject to the OGF Intellectual Property Policy



On Tue, Mar 25, 2014 at 9:01 AM, Mike Beckerle <mbeckerle.dfdl at gmail.com> 
wrote:

In section 12.3.7.3 we have this sentence: 

"The dfdl:lengthUnits may be 'bytes' or 'characters' and it is a schema 
definition error otherwise. "

Does anyone recall why we have this? 

I have data formats which are bit oriented and there are complex types 
which would naturally not be a multiple of 8 bytes long, e.g., 1 bit 
field, 3 bit field, 10 bit field, 6 bit field = 20 bits. 

I can't think of any reason for this restriction other than to explain how 
fillByte is used to fill in unused bits. But I think we can say that any 
unused bits are filled in with bits from the fillByte, and we don't have 
to be specific about which bits from the fillByte.


Mike Beckerle | OGF DFDL Workgroup Co-Chair | Tresys Technology | 
www.tresys.com
Please note: Contributions to the DFDL Workgroup's email discussions are 
subject to the OGF Intellectual Property Policy

--
  dfdl-wg mailing list
  dfdl-wg at ogf.org
  https://www.ogf.org/mailman/listinfo/dfdl-wg

Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU

Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ogf.org/pipermail/dfdl-wg/attachments/20140411/f8c5e162/attachment.html>


More information about the dfdl-wg mailing list