[DFDL-WG] Minutes: OGF DFDL Working Group Call, April-08-2009

Steve Hanson smh at uk.ibm.com
Thu Apr 9 06:43:54 CDT 2009


Open Grid Forum: Data Format Description Language Working Group

OGF DFDL Working Group Call, April-08-2009

Attendees
Suman Kalia (IBM)
Steve Hanson (IBM)
Mike Beckerle (Oco)

Apologies
Alan Powell (IBM)
Dave Glick (drac)


1. Escape Schemes 
Alan has mailed latest refinement
Overall
Agreed with the scope of the escape scheme support, ie, support three 
well-known variants, and not provide overly complex open ended support
Annotation structure. 
Why dfdl:defineEscapeScheme and dfdl:escapeScheme, instead of just 
dfdl:escapeScheme and an optional name attribute?   For consistency with 
dfdl:defineFormat, dfdl:defineNumberFormat, etc, and it makes it clear 
that the top level scope of the naming is a peer to dfdl:defineFormat not 
inside them.
Annotation properties.
These need a careful review to make sure that they behave in the expected 
manner. 
For example, should escape start/end bracketing be at the start/end of the 
field, or anywhere in the field?
Action raised to review in detail for next call.

2. Validation ranges
Need to decide whether to allow restricted use of xs:union to model this.
Agreed that this should be supported. For such a union:
- The member types must all be derived from the same schema simple type
- Any DFDL annotations on member types are a schema definition error
Will be added to draft 0.34.

3. Specialized annotations
Need to decide whether to drop specialized annotations altogether, or use 
the scheme below, which does not affect scoping in any way but which makes 
it clear what is allowed where.

        Specialized annotations on xsd objects only, dfdl:format on 
scoping constructs only
                dfdl;defineFormat  =>  dfdl:format 
                xs:complexType => dfdl:format 
                xs:sequence => dfdl:sequence 
                xs:choice => dfdl:choice 
                xs:group ref => dfdl:group
                xs:element or xs:element ref => dfdl:element 
                xs:any => dfdl:any 
                xs:simpleType => dfdl:simpleType 
        dfdl:format is exactly as specified in draft 0.33, its properties 
apply to all relevant objects
Scoping rules as specified in draft 0.33

Agreed that this scheme provided the best balance between simplicity and 
validation capability.
Will be added to draft 0.34.

4. Exclusion lists.
XML Schema only allows inclusion list of enumerations, does DFDL need to 
support exclusion list of enumerations?  It would be nice if it did, but 
if we provided a DFDL property that said 'treat enums as exclusion instead 
of inclusion', removal of the DFDL annotations would change the validation 
semantic. Agreed that exclusion provision is something that DFDL would 
inherit from XML Schema, when and if it gets added there.

5. Consuming extraneous data that occurs at the end of the stream
This is where the DFDL model matches input data ok, except that there is 
some extra data in the stream. This can be explicitly modelled, using a 
hidden optional element. Agreed that whether such a hidden optional 
element is needed, or whether the data is simply ignored, is up to 
individual DFDL implementations. The spec will not take a position.

6. 'Floating' definitions
A known element, the position of which can be anywhere in a sequence of 
other elements - is this something DFDL needs to support?  Capability is 
offered by IBM's WTX product.  Can be used for comments, but DFDL plans to 
handle comments post 1.0 using an explicit mechanism or using layering. 
Real purpose of floating component is for older EDI formats where there is 
a segment that can appear anywhere, and can appear any number of times. 
Action raised for IBM to provide a concrete example for discussion. The 
issue for DFDL is how does a floating component appear in the DFDL 
infoset, and how does it validate in the sequence. One possibility is a 
property dfdl:floating=yes/no and if an element has that property set, it 
can be expected anywhere when parsing, but appears in the correct point in 
the sequence in the parsed infoset, On unparsing it must appear in in the 
correct point in the sequence, and is output in that place.

7. Recursive use of DFDL for variable markup
Use of a DFDL annotated element/type to describe an initiator, length 
prefix, terminator, separator, etc. Steve suggested the most important use 
of "variable markup-like mechanism" in IBM's WTX product is to reference a 
location earlier in the bit stream where a delimiter value is found. We 
handle this already by use of  a path expression. The additional variable 
markup mechanism was to avoid proliferation of keywords for various corner 
cases on initiator, terminator and separator. Eg., what if you want the 
initiator to be "Name" or "name" only, not "NAME", "nAmE", etc. So case 
insensitive is not expressive enough. This can always be modeled, just not 
as an initiator tag. Feeling was to leave out variable markup (other than 
for prefix lengths) for v1.0, and to propose the minimum set of extra 
properties that can be used to address the common use cases, but that IBM 
needed to see whether this satisfied all WTX use cases. 
 
(Post-call update. It doesn't, there is a use case from WTX, Steve will 
mail this out before next call).

Actions updated below.

Next call 15 April 14:00 UK

Meeting closed, 15:05

Actions raised at this meeting

No
Action 
035
AP: Add validation ranges to spec, update specialized annotations in spec.
08/04: Raised. For draft 0.34
036
SH: Provide use case for floating component in a sequence
08/04: Raised


Current Actions:
No
Action 
012
AP/SH: Update decimalCalendarScheme
10/9: Not allocated yet
17/9: No update
24/9: Add calendar binary formats to actions
22/10: No progress
16/1: proposal distributed and discussed. Will be redistributed
21/1: add locale, 
04/02: changed from locale to specific properties
18/2: Need more investigation of ICU strict/lax behaviour.
08/04: Not discussed
020
SH: Resolve packedDecimalSignCodes behaviour depends on NumberCheckPolicy 
22/10: No progress
10/12: added how to decide to overpunch and sign position
11/02: proposal largely agreed. SH to make minor changes
18/02: AP to document unsigned type behaviour
25/02: no progress
08/04: Not discussed
023
MB: Review Schema 1.1
29/1: AP and SH to talk to Sandy Gao
04/02 Call arranged for Friday
11/02: Call took place. Identified useful changes. Consolidate with 
previous list.
04/03: decided to stay on Schema 1.0.
08/04: Not discussed
024
String XML type
08/04: Not discussed
025
Escape schemes 
21/1: discussed requirements
04/02: AP/SH to describe behaviour for known length text fields. Need to 
discuss if comment escapes should be supported.
11/02 new draft distributed:
18/02: SH up document concerns
25/02: SH and AP have refined proposal ready for approval.
04/03: SH and AP have further refined proposal.
11/03: discussed. suggested a simplified proposal be evaluated.
18/03: SH and AP had further discussions on simplified proposal
08/04: See minutes, review in detail for next call 
026
SH: Envelopes and Payloads
08/04: Not discussed explicity, but recursive use of DFDL is tied up with 
this
027
Property precedence tables
08/04: Not discussed
028
SH: Variable markup 
08/04: Discussed briefly at end of call, IBM to see whether there any use 
cases that require recursive use of DFDL.
029
valueCalc (output length calculation)
08/04: Not discussed
032
DG: Investigate compatibility between DFDL infoset and XDM
08/04: No update
033
AP/TK: Assert/Discriminator semantics. AP to document. TK to check uses of 
discriminator besides choice.
08/04: In progress within IBM
034
AP: Remove redundant properties, correct old examples
08/04: No update

Closed actions:

031
DG: Review dfdl v033
11/02:  Initial comments received
18/02: Will include work items 5 and 12.
11/03: complete

Work items:
No
Item

001
String XML type (Ian P) - Apr 30, 2008 

002
Escape schemes (Ian P) - Apr 30, 2008 

003
Variables - ??, 2008 (Mike) 

005
Improvements on property descriptions - ??, 2008 (All - split TBD) 

006
Envelopes and Payloads (Steve) - Apr 30, 2008

007
(from draft 32) valueCalc (Mike) - ??, 2008  
mostly
complete
008
(from draft 32) Property precedence for writing (Steve) - 
under review
009
(from draft 32) Variable markup (Steve) - Mar 31, 2008  
proposal needs writing up
010
(from draft 32) Assertions, discriminators and choice, including 
discussion of timing option (Suman) - Mar 31, 2008 * in progress *

011
(from draft 32) How speculative parsing works (combining choice and 
variable-occurence - currently these are separate) ??, 2008 (IBM) 
 in progress 
012
(from draft 32) Reordering the properties discussion: move representation 
earlier, improve flow of topics ??, 2008 (Alan) * not started *

025
Augmented infoset and unparsing (Alan) 
added but needs work
026
 Remove duration


Regards

Steve Hanson
Programming Model Architect
WebSphere Message Brokers
Hursley, UK
Internet: smh at uk.ibm.com
Phone (+44)/(0) 1962-815848





Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU





-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.ogf.org/pipermail/dfdl-wg/attachments/20090409/573e0d60/attachment-0001.html 


More information about the dfdl-wg mailing list