[DFDL-WG] Minutes for OGF DFDL Working Group Call, March 17-2010
Alan Powell
alan_powell at uk.ibm.com
Thu Mar 18 11:56:54 CDT 2010
Open Grid Forum: Data Format Description Language Working Group
OGF DFDL Working Group Call, March 17-2010
Attendees
Suman Kalia (IBM)
Steve Hanson (IBM)
Alan Powell (IBM)
Steve Marting (Progeny)
Stephanie Fetzer (IBM)
Mike Beckerle (Oco)
Apologies
Tim Kimber(IBM)
Steve H reported on the DFDL presentation at OGF 28
Overall OGF registered attendees thought to be around 250.
Number at DFDL session: 7. Apart from Erwin (Data AD), DFDL was new to
everyone.
Notes from session (from minutes taken by Erwin Laure based on questions
asked):
Validation on input and output? Means conforming to the schema defined
(e.g. integer between 0 and 100). dfdl assert could be used for asserting
correctness of the data. Further complex validation would be via an
external step (eg, Schematron invocation).
Using DFDL to model data structures generated by OO code, particularly use
of inheritance? Could be a use case for allowing complex type inheritance
in DFDL in the future.
Data often comes with units, infoset would want to reflect this? Could be
achieved using expressions, dfdl:inputValueCalc and dfdl:hidden to apply a
scaling factor based on units.
What's the efficiency of DFDL? Particularly, can tests be turned off for
fast read/write? The spec defines the behavior of the parser but not how
to implement it. For instance, validation is not mandatory.
Open source reference implementation would be good to have. (Lots of
nodding).
Scientific floating point data compresses badly. Knowing data structure
can allow a more intelligent compression. DFDL is not intending to do
transformations but that could be done on top.
Encryption/compression could also be a use case for multi-layers, or for
additional functions in the expression language.
Need to reach out to DAIS-WG and DR-WG for (public) comments and to see
whether DFDL will be actively used by those groups. Open source
implementation would obviously help here.
Comment on the spec: Should be a proposed recommendation (GFD-P-R) not
informational (GFD-I).
Draft 040 has been submitted to OGF and will be reviewed by the technical
committee on March 30th. It is expected to go out for public comment in
early April.
There needs to be some publicity to ensure that enough comments are made.
Mike suggested a press release from his company with IBM comment. Action
raised
May be able to get IBM representatives on other standards bodies such as
ACCORD. OMG, to interest their members,
Should contact all the contributors to the WG to ask them to review.
Mike will update the information he sent in response to an RFI from OMG
1 Process for dealing with internal issues during Public comment phase.
The public comments process will be used to make updates
2. Nils and Defaults during unparsing
The table 17 in section 13.16.2 was corrected in draft 40 but here are
still some ambiguities, for example when nil is the default.
Logical Value
nilValueInitiatorPolicy
Has default value specified
missingValueInitiatorPolicy
initiator region contains
content region contains
Nil
(implies nillable)
prohibited
don't care
don't care
empty
representation of nil based on nilKind, nilValue, etc.
required
initiator string
"" (empty string)
Note that this implies that the element type is xs:string
don't care
prohibited
empty
empty string
required
initiator string
a non-nil non-empty-string value
don't care
don't care
initiator string
The representation of the logical value
Not supplied
don?t care
Yes
(non-empty sting)
don't care
Initiator string
The representation of the default value.
Yes
(empty string)
prohibited
empty
empty
required
initiator string
empty
Alan will go update the table.
3 dfdl:choiceKind
The main issues are:
a) The calculation of the length of the longest branch is not obvious.
b) The length units to use - the dfdl:lengthUnits property does not exist
on a choice
c) The name could be better
Proposal is therefore to retain the property but to:
i) State the conditions that must apply to use this property, and enforce
them in the validator => schema definition error otherwise
ii) Decouple the choice from its parent by calculating the length of each
branch based solely on the properties of the branches components,
irrespective of any parent dfdl:lengthKind
Alan document the problem. Name change agreed.
4 DFDL time functions
fn:timezone-from-dateTime
Returns the timezone from an xs:dateTime value.
fn:timezone-from-date
Returns the timezone from an xs:date value.
fn:timezone-from-time
Returns the timezone from an xs:time value.
Function
Meaning
fn:adjust-dateTime-to-timezone
Adjusts an xs:dateTime value to a specific timezone, or to no timezone at
all.
fn:adjust-date-to-timezone
Adjusts an xs:date value to a specific timezone, or to no timezone at all.
fn:adjust-time-to-timezone
Adjusts an xs:time value to a specific timezone, or to no timezone at all.
All return an xs:duration. Do we need these functions?
As there is no known use case for these functions they will be dropped.
Meeting closed, 14:10
Next call Wednesday 17 March January 2010 13:00 UK (9:00 ET)
Call will be for one hour only
NOTE: East coast is 4 hours behind the UK for the next two weeks
Next action: 087
Actions raised at this meeting
No
Action
085
ALL: publicize Public comments phase to ensure a good review..
086
AP: Nils and Defaults during unparsing - update table
Current Actions:
No
Action
066
Investigate format for defining test cases
25/11:IBM to see if it is possible to publish its test case format.
04/12: no update
09/12: no update
16/12: reminded dent to project manager
23/12: SH will send another reminder.
06/01: Another reminder will be sent
13/01: no update
20/01: no update
27/01: no progress
29/01: no progress
03/02: IBM is still investigating
10/02: IBM is still investigating
17/02: IBM is willing in principle to publish the test case format and
some of the test cases. May need some time to build a 'compliance suite'
24/03: No progress
03/03: Discussions have been taking place on the subset of tests that will
be provided.
10/03: work is progressing
17/03: work is progressin
084
Check behaviour of dfdl:inputValueCalc and outputValueCalc.
085
ALL: publicize Public comments phase to ensure a good review..
086
AP: Nils and Defaults during unparsing - update table
Closed actions
No
Action
Work items:
No
Item
target version
status
005
Improvements on property descriptions
not started
012
Reordering the properties discussion: move representation earlier, improve
flow of topics
not started
036
Update dfdl schema with change properties
ongoing
042
Mapping of the DFDL infoset to XDM
none
not required for V1 specification
070
Write DFDL primer
071
Write test cases.
083
Implement RFC2116
097
Remove functions that returns duration
Regards
Alan Powell
Development - MQSeries, Message Broker, ESB
IBM Software Group, Application and Integration Middleware Software
-------------------------------------------------------------------------------------------------------------------------------------------
IBM
MP211, Hursley Park
Hursley, SO21 2JN
United Kingdom
Phone: +44-1962-815073
e-mail: alan_powell at uk.ibm.com
Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number
741598.
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.ogf.org/pipermail/dfdl-wg/attachments/20100318/dbb79839/attachment-0001.html
More information about the dfdl-wg
mailing list