[DFDL-WG] Agenda for OGF DFDL WG call 20 January 2010- 13:00 UK (8:00 ET) - TLOG

Steve Hanson smh at uk.ibm.com
Wed Jan 20 05:11:46 CST 2010


Finally had some time to look at the TLOG stuff again (the format emitted 
by IBM 4680 & 4690 POS controllers). This time I've looked at the MRM 
code, & spoken with domain expert David Bennett, to establish the full 
behaviour. Unfortunately there are some issues which mean DFDL 1.0 is not 
capable of handling it.

1. The individual fields are a mixture of ASCII strings, proprietary 
packed strings (rare), proprietary packed decimals, binary integers 
(rare). All fields are delimited by a separator. 

2. The fields are all defined with a length in bytes, but most of the 
string and decimal fields are actually variable length. If the data 
exceeds the length in bytes when parsing or unparsing the MRM throws an 
error. However, talking to David B, the length is really intended to be 
used if there was ever a fixed length equivalent format, so is really for 
validation only. To parse the current TLOG formats it is sufficient to use 
the delimiter. 
Note to WG: Validation using the specified length only works for strings. 
Should we allow dfdl:length to be specified when dfdl:lengthKind=delimited 
or pattern, and use it as an extra constraint when parsing/unparsing?

3. Scanning for the separator, or maybe use of a data pattern, is needed 
to extract the variable length data, including packed decimal data (which 
we would consider a binary type). 
Note to WG: Should we allow binary scanning when the users says it is safe 
to do so?

4. Packed strings. This is a packed data type used when the range of 
possible chars in the string is limited to 0-9, A-F. You can view this as 
a BCD that can also carry A-F. MRM does not try to turn this into an 
integer, instead it treats it as character data. That is x12x34 would 
result in string '1234'. Odd numbers of digits are padded with a x0 
nibble. On unparsing the MRM throws an error if a character other than 
0-9, A-F is encountered.  In practice, these packed strings are rare and 
invariably only carry 0-9 anyway. I'm still trying to establish why this 
data type is needed, as it could be treated as a logical integer and 
modelled as BCD.

5. Packed decimals. Like a packed decimal in the IBM sense. These can 
carry negative numbers but use a leading xD sign nibble. No sign nibble if 
positive. Odd number of digits (including sign if present) are padded with 
xF nibble. This is best illustrated using examples. 
1234     =>     x12x34
123      =>     xF1x23
-1234    =>     xFDx12x34
-123     =>     xD1x23
Note to WG: Should we support this data type natively?

6. Some of the packed decimals are interpreted as an array of bit flags. I 
assume that DFDL would model these using dfdl:hidden.

Regards

Steve Hanson
Programming Model Architect, WebSphere Message  Brokers,
OGF DFDL WG Co-Chair,
Hursley, UK,
Internet: smh at uk.ibm.com,
Phone (+44)/(0) 1962-815848



From:
Alan Powell/UK/IBM at IBMGB
To:
dfdl-wg at ogf.org
Date:
19/01/2010 18:40
Subject:
[DFDL-WG] Agenda for OGF DFDL WG call 20 January        2010-   13:00   UK 
(8:00 ET)
Sent by:
dfdl-wg-bounces at ogf.org




1.   045  - Disciminators 
Mike was writing up proposal 

2  Unparsing lengthKind = 'pattern' 
Review email from Alan 

3. TLOG 
SteveH has been investigating 

4, Empty Sequences 
Is the current section correct 

1.1        Empty Sequences 
A sequence having no children is syntactically legal in DFDL; however, a 
sequence having no children must have content length zero. It can still 
have non-zero length Prefix and Suffix regions, but the SequenceContent 
region in between must be of length zero. It is a schema definition error 
if the SequenceContent region of an empty sequence is not length zero. 

5. Mike review issues 
Attached document gives replies to comments/issues. Ones without replies 
need to be discussed 


6. Go through remaining actions 
7 Review Schedule 
Activity

Schedule
Who
Complete Action items 

             - 18 Dec 2009 
 WG 
Complete Spec 
Write up work items 
            – 23 Dec 2009 
AP 
Restructure and complete specification 
              - 23 Dec 2009 
AP 
Issue Draft 038 
23 Dec 2009

WG review 
WG review 
7 Dec – 08 Jan 2010 
WG 
Incorporate review comments 
4 Jan - 29 Jan 2010 
AP + 
Issue Draft 039 
15 Jan 2010

Incorporate review comments 
4 Jan - 29 Jan 2010 
AP + 
Issue Draft 040 
29 Jan 2010

Initial OGF Editor Review 
Initial Editor review 
1 Feb - 1 Mar 2010 
OGF 
Initial GFSG review 
1 Feb - 1 Mar 2010

Issue Draft 041 
1 Mar 2010

OGF Public Comment period (60 days) 

1 Mar - 30 Apr 2010 
OGF 
OGF 28 Munich 

15-19 March 2010 

Incorporate comments 
Incorporate comments 
28 May 2010

Issue Draft 042 
28 May 2010

Final OGF Editor Review 
Final  Editor review 
June  2010 
OGF 
final GFSG review 
June  2010

Issue Final specification 
30 June 2010

Publish proposed recommendation 

1 July 2010





Grid recommendation process 

1 Jan - 1 April 2011



Current Actions: 
No
Action 


045
20/05 AP: Speculative Parsing 
27/05: Psuedo code has been circulated. Review for next call 
03/06: Comments received and will be incorporated 
09/06: Progress but not discussed 
17/06: Discussed briefly 
24/06: No Progress 
01/07: No Progress 
15/07: No progress. MB not happy with the way the algorithm is documented, 
need to find a better way. 
29/07: No Progress 
05/08: No Progress. Will document behaviour as a set of rules. 
12/08: No Progress 
... 
16/09: no progress 
30/09: AP distributed proposal and others commented. Brief discussion AP 
to incorporate update and reissue 
07/10: Updated proposal was discussed.Comments will be incorporated into 
the next version. 
14/10: Alan to update proposal to include array scenario where minOccurs > 
0 
21/10: Updated proposal reviewed 
28/10: Updated proposal reviewed see minutes 
04/11: Discussed semantics of disciminators on arrays. MB to produce 
examples 
11/11: Absorbing action 033 into 045.  Maybe decorated discrminator kinds 
are needed after all. MB and SF to continue with examples.   
18/11: Went through WTX implementation of example. SF to gather more 
documentation about WTX discriminator rules. 
25/11: Further discussion. Will get more WTX documentation. Need to 
confirm that no changes need to Resolving Uncertainty doc. 
04/11: Further discussion about arrays. 
09/12: Reviewed proposed discriminator semantic. 
16/12: Reviewed discriminator examples and WTX semantic. 
23/12: SF to provide better description of WTX behaviour and invite B 
Connolley to next call 
06/01:B Connolly not available. SF to provide more complete description. 
13/01: Stephaine took us through a description of WTX identifiers. Mike 
agreed to write up in DFDL terms. 
049
20/05 AP Built-in specification description and schemas 
03/06: not discussed 
24/06: No Progress 
24/06: No Progress (hope to get these from test cases) 
15/07: No progress. Once available, the examples in the spec should use 
the dfdl:defineFormat annotations they provide. 
... 
14/10: no progress 
21/10: Discussed the real need for this being in the specification. It 
seemed that the main value is it define a schema location for downloading 
'known' defaults from the web. 
28/10: no progress 
04/11: no progress 
11/11: no update 
18/11: no update 
25/11: Agreed to try to produce for CSV and fixed formats 
04/12: no update 
09/12: no update 
16/12: no update 
23/12: no update 
06/01: no progress. If there is no resource to complete this action it can 
be deferred 
13/01:no progess 
064
MB/SH Request WG presentation at OGF 28 
25/11: Session requested 
04/12: no update 
09/12: no update 
16/12: SH has changed request to a general session rather tha WG in the 
hope of attracting more people. 
23/12: no update 
06/01: not heard anything yet 
13/01: no update 
066
Investigate format for defining test cases 
25/11:IBM to see if it is possible to publish its test case format. 
04/12: no update 
09/12: no update 
16/12: reminded dent to project manager 
23/12: SH will send another reminder. 
06/01: Another reminder will be sent 
13/01: no update 
068
Should the roots of messages be designated.? 
09/12: Yes. New dfdl:documentRoot property 
Closed 
16/12: reopened and decided to drop property subject to agreement from SKK 
and SF 
23/12: SKK review decision to drop  dfdl:documentRoot 
13/01: closed 
071
Semantics of length=0, nil handling and defaults. 
23/12:SH no update 
06/01: SH has started 
13/01: SH proposal review. Minor updates to be made 
073
SH: Control of overpunching zoned positive sign 
13/01: no update 




Alan Powell

MP 211, IBM UK Labs, Hursley,  Winchester, SO21 2JN, England
Notes Id: Alan Powell/UK/IBM     email: alan_powell at uk.ibm.com 
Tel: +44 (0)1962 815073                  Fax: +44 (0)1962 816898





Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU 





--
  dfdl-wg mailing list
  dfdl-wg at ogf.org
  http://www.ogf.org/mailman/listinfo/dfdl-wg







Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU






-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.ogf.org/pipermail/dfdl-wg/attachments/20100120/ff82ec94/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ogf-dfdl-v1.0-Core-037.final.MB.Comments.doc
Type: application/octet-stream
Size: 66048 bytes
Desc: not available
Url : http://www.ogf.org/pipermail/dfdl-wg/attachments/20100120/ff82ec94/attachment-0001.obj 


More information about the dfdl-wg mailing list