[DFDL-WG] Zoned decimals: spec errata 2.92 & 2.88

Steve Hanson smh at uk.ibm.com
Fri Jun 7 06:32:41 EDT 2013


2.92. Section 13.6. When property textNumberRep is ‘zoned’, the property 
description should state that ‘zoned’ is only allowed for SBCS encodings 
(schema definition error otherwise). 

When I came to implement this for IBM DFDL, I noticed there were already 
tests for UTF-8 and Shift_JIS which succeeded. The point being that both 
these character sets are ASCII compatible for the first 128 code points 
(Shift_JIS has two code points that differ, x5C and x7E).  I am wondering 
if this errata is therefore too strict?  I am particularly concerned that 
there might be Japanese users who will have COBOL data in Shift_JIS or 
MS_Kanji.


2.88. Section 13.5. Add support for HP NonStop Tandem zoned decimals. In 
this architecture, the negative sign is incorporated in the last byte of 
the number in the usual manner, but the overpunching occurs on the highest 
bit (ie, value 8) of the nibble. Consequently, a new enum value 
'asciiTandemModified’ is added to property textZonedSignStyle.
 
The range of ASCII code points that are used in a zoned number is x30-x39 
and either x70-x79 (standard overpunch) or x7B, x41-x49 (translated EBCDIC 
overpunch) or x20-x29 (CA Realia overpunch). This errata adds x80-x89. But 
these are not code points in standard ASCII, so the modeller must specify 
something like ISO-8859-1 in order for this to parse without an encoding 
error. The wording in the spec for this errata alludes to this but could 
make this clearer.
asciiTandemModified: In this style the ascii characters ‘0-9’ represent 
positive sign and digits 0 to 9, but bytes from 0x80 to 0x89 are used to 
represent overpunched negative sign and a digit. There are no 
corresponding character codepoints in the standard ASCII encoding since 
these values are all above 128 (decimal). (Note that neither ISO-8859-1 
encoding nor Unicode have assigned glyphs for these codepoints. They are 
considered control characters.)

Regards

Steve Hanson
Architect, IBM Data Format Description Language (DFDL)
Co-Chair, OGF DFDL Working Group
IBM SWG, Hursley, UK
smh at uk.ibm.com
tel:+44-1962-815848
Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number 
741598. 
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.ogf.org/pipermail/dfdl-wg/attachments/20130607/0722472e/attachment.html>


More information about the dfdl-wg mailing list