Loading...

XML

Word

Printable

Type: Bug
Resolution: Fixed
Priority: Major
Fix Version/s: ODF 1.2 CD 06
Affects Version/s: ODF 1.2 CD 05
Component/s: Locale, Text
Labels:
None

We should review the ODF 1.2 specification, in particular for the following:

1) Are all character literals specifying their code points, e.g., '1' (U+0030). Remember, not every reader of the standard will be a native English speaker or even a native user of Latin-1 characters. Since Unicode defines several characters that may look like a plus sign, or a dash, we need to be explicit.

2) Are we crystal clear on whitespace treatment?

3) Bidi?

4) Whenever we talk about sorting, are we clear on whether this is lexical or a locale-dependent collation order?

5) What Unicode version?

6) For most of ODF we can deal with Unicode characters and strings of Unicode characters without discussing encodings. For serialization we permit whatever XML permits and we don't need to deal with encoded characters. However there are some exceptions that we need to be more explicit with. One is passwords entered during encryption. Since the encryption algorithms work at the bit level, both encoding and byte ordering need to be specified.

7) Any functions that deal with upper case/lower case conversions, such as in OpenFormula, need to make sure they are specified correctly with respect to Unicode.

8) Anything else?

Suggest search phrases are: character*, sort, search, collation, unicode, encod*, encrypt*, string (unless it is xsd:string), *space, dash, hyphen,

Progress

1.	"Text character data"OpenDocument-v1.2-cd05-part1-editor-revision_04.odt	Applied	Michael Brauer (Inactive)
2.	Part 1, section 6.1.2 -- Are we saying that we normalize SPACE to itself?	Applied	Robert Weir (Inactive)
3.	Section 6.1.6 -- can we make that table so it doesn't split across pages? It messes up the 2nd row	Applied	Patrick Durusau
4.	Section 19.135.1 says "this name may contain arbitrary characters".	Applied	Patrick Durusau
5.	Section 19.364 -- I think we need a reference for the Unicode data base text file	Applied	Robert Weir (Inactive)
6.	Section 19.598 -- "string comprises one or more characters surrounded by quotation marks."	Applied	Unassigned
7.	Section 19.762 -- I would delete the "reference" column in that table	Applied	Patrick Durusau
8.	Missing definition of CJK and CTL	Applied	Patrick Durusau

Assignee:: Robert Weir (Inactive)
Reporter:: Robert Weir (Inactive)
Votes:: 0 Vote for this issue
Watchers:: 0 Start watching this issue

Created:: 15/Jul/09 6:54 PM
Updated:: 02/Nov/10 8:03 PM
Resolved:: 01/Nov/10 7:01 PM

Details

Description

Attachments

Sub-Tasks

Activity

People

Dates

Status Time Free