Menu
|
Setting Type in MarshalleseMarshallese is the language of the Marshall Islands in the Eastern Pacific Ocean. The modern version of the written language uses the Latin character set plus sixteen combinations of Latin characters with diacritical marks. To properly set type in Marshallese, the following issues should be addressed, in the order listed: Character DefinitionThe sixteen specialty characters include eight having a macron over them, and eight having an undefined diacritical mark under them. We have been unable to get a definitive answer as to which mark is "correct" for the "below" mark. However, in common practice, the cedilla is used, and this appears to be the recommended practice. The remainder of this discussion assumes use of the cedilla. Caveat: It is possible that some entity with authority on the subject may change this preference, which may result in documents needing to be encoded with a different scheme. Character EncodingFor consistency and usability, it is desirable, where possible, to use Unicode as the basis for encoding the characters. Of the sixteen special Marshallese characters, six of them do not have Unicode code points assigned, and it appears that they never will: To avoid creating every conceivable combination of Latin character and diacritical mark, the intent of the Unicode standard is to have the base character and diacritical mark keyed separately, as two characters. The rendering software should then combine the two characters together. The table below summarizes the Unicode code points for the correct combined characters, and also for the character combinations that produce the same effect. The Decimal equivalent of the Unicode code point is included as well, as it is useful for keyboard entry in some applications (for example, holding the Alt-key down in Microsoft Word while keying the Decimal value will enter the character):
Notes: For the six non-Unicode characters, if your software or font does not handle combining diacriticals properly there are at least two workarounds, both deprecated:
The following table summarizes these options. Please note that the private-use Unicode code points listed are purely arbitrary:
The following table summarizes the Unicode code points for the various combining diacriticals that are acceptable:
The combining diacriticals can be (or at least should be) usable even for characters that have Unicode code points for the combined characters. For example, even though A-macron has the Unicode code point U+0100 assigned, it could also be entered by first keying an uppercase A, then keying a combining macron. This may simplify keyboard entry for some users, because only two special characters need to be maintained and remembered, rather than sixteen. FontsTo use the encoding described above, your font must:
Even though the OpenType font format supports the concept of combining diacriticals discussed above, current searches (April, 2004) reveal no commercial fonts that have implemented the necessary features. Furthermore, it is only with great difficulty that such features can be added to commercial fonts (see OpenType fonts for discussion). ApplicationsEven after appropriate fonts are licensed or created, there is an amazing dearth of software applications that support the combining diacriticals. Adobe InDesign 2.0 does not, and we are told that Adobe InDesign CS does not either. Microsoft Word 2003 appears to be the only widely available package to do so, and its implementation appears to be a bit buggy. Convenient Keyboard EntryWe believe that convenient keyboard entry can be achieved in most applications and environments without significant effort. However, until the fonts and applications are more widely available, we will probably not address this issue. ConclusionUsing Unicode for setting type in Marshallese is not ready for "prime time" yet. It is probably best to continue using the current practice of arbitrarily-encoded precomposed glyphs instead. However, this situation is likely to change in the near future, as fonts and applications adopt more of the OpenType feature set. This article was written by Victor Mote, April 20, 2004. |