This article describes how to configure the sorting of index entries for users of the DITA Open Toolkit (DITA OT). The DITA OT is installed automatically with XMetaL Author Enterprise.
BackgroundRules for sorting index entries vary by language, and often also by region (
locale). For details, see:
http://en.wikipedia.org/wiki/Collation. For example, using English sorting rules, the word "Ñacunday" comes before the word "natural", whereas in Spanish sorting rules all words starting with "n" come before words starting with "ñ", so "natural" comes first.
PrerequisitesIndex sorting rules for non-PDF OutputTo our knowledge, index sorting rules for non-PDF output are not documented. We are not aware of any major issues with the pre-configured index sorting rules for non-PDF output.
Index sorting rules for PDF OutputFor "XMetaL Enhanced PDF via RenderX" output, index sorting rules are defined for each language in the following folder:
DITA_OT\demo\xmfo\cfg\common\index. They are overridden by any files in the following folder:
DITA_OT\demo\xmfo\Customization\common\index.
If you are generating PDF output using a standalone installation of the DITA Open Toolkit, index sorting rules are in the following folder:
DITA_OT\demo\fo\cfg\common\index. They are overridden by any files in the following folder:
\demo\fo\Customization\common\index.
The default index sorting rules for some languages are poor. For example, index terms which begin with accented letters might not appear in the index at all. The DITA OT treats
a,
A,
à, and
À as separate letters, so uppercase and lowercase versions of each possible accented letter must be explicitly included in the index sorting rules. You can download a
set of index rules files here. The files, which are free but require Yahoo! registration, are much more complete than the default set. Copy the files to your
DITA_OT\demo\xmfo\cfg\common\index folder. These files have been generously made available by other DITA users.
For sorting indexes in Chinese and Japanese documents, we have heard from some customers that the
Antenna House I18n Index Library gives best results.