Pages: 1
Author Topic: Prevent named entities in HTML output  (Read 3707 times)

Posts: 2

« on: June 15, 2009, 08:14:30 AM »


XMetal 5.5

Generating a single html file

In my translated files, characters such as u umlaut in German are processed to
named entities in the html.

How can I stop this?

When I process the files to multiple html files the characters are maintained as utf-8 characters - which is what I want.

There seems to be something different in the scripts for the single html generation but I can't find it.

Any help greatly appreciated.

Many thanks,
Derek Read
Program Manager (XMetaL)

Posts: 2621

« Reply #1 on: June 16, 2009, 11:04:48 AM »

I don't have a quick answer for how to fix this, but I will log it as a defect in our tracking system. I would expect the functionality in Single HTML to be similar to Multiple HTML, at least when it comes to the encoding being UTF-8 and inserting all characters as themselves rather than changing them to named character entities. Even though all browsers must be able to render these entities (to comply with the W3C recommendations) there doesn't seem to be any really good reason for it (so my feeling is that it is just an oversight on our side).

Note that Single HTML is not offered as an output type by the DITA OT. Multiple HTML is included with the DITA OT (where it is referred to as the "xhtml" transtype). So, the reason Single and Multiple differ is because Single is our own creation.

If development is able to research this and provide an answer (before our next release) I will post it here for people that wish to attempt to alter the behavior themselves.
Pages: 1
Jump to:  

email us