Skip navigation.

Accessibility & Inclusion
Your Online Presence > Managing Content

Multilingual Content Management

By Danny Sofer, Kitsite.com

This short article gives a quick summary of the issues of managing multilingual websites.

Introduction

There are seven key issues to consider:

  • Translation
  • Localisation
  • Culture
  • Feedback
  • Design
  • Workflow
  • Non-Latin character sets

Translation

Translation is essential to the running of a multilingual website. While machine translation does exist, there is no adequate substitute for employing qualified personnel or using an external translation service. Proof reading of translated text is also likely to be necessary.

Because of this, translation costs are likely to dwarf all other running costs on any site with significant translation requirements.

The use of machine translation should be considered with extreme caution, but it may be a plausible alternative for infrequently-accessed pages containing nonessential content. In this case, the use of short, unambiguously structured sentences and the avoidance of idiomatic phrases are essential, and sub-editing is likely to be required.

Localisation

A multilingual website is usually a mixture of global and local content. Local content presents no particular content management issues; global content - which has to be translated across all language locales - does.

Deciding where multiple language versions of content are going to be required and where content can be maintained separately for different locales is a critical decision that will affect how a site should be maintained and what it will cost.

Culture

Differences in language are only part of what distinguishes different locales. Graphical conventions, matters of taste, sense of humour, socially acceptable forms of address and issues of privacy all vary from place to place.

Also, some important concepts (think of Home Office or maternity pay) have no useful meaning if translated literally.

Feedback

Responses to any website feedback will need to be addressed in the language of the initial communication. User feedback should not be solicited in a language if it cannot be routed to a suitably qualified person who can answer in the appropriate language.

Scripts that handle interactivity, such as discussion forums, search results and feedback forms, will also need to be configured appropriately.

Design

Perhaps the most common, and an easily overlooked, difficulty encountered in developing multi-lingual websites is the maintenance of a consistent design across different language versions of a site, and in particular the layout of navigation: text or graphic labels that fit the design constraints in one language may not work well in translation.

The only sensible way to tackle this issue is to ensure that the initial design brief for a site includes all language variations of site branding and of the major navigational elements.

Also, links between pages should not lead unsuspecting users from one language locale into another.

Workflow

Simple workflow mechanisms usually offer some kind of notification when some action is performed on a page or when the page moves from one state to another.

Translation workflow, on the other hand, requires that changes to a page trigger appropriate notification of required changes to the other language versions of that page. It is important to remember that each time something in a document changes, the major effort will be in getting the changes translated again and to allow for this.

In addition, it is usually helpful to have some mechanism for identifying which elements within the page have changed.

Non-Latin character sets

There are some interesting challenges associated with the creation and rendering of non-Latin alphabets, although modern browsers have better support for them than in the past.

Unicode has now emerged as a recognised (and growing) international standard that includes most non-Latin characters and makes storage and retrieval of non-Latin characters in distributed environments (such as the web) much easier.

Unicode support for some character sets (such as Bengali) is still not universal, so the use of legacy character sets may occasionally be necessary (at least in the short term), but, ideally, website content should be stored and edited as unicode.

In addition, website pages should be published with an appropriate character set (either UTF-8 or a language-specific character set, such as windows-1256) and language META tags; any characters that are not in the publishing character set should be published as html entities; and direction tags should be specified, where appropriate.

It should also be possible to mix languages on a single page, allowing links to other language versions of the page to be handled simply.


About the author

Danny Sofer, Kitsite.com
Danny Sofer is a Director of Kitsite.com, developers of content management systems for websites.

Glossary

HTML, Storage, Website

Related articles

Published: 19th September 2003 Reviewed: 4th August 2006

Copyright © 2003 Danny Sofer, Kitsite.com

All rights reserved

User comments and discussion

If you have useful information to add to this article please Add a comment. Comments will appear after they have been moderated.

Discuss this topic in the Knowledgebase forums. This is a useful place to share knowledge, experiences, and ask questions.

Please sign in or register to be able to post a comment or discussion.