Localization and Word Processing



 

Introduction

It is a little remarked irony, or rather scandal, of word processing and the Internet that, although the smallest unit of a document is the individual letter or "character", it is impossible to guarantee correct transfer of these units from one language-domain to another. Only unaccented letters, the numerical digits, and a few punctuation marks are unambiguously defined in bits by the quasi-universal ASCII code.

This means that separate software is needed to guarantee an acceptable display for almost every language in the world. An implication is that is usually impossible to find software that will simultaneously display text in a variety of different languages' typefaces.

Localization is the process whereby software packages are made acceptable to markets which use different languages. Besides conversion of the fonts in application programmes, it also involves translation of all the system messages and help files, in application and operating systems. This can have major knock-on effects in the soource code, especially where this has been written with only one interaction language in mind.

Within word processors, the computer can give various linguistic aids to the user. It can check spelling, offer comments on the grammar and style of the text, and look up words and phrases in dictionaries and thesauri.

 

Where the Progress is Being Made

This is not a field for targeted research, more an area for specific improvements, often requiring administrative, rather than technical progress. The kind of issues that are involved are set out in  David Brooks: Translating into Success .

There is a particular centre of localization companies in Ireland. See:   Niceone.

More radical approaches need to be taken in order to provide effective representation for the vast number of languages which are only recently literate, and may still be fertile fields for linguistic fieldwork. See: Exploration .

 

Sources for Products

A wide range of contributing technologies is showcased at   LISA .

Examples of what can be achieved for two smaller languages are set out at   MEU,  and  Everson Gunn .

 

Things to Watch Out for

Since the majority of internet users is predicted to be using languages other than English by early 2001, the World Wide Web will soon be an important field of challenge for providers of English-based software products.

 

If you'd like to learn more about the potential of this technology, from an experienced but completely impartial source, it's time you got in touch with  Linguacubun Ltd  itself.



Linguacubun Ltd. Batheaston Villa, Bailbrook Lane, Bath BA1 7AA UK Tel:+44(0)1225 852865 Fax: +44(0)1225 859258