Machine Translation



 

Introduction

This is the oldest application of language technology, dating back to first experiments in Russian-English translation in the 1950s. It has had a chequered history, often attracting considerable government funding, but just as often blighted by discouraging reports about its long-term prospects. Nevertheless systems which have resulted from development over this long period are widely and intensively used in international organizations (such as the European Commission, the United Nations and the Panamerican Health Organization), and are now often freely available to casual users.

Machine translation is generally agreed to be best applied to generate rough drafts of translated documents. As such, it can assist translators to speed production; and it can also aid users directly when all that is required is a rough indication of the content of documents in unknown languages. It is currently in vogue, therefore, to provide access to web pages.

With the advent of memory-based translation, a new aid to translators has become available: they can store, and conveniently re-use, translations of words, phrases and paragraphs from previous work. This effects a massive saving of time, and gain in consistency, over the long term.

 

Where the Progress is Being Made

There are a number of approaches to machine translation. They are often presented as competitors, but probably all of them will make contributions to different aspects of translation performance.

  • Grammar-based machine translation.   This is the traditional approach, involving dictionary look-up and syntactical analysis. In the 1980s, it was pursued Europe-wide in the   EUROTRA   project, and the Japanese national programmes   MU  and   EDR .  It is capable of continuing gradual improvement, which is going on at the same centres.

  • Knowledge-based machine translation.   This is an extension of the grammar-based approach, but emphasizes the building of a coherent conceptual structure, or ontology, which helps to interpret a text, and so give a basis for choosing among translation-equivalents. Noted centres for this are CMU   and   NMSU   in the USA.

  • Statistically-based machine translation.  This approach is radically different, by-passing any explicit analysis of grammar, and using pre-existing bilingual texts to predict, on the basis of probability, what would be an appropriate translation for a given word in a given context. The approach has been pioneered in the USA, especially at  IBM .

  • Terminology.  Technical texts are an important area where translation is often necessary. However, they are often written in specialized language which is not available in bilingual dictionaries. (Automatic processes have been developed to identify these terms. See the   University of Surrey . )  It therefore becomes necessary, and cost-efficient, to compile special lists of terms and their equivalents in foreign languages. These lists are typically much longer than general-language dictionaries.

  • Memory-based machine translation.  This approach, like the statistically-based, relies on records of previous translation, but is deterministic, providing automatic access to translation equivalents which have already been selected in a translator's past work. It was first developed in   Kyoto University ,  but has since been take up as an approach within Europe, notably at   The Trados Company..

 

Sources for Products

There is a growing number of machine translation systems on the market. Some established names include  Trados,   Lernout & Hauspie,   Logos,   Systran,   and   Alpnet . There are also a large number of Japanese companies offering products. Evidently, choice must depend above all on which language-pairs a company's system will translate between. There is also a wide variety of prices (and hence system capabilities) to choose from.

A general list of MT products (including systems, dictionaries and terminologies) can be found at   Translation Tools .  If  help is needed in choosing amongst them, a number of companies provide assistance and evaluation studies.

 

Things to Watch Out for

  • Embarrassing mis-translations often derive from unexpected associations ("Please enter through the back side."), or particular idioms that happen to coincide with system output ("If this is your first time in Russia, you are welcome to it."). These can derive from any aspect of language, and so will never be eliminated. As a result, significant messages produced by a machine translation system always need to checked by people who know the language well.

  • The use of machine translation tools, resulting machine-aided translation, may be a cost-effective way to speed and lighten the task of human translators, reducing repetitive and time-consuming aspects of their work without compromising the reliability of their output.

  • Automatic interpreting, the spoken analogue of machine translation, is not yet a practical possibility. It is possible, however, to gain speedy access to human interpreters as needed. See for instance:   LanguageLine

     

    If you'd like to learn more about the potential of this technology, from an experienced but completely impartial source, it's time you got in touch with  Linguacubun Ltd  itself.



Linguacubun Ltd. Batheaston Villa, Bailbrook Lane, Bath BA1 7AA UK Tel:+44(0)1225 852865 Fax: +44(0)1225 859258