Technology

Free open source translation memory software, OmegaT vs Anaphraseus

We focus on FREE translation memory tools. While CAT tools such as TRADOS and Wordfast are widely used in the translation industry, free open source softwareTM can be used as an alternative to mainstream CAT tools.

We selected two software called OmegaT TM and Anaphraseus TM and analyzed their advantages and disadvantages.

OmegaT™

OmegaT is a CAT tool whose features include user-customizable regular expression segmentation, translation memory, fuzzy matching, match propagation, glossary matching, translation memory context search, and keyword searching.

in reference materials.

OmegaT uses the international standard TMX (Translation Memory eXchange) as its native translation memory format that most CAT tools can import and export. TMX is an open XML standard that allows easier exchange of TM between translators with different application tools.

Although lacking in features compared to other CAT tools, OmegaT is fast at segmenting a source file, compiling target files (known as “cleanup”), and fuzzy matching. In its previous version, OmegaT segments the source file by paragraph, instead of sentences. Now, you can select segmentation at the paragraph or sentence level. By using the feature called Edit Behavior under Options, you can have the option of displaying in source text or best partial match. This is a CAT tool worth trying.

OmegaT includes the following features:

– fuzzy match

– Party propagation

– Simultaneous processing of projects from multiple files

– Simultaneous use of multiple translation memories

– External glossaries

– Document file formats include:

XHTML and HTML

Microsoft Office 2007

XML

OpenOffice.org/StarOffice

XLIFF (Okapi)

MediaWiki (Wikipedia)

Plain text

– Unicode (UTF-8) support: can be used with non-latin alphabets

– Support for languages ​​from right to left

– Compatible with other translation memory (TMX) applications

Some third-party programs that can be used with OmegaT are OpenOffice.org, Rainbow, bitext2tmx, and Samuel Murray’s scripts and procedures.

Download site: http://www.omegat.org/en/downloads.html

AnaphraseusTM

Anaphraseus is a CAT tool for creating, managing and using bilingual translation memories. Some of its main features include text segmentation, terminology recognition, plain text TM (Unicode UTF-16), TM fuzzy search, UTF-16 TMX export/import, user glossary, and OpenOffice. organization extension.

Anaphraseus works on OpenOffice.org as an extension and works seamlessly with TM built on Wordfast. You can install it using the Extension Manager under Tools in OpenOffice.org. During the translation process, Anaphraseus divides the translated text into segments or sentences. When a segment is selected, Anaphraseus displays the closest match found in the TM.

As an extension of OpenOffice.org, it allows you to use all the functions of the word processor “on the fly”. When it comes to performance, it is slower than Wordfast, especially when handling large files. Anaphraseus works with translation memory in TMX format, which allows it to work with most CAT tools on the market.

Anaphraseus includes the following features:

– Text segmentation

– Recognition of terminology

– Plain TextTM (Unicode UTF-16)

– Fuzzy lookup in the translation memory

– Export/import Unicode UTF-16 TMX

– User glossary

Since Anaphraseus is still in beta testing, it currently only works with plain text, ignoring all formulas, images, and similar objects. If you need a CAT tool that doesn’t cost you anything and works with OpenOffice.org, Anaphraseus is worth a try.

Download site: http://anaphraseus.sourceforge.net/

Professional translators talk about their experience with translation memories in the translator column of our monthly newsletter at http://www.synergy-focus.com

*List of translation jobs available in the newsletter

Leave a Reply

Your email address will not be published. Required fields are marked *