Friday, November 14, 2014

Some additional answers about Xbench

At the ATA Conference in Chicago I gave a presentation on how to use Xbench for terminology management and translation QA (you can see and download the presentation from the Xbench tab in this blog).

I believe that the presentation was well received, and that most people found the program very useful, but I was stumped by a few questions. I've now inquired with the Xbench developers at ApSIC, and they have provided the missing information:

Q. Is Xbench compatible with languages that use non-Roman alphabets (e.g., languages that use the Cyrillic alphabet)?
A. Yes, Xbench 3.0 uses Unicode, and is therefore compatible with other alphabets.

Q. Is Xbench compatible with double-byte languages?
A. Xbench's compatibility with double-byte languages is quite good (Japan is ApSIC's largest customer base after Spain, and Korea is quite big as well, China is the country with most active users and downloads), but there are some caveats. Xbench does not have heuristics in place to identify words within a DBCS strings, so some features that rely on whole words identification do not work well (for example if Chinese is the source language in a key terms check).

Q. Is Xbench compatible with bi-directional languages?
A. With Xbench 3.0 build 1266 (the current build as of now), compatibility is still poor, but ApSIC is actively working to improve bi-directional compatibility.

Q. What are the size limits for files loaded in Xbench?
A. For the 32-bit version, there is a limit of 2GB per file (and a maximum for all files loaded of 2 or 4 GB). For the 64-bit version the limit is the available memory and available swap disk. ApSIC recommends installing the 64-bit version if you have a 64-bit Windows. The 64-bit version used to have a limitation of 2GB per file (however, with an unlimited number of files), but now that limitation has been lifted, and files in excess of 2GB should work.

Please note that all these answers refer to version 3.0 of Xbench (the commercial version of the program).


  1. Helped me a lot - thanks from Japan! ;-)

  2. The All uppercase check does not recognize double byte characters and outputs a mismatch even if the target has same characters in single byte. Can this have a solution as the report output is unnecessary and long to check.
    eg: TMI-2号機 Unit TMI-2: This will be show as an All uppercase mismatch.
    Please let me know if there is a solution to avoid this.


Thank you for your comment!

Unfortunately, comment spam has grown to the point that all comments need to be moderated. All legitimate comments will be published as soon as possible.