Unicode and ANSI troubles
Thread poster: elm0505

elm0505
Spain
Local time: 13:10
French to Spanish
+ ...
Aug 24, 2011

Hello everyone
I am translating texts from Spanish into French and Portuguese. Some of the source texts contain special characters from other languages, such as Turkish and Hungarian. Following recommendations (and an alert message when saving the texts) I chose Unicode to keep the special characters. The problem comes up when I open the project in OmegaT and I find the original text is plagued with strange symbols (and the text looks different, with the characters strangely spaced) This doesn't happens if I save the text in ANSI codification although some special characters and accents disappear.

Does anyone know how to solve this?


 

Dragomir Kovacevic  Identity Verified
Italy
Local time: 13:10
Italian to Serbian
+ ...
Omegat + UTF-8 exclusively Aug 24, 2011

Omegat + UTF-8 encoding exclusively, in order to guarantee universal visibility of all characters.

You probably used utf-16. In Windows it is simply named as "Unicode".

elm0505 wrote:

Hello everyone
I am translating texts from Spanish into French and Portuguese. Some of the source texts contain special characters from other languages, such as Turkish and Hungarian. Following recommendations (and an alert message when saving the texts) I chose Unicode to keep the special characters. The problem comes up when I open the project in OmegaT and I find the original text is plagued with strange symbols (and the text looks different, with the characters strangely spaced) This doesn't happens if I save the text in ANSI codification although some special characters and accents disappear.

Does anyone know how to solve this?


 

Didier Briel  Identity Verified
France
Local time: 13:10
Member (2007)
English to French
+ ...
What formats are your source texts? Aug 24, 2011

elm0505 wrote:
I am translating texts from Spanish into French and Portuguese. Some of the source texts contain special characters from other languages, such as Turkish and Hungarian. Following recommendations (and an alert message when saving the texts) I chose Unicode to keep the special characters. The problem comes up when I open the project in OmegaT and I find the original text is plagued with strange symbols (and the text looks different, with the characters strangely spaced) This doesn't happens if I save the text in ANSI codification although some special characters and accents disappear.

I assume you use text files.

By default, OmegaT reads .txt files as system encoding, which means ANSI under Windows.

If your files are UTF-16 (Unicode), you must configure OmegaT so that the extension you use (e.g., .utf16) corresponds to the encoding. This is done in Options > File Filters > Text Files > Edit..., and is documented (including the concept of encoding) in the Chapter "Working with plain text" of the documentation.

Didier


 

elm0505
Spain
Local time: 13:10
French to Spanish
+ ...
TOPIC STARTER
Solved Aug 29, 2011

You all were right, I resorted to name the files manually under the extension utf8 before saving and it works now, thank you all. Once again Windows has been messing around!

 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Unicode and ANSI troubles

Advanced search






SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »
PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search