Mobile menu

Pages in topic:   [1 2] >
How do I \'save files as Unicode UTF-8\' ?
Thread poster: Fuad Yahya
Fuad Yahya  Identity Verified
Arabic
+ ...
Mar 30, 2003

A current job posting on ProZ states the follwoing:



\"Because the completed translation has to be placed within a complex database, it needs to be in a format called Unicode UTF-8. . .



Please send a current CV with a cover letter explaining your . . . knowledge and ability to save files as Unicode UTF-8.\"



As I understand it, Unicode is a standard that \"provides a unique number for every character, no matter what the platform, no matter what the program, no matter what the language\" (the Unicode web site).



If that is the case, then what does it mean to \"save files as Unicode UTF-8,\" as this job posting demands? It is not as though Unicode UTF-8 is a file format, or is it? Does this job posting mean what it says, or is this just a shorthand for something much more complicated? If so, can someone decipher it?



More to the point, what tools and what techniques are called for to accomplish such a requirement?



Fuad


Direct link Reply with quote
 

Clarisa Moraña  Identity Verified
Argentina
Local time: 04:33
Member (2002)
English to Spanish
+ ...
It's quite easy in my PC Mar 30, 2003

I don\'t know if this is the same in all PC\'s. But in mine, when you save a Web page (FILE > SAVE AS), it appears a pop up window prompting you to enter (a) file\'s name, (b) save as (html. etc), and (c) code, there one of the last options, is UNICODE 8.



Regards,



Clarisa Moraña

[ This Message was edited by: Clarisa on 2003-03-30 15:01]


Direct link Reply with quote
 
Fuad Yahya  Identity Verified
Arabic
+ ...
TOPIC STARTER
I see the same in my browser. . . Mar 30, 2003

. . . but is this all they mean? It seems simple enough. I had the impression it was much more complicated. It shows my utter ignorance, doesn\'t it?



Would this mean that the files that the job poster intends to supply would have to be html files?



Direct link Reply with quote
 

Jack Doughty  Identity Verified
United Kingdom
Local time: 08:33
Member (2000)
Russian to English
+ ...
Saving Word documents as Unicode UTF-8 Mar 30, 2003

I have never been asked to do this, but I have just been experimenting, and I find that I can save a Word document as an html file in Word, exit Word, find the file in Windows Explorer, open it there, and then I can save it with the encoding changed to Unicode UTF-8. Is this what they mean?



Direct link Reply with quote
 

#41698 (LSF)
Malaysia
Local time: 15:33
Japanese to English
+ ...
UTF-8 is 1 byte version of unicode Mar 30, 2003

Standard Unicode is 2-bytes. But some systems cannot handle 2-bytes (including some E-Mail systems). So there exist a few versions of 1-byte Unicode and UTF-8 is one of them. It stands for \'UCS Transformation Format, 8-bit form\'.







Quote:


On 2003-03-30 14:31, FuadMYahya wrote:

A current job posting on ProZ states the follwoing:



\"Because the completed translation has to be placed within a complex database, it needs to be in a format called Unicode UTF-8. . .



Please send a current CV with a cover letter explaining your . . . knowledge and ability to save files as Unicode UTF-8.\"



As I understand it, Unicode is a standard that \"provides a unique number for every character, no matter what the platform, no matter what the program, no matter what the language\" (the Unicode web site).



If that is the case, then what does it mean to \"save files as Unicode UTF-8,\" as this job posting demands? It is not as though Unicode UTF-8 is a file format, or is it? Does this job posting mean what it says, or is this just a shorthand for something much more complicated? If so, can someone decipher it?



More to the point, what tools and what techniques are called for to accomplish such a requirement?



Fuad



Direct link Reply with quote
 
xxxMarc P  Identity Verified
Local time: 09:33
German to English
+ ...
Unicode Mar 30, 2003

As you say, Unicode isn\'t a file format, but a font encoding. In order to answer the question whether you can supply your translation in Unicode UTF-8, you need to know what file format is required. If the translation is to be entered into a database, it is very likely to be needed in plain text/UTF-8, but the customer needs to confirm that first.



Marc


Direct link Reply with quote
 

Clarisa Moraña  Identity Verified
Argentina
Local time: 04:33
Member (2002)
English to Spanish
+ ...
I don't know! Mar 31, 2003

Quote:


On 2003-03-30 15:04, FuadMYahya wrote:



Would this mean that the files that the job poster intends to supply would have to be html files?







I don\'t know. Sorry!





[ This Message was edited by: Clarisa on 2003-03-31 00:25]

Direct link Reply with quote
 
Fuad Yahya  Identity Verified
Arabic
+ ...
TOPIC STARTER
Thank you very much Mar 31, 2003

. . . for enlightening me about this matter. Based on your input, I have sent e-mail to Mondial (the agency that posted the job ad) asking them for clarification. I hope they reply. I will share with you any information I get. Thank you very much for helping.

Direct link Reply with quote
 

Luca Tutino  Identity Verified
Italy
Local time: 09:33
Member (2002)
English to Italian
+ ...
I have this option under Word 2000 Mar 31, 2003

Shiong Fong Lew explained it all. Actually it should not be difficult.



In Word 2000 (9.0.28.12) I can Save as..., choose the file format \"coded\" .txt and select the option UTF-8.


Direct link Reply with quote
 
Fuad Yahya  Identity Verified
Arabic
+ ...
TOPIC STARTER
In Word 2000, but not in Word 2002 Mar 31, 2003

Quote:


On 2003-03-31 01:26, ITS wrote:

In Word 2000 (9.0.28.12) I can Save as..., choose the file format \"coded\" .txt and select the option UTF-8.





I have two computers, one running Word 2000, the other running Word 2002 (Office XP). \"Encoded .txt\" is available in Word 2000, but I could not find it in Word 2002. I wonder if there is an add-on that I can install to give Word 2002 that capability. I would appreciate any information in this regard.



I have also found that Notepad has several Unicode standards, including UTF-8.



The agency has not responded to my query yet, so I still don\'t know what file format we are talking about.



Fuad

[ This Message was edited by: FuadMYahya on 2003-03-31 17:15]

Direct link Reply with quote
 
Fuad Yahya  Identity Verified
Arabic
+ ...
TOPIC STARTER
The agency has confirmed the information Apr 1, 2003

I want to thank everybody for helping me. The agency has responded to my query and confirmed that the files are indeed in html.



I have also noticed that in Front Page, you can specify the encoding of the page your are authoring in the Properties dialog.



Thanks again to everybody.


Direct link Reply with quote
 

Valentina Pecchiar  Identity Verified
Italy
Member
English to Italian
+ ...
Unicode in Word XP May 6, 2003

Quote:




On 2003-03-31 17:14, FuadMYahya wrote:

\"Encoded .txt\" is available in Word 2000, but I could not find it in Word 2002. I wonder if there is an add-on that I can install to give Word 2002 that capability. I would appreciate any information in this regard.







Hi Fuad

Sorry to pop in so late, I just saw your post and tested. Here it comes:



In Save as dialog select Plain Text (.txt). The File Conversion window opens and in it you can chose the text encoding (on the right-hand side). I have no special add-in in my PC (OS in ENG) and UTF-8 is available



HTH



Direct link Reply with quote
 
Fuad Yahya  Identity Verified
Arabic
+ ...
TOPIC STARTER
Better Late Than Never May 6, 2003

Thank you, Muja. I tried it, and it worked, just as you said. I appreciate your help.

Direct link Reply with quote
 

Valentina Pecchiar  Identity Verified
Italy
Member
English to Italian
+ ...
Happy to Help :-) May 7, 2003

Quote:


On 2003-05-06 19:23, FuadMYahya wrote:

Thank you, Muja. I tried it, and it worked, just as you said. I appreciate your help.



Direct link Reply with quote
 

Arkady Vysotsky  Identity Verified
Local time: 10:33
English to Ukrainian
+ ...
If it is not too late May 9, 2003

There is a small but annoying glitch in Word 2000/XP. When converting to UTF-8, it mangles some special symbols, like curly quotes, TM and (C) signs, nonbreakable spaces, n- and m-daches etc., converting them to ASCII characters. In the new Office 11 version, this bug seems to be fixed. For now, I use a small free utility, RWS Rainbow, to convert between encodings.

Direct link Reply with quote
 
Pages in topic:   [1 2] >


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How do I \'save files as Unicode UTF-8\' ?

Advanced search






memoQ translator pro
Kilgray's memoQ is the world's fastest developing integrated localization & translation environment rendering you more productive and efficient.

With our advanced file filters, unlimited language and advanced file support, memoQ translator pro has been designed for translators and reviewers who work on their own, with other translators or in team-based translation projects.

More info »
Déjà Vu X3
Try it, Love it

Find out why Déjà Vu is today the most flexible, customizable and user-friendly tool on the market. See the brand new features in action: *Completely redesigned user interface *Live Preview *Inline spell checking *Inline

More info »



All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs