Problem with docx format
Thread poster: James McVay

James McVay  Identity Verified
United States
Local time: 18:02
Russian to English
+ ...
May 18, 2010

I just finished a translation using OmegaT on a docx Word file after converting it to ODT format using OpenOffice.org. When I generated the finished translation and tried to open it with OpenOffice.org I got the following error message: "Format error discovered in the file in sub-document content.xml at 2,15093(row, col)."

I was able to partially rescue the translation using WordPad -- down to where the error was located, I presume. I saved that as an RTF file, opened it in Word 2007, then manuallly copied and pasted the missing segments out of OmegaT. I had to do some reformatting by hand to get the format right, but luckily it wasn't a lengthy or complicated document.

That's the second time this has happened to me. The first time I tried working directly with a docx file in OmegaT. It can handle the format, but has problems with the tags. I'm about ready to give up on OmegaT, although there is much about it I like.

But my question is this: is there something I should be doing with docx files that I don't know about?


 

PCovs
Denmark
Local time: 00:02
Member (2003)
English to Danish
+ ...
Try saving it as .doc first May 18, 2010

I don't know anything about OmegaT, but Trados has issues with the .docx format, and it seems these can be sorted simply by first saving the document as .doc, then translating, then saving back as .docx, so I jus thought this might also be of help to you, because this copy/paste-thingy is really not an option in the long run.

Good luck.


 

James McVay  Identity Verified
United States
Local time: 18:02
Russian to English
+ ...
TOPIC STARTER
Good suggestion May 18, 2010

Maybe I'll give it one more shot. It means adding another step to my workflow, but it's a lot quicker than the copy/paste business.

 

Samuel Murray  Identity Verified
Netherlands
Local time: 00:02
Member (2006)
English to Afrikaans
+ ...
Have you validated the tags? May 18, 2010

James McVay wrote:
I just finished a translation using OmegaT on a docx Word file after converting it to ODT format using OpenOffice.org. When I generated the finished translation and tried to open it with OpenOffice.org I got the following error message: "Format error discovered in the file in sub-document content.xml at 2,15093(row, col)."


This may be caused by a tag mismatch. Another user had a similar problem:
http://tech.groups.yahoo.com/group/OmegaT/message/16262

This is what OmegaT's tag validation feature is for. In OmegaT, go to Tools > Validate Tags (or press Ctrl+T). It will then show you which segments have mismatched tags. Correct these errors and try to create the file again.

Was this the problem, does this help?


 

James McVay  Identity Verified
United States
Local time: 18:02
Russian to English
+ ...
TOPIC STARTER
Thanks, Samuel May 18, 2010

Good advice -- it worked. It's still a bit tedious, though. Maybe if I pay more attention to the tags as I go through the document it will go more smoothly.

 

Susan Welsh  Identity Verified
United States
Local time: 18:02
Member (2008)
Russian to English
+ ...
Docx how-to May 19, 2010

Did you read this? It's a "how-to" on the OmegaT site.

http://www.omegat.org/en/howtos/docx.html

I've had trouble with docx also, but have not had a job requiring it since I learned about this how-to.


 

James McVay  Identity Verified
United States
Local time: 18:02
Russian to English
+ ...
TOPIC STARTER
Thanks, Susan May 19, 2010

I have not seen that. I'll take a look at it before I begin another translation using OmegaT.

 


There is no moderator assigned specifically to this forum.
To report site rules violations or get help, please contact site staff »


Problem with docx format

Advanced search






SDL MultiTerm 2019
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2019 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2019 you can automatically create term lists from your existing documentation to save time.

More info »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search