TagEditor segments words when opening a file converted from .pdf to .doc with Solid Converter PDF
Thread poster: Jorgen1977

Local time: 00:38
English to Norwegian (Bokmal)
+ ...
Mar 30, 2009

This has happened twice in no time, I've received a file converted from .pdf to .doc with Solid Converter PDF. So far so good, but when I open it in TagEditor, it somehow splits not only sentences, but even one word into several segments. Any solution to this one?

[Subject edited by staff or moderator 2009-03-30 17:01 GMT]

Direct link Reply with quote

Local time: 23:38
English to Slovenian
+ ...
Just a suggestion Mar 30, 2009

Since it's quite comfortable and reliable (depends on the size and complexity of the pdf file), I'm using Solid Converter PDF (v5) quite often, and it's best to deselect "Use custom character spacing to retain original layout" option, otherwise there will be numerous spaces and/or line breaks all over the file, producing exactly the type of file you described in your post - a rather useless and a very large one. (And TE starts new segment after a double space ...)

If it's possible, the client should (re)deliver a workable file, and if they can't provide it, the source files are always the best option (there are few recent posts stating this).


ps. If (although I doubt) I can help with conversion, you can send me an e-mail through my profile.

Direct link Reply with quote

Stanislav Pokorny  Identity Verified
Czech Republic
Local time: 23:38
English to Czech
+ ...
Remove paragraphs Mar 30, 2009

Hi Jorgen,
you should remove any excessive paragraph marks and soft breaks. The file should segment nicely then.

Direct link Reply with quote

To report site rules violations or get help, contact a site moderator:

You can also contact site staff by submitting a support request »

TagEditor segments words when opening a file converted from .pdf to .doc with Solid Converter PDF

Advanced search

Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

More info »
SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »

  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search