Deleting spaces in Chinese word documents
Thread poster: Iris Kleinophorst

Iris Kleinophorst  Identity Verified
Germany
Local time: 10:08
Chinese to German
+ ...
Oct 14, 2013

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


 

lbone  Identity Verified
China
Local time: 16:08
Member (2006)
English to Chinese
+ ...
regex case by case Oct 14, 2013

I think this is a work that needs the time by some person rather than by one or several expressions. Chinese characters are not easy to define simply by a regular expression (standard regular expressions are not supported by Microsoft Word), and sometimes blank spaces are meaningful such as in the title:

  第十四回 林如海捐馆扬州城 贾宝玉路谒北静王

Besides Chinese characters, spaces, digits, English letters and common English punctuation marks, there are also Korean/Japanese characters, non-standard/double-byte symbols. You will need to judge and handle spaces involved separately.

Iris Kleinophorst wrote:

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


 

Lawrence Lam  Identity Verified
China
Local time: 16:08
English to Chinese
+ ...
wildcards Nov 2, 2013

Iris Kleinophorst wrote:

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


Finding and replacing characters using wildcards.


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Deleting spaces in Chinese word documents

Advanced search






SDL Trados Studio 2019 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2019 has evolved to bring translators a brand new experience. Designed with user experience at its core, Studio 2019 transforms how new users get up and running and helps experienced users make the most of the powerful features.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search