Deleting spaces in Chinese word documents
Thread poster: Iris Kleinophorst

Iris Kleinophorst  Identity Verified
Germany
Local time: 01:18
Chinese to German
+ ...
Oct 14, 2013

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


Direct link Reply with quote
 

lbone  Identity Verified
China
Local time: 08:18
English to Chinese
+ ...
regex case by case Oct 14, 2013

I think this is a work that needs the time by some person rather than by one or several expressions. Chinese characters are not easy to define simply by a regular expression (standard regular expressions are not supported by Microsoft Word), and sometimes blank spaces are meaningful such as in the title:

  第十四回 林如海捐馆扬州城 贾宝玉路谒北静王

Besides Chinese characters, spaces, digits, English letters and common English punctuation marks, there are also Korean/Japanese characters, non-standard/double-byte symbols. You will need to judge and handle spaces involved separately.

Iris Kleinophorst wrote:

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


Direct link Reply with quote
 

Frank Lin  Identity Verified
China
Local time: 08:18
English to Chinese
+ ...
wildcards Nov 2, 2013

Iris Kleinophorst wrote:

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


Finding and replacing characters using wildcards.


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Deleting spaces in Chinese word documents

Advanced search






SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »
SDL Trados Studio 2017 Freelance
The leading translation software used by over 250,000 translators.

SDL Trados Studio 2017 helps translators increase translation productivity whilst ensuring quality. Combining translation memory, terminology management and machine translation in one simple and easy-to-use environment.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search