Deleting spaces in Chinese word documents
Thread poster: Iris Kleinophorst
Iris Kleinophorst  Identity Verified
Germany
Local time: 23:59
Chinese to German
+ ...
Oct 14, 2013

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


Direct link Reply with quote
 

lbone  Identity Verified
China
Local time: 05:59
English to Chinese
+ ...
regex case by case Oct 14, 2013

I think this is a work that needs the time by some person rather than by one or several expressions. Chinese characters are not easy to define simply by a regular expression (standard regular expressions are not supported by Microsoft Word), and sometimes blank spaces are meaningful such as in the title:

  第十四回 林如海捐馆扬州城 贾宝玉路谒北静王

Besides Chinese characters, spaces, digits, English letters and common English punctuation marks, there are also Korean/Japanese characters, non-standard/double-byte symbols. You will need to judge and handle spaces involved separately.

Iris Kleinophorst wrote:

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


Direct link Reply with quote
 

Frank Lin  Identity Verified
China
Local time: 05:59
English to Chinese
+ ...
wildcards Nov 2, 2013

Iris Kleinophorst wrote:

Hi

does anyone know a tool, function or regex to delete unnecessary spaces between Chinese characters resp. Chinese characters and Arabic numbers/Latin letters, e.g. in scanned PDF files? That is, a document with numerous other expressions where the spaces have to be kept, so that search and replace of spaces in Word does not work?

TIA
Iris


Finding and replacing characters using wildcards.


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Deleting spaces in Chinese word documents

Advanced search






Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »
BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search