ProZ.com global directory of translation services
 The translation workplace
Ideas

 
User
Thread poster: bhavaniprakash
How to convert scanned text in jpg format into editable text
bhavaniprakash
English
Jul 5, 2006

How to convert scanned printed text image jpg file into word file which can be edited. I tried in SimpleOCR software and others, but instead of one word we will get another word. Please give solution with free software download web details

[Subject edited by staff or moderator 2006-07-05 12:20]


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 09:46
Member (2006)
English to Afrikaans
+ ...
GOCR Jul 5, 2006


bhavaniprakash wrote:
I tried in SimpleOCR software and others...


AFAIK SimpleOCR uses an old version of GOCR as its OCR engine. However, there is a newer version of GOCR available which you can run in commandline mode. It is better than nothing, but it is still nowhere near the quality you'd get from paid products.

GOCR:
http://jocr.sourceforge.net/


Direct link Reply with quote
 

Fernando Toledo  Identity Verified
Germany
Local time: 09:46
Member (2005)
German to Spanish
Trial Jul 5, 2006

You have some days to "try it"...why not!

Finereader
http://download.abbyy.com/content/default.aspx?visitor=2effd83d-9e63-4ffd-a35d-02b41ce8d445&choose=en&x=16&y=13


Direct link Reply with quote
 

Carvallo
Mexico
Local time: 02:46
Member (2006)
English to Spanish
CONVERT JPG INTO TIFF FORMAT... Jul 5, 2006

And then use your MICROSOFT OFFICE DOCUMENT IMAGING.
The rest is easy.


Direct link Reply with quote
 

Zsanett Rozendaal-Pandur  Identity Verified
Hungary
Local time: 09:46
Dutch to Hungarian
+ ...
ABBYY Jul 5, 2006

I use ABBYY, the one that Fernando also suggested. I've ended up buying it after the trial period, because I've found to be superior to other software, like SimpleOCR. I had an image which a fairly low resolution which SimpleOCR just made gibberish of, and ABBYY beautifully read it all, I'm very impressed with it so far. If converting image to text is something you have to do often, it's worth buying it - will save you a lot of time.

Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 09:46
Member (2006)
English to Afrikaans
+ ...
Which version of MS Office...? Jul 6, 2006


Tadzio wrote:
And then use your Microsoft Office Document Imaging.


Which version of MS Office are you on about?


Direct link Reply with quote
 

Carvallo
Mexico
Local time: 02:46
Member (2006)
English to Spanish
Either 2002 or 2003 versions... Jul 6, 2006

Both Microsoft Office 2002 or 2003 have the Document Imaging application.
This is how you open it:
Start > Programs > Microsoft Office > Microsoft Office Tools > Microsoft Office Document Imaging.

Be sure you previously converted JPG into TIFF. When you are at "MS Office Document Imaging", click on the File menu. Then, "Import" (the tiff document). Finally, "Recognize text using OCR", and last, convert to Word document.

Hope you make it!
Tadzio.


Direct link Reply with quote
 

Tom Liska
Local time: 09:46
English to Czech
+ ...
freeware TopOCR Apr 17, 2008

http://www.topocr.com/download.html

Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Natalie[Call to this topic]
Prachya Mruetusatorn[Call to this topic]

You can also contact site staff by submitting a support request »

How to convert scanned text in jpg format into editable text






Déjà Vu X3
Try it, Love it

Find out why Déjà Vu is today the most flexbible, customizable and user-friendly tool on the market. See the brand new features in action: Completely redesigned user interface Live Preview Inline spell checking Inline formatting and more

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »