JPG to .doc converter?
Thread poster: Paula Hernández
I am trying to find a converter that changes images to text format. Does anyone know about such a program?
| Please use the forum search || May 20, 2006 |
This topic has been discussed numerous times here in forum. Try an advanced search with the relevant keywords here : http://www.proz.com/forumsearch and you will find plenty of useful advices
| ABBYY Fine Reader || May 20, 2006 |
I also use ABBYY Fine Reader. The trial version is more than enough. It came with an epson scanner that I bought for slides. I hate when they send you texts in jpg or in pdf. where you cannot select the text but the whole image. Depending on the quality of the text, of course, you will have to mend it after scanning it. E.g, m often becomes ni or viceversa, ll>ii, 8>B, etc. It may take some time, but a solution is agreeing an extra with the agency or just advertise the service with different quotes.
| | ViktoriaG
Local time: 05:21
English to French
| If you have a 4-in-1 machine... || May 20, 2006 |
Many of us don't realize, when we buy a 4-in-1 machine (copy-scan-fax-print) that the interface that comes with it already has a built-in OCR.
I agree that ABBYY is great - I think they are the market leader for OCR, as I see on the forums here that that's what everybody prefers. I also have a lite version of that, it came with my WordPerfect that came with my laptop. It seems that there are lots of demo and lite versions attached to programs and products...
The OCR that came with my Lexmark multitask machine is much like ABBYY and the documents produced are the same quality (maybe Lexmark got theirs from ABBYY), except that it doesn't ask for the document's language before processing it.
Tip: if you have a trial, demo or lite version of this or other OCR software and you have not used anything like this before, please make sure of one thing. Ususally, pictures copied from PDFs are very low resolution (96dpi). This is insufficient for ABBYY and most OCRs to extract text. What you may want to do is to open your image in a picture editor (there are some very neat free ones on the net), and change the resolution of the image to 300dpi. Also, it's best that at the same time, you change the image to JPEG format if it isn't already. Then, you can safely run the OCR on it and your document will most likely turn out near perfect (except for the above mentioned character misreadings and some formatting issues).
If you have formatting issues, instead of fiddling with formatting in the freshly OCRed document, copy all of its text and paste it as text (NOT as rich text). The formatting will go away, and you will start formatting with a clean slate (following the formatting of the original image document, of course).
I you want a real nice free image editor, go to http://www.freeserifsoftware.com/software/PhotoPlus/default.asp
[Edited at 2006-05-20 17:01]
| || || |
| | Ritu Bhanot
Local time: 11:21
French to Hindi
| | Paula Hernández
Local time: 10:21
English to Spanish
thanks guys, I'll try those and see which one I like the most^_^
| | Claudia Alvis
Local time: 05:21
| Free software - SimpleOCR || May 20, 2006 |
SimpleOCR is also a fantastic little program. It doesn't have as many features as ABBYY but it's 100% free and does a great job.
| | Elene P.
Local time: 14:21
Georgian to English
| | Brian Young
Local time: 02:21
Danish to English
| To OCR or not || Apr 29, 2008 |
Two of my recent jobs required translating from a pdf. On the first one, the outsourcer supplied the pdf, and an OCR'ed version in word, and asked me to proofread the OCR'ed version, which I did. Then I translated directly in Word, by splitting the screen into two panes, keeping the source text in the upper pane, and writing the translation in the lower screen.
The most recent translation was presented as eight pages of pdf text, and eight pages of old tax returns. I skipped the OCR business entirely. Again, I had both Word and Acrobat up, split across the screen. The pdf in the top, Word in the bottom. When I got to the tax returns I did them in Excel, and later pasted the Excel into Word. I delivered the finished document in both Word and a pdf, but never converted the original pdf. It is a lot of work to do that, and it may not be necessary. Translating a text is one job, and OCR'ing and proofing the original is another. It might be convenient for the outsourcer to get his original document converted to a digital file, and proofread as well, but that seems to me to be a scope of work beyond just the translation.
| || || |