How to extract images from Word files? (And put them back afterwards...)
Thread poster: Jabberwock

Jabberwock  Identity Verified
Poland
Local time: 22:56
Member (2004)
English to Polish
May 30, 2007

I remember that there was a topic considering extracting all embedded images from Word files and then putting them back after editing. However, I cannot find it...

Therefore, what is the preferred procedure/macro/tool for that?


Direct link Reply with quote
 
xxxOlaf
Local time: 22:56
English to German
You can link pictures through INCLUDEPICTURE field codes May 30, 2007

If you don't have many pictures you could simply replace them with INCLUDEPICTURE field codes:

To insert an INCLUDEPICTURE field, point to Picture on the Insert menu, click From File, click the arrow next to the Insert button, and then click Link to File.

You could then enable the display of field codes on the View tab of the Options dialog box to prevent the pictures from being displayed.

Olaf


Direct link Reply with quote
 

Tjasa Kuerpick  Identity Verified
Slovenia
Local time: 22:56
Member (2006)
Slovenian to German
+ ...
May be this helps May 30, 2007

I actually do not know any macro for that, but if there is really a need for an extraction you can either save the file (a copy of the file of course) as html, where while this process of conversion is running, all images are saved in an separated order as "image001.jpg, image002.jpg", and so on.
The problem is only that pictures will have a low resolution - which is not desired.

A second way is to save/transform the document as a pdf file, if you have a pdf converter, like Adobe. Adobe offers a possibility to extract all pictures from a pdf file, it does not even change the names, which is afterwards far more practical as you have less work by renaming all pics back to their original name, considering that the file has to be send back with the original names of the pictures. The good things is - the resolution is kept as in the original file.

There is a third way that is to extract them all one by one and safe them in Photo editor or Picture Publisher. But really, I would not do that, as it is too time extensive.


Direct link Reply with quote
 

Giles Watson  Identity Verified
Italy
Local time: 22:56
Italian to English
Is this the technique? May 30, 2007

Hi Jabberwock,

Perhaps you were thinking of this Trados Workbench procedure to get round the file bloat that afflicts Word files containing embedded images.

To translate texts with images and/or text boxes:

1. Create an empty TM
2. Tools/Analyse the file
3. Export unknown segments (99%) - save as RTF
4. Translate the RTF file (using other TMs if you like)
5. Tools/Cleanup the RTF into the empty TM
6. Tools/Translate the original file
7. Clean up into the TM of your choice.

It might be a good idea to print out a copy of the file beforehand, or open it on a second computer/monitor, so that you have an idea of the layout while you're working.

http://www.proz.com/post/354215#354215

HTH

Giles


Direct link Reply with quote
 

Jabberwock  Identity Verified
Poland
Local time: 22:56
Member (2004)
English to Polish
TOPIC STARTER
No, it's the images that need editing... May 30, 2007

No, I can deal with the text without problems... It's the images that need editing!

That is why I would like to have something to extract them easily. What is more important, I would like an easy way to get them back into Word files, when I'm done with them...


Direct link Reply with quote
 

Tony M  Identity Verified
France
Local time: 22:56
Member
French to English
+ ...
Formats? May 30, 2007

Jabberwock wrote:
It's the images that need editing!


I guess a lot will depend on what type / format of images they are, and how you have to edit them.

You could easily create a Word DOC with just the images in simply be deleting the text using search-&-replace, but maybe you can't edit them in a Word doc?

In any case, you've still got to find a way of recombining them with the text afterwards


Direct link Reply with quote
 

Giles Watson  Identity Verified
Italy
Local time: 22:56
Italian to English
SnagIt? May 30, 2007

Hi again,

Is this the application you were thinking of, then?

http://www.techsmith.com/snagit.asp

HTH

Giles


Direct link Reply with quote
 

Jabberwock  Identity Verified
Poland
Local time: 22:56
Member (2004)
English to Polish
TOPIC STARTER
Haven't thought of that... May 30, 2007

Nice tip, Olaf, haven't thought of that...

I can make a macro that will simply substitute the graphics with the code (after I extract them, of course). I have checked several of them and they are inline, so they might even stay where they are...


Direct link Reply with quote
 

Jabberwock  Identity Verified
Poland
Local time: 22:56
Member (2004)
English to Polish
TOPIC STARTER
Quite simple solution! May 30, 2007

The solution, turns out, was quite simple... I think I will describe it in detail in an article, here comes a short summary:

1. Save the Word document to xml.

2. Extract the images according to the procedure described here:

http://www.tkachenko.com/blog/archives/000195.html

i.e. run the tool nxslt on your xml file with the Word2HTML-.NET-script.xsl.

3. Open the xml file in a text/xml editor and change the picture links from internal to external, e.g.:

v:imagedata src="wordml://03000002.png"

to

v:imagedata src="test_files/03000002.png"

I used regular expression for this, but I suppose just changing

v:imagedata src="wordml://

to

v:imagedata src="test_files/

might work, too...


After that whenever you change the external graphics files, they get updated in the Word (xml) document. I think that for delivery the linked files would have to be embedded, but that is simple to do in Word (Edit/Links).

Of course, if you have a simpler solution for changing internal images into linked images, I would be glad to know...

[Zmieniono 2007-05-30 18:34]


Direct link Reply with quote
 

Giles Watson  Identity Verified
Italy
Local time: 22:56
Italian to English
You might want to look at this message... Jun 4, 2007

... from the Déjà Vu Yahoo group:

http://tech.groups.yahoo.com/group/dejavu-l/message/79618

which links you to some macros you can download from:

http://www.necco.ca/dv/word.htm#Removing_and_inserting_graphics_in_a_file

HTH

Giles


Direct link Reply with quote
 
freda_fan
English
Mercury Document System Jun 4, 2007

Mercury Document System
more at: http://www.qweas.com/download/business/info_managers/mercury_document_system.htm

It is a personal document management (PDM) / personal knowledge management (PKM) application for Microsoft Windows.


Direct link Reply with quote
 

Jabberwock  Identity Verified
Poland
Local time: 22:56
Member (2004)
English to Polish
TOPIC STARTER
Only the OpenOffice solution might work Jun 4, 2007

Unfortunately, these solutions are for people who want to edit only the text, not the other way round.

Of the three suggestions, only the second one might be used (the first one puts graphics in another Word file, the third one does not deal with graphics at all), as it allows editing the graphics outside of Word.

I will check it out!


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How to extract images from Word files? (And put them back afterwards...)

Advanced search






Déjà Vu X3
Try it, Love it

Find out why Déjà Vu is today the most flexible, customizable and user-friendly tool on the market. See the brand new features in action: *Completely redesigned user interface *Live Preview *Inline spell checking *Inline

More info »
SDL MultiTerm 2017
Guarantee a unified, consistent and high-quality translation with terminology software by the industry leaders.

SDL MultiTerm 2017 allows translators to create one central location to store and manage multilingual terminology, and with SDL MultiTerm Extract 2017 you can automatically create term lists from your existing documentation to save time.

More info »



All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs