How do I save a PDF-dictionary as Excel for use in MultiTerm?
Thread poster: Fredrik Pettersson

Fredrik Pettersson  Identity Verified
Hong Kong
Member (2009)
English to Swedish
+ ...
Jun 26, 2015

How do I save a PDF-dictionary as Excel for use in MultiTerm?

I managed to OCR the whole PDF successfully so I can search each term. The page layout is 4 columns like this:

source term translation source term translation

Now I would like to move these terms into an Excel that I then can create a termbase of with MultiTerm Desktop. Have anyone tried this before, to create an Excel from a PDF wordlíst and then create a termbase?

I have Adobe Acrobat XI Pro that I tried to save directly as Excel, but it didn't work very well.


 

Radian Yazynin  Identity Verified
Local time: 20:30
Member (2004)
English to Russian
+ ...
If you want I will try to help you out with it Jun 26, 2015

Just send it to my email.
As far as I understand you need a 2-column xls file, right?

[Edited at 2015-06-26 18:54 GMT]


 

Miguel Carmona  Identity Verified
United States
Local time: 10:30
English to Spanish
... Jun 26, 2015

Hi Frederik,

Here is the process:

1) Make the 4-column format into a 2-column format
2) Save 2-column document as Excel file
3) Use MultiTerm Convert to convert Excel file into an XML file
4) Create termbase in MultiTerm
5) Import XML file into newly created termbase

Good luck


 

Fredrik Pettersson  Identity Verified
Hong Kong
Member (2009)
English to Swedish
+ ...
TOPIC STARTER
Here comes the PDF Jun 26, 2015

Thanks Radian and Miguel for the help. I'll try this, sounds like it should work. The current Excel result was completely messed up because of the four columns probably.

 

Fredrik Pettersson  Identity Verified
Hong Kong
Member (2009)
English to Swedish
+ ...
TOPIC STARTER
How do I make the PDF in 2-column format? Jun 27, 2015

Miguel,

I wonder if you could advice how to make the PDF into a 2-column format directly in Adobe Acrobat Pro XI? Is there a way I can edit the PDF in Adobe what regards the layout with columns?


 

Miguel Carmona  Identity Verified
United States
Local time: 10:30
English to Spanish
... Jun 29, 2015

Fredrik Pettersson wrote:

I wonder if you could advice how to make the PDF into a 2-column format directly in Adobe Acrobat Pro XI? Is there a way I can edit the PDF in Adobe what regards the layout with columns?


Fredrik,

You cannot do it directly in Acrobat Pro.

You need to get out of Acrobat Pro, and work in Word.

First, in Acrobat Pro save/export the 4-column file as a Word document, so you have an editable text file, just like any regular Word document.

Then, open the Word document, select the 3rd and 4th columns, cut them and paste them at the bottom of the 1st and 2nd columns, so you end up with a 2-column document.

This is the 2-column table you need to export to Excel (via copy and paste, etc.), so you can process it in MultiTerm Converter.

Good luck.

===================
Editted to add:

Also, depending on how you get the 4-column table in Word, you can export it as a CSV file and open it directly in Excel.

When you open the file in Word, what kind of separation do you have between columns? Tabs?

It might be even possible with some manipulation of the separation characters to make the CSV file open as a 2-column table in Excel.

[Edited at 2015-06-29 17:28 GMT]


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

How do I save a PDF-dictionary as Excel for use in MultiTerm?

Advanced search







BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search