Searching trough multiple pdf files
Thread poster: Pablo Bouvier

Pablo Bouvier  Identity Verified
Local time: 10:06
German to Spanish
+ ...
Sep 15, 2010

I wrote myself :

I have a lot of information in a pdf format (dictionnaries, glossaries, technical books, etc,) I am using as reference for my translations.

However, to search trough all this stuff is very time consuming. Someone can tell me if it exist some tool that can read multiple pdf files and open them highlighting the information I am looking for in one go?

Something like to search with Google trough the Googlebooks bibliotheques...




Addendum: I would like to give all you a big thank you to give me some insight about this issue. I will try now all this tools and take a decision on wich is the one that best suits my needs.



[Edited at 2010-09-16 08:53 GMT]


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 10:06
Member (2006)
English to Afrikaans
+ ...
Wilbur Sep 15, 2010

Pablo Bouvier wrote:
Someone can tell me if it exist some tool that can read multiple pdf files and open them highlighting the information I am looking for in one go?


The desktop search tool "Wilbur" indexes PDF files after converting it (in the background) to plain text.

http://s3.amazonaws.com/redtree/wilbur/index.html


Direct link Reply with quote
 
Claudio Porcellana  Identity Verified
Italy
Searching trough multiple pdf files Sep 15, 2010

Logiterm and you'll have a powerful aligning tool too!
tha uses even PDFs for aligning

as an example, I was able to index ICD-9 that is a mission impossible otherwise, and have it almost bilingual (with number codes as a Rosetta stone)

a lot of huge EU PDFs
DOD Dictionary 2009
all REACH docs
and many, many others

and you can manage 3 kind of docs
bilingual, even Trados TM converted
monolingual
multilingual

furthermore, Logiterm manages almost every kind of docs, not only PDFs

Claudio

[Modificato alle 2010-09-15 22:06 GMT]


Direct link Reply with quote
 

Hynek Palatin  Identity Verified
Czech Republic
Local time: 10:06
English to Czech
+ ...
Desktop Search Applications Sep 15, 2010

Pablo,

I think you need a "desktop search" application. You can try the following:

Google Desktop

Copernic Desktop Search

Windows Search for Windows XP/Vista
(I am not sure how good is the PDF support.)
There is a built-in indexing service in Windows 7.

There is also an older article on lifehacker.

You could also consider converting your glossaries to a text format, which is much easier to search using many tools.

Hynek


Direct link Reply with quote
 

Anna Sylvia Villegas Carvallo
Mexico
Local time: 03:06
English to Spanish
Acrobat Reader Pro... Sep 16, 2010

...can do the job in a simple way, provided the PDFs are on your PC.



Direct link Reply with quote
 

Tomás Cano Binder, BA, CT  Identity Verified
Spain
Local time: 10:06
Member (2005)
English to Spanish
+ ...
Effective File Search Sep 16, 2010

In Spain we say "Cada maestrillo tiene su librillo", "Each little teacher has his little book". I very much like Effective File Search. It searches really fast in big directory structures and any kind of files.

Direct link Reply with quote
 

Pablo Bouvier  Identity Verified
Local time: 10:06
German to Spanish
+ ...
TOPIC STARTER
Searching trough multiple pdf files Sep 16, 2010

Tomás Cano Binder, CT wrote:

In Spain we say "Cada maestrillo tiene su librillo", "Each little teacher has his little book". I very much like Effective File Search. It searches really fast in big directory structures and any kind of files.


Indeed, The name of the application is promising.
Thanks for the info, Tomás!


Direct link Reply with quote
 

tectranslate ITS GmbH
Local time: 10:06
German
+ ...
So THAT's why it didn't work for me! Sep 16, 2010

Tadzio Carvallo wrote:

...can do the job in a simple way, provided the PDFs are on your PC.


Just the other day I tried searching a whole network directory full of PDFs that I KNEW had the search term in them, with zero results. This was quite baffling to me, but next time I'll copy them over to my HDD first (which is of course a major nuisance and shouldn't be necessary, but if it works...)

Best regards,
Benjamin


Direct link Reply with quote
 
FarkasAndras
Local time: 10:06
English to Hungarian
+ ...
EU pdfs? Sep 16, 2010

The question has pretty much been answered, get a good desktop search tool.
A somewhat knowledgeable friend says DTSearch is really great for this. Customizable search profiles, in-context hits in dtsearch's own window, search in many formats including pdf etc. I haven't tried it though.

I'm not sure it makes much sense to collect many large EU pdfs though. In my experience, almost everything is also published in HTML, which is infinitely better for our purposes. You can just grab two HTML files, autoalign them and go on with life. They match so closely you can just use autoaligned TMs without checking them.
For some reason, the treaties aren't easy to find in html, and the Council insists on producing its awful pdfs but everything else is out there in html or at least doc.


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Searching trough multiple pdf files

Advanced search







PDF Translation - the Easy Way
TransPDF converts your PDFs to XLIFF ready for professional translation.

TransPDF converts your PDFs to XLIFF ready for professional translation. It also puts your translations back into the PDF to make new PDFs. Quicker and more accurate than hand-editing PDF. Includes free use of Infix PDF Editor with your translated PDFs.

More info »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search