Download text from website
Thread poster: Paul Sinfield

Paul Sinfield  Identity Verified
Local time: 00:32
French to English
Oct 16, 2009

Hi all

Does anyone know of any (preferably free!) tools out there for quickly downloading all text from a website? I have a translation job on and such a tool would really save time. I appreciate that the effectiveness of the tool may well depend on how the site is built but it's worth a go!

Many thanks

Paul


Direct link Reply with quote
 

Samuel Murray  Identity Verified
Netherlands
Local time: 01:32
Member (2006)
English to Afrikaans
+ ...
Offline browser Oct 16, 2009

Paul Sinfield wrote:
Does anyone know of any (preferably free!) tools out there for quickly downloading all text from a website?


The program you need is called an offline browser (for some weird reason). Google for "offline browser" to see a few of them. Here is one of them that I've used in the past: http://www.httrack.com/

Also see here:
http://www.proz.com/forum/software_applications/132076-software_used_to_extract_html_files_from_websites.html


Direct link Reply with quote
 

Gerard de Noord  Identity Verified
France
Local time: 01:32
Member (2003)
German to Dutch
+ ...
HTTrack Oct 16, 2009

Hi Paul,

HTTrack does what you want. You can download it for free at:
http://www.httrack.com/

The eternal caveat is that the HTML pages you dowload can very well have been created on the fly. You can only calculate a quote and translate dynamic websites when you receive all necessary files from from the webmaster.

Regards,
Gerard


Direct link Reply with quote
 

Paul Sinfield  Identity Verified
Local time: 00:32
French to English
TOPIC STARTER
Thanks all Oct 16, 2009

Many thanks for the suggestions. Will give it a try although I think the site has secured content as well which I doubt will download without full access. Guess I'll have to get on to the webmaster as you suggest.

Dankie / Bedankt

Paul


Direct link Reply with quote
 

Hynek Palatin  Identity Verified
Czech Republic
Local time: 01:32
English to Czech
+ ...
Ask the client Oct 16, 2009

Paul,

You should ask your client for the source HTML files. That way there will be no misunderstanding as to what should be translated. Downloading the files could be considered an extra service, but there is a risk of missing something or maybe translating too much.

If you download the files yourself, have your client confirm that this is exactly what they want to be translated.

Hynek


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Download text from website

Advanced search






PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search