Download text from website
Thread poster: Paul Sinfield

Paul Sinfield  Identity Verified
Local time: 21:49
French to English
Oct 16, 2009

Hi all

Does anyone know of any (preferably free!) tools out there for quickly downloading all text from a website? I have a translation job on and such a tool would really save time. I appreciate that the effectiveness of the tool may well depend on how the site is built but it's worth a go!

Many thanks

Paul


 

Samuel Murray  Identity Verified
Netherlands
Local time: 22:49
Member (2006)
English to Afrikaans
+ ...
Offline browser Oct 16, 2009

Paul Sinfield wrote:
Does anyone know of any (preferably free!) tools out there for quickly downloading all text from a website?


The program you need is called an offline browser (for some weird reason). Google for "offline browser" to see a few of them. Here is one of them that I've used in the past: http://www.httrack.com/

Also see here:
http://www.proz.com/forum/software_applications/132076-software_used_to_extract_html_files_from_websites.html


 

Gerard de Noord  Identity Verified
France
Local time: 22:49
Member (2003)
German to Dutch
+ ...
HTTrack Oct 16, 2009

Hi Paul,

HTTrack does what you want. You can download it for free at:
http://www.httrack.com/

The eternal caveat is that the HTML pages you dowload can very well have been created on the fly. You can only calculate a quote and translate dynamic websites when you receive all necessary files from from the webmaster.

Regards,
Gerard


 

Paul Sinfield  Identity Verified
Local time: 21:49
French to English
TOPIC STARTER
Thanks all Oct 16, 2009

Many thanks for the suggestions. Will give it a try although I think the site has secured content as well which I doubt will download without full access. Guess I'll have to get on to the webmaster as you suggest.

Dankie / Bedankt

Paul


 

Hynek Palatin  Identity Verified
Czech Republic
Local time: 22:49
English to Czech
+ ...
Ask the client Oct 16, 2009

Paul,

You should ask your client for the source HTML files. That way there will be no misunderstanding as to what should be translated. Downloading the files could be considered an extra service, but there is a risk of missing something or maybe translating too much.

If you download the files yourself, have your client confirm that this is exactly what they want to be translated.

Hynek


 

Michael Beijer  Identity Verified
United Kingdom
Local time: 21:49
Member (2009)
Dutch to English
+ ...
I just found out about Omni-Crawler... Apr 25

See: https://www.omni-lookup.de/

Haven't tried it myself, but it's from the developer of Omni-Lookup (formerly known as Multifultor).

Michael


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Download text from website

Advanced search






TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use SDL Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search