Text analysis tool
Thread poster: Barnaby Capel-Dunn
I'm not quite sure what use you could put this tool but it's certainly intriguing. "Textalyser ( http://textalyser.net/ ) allows you to analyze either blocks of text or entire Web pages.
Either specify an URL or enter a block of text. I entered The Gettysburg Address. You can also specify the minimum number of character to constitute a word, number of words to analyze, whether to ignore numbers (this one is on by default) and whether your query (if a Web site) should be logged (there's a "recent queries" list.)
The results give you the number of words total, number of unique words, two different estimates of difficulty, a sentence count, the longest and shortest sentences, and other details. There's also a word frequency count, a word length (in characters) count, and a syllable count. "
See what you think!
| | xxxLia Fail
Local time: 14:25
Spanish to English
A really interesting site.
Text analysis is the stuff of corpus linguistics. There are/were a couple of similar sites around but they seemed to come and go.
Problem is, although it's fascinating, I haven't found a real use for corpus linguistics in translation. It doesn't help DO the job, it's all about analyising text beforehand and real-life translators don't usually have the time for this.
Nonethless, a site like the one you gave us could be useful for highlighting the key topic and key words of a text.
The key words could be used to search for similar texts on the WWW. One might also be able to highlight 'candidate terms' according to the frequency (one approach is a stop list, which excludes non-content words like the, a, it's etc, but I don't this site offers this option).
Thanks again, it's going to be fun:-)
Now, textalyser includes stoplists,
just french and english for now, but If You have other Suggestions or queries, the are welcome (as for example other stoplists)
To report site rules violations or get help, contact a site moderator:
You can also contact site staff by submitting a support request »