This software scans one or more text or text-like files (e.g. HTML and XML files) and counts the number of occurrences of the different words or phrases (pattern matching can be used). There is no limit on the size of an input file. The words which are found can be displayed alphabetically or by frequency. The program can be told to allow or disallow words with numerals, hyphens, apostrophes, underscores or colons, to ignore words which are short or which occur infrequently, and to ignore words (e.g., common words such as 'the') contained in a specified file. It can also be told to count only words or phrases in a specified list. Results can be written to an output file, and that file can be read into a spreadsheet such as Excel. This program can automatically create an Excel-readable file of words/phrases vs files. It may be used with text in French, German, Italian, Spanish, Portuguese and other languages.
JFreq is a tool that takes plain text documents and turns them into a word frequency matrix. JFreq tries to be quick, and not to take to much memory. It could be better at both, but it's quite usable. The plain text files can be added directly, or b
Tabiteuea is a useful application for the users that want to analyze a text corpus and to calculate the word frequency. The user can create multiple projects and select the TXT or XML files that contain the text. The program can calculate the freque
cb's Japanese Text Analysis Tool is designed to help you analyze Japanese text files and generate reports on the word frequency, the Kenji frequency and readability. cb's Japanese Text Analysis Tool uses two parsing methods: MeCab and JParser. it su
Reckoning is a lightweight and easy to use application that will enable you to find out the number of words and their frequency in any given text. There are two ways of submitting a text to analysis, namely by loading a TXT file or by pasting the te
[ 1 vote ] This software scans a file, or text on the clipboard, and counts the occurrences of different words. What is a 'word' is user-definable. The words found and displayed can be ordered alphabetically or by frequency. Supports non-English text as well.