Word Frequencies: Analyze Word Frequencies

The simplest function of MAXDictio determines the vocabulary of all of a current project’s texts.

This function can be accessed by navigating to the MAXDictio menu tab and clicking on the Word Frequencies icon.

The following dialogue window appears. Here you may select all the options you need.

Word Frequency options in MAXQDA 2018

Selection of texts to be analyzed

Only for activated documents – the frequencies procedure will be restricted to the activated text files

Only in retrieved segments – the frequencies procedure will be restricted to the coded segments actually displayed in the “Retrieved Segments” window

If neither option is selected, all text and table documents in the MAXQDA project will be analyzed.

Differentiation of results

None: The results table does not differentiate the results, providing only the totals over all analyzed texts.

By documents, document groups, document sets, focus group speakers: The results table contains additional columns that can be used to compare word frequency within individual documents, document groups, document sets, or focus group speakers (see Differentiation by Documents, Document Groups, Document Sets, and Focus Group Speakers). When the Only for activated documents option is selected, only activated documents within the document groups or document sets are taken into account, and only document groups or document sets containing activated documents will be analyzed.

By Codes: This option is available only if the analysis is restricted to the segments in the "Retrieved Segments" and a "Simple Coding Query" has been performed. The results table contains additional columns of recurring frequencies for each code that appears in the "Code System". This option is particularly helpful when texts have been divided into text units using codes for MAXDictio analysis, as it allows you to compare the word frequencies within different codes.

Ignore

In the "Ignore" section you can select various text elements that should not be taken into account, such as hyperlinks and e-mail addresses.

More options

Min. number of characters – words with fewer characters will be skipped.

Apply stop word list – If a stop word list is to be used, the corresponding box must be checked. Click on the button with the three dots to open and edit the stop lists.

Case sensitivity – If this setting is activated, "Give" and "give", for example, will be counted as different words. If the setting is inactive, all words will be displayed in lowercase in the results list.

Reduce words to base form (lemmatize) – when this box is checked, the identified words in the texts will be simplified to their word stems (lemmas) by using a lemma lexicon in the chosen language. For example, if a text contains the words “gave”, “given”, and “gives”, MAXDictio will list the base form “give” in the results table only.

Click OK, to begin the analysis of word frequencies. Depending on the size of the texts, this process may take a few moments. A display informs you about the progress of the analysis.

Recall last result once again

If you have already used the Word Frequencies function, you can click on the Word Frequencies label (not on the corresponding icon) in the MAXDictio menu tab and select Last Result: Word Frequencies from drop-down menu. This will display the last count results of the Word Frequency function without having to perform the count again.



Was this article helpful?