Repository | Book | Chapter

231772

(2018) Taming the corpus, Dordrecht, Springer.

Morphological richness of text

Radek Čech, Miroslav Kubát

pp. 63-77

This study proposes a method for measuring the morphological richness of text. The method enables us to characterize the morphological complexity of a text (or a corpus). It is based on a computation of the difference between two measurements — the vocabulary richness of lemmas and the vocabulary richness of word forms. The greater the difference, the higher the morphological complexity of a text. The Moving Average Type Token Ratio (MATTR) is used for the computation of vocabulary richness. We hypothesize that the proposed indicator, known as Moving Average Morphological Richness (MAMR), should reflect the style of a text, and could therefore be used in stylometry. To verify this assumption, MAMR is applied in analyses of both genre and authorship.

Publication details

DOI: 10.1007/978-3-319-98017-1_4

Full citation:

Čech, R. , Kubát, M. (2018)., Morphological richness of text, in M. Fidler & V. Cvrček (eds.), Taming the corpus, Dordrecht, Springer, pp. 63-77.

This document is unfortunately not available for download at the moment.