TTR Type-token ratio
Definition
Ratio of distinct word types to total word tokens, calculated over first 400 words. Measures lexical diversity.
Detection Rules
Type-token ratio. Number of unique word types divided by total tokens. Not a pattern match.
Normalization
Per words
Sources
- biber_1988 — Biber, Douglas (1988) : Variation across Speech and Writing
- mfte — Le Foll, Elen & Shakir, Muhammad (2023/2025) : Multi-Feature Tagger of English (MFTE) — Python version
- pybiber — Brown, David West & Reinhart, Alex (2026) : pybiber — Python package for linguistic feature extraction and Multi-Dimensional Analysis
- xiao_2009 — Xiao, Richard (2009) : Multidimensional analysis and the study of world Englishes
Notes
MFTE calculates over first 400 words to control for text length.