TTR Type-token ratio

Definition

Ratio of distinct word types to total word tokens, calculated over first 400 words. Measures lexical diversity.

Detection Rules

Type-token ratio. Number of unique word types divided by total tokens. Not a pattern match.

Requires: word

Normalization

Per words

Sources

  • biber_1988 — Biber, Douglas (1988) : Variation across Speech and Writing
  • mfte — Le Foll, Elen & Shakir, Muhammad (2023/2025) : Multi-Feature Tagger of English (MFTE) — Python version
  • pybiber — Brown, David West & Reinhart, Alex (2026) : pybiber — Python package for linguistic feature extraction and Multi-Dimensional Analysis
  • xiao_2009 — Xiao, Richard (2009) : Multidimensional analysis and the study of world Englishes

Notes

MFTE calculates over first 400 words to control for text length.