AWL Average word length

Definition

Mean number of characters per word token. A proxy for lexical sophistication and formality.

Detection Rules

Mean character count per word token. Not a pattern match — requires iterating all tokens and computing the average.

Requires: word

Normalization

Per words

Sources

  • biber_1988 — Biber, Douglas (1988) : Variation across Speech and Writing
  • mfte — Le Foll, Elen & Shakir, Muhammad (2023/2025) : Multi-Feature Tagger of English (MFTE) — Python version
  • pybiber — Brown, David West & Reinhart, Alex (2026) : pybiber — Python package for linguistic feature extraction and Multi-Dimensional Analysis
  • xiao_2009 — Xiao, Richard (2009) : Multidimensional analysis and the study of world Englishes
  • bohmann_2019 — Bohmann, Axel (2019) : Variation in English Worldwide: Varieties and Genres in a Quantitative Perspective