WORDCOUNT YAML source

code: WORDCOUNT
mfte_code: Words
name: Word count
definition: >-
  Total number of words (tokens) in the text. Used as the denominator for
  per-word normalisation of feature counts.

normalization: none

detection:
  - source: mfte
    requires:
    - word
    - upos
    cql: '[word=".*" & upos!="PUNCT"]'
    description: All non-punctuation tokens (MFTE counts words, not punctuation)

sources:
  - mfte

notes: >-
  A fundamental text property rather than a linguistic feature. Most taggers
  report this; MFTE outputs it as "Words" column.