SPLIT Split auxiliaries

Xiao N100 MFTE SPLIT child of SPLIT_ALL

Definition

Adverb or other material inserted between auxiliary and main verb (e.g., “has never seen”, “will probably go”, “is always talking”).

Parent

SPLIT_ALL

Detection Rules

Auxiliary + adverb + main verb.

semgrex{pos:/VB.*/}=verb >aux {}=aux . {pos:RB}=adv

Requires: pos, dep

pybiber

pybiber anchors on auxiliary token (dep_rel contains “aux”), then checks forward for ADV + VERB (or ADV + ADV + VERB). Uses UPOS tags (ADV, VERB) not fine-grained Penn tags.

p1
cql[dep="aux.*"] [upos="ADV"] [upos="VERB"]
p2
cql[dep="aux.*"] [upos="ADV"] [upos="ADV"] [upos="VERB"]
combine: p1 | p2

Requires: pos, upos, dep

Normalization

Per finite_verbs

Examples

I would actually drive.

Source: le_foll_2024

You can just so tell.

Source: le_foll_2024

I can’t ever imagine arguing with Jill.

Source: le_foll_2024

Sources

  • biber_1988 — Biber, Douglas (1988) : Variation across Speech and Writing
  • mfte — Le Foll, Elen & Shakir, Muhammad (2023/2025) : Multi-Feature Tagger of English (MFTE) — Python version
  • pybiber — Brown, David West & Reinhart, Alex (2026) : pybiber — Python package for linguistic feature extraction and Multi-Dimensional Analysis
  • xiao_2009 — Xiao, Richard (2009) : Multidimensional analysis and the study of world Englishes

Notes

D4 loading .44 (persuasion). MFTE merges with split infinitives (SPINF).