SPLIT Split auxiliaries
Definition
Adverb or other material inserted between auxiliary and main verb (e.g., “has never seen”, “will probably go”, “is always talking”).
Parent
Detection Rules
Auxiliary + adverb + main verb.
semgrex{pos:/VB.*/}=verb >aux {}=aux . {pos:RB}=adv
pybiber
pybiber anchors on auxiliary token (dep_rel contains “aux”), then checks forward for ADV + VERB (or ADV + ADV + VERB). Uses UPOS tags (ADV, VERB) not fine-grained Penn tags.
p1
cql[dep="aux.*"] [upos="ADV"] [upos="VERB"]
p2
cql[dep="aux.*"] [upos="ADV"] [upos="ADV"] [upos="VERB"]
combine: p1 | p2
Normalization
Per finite_verbs
Examples
I would actually drive.
Source: le_foll_2024
You can just so tell.
Source: le_foll_2024
I can’t ever imagine arguing with Jill.
Source: le_foll_2024
Sources
- biber_1988 — Biber, Douglas (1988) : Variation across Speech and Writing
- mfte — Le Foll, Elen & Shakir, Muhammad (2023/2025) : Multi-Feature Tagger of English (MFTE) — Python version
- pybiber — Brown, David West & Reinhart, Alex (2026) : pybiber — Python package for linguistic feature extraction and Multi-Dimensional Analysis
- xiao_2009 — Xiao, Richard (2009) : Multidimensional analysis and the study of world Englishes
Notes
D4 loading .44 (persuasion). MFTE merges with split infinitives (SPINF).