SPLIT Split auxiliaries
Definition
Adverb or other material inserted between auxiliary and main verb (e.g., “has never seen”, “will probably go”, “is always talking”).
Parent
Detection Rules
Auxiliary + adverb + main verb.
semgrex{pos:/VB.*/}=verb >aux {}=aux . {pos:RB}=adv
mfte
MFTE surface pattern: (MD|DOAUX|have|be) + RB (not n’t/not) + V. Uses pos not cat for adverb since refinement happens in parallel.
md3
cql[pos="MD"] [pos="RB" & word!="n't|not"] [pos="VB.*"]
do3
cql[cat="DOAUX"] [pos="RB" & word!="n't|not"] [pos="VB.*"]
have3
cql[word="have|has|ve|had|having" & pos="VB.*"] [pos="RB" & word!="n't|not"] [pos="VB.*"]
be3
cql[word="be|am|is|are|was|were|been|being|m|re" & pos="VB.*"] [pos="RB" & word!="n't|not"] [pos="VB.*"]
md4
cql[pos="MD"] [pos="RB"] [pos="RB"] [pos="VB.*"]
do4
cql[cat="DOAUX"] [pos="RB"] [pos="RB"] [pos="VB.*"]
have4
cql[word="have|has|ve|had|having" & pos="VB.*"] [pos="RB"] [pos="RB"] [pos="VB.*"]
be4
cql[word="be|am|is|are|was|were|been|being|m|re" & pos="VB.*"] [pos="RB"] [pos="RB"] [pos="VB.*"]
combine: md3 | do3 | have3 | be3 | md4 | do4 | have4 | be4
pybiber
pybiber anchors on auxiliary token (dep_rel contains “aux”), then checks forward for ADV + VERB (or ADV + ADV + VERB). Uses UPOS tags (ADV, VERB) not fine-grained Penn tags.
p1
cql[dep="aux.*"] [upos="ADV"] [upos="VERB"]
p2
cql[dep="aux.*"] [upos="ADV"] [upos="ADV"] [upos="VERB"]
combine: p1 | p2
Normalization
Per finite_verbs
Examples
I would actually drive.
Source: le_foll_2024
You can just so tell.
Source: le_foll_2024
I can’t ever imagine arguing with Jill.
Source: le_foll_2024
Sources
- biber_1988 — Biber, Douglas (1988) : Variation across Speech and Writing
- mfte — Le Foll, Elen & Shakir, Muhammad (2023/2025) : Multi-Feature Tagger of English (MFTE) — Python version
- pybiber — Brown, David West & Reinhart, Alex (2026) : pybiber — Python package for linguistic feature extraction and Multi-Dimensional Analysis
- xiao_2009 — Xiao, Richard (2009) : Multidimensional analysis and the study of world Englishes
Notes
D4 loading .44 (persuasion). MFTE merges with split infinitives (SPINF).