Bohmann 2019 Features

Features from Bohmann's (2019) 236-feature set, mapped to CLEF codes.

# Name Code Status
1 first person pronouns FPP mapped
2 second person pronouns SPP2 mapped
3 third person pronouns TPP mapped
4 pronoun it PIT mapped
5 demonstrative pronouns DEMO_ALL mapped
6 indefinite/quantifying pronouns QUPR mapped
7 non-standard 2nd person plural excluded dialectometric
8 archaic 2nd person ARCH2P added relevant for Tolkien/historical fiction
9 standard reflexives excluded dialectometric framing
10 whom WHOM added formality marker, Grieve
11 object pronoun + being excluded niche construction
12 each other EACHOTHER added Grieve, reciprocal marker
13 one another ONEANOTHER added Grieve, formal reciprocal
14 can MDCA mapped
15 may/might MDMM mapped
16 may/might MDMM mapped
17 could MDCO mapped
18 ought MDOU added Grieve + Leech, distinct modal
19 should MDNE mapped
20 must MDNE mapped
21 would MDWO mapped
22 shall MDSL added Grieve + Leech, formality marker
23 contracted will WILL_FULL added Leech, informality marker
24 uncontracted will WILL_FULL added Grieve, formality contrast with 'll
25 used to excluded dialectometric
26 BE about to BE_ABOUT_TO added Leech + Quirk, aspect marker
27 BE able to BE_ABLE_TO added Leech + Quirk, modal equivalent
28 BE supposed to BE_SUPPOSED_TO added Leech + Quirk, obligation
29 KEEP V-ing KEEP_VING added Leech + Quirk, aspect
30 going to GOING_TO added multiple sources, futurity
31 HAVE to HAVE_TO added multiple sources, obligation
32 possessive HAVE got excluded dialectometric framing
33 passive with GET GET_PASSIVE added Leech, voice alternation
34 NEED to NEED_TO added multiple sources, obligation
35 HAVE + deverbal noun HAVE_DEVERBAL added light verb, register marker
36 TAKE + deverbal noun TAKE_DEVERBAL added light verb, register marker
37 MAKE + deverbal noun MAKE_DEVERBAL added light verb, register marker
38 BE to be + past participle excluded rare construction
39 WANT to WANT_TO added Collins + Mair, desire/volition
40 habitual BE excluded dialectometric
41 by-passive PASSBY mapped
42 agentless be-passive PASS_ALL mapped
43 if + were IF_WERE added Leech, formality/register
44 if + was IF_WAS added Leech, informality contrast
45 if + would excluded dialectometric framing
46 progressive aspect PROG added multiple sources, aspect
47 always-type progressive PROG_ALWAYS added Leech, expressive progressive
48 progressive with stative verbs excluded dialectometric
49 will + progressive WILL_PROG added Leech, tentative/polite
50 progressive passives PROG_PASS added Leech, complex aspect
51 present perfect PEAS mapped
52 past perfect PAST_PERF added standard tense, narrative-relevant
53 strong verbal past STRONG_PAST added Grieve, British/formality marker
54 regularized verbal past REG_PAST added Grieve, American/informality marker
55 do-support with have excluded niche dialectometric
56 gotten as past participle excluded dialectometric
57 analytic negation: not NEG_ALL mapped
58 negative concord / multiple negation excluded dialectometric
59 ain't excluded dialectometric
60 comparative with as/than what excluded dialectometric
61 verb + to-infinitive INFIN mapped
62 verb + gerund complementation GERUND_COMP added multiple sources, complement alternation
63 verbal complement that excluded narrow verb list, overlaps CLEF
64 try and TRY_AND added register alternation with try to
65 try to TRY_TO added register alternation with try and
66 wh + to-infinitive WH_TOINF added Leech, complexity marker
67 prevent/stop + from PREVENT_FROM added Leech + Mair
68 help + to HELP_TO added Leech + Mair, complement alternation
69 the same X as excluded niche alternation
70 the same X that excluded niche alternation
71 BE like as quotative excluded dialectometric/youth speech
72 because CUZ mapped
73 although ALTHOUGH added Grieve + Biber, concessive
74 though ALTHOUGH added Grieve + Biber, informal concessive
75 if/unless COND mapped
76 adverbial subordinators OTHADVSUB mapped
77 conjuncts CONJUNCTS mapped
78 whether WHETHER added Grieve, complement clause
79 discourse-structuring so excluded dialectometric framing
80 actually ACTUALLY added Grieve, discourse marker
81 in fact IN_FACT added Grieve, conjunct
82 in addition IN_ADDITION added Grieve, conjunct
83 as if AS_IF added Grieve, comparison clause
84 as though AS_THOUGH added Grieve, comparison clause
85 downtoners DWNT mapped
86 amplifiers AMP mapped
87 emphatics EMPH mapped
88 epistemic certainty adverbs CERT_ADV added Quirk, stance marker
89 judgment adverbials JUDG_ADV added Quirk, stance marker
90 intensifier so INTENS_SO added common register marker
91 maybe FW_MAYBE added Grieve, epistemic
92 perhaps FW_PERHAPS added Grieve, epistemic (formal)
93 probably FW_PROBABLY added Grieve, epistemic
94 likely FW_LIKELY added Grieve, epistemic
95 especially FW_ESPECIALLY added Grieve, focus adverb
96 particularly FW_PARTICULARLY added Grieve, focus adverb (formal)
97 completely FW_COMPLETELY added Grieve, amplifier
98 entirely FW_ENTIRELY added Grieve, amplifier (formal)
99 almost FW_ALMOST added Grieve, approximator
100 nearly FW_NEARLY added Grieve, approximator
101 standardness excluded Twitter-specific
102 mean word length AWL mapped
103 communication/public verbs COMM mapped
104 private/mental verbs MENTAL mapped
105 definite article the DEF_ART added Leech, basic register feature
106 non-standard definite article da/di excluded dialectometric
107 indefinite article a INDEF_ART added Leech, basic register feature
108 a before vowel-initial words excluded dialectometric variation
109 about + numeral excluded niche quantifier alternation
110 around + numeral excluded niche quantifier alternation
111 more than + numeral excluded niche quantifier alternation
112 over + numeral excluded niche quantifier alternation
113 half/all of excluded niche alternation
114 half/all excluded niche alternation
115 plenty of excluded niche quantifier
116 many/much MANY_MUCH added Grieve, common quantifiers
117 lots of LOTS_OF added Grieve, informal quantifier
118 a lot of A_LOT_OF added Grieve, informal quantifier
119 past + numeral excluded niche alternation
120 last + numeral excluded niche alternation
121 s-genitives S_GENITIVE added Leech, genitive alternation
122 kind of N KIND_OF_N added Grieve, hedging/vagueness
123 type of N excluded less common than kind of
124 sort of N SORT_OF_N added Grieve, hedging/vagueness
125 BE kind of/kinda Adj BE_KIND_OF added Grieve, hedge + informality
126 BE sort of/sorta Adj BE_SORT_OF added Grieve, hedge + informality
127 not only NOT_ONLY added Grieve, correlative
128 not just NOT_JUST added Grieve, correlative (informal)
129 how come excluded niche
130 persons excluded niche lexical
131 place adverbials PLACE mapped
132 time adverbials TIME mapped
133 usually FW_USUALLY added Grieve, frequency adverb
134 normally FW_NORMALLY added Grieve, frequency adverb
135 previously FW_PREVIOUSLY added Grieve, temporal adverb
136 frequently FW_FREQUENTLY added Grieve, frequency adverb
137 often FW_OFTEN added Grieve, frequency adverb
138 sometimes FW_SOMETIMES added Grieve, frequency adverb
139 immediately FW_IMMEDIATELY added Grieve, temporal adverb
140 suddenly FW_SUDDENLY added Grieve, temporal (narrative!)
141 at the same time excluded multi-word, hard to detect cleanly
142 currently FW_CURRENTLY added Grieve, temporal adverb
143 right now excluded multi-word
144 words ending in -where WHERE_WORDS added Grieve, place suffix
145 words ending in -ward WARDS_WORDS added Grieve, directional suffix
146 words ending in -wards WARDS_WORDS added Grieve, directional suffix (British)
147 till TILL added Grieve, temporal (informal)
148 until UNTIL added Grieve, temporal (formal)
149 attributive adjectives JJAT mapped
150 comparative adjective forms JJR added standard grammatical feature
151 superlative adjective forms JJS added standard grammatical feature
152 real + adjective excluded dialectometric
153 in a excluded niche
154 much + comparative adj MUCH_COMP added Grieve, degree modification
155 far + comparative adj FAR_COMP added Grieve, degree modification
156 be as main verb BEMA mapped
157 existential there EX mapped
158 total prepositions IN mapped
159 preposition sequences PREP_SEQ added complexity marker
160 among AMONG added Grieve, formal alternation
161 amongst AMONG added Grieve, formal alternation (British)
162 Xside of excluded niche
163 with no excluded niche alternation
164 without any excluded niche alternation
165 contractions CONT mapped
166 gotta GOTTA added Grieve + Leech, informality
167 wanna WANNA added Grieve + Leech, informality
168 gonna GONNA added Grieve + Leech, informality
169 hyphenation excluded experimental/orthographic
170 gh sequences excluded experimental
171 word-initial a + double consonant excluded experimental
172 initial CCC clusters excluded experimental/phonological
173 final CCC clusters excluded experimental/phonological
174 words ending in -ect excluded experimental
175 anti- PREF_ANTI added
176 be- PREF_BE added
177 co- PREF_CO added
178 con-/com- PREF_CON added
179 counter- PREF_COUNTER added
180 de- PREF_DE added
181 dis- PREF_DIS added
182 en- PREF_EN added
183 ex- PREF_EX added
184 inter- PREF_INTER added
185 mis- PREF_MIS added
186 pre- PREF_PRE added
187 pro- PREF_PRO added
188 re- PREF_RE added
189 semi- PREF_SEMI added
190 speci- PREF_SPECI added
191 spect- PREF_SPECT added
192 specu- PREF_SPECU added
193 sub-/sup- PREF_SUB added
194 super- PREF_SUPER added
195 trans- PREF_TRANS added
196 under-/over- PREF_UNDER_OVER added
197 uni- PREF_UNI added
198 un- PREF_UN added
199 with- PREF_WITH added
200 V + -er/-or SUFF_ER_OR added
201 V + -ed + -ly SUFF_EDLY added
202 -ible/-able SUFF_ABLE added
203 -age SUFF_AGE added
204 -al SUFF_AL added
205 -ance SUFF_ANCE added
206 -ant/-ants SUFF_ANT added
207 -ary/-aries SUFF_ARY added
208 -ation NOMZ mapped
209 -dent SUFF_DENT added
210 -dom SUFF_DOM added
211 -ful SUFF_FUL added
212 -hood SUFF_HOOD added
213 -ian SUFF_IAN added
214 -ial SUFF_IAL added
215 -ic SUFF_ICAN added
216 -ical SUFF_ICAL added
217 -ican SUFF_ICAN added
218 -ify SUFF_IFY added
219 -ion SUFF_ION added
220 -ish SUFF_ISH added
221 -ism SUFF_ISM added
222 -ist SUFF_IST added
223 -ity/-ities SUFF_ITY added
224 -ize SUFF_IZE added
225 -ive SUFF_IVE added
226 -less SUFF_LESS added
227 -like SUFF_LIKE added
228 -ment SUFF_MENT added
229 -ness SUFF_NESS added
230 -ory/-ories SUFF_ORY added
231 -ous SUFF_OUS added
232 -ship SUFF_SHIP added
233 -tor SUFF_TOR added
234 -ture SUFF_TURE added
235 -ular SUFF_ULAR added
236 -wise SUFF_WISE added