Bohmann 2019 Features
Features from Bohmann's (2019) 236-feature set, mapped to CLEF codes.
| # | Name | Code | Status |
|---|---|---|---|
| 1 | first person pronouns | FPP | mapped |
| 2 | second person pronouns | SPP2 | mapped |
| 3 | third person pronouns | TPP | mapped |
| 4 | pronoun it | PIT | mapped |
| 5 | demonstrative pronouns | DEMO_ALL | mapped |
| 6 | indefinite/quantifying pronouns | QUPR | mapped |
| 7 | non-standard 2nd person plural | excluded | |
| 8 | archaic 2nd person | ARCH2P | added |
| 9 | standard reflexives | excluded | |
| 10 | whom | WHOM | added |
| 11 | object pronoun + being | excluded | |
| 12 | each other | EACHOTHER | added |
| 13 | one another | ONEANOTHER | added |
| 14 | can | MDCA | mapped |
| 15 | may/might | MDMM | mapped |
| 16 | may/might | MDMM | mapped |
| 17 | could | MDCO | mapped |
| 18 | ought | MDOU | added |
| 19 | should | MDNE | mapped |
| 20 | must | MDNE | mapped |
| 21 | would | MDWO | mapped |
| 22 | shall | MDSL | added |
| 23 | contracted will | WILL_FULL | added |
| 24 | uncontracted will | WILL_FULL | added |
| 25 | used to | excluded | |
| 26 | BE about to | BE_ABOUT_TO | added |
| 27 | BE able to | BE_ABLE_TO | added |
| 28 | BE supposed to | BE_SUPPOSED_TO | added |
| 29 | KEEP V-ing | KEEP_VING | added |
| 30 | going to | GOING_TO | added |
| 31 | HAVE to | HAVE_TO | added |
| 32 | possessive HAVE got | excluded | |
| 33 | passive with GET | GET_PASSIVE | added |
| 34 | NEED to | NEED_TO | added |
| 35 | HAVE + deverbal noun | HAVE_DEVERBAL | added |
| 36 | TAKE + deverbal noun | TAKE_DEVERBAL | added |
| 37 | MAKE + deverbal noun | MAKE_DEVERBAL | added |
| 38 | BE to be + past participle | excluded | |
| 39 | WANT to | WANT_TO | added |
| 40 | habitual BE | excluded | |
| 41 | by-passive | PASSBY | mapped |
| 42 | agentless be-passive | PASS_ALL | mapped |
| 43 | if + were | IF_WERE | added |
| 44 | if + was | IF_WAS | added |
| 45 | if + would | excluded | |
| 46 | progressive aspect | PROG | added |
| 47 | always-type progressive | PROG_ALWAYS | added |
| 48 | progressive with stative verbs | excluded | |
| 49 | will + progressive | WILL_PROG | added |
| 50 | progressive passives | PROG_PASS | added |
| 51 | present perfect | PEAS | mapped |
| 52 | past perfect | PAST_PERF | added |
| 53 | strong verbal past | STRONG_PAST | added |
| 54 | regularized verbal past | REG_PAST | added |
| 55 | do-support with have | excluded | |
| 56 | gotten as past participle | excluded | |
| 57 | analytic negation: not | NEG_ALL | mapped |
| 58 | negative concord / multiple negation | excluded | |
| 59 | ain't | excluded | |
| 60 | comparative with as/than what | excluded | |
| 61 | verb + to-infinitive | INFIN | mapped |
| 62 | verb + gerund complementation | GERUND_COMP | added |
| 63 | verbal complement that | excluded | |
| 64 | try and | TRY_AND | added |
| 65 | try to | TRY_TO | added |
| 66 | wh + to-infinitive | WH_TOINF | added |
| 67 | prevent/stop + from | PREVENT_FROM | added |
| 68 | help + to | HELP_TO | added |
| 69 | the same X as | excluded | |
| 70 | the same X that | excluded | |
| 71 | BE like as quotative | excluded | |
| 72 | because | CUZ | mapped |
| 73 | although | ALTHOUGH | added |
| 74 | though | ALTHOUGH | added |
| 75 | if/unless | COND | mapped |
| 76 | adverbial subordinators | OTHADVSUB | mapped |
| 77 | conjuncts | CONJUNCTS | mapped |
| 78 | whether | WHETHER | added |
| 79 | discourse-structuring so | excluded | |
| 80 | actually | ACTUALLY | added |
| 81 | in fact | IN_FACT | added |
| 82 | in addition | IN_ADDITION | added |
| 83 | as if | AS_IF | added |
| 84 | as though | AS_THOUGH | added |
| 85 | downtoners | DWNT | mapped |
| 86 | amplifiers | AMP | mapped |
| 87 | emphatics | EMPH | mapped |
| 88 | epistemic certainty adverbs | CERT_ADV | added |
| 89 | judgment adverbials | JUDG_ADV | added |
| 90 | intensifier so | INTENS_SO | added |
| 91 | maybe | FW_MAYBE | added |
| 92 | perhaps | FW_PERHAPS | added |
| 93 | probably | FW_PROBABLY | added |
| 94 | likely | FW_LIKELY | added |
| 95 | especially | FW_ESPECIALLY | added |
| 96 | particularly | FW_PARTICULARLY | added |
| 97 | completely | FW_COMPLETELY | added |
| 98 | entirely | FW_ENTIRELY | added |
| 99 | almost | FW_ALMOST | added |
| 100 | nearly | FW_NEARLY | added |
| 101 | standardness | excluded | |
| 102 | mean word length | AWL | mapped |
| 103 | communication/public verbs | COMM | mapped |
| 104 | private/mental verbs | MENTAL | mapped |
| 105 | definite article the | DEF_ART | added |
| 106 | non-standard definite article da/di | excluded | |
| 107 | indefinite article a | INDEF_ART | added |
| 108 | a before vowel-initial words | excluded | |
| 109 | about + numeral | excluded | |
| 110 | around + numeral | excluded | |
| 111 | more than + numeral | excluded | |
| 112 | over + numeral | excluded | |
| 113 | half/all of | excluded | |
| 114 | half/all | excluded | |
| 115 | plenty of | excluded | |
| 116 | many/much | MANY_MUCH | added |
| 117 | lots of | LOTS_OF | added |
| 118 | a lot of | A_LOT_OF | added |
| 119 | past + numeral | excluded | |
| 120 | last + numeral | excluded | |
| 121 | s-genitives | S_GENITIVE | added |
| 122 | kind of N | KIND_OF_N | added |
| 123 | type of N | excluded | |
| 124 | sort of N | SORT_OF_N | added |
| 125 | BE kind of/kinda Adj | BE_KIND_OF | added |
| 126 | BE sort of/sorta Adj | BE_SORT_OF | added |
| 127 | not only | NOT_ONLY | added |
| 128 | not just | NOT_JUST | added |
| 129 | how come | excluded | |
| 130 | persons | excluded | |
| 131 | place adverbials | PLACE | mapped |
| 132 | time adverbials | TIME | mapped |
| 133 | usually | FW_USUALLY | added |
| 134 | normally | FW_NORMALLY | added |
| 135 | previously | FW_PREVIOUSLY | added |
| 136 | frequently | FW_FREQUENTLY | added |
| 137 | often | FW_OFTEN | added |
| 138 | sometimes | FW_SOMETIMES | added |
| 139 | immediately | FW_IMMEDIATELY | added |
| 140 | suddenly | FW_SUDDENLY | added |
| 141 | at the same time | excluded | |
| 142 | currently | FW_CURRENTLY | added |
| 143 | right now | excluded | |
| 144 | words ending in -where | WHERE_WORDS | added |
| 145 | words ending in -ward | WARDS_WORDS | added |
| 146 | words ending in -wards | WARDS_WORDS | added |
| 147 | till | TILL | added |
| 148 | until | UNTIL | added |
| 149 | attributive adjectives | JJAT | mapped |
| 150 | comparative adjective forms | JJR | added |
| 151 | superlative adjective forms | JJS | added |
| 152 | real + adjective | excluded | |
| 153 | in a | excluded | |
| 154 | much + comparative adj | MUCH_COMP | added |
| 155 | far + comparative adj | FAR_COMP | added |
| 156 | be as main verb | BEMA | mapped |
| 157 | existential there | EX | mapped |
| 158 | total prepositions | IN | mapped |
| 159 | preposition sequences | PREP_SEQ | added |
| 160 | among | AMONG | added |
| 161 | amongst | AMONG | added |
| 162 | Xside of | excluded | |
| 163 | with no | excluded | |
| 164 | without any | excluded | |
| 165 | contractions | CONT | mapped |
| 166 | gotta | GOTTA | added |
| 167 | wanna | WANNA | added |
| 168 | gonna | GONNA | added |
| 169 | hyphenation | excluded | |
| 170 | gh sequences | excluded | |
| 171 | word-initial a + double consonant | excluded | |
| 172 | initial CCC clusters | excluded | |
| 173 | final CCC clusters | excluded | |
| 174 | words ending in -ect | excluded | |
| 175 | anti- | PREF_ANTI | added |
| 176 | be- | PREF_BE | added |
| 177 | co- | PREF_CO | added |
| 178 | con-/com- | PREF_CON | added |
| 179 | counter- | PREF_COUNTER | added |
| 180 | de- | PREF_DE | added |
| 181 | dis- | PREF_DIS | added |
| 182 | en- | PREF_EN | added |
| 183 | ex- | PREF_EX | added |
| 184 | inter- | PREF_INTER | added |
| 185 | mis- | PREF_MIS | added |
| 186 | pre- | PREF_PRE | added |
| 187 | pro- | PREF_PRO | added |
| 188 | re- | PREF_RE | added |
| 189 | semi- | PREF_SEMI | added |
| 190 | speci- | PREF_SPECI | added |
| 191 | spect- | PREF_SPECT | added |
| 192 | specu- | PREF_SPECU | added |
| 193 | sub-/sup- | PREF_SUB | added |
| 194 | super- | PREF_SUPER | added |
| 195 | trans- | PREF_TRANS | added |
| 196 | under-/over- | PREF_UNDER_OVER | added |
| 197 | uni- | PREF_UNI | added |
| 198 | un- | PREF_UN | added |
| 199 | with- | PREF_WITH | added |
| 200 | V + -er/-or | SUFF_ER_OR | added |
| 201 | V + -ed + -ly | SUFF_EDLY | added |
| 202 | -ible/-able | SUFF_ABLE | added |
| 203 | -age | SUFF_AGE | added |
| 204 | -al | SUFF_AL | added |
| 205 | -ance | SUFF_ANCE | added |
| 206 | -ant/-ants | SUFF_ANT | added |
| 207 | -ary/-aries | SUFF_ARY | added |
| 208 | -ation | NOMZ | mapped |
| 209 | -dent | SUFF_DENT | added |
| 210 | -dom | SUFF_DOM | added |
| 211 | -ful | SUFF_FUL | added |
| 212 | -hood | SUFF_HOOD | added |
| 213 | -ian | SUFF_IAN | added |
| 214 | -ial | SUFF_IAL | added |
| 215 | -ic | SUFF_ICAN | added |
| 216 | -ical | SUFF_ICAL | added |
| 217 | -ican | SUFF_ICAN | added |
| 218 | -ify | SUFF_IFY | added |
| 219 | -ion | SUFF_ION | added |
| 220 | -ish | SUFF_ISH | added |
| 221 | -ism | SUFF_ISM | added |
| 222 | -ist | SUFF_IST | added |
| 223 | -ity/-ities | SUFF_ITY | added |
| 224 | -ize | SUFF_IZE | added |
| 225 | -ive | SUFF_IVE | added |
| 226 | -less | SUFF_LESS | added |
| 227 | -like | SUFF_LIKE | added |
| 228 | -ment | SUFF_MENT | added |
| 229 | -ness | SUFF_NESS | added |
| 230 | -ory/-ories | SUFF_ORY | added |
| 231 | -ous | SUFF_OUS | added |
| 232 | -ship | SUFF_SHIP | added |
| 233 | -tor | SUFF_TOR | added |
| 234 | -ture | SUFF_TURE | added |
| 235 | -ular | SUFF_ULAR | added |
| 236 | -wise | SUFF_WISE | added |