How much vocabulary a student needs beyond the Dickinson (DCC) Latin Core 1000.
Total running words
5,818
Distinct headwords
~986
Known from core 1000
465
New words to learn
466
Distinct vocabulary in the speech
Running-word coverage by the core 1000
Method.
Text: Perseus phi0474.phi004 (Divinatio in Q. Caecilium, 73 sections), apparatus / Greek / deleted text stripped.
Core: DCC Latin Core Vocabulary (997 headwords). Lemmatisation via the Latin-macronizer morphological database (enclitics split; ambiguity resolved by corpus frequency; capitalisation used to route proper nouns, so Verrem → Verres, not the verb verro).
Verification found 0 false misses and only 5 unrecognised token-forms, so the figure is robust to about ±5–10 headwords.
Source: DCC Latin Core Vocabulary.