Stars indicate closer matches.


Corpus manager 

LIVAC Synchronous Corpus 
PolyAnalyst 
Text mining 
Corpus linguistics 
General Internet Corpus of Russian 
Linguistic Data Consortium 
Tehran Monolingual Corpus 
American National Corpus 
TIMIT 
Google Ngram Viewer 
N-gram 
PropBank 
Manually Annotated Sub-Corpus 
Native-language identification
Textual entailment
Meaning–text theory
International Computer Archive of Modern and Medieval English
Survey of English Usage
Bank of English
Bergen Corpus of London Teenage Language
Bijankhan Corpus
British National Corpus
Brown Corpus
Cambridge English Corpus
CHILDES
CorCenCC
Corpus of Contemporary American English
Croatian Language Corpus
Croatian National Corpus
German Reference Corpus
Hamshahri Corpus
International Corpus of English
Lancaster-Oslo-Bergen Corpus
Neo-Assyrian Text Corpus Project
Oxford English Corpus
Quranic Arabic Corpus
Russian National Corpus
Scottish Corpus of Texts and Speech
Slovenian National Corpus
Spoken English Corpus
Switchboard Telephone Speech Corpus
Wellington Corpus of Spoken New Zealand English
AsoSoft text corpus
CLAWS (linguistics)
List of text corpora
Yarowsky algorithm
Collocation extraction
FrameNet
Text corpus Popular: 7 Reims 2024 Lists of horror films Val Thorens
Random: GE U28C Midgham Bolton, Cumbria Auricle (botany) Andrew Green, Baron Green of Deddington
[beta] © 2026 • Data source: Wikipedia • Created by this chap