Tools for Corpus Linguistics

A comprehensive list of 111 tools used in corpus analysis.

Please feel free to contribute by suggesting new tools or by pointing out mistakes in the data.

Suggest a Tool

Everything
Category
Annotation
Concordancer
Parser
POS Tagger
Search
Visualization
Wordlists
Converter
Text analysis
Assessing text complexity
Analysis
Crawler
Compilation
Parsing
Collocations
Searching
Database
Tokenization
Transcription
Downloader
converter
Semantic Parser
word2vec
ngrams
pattern matching
Network Analysis
Semantic Tagger
Tokenizer
Boilerplate remover
Statistics
Frequency Analysis
Segmentation
Morphological tagger
Statistical NLP
Sentence Boundary Detector
Morphological Tagger
Tagger
Multilevel Tagger
Corpus creation
semantic analysis
word lists
Duplicate remover
text analysis
editing
vocabulary
Collocation
Constructions
Conversion
Phonology
phonetics
spoken
Meta modelling
Editing
searching
Tokenizing
concordancer
kwic
R
Topic Modeling
lexical sophistication
visualization
variation analysis
Dictionary
EFL
ESL
Linguistics
Search tool
Variant detector
Metaphor identifier
annotation
political science
Indexing
Comparison
cohesion
temporal tagger
timex3

Tool Description Categories Platform Pricing
@nnotateSemi-automatic annotation of corpus dataAnnotationSolaris, LinuxFree (with licence agreement)
AMALGAMTool for grammatical annotation (POS and phrase structure). Tagging a text that was entered via email.AnnotationWebFree
AtomicMulti-layer corpus annotation platform.AnnotationLinux, MacOSX, WindowsFree
DexterTool for text annotationAnnotationLinux, MacOSX, WindowsFree
DISCOCorpus pre-processing tool for a variety of languages that Dallows to retrieve the semantic similarity between arbitrary words and phrasesTokenization, AnnotationWindows, Linux, Solaris, and MacOSFree
ELANTranscription and annotation of sound or video filesTranscription, AnnotationLinux, MacOSX, WindowsFree
EXMARaLDATool for transcription, annotation, corpus analysis of spoken dataTranscription, Annotation, AnalysisFree
PALinkAAnnotation toolAnnotationDown
RSTToolTool that can annotate texts for constituency and rhetorical structureAnnotationWindows, Macintosh, UNIX and LINUX Free
SPreTool for segmenting and annotating textsAnnotationFree
SynpathyTool for manual syntactic annotationAnnotationWindows, MacOSX, LinuxFree
TreeTaggerTool for annotating text with part-of-speech and lemma informationPOS Tagger, AnnotationWindows, MacOSX, LinuxFree
UAM CorpusToolText annotation tool and statistics for various types of linguistic analysisAnnotationFree
UAM ImageToolImage annotation tool for visual data corporaAnnotationFree
Worldbuilder(should soon be available)Tool for annotation and visualisation in analysis applying text-world-theoryAnnotation, Visualization??