Tools for Corpus Linguistics

A comprehensive list of 93 tools used in corpus analysis.
Suggest a Tool

Everything
Category
Annotation
Concordancer
Parser
POS Tagger
Search
Visualization
Wordlists
Converter
Text analysis
Assessing text complexity
Analysis
Compilation
Parsing
?
Database
Tokenization
Transcription
Downloader
converter
Semantic Parser
ngrams
Network Analysis
Semantic Tagger
Tokenizer
Boilerplate remover
concordancer
statistical analysis
visualization
Segmentation
Morphological tagger
Statistical NLP
Metaphor identifier
Sentence Boundary Detector
Morphological Tagger
Tagger
Multilevel Tagger
Corpus creation
semantic analysis
word lists
Duplicate remover
Conversion
Meta modelling
Editing
searching
Tokenizing
Crawler
Search tool
Variant detector
Statistics
Indexing
Phonology
word2vec

Tool Description Categories Platform Pricing
AntConcCorpus analysis toolkitWordlists, ConcordancerLinux, MacOSX, WindowsFree
AntPConcCorpus analysis toolkit for files encoded with UTF-8Wordlists, ConcordancerWindows, MacOSXFree
CorpKitAn advanced modern corpus toolkit with an emphasis on visualization and annotated corpora.Wordlists, Parsing, Concordancer, VisualizationLinux, MacOSX, Windows (Python)Free
Corpus PresenterTree tagger and corpus analysis softwareWordlists, Parsing, Concordancer, VisualizationWindowsFree
HeidelGram Web-Based ToolsBasic corpus analysis toolkit for the HeidelGram CorpusWordlists, ConcordancerWebFree
HGSimpleCorpusNetworkBatch frequency analysis on corrupted (e.g. OCR) corpus data and generation of network analysis data.Wordlists, Network AnalysisMulti (Python)Free, Open Source
IMS Corpus WorkbenchTool for sorting frequencies in corporaWordlists, ConcordancerWeb and local versionFree
LancsBoxSoftware package for the analysis of language data and corporaWordlists, concordancer, statistical analysis, visualizationFree
PhraseContextTool for wordlists, concordancing, collocation, TTR, Wordlists, Concordancer35€
ProtAntTool for prototypical text analysisWordlistsWindows, MacOSXFree
WmatrixTool for corpus analysis and comparisonWordlists, Concordancer, POS Tagger, Semantic TaggerWeb£50 per username per year
WordsmithOne of the most established corpus toolkitsConcordancer, Wordlists, StatisticsWindows60€ per licence