Tools for Corpus Linguistics

A comprehensive list of 93 tools used in corpus analysis.
Suggest a Tool

Everything
Category
Annotation
Concordancer
Parser
POS Tagger
Search
Visualization
Wordlists
Converter
Text analysis
Assessing text complexity
Analysis
Compilation
Parsing
?
Database
Tokenization
Transcription
Downloader
converter
Semantic Parser
ngrams
Network Analysis
Semantic Tagger
Tokenizer
Boilerplate remover
concordancer
statistical analysis
visualization
Segmentation
Morphological tagger
Statistical NLP
Metaphor identifier
Sentence Boundary Detector
Morphological Tagger
Tagger
Multilevel Tagger
Corpus creation
semantic analysis
word lists
Duplicate remover
Conversion
Meta modelling
Editing
searching
Tokenizing
Crawler
Search tool
Variant detector
Statistics
Indexing
Phonology
word2vec

Tool Description Categories Platform Pricing
aConCordeMultilingual concordance tool (English and Arabic)ConcordancerLinux, MacOSX, WindowsFree
AntConcCorpus analysis toolkitWordlists, ConcordancerLinux, MacOSX, WindowsFree
AntPConcCorpus analysis toolkit for files encoded with UTF-8Wordlists, ConcordancerWindows, MacOSXFree
BNCWebBNCweb is a web-based client program for searching and retrieving lexical, grammatical and textual data from the British National Corpus (BNC).Analysis, ConcordancerWebFree
CollocateTool for the extraction of concordances and collocationsConcordancerWindows35 USD
ConcordancerOnline tool for frequency counts and text cloudsConcordancerWebFree
CorpKitAn advanced modern corpus toolkit with an emphasis on visualization and annotated corpora.Wordlists, Parsing, Concordancer, VisualizationLinux, MacOSX, Windows (Python)Free
Corpus PresenterTree tagger and corpus analysis softwareWordlists, Parsing, Concordancer, VisualizationWindowsFree
HeidelGram Web-Based ToolsBasic corpus analysis toolkit for the HeidelGram CorpusWordlists, ConcordancerWebFree
IMS Corpus WorkbenchTool for sorting frequencies in corporaWordlists, ConcordancerWeb and local versionFree
MLCTTool for building and processing corporaConcordancer, Sentence Boundary DetectorFree
MonoConc EsyConcordancing and text search tool that allows primary and secondary concordancingConcordancer, Sentence Boundary DetectorFree for non-commerical research
OpenConcTool for concordancingConcordancerFree
ParaConcA bilingual/multilingual concordancerConcordancerNon-Free
PhraseContextTool for wordlists, concordancing, collocation, TTR, Wordlists, Concordancer35€
Simple Concordance ProgramTool for concordance and word listing that works with many languagesConcordancerWindows, MacOSXFree
WConcord 3.0A full featured concordancerConcordancerFree
WmatrixTool for corpus analysis and comparisonWordlists, Concordancer, POS Tagger, Semantic TaggerWeb£50 per username per year
WordsmithOne of the most established corpus toolkitsConcordancer, Wordlists, StatisticsWindows60€ per licence
WordstatixCorpus analysis toolConcordancerFree
PyXMLConcConcordancer for XML files with automatic tag and attribute detection.ConcordancerMulti (Python), WindowsFree, Open Source