Please feel free to contribute by suggesting new tools or by pointing out mistakes in the data.
|jTokenizer||Tokenizing natural language||Tokenizer||Free|
|Natural Language Toolkit||Platform for building Python programs to work with human language data||Tokenizer, Tagger||Unix, MacOSX, Windows (+Python 3.4)||Free|
|Tweet NLP||Tweet tokenizer, POS Tagger, hierarchical word clusters, and a dependency parser for tweets, along with annotated corpora and web-based annotation tools. Clusters: http://www.cs.cmu.edu/~ark/TweetNLP/cluster_viewer.html||POS Tagger, Tokenizer, Parser||Free|
|Unitok||Tool that splits texts into tokens||Tokenizer||Free|