- statistics
- Fast and Numerically Stable Statistical Analysis Utilities

Github**·**Documentation - nlp utils
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.

Github**·**Documentation - tokenizer
- Multilingual tokenizer that automatically tags each token with its type such as word, email, time or hashtag.

Github**·**Documentation - naive bayes text classifier
- Configurable Naive Bayes Classifier for text with cross-validation support

Github**·**Documentation - sentiment
- Accurate and fast sentiment scoring of phrases with emoticons & emojis

Github**·**Documentation - perceptron
- Multi-class averaged perceptron

Github**·**Documentation

- pos tagger
- English Part-of-speech (POS) tagger

Github**·**Documentation - porter2 stemmer
- Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter

Github**·**Documentation - lemmatizer
- English lemmatizer

Github**·**Documentation - regression tree
- Decision Tree to predict the value of a continuous target variable

Github**·**Documentation - bm25 text search
- Fast Full Text Search based on BM25

Github**·**Documentation

- distance
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more

Github**·**Documentation - jaro distance
- An Implementation of Jaro Distance Algorithm by Matthew A. Jaro

Github**·**Documentation - lexicon
- English lexicon useful in NLP/NLU

Github**·**Documentation - helpers
- Helper functions for Javascript arrays and objects

Github**·**Documentation - ner
- Language agnostic named entity recognizer

Github**·**Documentation