Distance/Similarity functions for Bag of Words, Strings, Numbers, Dates and Vectors.
NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.
Feature hashing, also known as the hashing trick, a fast and space-efficient way of vectorizing features.