You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
jellyfish (Python, C) - Fast string distance and phonetic matching.
py_stringmatching (Python, C) - Large set of string comparison functions and tokenizaztion methods.
textdistance (Python) - Very large collection of sequence comparison functions, including token-based distances.
SecondString (Java) - Java implementation of string comparison functions.
StringCompare (Python, C++) - Time and space efficient implementation of common string distance functions. Architectured for maintainability and extendability.
Comparator (R, C++) - Efficient string comparison functions in R.
Embeddings (for pairwise comparison)
Entity Embed (Python, PyTorch) - Pytorch text embedding model for blocking.
FaceNet-PyTorch (Python, PyTorch) - Embeddings for facial identity resolution.