We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
A Japanese Tokenizer for Business
Java 842 72
Python version of Sudachi, a Japanese tokenizer.
Python 405 52
Sudachi in Rust 🦀 and new generation of SudachiPy
Rust 350 38
A lexicon for Sudachi
Python 249 19
The Japanese analysis plugin for elasticsearch
Kotlin 205 42
Japanese word embedding with Sudachi and NWJC 🌿
Python 163 6
A synonym token filter plugin for Elasticsearch
A library of TRIE structure using Double-Array
Japanese tokenizer for Transformers