TopWORDS (Deng et al., 2016) is an unsupervised Chinese word segmentation method.
The method involved E-M algorithm and Dynamic Programming.
TopWORDS not only segment corpus but also construct dictionary.
This package is inspired by qf6101's TopWORDS scala package.
This package also containing the famous Chinese novel SOS(Story-of-Stone).
-
Notifications
You must be signed in to change notification settings - Fork 0
kkuanhui/TopWORDS
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
An unsupervised Chinese word segmentation using EM algorithm and WDM.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published