pip install pynonjishokei
import pynonjishokei
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.
- Yasuharu Den, Toshinobu Ogiso, Hideki Ogura, Atsushi Yamada, Nobuaki Menematsu, Kiyotaka Uchimoto, Hanae Koiso. The development of an electronic dictionary for morphological analysis and its application to Japanese corpus linguistics, Japanese Linguistics, Vol.22, pp.101-123 (2007). [in Japanese]
- Kikuo Maekawa, Makoto Yamazaki, Takehiko Maruyama, Masaya Yamaguchi, Hideki Ogura, Wakako Kashino, Toshinobu Ogiso, Hanae Koiso and Yasuharu Den. Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese, In_Proceedings of the Seventh International Conference on Language Resources and Evaluation_(LREC 2010), pp.1483-1486 (2010).
- Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso, Hideki Ogura. A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation, In Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), pp.1019-1024 (2008).
The dictionary used in this project is compiled from the following sources.
JMDICT: This package uses the JMdict/EDICT and KANJIDIC dictionary files. These files are the property of the Electronic Dictionary Research and Development Group, and are used in conformance with the Group's licence.
UniDic for Contemporary Written Japanese (ver.202302-full): GPL v2.0/LGPL v2.1/修正BSD
UniDic for Contemporary Spoken Japanese (ver.202302-full): GPL v2.0/LGPL v2.1/修正BSD
Compound Verb Lexicon: CC BY 4.0 DEED
Ling Yang, for not letting me give up learning Japanese, and also for guiding me through the completion of this project paper.
Haiying Li, Hui Chen, for giving a lot of useful suggestion.
JetBrains, for offering the PyCharm Community Edition for free.