Skip to content

Releases: OpenPecha/Botok

v0.9.0

09 Mar 06:57
Compare
Choose a tag to compare

Feature

  • Improve test coverage to exceed 90% and ensure Python 3.12 compatibility (7240c90)

Fix

  • Improve test coverage to 90% and resolve Python 3.12 compatibility issues (e8b401c)

v0.8.15

09 Mar 06:33
Compare
Choose a tag to compare

Fix

  • Sync version number with latest release (0.8.14) (04fdcf7)
  • Pin semantic-release action to specific version (5315685)

v0.8.14

09 Mar 06:15
Compare
Choose a tag to compare

v0.8.14 (2025-03-09)

Bug Fixes

  • Improve test coverage to 90% and fix Python 3.12 compatibility (8170f40)

Detailed Changes: v0.8.13...v0.8.14

v0.8.13

24 Jan 05:48
Compare
Choose a tag to compare

v0.8.13 (2025-01-24)

Bug Fixes

  • Sentence normalizer test case updated (9d0af85)

  • Versin bump from 8.12 to 8.13 (249b667)

  • normalization: Fix for short strings (85a3155)


Detailed Changes: v0.8.12...v0.8.13

v0.8.12

17 May 11:36
Compare
Choose a tag to compare

Fix

  • Imports (986ce4b)
  • Better handling of invalid start of string (27128fd)

v0.8.11

11 May 07:37
Compare
Choose a tag to compare

Fix

  • Create new release manually (8cef179)
  • Create new release manually (e1ed718)

v0.8.10

05 Apr 18:34
Compare
Choose a tag to compare

Fix

  • sent-tokenizer: Normalised sentence is included in sentence tokens (7cf455d)
  • NO_POS typos (5d521a8)
  • Auto release (65b0931)
  • Auto release (159d39e)
  • Auto release (c961147)
  • Auto release (1d56341)
  • Auto release (b48e017)
  • Auto release (ab9cf0a)
  • trie: Non inflectible word type error bug fix (36ad9ca)
  • tokenize: Another case of unexpected skip of syl (deacd59)
  • tokenize: Syl of remove word getting skipped unexpectly (161445e)
  • split_affixztion: Keyerror bug fix by using pop method to delete items (a981693)
  • adjusttoken: Rules having invalid index are not considered and error msg is shown in that case (bfe6218)
  • WordTokenizer: Adjusted token list assigned to token list returned by tokenize method (dc898a9)
  • Set config kwargs to None (04eaa9f)
  • Select on .tsv files in dialect pack (e24369a)
  • Set part_lemmas attr to WordTokenizer (56734a1)
  • Running test under coverage (69b2334)
  • Use fully directory base config (b292077)
  • Use full directory base config (38107fe)
  • Skip rules when populating trie (a16ef54)

Documentation

v0.8.8

12 Oct 07:59
Compare
Choose a tag to compare

Fix

  • sent-tokenizer: Normalised sentence is included in sentence tokens (7cf455d)

v0.8.7

21 Jun 18:15
Compare
Choose a tag to compare

Fix

v0.8.6

20 May 06:15
Compare
Choose a tag to compare

Fix